Huggingface quantisation #409

Archit-Kohli · 2023-10-04T13:55:15Z

Documentation Update
Why was this update required?: Difficult to load model with such high parameters so needs to be loaded via bitsnbytes
Other information: Added link to hugging face's official bits-n-bytes notebook for reference

Added link to hugging face's official bits-n-bytes notebook for reference

vercel · 2023-10-04T13:55:35Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
docs-gpt	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Oct 4, 2023 1:56pm
nextra-docsgpt	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Oct 4, 2023 1:56pm

dartpain · 2023-10-04T23:02:56Z

Unfortunately users will need to change some code in application/llm/huggingface.py
What you can do is maybe create a differnt class or a q parameter that can be passed to it called q to set quantisation if explicit by users. Will only need to add few lines of code here

basically add
bnb_config = BitsAndBytesConfig(
load_in_4bit=True,
bnb_4bit_use_double_quant=True,
bnb_4bit_quant_type="nf4",
bnb_4bit_compute_dtype=torch.bfloat16
)

to AutoModelForCausalLM.from_pretrained and a conditional import

this would be much appreciated

dartpain · 2023-10-04T23:03:28Z

Unfortunately users will need to change some code in application/llm/huggingface.py
What you can do is maybe create a differnt class or a q parameter that can be passed to it called q to set quantisation if explicit by users. Will only need to add few lines of code here

Archit-Kohli · 2023-10-05T05:48:44Z

Updated in a new pull request #425 you can close this request

Update README.md

3bb24b3

Added link to hugging face's official bits-n-bytes notebook for reference

vercel bot deployed to Preview – nextra-docsgpt October 4, 2023 13:56 View deployment

vercel bot deployed to Preview – docs-gpt October 4, 2023 13:56 View deployment

dartpain changed the title ~~Update README.md~~ Huggingface quantisation Oct 4, 2023

Archit-Kohli mentioned this pull request Oct 5, 2023

Update huggingface.py #425

Merged

dartpain closed this Oct 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Huggingface quantisation #409

Huggingface quantisation #409

Archit-Kohli commented Oct 4, 2023

vercel bot commented Oct 4, 2023 •

edited

Loading

dartpain commented Oct 4, 2023 •

edited

Loading

dartpain commented Oct 4, 2023

Archit-Kohli commented Oct 5, 2023

Huggingface quantisation #409

Huggingface quantisation #409

Conversation

Archit-Kohli commented Oct 4, 2023

vercel bot commented Oct 4, 2023 • edited Loading

dartpain commented Oct 4, 2023 • edited Loading

dartpain commented Oct 4, 2023

Archit-Kohli commented Oct 5, 2023

vercel bot commented Oct 4, 2023 •

edited

Loading

dartpain commented Oct 4, 2023 •

edited

Loading