Add the Quantized model and also a Demo of the Quantized model #587

Ruhaan838 · 2025-01-09T16:55:06Z

Post Training Quantization for GPT2

In this commit, I try to add the Quantization for the GPT2 most of the code remains the same.
Made the new files quantized_model.py in this repo I tried my best for this.
If anything you feel is wrong then ask me I try my best.
I can't change the original code just make the new file and done.

I still can't figure out how to generate the text with this Quantized model.
After applying the engine for quantization it will show the error of NotImplementation for Embeddings

@karpathy If you have time to review this code and merge it or feel something wrong then tell me I try my best

Ruhaan838 added 2 commits January 9, 2025 21:36

add quantized model and demo also

ac7adfe

some changes not able to generate text some error later fix

bd97adc