GGUF/GPTQ models for LLaVA-1.5 #526

shahizat · 2023-10-11T07:53:30Z

Hello,

Is anyone aware of the 4-bit quantized models for LLaVA-1.5 available on Hugging Face?

Thanks in advance!

barshag · 2023-10-12T20:56:29Z

what are the system requirement for that? (running solely on CPU?)

aiaicode · 2023-10-13T20:06:19Z

7b works better than 13b for now.

Use the llama.cpp repo and run the make command and use ./lava

Runs on CPU. More info on this PR

haotian-liu · 2023-10-17T19:40:04Z

The bug seems to be fixed in ggml-org/llama.cpp#3645
Now the quality for both 7b and 13b are improved.

Closing this now.

haotian-liu closed this as completed Oct 17, 2023

Provide feedback