Error when starting: "llama_init_from_file: failed to load model" #18

mkinney · 2023-04-07T04:55:28Z

I'm getting an error when starting:

(venv) sweet gpt4all-ui % python app.py
llama_model_load: loading model from './models/gpt4all-lora-quantized-ggml.bin' - please wait ...
./models/gpt4all-lora-quantized-ggml.bin: invalid model file (bad magic [got 0x67676d66 want 0x67676a74])
	you most likely need to regenerate your ggml files
	the benefit is you'll get 10-100x faster load times
	see https://github.com/ggerganov/llama.cpp/issues/91
	use convert-pth-to-ggml.py to regenerate from original pth
	use migrate-ggml-2023-03-30-pr613.py if you deleted originals
llama_init_from_file: failed to load model
Checking discussions database...
Ok
llama_generate: seed = 1680843169

system_info: n_threads = 8 / 16 | AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 1 | SSE3 = 1 | VSX = 0 |
zsh: segmentation fault  python app.py

I have validated the md5sum matches:

sweet gpt4all-ui % md5sum models/gpt4all-lora-quantized-ggml.bin
387eeb7cba52aaa278ebc2fe386649b1  models/gpt4all-lora-quantized-ggml.bin
sweet gpt4all-ui % cat models/gpt4all-lora-quantized-ggml.bin.md5
387eeb7cba52aaa278ebc2fe386649b1

The text was updated successfully, but these errors were encountered:

rafatzbr · 2023-04-07T05:03:54Z

Same error here.

mkinney · 2023-04-07T05:05:54Z

I should add I'm on an intel mac. The gpt4all chat's gpt4all-lora-quantized-OSX-intel works for me.

rafatzbr · 2023-04-07T05:07:28Z

I'm running on Linux. Also was able to run gpt4all-lora-quantized-Linux-x86

rafatzbr · 2023-04-07T05:09:45Z

I also tried to run this code:

`from pyllamacpp.model import Model

def new_text_callback(text: str):
print(text, end="")

model = Model(ggml_model='./models/gpt4all-lora-quantized-ggml.bin', n_ctx=512)
model.generate("Once upon a time, ", n_predict=55, new_text_callback=new_text_callback, n_threads=8)`

and got the same error, which points to an invalid file.

ParisNeo · 2023-04-07T05:38:45Z

Yes you are right. The model provided by Nomic-ai is not compatible with windows. I'll ask them to provide a working version soon.

In the meanwhile you can download the model and convert it by yourself as presented in:
https://github.com/nomic-ai/pyllamacpp

Thanks for your understanding.

ParisNeo · 2023-04-08T12:17:10Z

A new version of the installer fixes this issue.
Please confirm that it works for you and close the issue.

zpdg · 2023-04-08T13:20:36Z

A new version of the installer fixes this issue. Please confirm that it works for you and close the issue.

I am using Ubuntu. The same error occurs. "./models/gpt4all-lora-quantized-ggml.bin: invalid model file (bad magic [got 0x67676d66 want 0x67676a74]).......llama_init_from_file: failed to load model"

BoQsc · 2023-04-08T17:32:55Z

Windows 10 Home, Intel, NVIDIA laptop. All recent github clone. Selected to download model using browser. Downloading was successful. bad magic error.

llama_model_load: loading model from './models/gpt4all-lora-quantized-ggml.bin' - please wait ...
./models/gpt4all-lora-quantized-ggml.bin: invalid model file (bad magic [got 0x67676d66 want 0x67676a74])
        you most likely need to regenerate your ggml files
        the benefit is you'll get 10-100x faster load times
        see https://github.com/ggerganov/llama.cpp/issues/91
        use convert-pth-to-ggml.py to regenerate from original pth
        use migrate-ggml-2023-03-30-pr613.py if you deleted originals
llama_init_from_file: failed to load model
llama_generate: seed = 1680974915

ngonzalezromero · 2023-04-08T17:59:22Z

MacOSx Ventura Intel same error
`llama_model_load: loading model from './models/gpt4all-lora-quantized-ggml.bin' - please wait ...
./models/gpt4all-lora-quantized-ggml.bin: invalid model file (bad magic [got 0x67676d66 want 0x67676a74])
you most likely need to regenerate your ggml files
the benefit is you'll get 10-100x faster load times
see ggml-org/llama.cpp#91
use convert-pth-to-ggml.py to regenerate from original pth
use migrate-ggml-2023-03-30-pr613.py if you deleted originals
llama_init_from_file: failed to load model
llama_generate: seed = 1680976601

system_info: n_threads = 8 / 12 | AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 1 | SSE3 = 1 | VSX = 0 |
./run.sh: line 43: 14830 Segmentation fault: 11 python app.py`

ParisNeo · 2023-04-08T18:58:50Z

For people using windows try running install.bat. The script downloads the right model, does the conversion and everything.
The model you download should be converted to the new llama.cpp format.
Take a look at the install.bat code.
I will change the install.sh to do the same.

BoQsc · 2023-04-08T20:40:36Z

Windows 10

Conversion seem to fix this.
./models/gpt4all-lora-quantized-ggml.bin: invalid model file (bad magic [got 0x67676d66 want 0x67676a74])

"gpt4all-ui/convert.cmd"

@ECHO OFF
echo Converting the model to the new format
if not exist tmp/llama.cpp git clone https://github.com/ggerganov/llama.cpp.git tmp\llama.cpp
move models\gpt4all-lora-quantized-ggml.bin models\gpt4all-lora-quantized-ggml.bin.original
python tmp\llama.cpp\migrate-ggml-2023-03-30-pr613.py models\gpt4all-lora-quantized-ggml.bin.original models\gpt4all-lora-quantized-ggml.bin
echo The model file (gpt4all-lora-quantized-ggml.bin) has been fixed.

PAUSE

Then just do the run.bat and have lots of patience.

ngonzalezromero · 2023-04-08T21:19:44Z

I fixed the error by just adding lines on run.sh fix the error
`
echo "Converting the model to the new format"

if [ ! -d "tmp/llama.cpp" ]
then
git clone https://github.com/ggerganov/llama.cpp.git tmp/llama.cpp
fi

mv models/gpt4all-lora-quantized-ggml.bin models/gpt4all-lora-quantized-ggml.bin.original

python tmp/llama.cpp/migrate-ggml-2023-03-30-pr613.py models/gpt4all-lora-quantized-ggml.bin.original models/gpt4all-lora-quantized-ggml.bin
`

ngonzalezromero · 2023-04-08T21:20:08Z

happy to create a new PR with this fix

arroyoquiel mentioned this issue Apr 9, 2023

Changed Git and Conversion #57

Merged

10 tasks

ParisNeo closed this as completed in #57 Apr 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when starting: "llama_init_from_file: failed to load model" #18

Error when starting: "llama_init_from_file: failed to load model" #18

mkinney commented Apr 7, 2023

rafatzbr commented Apr 7, 2023

mkinney commented Apr 7, 2023

rafatzbr commented Apr 7, 2023

rafatzbr commented Apr 7, 2023 •

edited

Loading

ParisNeo commented Apr 7, 2023

ParisNeo commented Apr 8, 2023

zpdg commented Apr 8, 2023

BoQsc commented Apr 8, 2023 •

edited

Loading

ngonzalezromero commented Apr 8, 2023

ParisNeo commented Apr 8, 2023

BoQsc commented Apr 8, 2023 •

edited

Loading

ngonzalezromero commented Apr 8, 2023

ngonzalezromero commented Apr 8, 2023

Error when starting: "llama_init_from_file: failed to load model" #18

Error when starting: "llama_init_from_file: failed to load model" #18

Comments

mkinney commented Apr 7, 2023

rafatzbr commented Apr 7, 2023

mkinney commented Apr 7, 2023

rafatzbr commented Apr 7, 2023

rafatzbr commented Apr 7, 2023 • edited Loading

ParisNeo commented Apr 7, 2023

ParisNeo commented Apr 8, 2023

zpdg commented Apr 8, 2023

BoQsc commented Apr 8, 2023 • edited Loading

ngonzalezromero commented Apr 8, 2023

ParisNeo commented Apr 8, 2023

BoQsc commented Apr 8, 2023 • edited Loading

Windows 10

"gpt4all-ui/convert.cmd"

ngonzalezromero commented Apr 8, 2023

ngonzalezromero commented Apr 8, 2023

rafatzbr commented Apr 7, 2023 •

edited

Loading

BoQsc commented Apr 8, 2023 •

edited

Loading

BoQsc commented Apr 8, 2023 •

edited

Loading