Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mpt-7b-ggml generating garbled characters #272

Open
taiyou2000 opened this issue Jun 28, 2023 · 2 comments
Open

mpt-7b-ggml generating garbled characters #272

taiyou2000 opened this issue Jun 28, 2023 · 2 comments
Labels
bug Something isn't working

Comments

@taiyou2000
Copy link

I tried to use mpt-7b-ggml-q5_1(https://huggingface.co/TheBloke/MPT-7B-GGML) with koboldcpp(commit hash: e6ddb15) on Ubuntu 22.04. It was fine with generating English alphabet but when it comes to characters in languages other than English, it's generating garbled characters like this:

������
2.
��都市
3.
����都

And the terminal is showing: gpt_tokenize: unknown token '�'

I also tried to run mpt with pytorch in colab and my computer but both encountered OOM error so I can't tell if this is whether ggml or pytorch/transformers side issue. But I think this is ggml side issue.
I suspected this is caused by misconfiguration of encoding in terminal. But it was UTF-8(ja_JP.UTF-8) and it is unlikely caused by terminal encoding.
https://github.com/ggerganov/ggml caused same result.

It seems like similar issue was discussed in early llama.cpp repository ggerganov#73

@h3ndrik
Copy link

h3ndrik commented Jun 28, 2023

I think there is still something wrong with mpt. I also had the larger MPT-30b output weird characters here and there. And yesterday koboldcpp kept crashing with a segmentation fault while generating close to and beyond 2048 tokens. The prompt ingestion worked fine. Also i'm not sure how intelligent it's supposed to be. Either there is something wrong with it, or it's way less 'good' than a llama based model.
(EDIT: The segmentation fault is my own fault: I missed the parameter --contextsize. Maybe we need to check if the user gives contradictory values on cli and koboldai lite.)

@LostRuins LostRuins added the bug Something isn't working label Jun 28, 2023
@e576082c
Copy link

Interesting! I think I had a similar problem with TheBloke/PULI-GPT-3SX-GGML (puli-gpt-3sx.ggmlv1.q8_0.bin).

Generates garbled characters, gives unknown token '�' error, and generally can only generate nonsense, no matter of temperature or sampling preset used. I decided it's not worth the effort, so I have given up on using it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants