Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance degrading over time #832

Closed
vashat opened this issue Apr 7, 2023 · 1 comment
Closed

Performance degrading over time #832

vashat opened this issue Apr 7, 2023 · 1 comment

Comments

@vashat
Copy link

vashat commented Apr 7, 2023

Expected Behavior

When running this command:

./main -i --interactive-first -r "### Human:" --temp 0 -c 2048 -n -1 --ignore-eos --repeat_penalty 1.2 --threads 4 --instruct -m models/ggml-vicuna-13b-4bit.bin

I expect the performance to be the same over time when the model is answering my questions.

Current Behavior

The performance is good in the begining, answers are written out fast, 4 cpu cores are fully utilized. But over time speed degrades until it slows down to a word every 30 seconds and cpu cores are just idling.

Environment and Context

Apple M1 Mac Mini 16GB RAM. Ventura 13.3.

Python 3.8.13
GNU Make 3.81
Apple clang version 14.0.0 (clang-1400.0.29.202)

numpy                         1.23.4
rotary-embedding-torch        0.2.1
sentencepiece                 0.1.97
torch                         2.1.0.dev20230307
torchaudio                    2.0.0.dev20230307
torchvision                   0.15.0.dev20230307

Steps to Reproduce

Ask questions for a while. The speed should degrade after about 10 questions that require longer answers.

@vashat vashat changed the title [User] Insert summary of your issue or enhancement.. Performance degrading over time Apr 7, 2023
@vashat
Copy link
Author

vashat commented Apr 7, 2023

Found the solution here: #767 . I needed to add the --mlock parameter.

@vashat vashat closed this as completed Apr 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant