-
Notifications
You must be signed in to change notification settings - Fork 370
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added Whisper.net x LLamaSharp examples for Speech Chat #656
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The overall looks great except of some minor issues. Thanks a lot for your contribution!
In addition to the comments, could you please print a brief description of this example when the user runs it? For example, This example shows how to recognize the voice and then get response from LLM. It uses whisper.net and models could be found in....... To use it, please open your microphone and speak to it, then......
Merging this PR now, thank you for the contribution! |
I love you, but where do i donwload the llama models a auto model downloader would be extremely usefoul and nice |
@Mrgaton Thanks :) You mean the quantized language models? Those could be obtained from huggingface. You can either download a prequantized model (GGUF) or provide a non-quantized version (FP32, FP16, BF16) and convert it to GGUF via the llama.cpp library: https://github.com/ggerganov/llama.cpp/blob/master/convert.py Here are some GGUF quants for LLaMA3-8B-Instruct: https://huggingface.co/bartowski/Meta-Llama-3-8B-Instruct-GGUF |
I managed to found a quantized llama3 8b model but is horrible is worse than gpt 3.5 |
Also extremely slow cause I don't know why it's not using my gpu |
@Mrgaton Well, gpt-3.5 is better than any released open source models atm :p You've set the bar too high. |
The model downloading feature is on the way now. :) |
Added whisper.net usage example & usage in combination with LlamaSharp.
Note that while the example is named 'Speech Chat', it actually uses non-instruct prompting. Let me know any thoughts!
Also, I added a note on where to download the models from, even when Whisper.net allows auto-download functionality.
This could be embedded by downloading the 150mb
base
model and copying it to build, but it's up to you to decide.Videos:
LlamaSharp-Speech.Chat.mp4
LlamaSharp-Audio.Model.Prompt.mp4