-
Notifications
You must be signed in to change notification settings - Fork 364
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to get a simple example to work with Llamasharp and Semantic Kernel #186
Comments
Hi, using your code as source, I had to change the |
@negatron99 thanks for spotting this. In your opinion, is the performance acceptable? For the second question, even setting the minrelevance to 0.1, does not result in correct memory being recalled. Is this a model issue, or an issue related to SK integration? |
@bancroftway I got that too, the questions often returned with the expected answer in 2nd place. I'm running an AI locally, and I think that has a severe impact because of the size of model I'm using. (note: I'm new to this) |
@xbotter Could you please have a quick review for this issue to see if it's a problem of semantic-kernel integration or LLamaSharp it self? |
I think it is closely related to the model. I tried the following 5 models, and the similarity results obtained are as follows.
As a comparison, here are the results generated using OpenAI.
I am using LLamaSharp.semantic-kernel 0.7.0, and Microsoft.SemanticKernel.Core 1.0.1-beta.Compared to version 0.5.1, there is not much change in terms of logic. |
There're different optimization strategies on cpu and cuda so that slight difference is okay. @bancroftway @negatron99 Would you like to try it with v0.7.0. I remember that there were some fix in semantic-kernel integration package after v0.5.1. |
This is another comparison result from the bge-large-en-v1.5 model.
The result seems relatively better. But the model is base BERT, unable to convert into a usable format for llama.cpp at the moment. I used bert.cpp for processing. |
I am using packages LLamasharp 0.5.1, LLamaSharp.semantic-kernel 0.5.1, and Microsoft.SemanticKernel.Core 0.24.230918.1-preview. In a very simple example inspired by this video, I am unable to get any results. Please advise what am I doing wrong?
The text was updated successfully, but these errors were encountered: