-
Notifications
You must be signed in to change notification settings - Fork 177
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for SentenceTransformers with deepsparse.sentence_transformers.SentenceTransformer
#1301
Conversation
…s.SentenceTransformer`
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM overall - assuming a readme will come separately. will fit in nicely with the pipelines refactor - should be able to swap in optimum.deepsparse
the way we swap in any other engine to our pipelines
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM pending coment
Direct replacement for SentenceTransformer implemented using DeepSparse and optimum-deepsparse
Performance optimizations with bucketing/batching will come in future work
Smoke Test:
Full test through evaluation with MTEB
Output: