Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add olmoe and granitemoe architecture to whitelist #3449

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

ThiloteE
Copy link
Collaborator

@ThiloteE ThiloteE commented Jan 31, 2025

Describe your changes

Those two model architectures are supported in llama.cpp and technically in GPT4All 3.8.0 as well, but they have to be whitelisted to make them accessible to users, so do it.

Here are the models:

They are really small and not the brightest, but very fast. I am hoping to make use of them for sentence similarity search with the LocalDocs feature. There was also a user that wanted to have tiny LLMs for educational purposes in Africa, because of hardware constraints.

Because they are so fast and tiny and not the brightest, not having them supported on the compute backend could be argued to be justified. They are compatible with the Cuda backend though, so that's good.

Checklist before requesting a review

  • I have performed a self-review of my code.
  • If it is a core feature, I have added thorough tests.
  • I have added thorough documentation for my code.
  • I have tagged PR with relevant project labels. I acknowledge that a PR without labels may be dismissed.
  • If this PR addresses a bug, I have provided both a screenshot/video of the original bug and the working solution.

Those two model architectures are supported in llama.cpp and technically in GPT4All 3.8.0 as well, but they have to be whitelisted to make them accessible to users, so do it.

Here are the models:
- https://huggingface.co/allenai/OLMoE-1B-7B-0924-Instruct
- https://huggingface.co/ibm-granite/granite-3.1-3b-a800m-instruct

They are really small and not the brightest, but very fast. I am hoping to make use of them for sentence similarity search with the LocalDocs feature. There was also a user that wanted to have tiny LLMs for educational purposes in Africa, because of hardware constraints.

Signed-off-by: ThiloteE <[email protected]>
@ThiloteE ThiloteE added the model-architecture A request to add support for a new model architecture. label Jan 31, 2025
@ThiloteE ThiloteE requested a review from manyoso February 2, 2025 16:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
model-architecture A request to add support for a new model architecture.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant