Skip to content

Commit

Permalink
Add Mistral Small 3 model (#3308)
Browse files Browse the repository at this point in the history
  • Loading branch information
yifanmai authored Feb 3, 2025
1 parent 228e0f1 commit 2401e5e
Show file tree
Hide file tree
Showing 3 changed files with 22 additions and 0 deletions.
7 changes: 7 additions & 0 deletions src/helm/config/model_deployments.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -1666,6 +1666,13 @@ model_deployments:
client_spec:
class_name: "helm.clients.mistral_client.MistralAIClient"

- name: mistralai/mistral-small-2501
model_name: mistralai/mistral-small-2501
tokenizer_name: mistralai/Mistral-Small-24B-Instruct-2501
max_sequence_length: 32000
client_spec:
class_name: "helm.clients.mistral_client.MistralAIClient"

- name: mistralai/mistral-medium-2312
model_name: mistralai/mistral-medium-2312
tokenizer_name: mistralai/Mistral-7B-v0.1
Expand Down
9 changes: 9 additions & 0 deletions src/helm/config/model_metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -2134,6 +2134,15 @@ models:
release_date: 2024-09-18
tags: [TEXT_MODEL_TAG, LIMITED_FUNCTIONALITY_TEXT_MODEL_TAG, INSTRUCTION_FOLLOWING_MODEL_TAG]

- name: mistralai/mistral-small-2501
display_name: Mistral Small 3 (2501)
description: Mistral Small 3 (2501) is a pre-trained and instructed model catered to the '80%' of generative AI tasks—those that require robust language and instruction following performance, with very low latency. ([blog](https://mistral.ai/news/mistral-small-3/))
creator_organization_name: Mistral AI
access: open
num_parameters: 23600000000
release_date: 2025-01-30
tags: [TEXT_MODEL_TAG, LIMITED_FUNCTIONALITY_TEXT_MODEL_TAG, INSTRUCTION_FOLLOWING_MODEL_TAG]

- name: mistralai/mistral-medium-2312
display_name: Mistral Medium (2312)
description: Mistral is a transformer model that uses Grouped-Query Attention (GQA) and Sliding-Window Attention (SWA).
Expand Down
6 changes: 6 additions & 0 deletions src/helm/config/tokenizer_configs.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -484,6 +484,12 @@ tokenizer_configs:
end_of_text_token: "</s>"
prefix_token: "<s>"

- name: mistralai/Mistral-Small-24B-Instruct-2501
tokenizer_spec:
class_name: "helm.tokenizers.huggingface_tokenizer.HuggingFaceTokenizer"
end_of_text_token: "</s>"
prefix_token: "<s>"

# Nectec
- name: nectec/OpenThaiLLM-Prebuilt-7B
tokenizer_spec:
Expand Down

0 comments on commit 2401e5e

Please sign in to comment.