add mlx support #1089

davidberenstein1957 · 2024-12-30T07:02:00Z

Closes #995

Use it individually

from distilabel.models.llms import MlxLLM

llm = MlxLLM(model="mlx-community/Meta-Llama-3.1-8B-Instruct-4bit")

llm.load()

# Call the model
output = llm.generate_outputs(inputs=[[{"role": "user", "content": "Hello world!"}]])

Use it with magpie

from distilabel.models.llms.mlx import MlxLLM
from distilabel.steps.tasks import Magpie

model = MlxLLM(
    path_or_hf_repo="mlx-community/Meta-Llama-3.1-8B-Instruct-8bit",
    use_magpie_template=True,
    magpie_pre_query_template="llama3",
)
task = Magpie(llm=model)
task.load()

It is relatively easy to spin up an mlx server, but no public Python API clients are available except for LangChain https://python.langchain.com/docs/integrations/chat/mlx/. Currently, the OpenAIAPI does not align with the payloads from both the chat and the text generation API.

github-actions · 2024-12-30T07:03:22Z

Documentation for this PR has been built. You can view it at: https://distilabel.argilla.io/pr-1089/

codspeed-hq · 2024-12-30T07:10:15Z

CodSpeed Performance Report

Merging #1089 will not alter performance

_{Comparing feat/add-support-mlx (a65a268) with develop (99c2448)}

Summary

✅ 1 untouched benchmarks

update mlx

gabrielmbmb

Let's also add the import of MlxLLM to src/distilabel/llms.py to avoid confusion until we deprecate it in 1.7.0

src/distilabel/models/llms/mlx.py

- Introduced MlxLLM class in mlx.py, integrating it into the llms module. - Updated output preparation logic to include token computation and logprobs in utils.py. - Modified __init__.py to export MlxLLM. - Enhanced type annotations for tokenizer_config and model_config in MlxLLM.

for more information, see https://pre-commit.ci

- Imported LlamaCppEmbeddings in models/__init__.py. - Added LlamaCppEmbeddings to the __all__ exports in both models/__init__.py and embeddings/__init__.py. - Removed duplicate entry of LlamaCppEmbeddings from embeddings/__init__.py exports.

add mlx support

ad91eef

davidberenstein1957 added 2 commits December 30, 2024 09:01

add import

bcc75f1

add typing

162f73e

update mlx

davidberenstein1957 requested review from gabrielmbmb and plaguss December 30, 2024 19:09

davidberenstein1957 added 2 commits January 8, 2025 16:32

docs add mlx example

6fd1e6b

add mlx tests

a0e8673

davidberenstein1957 marked this pull request as ready for review January 8, 2025 15:48

davidberenstein1957 added 2 commits January 8, 2025 16:54

add mark skip if

58ad51e

add mark to skip if not on silicon

9585eff

davidberenstein1957 requested review from burtenshaw and sdiazlor January 8, 2025 15:55

davidberenstein1957 added 2 commits January 8, 2025 16:57

update installation message

b824d74

chore clean code

69cc107

davidberenstein1957 linked an issue Jan 8, 2025 that may be closed by this pull request

[FEATURE] mlx-lm integration #995

Closed

gabrielmbmb approved these changes Jan 10, 2025

View reviewed changes

src/distilabel/models/llms/mlx.py Outdated Show resolved Hide resolved

src/distilabel/models/llms/mlx.py Outdated Show resolved Hide resolved

davidberenstein1957 and others added 7 commits January 10, 2025 11:38

[pre-commit.ci] auto fixes from pre-commit.com hooks

27318a0

for more information, see https://pre-commit.ci

Remove LlamaCppEmbeddings from exports in __init__.py

24c418a

[pre-commit.ci] auto fixes from pre-commit.com hooks

92879b8

for more information, see https://pre-commit.ci

Merge branch 'develop' into feat/add-support-mlx

ca66a57

[pre-commit.ci] auto fixes from pre-commit.com hooks

ab5e3e7

for more information, see https://pre-commit.ci

davidberenstein1957 merged commit 2c893c1 into develop Jan 10, 2025
8 checks passed

davidberenstein1957 deleted the feat/add-support-mlx branch January 10, 2025 11:10

plaguss mentioned this pull request Jan 15, 2025

[FEATURE] mlx-lm integration #995

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add mlx support #1089

add mlx support #1089

davidberenstein1957 commented Dec 30, 2024 •

edited

Loading

github-actions bot commented Dec 30, 2024

codspeed-hq bot commented Dec 30, 2024 •

edited

Loading

gabrielmbmb left a comment

add mlx support #1089

add mlx support #1089

Conversation

davidberenstein1957 commented Dec 30, 2024 • edited Loading

github-actions bot commented Dec 30, 2024

codspeed-hq bot commented Dec 30, 2024 • edited Loading

CodSpeed Performance Report

Merging #1089 will not alter performance

Summary

gabrielmbmb left a comment

Choose a reason for hiding this comment

davidberenstein1957 commented Dec 30, 2024 •

edited

Loading

codspeed-hq bot commented Dec 30, 2024 •

edited

Loading