ggml : skip register metal backend on os simulator #10132

jhen0409 · 2024-11-02T03:29:15Z

Fix for #10089.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

* feat: sync llama.cpp * fix: fix submodule update - as part of llama.cpp sync * chore: remove unnecessary comment * chore(example): revert unnecessary changes * feat: sync llama.cpp * fix: remove tfs_z ref: ggerganov/llama.cpp#10071 * fix(cpp): skip gpu device if n_gpu_layers <= 0 ref: ggerganov/llama.cpp#10132 --------- Co-authored-by: Jhen-Jie Hong <[email protected]>

ggerganov

This is not a good solution - we should avoid special-casing backends like this from now on. In theory, even with ngl == 0, a backend can still be utilized to offload very heavy compute ops (for example, like we do with large batches with the CUDA backend).

jhen0409 · 2024-11-02T09:53:41Z

This is not a good solution - we should avoid special-casing backends like this from now on. In theory, even with ngl == 0, a backend can still be utilized to offload very heavy compute ops (for example, like we do with large batches with the CUDA backend).

Got it. Maybe just skipping the metal backend registration in the simulator will be enough to fix this issue.

For disable device, it looks like the todo will be a better way.

jhen0409 added a commit to a-ghorbani/llama.rn that referenced this pull request Nov 2, 2024

fix(cpp): skip gpu device if n_gpu_layers <= 0

055df7f

ref: ggerganov/llama.cpp#10132

ggerganov reviewed Nov 2, 2024

View reviewed changes

ggml-backend : skip register metal backend on os simulator

cd457dc

jhen0409 force-pushed the fix-ios-disable-gpu branch from 7ef6580 to cd457dc Compare November 2, 2024 10:32

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Nov 2, 2024

jhen0409 changed the title ~~llama : skip metal device if n_gpu_layers <= 0~~ ggml : skip register metal backend on os simulator Nov 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : skip register metal backend on os simulator #10132

ggml : skip register metal backend on os simulator #10132

jhen0409 commented Nov 2, 2024

ggerganov left a comment

jhen0409 commented Nov 2, 2024 •

edited

Loading

ggml : skip register metal backend on os simulator #10132

Are you sure you want to change the base?

ggml : skip register metal backend on os simulator #10132

Conversation

jhen0409 commented Nov 2, 2024

ggerganov left a comment

Choose a reason for hiding this comment

jhen0409 commented Nov 2, 2024 • edited Loading

jhen0409 commented Nov 2, 2024 •

edited

Loading