fix: use `vm_allocate` instead of `posix_memalign` for Metal on macOS #7078

giladgd · 2024-05-04T21:36:05Z

Creating a context in an Electron app using node-llama-cpp crashes the process with some models (issue), so I've investigated what's happening and found that allocating a large memory block using posix_memalign is the culprit.

For some reason, it happens only on Electron and not on Nodejs, but I couldn't figure out why.

From my testings in Electron:

posix_memalign((void **) &data, 16384, 587218944) - works fine
posix_memalign((void **) &data, 16384, 1073741824) - crashes the process with SIGTRAP

Tested on an M1 Max machine with 32GB of RAM

I tried switching from posix_memalign to malloc in ggml-metal.m, and it seems that everything still works correctly, but maybe I'm missing something.
I assume that posix_memalign is used there for a reason, but since it seems to me that everything still works great with malloc, maybe the original reason for using posix_memalign is irrelevant by now?

I'm not sure whether the change I made in this PR is a good idea, so I opened it so someone more knowledgable in this area can take a look.

This may be a bug specific to Electron, so I shared my findings on the Electron repo, but since I haven't noticed any side effect of this workaround in llama.cpp, I think it may be a good idea to also solve this issue here.

…ke it not crash Electron proccesses

slaren · 2024-05-04T22:05:30Z

Metal requires a page-aligned pointer, which is why posix_memalign is used.

giladgd · 2024-05-04T22:51:58Z

I've switched to use vm_allocate instead since I found that this is what the Apple documentation recommends, and it seems to also fix the issue with Electron.

vm_allocate also allocates page-aligned memory.
Is the new change I made ok?

…al_host_malloc` returns `NULL`

giladgd · 2024-05-07T21:30:31Z

I've been using this for the past few days, and it seems to work great.
I've seen that all the tests passed, so I think this may be a good solution to the Electron issue without affecting other things.

giladgd added 2 commits May 5, 2024 00:06

fix: use malloc instead of posix_memalign in ggml-metal.m to ma…

571dca5

…ke it not crash Electron proccesses

fix: typo

a53e517

fix: use vm_allocate instead of posix_memalign

bfa4dae

giladgd added 2 commits May 5, 2024 01:56

fix: don't call newBufferWithBytesNoCopy with NULL when `ggml_met…

a92efec

…al_host_malloc` returns `NULL`

fix: use vm_allocate only on macOS

78214ac

giladgd changed the title ~~fix: workaround to not crash when running in Electron~~ fix: use vm_allocate instead of posix_memalign for Metal on macOS May 5, 2024

slaren approved these changes May 7, 2024

View reviewed changes

ggerganov merged commit 26458af into ggml-org:master May 8, 2024
58 checks passed

giladgd deleted the metalPosixMemalignWorkaround branch May 8, 2024 20:19

giladgd mentioned this pull request May 8, 2024

feat: split gguf files support withcatai/node-llama-cpp#214

Merged

7 tasks

beshkenadze mentioned this pull request May 9, 2024

[Bug]: [macos] Electron or child_process/worker crashes when using Metal API electron/electron#41513

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use `vm_allocate` instead of `posix_memalign` for Metal on macOS #7078

fix: use `vm_allocate` instead of `posix_memalign` for Metal on macOS #7078

giladgd commented May 4, 2024 •

edited

Loading

slaren commented May 4, 2024

giladgd commented May 4, 2024 •

edited

Loading

giladgd commented May 7, 2024

fix: use vm_allocate instead of posix_memalign for Metal on macOS #7078

fix: use vm_allocate instead of posix_memalign for Metal on macOS #7078

Conversation

giladgd commented May 4, 2024 • edited Loading

slaren commented May 4, 2024

giladgd commented May 4, 2024 • edited Loading

giladgd commented May 7, 2024

fix: use `vm_allocate` instead of `posix_memalign` for Metal on macOS #7078

fix: use `vm_allocate` instead of `posix_memalign` for Metal on macOS #7078

giladgd commented May 4, 2024 •

edited

Loading

giladgd commented May 4, 2024 •

edited

Loading