fix: use `vm_allocate` instead of `posix_memalign` for Metal on macOS #7078

giladgd · 2024-05-04T21:36:05Z

Creating a context in an Electron app using node-llama-cpp crashes the process with some models (issue), so I've investigated what's happening and found that allocating a large memory block using posix_memalign is the culprit.

For some reason, it happens only on Electron and not on Nodejs, but I couldn't figure out why.

From my testings in Electron:

posix_memalign((void **) &data, 16384, 587218944) - works fine
posix_memalign((void **) &data, 16384, 1073741824) - crashes the process with SIGTRAP

Tested on an M1 Max machine with 32GB of RAM

I tried switching from posix_memalign to malloc in ggml-metal.m, and it seems that everything still works correctly, but maybe I'm missing something.
I assume that posix_memalign is used there for a reason, but since it seems to me that everything still works great with malloc, maybe the original reason for using posix_memalign is irrelevant by now?

I'm not sure whether the change I made in this PR is a good idea, so I opened it so someone more knowledgable in this area can take a look.

This may be a bug specific to Electron, so I shared my findings on the Electron repo, but since I haven't noticed any side effect of this workaround in llama.cpp, I think it may be a good idea to also solve this issue here.

…ke it not crash Electron proccesses

slaren · 2024-05-04T22:05:30Z

Metal requires a page-aligned pointer, which is why posix_memalign is used.

giladgd · 2024-05-04T22:51:58Z

I've switched to use vm_allocate instead since I found that this is what the Apple documentation recommends, and it seems to also fix the issue with Electron.

vm_allocate also allocates page-aligned memory.
Is the new change I made ok?

…al_host_malloc` returns `NULL`

giladgd · 2024-05-07T21:30:31Z

I've been using this for the past few days, and it seems to work great.
I've seen that all the tests passed, so I think this may be a good solution to the Electron issue without affecting other things.

giladgd added 2 commits May 5, 2024 00:06

fix: use malloc instead of posix_memalign in ggml-metal.m to ma…

571dca5

…ke it not crash Electron proccesses

fix: typo

a53e517

fix: use vm_allocate instead of posix_memalign

bfa4dae

giladgd added 2 commits May 5, 2024 01:56

fix: don't call newBufferWithBytesNoCopy with NULL when `ggml_met…

a92efec

…al_host_malloc` returns `NULL`

fix: use vm_allocate only on macOS

78214ac

giladgd changed the title ~~fix: workaround to not crash when running in Electron~~ fix: use vm_allocate instead of posix_memalign for Metal on macOS May 5, 2024

slaren approved these changes May 7, 2024

View reviewed changes

ggerganov merged commit 26458af into ggml-org:master May 8, 2024

giladgd deleted the metalPosixMemalignWorkaround branch May 8, 2024 20:19

giladgd mentioned this pull request May 8, 2024

feat: split gguf files support withcatai/node-llama-cpp#214

Merged

7 tasks

beshkenadze mentioned this pull request May 9, 2024

[Bug]: [macos] Electron or child_process/worker crashes when using Metal API electron/electron#41513

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: use `vm_allocate` instead of `posix_memalign` for Metal on macOS #7078

fix: use `vm_allocate` instead of `posix_memalign` for Metal on macOS #7078

Uh oh!

giladgd commented May 4, 2024 •

edited

Loading

Uh oh!

slaren commented May 4, 2024

Uh oh!

giladgd commented May 4, 2024 •

edited

Loading

Uh oh!

giladgd commented May 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: use vm_allocate instead of posix_memalign for Metal on macOS #7078

fix: use vm_allocate instead of posix_memalign for Metal on macOS #7078

Uh oh!

Conversation

giladgd commented May 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

slaren commented May 4, 2024

Uh oh!

giladgd commented May 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

giladgd commented May 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: use `vm_allocate` instead of `posix_memalign` for Metal on macOS #7078

fix: use `vm_allocate` instead of `posix_memalign` for Metal on macOS #7078

giladgd commented May 4, 2024 •

edited

Loading

giladgd commented May 4, 2024 •

edited

Loading