Llama 3.1 update binaries #874

martindevans · 2024-07-28T21:08:53Z

Update to new binaries for llama.cpp 345c8c0c87a97c1595f9c8b14833d531c8c7d8df. Built with this build action. This should include the RoPE fixes for llama 3.1.

Testing:

…hen referring to the C++ - Exposed `KvCacheMaxPosition` on `SafeLLamaContextHandle`

m0nsky · 2024-08-01T17:07:43Z

Tests passed on Linux CUDA.
Built my sample app for Windows & Linux Vulkan, and both are working fine.

Not sure if we need to test Linux/MacOS CPU as this should be covered by CI.

martindevans · 2024-08-02T17:02:02Z

Not sure if we need to test Linux/MacOS CPU as this should be covered by CI.

Does the MacOS CI runner have metal or is it CPU only?

SignalRT · 2024-08-02T18:17:42Z

Not sure if we need to test Linux/MacOS CPU as this should be covered by CI.

Does the MacOS CI runner have metal or is it CPU only?

Only CPU. The metal emulation fails.

SignalRT · 2024-08-02T18:18:19Z

It works on MacOS with metal

SignalRT

Tested

martindevans · 2024-08-02T19:56:31Z

Excellent, thanks for testing that @SignalRT. I'll try to run through the release process tonight or tomorrow.

martindevans · 2024-08-03T13:51:46Z

Resolved conflicts, will merge once CI completes.

martindevans added 2 commits July 28, 2024 16:32

Changes for 345c8c0c87a97c1595f9c8b14833d531c8c7d8df

97c8786

Updated binaries

bae21b5

martindevans mentioned this pull request Jul 29, 2024

Error Loading LLAMA 3.1 llama_model_load: error loading model: done_getting_tensors: wrong number of tensors; expected 292, got 291[BUG]: #875

Open

- Added note on LLamaAttentionType struct, making it easier to find w…

bb7523e

…hen referring to the C++ - Exposed `KvCacheMaxPosition` on `SafeLLamaContextHandle`

martindevans requested a review from SignalRT July 31, 2024 12:05

neuhaus mentioned this pull request Aug 2, 2024

Bug: llama 3.1 and variants fail with error "wrong number of tensors; expected 292, got 291" Mozilla-Ocho/llamafile#516

Open

SignalRT approved these changes Aug 2, 2024

View reviewed changes

Merge branch 'master' into llama_3.1_update_binaries

d1dbb21

martindevans merged commit bb2f3ad into SciSharp:master Aug 3, 2024
6 checks passed

martindevans deleted the llama_3.1_update_binaries branch August 3, 2024 14:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama 3.1 update binaries #874

Llama 3.1 update binaries #874

martindevans commented Jul 28, 2024 •

edited by SignalRT

Loading

m0nsky commented Aug 1, 2024

martindevans commented Aug 2, 2024

SignalRT commented Aug 2, 2024

SignalRT commented Aug 2, 2024

SignalRT left a comment

martindevans commented Aug 2, 2024

martindevans commented Aug 3, 2024

Llama 3.1 update binaries #874

Llama 3.1 update binaries #874

Conversation

martindevans commented Jul 28, 2024 • edited by SignalRT Loading

m0nsky commented Aug 1, 2024

martindevans commented Aug 2, 2024

SignalRT commented Aug 2, 2024

SignalRT commented Aug 2, 2024

SignalRT left a comment

Choose a reason for hiding this comment

martindevans commented Aug 2, 2024

martindevans commented Aug 3, 2024

martindevans commented Jul 28, 2024 •

edited by SignalRT

Loading