changelog : `libllama` API #9289

ggerganov · 2024-09-03T06:48:45Z

Overview

This is a list of changes to the public interface of the llama library. Collaborators are encouraged to edit this post in order to reflect important changes to the API that end up merged into the master branch.

If you are building a 3rd party project that relies on libllama, it is recommended to follow this issue and check it before upgrading to new versions.

Recent API changes (most recent at the top)

version	PR	desc
TBD	#11063	Update `llama_model` API naming
b4357	#10784	Remove `llama_model_get_tensor()`
b4337	#10803	Change `llama_sampler_init_penalties()`
b4282	#10446	Removed support for `Q4_0_N_M` model files in favor of automatic repacking of `Q4_0`
b4167	#10497	Add `devices` to `llama_model_params`
b3948	#9897	Deprecate `softmax` sampler and update `dist` sampler`
b3988	#10071	Remove Tail-Free sampling
b3943	#9745	Removed `all_pos_0, all_pos_1, all_seq_id` from `llama_batch`
b3908	#9798	Update FIM-related API
b3841	#9510	Add `LLAMA_POOLING_TYPE_RANK`
b3774	#9512	Add `llama_n_head()`
b3750	#9355	Add `llama_perf` API + param to disable internal profiling
b3749	#9445	Add `llama_sampler_chain_remove()`
b3681	#9294	Major changes to the sampling API (see PR for more info)
b3651	#8980	Add `LLAMA_VOCAB_TYPE_RWKV` enum value
b3644	#8672	Add `llama_threadpool` API + change `uint32_t` -> `int32_t`
b3614	#8526	Add `llama_model_is_recurrent`

For older changes, use:

git log --oneline -p b3614 -- include/llama.h

(For collaborators) To link between PR number vs Build number:

git log --oneline | tail -r | nl

Upcoming API changes

TBD

The text was updated successfully, but these errors were encountered:

ggerganov · 2024-09-13T06:57:14Z

#9355 restores the functionality for getting performance measurements from within libllama (which was removed in #9294) via a new llama_perf API. The llama_context_params is extended with a new bool no_perf parameter that can be used to disable the internal timings during libllama compute.

* ggml : add support for dynamic loading of backends --------- Co-authored-by: Georgi Gerganov <[email protected]>

ddh0 · 2025-01-03T18:14:15Z

Looks like llama_model_get_tensor was removed from the API but that change was not documented here

ggerganov · 2025-01-04T14:15:49Z

Looks like llama_model_get_tensor was removed from the API but that change was not documented here

I didn't expect that this function is being used by anyone, so I skipped updating the changelog. It's updated now.

Btw, what do you use this call for?

ddh0 · 2025-01-05T00:53:53Z

I didn't expect that this function is being used by anyone, so I skipped updating the changelog. It's updated now.

Btw, what do you use this call for?

I don't use it personally but the function was included in my Python code, I started to get ctypes "symbol not found" errors and I had to do some digging to figure out why. No worries!

ggerganov added the documentation Improvements or additions to documentation label Sep 3, 2024

ggerganov pinned this issue Sep 3, 2024

ggerganov mentioned this issue Sep 3, 2024

changelog : llama-server REST API #9291

Open

ngxson mentioned this issue Oct 31, 2024

llama : remove Tail-Free sampling #10071

Merged

Vali-98 referenced this issue Nov 26, 2024

ggml : add support for dynamic loading of backends (#10469)

5931c1f

* ggml : add support for dynamic loading of backends --------- Co-authored-by: Georgi Gerganov <[email protected]>

Vali-98 mentioned this issue Nov 27, 2024

ggml : add support for dynamic loading of backends #10469

Merged

2 tasks

slaren mentioned this issue Dec 12, 2024

Misc. bug: Q4_0 with runtime repacking not working as expected (TYPE_Q4_0_4_4 REMOVED) #10757

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

changelog : `libllama` API #9289

changelog : `libllama` API #9289

ggerganov commented Sep 3, 2024 •

edited

Loading

ggerganov commented Sep 13, 2024

ddh0 commented Jan 3, 2025

ggerganov commented Jan 4, 2025

ddh0 commented Jan 5, 2025

changelog : libllama API #9289

changelog : libllama API #9289

Comments

ggerganov commented Sep 3, 2024 • edited Loading

Overview

Recent API changes (most recent at the top)

Upcoming API changes

ggerganov commented Sep 13, 2024

ddh0 commented Jan 3, 2025

ggerganov commented Jan 4, 2025

ddh0 commented Jan 5, 2025

changelog : `libllama` API #9289

changelog : `libllama` API #9289

ggerganov commented Sep 3, 2024 •

edited

Loading