feat(option): to use different samplers #20

grencez · 2023-04-30T04:23:35Z

ggml-org/llama.cpp#1126 introduced some new ones. Right now, we use repetition penalty. It does a decent job of avoiding repeated content for a while, but it's certainly not perfect. For example, a large window penalizes a lot of punctuation and causes run-on sentences. We can already change this by excluding tokens from the penalized list, but it's a balancing act that I'm not very good at.

First order of business is to get repeat_penalty working with the new API.

Issue #20

grencez · 2023-04-30T23:03:53Z

Seems to be working as before.

New sampling parameter defaults in llama.cpp are:

tfs_z = 1.0f;
typical_p = 1.0f;
frequency_penalty = 0.0f;
presence_penalty = 0.0f;

mirostat = 0;  // 0 disabled, 1 for v1, 2 for v2.
mirostat_tau = 5.0f;
mirostat_eta = 0.1f;

Might as well add those in and try.

These new ones are disabled by default. Issue #20

grencez · 2023-05-01T19:11:21Z

Hrm... for mirostat, it looks like we need to remember a mu value across subsequent calls. That's going to be a little tricky to maintain with undoing. Probably should make a structure that wraps & maintains the current context tokens alongside a new array of mu values that were computed for each token (or carried over from previous tokens in the case of user input).

This will make it easier to maintain other state variables. Issue #20

grencez added the feature New feature or request label Apr 30, 2023

grencez self-assigned this Apr 30, 2023

grencez added a commit that referenced this issue Apr 30, 2023

build(dep): Update llama.cpp for new sampling API

4388992

Issue #20

grencez added a commit that referenced this issue May 1, 2023

feat(option): to set more sampling parameters

ac1c1f9

These new ones are disabled by default. Issue #20

grencez added a commit that referenced this issue May 4, 2023

qual(chat): ChatTrajectory class maintains history

9728055

This will make it easier to maintain other state variables. Issue #20

grencez closed this as completed in 4339ba9 May 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(option): to use different samplers #20

feat(option): to use different samplers #20

grencez commented Apr 30, 2023

grencez commented Apr 30, 2023 •

edited

Loading

grencez commented May 1, 2023 •

edited

Loading

feat(option): to use different samplers #20

feat(option): to use different samplers #20

Comments

grencez commented Apr 30, 2023

grencez commented Apr 30, 2023 • edited Loading

grencez commented May 1, 2023 • edited Loading

grencez commented Apr 30, 2023 •

edited

Loading

grencez commented May 1, 2023 •

edited

Loading