llama.vscode

Local LLM-assisted text completion extension for VS Code

TODO: image

TODO: gif

Features

Auto-suggest on cursor movement
Toggle the suggestion manually by pressing Ctrl+L
Accept a suggestion with Tab
Accept the first line of a suggestion with Shift+Tab
Accept the next word with Ctrl + Right Arrow
Control max text generation time
Configure scope of context around the cursor
Ring context with chunks from open and edited files and yanked text
Supports very large contexts even on low-end hardware via smart context reuse
Display performance stats

Installation

VS Code extension setup

TODO: write instructions

llama.cpp setup

The plugin requires a llama.cpp server instance to be running at configured endpoint:

TODO: add image to the config

Mac OS

brew install llama.cpp

Any other OS

Either build from source or use the latest binaries: https://github.com/ggerganov/llama.cpp/releases

llama.cpp settings

Here are recommended settings, depending on the amount of VRAM that you have:

More than 16GB VRAM:

llama-server \
    -hf ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF \
    --port 8012 -ngl 99 -fa -ub 1024 -b 1024 -dt 0.1 \
    --ctx-size 0 --cache-reuse 256

Less than 16GB VRAM:

llama-server \
    -hf ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF \
    --port 8012 -ngl 99 -fa -ub 1024 -b 1024 -dt 0.1 \
    --ctx-size 0 --cache-reuse 256

Recommended LLMs

The plugin requires FIM-compatible models: HF collection

Examples

TODO: add examples

Implementation details

The extension aims to be very simple and lightweight and at the same time to provide high-quality and performant local FIM completions, even on consumer-grade hardware.

The initial implementation was done by Ivaylo Gardev @igardev
Initial implementation and techincal description: ggml-org/llama.cpp#9787

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.vscode		.vscode
src		src
.gitignore		.gitignore
.vscode-test.mjs		.vscode-test.mjs
.vscodeignore		.vscodeignore
CHANGELOG.md		CHANGELOG.md
LICENSE.txt		LICENSE.txt
README.md		README.md
eslint.config.mjs		eslint.config.mjs
llama.png		llama.png
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llama.vscode

Features

Installation

VS Code extension setup

llama.cpp setup

Mac OS

Any other OS

llama.cpp settings

Recommended LLMs

Examples

Implementation details

About

Releases

Packages

Languages

License

igardev/llama.vscode

Folders and files

Latest commit

History

Repository files navigation

llama.vscode

Features

Installation

VS Code extension setup

llama.cpp setup

Mac OS

Any other OS

llama.cpp settings

Recommended LLMs

Examples

Implementation details

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages