GitHub - bionic-gpt/doc-llama

Docker Containers for a local OPEN AI compatible API

Open AI API compatible docker images. We select models and package them in high performance docker containers.

Image Name	Parameters	Type	Context Size	Image Size	Quantization	Ram Requirements
llama-2-7b-chat:1.1.3	7b	LLama 2	2K	5GB	Yes	16GB
llama-3.2-3b:1.1.3	7b	LLama 3.2	2K	5GB	Yes	16GB
embeddings:1.1.3	7b	bge:small-en-v1.5-q8_0	2K	5GB	Yes	16GB

Features

API implements batching meaning when multiple users make a request the API serves them in parallel.
The docker image contains the model, no needs to download the model in another step.
Images are optimized for different architectures [coming soon]

Startup the API

docker run -it --rm -p 3000:11434 ghcr.io/bionic-gpt/llama-3.2-3b:1.1.3:1.1.1

Try it out

curl http://localhost:3000/v1/chat/completions -H "Content-Type: application/json" -d '{
     "model": "llama2", 
     "messages": [{"role": "user", "content": "How are you?"}],
     "temperature": 0.1 
   }'

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
src		src
.gitignore		.gitignore
.releaserc.json		.releaserc.json
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile.embeddings		Dockerfile.embeddings
Dockerfile.llama3-8b-chat		Dockerfile.llama3-8b-chat
Dockerfile.llama3.2-3b.cpu		Dockerfile.llama3.2-3b.cpu
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Docker Containers for a local OPEN AI compatible API

Features

Startup the API

Try it out

About

Releases

Packages

Contributors 4

Languages

bionic-gpt/doc-llama

Folders and files

Latest commit

History

Repository files navigation

Docker Containers for a local OPEN AI compatible API

Features

Startup the API

Try it out

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages