Skip to content

llama_model_loader: support multiple split/shard GGUFs #9817

llama_model_loader: support multiple split/shard GGUFs

llama_model_loader: support multiple split/shard GGUFs #9817

Annotations

1 warning

Push Docker image to Docker Hub (server-cuda, .devops/server-cuda.Dockerfile, linux/amd64)

succeeded Mar 22, 2024 in 9m 32s