Skip to content

Actions: EricLBuehler/mistral.rs

docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,256 workflow runs
1,256 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix chat sampling response (#1154)
docs #1256: Commit 8d89c14 pushed by EricLBuehler
February 19, 2025 17:02 7m 52s master
February 19, 2025 17:02 7m 52s
Fix non-cuda
docs #1255: Commit 71650a4 pushed by EricLBuehler
February 16, 2025 21:11 8m 13s master
February 16, 2025 21:11 8m 13s
Blockwise FP8 CUDA fix for cc < 800 (#1150)
docs #1254: Commit 9ec86d8 pushed by EricLBuehler
February 16, 2025 19:59 8m 3s master
February 16, 2025 19:59 8m 3s
FP8 blockwise dequant CUDA kernel (#1149)
docs #1253: Commit 1b8c077 pushed by EricLBuehler
February 16, 2025 19:44 7m 59s master
February 16, 2025 19:44 7m 59s
February 16, 2025 18:24 7m 45s
February 16, 2025 04:55 8m 3s
Patch check for multi-node at the same time as pipeline parallel
docs #1250: Commit 098776f pushed by EricLBuehler
February 16, 2025 03:59 7m 57s master
February 16, 2025 03:59 7m 57s
Handle HF_HUB_CACHE env var (#1146)
docs #1249: Commit 6a92f70 pushed by EricLBuehler
February 16, 2025 03:29 7m 52s master
February 16, 2025 03:29 7m 52s
Use cudarc 0.13.5 - CUDA 12.8 support (#1145)
docs #1248: Commit 2e66544 pushed by EricLBuehler
February 16, 2025 03:07 11m 32s master
February 16, 2025 03:07 11m 32s
Integrate fused MLP mul-act for more models! (#1144)
docs #1247: Commit e2830b5 pushed by EricLBuehler
February 16, 2025 03:03 7m 44s master
February 16, 2025 03:03 7m 44s
Short-circuit dry sampling (#1143)
docs #1246: Commit 9f4fbd2 pushed by EricLBuehler
February 15, 2025 22:34 7m 48s master
February 15, 2025 22:34 7m 48s
Fuse MLP mul-and-act (#1142)
docs #1245: Commit c65d8f6 pushed by EricLBuehler
February 15, 2025 03:59 8m 49s master
February 15, 2025 03:59 8m 49s
Revamp speculative decoding! (#1027)
docs #1244: Commit dd5aee1 pushed by EricLBuehler
February 15, 2025 03:13 8m 0s master
February 15, 2025 03:13 8m 0s
Remove failing cp command from readme (#1141)
docs #1243: Commit 5e689c9 pushed by EricLBuehler
February 14, 2025 18:00 8m 3s master
February 14, 2025 18:00 8m 3s
Some fixes for Qwen, #1134
docs #1242: Commit 87a7c23 pushed by EricLBuehler
February 13, 2025 22:33 8m 13s master
February 13, 2025 22:33 8m 13s
FIx for llama multi node (#1136)
docs #1241: Commit c9ac321 pushed by EricLBuehler
February 13, 2025 01:25 7m 44s master
February 13, 2025 01:25 7m 44s
Add jinja strftime_now function (#1132)
docs #1240: Commit 323e7cd pushed by EricLBuehler
February 12, 2025 01:37 7m 47s master
February 12, 2025 01:37 7m 47s
Fix mistral 2501 gguf (#1131)
docs #1239: Commit 8dff440 pushed by EricLBuehler
February 12, 2025 00:44 8m 3s master
February 12, 2025 00:44 8m 3s
Add an NCCL feature flag (#1129)
docs #1238: Commit bd5532c pushed by EricLBuehler
February 11, 2025 23:23 8m 13s master
February 11, 2025 23:23 8m 13s
Multi-node support for tensor parallelism (#1125)
docs #1237: Commit 844cdc0 pushed by EricLBuehler
February 11, 2025 22:20 8m 17s master
February 11, 2025 22:20 8m 17s
Fix isq with bias for column parallel (#1128)
docs #1236: Commit 3fb29cc pushed by EricLBuehler
February 9, 2025 03:16 7m 38s master
February 9, 2025 03:16 7m 38s
New file format for imatrix: .cimatrix (#1004)
docs #1235: Commit b99035b pushed by EricLBuehler
February 6, 2025 23:52 7m 38s master
February 6, 2025 23:52 7m 38s
Allow chat streaming to use tools (#1088)
docs #1234: Commit 95265b8 pushed by EricLBuehler
February 4, 2025 12:56 8m 42s master
February 4, 2025 12:56 8m 42s
Bump openssl from 0.10.69 to 0.10.70 (#1121)
docs #1233: Commit c1d06e7 pushed by EricLBuehler
February 3, 2025 19:57 7m 59s master
February 3, 2025 19:57 7m 59s
Tensor parallelism and pipeline parallelism (#1113)
docs #1232: Commit 875a940 pushed by EricLBuehler
February 3, 2025 18:54 7m 51s master
February 3, 2025 18:54 7m 51s