Skip to content

Add support for nvidia modelopt fp8 kv cache #5106

Add support for nvidia modelopt fp8 kv cache

Add support for nvidia modelopt fp8 kv cache #5106

Triggered via pull request January 30, 2025 22:53
Status Skipped
Total duration 6s
Artifacts

pr-test.yml

on: pull_request
Matrix: unit-test-backend-1-gpu
unit-test-frontend
0s
unit-test-frontend
unit-test-backend-2-gpu
0s
unit-test-backend-2-gpu
performance-test-1-gpu-part-1
0s
performance-test-1-gpu-part-1
performance-test-1-gpu-part-2
0s
performance-test-1-gpu-part-2
performance-test-2-gpu
0s
performance-test-2-gpu
accuracy-test-1-gpu
0s
accuracy-test-1-gpu
accuracy-test-2-gpu
0s
accuracy-test-2-gpu
Fit to window
Zoom out
Zoom in