Skip to content

Actions: huggingface/lighteval

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
413 workflow run results
413 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fixing some silent bugs in Arabic Custom Tasks
Tests #2273: Pull request #556 synchronize by alielfilali01
March 7, 2025 11:29 Action required alielfilali01:main
March 7, 2025 11:29 Action required
VLLM + Math-Verify fixes (#603)
Tests #2265: Commit d81bafd pushed by clefourrier
March 4, 2025 16:36 50m 46s main
March 4, 2025 16:36 50m 46s
Update links in readme
Tests #2260: Pull request #527 synchronize by NathanHB
March 4, 2025 13:20 -1s jaysonfrancis:main
March 4, 2025 13:20 -1s
Fixing backend error in main_sglang. (#597)
Tests #2258: Commit f2ddc52 pushed by NathanHB
March 4, 2025 12:51 38m 50s main
March 4, 2025 12:51 38m 50s
Add subsets for lcb (#587)
Tests #2247: Commit ed08481 pushed by NathanHB
February 26, 2025 09:52 37m 51s main
February 26, 2025 09:52 37m 51s
Fixing some silent bugs in Arabic Custom Tasks
Tests #2246: Pull request #556 synchronize by alielfilali01
February 26, 2025 08:00 39m 17s alielfilali01:main
February 26, 2025 08:00 39m 17s
Fixing some silent bugs in Arabic Custom Tasks
Tests #2245: Pull request #556 synchronize by alielfilali01
February 26, 2025 07:43 Action required alielfilali01:main
February 26, 2025 07:43 Action required
adds aime24, 25 and math500 (#586)
Tests #2243: Commit 4c9af85 pushed by NathanHB
February 25, 2025 17:06 37m 45s main
February 25, 2025 17:06 37m 45s
docs: update README to reflect new model evaluation entry points (#581)
Tests #2232: Commit 066f84f pushed by NathanHB
February 25, 2025 09:50 38m 27s main
February 25, 2025 09:50 38m 27s
parse seed for vllm (#585)
Tests #2231: Commit 95068aa pushed by NathanHB
February 25, 2025 09:50 37m 48s main
February 25, 2025 09:50 37m 48s
Push details without converting fields to str (#572)
Tests #2230: Commit 7b42113 pushed by NathanHB
February 25, 2025 09:06 37m 56s main
February 25, 2025 09:06 37m 56s
Add turkish and word (#583)
Tests #2227: Commit bd578a8 pushed by clefourrier
February 24, 2025 07:11 38m 35s main
February 24, 2025 07:11 38m 35s
Add Turkish and word
Tests #2226: Pull request #583 opened by bezir
February 23, 2025 19:19 38m 9s bezir:main
February 23, 2025 19:19 38m 9s
new metrics and pr-fouras dataset add
Tests #2218: Pull request #558 synchronize by NathanHB
February 21, 2025 13:12 38m 28s BertrandCabotIDRIS:main
February 21, 2025 13:12 38m 28s
Fix vLLM generation with sampling params (#578)
Tests #2216: Commit ebb7377 pushed by lewtun
February 21, 2025 10:30 43m 19s main
February 21, 2025 10:30 43m 19s
Humanity's last exam (#520)
Tests #2207: Commit 782afe8 pushed by NathanHB
February 18, 2025 16:01 38m 54s main
February 18, 2025 16:01 38m 54s
Let lighteval support sglang (#552)
Tests #2202: Commit 086cf90 pushed by NathanHB
February 18, 2025 14:13 39m 46s main
February 18, 2025 14:13 39m 46s
raise exception when generation size is more than model length (#571)
Tests #2200: Commit bee02f7 pushed by NathanHB
February 18, 2025 14:04 38m 33s main
February 18, 2025 14:04 38m 33s
Add extended task for LiveCodeBench codegeneration (#548)
Tests #2194: Commit fd479ee pushed by NathanHB
February 18, 2025 09:54 38m 26s main
February 18, 2025 09:54 38m 26s
Let lighteval support sglang
Tests #2192: Pull request #552 synchronize by Jayon02
February 18, 2025 02:33 40m 35s Jayon02:main
February 18, 2025 02:33 40m 35s
new metrics and pr-fouras dataset add
Tests #2191: Pull request #558 synchronize by BertrandCabotIDRIS
February 17, 2025 15:01 38m 44s BertrandCabotIDRIS:main
February 17, 2025 15:01 38m 44s
Let lighteval support sglang
Tests #2184: Pull request #552 synchronize by Jayon02
February 16, 2025 01:01 38m 43s Jayon02:main
February 16, 2025 01:01 38m 43s
new metrics and pr-fouras dataset add
Tests #2177: Pull request #558 opened by BertrandCabotIDRIS
February 13, 2025 17:05 38m 32s BertrandCabotIDRIS:main
February 13, 2025 17:05 38m 32s
allows better flexibility for litellm endpoints (#549)
Tests #2175: Commit d6de1fe pushed by NathanHB
February 13, 2025 13:37 38m 14s main
February 13, 2025 13:37 38m 14s
Fixing some silent bugs in Arabic Custom Tasks
Tests #2173: Pull request #556 opened by alielfilali01
February 13, 2025 06:19 38m 10s alielfilali01:main
February 13, 2025 06:19 38m 10s