Skip to content

Actions: huggingface/lighteval

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,898 workflow runs
1,898 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

docs: update README to reflect new model evaluation entry points
Tests #2225: Pull request #581 opened by czakop
February 23, 2025 13:15 Action required czakop:update-readme
February 23, 2025 13:15 Action required
Push details without converting fields to str
Tests #2224: Pull request #572 synchronize by NathanHB
February 21, 2025 16:06 37m 58s nathan-fix-details-to-str
February 21, 2025 16:06 37m 58s
Add swiss legal evals as new community tasks
Tests #2223: Pull request #389 synchronize by JoelNiklaus
February 21, 2025 13:27 Action required JoelNiklaus:add_swiss_legal_evals
February 21, 2025 13:27 Action required
Fix attribute and parameter names in loggers
Tests #2222: Pull request #476 synchronize by NathanHB
February 21, 2025 13:22 6h 0m 25s albertvillanova:fix-loggers
February 21, 2025 13:22 6h 0m 25s
Config fixes for VLLMModel
Tests #2221: Pull request #472 synchronize by NathanHB
February 21, 2025 13:21 37m 25s anton-l:vllm_quick_fixes
February 21, 2025 13:21 37m 25s
feat: add JGLUE tasks
Tests #2220: Pull request #469 synchronize by NathanHB
February 21, 2025 13:20 Action required ryan-minato:jglue
February 21, 2025 13:20 Action required
Added custom model inference.
Tests #2219: Pull request #437 synchronize by NathanHB
February 21, 2025 13:15 Action required JoelNiklaus:add-custom-model
February 21, 2025 13:15 Action required
new metrics and pr-fouras dataset add
Tests #2218: Pull request #558 synchronize by NathanHB
February 21, 2025 13:12 Action required BertrandCabotIDRIS:main
February 21, 2025 13:12 Action required
Add draft functionality for a generic sandboxed code running
Tests #2217: Pull request #580 opened by plaguss
February 21, 2025 12:44 38m 28s plaguss:code-run
February 21, 2025 12:44 38m 28s
Fix vLLM generation with sampling params (#578)
Tests #2216: Commit ebb7377 pushed by lewtun
February 21, 2025 10:30 43m 19s main
February 21, 2025 10:30 43m 19s
Fix vLLM generation with sampling params
Tests #2215: Pull request #578 synchronize by lewtun
February 21, 2025 09:51 37m 23s lewtun/fix-lcb
February 21, 2025 09:51 37m 23s
Fix vLLM generation with sampling params
Tests #2214: Pull request #578 synchronize by lewtun
February 21, 2025 09:47 38m 33s lewtun/fix-lcb
February 21, 2025 09:47 38m 33s
Fix vLLM generation with sampling params
Tests #2213: Pull request #578 synchronize by lewtun
February 21, 2025 08:33 38m 56s lewtun/fix-lcb
February 21, 2025 08:33 38m 56s
Fix vLLM generation with sampling params
Tests #2212: Pull request #578 synchronize by lewtun
February 21, 2025 08:15 38m 50s lewtun/fix-lcb
February 21, 2025 08:15 38m 50s
Fix vLLM generation with sampling params
Tests #2211: Pull request #578 synchronize by lewtun
February 20, 2025 21:57 38m 32s lewtun/fix-lcb
February 20, 2025 21:57 38m 32s
Fix vLLM generation with sampling params
Tests #2210: Pull request #578 opened by lewtun
February 20, 2025 21:33 43m 14s lewtun/fix-lcb
February 20, 2025 21:33 43m 14s
Multi node vLLM
Tests #2209: Pull request #530 synchronize by ncassereau
February 20, 2025 09:04 39m 2s ncassereau:multi_node_vllm
February 20, 2025 09:04 39m 2s
Add CodeElo
Tests #2208: Pull request #575 opened by plaguss
February 19, 2025 07:50 38m 28s plaguss:codeelo
February 19, 2025 07:50 38m 28s
Humanity's last exam (#520)
Tests #2207: Commit 782afe8 pushed by NathanHB
February 18, 2025 16:01 38m 54s main
February 18, 2025 16:01 38m 54s
Humanity's last exam
Tests #2206: Pull request #520 synchronize by NathanHB
February 18, 2025 14:52 40m 8s clem_last_exam
February 18, 2025 14:52 40m 8s
Humanity's last exam
Tests #2205: Pull request #520 synchronize by NathanHB
February 18, 2025 14:52 38m 23s clem_last_exam
February 18, 2025 14:52 38m 23s
Humanity's last exam
Tests #2204: Pull request #520 synchronize by NathanHB
February 18, 2025 14:50 39m 34s clem_last_exam
February 18, 2025 14:50 39m 34s
Humanity's last exam
Tests #2203: Pull request #520 synchronize by NathanHB
February 18, 2025 14:22 39m 33s clem_last_exam
February 18, 2025 14:22 39m 33s
Let lighteval support sglang (#552)
Tests #2202: Commit 086cf90 pushed by NathanHB
February 18, 2025 14:13 39m 46s main
February 18, 2025 14:13 39m 46s
Push details without converting fields to str
Tests #2201: Pull request #572 synchronize by NathanHB
February 18, 2025 14:04 6h 0m 26s nathan-fix-details-to-str
February 18, 2025 14:04 6h 0m 26s