Skip to content

Actions: huggingface/open-r1

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
559 workflow runs
559 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add script to decontaminate datasets against benchmark datasets
Tests #184: Pull request #416 synchronize by plaguss
February 24, 2025 14:54 2m 26s decontaminate
February 24, 2025 14:54 2m 26s
add sft recipe (#415)
Tests #183: Commit 5355687 pushed by lewtun
February 24, 2025 14:43 3m 5s main
February 24, 2025 14:43 3m 5s
Add script to decontaminate datasets against benchmark datasets
Tests #182: Pull request #416 opened by plaguss
February 24, 2025 14:31 2m 18s decontaminate
February 24, 2025 14:31 2m 18s
add sft recipe
Tests #181: Pull request #415 opened by eliebak
February 24, 2025 14:16 2m 20s add-config-sft
February 24, 2025 14:16 2m 20s
Start agent traces
Tests #180: Pull request #414 opened by aymeric-roucher
February 24, 2025 14:06 2m 17s agent-traces
February 24, 2025 14:06 2m 17s
pip in /. - Update #969618802
Dependabot Updates #13: by dependabot bot
February 24, 2025 13:58 52s main
February 24, 2025 13:58 52s
Fix reasoning_steps_reward function
Tests #179: Pull request #335 synchronize by rocke2020
February 24, 2025 02:14 Action required rocke2020:main
February 24, 2025 02:14 Action required
Bump Liger kernel (#399)
Tests #178: Commit 3f9d75a pushed by kashif
February 23, 2025 16:44 2m 16s main
February 23, 2025 16:44 2m 16s
add debug and qwen0.5b config file
Tests #177: Pull request #402 opened by GitMonkey0
February 23, 2025 15:59 Action required GitMonkey0:dev
February 23, 2025 15:59 Action required
Fix typo in vLLM MODEL_ARGS
Tests #176: Pull request #401 opened by Maxwell-Jia
February 23, 2025 15:39 Action required Maxwell-Jia:typo
February 23, 2025 15:39 Action required
Bump Liger kernel
Tests #175: Pull request #399 opened by lewtun
February 23, 2025 14:09 2m 33s lewtun-patch-1
February 23, 2025 14:09 2m 33s
feat: make reward functions to support parallel computation
Tests #174: Pull request #398 opened by 0x404
February 23, 2025 09:39 Action required 0x404:zqh/refactor_reward
February 23, 2025 09:39 Action required
Update sft.py
Tests #173: Pull request #397 opened by zhangshan-zs94
February 23, 2025 08:37 Action required zhangshan-zs94:main
February 23, 2025 08:37 Action required
WIP new GRPO dataset and task: formally-verified program correctness
Tests #172: Pull request #379 synchronize by ocramz
February 23, 2025 03:52 Action required unfoldml:feature/htgen-dataset
February 23, 2025 03:52 Action required
WIP "Faster" grpo trainer
Tests #171: Pull request #371 synchronize by edbeeching
February 22, 2025 22:01 2m 23s faster-grpo-trainer
February 22, 2025 22:01 2m 23s
Update prompt template and sampling parameters for evaluation (#392)
Tests #170: Commit eeca246 pushed by lewtun
February 22, 2025 14:21 2m 27s main
February 22, 2025 14:21 2m 27s
Update prompt template and sampling parameters for evaluation
Tests #169: Pull request #392 synchronize by lewtun
February 22, 2025 14:17 2m 23s fix-math
February 22, 2025 14:17 2m 23s
Pin dependencies (#393)
Tests #168: Commit 49d9b74 pushed by lewtun
February 22, 2025 13:46 2m 21s main
February 22, 2025 13:46 2m 21s
Update prompt template and sampling parameters for evaluation
Tests #167: Pull request #392 synchronize by lewtun
February 22, 2025 13:42 2m 34s fix-math
February 22, 2025 13:42 2m 34s
WIP new GRPO dataset and task: formally-verified program correctness
Tests #166: Pull request #379 synchronize by ocramz
February 22, 2025 11:56 Action required unfoldml:feature/htgen-dataset
February 22, 2025 11:56 Action required
Pin training dependencies
Tests #165: Pull request #393 opened by lewtun
February 22, 2025 10:52 2m 21s pin-deps
February 22, 2025 10:52 2m 21s
Update prompt template and sampling parameters for evaluation
Tests #164: Pull request #392 synchronize by lewtun
February 22, 2025 10:17 2m 39s fix-math
February 22, 2025 10:17 2m 39s
Update prompt template and sampling parameters for evaluation
Tests #163: Pull request #392 synchronize by lewtun
February 22, 2025 10:04 2m 31s fix-math
February 22, 2025 10:04 2m 31s
Update prompt template and sampling parameters for evaluation
Tests #162: Pull request #392 opened by lewtun
February 22, 2025 08:34 2m 29s fix-math
February 22, 2025 08:34 2m 29s
WIP new GRPO dataset and task: formally-verified program correctness
Tests #161: Pull request #379 synchronize by ocramz
February 22, 2025 08:18 Action required unfoldml:feature/htgen-dataset
February 22, 2025 08:18 Action required