Documentation Index
Fetch the complete documentation index at: https://docs.rllm-project.com/llms.txt
Use this file to discover all available pages before exploring further.
Read the full write-up
Notion blog post with full details
o3-mini and o1 performance on competitive programming benchmarks.
Results
| Model | Parameters | LiveCodeBench Pass@1 |
|---|---|---|
| DeepCoder | 14B | 60.6% |
| o3-mini (Low) | Unknown | ~60% |
| o1 | Unknown | ~60% |
Approach
Fine-tuned from DeepSeek-R1-Distill-Qwen-14B on coding competition problems using RL with code execution feedback. See thecookbooks/deepcoder cookbook (single-turn coding with hidden-test grading) for the AgentFlow-based reproducer.
Released: April 2025
