DeepCoder - rLLM

Read the full write-up

Notion blog post with full details

DeepCoder is a 14B coding model that achieves 60.6% Pass@1 on LiveCodeBench, an 8% improvement over the base model. It matches o3-mini and o1 performance on competitive programming benchmarks.

Results

Model	Parameters	LiveCodeBench Pass@1
DeepCoder	14B	60.6%
o3-mini (Low)	Unknown	~60%
o1	Unknown	~60%

Approach

Fine-tuned from DeepSeek-R1-Distill-Qwen-14B on coding competition problems using RL with code execution feedback. See the cookbooks/deepcoder cookbook (single-turn coding with hidden-test grading) for the AgentFlow-based reproducer. Released: April 2025

DeepScaleR

DeepSWE

⌘I

Read the full write-up

​Results

​Approach

Results

Approach