Read the full write-up
Notion blog post with full details
Results
| Model | Parameters | AIME Pass@1 |
|---|---|---|
| DeepScaleR | 1.5B | 43.1% |
| O1-Preview | Unknown | 42.0% |
A 1.5B model that surpasses O1-Preview by scaling RL on math reasoning
| Model | Parameters | AIME Pass@1 |
|---|---|---|
| DeepScaleR | 1.5B | 43.1% |
| O1-Preview | Unknown | 42.0% |