Skip to main content

Paper

arXiv:2509.25052
Cogito, Ergo Ludo combines reasoning and planning to build game-playing agents. Using rLLM’s RL training pipeline, it trains agents that can think through game states and develop strategies through self-play.