An agent that learns to play games by reasoning and planning
Paper
arXiv:2509.25052
Cogito, Ergo Ludo combines reasoning and planning to build game-playing agents. Using rLLM’s RL training pipeline, it trains agents that can think through game states and develop strategies through self-play.