Terminal-Bench-RL

GitHub repository

View source code and documentation

Terminal-Bench-RL is a benchmark and training framework for long-horizon task completion in terminal environments. It provides a suite of tasks for evaluating and training terminal agents using rLLM’s RL pipeline.

Tongyi DeepResearch

PettingLLMs

⌘I