Quick start (CLI)

This guide gets you from zero to running an evaluation and launching RL training using only the rllm CLI — no Python scripts required.

rLLM CLI showing the banner, available commands including agent, dataset, eval, init, login, model, and train, along with global options

Prerequisites

rLLM installed (see installation)
An API key for a model provider (OpenAI, Anthropic, Together, etc.)

Step 1: Configure your model

Run the interactive setup to select a provider and model:

rllm model setup

You’ll be prompted to:

Choose a provider (e.g., OpenAI)
Enter your API key
Pick a default model (e.g., gpt-4o)

Your configuration is saved to ~/.rllm/config.json. You can switch providers later with rllm model swap.

Step 2: Explore available datasets

Browse the full catalog of 50+ benchmarks:

rllm dataset list --all

To preview what a dataset looks like:

rllm dataset inspect gsm8k -n 3

Step 3: Run an evaluation

Evaluate your model on a benchmark:

rllm eval gsm8k

That’s it. rLLM will:

Auto-pull the dataset from HuggingFace
Start a local LiteLLM proxy for your configured provider
Resolve the default agent and evaluator from the catalog
Run the evaluation with 64 concurrent requests
Print accuracy, error count, and per-signal metrics

For a quick test run, limit the number of examples:

rllm eval gsm8k --max-examples 20

Evaluate with a local model

If you’re running a model server (vLLM, SGLang, etc.), point to it directly:

rllm eval gsm8k --base-url http://localhost:30000/v1 --model Qwen/Qwen3-4B

Step 4: Train with RL

Launch reinforcement learning training on a benchmark:

rllm train gsm8k --model Qwen/Qwen3-8B

This starts the unified training pipeline with the tinker backend, using GRPO for advantage computation and LoRA for efficient fine-tuning. Customize training hyperparameters:

rllm train gsm8k \
  --model Qwen/Qwen3-8B \
  --batch-size 16 \
  --group-size 4 \
  --lr 1e-5 \
  --max-steps 50

Step 5: Build a custom agent

Scaffold a new agent project:

rllm init my-agent --template react

Install it and use it immediately:

cd my-agent
pip install -e .
rllm eval gsm8k --agent my-agent

What’s next

CLI reference

Full reference for all commands and flags

Supported datasets

Browse 50+ benchmarks across math, code, QA, VLM, and more

Unified trainer

Dive into the training pipeline and configuration

SDK overview

Use any LLM framework with SDK-based training

Get started

Tutorials

rLLM CLI & UI

Core concepts

Agent runtimes

Training backends

Guides

Unified workflow trainer

Advanced algorithms

Quick start (CLI)

Prerequisites

Step 1: Configure your model

Step 2: Explore available datasets

Step 3: Run an evaluation

Evaluate with a local model

Step 4: Train with RL

Step 5: Build a custom agent

What’s next

CLI reference

Supported datasets

Unified trainer

SDK overview

Get started

Tutorials

rLLM CLI & UI

Core concepts

Agent runtimes

Training backends

Guides

Unified workflow trainer

Advanced algorithms

Documentation Index

​Prerequisites

​Step 1: Configure your model

​Step 2: Explore available datasets

​Step 3: Run an evaluation

​Evaluate with a local model

​Step 4: Train with RL

​Step 5: Build a custom agent

​What’s next

CLI reference

Supported datasets

Unified trainer

SDK overview

Prerequisites

Step 1: Configure your model

Step 2: Explore available datasets

Step 3: Run an evaluation

Evaluate with a local model

Step 4: Train with RL

Step 5: Build a custom agent

What’s next