An open-source reinforcement learning framework for training LLM-based agents — supporting GRPO, PPO, RLHF, multi-turn reasoning, tool use, and distributed training.
-
Updated
Feb 3, 2026 - Python
An open-source reinforcement learning framework for training LLM-based agents — supporting GRPO, PPO, RLHF, multi-turn reasoning, tool use, and distributed training.
Scalable and extensible reinforcement learning for LM agents.
Train SLM to use Tools with RL
Add a description, image, and links to the agent-rl topic page so that developers can more easily learn about it.
To associate your repository with the agent-rl topic, visit your repo's landing page and select "manage topics."