An open-source reinforcement learning framework for training LLM-based agents — supporting GRPO, PPO, RLHF, multi-turn reasoning, tool use, and distributed training.
-
Updated
Feb 3, 2026 - Python
An open-source reinforcement learning framework for training LLM-based agents — supporting GRPO, PPO, RLHF, multi-turn reasoning, tool use, and distributed training.
This project aims to present an automated method for evaluating Brazilian companies listed on the B3 stock exchange based on the analysis of their balance sheets, income statements, and cash flow statements.
Add a description, image, and links to the entropy-method topic page so that developers can more easily learn about it.
To associate your repository with the entropy-method topic, visit your repo's landing page and select "manage topics."