Overview# Welcome to AReaL’s documentation!# Version History Key Milestones Getting Started with AReaL-lite Running GRPO on GSM8K Dataset Tutorial Installation Installation (Ascend NPU) Quickstart OpenAI-Compatible Workflows Agentic Reinforcement Learning Evaluation Fine-tuning Large MoE Models Configurations Best Practices Diagnosing RL Performance Debugging Guide Handling OOM Issues Performance Profiling Customization Dataset Rollout and Agentic RL Training Algorithm Algorithms Asynchronous RL PPO, GRPO, and Related Algorithms Second-Moment Trust Policy Optimization (M2PO) Proximal Log-Probability Approximation