Introduction
The Neurenix RL module provides a comprehensive framework for reinforcement learning, enabling you to train intelligent agents that learn from interaction with environments. The module implements state-of-the-art algorithms including DQN, PPO, SAC, A2C, and DDPG.Key Features
- Modern Algorithms: DQN, PPO, SAC, A2C, DDPG implementations
- Flexible Policies: Support for discrete and continuous action spaces
- Value Functions: Q-functions, value networks, and advantage functions
- Experience Replay: Efficient memory-based learning
- Multi-Agent Systems: Support for multi-agent reinforcement learning
- Custom Environments: Easy-to-use environment interface
Quick Start
Core Components
Agents
Agents are the learning entities that interact with environments:neurenix/rl/agent.py:18
Environments
Environments define the world in which agents operate:neurenix/rl/environment.py:15
Policies
Policies map states to actions:neurenix/rl/policy.py:174
Value Functions
Value functions estimate the value of states or state-action pairs:neurenix/rl/value.py:101
Training Loop
The standard training loop follows this pattern:neurenix/rl/agent.py:99
Multi-Agent Systems
Support for multiple agents in shared environments:neurenix/rl/agent.py:393
Saving and Loading
Persist trained agents for later use:neurenix/rl/agent.py:189
Next Steps
Policies
Learn about different policy types
Algorithms
Explore RL algorithms
Training
Master training techniques
Algorithms
Explore RL algorithms