trinity.algorithm.advantage_fn.ppo_advantage module
PPO’s GAE advantage computation
Ref: https://github.com/volcengine/verl/blob/main/verl/trainer/ppo/core_algos.py
- class trinity.algorithm.advantage_fn.ppo_advantage.PPOAdvantageFn(gamma: float = 1.0, lam: float = 1.0)[source]
Bases:
AdvantageFn