trinity.algorithm.advantage_fn.ppo_advantage module#
PPO’s GAE advantage computation
Ref: volcengine/verl
- class trinity.algorithm.advantage_fn.ppo_advantage.PPOAdvantageFn(gamma: float = 1.0, lam: float = 1.0)[source]#
Bases:
AdvantageFn
PPO’s GAE advantage computation
Ref: volcengine/verl
Bases: AdvantageFn