trinity.algorithm.advantage_fn.rloo_advantage module
RLOO advantage computation
Ref: https://github.com/volcengine/verl/blob/main/verl/trainer/ppo/core_algos.py
RLOO advantage computation
Ref: https://github.com/volcengine/verl/blob/main/verl/trainer/ppo/core_algos.py