trinity.algorithm.advantage_fn.remax_advantage module
REMAX advantage computation
Ref: https://github.com/volcengine/verl/blob/main/verl/trainer/ppo/core_algos.py
REMAX advantage computation
Ref: https://github.com/volcengine/verl/blob/main/verl/trainer/ppo/core_algos.py