trinity.algorithm.advantage_fn.rloo_advantage module

RLOO advantage computation

Ref: https://github.com/volcengine/verl/blob/main/verl/trainer/ppo/core_algos.py

class trinity.algorithm.advantage_fn.rloo_advantage.RLOOAdvantageFn[source]

Bases: AdvantageFn

__init__() None[source]
classmethod default_args() Dict[source]
Returns:

The default init arguments for the advantage function.

Return type:

Dict