trinity.algorithm.policy_loss_fn.sft_loss module#
SFT loss function.
- class trinity.algorithm.policy_loss_fn.sft_loss.SFTLossFn(backend: str = 'verl', loss_agg_mode: str = 'token-mean')[源代码]#
基类:
PolicyLossFn- __init__(backend: str = 'verl', loss_agg_mode: str = 'token-mean') None[源代码]#
Initialize the policy loss function.
- 参数:
backend -- The training framework/backend to use (e.g., "verl")
- classmethod default_args()[源代码]#
Get default initialization arguments for this loss function.
- 返回:
The default init arguments for the policy loss function.
- 返回类型:
Dict
- property select_keys#
Returns parameter keys mapped to the specific training framework's naming convention.