trinity.common.rewards.math_reward module#

Math Reward Function Class.

class trinity.common.rewards.math_reward.MathRewardFn(answer_parser=<function simple_answer_parser>, pattern='.*?<think>.*?</think>\\s*<answer>.*?</answer>\\s*$')[源代码]#

基类:RewardFn

A reward function that rewards for math task.

DEFAULT_FORMAT_PATTERN = '.*?<think>.*?</think>\\s*<answer>.*?</answer>\\s*$'#
DEFAULT_ANSWER_PARSER() list[str]#
__init__(answer_parser=<function simple_answer_parser>, pattern='.*?<think>.*?</think>\\s*<answer>.*?</answer>\\s*$') None[源代码]#
class trinity.common.rewards.math_reward.MathBoxedRewardFn(**kwargs)[源代码]#

基类:RewardFn

A reward function that rewards for math task.

__init__(**kwargs) None[源代码]#