trinity.common.rewards.math_reward module#

Math Reward Function Class.

class trinity.common.rewards.math_reward.MathRewardFn(answer_parser=<function simple_answer_parser>, pattern='.*?<think>.*?</think>\\s*<answer>.*?</answer>\\s*$')[source]#

Bases: RewardFn

A reward function that rewards for math task.

DEFAULT_FORMAT_PATTERN = '.*?<think>.*?</think>\\s*<answer>.*?</answer>\\s*$'#
DEFAULT_ANSWER_PARSER() str#
__init__(answer_parser=<function simple_answer_parser>, pattern='.*?<think>.*?</think>\\s*<answer>.*?</answer>\\s*$') None[source]#
class trinity.common.rewards.math_reward.MathBoxedRewardFn(**kwargs)[source]#

Bases: RewardFn

A reward function that rewards for math task.

__init__(**kwargs) None[source]#