trinity.common.rewards.math_reward module
Math Reward Function Class.
- class trinity.common.rewards.math_reward.MathRewardFn(answer_parser=<function simple_answer_parser>, pattern='.*?<think>.*?</think>\\s*<answer>.*?</answer>\\s*$')[source]
Bases:
RewardFn
A reward function that rewards for math task.
- DEFAULT_FORMAT_PATTERN = '.*?<think>.*?</think>\\s*<answer>.*?</answer>\\s*$'
- DEFAULT_ANSWER_PARSER() str