trinity.common.rewards.math_reward module

Math Reward Function Class.

class trinity.common.rewards.math_reward.MathRewardFn(answer_parser=<function simple_answer_parser>, pattern='.*?<think>.*?</think>\\s*<answer>.*?</answer>\\s*$')[source]

Bases: RewardFn

A reward function that rewards for math task.

DEFAULT_FORMAT_PATTERN = '.*?<think>.*?</think>\\s*<answer>.*?</answer>\\s*$'
DEFAULT_ANSWER_PARSER() str
__init__(answer_parser=<function simple_answer_parser>, pattern='.*?<think>.*?</think>\\s*<answer>.*?</answer>\\s*$') None[source]
class trinity.common.rewards.math_reward.MathBoxedRewardFn(**kwargs)[source]

Bases: RewardFn

A reward function that rewards for math task.

__init__(**kwargs) None[source]