trinity.common.workflows.rubric_judge_workflow module#

A workflow with LLM-as-a-judge.

class trinity.common.workflows.rubric_judge_workflow.RubricJudgeWorkflow(*, task: Task, model: ModelWrapper, auxiliary_models: List[ModelWrapper] | None = None)[源代码]#

基类:SimpleWorkflow

A workflow using LLM-as-a-judge and rubrics to get the reward.

Adapted from https://arxiv.org/pdf/2507.17746

__init__(*, task: Task, model: ModelWrapper, auxiliary_models: List[ModelWrapper] | None = None)[源代码]#
reset(task: Task)[源代码]#

Modified from SimpleWorkflow.reset

run() List[Experience][源代码]#

Modified from SimpleWorkflow.run

get_judge_reward(response: str, judger: OpenAI) Tuple[bool, float][源代码]#

Get rewards with LLM-as-a-judge The prompts are adapted from RAR-IMPLICIT method in https://arxiv.org/pdf/2507.17746