precise_if
PreciseIFListWiseReward
Bases: BaseHelpfulnessListWiseReward
Precise Instruction Following : Follows precise instructions, such as ‘Answer without the letter u’.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/precise_if.py
20 21 22 23 24 25 26 27 |
|