math
MathListWiseReward
Bases: BaseHelpfulnessListWiseReward
Math: Solves problems at math, on open-ended human prompts ranging from middle school physics and geometry to college-level chemistry, calculus, combinatorics, and more.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/math.py
18 19 20 21 22 23 24 25 |
|