safety
SafetyListWiseReward
Bases: BaseHarmlessnessListWiseReward
Safety: Comply with or refuse prompts related to harmful use cases as well as general compliance behaviors.
Source code in rm_gallery/gallery/rm/alignment/harmlessness/safety.py
19 20 21 22 23 24 25 26 |
|