helpfulness
BaseHelpfulnessListWiseReward
Bases: BaseListWisePrincipleReward
The assistant aims to provide helpful and informative responses to users, responding to their queries with relevant and accurate information.
Source code in rm_gallery/gallery/rm/alignment/base.py
49 50 51 52 53 54 55 56 57 58 |
|
BrainstormingListWiseReward
Bases: BaseHelpfulnessListWiseReward
Brainstorming: Generating text to come up with new ideas or solutions, with an emphasis on creativity and driving thinking.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/brainstorming.py
20 21 22 23 24 25 26 27 |
|
ChatListWiseReward
Bases: BaseHelpfulnessListWiseReward
Chat: Simulates human conversation and communicates a variety of topics through text understanding and generation, emphasizing coherence and natural flow of interaction.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/chat.py
22 23 24 25 26 27 28 29 |
|
ClassificationListWiseReward
Bases: BaseHelpfulnessListWiseReward
Classification: Entails assigning predefined categories or labels to text based on its content.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/classification.py
18 19 20 21 22 23 24 25 26 27 |
|
ClosedQAListWiseReward
Bases: BaseHelpfulnessListWiseReward
Closed QA: Search for direct answers to specific questions in given text sources (i.e. given context, given options).
Source code in rm_gallery/gallery/rm/alignment/helpfulness/closed_qa.py
16 17 18 19 20 21 22 23 |
|
CodeListWiseReward
Bases: BaseHelpfulnessListWiseReward
Code: Involves generating, understanding, or modifying programming language code within text.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/code.py
15 16 17 18 19 20 21 22 |
|
FocusListWiseReward
Bases: BaseHelpfulnessListWiseReward
Focus: Detects high-quality, on-topic answers to general user queries
Source code in rm_gallery/gallery/rm/alignment/helpfulness/focus.py
19 20 21 22 23 24 25 26 |
|
GenerationListWiseReward
Bases: BaseHelpfulnessListWiseReward
Generation: Creating new textual content, from articles to stories, with an emphasis on originality and creativity.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/generation.py
21 22 23 24 25 26 27 28 |
|
MathListWiseReward
Bases: BaseHelpfulnessListWiseReward
Math: Solves problems at math, on open-ended human prompts ranging from middle school physics and geometry to college-level chemistry, calculus, combinatorics, and more.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/math.py
18 19 20 21 22 23 24 25 |
|
OpenQAListWiseReward
Bases: BaseHelpfulnessListWiseReward
Open QA: Search for answers across a wide range of text sources. The challenge is to process large amounts of information and understand complex questions.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/open_qa.py
15 16 17 18 19 20 21 22 |
|
PreciseIFListWiseReward
Bases: BaseHelpfulnessListWiseReward
Precise Instruction Following : Follows precise instructions, such as ‘Answer without the letter u’.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/precise_if.py
20 21 22 23 24 25 26 27 |
|
ReasoningListWiseReward
Bases: BaseHelpfulnessListWiseReward
Reasoning: Involves processing and analyzing text to draw inferences, make predictions, or solve problems, requiring an understanding of underlying concepts and relationships within the text.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/reasoning.py
15 16 17 18 19 20 21 22 |
|
RewardRegistry
A registry management system for reward modules that maps module names to their corresponding implementation classes.
This class provides a centralized repository for registering and retrieving reward modules by string identifiers. Modules can be registered using decorators and later accessed by their string identifiers.
Attributes:
Name | Type | Description |
---|---|---|
_registry |
Dict[str, Type[BaseReward]]
|
Internal dictionary storing the mapping between reward module names and their classes. |
Source code in rm_gallery/core/reward/registry.py
9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 |
|
get(name)
classmethod
Retrieve a registered reward module class by its identifier.
Provides safe access to registered modules without raising errors for missing entries.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
String identifier of the reward module to retrieve |
required |
Returns:
Type | Description |
---|---|
Type[BaseReward] | None
|
The corresponding BaseReward subclass if found, None otherwise |
Source code in rm_gallery/core/reward/registry.py
50 51 52 53 54 55 56 57 58 59 60 61 62 63 |
|
list()
classmethod
Returns:
Type | Description |
---|---|
str
|
A list of all registered reward modules |
Source code in rm_gallery/core/reward/registry.py
65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 |
|
register(name)
classmethod
Create a decorator to register a reward module class with a specified identifier.
The decorator pattern allows classes to be registered while maintaining their original identity.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
Unique string identifier for the reward module |
required |
module
|
The BaseReward subclass to be registered |
required |
Returns:
Type | Description |
---|---|
A decorator function that registers the module when applied to a class |
Source code in rm_gallery/core/reward/registry.py
22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 |
|
RewriteListWiseReward
Bases: BaseHelpfulnessListWiseReward
Rewrite: the assistant aims to modifies existing text to alter its style while preserving the original information and intent.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/rewrite.py
17 18 19 20 21 22 23 24 |
|
RolePlayingListWiseReward
Bases: BaseHelpfulnessListWiseReward
Role Playing: Entails adopting specific characters or personas within text-based scenarios, engaging in dialogues or actions that reflect the assigned roles.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/role_playing.py
25 26 27 28 29 30 31 32 |
|
SummarizationListWiseReward
Bases: BaseHelpfulnessListWiseReward
Summarization: The text is compressed into a short form, retaining the main information, which is divided into extraction (directly selected from the original text) and production (rewriting the information).
Source code in rm_gallery/gallery/rm/alignment/helpfulness/summarization.py
23 24 25 26 27 28 29 30 31 32 |
|
TranslationListWiseReward
Bases: BaseHelpfulnessListWiseReward
Translation: Converting text from one language to another.
Source code in rm_gallery/gallery/rm/alignment/helpfulness/translation.py
21 22 23 24 25 26 27 28 |
|