cumulative
IterableCumulativePrincipleGenerator
Bases: IterativePrincipleGenerator
Iterative principle generator that combines evaluation, generation, and clustering.
Attributes:
Name | Type | Description |
---|---|---|
reward |
BaseListWisePrincipleReward
|
Reward module for principle-based evaluation |
max_epochs |
int
|
Maximum number of iteration cycles |
Source code in rm_gallery/core/reward/principle/cumulative.py
69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 |
|
run_batch(samples, thread_pool, principles=None)
Executes the iterative principle generation pipeline.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
samples
|
List[DataSample]
|
List of initial data samples |
required |
thread_pool
|
ThreadPoolExecutor
|
Executor for parallel processing |
required |
Returns:
Type | Description |
---|---|
Dict[str, str]
|
Final optimized principles dictionary after iterations |
Source code in rm_gallery/core/reward/principle/cumulative.py
87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 |
|
PrincipleClusterTemplate
Bases: BaseGeneratorTemplate
Template class for clustering and organizing evaluation principles.
Methods:
Name | Description |
---|---|
format |
Formats a prompt for principle clustering and optimization. |
Source code in rm_gallery/core/reward/principle/cumulative.py
13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 |
|
format(examples, scenario, number, principles, **kwargs)
classmethod
Generates a structured prompt for principle clustering analysis.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
examples
|
str
|
Pre-generated example principles for reference |
required |
scenario
|
str
|
Contextual description of the evaluation scenario |
required |
number
|
int
|
Maximum number of clustered principles to generate |
required |
principles
|
Raw principles to be clustered and optimized |
required | |
**kwargs
|
Additional formatting parameters |
{}
|
Returns:
Type | Description |
---|---|
str
|
Formatted prompt string for principle clustering |
Source code in rm_gallery/core/reward/principle/cumulative.py
21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 |
|