data_juicer.ops.mapper.mllm_mapper module¶
- class data_juicer.ops.mapper.mllm_mapper.MllmMapper(hf_model: str = 'llava-hf/llava-v1.6-vicuna-7b-hf', max_new_tokens=256, temperature=0.2, top_p=None, num_beams=1, *args, **kwargs)[源代码]¶
基类:
Mapper
Mapper to use MLLMs for visual question answering tasks. Recommended model list: [
llava-hf/llava-v1.6-vicuna-7b-hf, Qwen/Qwen2-VL-7B-Instruct,
]
- __init__(hf_model: str = 'llava-hf/llava-v1.6-vicuna-7b-hf', max_new_tokens=256, temperature=0.2, top_p=None, num_beams=1, *args, **kwargs)[源代码]¶
Initialization method. :param hf_model: hugginface model id. :param max_new_tokens: the maximum number of new tokens
generated by the model.
- 参数:
temperature -- used to control the randomness of generated text. The higher the temperature, the more random and creative the generated text will be.
top_p -- randomly select the next word from the group of words whose cumulative probability reaches p.
num_beams -- the larger the beam search size, the higher the quality of the generated text.
args -- extra args
kwargs -- extra args