data_juicer.ops.mapper.clean_html_mapper module¶ class data_juicer.ops.mapper.clean_html_mapper.CleanHtmlMapper(*args, **kwargs)[源代码]¶ 基类:Mapper Mapper to clean html code in text samples. __init__(*args, **kwargs)[源代码]¶ Initialization method. 参数: args -- extra args kwargs -- extra args process_batched(samples)[源代码]¶