data_juicer.ops.mapper.remove_non_chinese_character_mapper module¶
- class data_juicer.ops.mapper.remove_non_chinese_character_mapper.RemoveNonChineseCharacterlMapper(keep_alphabet: bool = True, keep_number: bool = True, keep_punc: bool = True, *args, **kwargs)[源代码]¶
基类:
Mapper
Mapper to remove non chinese Character in text samples.