data_juicer.ops.mapper.clean_copyright_mapper module¶ class data_juicer.ops.mapper.clean_copyright_mapper.CleanCopyrightMapper(*args, **kwargs)[source]¶ Bases: Mapper Mapper to clean copyright comments at the beginning of the text samples. __init__(*args, **kwargs)[source]¶ Initialization method. Parameters: args – extra args kwargs – extra args process_batched(samples)[source]¶