data_juicer.ops.mapper.clean_links_mapper module¶
- class data_juicer.ops.mapper.clean_links_mapper.CleanLinksMapper(pattern: str | None = None, repl: str = '', *args, **kwargs)[source]¶
Bases:
Mapper
Mapper to clean links like http/https/ftp in text samples.