data_juicer.ops.mapper.video_ffmpeg_wrapped_mapper module¶
- class data_juicer.ops.mapper.video_ffmpeg_wrapped_mapper.VideoFFmpegWrappedMapper(filter_name: str | None = None, filter_kwargs: Dict | None = None, global_args: List[str] | None = None, capture_stderr: bool = True, overwrite_output: bool = True, save_dir: str = None, *args, **kwargs)[source]¶
Bases:
Mapper
Wraps FFmpeg video filters for processing video files in a dataset.
This operator applies a specified FFmpeg video filter to each video file in the dataset. It supports passing keyword arguments to the filter and global arguments to the FFmpeg command line. The processed videos are saved in a specified directory or the same directory as the input files. If no filter name is provided, the videos remain unmodified. The operator updates the source file paths in the dataset to reflect any changes.
- __init__(filter_name: str | None = None, filter_kwargs: Dict | None = None, global_args: List[str] | None = None, capture_stderr: bool = True, overwrite_output: bool = True, save_dir: str = None, *args, **kwargs)[source]¶
Initialization method.
- Parameters:
filter_name – ffmpeg video filter name.
filter_kwargs – keyword-arguments passed to ffmpeg filter.
global_args – list-arguments passed to ffmpeg command-line.
capture_stderr – whether to capture stderr.
overwrite_output – whether to overwrite output file.
save_dir – The directory where generated video files will be stored. If not specified, outputs will be saved in the same directory as their corresponding input files. This path can alternatively be defined by setting the DJ_PRODUCED_DATA_DIR environment variable.
args – extra args
kwargs – extra args