video_captioning_from_audio_mapper

Mapper to caption a video according to its audio streams based on Qwen-Audio model.

基于Qwen-Audio模型根据音频流为视频添加字幕的映射器。

Type 算子类型: mapper

Tags 标签: cpu, hf, multimodal

🔧 Parameter Configuration 参数配置

name 参数名

type 类型

default 默认值

desc 说明

keep_original_sample

<class ‘bool’>

True

whether to keep the original sample. If

args

''

extra args

kwargs

''

extra args

📊 Effect demonstration 效果演示

not available 暂无