期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
CLFormer:a cross-lingual transformer framework for temporal forgery localization
1
作者 Haonan Cheng Hanyue Liu +1 位作者 Juanjuan Cai Long Ye 《Visual Intelligence》 2025年第1期183-195,共13页
Temporal forgery localization(TFL)is crucial in deepfake detection.It focuses on identifying subtle temporal manipulations within video content.However,the generalization capabilities of current TFL methods are limite... Temporal forgery localization(TFL)is crucial in deepfake detection.It focuses on identifying subtle temporal manipulations within video content.However,the generalization capabilities of current TFL methods are limited,especially across different languages,which limits their performance in diverse environments.This limitation stems from two key factors.First,most existing datasets are English-centric.Second,there is inadequate learning from multi-modal information,where visual features are often prioritized over audio features.To address this gap,we created the Chinese audio-visual deepfake(CHAV-DF)dataset,which is the first dataset designed for the TFL in the Chinese context.This dataset provides a valuable benchmark for evaluating the TFL methods in cross-lingual settings.Additionally,we introduced a cross-lingual transformer framework(CLFormer),which prioritizes audio features and utilizes a pre-trained multi-lingual Wav2Vec2 to enhance cross-lingual generalization,while incorporating visual features to further refine TFL.Moreover,we incorporated a refinement module into CLFormer to enhance the accuracy of forgery localization.Experiments on the LAV-DF,CHAV-DF,and AV-Deepfake1M datasets demonstrate that CLFormer performs well in both same-language and cross-language settings.Specifically,CLFormer achieves an average precision(AP)of 57.68%at temporal intersection over union(tIoU)of 0.50 when trained on CHAV-DF and tested on LAV-DF,surpassing the state-of-the-art method by 47.59%,and validating its cross-language generalization capability. 展开更多
关键词 Temporal forgery localization(TFL) Cross-lingual Audio feature wav2vec2 Boundary refinement
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部