面向多源信息聚类与私有特征学习的情感分析

Multimodal Sentiment Analysis Focusing on Multisource Information Clustering and Private Feature Learning

下载PDF

导出

摘要针对目前情感分析模型的研究大多侧重于文本模态处理,而音频和视频模态的处理相对简单,未能充分挖掘其在增强情感信息方面的潜力,并且存在跨模态特征融合中的信息冗余问题。为此,提出了一种名为面向多源信息聚类与私有特征学习的情感分析的模型。引入隐性聚类的思维,通过跨模态注意力机制优化音视频与文本特征的互补能力,将不同模态的特征划分为若干类簇,以减少无关信息对融合过程的干扰。进一步地,通过特征一致性增强机制使用马氏距离度量方法对音视频模态特征进行增强和过滤,从而提升情感信息密度。与此同时,采用自适应权重调控机制,根据类簇的语义一致性来调节音视频模态的融合权重比例,并结合文本模态来消除模态间的语义歧义。此外,模型还引入自监督学习策略,进一步增强单模态的情感预测能力,帮助模型学到各模态的独特特性。实验结果表明,在CMU-MOSEI和CMU-MOSI数据集上,该模型在情感分类任务中的表现显著提升,验证了其在多模态信息融合和冗余信息抑制方面的有效性。 Current sentiment analysis models often focus on text modality processing,while the handling of audio and video modalities remains relatively simple,failing to fully exploit their potential in enhancing emotional information.Additionally,there is the issue of information redundancy in cross-modal feature fusion.To address these challenges,this paper proposes a sentiment analysis model based on multimodal information clustering and private feature learning.By introducing the concept of latent clustering thinking,the model optimizes the complementarity of audio,vision,and text features through a cross-modal attention mechanism,dividing the features of different modalities into several clusters to reduce the interference of irrelevant information during the fusion process.Furthermore,a feature consistency enhancement mechanism using Mahalanobis distance is employed to enhance and filter audio and video modality features,thereby increasing the density of emotional information.Simultaneously,an adaptive weight adjustment mechanism is applied,which adjusts the fusion weight ratio of the audio and video modalities based on the semantic consistency of the clusters and combines them with the text modality to eliminate semantic ambiguity between modalities.Additionally,the model incorporates a self-supervised learning strategy to further enhance the emotional prediction ability of each modality,helping the model learn the unique characteristics of each modality.Experimental results on the CMU-MOSEI and CMU-MOSI datasets show significant improvements in sentiment classification performance,validating the effectiveness of the model in multimodal information fusion and redundancy suppression.

作者钟婷冯广林健忠杨燕茹周垣桦郑润庭刘天翔 ZHONG Ting;FENG Guang;LIN Jianzhong;YANG Yanru;ZHOU Yuanhua;ZHENG Runting;LIU Tianxiang(School of Automation,Guangdong University of Technology,Guangzhou 510006,China;School of Computer Science,Guangdong University of Technology,Guangzhou 510006,China)

机构地区广东工业大学自动化学院广东工业大学计算机学院

出处《计算机工程与应用》北大核心 2025年第24期176-186,共11页 Computer Engineering and Applications

基金国家自然科学基金重点项目(62237001) 广东省哲学社会科学青年项目(GD23YJY08)。

关键词多模态情感分析注意力机制隐性聚类马氏距离自监督学习 multimodal sentiment analysis attention mechanism latent clustering Mahalanobis distance self-supervised learning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1刘瑞圣,陈忠玉,孙士强,纪磊.智能变电站SV-GOOSE信息实时转换技术研究[J].电工技术,2025(22):208-210.
2王秀芝.中学英语教学中多模态互动教学模式的探索与应用[J].成长,2025(20):26-28.
3杜佳敏,樊迪,李陈,毕丽雅,杨薇.基于大语言模型的人形机器人头部共情响应对话模型研究[J].智能感知工程,2025,2(4):32-37.
4陈馨,吴子炜,周素茵,夏芳.基于改进YOLOv8n-pose的巨峰葡萄采摘定位方法[J].华南农业大学学报,2026,47(1):118-127.
5黄盼,李丹丹.血小板反应蛋白-1(THBS1)通过Hippo通路促进胃癌转移[J].湖北医药学院学报,2025,44(6):655-662.
6朱纲要.浅议美育在“快乐成长工程”中的实践探索[J].画刊(美育),2025(7):203-205.
7杨江华,杨思宇.青年网络社交圈群特征与政治参与[J].复印报刊资料(青少年导刊),2024(2):34-43.
8王莹.生成式人工智能技术对新媒体文案创意表现力的影响[J].新闻文化建设,2025(21):71-73.
9李洋洋.大牛地气田大12井区含凝析油气井泡排工艺研究[J].石油化工应用,2025,44(11):15-18.
10贺艳芳,周雅芝,唐智彬.百年来中国共产党农民教育探索与当下启示[J].复印报刊资料(成人教育学刊),2024(4):62-69.

计算机工程与应用

2025年第24期

浏览历史

内容加载中请稍等...

面向多源信息聚类与私有特征学习的情感分析

相关作者

相关机构

相关主题

浏览历史