期刊文献+
共找到206篇文章
< 1 2 11 >
每页显示 20 50 100
Text-Image Feature Fine-Grained Learning for Joint Multimodal Aspect-Based Sentiment Analysis
1
作者 Tianzhi Zhang Gang Zhou +4 位作者 Shuang Zhang Shunhang Li Yepeng Sun Qiankun Pi Shuo Liu 《Computers, Materials & Continua》 SCIE EI 2025年第1期279-305,共27页
Joint Multimodal Aspect-based Sentiment Analysis(JMASA)is a significant task in the research of multimodal fine-grained sentiment analysis,which combines two subtasks:Multimodal Aspect Term Extraction(MATE)and Multimo... Joint Multimodal Aspect-based Sentiment Analysis(JMASA)is a significant task in the research of multimodal fine-grained sentiment analysis,which combines two subtasks:Multimodal Aspect Term Extraction(MATE)and Multimodal Aspect-oriented Sentiment Classification(MASC).Currently,most existing models for JMASA only perform text and image feature encoding from a basic level,but often neglect the in-depth analysis of unimodal intrinsic features,which may lead to the low accuracy of aspect term extraction and the poor ability of sentiment prediction due to the insufficient learning of intra-modal features.Given this problem,we propose a Text-Image Feature Fine-grained Learning(TIFFL)model for JMASA.First,we construct an enhanced adjacency matrix of word dependencies and adopt graph convolutional network to learn the syntactic structure features for text,which addresses the context interference problem of identifying different aspect terms.Then,the adjective-noun pairs extracted from image are introduced to enable the semantic representation of visual features more intuitive,which addresses the ambiguous semantic extraction problem during image feature learning.Thereby,the model performance of aspect term extraction and sentiment polarity prediction can be further optimized and enhanced.Experiments on two Twitter benchmark datasets demonstrate that TIFFL achieves competitive results for JMASA,MATE and MASC,thus validating the effectiveness of our proposed methods. 展开更多
关键词 multimodal sentiment analysis aspect-based sentiment analysis feature fine-grained learning graph convolutional network adjective-noun pairs
在线阅读 下载PDF
A review of multimodal aspect-based sentiment analysis
2
作者 Tian’ang Chen 《Advances in Engineering Innovation》 2025年第7期43-51,共9页
In the era of digital communication,the exponential growth of user-generated content across social media and online platforms has intensified the demand for effective emotion analysis tools.Traditional text-based sent... In the era of digital communication,the exponential growth of user-generated content across social media and online platforms has intensified the demand for effective emotion analysis tools.Traditional text-based sentiment analysis methods,however,often fall short in accurately capturing the nuances of human emotions due to their reliance on a single modality.Motivated by the need for more comprehensive and context-aware emotion recognition,this study systematically reviews the literature on both unimodal and multimodal aspect-level sentiment analysis.By comparing different approaches within the multimodal domain,we identify existing challenges and emerging trends in this research area.Our findings highlight the potential of integrating multiple modalities—such as text,images,and audio—to enhance the precision of sentiment detection and suggest future directions for advancing multimodal sentiment analysis. 展开更多
关键词 aspect-based sentiment analysis multimodal aspect-based sentiment analysis Large Language Models(LLMs)
在线阅读 下载PDF
A Multimodal Sentiment Analysis Method Based on Multi-Granularity Guided Fusion
3
作者 Zilin Zhang Yan Liu +3 位作者 Jia Liu Senbao Hou Yuping Zhang Chenyuan Wang 《Computers, Materials & Continua》 2026年第2期1228-1241,共14页
With the growing demand formore comprehensive and nuanced sentiment understanding,Multimodal Sentiment Analysis(MSA)has gained significant traction in recent years and continues to attract widespread attention in the ... With the growing demand formore comprehensive and nuanced sentiment understanding,Multimodal Sentiment Analysis(MSA)has gained significant traction in recent years and continues to attract widespread attention in the academic community.Despite notable advances,existing approaches still face critical challenges in both information modeling and modality fusion.On one hand,many current methods rely heavily on encoders to extract global features from each modality,which limits their ability to capture latent fine-grained emotional cues within modalities.On the other hand,prevailing fusion strategies often lack mechanisms to model semantic discrepancies across modalities and to adaptively regulate modality interactions.To address these limitations,we propose a novel framework for MSA,termed Multi-Granularity Guided Fusion(MGGF).The proposed framework consists of three core components:(i)Multi-Granularity Feature Extraction Module,which simultaneously captures both global and local emotional features within each modality,and integrates them to construct richer intra-modal representations;(ii)Cross-ModalGuidance Learning Module(CMGL),which introduces a cross-modal scoring mechanism to quantify the divergence and complementarity betweenmodalities.These scores are then used as guiding signals to enable the fusion strategy to adaptively respond to scenarios of modality agreement or conflict;(iii)Cross-Modal Fusion Module(CMF),which learns the semantic dependencies among modalities and facilitates deep-level emotional feature interaction,thereby enhancing sentiment prediction with complementary information.We evaluate MGGF on two benchmark datasets:MVSA-Single and MVSA-Multiple.Experimental results demonstrate that MGGF outperforms the current state-of-the-art model CLMLF on MVSA-Single by achieving a 2.32% improvement in F1 score.On MVSA-Multiple,it surpasses MGNNS with a 0.26% increase in accuracy.These results substantiate the effectiveness ofMGGFin addressing two major limitations of existing methods—insufficient intra-modal fine-grained sentiment modeling and inadequate cross-modal semantic fusion. 展开更多
关键词 multimodal sentiment analysis cross-modal fusion cross-modal guided learning
在线阅读 下载PDF
Enhanced Multimodal Sentiment Analysis via Integrated Spatial Position Encoding and Fusion Embedding
4
作者 Chenquan Gan Xu Liu +3 位作者 Yu Tang Xianrong Yu Qingyi Zhu Deepak Kumar Jain 《Computers, Materials & Continua》 2025年第12期5399-5421,共23页
Multimodal sentiment analysis aims to understand emotions from text,speech,and video data.However,current methods often overlook the dominant role of text and suffer from feature loss during integration.Given the vary... Multimodal sentiment analysis aims to understand emotions from text,speech,and video data.However,current methods often overlook the dominant role of text and suffer from feature loss during integration.Given the varying importance of each modality across different contexts,a central and pressing challenge in multimodal sentiment analysis lies in maximizing the use of rich intra-modal features while minimizing information loss during the fusion process.In response to these critical limitations,we propose a novel framework that integrates spatial position encoding and fusion embedding modules to address these issues.In our model,text is treated as the core modality,while speech and video features are selectively incorporated through a unique position-aware fusion process.The spatial position encoding strategy preserves the internal structural information of speech and visual modalities,enabling the model to capture localized intra-modal dependencies that are often overlooked.This design enhances the richness and discriminative power of the fused representation,enabling more accurate and context-aware sentiment prediction.Finally,we conduct comprehensive evaluations on two widely recognized standard datasets in the field—CMU-MOSI and CMU-MOSEI to validate the performance of the proposed model.The experimental results demonstrate that our model exhibits good performance and effectiveness for sentiment analysis tasks. 展开更多
关键词 multimodal sentiment analysis spatial position encoding fusion embedding feature loss reduction
在线阅读 下载PDF
Multimodal sentiment analysis for social media contents during public emergencies 被引量:1
5
作者 Tao Fan Hao Wang +2 位作者 Peng Wu Chen Ling Milad Taleby Ahvanooey 《Journal of Data and Information Science》 CSCD 2023年第3期61-87,共27页
Purpose:Nowadays,public opinions during public emergencies involve not only textual contents but also contain images.However,the existing works mainly focus on textual contents and they do not provide a satisfactory a... Purpose:Nowadays,public opinions during public emergencies involve not only textual contents but also contain images.However,the existing works mainly focus on textual contents and they do not provide a satisfactory accuracy of sentiment analysis,lacking the combination of multimodal contents.In this paper,we propose to combine texts and images generated in the social media to perform sentiment analysis.Design/methodology/approach:We propose a Deep Multimodal Fusion Model(DMFM),which combines textual and visual sentiment analysis.We first train word2vec model on a large-scale public emergency corpus to obtain semantic-rich word vectors as the input of textual sentiment analysis.BiLSTM is employed to generate encoded textual embeddings.To fully excavate visual information from images,a modified pretrained VGG16-based sentiment analysis network is used with the best-performed fine-tuning strategy.A multimodal fusion method is implemented to fuse textual and visual embeddings completely,producing predicted labels.Findings:We performed extensive experiments on Weibo and Twitter public emergency datasets,to evaluate the performance of our proposed model.Experimental results demonstrate that the DMFM provides higher accuracy compared with baseline models.The introduction of images can boost the performance of sentiment analysis during public emergencies.Research limitations:In the future,we will test our model in a wider dataset.We will also consider a better way to learn the multimodal fusion information.Practical implications:We build an efficient multimodal sentiment analysis model for the social media contents during public emergencies.Originality/value:We consider the images posted by online users during public emergencies on social platforms.The proposed method can present a novel scope for sentiment analysis during public emergencies and provide the decision support for the government when formulating policies in public emergencies. 展开更多
关键词 Public emergency multimodal sentiment analysis Social platform Textual sentiment analysis Visual sentiment analysis
在线阅读 下载PDF
A Robust Framework for Multimodal Sentiment Analysis with Noisy Labels Generated from Distributed Data Annotation 被引量:1
6
作者 Kai Jiang Bin Cao Jing Fan 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第6期2965-2984,共20页
Multimodal sentiment analysis utilizes multimodal data such as text,facial expressions and voice to detect people’s attitudes.With the advent of distributed data collection and annotation,we can easily obtain and sha... Multimodal sentiment analysis utilizes multimodal data such as text,facial expressions and voice to detect people’s attitudes.With the advent of distributed data collection and annotation,we can easily obtain and share such multimodal data.However,due to professional discrepancies among annotators and lax quality control,noisy labels might be introduced.Recent research suggests that deep neural networks(DNNs)will overfit noisy labels,leading to the poor performance of the DNNs.To address this challenging problem,we present a Multimodal Robust Meta Learning framework(MRML)for multimodal sentiment analysis to resist noisy labels and correlate distinct modalities simultaneously.Specifically,we propose a two-layer fusion net to deeply fuse different modalities and improve the quality of the multimodal data features for label correction and network training.Besides,a multiple meta-learner(label corrector)strategy is proposed to enhance the label correction approach and prevent models from overfitting to noisy labels.We conducted experiments on three popular multimodal datasets to verify the superiority of ourmethod by comparing it with four baselines. 展开更多
关键词 Distributed data collection multimodal sentiment analysis meta learning learn with noisy labels
在线阅读 下载PDF
GLAMSNet:A Gated-Linear Aspect-Aware Multimodal Sentiment Network with Alignment Supervision and External Knowledge Guidance
7
作者 Dan Wang Zhoubin Li +1 位作者 Yuze Xia Zhenhua Yu 《Computers, Materials & Continua》 2025年第12期5823-5845,共23页
Multimodal Aspect-Based Sentiment Analysis(MABSA)aims to detect sentiment polarity toward specific aspects by leveraging both textual and visual inputs.However,existing models suffer from weak aspectimage alignment,mo... Multimodal Aspect-Based Sentiment Analysis(MABSA)aims to detect sentiment polarity toward specific aspects by leveraging both textual and visual inputs.However,existing models suffer from weak aspectimage alignment,modality imbalance dominated by textual signals,and limited reasoning for implicit or ambiguous sentiments requiring external knowledge.To address these issues,we propose a unified framework named Gated-Linear Aspect-Aware Multimodal Sentiment Network(GLAMSNet).First of all,an input encoding module is employed to construct modality-specific and aspect-aware representations.Subsequently,we introduce an image–aspect correlation matching module to provide hierarchical supervision for visual-textual alignment.Building upon these components,we further design a Gated-Linear Aspect-Aware Fusion(GLAF)module to enhance aspect-aware representation learning by adaptively filtering irrelevant textual information and refining semantic alignment under aspect guidance.Additionally,an External Language Model Knowledge-Guided mechanism is integrated to incorporate sentimentaware prior knowledge from GPT-4o,enabling robust semantic reasoning especially under noisy or ambiguous inputs.Experimental studies conducted based on Twitter-15 and Twitter-17 datasets demonstrate that the proposed model outperforms most state-of-the-art methods,achieving 79.36%accuracy and 74.72%F1-score,and 74.31%accuracy and 72.01%F1-score,respectively. 展开更多
关键词 sentiment analysis multimodal aspect-based sentiment analysis cross-modal alignment multimodal sentiment classification large language model
在线阅读 下载PDF
Leveraging Vision-Language Pre-Trained Model and Contrastive Learning for Enhanced Multimodal Sentiment Analysis
8
作者 Jieyu An Wan Mohd Nazmee Wan Zainon Binfen Ding 《Intelligent Automation & Soft Computing》 SCIE 2023年第8期1673-1689,共17页
Multimodal sentiment analysis is an essential area of research in artificial intelligence that combines multiple modes,such as text and image,to accurately assess sentiment.However,conventional approaches that rely on... Multimodal sentiment analysis is an essential area of research in artificial intelligence that combines multiple modes,such as text and image,to accurately assess sentiment.However,conventional approaches that rely on unimodal pre-trained models for feature extraction from each modality often overlook the intrinsic connections of semantic information between modalities.This limitation is attributed to their training on unimodal data,and necessitates the use of complex fusion mechanisms for sentiment analysis.In this study,we present a novel approach that combines a vision-language pre-trained model with a proposed multimodal contrastive learning method.Our approach harnesses the power of transfer learning by utilizing a vision-language pre-trained model to extract both visual and textual representations in a unified framework.We employ a Transformer architecture to integrate these representations,thereby enabling the capture of rich semantic infor-mation in image-text pairs.To further enhance the representation learning of these pairs,we introduce our proposed multimodal contrastive learning method,which leads to improved performance in sentiment analysis tasks.Our approach is evaluated through extensive experiments on two publicly accessible datasets,where we demonstrate its effectiveness.We achieve a significant improvement in sentiment analysis accuracy,indicating the supe-riority of our approach over existing techniques.These results highlight the potential of multimodal sentiment analysis and underscore the importance of considering the intrinsic semantic connections between modalities for accurate sentiment assessment. 展开更多
关键词 multimodal sentiment analysis vision–language pre-trained model contrastive learning sentiment classification
在线阅读 下载PDF
Improving Targeted Multimodal Sentiment Classification with Semantic Description of Images 被引量:1
9
作者 Jieyu An Wan Mohd Nazmee Wan Zainon Zhang Hao 《Computers, Materials & Continua》 SCIE EI 2023年第6期5801-5815,共15页
Targeted multimodal sentiment classification(TMSC)aims to identify the sentiment polarity of a target mentioned in a multimodal post.The majority of current studies on this task focus on mapping the image and the text... Targeted multimodal sentiment classification(TMSC)aims to identify the sentiment polarity of a target mentioned in a multimodal post.The majority of current studies on this task focus on mapping the image and the text to a high-dimensional space in order to obtain and fuse implicit representations,ignoring the rich semantic information contained in the images and not taking into account the contribution of the visual modality in the multimodal fusion representation,which can potentially influence the results of TMSC tasks.This paper proposes a general model for Improving Targeted Multimodal Sentiment Classification with Semantic Description of Images(ITMSC)as a way to tackle these issues and improve the accu-racy of multimodal sentiment analysis.Specifically,the ITMSC model can automatically adjust the contribution of images in the fusion representation through the exploitation of semantic descriptions of images and text similarity relations.Further,we propose a target-based attention module to capture the target-text relevance,an image-based attention module to capture the image-text relevance,and a target-image matching module based on the former two modules to properly align the target with the image so that fine-grained semantic information can be extracted.Our experimental results demonstrate that our model achieves comparable performance with several state-of-the-art approaches on two multimodal sentiment datasets.Our findings indicate that incorporating semantic descriptions of images can enhance our understanding of multimodal content and lead to improved sentiment analysis performance. 展开更多
关键词 Targeted sentiment analysis multimodal sentiment classification visual sentiment textual sentiment social media
在线阅读 下载PDF
End-to-end aspect category sentiment analysis based on type graph convolutional networks
10
作者 邵清 ZHANG Wenshuang WANG Shaojun 《High Technology Letters》 EI CAS 2023年第3期325-334,共10页
For the existing aspect category sentiment analysis research,most of the aspects are given for sentiment extraction,and this pipeline method is prone to error accumulation,and the use of graph convolutional neural net... For the existing aspect category sentiment analysis research,most of the aspects are given for sentiment extraction,and this pipeline method is prone to error accumulation,and the use of graph convolutional neural network for aspect category sentiment analysis does not fully utilize the dependency type information between words,so it cannot enhance feature extraction.This paper proposes an end-to-end aspect category sentiment analysis(ETESA)model based on type graph convolutional networks.The model uses the bidirectional encoder representation from transformers(BERT)pretraining model to obtain aspect categories and word vectors containing contextual dynamic semantic information,which can solve the problem of polysemy;when using graph convolutional network(GCN)for feature extraction,the fusion operation of word vectors and initialization tensor of dependency types can obtain the importance values of different dependency types and enhance the text feature representation;by transforming aspect category and sentiment pair extraction into multiple single-label classification problems,aspect category and sentiment can be extracted simultaneously in an end-to-end way and solve the problem of error accumulation.Experiments are tested on three public datasets,and the results show that the ETESA model can achieve higher Precision,Recall and F1 value,proving the effectiveness of the model. 展开更多
关键词 aspect-based sentiment analysis(ABSA) bidirectional encoder representation from transformers(BERT) type graph convolutional network(TGCN) aspect category and senti-ment pair extraction
在线阅读 下载PDF
Multimodal sentiment analysis based on contrastive learning and cross-modal guided fusion
11
作者 Liu Huanqi Sun Jingtao +1 位作者 Zhang Fengling Hou Wenyan 《The Journal of China Universities of Posts and Telecommunications》 2025年第4期18-33,共16页
Multimodal sentiment analysis,which integrates text,speech,and image modalities,has emerged as a prominent research direction in artificial intelligence for precise emotion assessment.However,current techniques experi... Multimodal sentiment analysis,which integrates text,speech,and image modalities,has emerged as a prominent research direction in artificial intelligence for precise emotion assessment.However,current techniques experience difficulties in efficiently managing redundancy and inconsistency across features from different modalities,compromising sentiment analysis accuracy.Additionally,while the analysis of intraclass emotional features has garnered substantial attention,studies of interclass relationships have been neglected.To address these challenges,a multimodal sentiment analysis method based on contrastive learning and cross-modal guided fusion(CLCGF)is proposed.This method encodes text and images to derive latent representations and employs a cross-modal guided module with sparse attention mechanisms to effectively integrate textual and visual features,thereby mitigating redundancy issues within each modality's features.In addition to the sentiment classification task,a supervised contrastive learning task is incorporated to aid the model in learning effective features from multimodal data related to emotions.To assess the efficacy of the CLCGF method,experiments were conducted on three public datasets:MVSA-Single,MVSA-Multiple and HFM.The experimental results indicate that CLCGF significantly improves sentiment analysis accuracy compared with traditional methods. 展开更多
关键词 multimodal sentiment analysis sparse attention contrastive learning
原文传递
用于多模态方面级情感分析的多通道信息增强网络
12
作者 夏敏捷 郑海涛 樊银亭 《计算机工程与设计》 北大核心 2026年第3期895-900,F0003,共7页
针对现有的多模态方面级情感分析忽略模态间存在的语义相关性以及未能对情感信息进行动态捕获的问题,提出了一种多通道信息增强网络。该模型通过集成多级注意力机制,深度挖掘文本与图像模态间的语义相关性,为解决序列化数据中情感信息... 针对现有的多模态方面级情感分析忽略模态间存在的语义相关性以及未能对情感信息进行动态捕获的问题,提出了一种多通道信息增强网络。该模型通过集成多级注意力机制,深度挖掘文本与图像模态间的语义相关性,为解决序列化数据中情感信息的动态变化问题,设计动态情感感知模块,有效增强了方面与文本、方面与图像之间的信息交互。在两个基准数据集上开展实验,实验结果验证了所提模型在多模态情感分析任务中具有良好的表现。 展开更多
关键词 多模态 方面级情感分析 多通道 注意力机制 语义相关性 动态情感感知 信息交互
在线阅读 下载PDF
面向方面级多模态情感分析的双交互异构图神经网络模型
13
作者 杨力 文腾 +1 位作者 廖远 姜春雨 《计算机科学与探索》 北大核心 2026年第4期1147-1158,共12页
当前方面级多模态情感分析(MABSA)中普遍存在图像特征利用不足、模态间交互机制薄弱的问题。为此,提出一种新颖的图像双交互异构图神经网络模型(DIGN),以提升图像语义理解能力与跨模态特征融合效率。使用ResNet50和RoBERTa分别提取图像... 当前方面级多模态情感分析(MABSA)中普遍存在图像特征利用不足、模态间交互机制薄弱的问题。为此,提出一种新颖的图像双交互异构图神经网络模型(DIGN),以提升图像语义理解能力与跨模态特征融合效率。使用ResNet50和RoBERTa分别提取图像与文本的初始语义特征,并结合多头注意力机制增强文本的上下文表达;设计图卷积双交互模块,构建图像-文本与图像-方面两个异构图结构,利用GCN分别建模其语义关系,深入挖掘图像的局部与目标相关语义信息;引入注意力感知辅助模块,将两种交互特征通过多头注意力机制进行融合,并加入残差连接和归一化增强特征一致性与稳定性;提出多模态语义融合模块,结合全局注意力机制(GAM)与多层感知机(MLP),动态调控通道和特征维度权重,实现交互特征与原始方面特征的深层融合;通过Softmax函数完成情感分类任务。在Twitter-2015与Twitter-2017两个数据集上进行的对比实验和消融实验验证了该模型在精确度、鲁棒性及泛化能力上的显著提升。 展开更多
关键词 情感分析 多模态 方面 图卷积神经网络 注意力机制
在线阅读 下载PDF
结合自适应特征加权与权值优化策略的多模态情感分析
14
作者 冯广 周垣桦 +5 位作者 钟婷 杨燕茹 黄荣灿 盘皓然 林健忠 周科栋 《计算机工程与应用》 北大核心 2026年第6期194-204,共11页
多模态情感分析在智慧教育领域中发挥着关键作用,通过对课堂中产生的音视频流媒体数据进行实时分析,可以更精准地挖掘学生的情感状态。当前多模态情感分析中的编码方法普遍忽视了不同模态之间信息密度的差异以及模态特有信息之间的不兼... 多模态情感分析在智慧教育领域中发挥着关键作用,通过对课堂中产生的音视频流媒体数据进行实时分析,可以更精准地挖掘学生的情感状态。当前多模态情感分析中的编码方法普遍忽视了不同模态之间信息密度的差异以及模态特有信息之间的不兼容性,这在融合过程中可能引入噪声或导致信息冗余。为解决这一问题,提出了一种结合自适应特征加权与权值优化策略的多模态情感分析模型。在特征优化层,作为低级特征的音频与视频通过交叉注意力进行交互,从而提高信息密度,并通过自适应加权与权值优化策略对交互结果进行动态校正。在特征融合层,利用交叉注意力模块实现文本特征与音视频特征的有效融合,通过由特征加权过滤与权值优化约束实现的互补策略增强特征表征能力。在公开数据集MOSI和MOSEI上的实验结果表明,提出的模型在特征加权调优下显著提升了情感预测性能,在大多数评价指标上实现了较先进或具竞争力的表现。 展开更多
关键词 多模态情感分析 跨模态融合 交叉注意力 权值优化
在线阅读 下载PDF
跨模态特征增强与层次化MLP通信的多模态情感分析
15
作者 王旭阳 马瑾 《广西师范大学学报(自然科学版)》 北大核心 2026年第1期91-101,共11页
在多模态情感分析任务中,由于非语言模态信息利用不充分、跨模态交互缺乏细粒度关联建模以及层次化语义融合机制不完善,导致不同模态之间的情感信息难以实现有效融合。为此,本文提出一种跨模态特征增强与层次化MLP通信的多模态情感分析... 在多模态情感分析任务中,由于非语言模态信息利用不充分、跨模态交互缺乏细粒度关联建模以及层次化语义融合机制不完善,导致不同模态之间的情感信息难以实现有效融合。为此,本文提出一种跨模态特征增强与层次化MLP通信的多模态情感分析方法。该方法构建渐进式融合架构,首先通过跨模态注意力机制增强非语言模态信息,捕捉多对多的跨模态细粒度交互;继而使用层次化MLP通信模块,在模态融合维度与时间建模维度上分别设计并行与堆叠的MLP模块,实现水平与垂直方向的层次化特征交互,有效提升情感理解的准确性与表达能力。实验结果表明,本文模型在CMU-MOSI上,Acc2和F_(1)值较次优模型分别提升0.89和0.77个百分点,在CMU-MOSEI上对比实验各项指标均优于基准模型,Acc2、F_(1)值分别达到86.34%、86.25%。 展开更多
关键词 多模态 情感分析 跨模态注意力 层次化MLP通信 门控单元
在线阅读 下载PDF
模态缺失场景下基于生成重构和交互式自挖掘的多模态情感分析
16
作者 冯广 周科栋 +5 位作者 伍文燕 黄俊辉 林忆宝 刘馨婷 赵志文 苏旭 《计算机应用研究》 北大核心 2026年第3期664-671,共8页
在多模态情感分析任务中,真实应用常出现模态缺失现象,而现有缺失模态生成方法普遍过度依赖自动生成的模态表示,易导致生成误差放大和泛化能力不足等问题。为此,提出一种提示生成-双模态重构-自挖掘的多模态情感分析框架(prompt-reconst... 在多模态情感分析任务中,真实应用常出现模态缺失现象,而现有缺失模态生成方法普遍过度依赖自动生成的模态表示,易导致生成误差放大和泛化能力不足等问题。为此,提出一种提示生成-双模态重构-自挖掘的多模态情感分析框架(prompt-reconstruct-mining,PRM)。针对单模态和双模态缺失场景,该框架首先利用生成式提示与已有模态信息对缺失模态进行初步估计,随后设计了双模态支撑重构机制,有效降低了单源生成误差;在融合阶段创新性地引入自挖掘算子(self-mining operator)显式学习未缺失模态的深层语义特征,并提出零向量位插入(zero-slot insertion)策略聚合全局上下文信息。实验结果表明,在CMU-MOSI和CMU-MOSEI数据集的单模态和双模态缺失场景下,PRM模型的accuracy和F 1分别平均提升约1%~3%,并且在动态缺失与跨数据集迁移实验中仍表现出稳健的泛化能力,验证了模型在复杂缺失情境下的有效性和鲁棒性。 展开更多
关键词 多模态情感分析 模态缺失 提示生成 重构 自挖掘 零向量位
在线阅读 下载PDF
基于提示学习和注意力机制的多模态方面级情感分析
17
作者 曾桢 廖茂熙 +2 位作者 杨楠 罗燕 杨超 《科学技术与工程》 北大核心 2026年第2期708-716,共9页
针对现有多模态方面级情感分析模型存在不同模态之间关联性不强,难以有效利用它们之间相关信息的问题,提出了一种基于提示学习和注意力机制的多模态方面级情感分析模型。首先,在文本和图像特征分别输入多层Transformer层之前,对文本特... 针对现有多模态方面级情感分析模型存在不同模态之间关联性不强,难以有效利用它们之间相关信息的问题,提出了一种基于提示学习和注意力机制的多模态方面级情感分析模型。首先,在文本和图像特征分别输入多层Transformer层之前,对文本特征添加提示信息,并通过耦合函数投影到图像特征中;其次,使用跨模态注意力机制进行特征融合,并通过双向门控循环单元(bidirectional gated recurrent unit,BiGRU)过滤融合过程中的无关信息;然后,构建了一个由方面信息引导的注意力模块,以加强对方面信息的关注度;最后,通过全连接层和softmax分类层得到情感分类结果。该模型分别在公开的Twitter-15和Twitter-17数据集上进行实验,结果显示与多个基线模型相比,所提模型的准确率和F1均有所提升,证明了所提模型在多模态方面级情感分析任务中的有效性。 展开更多
关键词 方面级情感分析 多模态 提示学习 特征提取 注意力机制 自然语言处理
在线阅读 下载PDF
Fusing Syntactic Structure Information and Lexical Semantic Information for End-to-End Aspect-Based Sentiment Analysis 被引量:3
18
作者 Yong Bie Yan Yang Yiling Zhang 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2023年第2期230-243,共14页
The aspect-based sentiment analysis(ABSA)consists of two subtasksaspect term extraction and aspect sentiment prediction.Most methods conduct the ABSA task by handling the subtasks in a pipeline manner,whereby problems... The aspect-based sentiment analysis(ABSA)consists of two subtasksaspect term extraction and aspect sentiment prediction.Most methods conduct the ABSA task by handling the subtasks in a pipeline manner,whereby problems in performance and real application emerge.In this study,we propose an end-to-end ABSA model,namely,SSi-LSi,which fuses the syntactic structure information and the lexical semantic information,to address the limitation that existing end-to-end methods do not fully exploit the text information.Through two network branches,the model extracts syntactic structure information and lexical semantic information,which integrates the part of speech,sememes,and context,respectively.Then,on the basis of an attention mechanism,the model further realizes the fusion of the syntactic structure information and the lexical semantic information to obtain higher quality ABSA results,in which way the text information is fully used.Subsequent experiments demonstrate that the SSi-LSi model has certain advantages in using different text information. 展开更多
关键词 deep learning natural language processing aspect-based sentiment analysis graph convolutional
原文传递
基于统一对齐与多阶段融合机制的多模态情感分析模型
19
作者 冯广 刘馨婷 +4 位作者 林忆宝 赵志文 肖俊鸿 周科栋 黄俊辉 《计算机应用研究》 北大核心 2026年第2期342-352,共11页
针对多模态情感分析中模态异构、贡献动态与语义抽象不足等问题,提出一种三阶段闭环融合模型MIFA,路径包含“统一对齐-动态融合调控-高阶语义抽象”。方法上,首先以统一语义对齐实现异构模态在共享空间的一致表达;继而通过上下文门控与... 针对多模态情感分析中模态异构、贡献动态与语义抽象不足等问题,提出一种三阶段闭环融合模型MIFA,路径包含“统一对齐-动态融合调控-高阶语义抽象”。方法上,首先以统一语义对齐实现异构模态在共享空间的一致表达;继而通过上下文门控与通道调制联合估计模态/通道权重;最终以分层残差语义增强实现高阶抽象与判别强化。在CMU-MOSI与CMU-MOSEI数据集上的实验表明,二分类Acc2与F_(1)分别达到86.43%/86.03%和86.42%/85.81%,七分类Acc7为45.04%/50.41%,回归任务中MAE为0.689/0.532,总体优于主流模型。验证了该方法能够稳定对齐并自适应调控信息流,提升情感分类与强度回归性能,具备在复杂跨模态场景中的应用潜力。 展开更多
关键词 多模态情感分析 跨模态特征融合 统一语义对齐 动态融合调控 分层残差机制 跨模态鲁棒性
在线阅读 下载PDF
A Multitask Multiview Neural Network for End-to-End Aspect-Based Sentiment Analysis 被引量:6
20
作者 Yong Bie Yan Yang 《Big Data Mining and Analytics》 EI 2021年第3期195-207,共13页
The aspect-based sentiment analysis(ABSA) consists of two subtasks—aspect term extraction and aspect sentiment prediction. Existing methods deal with both subtasks one by one in a pipeline manner, in which there lies... The aspect-based sentiment analysis(ABSA) consists of two subtasks—aspect term extraction and aspect sentiment prediction. Existing methods deal with both subtasks one by one in a pipeline manner, in which there lies some problems in performance and real application. This study investigates the end-to-end ABSA and proposes a novel multitask multiview network(MTMVN) architecture. Specifically, the architecture takes the unified ABSA as the main task with the two subtasks as auxiliary tasks. Meanwhile, the representation obtained from the branch network of the main task is regarded as the global view, whereas the representations of the two subtasks are considered two local views with different emphases. Through multitask learning, the main task can be facilitated by additional accurate aspect boundary information and sentiment polarity information. By enhancing the correlations between the views under the idea of multiview learning, the representation of the global view can be optimized to improve the overall performance of the model. The experimental results on three benchmark datasets show that the proposed method exceeds the existing pipeline methods and end-to-end methods, proving the superiority of our MTMVN architecture. 展开更多
关键词 deep learning multitask learning multiview learning natural language processing aspect-based sentiment analysis
原文传递
上一页 1 2 11 下一页 到第
使用帮助 返回顶部