基于Transformer架构的跨模态语义理解研究

Research on cross modal semantic understanding based on transformer architecture

下载PDF

导出

摘要面对多媒体数据的爆炸式增长,跨模态语义理解已成为人工智能领域的核心挑战与前沿方向。Transformer架构凭借在自然语言处理中展现出的卓越能力,为解决这一难题提供了关键范式。文章系统性地研究了基于Transformer的跨模态语义理解方法,重点探讨了该架构在语义对齐、信息融合与深层理解3个关键环节的创新应用。 In the face of the explosive growth of multimedia data,cross-modal semantic understanding has become a core challenge and frontier direction in the field of artificial intelligence.The Transformer architecture offers a key paradigm for addressing this challenge,thanks to its outstanding capabilities demonstrated in natural language processing.This paper systematically studies the Transformer-based cross-modal semantic understanding method,and focuses on the innovative application of this architecture in three key links:semantic alignment,information fusion and deep understanding.

作者蒋毅 JIANG Yi(Electromechanical and Information Engineering Department,Changde Vocational Technical College,Changde,Hunan 415000,China)

机构地区常德职业技术学院机电与信息工程系

出处《计算机应用文摘》 2025年第24期246-248,共3页

关键词 Transformer架构跨模态语义理解语义对齐数据融合 transformer architecture cross modal semantic understanding semantic alignment data fusion

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1毕向阳.用“计算理解”融合质性与量化分析——基于Transformer架构的评教大数据研究[J].智能社会研究,2025(3):154-179. 被引量：1
2程齐凯,刘富康,石湘,黄永,陆伟.基于视觉注意力模拟的交互式科学图表理解研究[J].情报科学,2025,43(9):109-121.

计算机应用文摘

2025年第24期

浏览历史

内容加载中请稍等...

基于Transformer架构的跨模态语义理解研究

相关作者

相关机构

相关主题

浏览历史