低秩注意力与对比对齐的深度多视图聚类

Low-Rank Attention and Contrastive Alignment for Deep Multi-view Clustering

下载PDF

导出

摘要多视图聚类(MVC)作为无监督学习的一个关键研究方向,在处理大规模数据时,结合深度学习可以显示出其独特的优势。然而,三个关键挑战仍未得到解决:(1)如何有效地从海量数据中提取判别特征,同时减少过多的训练开销;(2)如何协调视图之间的异构特征表示;(3)如何保持多种模态之间的语义一致性。为了应对这些挑战,提出了一种基于线性注意力与簇对齐的框架(LRACA)。开发了一种标签驱动的锚点采样策略,其中类别感知K-means算法选择交叉视图对齐的锚点来指导训练方向的确定。为特征预训练设计了特定于视图的自编码器,并引入了一种动态低秩注意力机制,将键/值矩阵投影到线性子空间中,将注意力操作的计算复杂度从O(N^(2))降低到了O(N),同时显著增强特征区分性,并更新伪标签。提出了一种集群级对比学习范式,以聚类伪标签为纽带强化跨视图语义一致性,稳定和巩固了整个框架的聚类表现。对六个基准数据集的广泛实验表明,LRACA在聚类准确性、纯度和归一化互信息方面优于八个最先进的MVC基线,验证了其有效性和效率。 Multi-view clustering(MVC),as a pivotal research direction in unsupervised learning,when dealing with largescale data,incorporating deep learning can show its unique advantages.However,three critical challenges remain unresolved:(1)how to efficiently extract discriminative features from massive data while mitigating excessive training overhead,(2)how to harmonize heterogeneous feature representations across views,and(3)how to maintain semantic consistency among multiple modalities.To address these challenges,this paper proposes a deep multi-view clustering alignment guided by low-rank attention(LRACA).The technical contributions are threefold:firstly,a label-driven anchor sampling strategy is developed,where category-aware K-means algorithm selects cross-view aligned anchors to guide training direction determination.Secondly,this paper designs view-specific autoencoders for feature pretraining and introduces a low-rank attention mechanism that projects key/value matrices into linear subspaces,reducing the computational complexity of attention operations from O(N^(2))to O(N),while significantly enhancing feature discriminability and updating pseudo-labels.Thirdly,a cluster-level contrastive learning paradigm is proposed,leveraging clustering pseudo-labels as a bridge to strengthen cross-view semantic consistency,thereby stabilizing and consolidating the clustering performance of the entire framework.Extensive experiments on six benchmark datasets demonstrate that LRACA outperforms eight state-of-the-art MVC baselines in terms of clustering accuracy,purity and normalized mutual information,validating its effectiveness and efficiency.

作者温珍平孙颖慧李杏峰孙权森 WEN Zhenping;SUN Yinghui;LI Xingfeng;SUN Quansen(College of Computer Science and Engineering,Nanjing University of Science and Technology,Nanjing 210094,China;College of Computer Science and Engineering,Southeast University,Nanjing 210096,China)

机构地区南京理工大学计算机科学与工程学院东南大学计算机科学与工程学院

出处《计算机科学与探索》北大核心 2026年第4期1103-1114,共12页 Journal of Frontiers of Computer Science and Technology

基金国家自然科学基金(62372235,62406069) 中国博士后科学基金(2024M750425)。

关键词多视图聚类低秩注意力对比学习深度聚类跨视图对齐 multi-view clustering low-rank attention contrastive learning deep clustering cross-view alignment

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1张亚海.AI赋能下的小学语文阅读教学模式革新[J].成长,2025(22):166-168.
2漆涛,张永琴,田敬峰.教师与教研员何以联合:教研共同体的生成与运作[J].复印报刊资料(中小学教育),2024(10):21-27.
3刘娟,彭金锋.大数据背景下的高校专业满意度模型构建研究[J].中国科技投资,2025(23):131-133.
4王悦.纵向联邦学习中的无目标中毒攻击[J].智能计算机与应用,2026,16(1):103-109.
5高书涵,周超,荣梦琪,刘养东,刘辉,沈浩,贾然,刘传彬,张洋,刘嵘,申抒含.面向高压输电线路的三维点云语义分割[J].测绘通报,2026(2):156-160.
6陈晓娟.技术技能型人才能力图谱的研究与构建——以电气自动化技术专业为例[J].教育观察,2025,14(34):78-81. 被引量：1
7李文亮,张浩,陈鸣睿,梁晨,翟持.基于NMI-GBR-SHAP的甲醇制烯烃催化剂积炭过程解释性分析[J].化工学报,2026,77(2):791-802.
8张天啸,张永,刘文哲.基于增强低秩的多视图子空间聚类算法[J].电信科学,2026,42(3):81-96.
9贾丽娜.初中体育项目化教育的策略探索[J].小樱桃,2024(47):125-126.

计算机科学与探索

2026年第4期

浏览历史

内容加载中请稍等...

低秩注意力与对比对齐的深度多视图聚类

相关作者

相关机构

相关主题

浏览历史