摘要
在多标签学习中,人工标注标签的主观性和不稳定性往往造成标签缺失,无法形成完备的标签空间,从而对监督学习算法的训练产生误导.标签相关性可在一定程度上弥补缺失标签对算法分类性能造成的不利影响.但缺失标签也会导致对标签相关性的估计不准确.针对该问题,提出一种增强标签相关性矩阵的不完备多标签学习(multi-label learning with incomplete labels via augmented label correlation matrix,ML-ALC)方法.首先,通过拉普拉斯映射构造数据的低维流形;然后,使用标签向量计算原始标签相关矩阵;接着,构造一个校正矩阵对原始标签相关矩阵进行增强,并通过回归系数矩阵和增强标签相关性矩阵将原始特征空间和标签空间分别映射到低维流形;最后,经过迭代学习获得优化的回归系数矩阵和增强标签相关性矩阵,并应用于多标签分类.实验结果表明,ML-ALC方法的分类性能优于其他针对缺失标签的多标签分类方法.
In multi-label learning,the subjectivity and instability of manual labeling often result in the absence of partial class labels and an incomplete label space.Missing labels are likely to be misleading in the training of supervised learning algorithms.The use of label correlation can alleviate the negative efiect of missing labels on the algorithm classiflcation performance to a certain extent.However,missing labels can lead to inaccurate estimations of label correlations.To address this problem,a method called multi-label learning with incomplete labels using an augmented label correlation matrix(ML-ALC)was proposed.First,a low-dimensional manifold of the data is constructed using Laplace mapping,and the original label correlation matrix is calculated using the label vectors.Subsequently,a correction matrix is constructed to augment the original label correlation matrix,and the original feature space and label space are mapped to the low-dimensional manifold through a regression coe–cient matrix and augmented label correlation matrix,respectively.Finally,the optimized regression coe–cient matrix and augmented label correlation matrix are obtained through iterative learning and applied to multi-label classiflcation.The experimental results demonstrate that the proposed algorithm performs better than other state-of-the-art methods in multi-label learning with incomplete labels.
作者
许智磊
黄睿
XU Zhilei;HUANG Rui(School of Communication and Information Engineering,Shanghai University,Shanghai 200444,China)
出处
《上海大学学报(自然科学版)》
北大核心
2025年第6期915-930,共16页
Journal of Shanghai University:Natural Science Edition
关键词
不完备多标签学习
缺失标签
标签相关性
低维流形
multi-label learning with incomplete labels
missing labels
label correlation
low-dimensional manifold