摘要
多标记特征选择通过去除无关特征来提升学习模型的性能。然而,大多数现有方法假设训练集中的标记仅包含简单的逻辑值,并认为所有相关标记对实例的作用相同。除此之外,在实际应用中,不同标记对实例的影响程度可能存在差异。基于此,提出一种基于模糊邻域信息熵与互判别指数的特征选择方法,首先采用标记增强技术将原始多标记数据集转换为标记分布数据集;继后通过邻域信息熵来量化标记空间中样本间的相似关系;最终借助模糊邻域互判别指数将特征空间与标记空间相结合,从而识别出具有显著区分能力的特征子集。通过6个数据集的实验综合表明,该算法的分类性能较其他算法更为优异。
Multi-label feature selection improves the performance of learning models by eliminating irrelevant features.However,most existing methods assume that the labels in the training set only contain simple logical values and that all relevant labels have the same effect on instances.In addition,in practical applications,the influence of different labels on instances may vary.Based on this,this paper proposes a feature selection method based on fuzzy neighborhood information entropy and mutual discriminant index.Firstly,the original multi-label datasets are transformed into label distribution datasets by using label enhancement technology.Then,the neighborhood information entropy is used to quantify the similarity relationship between samples in the label space.Finally,the feature space and the label space are combined by using the fuzzy neighborhood mutual discriminant index to identify the feature subset with significant discrimination ability.Experiments on six datasets comprehensively show that the classification performance of this algorithm is superior to that of other algorithms.
作者
吴立胜
鄂晨
WU Li-sheng;E Chen(Jiangxi University of Technology,Nanchang 330098,China)
出处
《电脑与电信》
2025年第4期17-22,共6页
Computer & Telecommunication
关键词
特征选择
模糊邻域
多标记学习
feature selection
fuzzy neighborhood
multi-label learning