期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
融合弱监督目标定位的细粒度小样本学习 被引量:7
1
作者 贺小箭 林金福 《中国图象图形学报》 CSCD 北大核心 2022年第7期2226-2239,共14页
目的小样本学习旨在通过一幅或几幅图像来学习全新的类别。目前许多小样本学习方法基于图像的全局表征,可以很好地实现常规小样本图像分类任务。但是,细粒度图像分类需要依赖局部的图像特征,而基于全局表征的方法无法有效地获取图像的... 目的小样本学习旨在通过一幅或几幅图像来学习全新的类别。目前许多小样本学习方法基于图像的全局表征,可以很好地实现常规小样本图像分类任务。但是,细粒度图像分类需要依赖局部的图像特征,而基于全局表征的方法无法有效地获取图像的局部特征,导致很多小样本学习方法不能很好地处理细粒度小样本图像分类问题。为此,提出一种融合弱监督目标定位的细粒度小样本学习方法。方法在数据量有限的情况下,目标定位是一个有效的方法,能直接提供最具区分性的区域。受此启发,提出了一个基于自注意力的互补定位模块来实现弱监督目标定位,生成筛选掩膜进行特征描述子的筛选。基于筛选的特征描述子,设计了一种语义对齐距离来度量图像最具区分性区域的相关性,进而完成细粒度小样本图像分类。结果在mini Image Net数据集上,本文方法在1-shot和5-shot下的分类精度相较性能第2的方法高出0.56%和5.02%。在细粒度数据集Stanford Dogs和Stanford Cars数据集上,本文方法在1-shot和5-shot下的分类精度相较性能第2的方法分别提高了4.18%,7.49%和16.13,5.17%。在CUB 200-2011(Caltech-UCSD birds)数据集中,本文方法在5-shot下的分类精度相较性能第2的方法提升了1.82%。泛化性实验也显示出本文方法可以更好地同时处理常规小样本学习和细粒度小样本学习。此外,可视化结果显示出所提出的弱监督目标定位模块可以更完整地定位出目标。结论融合弱监督目标定位的细粒度小样本学习方法显著提高了细粒度小样本图像分类的性能,而且可以同时处理常规的和细粒度的小样本图像分类。 展开更多
关键词 弱监督目标定位(wsol) 小样本学习(FSL) 细粒度图像分类 细粒度小样本学习 特征描述子
原文传递
SSA: semantic structure aware inference on CNN networks for weakly pixel-wise dense predictions without cost
2
作者 Yanpeng SUN Zechao LI 《Frontiers of Computer Science》 2025年第2期1-10,共10页
The pixel-wise dense prediction tasks based on weakly supervisions currently use Class Attention Maps(CAMs)to generate pseudo masks as ground-truth.However,existing methods often incorporate trainable modules to expan... The pixel-wise dense prediction tasks based on weakly supervisions currently use Class Attention Maps(CAMs)to generate pseudo masks as ground-truth.However,existing methods often incorporate trainable modules to expand the immature class activation maps,which can result in significant computational overhead and complicate the training process.In this work,we investigate the semantic structure information concealed within the CNN network,and propose a semantic structure aware inference(SSA)method that utilizes this information to obtain high-quality CAM without any additional training costs.Specifically,the semantic structure modeling module(SSM)is first proposed to generate the classagnostic semantic correlation representation,where each item denotes the affinity degree between one category of objects and all the others.Then,the immature CAM are refined through a dot product operation that utilizes semantic structure information.Finally,the polished CAMs from different backbone stages are fused as the output.The advantage of SSA lies in its parameter-free nature and the absence of additional training costs,which makes it suitable for various weakly supervised pixel-dense prediction tasks.We conducted extensive experiments on weakly supervised object localization and weakly supervised semantic segmentation,and the results confirm the effectiveness of SSA. 展开更多
关键词 class attention maps semantic structure weaklysupervised object localization weakly-supervised semantic segmentation
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部