期刊文献+
共找到73篇文章
< 1 2 4 >
每页显示 20 50 100
Generalized Prototype-Based Few-Shot Semantic Segmentation Network
1
作者 Hang Zhou Guanglu Sun 《国际计算机前沿大会会议论文集》 2025年第1期307-324,共18页
Few-shot image semantic segmentation aims to achieve pixel-level classification for novel classes using only a few labeled examples.The method first trains the segmentation model on base classes,and then adapts it to ... Few-shot image semantic segmentation aims to achieve pixel-level classification for novel classes using only a few labeled examples.The method first trains the segmentation model on base classes,and then adapts it to novel classes.Although existing methods have achieved remarkable performance in few-shot image semantic segmentation,they still face the following challenges.Traditional methods typically rely on mask average pooling to generate single-category prototype vectors and perform feature matching via metric learning,but they exhibit significant limitations in modeling inter-category relationships and addressing complex background interference.Inspired by the analogy-based transfer mechanisms in cognitive psychology,we propose a Generalized Prototype Network(GPNet)to enhance the model's generalization ability for unseen categories and improve robustness in feature matching.GPNet consists of two key modules.The first is a generalized prototype enhancement module,which explores potential inter-category relationships to construct more discriminative category prototype representations.The second is a multi-scale feature alignment module,which dynamically aligns support and query features across multiple scales using an attention mechanism,thus mitigating background interference in complex scenarios.Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art approaches on several few-shot semantic segmentation benchmarks,validating its effectiveness and generalization capabilities. 展开更多
关键词 semantic segmentation few-shot semantic segmentation PROTOTYPE semantic alignment few-shot learning
原文传递
CAMSNet:Few-Shot Semantic Segmentation via Class Activation Map and Self-Cross Attention Block
2
作者 Jingjing Yan Xuyang Zhuang +2 位作者 Xuezhuan Zhao Xiaoyan Shao Jiaqi Han 《Computers, Materials & Continua》 2025年第3期5363-5386,共24页
The key to the success of few-shot semantic segmentation(FSS)depends on the efficient use of limited annotated support set to accurately segment novel classes in the query set.Due to the few samples in the support set... The key to the success of few-shot semantic segmentation(FSS)depends on the efficient use of limited annotated support set to accurately segment novel classes in the query set.Due to the few samples in the support set,FSS faces challenges such as intra-class differences,background(BG)mismatches between query and support sets,and ambiguous segmentation between the foreground(FG)and BG in the query set.To address these issues,The paper propose a multi-module network called CAMSNet,which includes four modules:the General Information Module(GIM),the Class Activation Map Aggregation(CAMA)module,the Self-Cross Attention(SCA)Block,and the Feature Fusion Module(FFM).In CAMSNet,The GIM employs an improved triplet loss,which concatenates word embedding vectors and support prototypes as anchors,and uses local support features of FG and BG as positive and negative samples to help solve the problem of intra-class differences.Then for the first time,the Class Activation Map(CAM)from the Weakly Supervised Semantic Segmentation(WSSS)is applied to FSS within the CAMA module.This method replaces the traditional use of cosine similarity to locate query information.Subsequently,the SCA Block processes the support and query features aggregated by the CAMA module,significantly enhancing the understanding of input information,leading to more accurate predictions and effectively addressing BG mismatch and ambiguous FG-BG segmentation.Finally,The FFM combines general class information with the enhanced query information to achieve accurate segmentation of the query image.Extensive Experiments on PASCAL and COCO demonstrate that-5i-20ithe CAMSNet yields superior performance and set a state-of-the-art. 展开更多
关键词 few-shot semantic segmentation semantic segmentation meta learning
在线阅读 下载PDF
Two-Stage Category-Guided Frequency Modulation for Few-Shot Semantic Segmentation
3
作者 Yiming Tang Yanqiu Chen 《Computers, Materials & Continua》 2025年第5期1707-1726,共20页
Semantic segmentation of novel object categories with limited labeled data remains a challenging problem in computer vision.Few-shot segmentation methods aim to address this problem by recognizing objects from specifi... Semantic segmentation of novel object categories with limited labeled data remains a challenging problem in computer vision.Few-shot segmentation methods aim to address this problem by recognizing objects from specific target classes with a few provided examples.Previous approaches for few-shot semantic segmentation typically represent target classes using class prototypes.These prototypes are matched with the features of the query set to get segmentation results.However,class prototypes are usually obtained by applying global average pooling on masked support images.Global pooling discards much structural information,which may reduce the accuracy of model predictions.To address this issue,we propose a Category-Guided Frequency Modulation(CGFM)method.CGFM is designed to learn category-specific information in the frequency space and leverage it to provide a twostage guidance for the segmentation process.First,to self-adaptively activate class-relevant frequency bands while suppressing irrelevant ones,we leverage the Dual-Perception Gaussian Band Pre-activation(DPGBP)module to generate Gaussian filters using class embedding vectors.Second,to further enhance category-relevant frequency components in activated bands,we design a Support-Guided Category Response Enhancement(SGCRE)module to effectively introduce support frequency components into the modulation of query frequency features.Experiments on the PASCAL-5^(i) and COCO-20^(i) datasets demonstrate the promising performance of our model. 展开更多
关键词 few-shot semantic segmentation frequency feature category representation
在线阅读 下载PDF
Axial Assembled Correspondence Network for Few-Shot Semantic Segmentation 被引量:3
4
作者 Yu Liu Bin Jiang Jiaming Xu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第3期711-721,共11页
Few-shot semantic segmentation aims at training a model that can segment novel classes in a query image with only a few densely annotated support exemplars.It remains a challenge because of large intra-class variation... Few-shot semantic segmentation aims at training a model that can segment novel classes in a query image with only a few densely annotated support exemplars.It remains a challenge because of large intra-class variations between the support and query images.Existing approaches utilize 4D convolutions to mine semantic correspondence between the support and query images.However,they still suffer from heavy computation,sparse correspondence,and large memory.We propose axial assembled correspondence network(AACNet)to alleviate these issues.The key point of AACNet is the proposed axial assembled 4D kernel,which constructs the basic block for semantic correspondence encoder(SCE).Furthermore,we propose the deblurring equations to provide more robust correspondence for the aforementioned SCE and design a novel fusion module to mix correspondences in a learnable manner.Experiments on PASCAL-5~i reveal that our AACNet achieves a mean intersection-over-union score of 65.9%for 1-shot segmentation and 70.6%for 5-shot segmentation,surpassing the state-of-the-art method by 5.8%and 5.0%respectively. 展开更多
关键词 Artificial intelligence computer vision deep convolutional neural network few-shot semantic segmentation
在线阅读 下载PDF
Part-Whole Relational Few-Shot 3D Point Cloud Semantic Segmentation 被引量:1
5
作者 Shoukun Xu Lujun Zhang +2 位作者 Guangqi Jiang Yining Hua Yi Liu 《Computers, Materials & Continua》 SCIE EI 2024年第3期3021-3039,共19页
This paper focuses on the task of few-shot 3D point cloud semantic segmentation.Despite some progress,this task still encounters many issues due to the insufficient samples given,e.g.,incomplete object segmentation an... This paper focuses on the task of few-shot 3D point cloud semantic segmentation.Despite some progress,this task still encounters many issues due to the insufficient samples given,e.g.,incomplete object segmentation and inaccurate semantic discrimination.To tackle these issues,we first leverage part-whole relationships into the task of 3D point cloud semantic segmentation to capture semantic integrity,which is empowered by the dynamic capsule routing with the module of 3D Capsule Networks(CapsNets)in the embedding network.Concretely,the dynamic routing amalgamates geometric information of the 3D point cloud data to construct higher-level feature representations,which capture the relationships between object parts and their wholes.Secondly,we designed a multi-prototype enhancement module to enhance the prototype discriminability.Specifically,the single-prototype enhancement mechanism is expanded to the multi-prototype enhancement version for capturing rich semantics.Besides,the shot-correlation within the category is calculated via the interaction of different samples to enhance the intra-category similarity.Ablation studies prove that the involved part-whole relations and proposed multi-prototype enhancement module help to achieve complete object segmentation and improve semantic discrimination.Moreover,under the integration of these two modules,quantitative and qualitative experiments on two public benchmarks,including S3DIS and ScanNet,indicate the superior performance of the proposed framework on the task of 3D point cloud semantic segmentation,compared to some state-of-the-art methods. 展开更多
关键词 few-shot point cloud semantic segmentation CapsNets
在线阅读 下载PDF
Filter Bank Networks for Few-Shot Class-Incremental Learning
6
作者 Yanzhao Zhou Binghao Liu +1 位作者 Yiran Liu Jianbin Jiao 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第10期647-668,共22页
Deep Convolution Neural Networks(DCNNs)can capture discriminative features from large datasets.However,how to incrementally learn new samples without forgetting old ones and recognize novel classes that arise in the d... Deep Convolution Neural Networks(DCNNs)can capture discriminative features from large datasets.However,how to incrementally learn new samples without forgetting old ones and recognize novel classes that arise in the dynamically changing world,e.g.,classifying newly discovered fish species,remains an open problem.We address an even more challenging and realistic setting of this problem where new class samples are insufficient,i.e.,Few-Shot Class-Incremental Learning(FSCIL).Current FSCIL methods augment the training data to alleviate the overfitting of novel classes.By contrast,we propose Filter Bank Networks(FBNs)that augment the learnable filters to capture fine-detailed features for adapting to future new classes.In the forward pass,FBNs augment each convolutional filter to a virtual filter bank containing the canonical one,i.e.,itself,and multiple transformed versions.During back-propagation,FBNs explicitly stimulate fine-detailed features to emerge and collectively align all gradients of each filter bank to learn the canonical one.FBNs capture pattern variants that do not yet exist in the pretraining session,thus making it easy to incorporate new classes in the incremental learning phase.Moreover,FBNs introduce model-level prior knowledge to efficiently utilize the limited few-shot data.Extensive experiments on MNIST,CIFAR100,CUB200,andMini-ImageNet datasets show that FBNs consistently outperformthe baseline by a significantmargin,reporting new state-of-the-art FSCIL results.In addition,we contribute a challenging FSCIL benchmark,Fishshot1K,which contains 8261 underwater images covering 1000 ocean fish species.The code is included in the supplementary materials. 展开更多
关键词 Deep learning incremental learning few-shot learning Filter Bank Networks
在线阅读 下载PDF
基于高精度三维扫描的钢梁智能调位一体化控制技术研究及应用
7
作者 王敏 黄涛 樊健生 《桥梁建设》 北大核心 2026年第1期155-161,共7页
为提高装配式钢桥节段钢梁顶推施工焊接前调位精度与效率,提出基于高精度三维扫描的钢梁智能调位一体化控制技术。该技术从减小钢梁调位误差的角度出发,基于三维扫描技术得到高精度点云数据,采用表面重建算法得到带制造误差的带力模型,... 为提高装配式钢桥节段钢梁顶推施工焊接前调位精度与效率,提出基于高精度三维扫描的钢梁智能调位一体化控制技术。该技术从减小钢梁调位误差的角度出发,基于三维扫描技术得到高精度点云数据,采用表面重建算法得到带制造误差的带力模型,模拟钢梁“预拼-调位-拼装”全过程挠度变化,计算得到由制造误差、环境温度变化、支撑条件改变等带来的调位误差,修正调位指令,以减小系统误差;提出以最优化法为核心的智能调位算法,建立梁面测点数据与梁底三向千斤顶调整量的微分运动关系,采用智能调位系统取代人工进行控制点坐标采集和调位控制操作,以减小调位过程中的人为操作误差。为验证该技术的可行性,依托某黄河特大桥进行实际应用。结果表明:采用该智能调位一体化控制技术,焊接面匹配精度为±1 mm,可在5 min内将调位误差控制在±2 mm,较传统方法,该技术不仅提高了施工效率及调位精度,同时提高了焊接平顺度。 展开更多
关键词 装配式钢桥 节段钢梁 顶推施工 三维扫描 带力模型 智能调位系统 钢梁调位
在线阅读 下载PDF
多柱式桥墩超宽节段拼装盖梁抗震性能研究
8
作者 杨靖宁 邓开来 +2 位作者 付晓 许宇新 徐腾飞 《振动与冲击》 北大核心 2026年第4期259-264,285,共7页
构件轻型化、设备便携化是预制装配式桥梁重要的发展方向。为实现超宽市政桥梁的轻量化建造,提出了多柱式桥墩的盖梁横向分段装配建造方案,将跨度40 m的超宽盖梁分为3个预制节段,之间采用C50后浇带连接,由此实现了40 m超宽盖梁的轻量化... 构件轻型化、设备便携化是预制装配式桥梁重要的发展方向。为实现超宽市政桥梁的轻量化建造,提出了多柱式桥墩的盖梁横向分段装配建造方案,将跨度40 m的超宽盖梁分为3个预制节段,之间采用C50后浇带连接,由此实现了40 m超宽盖梁的轻量化装配建造。为探究地震下多柱式桥墩超宽盖梁的性能,建立该盖梁的精细化数值模型,通过增量动力分析,得到了盖梁预制节段与后浇带间界面的结构行为,发现新旧混凝土界面依然是拼装盖梁的损伤薄弱部位,界面开裂降低了盖梁刚度,墩柱之间耦合作用减弱。大震作用下,界面开合宽度不超过1.0 mm,且震后界面自动闭合。此外,预制拼装盖梁方案下的墩顶位移、支座变形等宏观响应与现浇桥墩较为接近,墩底剪力、墩身应变有部分降低,表明节段拼装盖梁方案基本达到“等同现浇”的性能目标。 展开更多
关键词 节段拼装盖梁 后浇带 界面开裂 增量动力分析 等同现浇
在线阅读 下载PDF
基于改进增量模型的汽车行驶工况构建
9
作者 陈德启 张淑慧 +1 位作者 张文会 王宪彬 《河南城建学院学报》 2026年第1期31-37,44,共8页
为精准匹配轻型汽车的实际行驶特征,基于真实道路车辆行驶轨迹数据,提出一种多源异构数据融合的行驶工况构建方法。构建多阶段数据处理框架,生成高精度训练数据集;依据明确定义的运动学片段及其速度约束条件,提取有效片段并确定16个关... 为精准匹配轻型汽车的实际行驶特征,基于真实道路车辆行驶轨迹数据,提出一种多源异构数据融合的行驶工况构建方法。构建多阶段数据处理框架,生成高精度训练数据集;依据明确定义的运动学片段及其速度约束条件,提取有效片段并确定16个关键特征参数。采用主成分分析与聚类分析的耦合策略,通过Kaiser准则提取主成分,并利用改进的增量模型构建表征性显著的行驶工况曲线。结果表明,改进后增量模型的累积贡献方差值降至0.122,验证了所建行驶工况模型具有较好的准确性与合理性。 展开更多
关键词 汽车行驶工况 运动学片段 主成分分析 改进增量模型
在线阅读 下载PDF
Parallel prototype filter and feature refinement for few-shot medical image segmentation
10
作者 Haoxiang ZHU Houjin CHEN +3 位作者 Yanfeng LI Jia SUN Ziwei CHEN Jiaxin LI 《Frontiers of Information Technology & Electronic Engineering》 2025年第11期2143-2158,共16页
Medical image segmentation is critical for clinical diagnosis,but the scarcity of annotated data limits robust model training,making few-shot learning indispensable.Existing methods often suffer from two issues—perfo... Medical image segmentation is critical for clinical diagnosis,but the scarcity of annotated data limits robust model training,making few-shot learning indispensable.Existing methods often suffer from two issues—performance degradation due to significant inter-class variations in pathological structures,and overreliance on attention mechanisms with high computational complexity(O(n2)),which hinders the efficient modeling of long-range dependencies.In contrast,the state space model(SSM)offers linear complexity(O(n))and superior efficiency,making it a key solution.To address these challenges,we propose PPFFR(parallel prototype filter and feature refinement)for few-shot medical image segmentation.The proposed framework comprises three key modules.First,we propose the prototype refinement(PR)module to construct refined class subgraphs from encoder-extracted features of both support and query images,which generates support prototypes with minimized inter-class variation.We then propose the parallel prototype filter(PPF)module to suppress background interference and enhance the correlation between support and query prototypes.Finally,we implement the feature refinement(FR)module to further enhance segmentation accuracy and accelerate model convergence with SSM’s robust long-range dependency modeling capability,integrated with multi-head attention(MHA)to preserve spatial details.Experimental results on the Abd-MRI dataset demonstrate that FR with MHA outperforms FR alone in segmenting the left kidney,right kidney,liver,and spleen,and in terms of mean accuracy,confirming MHA’s role in improving precision.In extensive experiments conducted on three public datasets under the 1-way 1-shot setting,PPFFR achieves Dice scores of 87.62%,86.74%,and 79.71%separately,consistently surpassing state-of-the-art few-shot medical image segmentation methods.As the critical component,SSM ensures that PPFFR balances performance with efficiency.Ablation studies validate the effectiveness of the PR,PPF,and FR modules.The results indicate that explicit inter-class variation reduction and SSM-based feature refinement can enhance accuracy without heavy computational overhead.In conclusion,PPFFR effectively enhances inter-class consistency and computational efficiency for few-shot medical image segmentation.This work provides insights for few-shot learning in medical imaging and inspires lightweight architecture designs for clinical deployment. 展开更多
关键词 few-shot learning Medical image segmentation Prototype filter State space model
原文传递
Leveraging on few-shot learning for tire pattern classification in forensics
11
作者 Lijun Jiang Syed Ariff Syed Hesham +1 位作者 Keng Pang Lim Changyun Wen 《Journal of Automation and Intelligence》 2023年第3期146-151,共6页
This paper presents a novel approach for tire-pattern classification,aimed at conducting forensic analysis on tire marks discovered at crime scenes.The classification model proposed in this study accounts for the intr... This paper presents a novel approach for tire-pattern classification,aimed at conducting forensic analysis on tire marks discovered at crime scenes.The classification model proposed in this study accounts for the intricate and dynamic nature of tire prints found in real-world scenarios,including accident sites.To address this complexity,the classifier model was developed to harness the meta-learning capabilities of few-shot learning algorithms(learning-to-learn).The model is meticulously designed and optimized to effectively classify both tire patterns exhibited on wheels and tire-indentation marks visible on surfaces due to friction.This is achieved by employing a semantic segmentation model to extract the tire pattern marks within the image.These marks are subsequently used as a mask channel,combined with the original image,and fed into the classifier to perform classification.Overall,The proposed model follows a three-step process:(i)the Bilateral Segmentation Network is employed to derive the semantic segmentation of the tire pattern within a given image.(ii)utilizing the semantic image in conjunction with the original image,the model learns and clusters groups to generate vectors that define the relative position of the image in the test set.(iii)the model performs predictions based on these learned features.Empirical verification demonstrates usage of semantic model to extract the tire patterns before performing classification increases the overall accuracy of classification by∼4%. 展开更多
关键词 META-LEARNING few-shot classification Semantic segmentation
在线阅读 下载PDF
基于特征记忆库的三维点云域自适应语义分割
12
作者 陈子宜 叶锋 《福建师范大学学报(自然科学版)》 北大核心 2025年第2期35-42,共8页
由于城市车载激光点云应用场景的复杂性,导致基于深度学习的语义分割模型面临着目标语义层面的域迁移现象,通常需要重新训练完整模型才能处理新增的语义类别。然而城市车载激光点云通常包括极大量的点,重新训练语义分割模型会浪费大量... 由于城市车载激光点云应用场景的复杂性,导致基于深度学习的语义分割模型面临着目标语义层面的域迁移现象,通常需要重新训练完整模型才能处理新增的语义类别。然而城市车载激光点云通常包括极大量的点,重新训练语义分割模型会浪费大量的资源。通过基于特征记忆库的城市车载激光点云域自适应语义分割方法,解决城市车载激光点云之间的目标语义域迁移现象,使得新增语义类别数据时,只需提取新增语义类别的特征,而无需重新训练完整的语义分割模型,得到的语义分割性能对比重新训练完整模型仅有较小的损失。 展开更多
关键词 三维点云 车载激光 语义分割 域自适应 增量学习 计算机视觉
在线阅读 下载PDF
视觉感知中的小样本类别增量学习技术
13
作者 冉航 乔树山 《智能感知工程》 2025年第1期49-67,共19页
在动态开放环境中,智能视觉感知系统需要不断学习新类别的视觉概念。然而,由于数据获取和标注成本高昂,新类别通常仅有少量标注样本可用。小样本类别增量学习(Few-shot Class-incremental Learning,FSCIL)旨在解决这一问题,使模型在有... 在动态开放环境中,智能视觉感知系统需要不断学习新类别的视觉概念。然而,由于数据获取和标注成本高昂,新类别通常仅有少量标注样本可用。小样本类别增量学习(Few-shot Class-incremental Learning,FSCIL)旨在解决这一问题,使模型在有限的新类别样本下实现高效学习,同时避免对旧类别的遗忘。首先,概述小样本类别增量学习技术,强调其在智能感知领域的应用潜力;其次,根据不同视觉任务,对主流方法及其性能表现进行系统性分析;最后,对该领域的未来发展趋势进行展望,包括理论技术发展和问题设置扩展等。 展开更多
关键词 小样本类别增量学习 视觉感知 增量小样本分割 增量小样本目标检测
在线阅读 下载PDF
基于YOLOv8的实例分割增量学习方法
14
作者 卓运航 朱秋煜 《计量与测试技术》 2025年第11期51-57,共7页
为解决在不断引入新类别任务时模型遗忘旧类别知识的问题,本文提出一种基于YOLOv8的实例分割类增量学习方法,提升新旧类别权重的平衡性。同时,为缓解灾难性遗忘,设计了多种知识蒸馏策略,包括类别概率蒸馏、回归框蒸馏、原型掩码蒸馏及... 为解决在不断引入新类别任务时模型遗忘旧类别知识的问题,本文提出一种基于YOLOv8的实例分割类增量学习方法,提升新旧类别权重的平衡性。同时,为缓解灾难性遗忘,设计了多种知识蒸馏策略,包括类别概率蒸馏、回归框蒸馏、原型掩码蒸馏及掩码系数蒸馏,从而有效保留旧知识。此外,结合记忆回放与伪掩码生成策略进一步增强模型对旧类别的保持能力。结果表明,该方法能在保持旧类识别能力的同时,具备良好的新类适应性能。 展开更多
关键词 实例分割 类增量学习 YOLOv8 知识蒸馏
在线阅读 下载PDF
Component recognition of ISAR targets via multimodal feature fusion
15
作者 Chenxuan LI Weigang ZHU +2 位作者 Wei QU Fanyin MA Rundong WANG 《Chinese Journal of Aeronautics》 2025年第2期256-273,共18页
Inverse Synthetic Aperture Radar(ISAR)images of complex targets have a low Signal-to-Noise Ratio(SNR)and contain fuzzy edges and large differences in scattering intensity,which limits the recognition performance of IS... Inverse Synthetic Aperture Radar(ISAR)images of complex targets have a low Signal-to-Noise Ratio(SNR)and contain fuzzy edges and large differences in scattering intensity,which limits the recognition performance of ISAR systems.Also,data scarcity poses a greater challenge to the accurate recognition of components.To address the issues of component recognition in complex ISAR targets,this paper adopts semantic segmentation and proposes a few-shot semantic segmentation framework fusing multimodal features.The scarcity of available data is mitigated by using a two-branch scattering feature encoding structure.Then,the high-resolution features are obtained by fusing the ISAR image texture features and scattering quantization information of complex-valued echoes,thereby achieving significantly higher structural adaptability.Meanwhile,the scattering trait enhancement module and the statistical quantification module are designed.The edge texture is enhanced based on the scatter quantization property,which alleviates the segmentation challenge of edge blurring under low SNR conditions.The coupling of query/support samples is enhanced through four-dimensional convolution.Additionally,to overcome fusion challenges caused by information differences,multimodal feature fusion is guided by equilibrium comprehension loss.In this way,the performance potential of the fusion framework is fully unleashed,and the decision risk is effectively reduced.Experiments demonstrate the great advantages of the proposed framework in multimodal feature fusion,and it still exhibits great component segmentation capability under low SNR/edge blurring conditions. 展开更多
关键词 few-shot Semantic segmentation Inverse Synthetic Aperture Radar(ISAR) SCATTERING Multimodal fusion
原文传递
使用动态盘码的深度交互式分割方法
16
作者 杨亮喆 彭双 +2 位作者 杜春 黄亚哲 陈浩 《小型微型计算机系统》 北大核心 2025年第3期636-644,共9页
目前,主流的基于点击的交互式分割方法对所有用户点击进行无差异的编码.这样的编码方法意味着用户的交互只能给神经网络提供目标的位置信息,且每次点击的影响力是相同的.然而,不同阶段的点击的影响力是不同的.早期的交互用于目标轮廓的... 目前,主流的基于点击的交互式分割方法对所有用户点击进行无差异的编码.这样的编码方法意味着用户的交互只能给神经网络提供目标的位置信息,且每次点击的影响力是相同的.然而,不同阶段的点击的影响力是不同的.早期的交互用于目标轮廓的选择,中后期的交互则偏向于对分割结果的局部细节进行微调.因此,应该适当扩大早期点击的影响力,以便更快地获得目标轮廓,同时削弱中后期点击的影响力,以防止因为超调或歧义而影响交互式分割的收敛性.1)本文提出了一种动态盘码(Dynamic Disk Coding,DDC)算法,该算法将用户的每个点击都编码成一个特定半径的圆盘,以此添加关于点击影响力的先验信息;2)本文提出了一个交互式分割网络DDC-Net,通过交互信息预处理模块加强交互信息,并在分割网络的浅层和深层将交互式信息与语义信息进行混合,缓解交互信息随着网络加深而逐渐衰减的问题;3)本文提出了一种改进的模拟训练策略,使得网络在训练时能够充分学习不同编码半径的点击所具备的不同影响力,从而使得提出的方法兼顾收敛速度和收敛性.通过实验表明,本文提出的使用动态盘码的深度交互式分割方法具有科学性和有效性,相较于基线方法,和分别平均取得3.63%和2.44%的提升. 展开更多
关键词 交互式分割 深度学习 动态盘码 增量学习 特征融合技术
在线阅读 下载PDF
基于增量学习的结直肠息肉分割方法
17
作者 逯暄 景路琪 彭甫镕 《计算机工程》 北大核心 2025年第7期284-293,共10页
结直肠内窥镜图像在设备之间的特征分布不同,导致训练模型在新设备上的分割性能降低。为缓解模型对新设备的适应性问题,提出一种基于增量学习的微调方法,以及一种改进的结直肠息肉分割网络CPSegNet。增量学习方法包含预训练和新设备微调... 结直肠内窥镜图像在设备之间的特征分布不同,导致训练模型在新设备上的分割性能降低。为缓解模型对新设备的适应性问题,提出一种基于增量学习的微调方法,以及一种改进的结直肠息肉分割网络CPSegNet。增量学习方法包含预训练和新设备微调2个阶段,预训练使用旧设备的数据对息肉分割网络进行充分训练,微调阶段同时使用新旧设备样本进行训练,并通过采样率和正则化损失函数防止出现灾难性遗忘现象。CPSegNet采用MiT的预训练模型作为骨干网络,多层感知机(MLP)作为解码模块,不确定区域注意力(URA)作为细化模块,对边界模糊区域进行优化。为了验证学习策略对新设备的适应能力,采用Kvasir-SEG、CVC-ClinicDB、CVC-300、CVC-ColonDB、Kvasir-Sessile和ETIS-LaribPolypDB共6个数据集进行实验,其中前2个数据集为训练集,其余4个为新设备的模拟数据。以Dice相似系数和交并比(IoU)作为评价指标。实验结果表明,在无增量学习情况下CPSegNet在新设备上的性能优于主流的算法,以Kvasir-SEG作为源域数据集,将较难分割的ETIS-LaribPolypDB作为目标域数据集时,与ColonFormer算法相比的Dice相似系数提升3百分点,以CVC-ClinicDB作为源域数据集时,提升了6百分点,使用增量学习后CPSegNet和主流算法都能在新设备上获得性能提升,同时保持在旧设备上的分割精度。 展开更多
关键词 息肉分割 增量学习 迁移学习 少样本学习 灾难性遗忘
在线阅读 下载PDF
Distilling base-and-meta network with contrastive learning for few-shot semantic segmentation
18
作者 Xinyue Chen Yueyi Wang +1 位作者 Yingyue Xu Miaojing Shi 《Autonomous Intelligent Systems》 EI 2023年第1期1-11,共11页
Current studies in few-shot semantic segmentation mostly utilize meta-learning frameworks to obtain models that can be generalized to new categories.However,these models trained on base classes with sufficient annotat... Current studies in few-shot semantic segmentation mostly utilize meta-learning frameworks to obtain models that can be generalized to new categories.However,these models trained on base classes with sufficient annotated samples are biased towards these base classes,which results in semantic confusion and ambiguity between base classes and new classes.A strategy is to use an additional base learner to recognize the objects of base classes and then refine the prediction results output by the meta learner.In this way,the interaction between these two learners and the way of combining results from the two learners are important.This paper proposes a new model,namely Distilling Base and Meta(DBAM)network by using self-attention mechanism and contrastive learning to enhance the few-shot segmentation performance.First,the self-attention-based ensemble module(SEM)is proposed to produce a more accurate adjustment factor for improving the fusion of two predictions of the two learners.Second,the prototype feature optimization module(PFOM)is proposed to provide an interaction between the two learners,which enhances the ability to distinguish the base classes from the target class by introducing contrastive learning loss.Extensive experiments have demonstrated that our method improves on the PASCAL-5i under 1-shot and 5-shot settings,respectively. 展开更多
关键词 Semantic segmentation few-shot learning Meta learning Contrastive learning Self-attention
原文传递
鞋样精准级放的主要影响因素研究
19
作者 徐茂松 王维君 +2 位作者 陶然 向为涛 蒙嘉瑞 《西部皮革》 2025年第10期5-7,共3页
基于提升鞋样级放精度的目的,通过分析鞋号与鞋楦标准体系、级放方向控制原则及分段方式等关键技术要素,研究了鞋样级放过程中的参数设定与实际应用问题。研究认为,鞋样级放需结合脚型规律、材料特性及供应链协同等因素,灵活采用长度与... 基于提升鞋样级放精度的目的,通过分析鞋号与鞋楦标准体系、级放方向控制原则及分段方式等关键技术要素,研究了鞋样级放过程中的参数设定与实际应用问题。研究认为,鞋样级放需结合脚型规律、材料特性及供应链协同等因素,灵活采用长度与宽度等差、级放方向匹配、号码分段、距离分段及局部独立参数等策略。 展开更多
关键词 鞋样 级放 等差 方向 分段 鞋跟 楦头型 材料延伸性
在线阅读 下载PDF
开阔海域环境锚碇区钢箱梁顶推施工技术
20
作者 谢书良 吴育剑 +2 位作者 吴琦 吴聪 朱书敏 《城市道桥与防洪》 2025年第6期242-247,共6页
针对海域环境下超重超高钢箱梁施工,通常采用“大节段吊装整体顶推”“小节段吊装滑移”和“超大型浮吊原位架设”等方案进行施工,结合深中通道工程伶仃洋大桥引桥钢箱梁施工,在保证施工进度的前提下,同时为克服锚碇区域场地受限、大型... 针对海域环境下超重超高钢箱梁施工,通常采用“大节段吊装整体顶推”“小节段吊装滑移”和“超大型浮吊原位架设”等方案进行施工,结合深中通道工程伶仃洋大桥引桥钢箱梁施工,在保证施工进度的前提下,同时为克服锚碇区域场地受限、大型浮吊档期难以确定,租赁费用高等缺点,在以上方案的基础上通过多次比选及优化,最终确定采用了“整跨吊装+临时支墩+前、后钢导梁+步履式千斤顶+智能化监控”的大节段整体顶推技术,在国内首次实现了恶劣海况条件下高墩、大跨、超重钢箱梁多点同步连续顶推,智能化联动监控。保证了箱梁顶推过程中梁段线形、应力应变可控,安全、高效地完成了锚碇上方区域钢箱梁的安装。 展开更多
关键词 开阔海域环境 锚碇区域 钢箱梁顶推 大节段吊装
在线阅读 下载PDF
上一页 1 2 4 下一页 到第
使用帮助 返回顶部