期刊文献+
共找到65,691篇文章
< 1 2 250 >
每页显示 20 50 100
基于SAM的水陆两栖环境感知微调策略与应用
1
作者 左哲 蓝鸿 +1 位作者 覃卫 王坤 《北京理工大学学报》 北大核心 2026年第1期20-28,共9页
针对水陆两栖无人平台在不确定环境中面临的高误报率及多感知任务整合困难的问题,本研究提出了一种基于分割一切模型(segment anything model,SAM)的多模型联合环境感知方法,实现了障碍物检测与水陆域分割的统一处理.具体而言,是将U-Net... 针对水陆两栖无人平台在不确定环境中面临的高误报率及多感知任务整合困难的问题,本研究提出了一种基于分割一切模型(segment anything model,SAM)的多模型联合环境感知方法,实现了障碍物检测与水陆域分割的统一处理.具体而言,是将U-Net和YOLOv8与SAM结合,U-Net和YOLOv8负责获取目标的粗略轮廓,而SAM通过其编码−解码结构实现进一步精细分割.此外,设计了专门的微调策略以实现联合训练,进一步提升了模型的性能.本研究还构建了专有数据集USV-Dataset,并开发了数据引擎以提高标注效率.为增强模型的泛化能力,采用了4个公开数据集与USV-Dataset进行混合训练,涵盖了多样化的场景和障碍物类别.实验结果表明,该方法实现了96.8%的mPA分割精度和10 FPS的推理速度,展现出良好的泛化能力,能够满足中低速两栖无人平台的实时环境感知需求. 展开更多
关键词 水陆两栖平台 环境感知 sam 多模型融合
在线阅读 下载PDF
基于MedSAM的高效半监督医学图像病灶分割方法
2
作者 贾熹滨 尹训洁 +1 位作者 范超 杨正汉 《东北大学学报(自然科学版)》 北大核心 2026年第1期1-10,共10页
针对半监督病灶分割中教师网络性能较差,难以指导学生网络进行有效分割的问题,本文提出一种高效的半监督医学图像病灶分割方法.该方法选用特征提取能力更强的MedSAM(medical segment anything model)作为教师网络,构建基于Mamba的轻量... 针对半监督病灶分割中教师网络性能较差,难以指导学生网络进行有效分割的问题,本文提出一种高效的半监督医学图像病灶分割方法.该方法选用特征提取能力更强的MedSAM(medical segment anything model)作为教师网络,构建基于Mamba的轻量级学生网络,通过知识蒸馏提升学生网络分割性能.针对异构网络特征对齐带来的语义失配问题,提出基于扰动一致的跨架构知识蒸馏策略,将教师特征映射到学生特征空间并对齐扰动响应,提升学生网络特征表达能力以优化分割性能.此外,针对病灶形态多样及前景背景对比度低导致的分割一致性差问题,提出基于分布的自监督损失进行优化.在多类医学图像病灶分割数据集上的实验表明,本文方法的分割性能优于现有方法,同时学生网络参数量仅为1.34 M,显著提升了模型效率. 展开更多
关键词 病灶分割 Medsam Mamba 知识蒸馏 自监督损失
在线阅读 下载PDF
基于SAM多尺度标签优化的半监督学习遥感目标检测 被引量:1
3
作者 周洁 方振宇 《微电子学与计算机》 2026年第1期65-74,共10页
针对遥感图像中目标分辨率低、背景复杂且获取高质量旋转框标注费用高、耗时长等问题,提出了一种多尺度标签优化的半监督学习遥感目标检测方法。该方法使用SoftTeacher模型能够充分利用大量未标注且多样化的数据,同时还能发现原始数据... 针对遥感图像中目标分辨率低、背景复杂且获取高质量旋转框标注费用高、耗时长等问题,提出了一种多尺度标签优化的半监督学习遥感目标检测方法。该方法使用SoftTeacher模型能够充分利用大量未标注且多样化的数据,同时还能发现原始数据集中未标注的目标;借助SAM(Segment Anything Model)模型可实现基于深度学习的图像分割,并通过基于掩码的优化生成高质量的标签。通过半监督学习生成伪标注,对伪标注中的标签特征框进行多尺度处理后输入SAM模型进行优化,使用优化后的标注扩充原数据集样本重新用于全监督训练。实验结果表明:所选用的半监督目标检测模型SoftTeacher能够展现出优于全监督目标检测模型的性能,经过优化后的数据集样本能够展现相比原本伪标注数据集更精确的效果。在使用扩充后的数据集进行全监督训练时,原先的平均精度均值(mean Average Precision, mAP, mAP)从51.4%提升到53.5%。此外,全监督训练阶段使用现有的常用目标检测器进行了对比实验,进一步验证了所提方法可以有效提高遥感目标检测在标注不足情况下的准确性。 展开更多
关键词 遥感图像 半监督学习 sam 图像分割
在线阅读 下载PDF
Pre-trained SAM as data augmentation for image segmentation 被引量:1
4
作者 Junjun Wu Yunbo Rao +1 位作者 Shaoning Zeng Bob Zhang 《CAAI Transactions on Intelligence Technology》 2025年第1期268-282,共15页
Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in ord... Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation. 展开更多
关键词 data augmentation image segmentation large model segment anything model
在线阅读 下载PDF
从通用分割到专用化建筑物提取——SAM在高分遥感影像中的优化策略研究
5
作者 陈秀秀 金永胜 +1 位作者 叶建生 方雷 《中国图象图形学报》 北大核心 2026年第2期642-656,共15页
目的 针对传统高分辨率影像建筑物提取方法的精度瓶颈,SAM(segment anything model)模型虽然具有分割优势,却因训练域差异和人工提示依赖,无法直接应用于大规模遥感影像的自动化提取。为此,提出一种无提示—判别联合模型(SAM-Classifie... 目的 针对传统高分辨率影像建筑物提取方法的精度瓶颈,SAM(segment anything model)模型虽然具有分割优势,却因训练域差异和人工提示依赖,无法直接应用于大规模遥感影像的自动化提取。为此,提出一种无提示—判别联合模型(SAM-Classifier),实现了通用视觉模型向遥感场景的迁移,完成了建筑物的自动化高效提取。方法 本研究采用了一系列实验来系统探究不同提示方式(包括点提示、框提示和掩码提示)在SAM模型指导下的建筑物提取效果,并引入一个无需提示的联合模型——SAM-Classifier,以克服传统SAM模型在语义理解和提示依赖方面的限制。实验基于3个公开可用的数据集进行,以全面评估各种提示策略下SAM模型的表现。此外,为了比较不同解决方案在建筑物提取任务中的性能差异,还特别设计了对比实验,将SAM模型及SAMClassifier的结果与商汤科技开发的遥感大模型(Sense Earth 3.0)进行了详细的对比分析。结果 实验表明,框提示引导下的SAM分割表现最优(WHU数据集F1分数0.945);所提出的SAM-Classifier无需人工提示,Ma数据集F1分数0.717,与对比的先进方法性能相近。结论 本文提出SAM-Classifier,通过融合轻量级分类器实现无需提示的端到端建筑物提取,有效缓解了SAM的语义理解不足与提示依赖问题,为遥感影像的自动化解译提供了新方案。 展开更多
关键词 图像分割 高分辨率影像 建筑物提取 sam(segment anything model) 提示分割 优化策略
原文传递
A medical image segmentation model based on SAM with an integrated local multi-scale feature encoder
6
作者 DI Jing ZHU Yunlong LIANG Chan 《Journal of Measurement Science and Instrumentation》 2025年第3期359-370,共12页
Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding ... Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding phase.This paper presents a medical image segmentation model based on SAM with a local multi-scale feature encoder(LMSFE-SAM)to address the issues above.Firstly,based on the SAM,a local multi-scale feature encoder is introduced to improve the representation of features within local receptive field,thereby supplying the Vision Transformer(ViT)branch in SAM with enriched local multi-scale contextual information.At the same time,a multiaxial Hadamard product module(MHPM)is incorporated into the local multi-scale feature encoder in a lightweight manner to reduce the quadratic complexity and noise interference.Subsequently,a cross-branch balancing adapter is designed to balance the local and global information between the local multi-scale feature encoder and the ViT encoder in SAM.Finally,to obtain smaller input image size and to mitigate overlapping in patch embeddings,the size of the input image is reduced from 1024×1024 pixels to 256×256 pixels,and a multidimensional information adaptation component is developed,which includes feature adapters,position adapters,and channel-spatial adapters.This component effectively integrates the information from small-sized medical images into SAM,enhancing its suitability for clinical deployment.The proposed model demonstrates an average enhancement ranging from 0.0387 to 0.3191 across six objective evaluation metrics on BUSI,DDTI,and TN3K datasets compared to eight other representative image segmentation models.This significantly enhances the performance of the SAM on medical images,providing clinicians with a powerful tool in clinical diagnosis. 展开更多
关键词 segment anything model(sam) medical image segmentation ENCODER decoder multiaxial Hadamard product module(MHPM) cross-branch balancing adapter
在线阅读 下载PDF
CableSAM:an efficient automatic segmentation method for aircraft cabin cables
7
作者 LING Aihua WANG Junwen +1 位作者 LU Jiaming LIU Ruyu 《Optoelectronics Letters》 2025年第3期183-187,共5页
Cabin cables,as critical components of an aircraft's electrical system,significantly impact the operational efficiency and safety of the aircraft.The existing cable segmentation methods in civil aviation cabins ar... Cabin cables,as critical components of an aircraft's electrical system,significantly impact the operational efficiency and safety of the aircraft.The existing cable segmentation methods in civil aviation cabins are limited,especially in automation,heavily dependent on large amounts of data and resources,lacking the flexibility to adapt to different scenarios.To address these challenges,this paper introduces a novel image segmentation model,CableSAM,specifically designed for automated segmentation of cabin cables.CableSAM improves segmentation efficiency and accuracy using knowledge distillation and employs a context ensemble strategy.It accurately segments cables in various scenarios with minimal input prompts.Comparative experiments on three cable datasets demonstrate that CableSAM surpasses other advanced cable segmentation methods in performance. 展开更多
关键词 image segmentation aircraft cabin automatic segmentation automated segmentation cabin cablesas civil aviation cabins cable segmentation knowledge distillation
原文传递
CW-HRNet:Constrained Deformable Sampling and Wavelet-Guided Enhancement for Lightweight Crack Segmentation
8
作者 Dewang Ma 《Journal of Electronic Research and Application》 2025年第5期269-280,共12页
This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two ke... This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two key modules:Constrained Deformable Convolution(CDC),which stabilizes geometric alignment by applying a tanh limiter and learnable scaling factor to the predicted offsets,and the Wavelet Frequency Enhancement Module(WFEM),which decomposes features using Haar wavelets to preserve low-frequency structures while enhancing high-frequency boundaries and textures.Evaluations on the CrackSeg9k benchmark demonstrate CW-HRNet’s superior performance,achieving 82.39%mIoU with only 7.49M parameters and 10.34 GFLOPs,outperforming HrSegNet-B48 by 1.83% in segmentation accuracy with minimal complexity overhead.The model also shows strong cross-dataset generalization,achieving 60.01%mIoU and 66.22%F1 on Asphalt3k without fine-tuning.These results highlight CW-HRNet’s favorable accuracyefficiency trade-off for real-world crack segmentation tasks. 展开更多
关键词 Crack segmentation Lightweight semantic segmentation Deformable convolution Wavelet transform Road infrastructure
在线阅读 下载PDF
Precision organoid segmentation technique(POST):accurate organoid segmentation in challenging bright-field images 被引量:1
9
作者 Xuan Du Yuchen Li +5 位作者 Jiaping Song Zilin Zhang Jing Zhang Yanhui Li Zaozao Chen Zhongze Gu 《Bio-Design and Manufacturing》 2026年第1期80-93,I0013-I0016,共18页
Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of... Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of complex diseases,with some even achieving clinical translation.Changes in the overall size,shape,boundary,and other morphological features of organoids provide a noninvasive method for assessing organoid drug sensitivity.However,the precise segmentation of organoids in bright-field microscopy images is made difficult by the complexity of the organoid morphology and interference,including overlapping organoids,bubbles,dust particles,and cell fragments.This paper introduces the precision organoid segmentation technique(POST),which is a deep-learning algorithm for segmenting challenging organoids under simple bright-field imaging conditions.Unlike existing methods,POST accurately segments each organoid and eliminates various artifacts encountered during organoid culturing and imaging.Furthermore,it is sensitive to and aligns with measurements of organoid activity in drug sensitivity experiments.POST is expected to be a valuable tool for drug screening using organoids owing to its capability of automatically and rapidly eliminating interfering substances and thereby streamlining the organoid analysis and drug screening process. 展开更多
关键词 Organoid Drug screening Deep learning Image segmentation
暂未订购
基于UAV图像和SAM弱监督学习的黑土区保护性耕作玉米秸秆识别方法
10
作者 赵丽华 张超 +4 位作者 王贝贝 陈畅 武亚楠 杨翠翠 李媛媛 《农业机械学报》 北大核心 2026年第3期87-96,共10页
秸秆覆盖还田是黑土区保护性耕作的重要手段,秸秆识别对于保护性耕作实施效果评估和农业管理决策具有重要意义。针对全监督深度学习秸秆遥感识别方法依赖大量像素级标注标签数据问题,提出一种基于无人机(UAV)图像和Segment anything mod... 秸秆覆盖还田是黑土区保护性耕作的重要手段,秸秆识别对于保护性耕作实施效果评估和农业管理决策具有重要意义。针对全监督深度学习秸秆遥感识别方法依赖大量像素级标注标签数据问题,提出一种基于无人机(UAV)图像和Segment anything model(SAM)的弱监督学习秸秆遥感识别方法。通过Adapter和联合损失函数对SAM进行微调,并利用边界框弱标注生成高质量伪标签,最终训练改进的U-Net分割网络实现秸秆识别。以吉林省梨树县玉米保护性耕作区为研究区进行秸秆提取试验,试验结果表明,微调后SAM的平均交并比和F1分数分别达到81.04%和87.85%,显著优于未微调模型;SAM弱监督结合改进U-Net的模型性能高于其他分割方法,F1分数为90.6%;消融试验验证了联合损失函数和卷积模块可有效提升模型性能。本文为黑土区玉米保护性耕作秸秆遥感识别提供了一种高效、低成本的解决方案。 展开更多
关键词 玉米秸秆识别 无人机图像 保护性耕作 弱监督学习 sam 语义分割
在线阅读 下载PDF
MicroFlowSAM:A motion-prompted instance segmentation approach in microfluidics with zero annotation and training
11
作者 Wenle Xu Lin Sheng +2 位作者 Tong Qiu Kai Wang Guangsheng Luo 《Chinese Journal of Chemical Engineering》 2025年第11期103-114,共12页
Microdispersion technology is crucial for a variety of applications in both the chemical and biomedical fields.The precise and rapid characterization of microdroplets and microbubbles is essential for research as well... Microdispersion technology is crucial for a variety of applications in both the chemical and biomedical fields.The precise and rapid characterization of microdroplets and microbubbles is essential for research as well as for optimizing and controlling industrial processes.Traditional methods often rely on time-consuming manual analysis.Although some deep learning-based computer vision methods have been proposed for automated identification and characterization,these approaches often rely on supervised learning,which requires labeled data for model training.This dependency on labeled data can be time-consuming and expensive,especially when working with large and complex datasets.To address these challenges,we propose Micro Flow SAM,an innovative,motion-prompted,annotation-free,and training-free instance segmentation approach.By utilizing motion of microdroplets and microbubbles as prompts,our method directs large-scale vision models to perform accurate instance segmentation without the need for annotated data or model training.This approach eliminates the need for human intervention in data labeling and reduces computational costs,significantly streamlining the data analysis process.We demonstrate the effectiveness of Micro Flow SAM across 12 diverse datasets,achieving outstanding segmentation results that are competitive with traditional methods.This novel approach not only accelerates the analysis process but also establishes a foundation for efficient process control and optimization in microfluidic applications.Micro Flow SAM represents a breakthrough in reducing the complexities and resource demands of instance segmentation,enabling faster insights and advancements in the microdispersion field. 展开更多
关键词 MICROFLUIDICS Microdispersion Instance segmentation Large vision model Prompt engineering
在线阅读 下载PDF
MSAMamba-UNet:A Lightweight Multi-Scale Adaptive Mamba Network for Skin Lesion Segmentation
12
作者 Shouming Hou Jianchao Hou +2 位作者 Yuteng Pang Aoyu Xia Beibei Hou 《Journal of Bionic Engineering》 2025年第6期3209-3225,共17页
Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion siz... Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion sizes.To overcome these limitations,we introduce MSAMamba-UNet,a lightweight model that integrates two novel architectures:Multi-Scale Mamba(MSMamba)and Adaptive Dynamic Gating Block(ADGB).MSMamba utilizes multi-scale decomposition and a parallel hierarchical structure to enhance the delineation of irregular lesion boundaries and sensitivity to small targets.ADGB dynamically selects convolutional kernels with varying receptive fields based on input features,improving the model’s capacity to accommodate diverse lesion textures and scales.Additionally,we introduce a Mix Attention Fusion Block(MAF)to enhance shallow feature representation by integrating parallel channel and pixel attention mechanisms.Extensive evaluation of MSAMamba-UNet on the ISIC 2016,ISIC 2017,and ISIC 2018 datasets demonstrates competitive segmentation accuracy with only 0.056 M parameters and 0.069 GFLOPs.Our experiments revealed that MSAMamba-UNet achieved IoU scores of 85.53%,85.47%,and 82.22%,as well as DSC scores of 92.20%,92.17%,and 90.24%,respectively.These results underscore the lightweight design and effectiveness of MSAMamba-UNet. 展开更多
关键词 TRANSFORMER segmenting skin lesions Mamba Lightweight model MULTI-SCALE
在线阅读 下载PDF
Predictive Value of 3D Radiological Segmentation and Anatomical Parameters for Cochlear Implantation Electrode Insertion Depth Based on a Large Sample of Patients with Inner Ear Malformations
13
作者 Shujin Xue Xingmei Wei +4 位作者 Ying Kong Biao Chen Zhencheng Gao Chunling Ma Yongxin Li 《Journal of Otology》 2025年第4期259-267,共9页
Objective:The aims of this study were to investigate the clinical applicability of 3D segmentation in measuring cochlear anatomical parameters,explore factors that influence the insertion angle of cochlear implant ele... Objective:The aims of this study were to investigate the clinical applicability of 3D segmentation in measuring cochlear anatomical parameters,explore factors that influence the insertion angle of cochlear implant electrodes in patients with inner ear malformations,and determine the value of 3D segmentation in predicting cochlear implant electrode insertion depth by simulating electrode implantation in a reconstructed 3D model.Methods:Data from 208 temporal bone CT scans of patients with a variety of inner ear malformations(including the CH,IP-Ⅰ,IP-Ⅱ,and IP-Ⅲtypes)who underwent cochlear implantation at our center were retrospectively analyzed.Preoperative temporal bone CT data were subjected to three-dimensional(3D)segmentation of the cochlea with a 3D slicer.Results:Cochlear malformation types,including IP typesⅠ(42 ears),Ⅱ(278ears),Ⅲ(20 ears),and CH(65 ears),were diagnosed and measured in 208 preoperative CT datasets.Cochlear anatomical parameters and electrode length were correlated,which partially explained the variations in electrode insertion angle.The mean angle of implantation among the enrolled patients was 564.33°,and the mean implantation angle prediction error in the 3D segmentation was|23.74|°.Conclusion:Three-dimensional segmentation from temporal bone CT is valuable for surgeons,especially in treating patients with inner ear malformation.Such insights will help surgeons understand overall anatomical variations,predict electrode implantation depth,and complete preoperative imaging assessments for cochlear implant insertion depth in patients with inner ear malformations. 展开更多
关键词 Inner ear malformation Cochlear implant Temporal bone CT Three-dimensional segmentation
暂未订购
Detection of co-phasing error in segmented mirror based on extended Young’s interferometry combined with Vision Transformer
14
作者 LIU Yin-ling YAO Chi +3 位作者 OUYANG Shang-tao WAN Yi-rong CHEN Mo LI Bin 《中国光学(中英文)》 北大核心 2026年第1期205-218,共14页
Due to the inability of manufacturing a single monolithic mirror at the 10-meter scales,segmented mirrors have become indispensable tools in modern astronomical research.However,to match the imaging performance of the... Due to the inability of manufacturing a single monolithic mirror at the 10-meter scales,segmented mirrors have become indispensable tools in modern astronomical research.However,to match the imaging performance of the monolithic counterpart,the sub-mirrors must maintain precise co-phasing.Piston error critically degrades segmented mirror imaging quality,necessitating efficient and precise detection.To ad-dress the limitations that the conventional circular-aperture diffraction with two-wavelength algorithm is sus-ceptible to decentration errors,and the traditional convolutional neural networks(CNNs)struggle to capture global features under large-range piston errors due to their restricted local receptive fields,this paper pro-poses a method that integrates extended Young’s interference principles with a Vision Transformer(ViT)to detect piston error.By suppressing decentration error interference through two symmetrically arranged aper-tures and extending the measurement range to±7.95μm via a two-wavelength(589 nm/600 nm)algorithm.This approach exploits ViT’s self-attention mechanism to model global characteristics of interference fringes.Unlike CNNs constrained by local convolutional kernels,the ViT significantly improves sensitivity to inter-ferogram periodicity.The simulation results demonstrate that the proposed method achieves a measurement accuracy of 5 nm(0.0083λ0)across the range of±7.95μm,while maintaining an accuracy exceeding 95%in the presence of Gaussian noise(SNR≥15 dB),Poisson noise(λ≥9 photons/pixel),and sub-mirror gap er-ror(Egap≤0.2)interference.Moreover,the detection speed shows significant improvement compared to the cross-correlation algorithm.This study establishes an accurate,robust framework for segmented mirror error detection,advancing high-precision astronomical observation. 展开更多
关键词 segmented mirror co-phasing piston errors ViT Young’s interference principles
在线阅读 下载PDF
How precise is precise enough?Tree crown segmentation using high resolution close-up multispectral UAV images and its effect on NDVI accuracy in Fraxinus excelsior L.trees
15
作者 Lisa Buchner Anna-Katharina Eisen Susanne Jochner-Oette 《Journal of Forestry Research》 2026年第2期16-30,共15页
Detailed individual tree crown segmentation is highly relevant for the detection and monitoring of Fraxinus excelsior L.trees affected by ash dieback,a major threat to common ash populations across Europe.In this stud... Detailed individual tree crown segmentation is highly relevant for the detection and monitoring of Fraxinus excelsior L.trees affected by ash dieback,a major threat to common ash populations across Europe.In this study,both fine and coarse crown segmentation methods were applied to close-range multispectral UAV imagery.The fine tree crown segmentation method utilized a novel unsupervised machine learning approach based on a blended NIR-NDVI image,whereas the coarse segmentation relied on the segment anything model(SAM).Both methods successfully delineated tree crown outlines,however,only the fine segmentation accurately captured internal canopy gaps.Despite these structural differences,mean NDVI values calculated per tree crown revealed no significant differences between the two approaches,indicating that coarse segmentation is sufficient for mean vegetation index assessments.Nevertheless,the fine segmentation revealed increased heterogeneity in NDVI values in more severely damaged trees,underscoring its value for detailed structural and health analyses.Furthermore,the fine segmentation workflow proved transferable to both individual UAV images and orthophotos from broader UAV surveys.For applications focused on structural integrity and spatial variation in canopy health,the fine segmentation approach is recommended. 展开更多
关键词 Leaf mass segmentation Machine learning segment anything model Ash dieback
在线阅读 下载PDF
An intelligent segmentation method for leakage points in central serous chorioretinopathy based on fluorescein angiography images
16
作者 Jian-Guo Xu Yong-Chi Liu +4 位作者 Fen Zhou Jian-Xin Shen Zhi-Peng Yan Xin-Ya Hu Wei-Hua Yang 《International Journal of Ophthalmology(English edition)》 2026年第3期421-433,共13页
AIM:To construct an intelligent segmentation scheme for precise localization of central serous chorioretinopathy(CSC)leakage points,thereby enabling ophthalmologists to deliver accurate laser treatment without navigat... AIM:To construct an intelligent segmentation scheme for precise localization of central serous chorioretinopathy(CSC)leakage points,thereby enabling ophthalmologists to deliver accurate laser treatment without navigational laser equipment.METHODS:A dataset with dual labels(point-level and pixel-level)was first established based on fundus fluorescein angiography(FFA)images of CSC and subsequently divided into training(102 images),validation(40 images),and test(40 images)datasets.An intelligent segmentation method was then developed,based on the You Only Look Once version 8 Pose Estimation(YOLOv8-Pose)model and segment anything model(SAM),to segment CSC leakage points.Next,the YOLOv8-Pose model was trained for 200 epochs,and the best-performing model was selected to form the optimal combination with SAM.Additionally,the classic five types of U-Net series models[i.e.,U-Net,recurrent residual U-Net(R2U-Net),attention U-Net(AttU-Net),recurrent residual attention U-Net(R2AttUNet),and nested U-Net(UNet^(++))]were initialized with three random seeds and trained for 200 epochs,resulting in a total of 15 baseline models for comparison.Finally,based on the metrics including Dice similarity coefficient(DICE),intersection over union(IoU),precision,recall,precisionrecall(PR)curve,and receiver operating characteristic(ROC)curve,the proposed method was compared with baseline models through quantitative and qualitative experiments for leakage point segmentation,thereby demonstrating its effectiveness.RESULTS:With the increase of training epochs,the mAP50-95,Recall,and precision of the YOLOv8-Pose model showed a significant increase and tended to stabilize,and it achieved a preliminary localization success rate of 90%(i.e.,36 images)for CSC leakage points in 40 test images.Using manually expert-annotated pixel-level labels as the ground truth,the proposed method achieved outcomes with a DICE of 57.13%,an IoU of 45.31%,a precision of 45.91%,a recall of 93.57%,an area under the PR curve(AUC-PR)of 0.78 and an area under the ROC curve(AUC-ROC)of 0.97,which enables more accurate segmentation of CSC leakage points.CONCLUSION:By combining the precise localization capability of the YOLOv8-Pose model with the robust and flexible segmentation ability of SAM,the proposed method not only demonstrates the effectiveness of the YOLOv8-Pose model in detecting keypoint coordinates of CSC leakage points from the perspective of application innovation but also establishes a novel approach for accurate segmentation of CSC leakage points through the“detect-then-segment”strategy,thereby providing a potential auxiliary means for the automatic and precise realtime localization of leakage points during traditional laser photocoagulation for CSC. 展开更多
关键词 You Only Look Once version 8 Pose Estimation segment anything model central serous chorioretinopathy leakage point segmentation
原文传递
RE-UKAN:A Medical Image Segmentation Network Based on Residual Network and Efficient Local Attention
17
作者 Bo Li Jie Jia +2 位作者 Peiwen Tan Xinyan Chen Dongjin Li 《Computers, Materials & Continua》 2026年第3期2184-2200,共17页
Medical image segmentation is of critical importance in the domain of contemporary medical imaging.However,U-Net and its variants exhibit limitations in capturing complex nonlinear patterns and global contextual infor... Medical image segmentation is of critical importance in the domain of contemporary medical imaging.However,U-Net and its variants exhibit limitations in capturing complex nonlinear patterns and global contextual information.Although the subsequent U-KAN model enhances nonlinear representation capabilities,it still faces challenges such as gradient vanishing during deep network training and spatial detail loss during feature downsampling,resulting in insufficient segmentation accuracy for edge structures and minute lesions.To address these challenges,this paper proposes the RE-UKAN model,which innovatively improves upon U-KAN.Firstly,a residual network is introduced into the encoder to effectively mitigate gradient vanishing through cross-layer identity mappings,thus enhancing modelling capabilities for complex pathological structures.Secondly,Efficient Local Attention(ELA)is integrated to suppress spatial detail loss during downsampling,thereby improving the perception of edge structures and minute lesions.Experimental results on four public datasets demonstrate that RE-UKAN outperforms existing medical image segmentation methods across multiple evaluation metrics,with particularly outstanding performance on the TN-SCUI 2020 dataset,achieving IoU of 88.18%and Dice of 93.57%.Compared to the baseline model,it achieves improvements of 3.05%and 1.72%,respectively.These results fully demonstrate RE-UKAN’s superior detail retention capability and boundary recognition accuracy in complex medical image segmentation tasks,providing a reliable solution for clinical precision segmentation. 展开更多
关键词 Image segmentation U-KAN residual network ELA
在线阅读 下载PDF
Context Patch Fusion with Class Token Enhancement for Weakly Supervised Semantic Segmentation
18
作者 Yiyang Fu Hui Li Wangyu Wu 《Computer Modeling in Engineering & Sciences》 2026年第1期1130-1150,共21页
Weakly Supervised Semantic Segmentation(WSSS),which relies only on image-level labels,has attracted significant attention for its cost-effectiveness and scalability.Existing methods mainly enhance inter-class distinct... Weakly Supervised Semantic Segmentation(WSSS),which relies only on image-level labels,has attracted significant attention for its cost-effectiveness and scalability.Existing methods mainly enhance inter-class distinctions and employ data augmentation to mitigate semantic ambiguity and reduce spurious activations.However,they often neglect the complex contextual dependencies among image patches,resulting in incomplete local representations and limited segmentation accuracy.To address these issues,we propose the Context Patch Fusion with Class Token Enhancement(CPF-CTE)framework,which exploits contextual relations among patches to enrich feature repre-sentations and improve segmentation.At its core,the Contextual-Fusion Bidirectional Long Short-Term Memory(CF-BiLSTM)module captures spatial dependencies between patches and enables bidirectional information flow,yield-ing a more comprehensive understanding of spatial correlations.This strengthens feature learning and segmentation robustness.Moreover,we introduce learnable class tokens that dynamically encode and refine class-specific semantics,enhancing discriminative capability.By effectively integrating spatial and semantic cues,CPF-CTE produces richer and more accurate representations of image content.Extensive experiments on PASCAL VOC 2012 and MS COCO 2014 validate that CPF-CTE consistently surpasses prior WSSS methods. 展开更多
关键词 Weakly supervised semantic segmentation context-fusion class enhancement
在线阅读 下载PDF
Semi-Supervised Segmentation Framework for Quantitative Analysis of Material Microstructure Images
19
作者 Yingli Liu Weiyong Tang +2 位作者 Xiao Yang Jiancheng Yin Haihe Zhou 《Computers, Materials & Continua》 2026年第4期596-611,共16页
Quantitative analysis of aluminum-silicon(Al-Si)alloy microstructure is crucial for evaluating and controlling alloy performance.Conventional analysis methods rely on manual segmentation,which is inefficient and subje... Quantitative analysis of aluminum-silicon(Al-Si)alloy microstructure is crucial for evaluating and controlling alloy performance.Conventional analysis methods rely on manual segmentation,which is inefficient and subjective,while fully supervised deep learning approaches require extensive and expensive pixel-level annotated data.Furthermore,existing semi-supervised methods still face challenges in handling the adhesion of adjacent primary silicon particles and effectively utilizing consistency in unlabeled data.To address these issues,this paper proposes a novel semi-supervised framework for Al-Si alloy microstructure image segmentation.First,we introduce a Rotational Uncertainty Correction Strategy(RUCS).This strategy employs multi-angle rotational perturbations andMonte Carlo sampling to assess prediction consistency,generating a pixel-wise confidence weight map.By integrating this map into the loss function,the model dynamically focuses on high-confidence regions,thereby improving generalization ability while reducing manual annotation pressure.Second,we design a Boundary EnhancementModule(BEM)to strengthen boundary feature extraction through erosion difference and multi-scale dilated convolutions.This module guides the model to focus on the boundary regions of adjacent particles,effectively resolving particle adhesion and improving segmentation accuracy.Systematic experiments were conducted on the Aluminum-Silicon Alloy Microstructure Dataset(ASAD).Results indicate that the proposed method performs exceptionally well with scarce labeled data.Specifically,using only 5%labeled data,our method improves the Jaccard index and Adjusted Rand Index(ARI)by 2.84 and 1.57 percentage points,respectively,and reduces the Variation of Information(VI)by 8.65 compared to stateof-the-art semi-supervised models,approaching the performance levels of 10%labeled data.These results demonstrate that the proposed method significantly enhances the accuracy and robustness of quantitative microstructure analysis while reducing annotation costs. 展开更多
关键词 Microstructure alloy semi-supervised segmentation boundary enhancement variation of information
在线阅读 下载PDF
Enhanced BEV Scene Segmentation:De-Noise Channel Attention for Resource-Constrained Environments
20
作者 Argho Dey Yunfei Yin +3 位作者 Zheng Yuan ZhiwenZeng Xianjian Bao Md Minhazul Islam 《Computers, Materials & Continua》 2026年第4期2161-2180,共20页
Autonomous vehicles rely heavily on accurate and efficient scene segmentation for safe navigation and efficient operations.Traditional Bird’s Eye View(BEV)methods on semantic scene segmentation,which leverage multimo... Autonomous vehicles rely heavily on accurate and efficient scene segmentation for safe navigation and efficient operations.Traditional Bird’s Eye View(BEV)methods on semantic scene segmentation,which leverage multimodal sensor fusion,often struggle with noisy data and demand high-performance GPUs,leading to sensor misalignment and performance degradation.This paper introduces an Enhanced Channel Attention BEV(ECABEV),a novel approach designed to address the challenges under insufficient GPU memory conditions.ECABEV integrates camera and radar data through a de-noise enhanced channel attention mechanism,which utilizes global average and max pooling to effectively filter out noise while preserving discriminative features.Furthermore,an improved fusion approach is proposed to efficiently merge categorical data across modalities.To reduce computational overhead,a bilinear interpolation layer normalizationmethod is devised to ensure spatial feature fidelity.Moreover,a scalable crossentropy loss function is further designed to handle the imbalanced classes with less computational efficiency sacrifice.Extensive experiments on the nuScenes dataset demonstrate that ECABEV achieves state-of-the-art performance with an IoU of 39.961,using a lightweight ViT-B/14 backbone and lower resolution(224×224).Our approach highlights its cost-effectiveness and practical applicability,even on low-end devices.The code is publicly available at:https://github.com/YYF-CQU/ECABEV.git. 展开更多
关键词 Autonomous vehicle BEV attention mechanism sensor fusion scene segmentation
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部