期刊文献+
共找到66,341篇文章
< 1 2 250 >
每页显示 20 50 100
Multiple PointMedSAM Prompting for Enhanced Medical Image Segmentation
1
作者 Wasfieh Nazzal Ezequiel López-Rubio +1 位作者 Miguel A.Molina-Cabello Karl Thurnhofer-Hemsi 《Computers, Materials & Continua》 2026年第5期2100-2115,共16页
Automatic and accurate medical image segmentation remains a fundamental task in computer-aided diagnosis and treatment planning.Recent advances in foundation models,such as the medical-focused Segment AnythingModel(Me... Automatic and accurate medical image segmentation remains a fundamental task in computer-aided diagnosis and treatment planning.Recent advances in foundation models,such as the medical-focused Segment AnythingModel(MedSAM),have demonstrated strong performance but face challenges inmanymedical applications due to anatomical complexity and a limited domain-specific prompt.Thiswork introduces amethodology that enhances segmentation robustness and precision by automatically generating multiple informative point prompts,rather than relying on single inputs.The proposed approach randomly samples sets of spatially distributed point prompts based on image features,enabling MedSAM to better capture fine-grained anatomical structures and boundaries.During inference,probability maps are aggregated to reduce local misclassifications without additional model training.Extensive experiments on various computed tomography(CT)and magnetic resonance imaging(MRI)datasets demonstrate improvements in Dice Similarity Coefficient(DSC)and Normalized Surface Dice(NSD)metrics compared to baseline SAM and Scribble Prompt models.A semi-automatic point sampling version based on the ground truth segmentations yielded enhanced results,achieving up to 92.1%DSC and 86.6%NSD,with significant gains in delineating complex organs such as the pancreas,colon,kidney,and brain tumours.The main novelty of our method consists of effectively combining the results of multiple point prompts into the medical segmentation pipeline so that single-point prompt methods are outperformed.Overall,the proposed model offers a straightforward yet effective approach to improve medical image segmentation performance while maintaining computational efficiency. 展开更多
关键词 Medical image segmentation deep learning test-time augmentation point prompt
在线阅读 下载PDF
基于SAM的水陆两栖环境感知微调策略与应用
2
作者 左哲 蓝鸿 +1 位作者 覃卫 王坤 《北京理工大学学报》 北大核心 2026年第1期20-28,共9页
针对水陆两栖无人平台在不确定环境中面临的高误报率及多感知任务整合困难的问题,本研究提出了一种基于分割一切模型(segment anything model,SAM)的多模型联合环境感知方法,实现了障碍物检测与水陆域分割的统一处理.具体而言,是将U-Net... 针对水陆两栖无人平台在不确定环境中面临的高误报率及多感知任务整合困难的问题,本研究提出了一种基于分割一切模型(segment anything model,SAM)的多模型联合环境感知方法,实现了障碍物检测与水陆域分割的统一处理.具体而言,是将U-Net和YOLOv8与SAM结合,U-Net和YOLOv8负责获取目标的粗略轮廓,而SAM通过其编码−解码结构实现进一步精细分割.此外,设计了专门的微调策略以实现联合训练,进一步提升了模型的性能.本研究还构建了专有数据集USV-Dataset,并开发了数据引擎以提高标注效率.为增强模型的泛化能力,采用了4个公开数据集与USV-Dataset进行混合训练,涵盖了多样化的场景和障碍物类别.实验结果表明,该方法实现了96.8%的mPA分割精度和10 FPS的推理速度,展现出良好的泛化能力,能够满足中低速两栖无人平台的实时环境感知需求. 展开更多
关键词 水陆两栖平台 环境感知 sam 多模型融合
在线阅读 下载PDF
基于MedSAM的高效半监督医学图像病灶分割方法
3
作者 贾熹滨 尹训洁 +1 位作者 范超 杨正汉 《东北大学学报(自然科学版)》 北大核心 2026年第1期1-10,共10页
针对半监督病灶分割中教师网络性能较差,难以指导学生网络进行有效分割的问题,本文提出一种高效的半监督医学图像病灶分割方法.该方法选用特征提取能力更强的MedSAM(medical segment anything model)作为教师网络,构建基于Mamba的轻量... 针对半监督病灶分割中教师网络性能较差,难以指导学生网络进行有效分割的问题,本文提出一种高效的半监督医学图像病灶分割方法.该方法选用特征提取能力更强的MedSAM(medical segment anything model)作为教师网络,构建基于Mamba的轻量级学生网络,通过知识蒸馏提升学生网络分割性能.针对异构网络特征对齐带来的语义失配问题,提出基于扰动一致的跨架构知识蒸馏策略,将教师特征映射到学生特征空间并对齐扰动响应,提升学生网络特征表达能力以优化分割性能.此外,针对病灶形态多样及前景背景对比度低导致的分割一致性差问题,提出基于分布的自监督损失进行优化.在多类医学图像病灶分割数据集上的实验表明,本文方法的分割性能优于现有方法,同时学生网络参数量仅为1.34 M,显著提升了模型效率. 展开更多
关键词 病灶分割 Medsam Mamba 知识蒸馏 自监督损失
在线阅读 下载PDF
基于SAM多尺度标签优化的半监督学习遥感目标检测 被引量:1
4
作者 周洁 方振宇 《微电子学与计算机》 2026年第1期65-74,共10页
针对遥感图像中目标分辨率低、背景复杂且获取高质量旋转框标注费用高、耗时长等问题,提出了一种多尺度标签优化的半监督学习遥感目标检测方法。该方法使用SoftTeacher模型能够充分利用大量未标注且多样化的数据,同时还能发现原始数据... 针对遥感图像中目标分辨率低、背景复杂且获取高质量旋转框标注费用高、耗时长等问题,提出了一种多尺度标签优化的半监督学习遥感目标检测方法。该方法使用SoftTeacher模型能够充分利用大量未标注且多样化的数据,同时还能发现原始数据集中未标注的目标;借助SAM(Segment Anything Model)模型可实现基于深度学习的图像分割,并通过基于掩码的优化生成高质量的标签。通过半监督学习生成伪标注,对伪标注中的标签特征框进行多尺度处理后输入SAM模型进行优化,使用优化后的标注扩充原数据集样本重新用于全监督训练。实验结果表明:所选用的半监督目标检测模型SoftTeacher能够展现出优于全监督目标检测模型的性能,经过优化后的数据集样本能够展现相比原本伪标注数据集更精确的效果。在使用扩充后的数据集进行全监督训练时,原先的平均精度均值(mean Average Precision, mAP, mAP)从51.4%提升到53.5%。此外,全监督训练阶段使用现有的常用目标检测器进行了对比实验,进一步验证了所提方法可以有效提高遥感目标检测在标注不足情况下的准确性。 展开更多
关键词 遥感图像 半监督学习 sam 图像分割
在线阅读 下载PDF
Pre-trained SAM as data augmentation for image segmentation 被引量:1
5
作者 Junjun Wu Yunbo Rao +1 位作者 Shaoning Zeng Bob Zhang 《CAAI Transactions on Intelligence Technology》 2025年第1期268-282,共15页
Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in ord... Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation. 展开更多
关键词 data augmentation image segmentation large model segment anything model
在线阅读 下载PDF
SAMSNet:融合分散注意力与多尺度通道注意力的遥感道路提取网络
6
作者 魏德宾 徐永强 +1 位作者 李品儒 解鸿基 《遥感学报》 北大核心 2026年第2期371-384,共14页
从遥感图像中自动提取道路在智慧城市、智慧交通和自动驾驶等领域有着广泛的应用前景。然而,从高分辨率遥感图像中自动提取的道路存在碎片化、连通性差等问题,提取完整的道路仍然具有挑战性。为此,本文提出一种改进的编码器—解码器网络... 从遥感图像中自动提取道路在智慧城市、智慧交通和自动驾驶等领域有着广泛的应用前景。然而,从高分辨率遥感图像中自动提取的道路存在碎片化、连通性差等问题,提取完整的道路仍然具有挑战性。为此,本文提出一种改进的编码器—解码器网络SAMSNet(Split-Attention and Multi-Scale Attention Network)。首先,采用Split-Attention Network(ResNeSt-50)作为编码器,通过跨通道提取图像的语义信息以实现高质量的特征表示;其次,引入级联并行的空洞卷积块,在扩大感受野的同时提高网络对多尺度上下文信息的感知能力;最后,在跳跃连接部分引入多尺度通道注意力模块MS-CAM(Multi-Scale Channel Attention Module),同时关注分布全局的和局部的道路信息,帮助网络识别和检测极端尺度变化下的道路。并在DeepGlobe Road数据集、Massachusetts Road数据集和GRSet数据集上进行实验验证,将本文提出的SAMSNet与其他9种主流模型进行对比。验证结果表明,SAMSNet在3个公开数据集上的IoU和F1-score等多项评价指标均优于其他对比模型,取得了最优的提取结果。 展开更多
关键词 遥感图像 道路提取 语义分割 ResNeSt-50 分散注意力 多尺度通道注意力 空洞卷积
原文传递
从通用分割到专用化建筑物提取——SAM在高分遥感影像中的优化策略研究
7
作者 陈秀秀 金永胜 +1 位作者 叶建生 方雷 《中国图象图形学报》 北大核心 2026年第2期642-656,共15页
目的 针对传统高分辨率影像建筑物提取方法的精度瓶颈,SAM(segment anything model)模型虽然具有分割优势,却因训练域差异和人工提示依赖,无法直接应用于大规模遥感影像的自动化提取。为此,提出一种无提示—判别联合模型(SAM-Classifie... 目的 针对传统高分辨率影像建筑物提取方法的精度瓶颈,SAM(segment anything model)模型虽然具有分割优势,却因训练域差异和人工提示依赖,无法直接应用于大规模遥感影像的自动化提取。为此,提出一种无提示—判别联合模型(SAM-Classifier),实现了通用视觉模型向遥感场景的迁移,完成了建筑物的自动化高效提取。方法 本研究采用了一系列实验来系统探究不同提示方式(包括点提示、框提示和掩码提示)在SAM模型指导下的建筑物提取效果,并引入一个无需提示的联合模型——SAM-Classifier,以克服传统SAM模型在语义理解和提示依赖方面的限制。实验基于3个公开可用的数据集进行,以全面评估各种提示策略下SAM模型的表现。此外,为了比较不同解决方案在建筑物提取任务中的性能差异,还特别设计了对比实验,将SAM模型及SAMClassifier的结果与商汤科技开发的遥感大模型(Sense Earth 3.0)进行了详细的对比分析。结果 实验表明,框提示引导下的SAM分割表现最优(WHU数据集F1分数0.945);所提出的SAM-Classifier无需人工提示,Ma数据集F1分数0.717,与对比的先进方法性能相近。结论 本文提出SAM-Classifier,通过融合轻量级分类器实现无需提示的端到端建筑物提取,有效缓解了SAM的语义理解不足与提示依赖问题,为遥感影像的自动化解译提供了新方案。 展开更多
关键词 图像分割 高分辨率影像 建筑物提取 sam(segment anything model) 提示分割 优化策略
原文传递
CSG-Net:一种融合域适应与视觉基础模型SAM的遥感影像建筑物足迹提取方法
8
作者 王椰 张新长 +1 位作者 姜明 阮永俭 《测绘通报》 北大核心 2026年第3期57-61,共5页
针对深度学习建筑物足迹提取模型在跨平台与跨分辨率应用中因域间分布不一致导致的泛化能力显著下降问题,本文提出了一种跨尺度几何精炼网络(CSG-Net),构建了一个“概率-几何”串联的伪标签精炼框架,旨在提升模型在无标签目标域中的适... 针对深度学习建筑物足迹提取模型在跨平台与跨分辨率应用中因域间分布不一致导致的泛化能力显著下降问题,本文提出了一种跨尺度几何精炼网络(CSG-Net),构建了一个“概率-几何”串联的伪标签精炼框架,旨在提升模型在无标签目标域中的适应性与提取精度。首先,通过计算模型双预测分支的Jensen-Shannon散度(JSD),实现对伪标签的不确定性度量与概率加权,以软性方式抑制不可靠区域的噪声;然后,引入基于segment anything model(SAM)分割结果的几何先验,通过重叠率分析对初始伪标签的边界进行硬性几何修正,从而生成高质量的训练目标。在跨尺度建筑物提取任务上的试验表明,CSG-Net的交并比(IoU)达到73.05%,显著优于Baseline(52.49%)及其他先进域适应方法,验证了本文框架在提升跨域稳健性和提取精度方面的有效性。 展开更多
关键词 遥感影像 建筑物足迹 语义分割 域适应 segment anything model(sam)
原文传递
A medical image segmentation model based on SAM with an integrated local multi-scale feature encoder
9
作者 DI Jing ZHU Yunlong LIANG Chan 《Journal of Measurement Science and Instrumentation》 2025年第3期359-370,共12页
Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding ... Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding phase.This paper presents a medical image segmentation model based on SAM with a local multi-scale feature encoder(LMSFE-SAM)to address the issues above.Firstly,based on the SAM,a local multi-scale feature encoder is introduced to improve the representation of features within local receptive field,thereby supplying the Vision Transformer(ViT)branch in SAM with enriched local multi-scale contextual information.At the same time,a multiaxial Hadamard product module(MHPM)is incorporated into the local multi-scale feature encoder in a lightweight manner to reduce the quadratic complexity and noise interference.Subsequently,a cross-branch balancing adapter is designed to balance the local and global information between the local multi-scale feature encoder and the ViT encoder in SAM.Finally,to obtain smaller input image size and to mitigate overlapping in patch embeddings,the size of the input image is reduced from 1024×1024 pixels to 256×256 pixels,and a multidimensional information adaptation component is developed,which includes feature adapters,position adapters,and channel-spatial adapters.This component effectively integrates the information from small-sized medical images into SAM,enhancing its suitability for clinical deployment.The proposed model demonstrates an average enhancement ranging from 0.0387 to 0.3191 across six objective evaluation metrics on BUSI,DDTI,and TN3K datasets compared to eight other representative image segmentation models.This significantly enhances the performance of the SAM on medical images,providing clinicians with a powerful tool in clinical diagnosis. 展开更多
关键词 segment anything model(sam) medical image segmentation ENCODER decoder multiaxial Hadamard product module(MHPM) cross-branch balancing adapter
在线阅读 下载PDF
CableSAM:an efficient automatic segmentation method for aircraft cabin cables
10
作者 LING Aihua WANG Junwen +1 位作者 LU Jiaming LIU Ruyu 《Optoelectronics Letters》 2025年第3期183-187,共5页
Cabin cables,as critical components of an aircraft's electrical system,significantly impact the operational efficiency and safety of the aircraft.The existing cable segmentation methods in civil aviation cabins ar... Cabin cables,as critical components of an aircraft's electrical system,significantly impact the operational efficiency and safety of the aircraft.The existing cable segmentation methods in civil aviation cabins are limited,especially in automation,heavily dependent on large amounts of data and resources,lacking the flexibility to adapt to different scenarios.To address these challenges,this paper introduces a novel image segmentation model,CableSAM,specifically designed for automated segmentation of cabin cables.CableSAM improves segmentation efficiency and accuracy using knowledge distillation and employs a context ensemble strategy.It accurately segments cables in various scenarios with minimal input prompts.Comparative experiments on three cable datasets demonstrate that CableSAM surpasses other advanced cable segmentation methods in performance. 展开更多
关键词 image segmentation aircraft cabin automatic segmentation automated segmentation cabin cablesas civil aviation cabins cable segmentation knowledge distillation
原文传递
CW-HRNet:Constrained Deformable Sampling and Wavelet-Guided Enhancement for Lightweight Crack Segmentation
11
作者 Dewang Ma 《Journal of Electronic Research and Application》 2025年第5期269-280,共12页
This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two ke... This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two key modules:Constrained Deformable Convolution(CDC),which stabilizes geometric alignment by applying a tanh limiter and learnable scaling factor to the predicted offsets,and the Wavelet Frequency Enhancement Module(WFEM),which decomposes features using Haar wavelets to preserve low-frequency structures while enhancing high-frequency boundaries and textures.Evaluations on the CrackSeg9k benchmark demonstrate CW-HRNet’s superior performance,achieving 82.39%mIoU with only 7.49M parameters and 10.34 GFLOPs,outperforming HrSegNet-B48 by 1.83% in segmentation accuracy with minimal complexity overhead.The model also shows strong cross-dataset generalization,achieving 60.01%mIoU and 66.22%F1 on Asphalt3k without fine-tuning.These results highlight CW-HRNet’s favorable accuracyefficiency trade-off for real-world crack segmentation tasks. 展开更多
关键词 Crack segmentation Lightweight semantic segmentation Deformable convolution Wavelet transform Road infrastructure
在线阅读 下载PDF
Precision organoid segmentation technique(POST):accurate organoid segmentation in challenging bright-field images 被引量:1
12
作者 Xuan Du Yuchen Li +5 位作者 Jiaping Song Zilin Zhang Jing Zhang Yanhui Li Zaozao Chen Zhongze Gu 《Bio-Design and Manufacturing》 2026年第1期80-93,I0013-I0016,共18页
Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of... Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of complex diseases,with some even achieving clinical translation.Changes in the overall size,shape,boundary,and other morphological features of organoids provide a noninvasive method for assessing organoid drug sensitivity.However,the precise segmentation of organoids in bright-field microscopy images is made difficult by the complexity of the organoid morphology and interference,including overlapping organoids,bubbles,dust particles,and cell fragments.This paper introduces the precision organoid segmentation technique(POST),which is a deep-learning algorithm for segmenting challenging organoids under simple bright-field imaging conditions.Unlike existing methods,POST accurately segments each organoid and eliminates various artifacts encountered during organoid culturing and imaging.Furthermore,it is sensitive to and aligns with measurements of organoid activity in drug sensitivity experiments.POST is expected to be a valuable tool for drug screening using organoids owing to its capability of automatically and rapidly eliminating interfering substances and thereby streamlining the organoid analysis and drug screening process. 展开更多
关键词 Organoid Drug screening Deep learning Image segmentation
暂未订购
基于UAV图像和SAM弱监督学习的黑土区保护性耕作玉米秸秆识别方法
13
作者 赵丽华 张超 +4 位作者 王贝贝 陈畅 武亚楠 杨翠翠 李媛媛 《农业机械学报》 北大核心 2026年第3期87-96,共10页
秸秆覆盖还田是黑土区保护性耕作的重要手段,秸秆识别对于保护性耕作实施效果评估和农业管理决策具有重要意义。针对全监督深度学习秸秆遥感识别方法依赖大量像素级标注标签数据问题,提出一种基于无人机(UAV)图像和Segment anything mod... 秸秆覆盖还田是黑土区保护性耕作的重要手段,秸秆识别对于保护性耕作实施效果评估和农业管理决策具有重要意义。针对全监督深度学习秸秆遥感识别方法依赖大量像素级标注标签数据问题,提出一种基于无人机(UAV)图像和Segment anything model(SAM)的弱监督学习秸秆遥感识别方法。通过Adapter和联合损失函数对SAM进行微调,并利用边界框弱标注生成高质量伪标签,最终训练改进的U-Net分割网络实现秸秆识别。以吉林省梨树县玉米保护性耕作区为研究区进行秸秆提取试验,试验结果表明,微调后SAM的平均交并比和F1分数分别达到81.04%和87.85%,显著优于未微调模型;SAM弱监督结合改进U-Net的模型性能高于其他分割方法,F1分数为90.6%;消融试验验证了联合损失函数和卷积模块可有效提升模型性能。本文为黑土区玉米保护性耕作秸秆遥感识别提供了一种高效、低成本的解决方案。 展开更多
关键词 玉米秸秆识别 无人机图像 保护性耕作 弱监督学习 sam 语义分割
在线阅读 下载PDF
基于FastSAM模型的猪肌纤维形态快速测量系统
14
作者 陈力 王金勇 +8 位作者 阿超 陈鸿基 候欣华 高庭玥 蒋新 王立刚 唐家森 张龙超 王立贤 《农业工程学报》 北大核心 2026年第3期369-379,共11页
肌纤维形态测量对畜牧业育种和医学领域具有重要意义。传统的方法在效率和准确性方面存在局限性,需要发展先进的计算机辅助分析技术。该研究开发了一种基于FastSAM模型的肌纤维形态学测量方法,利用图像锐化裁切处理,精确和有效地分析肌... 肌纤维形态测量对畜牧业育种和医学领域具有重要意义。传统的方法在效率和准确性方面存在局限性,需要发展先进的计算机辅助分析技术。该研究开发了一种基于FastSAM模型的肌纤维形态学测量方法,利用图像锐化裁切处理,精确和有效地分析肌纤维的特征,能够快速大批量地测定HE染色的肌纤维切片图像中的面积、直径等表型指标。结果表明:与传统方法相比,该方法时间效率更高(单张图像平均处理耗时显著减少,P<0.01);测量精度与可靠性更强,分割Dice系数达0.922,高IoU阈值下平均精确率(AP)为0.514、F1分数为0.830,均优于EdgeSAM、MobileSAM等主流模型;与手动测量结果高度一致,相关系数r>0.99,P<0.01。此外,该方法对猪12个不同骨骼区域及30~170 kg不同生长发育阶段的肌纤维均具有良好适应性。研究结果可为猪分子育种与肉质改良、肌纤维相关疾病的病理机制研究及骨骼肌形态学量化分析标准化方案的建立提供参考。 展开更多
关键词 肌纤维 图像分割 Fastsam HE染色
在线阅读 下载PDF
MicroFlowSAM:A motion-prompted instance segmentation approach in microfluidics with zero annotation and training
15
作者 Wenle Xu Lin Sheng +2 位作者 Tong Qiu Kai Wang Guangsheng Luo 《Chinese Journal of Chemical Engineering》 2025年第11期103-114,共12页
Microdispersion technology is crucial for a variety of applications in both the chemical and biomedical fields.The precise and rapid characterization of microdroplets and microbubbles is essential for research as well... Microdispersion technology is crucial for a variety of applications in both the chemical and biomedical fields.The precise and rapid characterization of microdroplets and microbubbles is essential for research as well as for optimizing and controlling industrial processes.Traditional methods often rely on time-consuming manual analysis.Although some deep learning-based computer vision methods have been proposed for automated identification and characterization,these approaches often rely on supervised learning,which requires labeled data for model training.This dependency on labeled data can be time-consuming and expensive,especially when working with large and complex datasets.To address these challenges,we propose Micro Flow SAM,an innovative,motion-prompted,annotation-free,and training-free instance segmentation approach.By utilizing motion of microdroplets and microbubbles as prompts,our method directs large-scale vision models to perform accurate instance segmentation without the need for annotated data or model training.This approach eliminates the need for human intervention in data labeling and reduces computational costs,significantly streamlining the data analysis process.We demonstrate the effectiveness of Micro Flow SAM across 12 diverse datasets,achieving outstanding segmentation results that are competitive with traditional methods.This novel approach not only accelerates the analysis process but also establishes a foundation for efficient process control and optimization in microfluidic applications.Micro Flow SAM represents a breakthrough in reducing the complexities and resource demands of instance segmentation,enabling faster insights and advancements in the microdispersion field. 展开更多
关键词 MICROFLUIDICS Microdispersion Instance segmentation Large vision model Prompt engineering
在线阅读 下载PDF
MSAMamba-UNet:A Lightweight Multi-Scale Adaptive Mamba Network for Skin Lesion Segmentation
16
作者 Shouming Hou Jianchao Hou +2 位作者 Yuteng Pang Aoyu Xia Beibei Hou 《Journal of Bionic Engineering》 2025年第6期3209-3225,共17页
Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion siz... Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion sizes.To overcome these limitations,we introduce MSAMamba-UNet,a lightweight model that integrates two novel architectures:Multi-Scale Mamba(MSMamba)and Adaptive Dynamic Gating Block(ADGB).MSMamba utilizes multi-scale decomposition and a parallel hierarchical structure to enhance the delineation of irregular lesion boundaries and sensitivity to small targets.ADGB dynamically selects convolutional kernels with varying receptive fields based on input features,improving the model’s capacity to accommodate diverse lesion textures and scales.Additionally,we introduce a Mix Attention Fusion Block(MAF)to enhance shallow feature representation by integrating parallel channel and pixel attention mechanisms.Extensive evaluation of MSAMamba-UNet on the ISIC 2016,ISIC 2017,and ISIC 2018 datasets demonstrates competitive segmentation accuracy with only 0.056 M parameters and 0.069 GFLOPs.Our experiments revealed that MSAMamba-UNet achieved IoU scores of 85.53%,85.47%,and 82.22%,as well as DSC scores of 92.20%,92.17%,and 90.24%,respectively.These results underscore the lightweight design and effectiveness of MSAMamba-UNet. 展开更多
关键词 TRANSFORMER segmenting skin lesions Mamba Lightweight model MULTI-SCALE
在线阅读 下载PDF
An enhanced segmentation method for 3D point cloud of tunnel support system using PointNet++t and coverage-voted strategy algorithms 被引量:1
17
作者 Wenju Liu Fuqiang Gao +4 位作者 Shuangyong Dong Xiaoqing Wang Shuwen Cao Wanjie Wang Xiaomin Liu 《Journal of Rock Mechanics and Geotechnical Engineering》 2026年第2期1653-1660,共8页
3D laser scanning technology is widely used in underground openings for high-precision,rapid,and nondestructive structural evaluations.Segmenting large 3D point cloud datasets,particularly in coal mine roadways with m... 3D laser scanning technology is widely used in underground openings for high-precision,rapid,and nondestructive structural evaluations.Segmenting large 3D point cloud datasets,particularly in coal mine roadways with multi-scale targets,remains challenging.This paper proposes an enhanced segmentation method integrating improved PointNet++with a coverage-voted strategy.The coverage-voted strategy reduces data while preserving multi-scale target topology.The segmentation is achieved using an enhanced PointNet++algorithm with a normalization preprocessing head,resulting in a 94%accuracy for common supporting components.Ablation experiments show that the preprocessing head and coverage strategies increase segmentation accuracy by 20%and 2%,respectively,and improve Intersection over Union(IoU)for bearing plate segmentation by 58%and 20%.The accuracy of the current pretraining segmentation model may be affected by variations in surface support components,but it can be readily enhanced through re-optimization with additional labeled point cloud data.This proposed method,combined with a previously developed machine learning model that links rock bolt load and the deformation field of its bearing plate,provides a robust technique for simultaneously measuring the load of multiple rock bolts in a single laser scan. 展开更多
关键词 Point cloud segmentation Improved PointNet++ Tunnel laser scanning Rock bolt automatic recognition
在线阅读 下载PDF
Predictive Value of 3D Radiological Segmentation and Anatomical Parameters for Cochlear Implantation Electrode Insertion Depth Based on a Large Sample of Patients with Inner Ear Malformations
18
作者 Shujin Xue Xingmei Wei +4 位作者 Ying Kong Biao Chen Zhencheng Gao Chunling Ma Yongxin Li 《Journal of Otology》 2025年第4期259-267,共9页
Objective:The aims of this study were to investigate the clinical applicability of 3D segmentation in measuring cochlear anatomical parameters,explore factors that influence the insertion angle of cochlear implant ele... Objective:The aims of this study were to investigate the clinical applicability of 3D segmentation in measuring cochlear anatomical parameters,explore factors that influence the insertion angle of cochlear implant electrodes in patients with inner ear malformations,and determine the value of 3D segmentation in predicting cochlear implant electrode insertion depth by simulating electrode implantation in a reconstructed 3D model.Methods:Data from 208 temporal bone CT scans of patients with a variety of inner ear malformations(including the CH,IP-Ⅰ,IP-Ⅱ,and IP-Ⅲtypes)who underwent cochlear implantation at our center were retrospectively analyzed.Preoperative temporal bone CT data were subjected to three-dimensional(3D)segmentation of the cochlea with a 3D slicer.Results:Cochlear malformation types,including IP typesⅠ(42 ears),Ⅱ(278ears),Ⅲ(20 ears),and CH(65 ears),were diagnosed and measured in 208 preoperative CT datasets.Cochlear anatomical parameters and electrode length were correlated,which partially explained the variations in electrode insertion angle.The mean angle of implantation among the enrolled patients was 564.33°,and the mean implantation angle prediction error in the 3D segmentation was|23.74|°.Conclusion:Three-dimensional segmentation from temporal bone CT is valuable for surgeons,especially in treating patients with inner ear malformation.Such insights will help surgeons understand overall anatomical variations,predict electrode implantation depth,and complete preoperative imaging assessments for cochlear implant insertion depth in patients with inner ear malformations. 展开更多
关键词 Inner ear malformation Cochlear implant Temporal bone CT Three-dimensional segmentation
暂未订购
Efficient Dataset Generation for Stacked Meat Products Instance Segmentation in Food Automation
19
作者 Hoang Minh Pham Anh Dong Le +2 位作者 Pablo Malvido-Fresnillo Saigopal Vasudevan JoséL.Martínez Lastra 《IEEE/CAA Journal of Automatica Sinica》 2026年第1期224-226,共3页
Dear Editor,This letter presents techniques to simplify dataset generation for instance segmentation of raw meat products,a critical step toward automating food production lines.Accurate segmentation is essential for ... Dear Editor,This letter presents techniques to simplify dataset generation for instance segmentation of raw meat products,a critical step toward automating food production lines.Accurate segmentation is essential for addressing challenges such as occlusions,indistinct edges,and stacked configurations,which demand large,diverse datasets.To meet these demands,we propose two complementary approaches:a semi-automatic annotation interface using tools like the segment anything model(SAM)and GrabCut and a synthetic data generation pipeline leveraging 3D-scanned models.These methods reduce reliance on real meat,mitigate food waste,and improve scalability.Experimental results demonstrate that incorporating synthetic data enhances segmentation model performance and,when combined with real data,further boosts accuracy,paving the way for more efficient automation in the food industry. 展开更多
关键词 dataset generation segment anything model sam food automation raw meat productsa automating food production linesaccurate instance segmentation stacked meat products semi automatic annotation
在线阅读 下载PDF
Detection of co-phasing error in segmented mirror based on extended Young’s interferometry combined with Vision Transformer
20
作者 LIU Yin-ling YAO Chi +3 位作者 OUYANG Shang-tao WAN Yi-rong CHEN Mo LI Bin 《中国光学(中英文)》 北大核心 2026年第1期205-218,共14页
Due to the inability of manufacturing a single monolithic mirror at the 10-meter scales,segmented mirrors have become indispensable tools in modern astronomical research.However,to match the imaging performance of the... Due to the inability of manufacturing a single monolithic mirror at the 10-meter scales,segmented mirrors have become indispensable tools in modern astronomical research.However,to match the imaging performance of the monolithic counterpart,the sub-mirrors must maintain precise co-phasing.Piston error critically degrades segmented mirror imaging quality,necessitating efficient and precise detection.To ad-dress the limitations that the conventional circular-aperture diffraction with two-wavelength algorithm is sus-ceptible to decentration errors,and the traditional convolutional neural networks(CNNs)struggle to capture global features under large-range piston errors due to their restricted local receptive fields,this paper pro-poses a method that integrates extended Young’s interference principles with a Vision Transformer(ViT)to detect piston error.By suppressing decentration error interference through two symmetrically arranged aper-tures and extending the measurement range to±7.95μm via a two-wavelength(589 nm/600 nm)algorithm.This approach exploits ViT’s self-attention mechanism to model global characteristics of interference fringes.Unlike CNNs constrained by local convolutional kernels,the ViT significantly improves sensitivity to inter-ferogram periodicity.The simulation results demonstrate that the proposed method achieves a measurement accuracy of 5 nm(0.0083λ0)across the range of±7.95μm,while maintaining an accuracy exceeding 95%in the presence of Gaussian noise(SNR≥15 dB),Poisson noise(λ≥9 photons/pixel),and sub-mirror gap er-ror(Egap≤0.2)interference.Moreover,the detection speed shows significant improvement compared to the cross-correlation algorithm.This study establishes an accurate,robust framework for segmented mirror error detection,advancing high-precision astronomical observation. 展开更多
关键词 segmented mirror co-phasing piston errors ViT Young’s interference principles
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部