期刊文献+
共找到5,064篇文章
< 1 2 250 >
每页显示 20 50 100
A medical image segmentation model based on SAM with an integrated local multi-scale feature encoder
1
作者 DI Jing ZHU Yunlong LIANG Chan 《Journal of Measurement Science and Instrumentation》 2025年第3期359-370,共12页
Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding ... Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding phase.This paper presents a medical image segmentation model based on SAM with a local multi-scale feature encoder(LMSFE-SAM)to address the issues above.Firstly,based on the SAM,a local multi-scale feature encoder is introduced to improve the representation of features within local receptive field,thereby supplying the Vision Transformer(ViT)branch in SAM with enriched local multi-scale contextual information.At the same time,a multiaxial Hadamard product module(MHPM)is incorporated into the local multi-scale feature encoder in a lightweight manner to reduce the quadratic complexity and noise interference.Subsequently,a cross-branch balancing adapter is designed to balance the local and global information between the local multi-scale feature encoder and the ViT encoder in SAM.Finally,to obtain smaller input image size and to mitigate overlapping in patch embeddings,the size of the input image is reduced from 1024×1024 pixels to 256×256 pixels,and a multidimensional information adaptation component is developed,which includes feature adapters,position adapters,and channel-spatial adapters.This component effectively integrates the information from small-sized medical images into SAM,enhancing its suitability for clinical deployment.The proposed model demonstrates an average enhancement ranging from 0.0387 to 0.3191 across six objective evaluation metrics on BUSI,DDTI,and TN3K datasets compared to eight other representative image segmentation models.This significantly enhances the performance of the SAM on medical images,providing clinicians with a powerful tool in clinical diagnosis. 展开更多
关键词 segment anything model(sam) medical image segmentation ENCODER decoder multiaxial Hadamard product module(MHPM) cross-branch balancing adapter
在线阅读 下载PDF
Optimizing zero-shot text-based segmentation of remote sensing imagery using SAM and Grounding DINO
2
作者 Mohanad Diab Polychronis Kolokoussis Maria Antonia Brovelli 《Artificial Intelligence in Geosciences》 2025年第1期14-24,共11页
The use of AI technologies in remote sensing(RS)tasks has been the focus of many individuals in both the professional and academic domains.Having more accessible interfaces and tools that allow people of little or no ... The use of AI technologies in remote sensing(RS)tasks has been the focus of many individuals in both the professional and academic domains.Having more accessible interfaces and tools that allow people of little or no experience to intuitively interact with RS data of multiple formats is a potential provided by this integration.However,the use of AI and AI agents to help automate RS-related tasks is still in its infancy stage,with some frameworks and interfaces built on top of well-known vision language models(VLM)such as GPT-4,segment anything model(SAM),and grounding DINO.These tools do promise and draw guidelines on the potentials and limitations of existing solutions concerning the use of said models.In this work,the state of the art AI foundation models(FM)are reviewed and used in a multi-modal manner to ingest RS imagery input and perform zero-shot object detection using natural language.The natural language input is then used to define the classes or labels the model should look for,then,both inputs are fed to the pipeline.The pipeline presented in this work makes up for the shortcomings of the general knowledge FMs by stacking pre-processing and post-processing applications on top of the FMs;these applications include tiling to produce uniform patches of the original image for faster detection,outlier rejection of redundant bounding boxes using statistical and machine learning methods.The pipeline was tested with UAV,aerial and satellite images taken over multiple areas.The accuracy for the semantic segmentation showed improvement from the original 64%to approximately 80%-99%by utilizing the pipeline and techniques proposed in this work.GitHub Repository:MohanadDiab/LangRS. 展开更多
关键词 Foundation models Multi-modal models Vision language models Semantic segmentation segment anything model Earth observation Remote sensing
在线阅读 下载PDF
Pre-trained SAM as data augmentation for image segmentation
3
作者 Junjun Wu Yunbo Rao +1 位作者 Shaoning Zeng Bob Zhang 《CAAI Transactions on Intelligence Technology》 2025年第1期268-282,共15页
Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in ord... Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation. 展开更多
关键词 data augmentation image segmentation large model segment anything model
在线阅读 下载PDF
Segmentation of CAD models using hybrid representation
4
作者 Claude UWIMANA Shengdi ZHOU +4 位作者 Limei YANG Zhuqing LI Norbelt MUTAGISHA Edouard NIYONGABO Bin ZHOU 《虚拟现实与智能硬件(中英文)》 2025年第2期188-202,共15页
In this paper,we introduce an innovative method for computer-aided design(CAD)segmentation by concatenating meshes and CAD models.Many previous CAD segmentation methods have achieved impressive performance using singl... In this paper,we introduce an innovative method for computer-aided design(CAD)segmentation by concatenating meshes and CAD models.Many previous CAD segmentation methods have achieved impressive performance using single representations,such as meshes,CAD,and point clouds.However,existing methods cannot effectively combine different three-dimensional model types for the direct conversion,alignment,and integrity maintenance of geometric and topological information.Hence,we propose an integration approach that combines the geometric accuracy of CAD data with the flexibility of mesh representations,as well as introduce a unique hybrid representation that combines CAD and mesh models to enhance segmentation accuracy.To combine these two model types,our hybrid system utilizes advanced-neural-network techniques to convert CAD models into mesh models.For complex CAD models,model segmentation is crucial for model retrieval and reuse.In partial retrieval,it aims to segment a complex CAD model into several simple components.The first component of our hybrid system involves advanced mesh-labeling algorithms that harness the digitization of CAD properties to mesh models.The second component integrates labelled face features for CAD segmentation by leveraging the abundant multisemantic information embedded in CAD models.This combination of mesh and CAD not only refines the accuracy of boundary delineation but also provides a comprehensive understanding of the underlying object semantics.This study uses the Fusion 360 Gallery dataset.Experimental results indicate that our hybrid method can segment these models with higher accuracy than other methods that use single representations. 展开更多
关键词 B-RepNet hybrid segmentation CAD models classification MeshCNN MeshCAD-Net
在线阅读 下载PDF
Dual-Stream Attention-Based Classification Network for Tibial Plateau Fractures via Diffusion Model Augmentation and Segmentation Map Integration
5
作者 Yi Xie Zhi-wei Hao +8 位作者 Xin-meng Wang Hong-lin Wang Jia-ming Yang Hong Zhou Xu-dong Wang Jia-yao Zhang Hui-wen Yang Peng-ran Liu Zhe-wei Ye 《Current Medical Science》 2025年第1期57-69,共13页
Objective This study aimed to explore a novel method that integrates the segmentation guidance classification and the dif-fusion model augmentation to realize the automatic classification for tibial plateau fractures(... Objective This study aimed to explore a novel method that integrates the segmentation guidance classification and the dif-fusion model augmentation to realize the automatic classification for tibial plateau fractures(TPFs).Methods YOLOv8n-cls was used to construct a baseline model on the data of 3781 patients from the Orthopedic Trauma Center of Wuhan Union Hospital.Additionally,a segmentation-guided classification approach was proposed.To enhance the dataset,a diffusion model was further demonstrated for data augmentation.Results The novel method that integrated the segmentation-guided classification and diffusion model augmentation sig-nificantly improved the accuracy and robustness of fracture classification.The average accuracy of classification for TPFs rose from 0.844 to 0.896.The comprehensive performance of the dual-stream model was also significantly enhanced after many rounds of training,with both the macro-area under the curve(AUC)and the micro-AUC increasing from 0.94 to 0.97.By utilizing diffusion model augmentation and segmentation map integration,the model demonstrated superior efficacy in identifying SchatzkerⅠ,achieving an accuracy of 0.880.It yielded an accuracy of 0.898 for SchatzkerⅡandⅢand 0.913 for SchatzkerⅣ;for SchatzkerⅤandⅥ,the accuracy was 0.887;and for intercondylar ridge fracture,the accuracy was 0.923.Conclusion The dual-stream attention-based classification network,which has been verified by many experiments,exhibited great potential in predicting the classification of TPFs.This method facilitates automatic TPF assessment and may assist surgeons in the rapid formulation of surgical plans. 展开更多
关键词 Artificial intelligence YOLOv8 Tibial plateau fracture Diffusion model augmentation segmentation map
暂未订购
光影智绘:基于SAM的视频阴影鲁棒抽取
6
作者 陈东 李昌隆 +2 位作者 杜振龙 宋爽 李晓丽 《图学学报》 北大核心 2025年第4期739-745,共7页
针对传统方法对于光照变化和物体遮挡引起复杂的、动态变化阴影处理易致阴影检测的准确率和鲁棒性较低问题,提出了一种基于分割万物模型(SAM)的视频阴影检测方法,对SAM解码器进行微调,使其更适合阴影检测;利用SAM提取关键帧阴影区域,引... 针对传统方法对于光照变化和物体遮挡引起复杂的、动态变化阴影处理易致阴影检测的准确率和鲁棒性较低问题,提出了一种基于分割万物模型(SAM)的视频阴影检测方法,对SAM解码器进行微调,使其更适合阴影检测;利用SAM提取关键帧阴影区域,引入XMem模型,结合感觉记忆、短时记忆和长时记忆联合前后帧信息,给出优化和稳定视频阴影检测结果。实验结果表明:在ViSha数据集的阴影实验结果与传统方法相比,该方法的均值绝对误差降低了约31.8%,交并比提升了约19.7%;定性和定量结果表明本方法不仅提升了视频阴影处理的准确率,并表现出较好的鲁棒性。 展开更多
关键词 阴影检测 语义分割 视频对象分割 sam XMem
在线阅读 下载PDF
SABM:一种蝴蝶生态图像分割的增强SAM模型
7
作者 谢娟英 兰翔 许升全 《陕西师范大学学报(自然科学版)》 北大核心 2025年第6期1-14,共14页
通过分割生态图像中蝴蝶获得蝴蝶掩码是基于生态图像的蝴蝶物种自动化识别的基础,因此研究蝴蝶生态图像分割有重要意义。然而,现有蝴蝶生态图像存在数据集样本量小、蝴蝶拟态、翅膀遮挡等问题,使现有深度网络难以训练出具有良好泛化能... 通过分割生态图像中蝴蝶获得蝴蝶掩码是基于生态图像的蝴蝶物种自动化识别的基础,因此研究蝴蝶生态图像分割有重要意义。然而,现有蝴蝶生态图像存在数据集样本量小、蝴蝶拟态、翅膀遮挡等问题,使现有深度网络难以训练出具有良好泛化能力的分割模型。为此,通过改进SAM(segment anything model)模型,提出一种鲁棒的蝴蝶生态图像分割新模型SABM(segment any butterfly model)。SABM模型通过引入双路卷积模块、蝴蝶词元(butterfly token)及一个3层MLP(multi-layer perceptron)使模型具有更好的特征学习能力。707张蝴蝶生态图像数据集的2折交叉验证实验表明,SABM模型对蝴蝶生态图像的分割能力超越了SAM及其现有的改进SOTA模型。7645张全新蝴蝶生态图像数据集的分割实验测试发现,SABM模型具有非常好的泛化性能,对7645张全新蝴蝶生态图像的蝴蝶实现了非常好的分割。该分割结果为未来的蝴蝶生态图像分割研究提供了10倍于现有数据的大数据集,为野外环境下的蝴蝶物种自动识别提供了更好的可用数据,也为测试聚类算法性能提供了富有挑战性的数据集。另外,还在医学图像数据测试了SABM模型的鲁棒性。 展开更多
关键词 蝴蝶分割 双路卷积 sam SABM 图像分割
在线阅读 下载PDF
基于Stone-SAM的便携式粗集料级配智能检测
8
作者 张鸿 杨俊雅 +2 位作者 刘可心 张益鹏 程雪聪 《建筑材料学报》 北大核心 2025年第6期581-590,共10页
为实现精确的粗集料级配检测,提出了一种便携式粗集料级配智能检测方法。采用知识蒸馏的策略对视觉大模型——分割一切模型(SAM)进行网络结构轻量化,嵌入神经网络分类器PP-HGNetV2为模型提供语义判断的能力,设计粗集料颗粒特征参数数学... 为实现精确的粗集料级配检测,提出了一种便携式粗集料级配智能检测方法。采用知识蒸馏的策略对视觉大模型——分割一切模型(SAM)进行网络结构轻量化,嵌入神经网络分类器PP-HGNetV2为模型提供语义判断的能力,设计粗集料颗粒特征参数数学表征算法,开发移动端应用程序,实现粗集料级配高通量检测。对5种粗集料级配场景进行测试。结果表明:本研究方法对于粗集料颗粒的分割精度高于原始SAM模型,并且能够精确去除背景信息,粗集料颗粒关键参数提取结果准确可靠。 展开更多
关键词 分割一切模型(sam) 粗集料级配 智能检测 移动端 工程检测
在线阅读 下载PDF
针对SAM下游模型脆弱模块的对抗迁移攻击
9
作者 丁熠 林能健 +2 位作者 蒋昀陶 钟宇浩 曹明生 《计算机研究与发展》 北大核心 2025年第10期2455-2467,共13页
SAM(segment anything model)作为一种通用的视觉基础模型,已被广泛应用于多种图像分割任务,但其在对抗性攻击面前表现出脆弱性.提出一种针对SAM下游模型脆弱模块的对抗迁移攻击方法FSGR(fragile section gradient robustness).该方法... SAM(segment anything model)作为一种通用的视觉基础模型,已被广泛应用于多种图像分割任务,但其在对抗性攻击面前表现出脆弱性.提出一种针对SAM下游模型脆弱模块的对抗迁移攻击方法FSGR(fragile section gradient robustness).该方法在无需知晓下游微调细节的前提下,可有效生成对抗样本,实现对SAM下游模型的攻击.该方法运用“脆弱层精准定位+局部强化迁移”策略,通过特征相似度筛选出跨任务共享且最易被激活的模块,针对性地强化攻击效果;同时,引入梯度稳健损失以消除目标模型与下游任务模型间的梯度差异. FSGR方法融合了脆弱层攻击与梯度稳健损失机制,在多个数据集上均实现了相对性能的提升.实验结果表明,FSGR在多种微调模型(如医学分割、阴影分割和伪装分割)的迁移攻击中显著降低了模型性能,证明了其正确性和实用性.与基线方法相比,FSGR不仅在攻击成功率上表现出色,还通过结合脆弱层攻击和梯度稳健损失,实现了相对性能的提升. 展开更多
关键词 图像分割 对抗攻击 迁移攻击 特征相似度 模型鲁棒性
在线阅读 下载PDF
基于SAM优化的饲喂目标实时识别方法
10
作者 张勤 翁凯航 《华南理工大学学报(自然科学版)》 北大核心 2025年第7期60-69,共10页
饲喂辅助机器人是推动畜牧业现代化转型的关键设备,饲喂目标的快速、准确识别是机器人实现智能推料的重要保证。匹配分割精度和运行效率是保证算法综合性能的关键步骤,也是识别算法的重要课题。针对现有奶牛饲喂目标识别方法存在分割精... 饲喂辅助机器人是推动畜牧业现代化转型的关键设备,饲喂目标的快速、准确识别是机器人实现智能推料的重要保证。匹配分割精度和运行效率是保证算法综合性能的关键步骤,也是识别算法的重要课题。针对现有奶牛饲喂目标识别方法存在分割精度和运行效率不匹配的问题,该文提出了一种基于分割大模型(SAM)优化的饲喂目标实时识别方法RTFTR。该方法首先在SAM-det架构基础上,通过轻量化图像编码器和目标检测器的参数,引入缓冲区队列的并行化设计方法来平衡各模块的运行效率,以提升推理速率;然后利用HQ形符增强特征空间的解码能力,优化设计掩码解码器,并采用针对饲喂目标的分阶段训练,以提高分割精度。实验结果表明:所提方法在提高分割精度的前提下保证了推理速率;在奶牛饲喂目标识别中,奶牛分割精度达98.7%,饲料分割精度达96.4%,料槽分割精度达99.2%,整体平均分割精度达98.1%,运行速率为52.9 f/s,满足养殖场复杂环境和机器人计算资源限制下对奶牛饲喂目标识别方法的高精度、高效率的应用需求。 展开更多
关键词 饲喂辅助机器人 分割大模型 奶牛饲喂 目标识别 分割精度
在线阅读 下载PDF
SAM Era:Can It Segment Any Industrial Surface Defects? 被引量:1
11
作者 Kechen Song Wenqi Cui +2 位作者 Han Yu Xingjie Li Yunhui Yan 《Computers, Materials & Continua》 SCIE EI 2024年第3期3953-3969,共17页
Segment Anything Model(SAM)is a cutting-edge model that has shown impressive performance in general object segmentation.The birth of the segment anything is a groundbreaking step towards creating a universal intellige... Segment Anything Model(SAM)is a cutting-edge model that has shown impressive performance in general object segmentation.The birth of the segment anything is a groundbreaking step towards creating a universal intelligent model.Due to its superior performance in general object segmentation,it quickly gained attention and interest.This makes SAM particularly attractive in industrial surface defect segmentation,especially for complex industrial scenes with limited training data.However,its segmentation ability for specific industrial scenes remains unknown.Therefore,in this work,we select three representative and complex industrial surface defect detection scenarios,namely strip steel surface defects,tile surface defects,and rail surface defects,to evaluate the segmentation performance of SAM.Our results show that although SAM has great potential in general object segmentation,it cannot achieve satisfactory performance in complex industrial scenes.Our test results are available at:https://github.com/VDT-2048/SAM-IS. 展开更多
关键词 segment anything sam surface defect detection salient object detection
在线阅读 下载PDF
基于SAM图像处理的堆石料级配计算方法及验证
12
作者 张振伟 蔡可天 +3 位作者 高轩 贺一轩 王建 鲁洋 《水力发电》 2025年第2期80-86,共7页
堆石料级配检测是堆石坝施工过程中质量控制的重要环节,传统方法通常采用现场人工筛分法测量,存在检测样本少、效率低、干扰施工等问题。提出了一种基于图像处理的堆石料级配计算方法,采用国际最新Mata AI开源的通用图像分割大模型Segme... 堆石料级配检测是堆石坝施工过程中质量控制的重要环节,传统方法通常采用现场人工筛分法测量,存在检测样本少、效率低、干扰施工等问题。提出了一种基于图像处理的堆石料级配计算方法,采用国际最新Mata AI开源的通用图像分割大模型Segment Anything Model(SAM)对筑坝堆石料进行自动图像分割,提出堆石长宽比、面积比等堆石形态学几何参数用于提取堆石料图像中的堆石颗粒目标;同时,建立堆石形态数据库、堆石实例分割数据库,并分析参数取值和验证堆石图像级配计算方法的有效性;最后,试验验证结果表明该方法能够有效识别出图像中的堆石颗粒目标,实现级配曲线的智能识别,以及曲率、不均匀系数等级配指标的快速计算。该方法计算获得的级配与真实筛分法测的级配相关性可达0.94,平均绝对误差约5%,能够在堆石坝施工过程中有效辅助检测堆石料的颗粒级配信息,服务堆石坝的施工碾压质量控制。 展开更多
关键词 堆石料 级配 segment anything model(sam) 图像识别 快速检测
在线阅读 下载PDF
轻量级微调SAM的结肠息肉分割方法SAMCP
13
作者 刘娜 封筠 +2 位作者 霍一儒 王弘扬 杨柳 《计算机应用》 北大核心 2025年第10期3390-3398,共9页
在胃肠道内窥镜图像处理中,精准分割结肠息肉具有重要的临床意义。传统分割方法常因细节捕捉不足和对大规模数据的依赖,在应对复杂形态的息肉时表现不佳。尽管分割一切模型(SAM)在自然图像分割中取得显著进展,但由于自然图像与医学图像... 在胃肠道内窥镜图像处理中,精准分割结肠息肉具有重要的临床意义。传统分割方法常因细节捕捉不足和对大规模数据的依赖,在应对复杂形态的息肉时表现不佳。尽管分割一切模型(SAM)在自然图像分割中取得显著进展,但由于自然图像与医学图像存在域差异,现有的SAM方法在结肠息肉分割任务上仍难以取得理想效果。为解决这一问题,基于SAM架构提出一种轻量级微调结肠息肉分割方法(SAMCP)。该方法引入精简适配器模块,重点关注通道维度信息,采用Dice和交并比(IoU)简化联合损失函数,并在训练时冻结原始图像编码器和提示编码器的参数,以低训练成本提升结肠息肉分割性能。在3个公开数据集上与9种先进方法的对比实验结果表明,相较于SAM方法,SAMCP在Kvasir-SEG数据集上的Dice和IoU值分别提高了56.7%和84.5%,在CVC-ClinicDB数据集上的Dice和IoU值分别提高了46.0%和86.0%,在CVC-ColonDB数据集上的Dice和IoU值分别提高了95.3%和122.2%,超过目前SAM-based类方法的最佳性能。在引入点提示的情况下,即使只使用1次点击,SAMCP仍能优于其他SAM-based方法。以上验证了SAMCP在处理复杂形状和局部细节时表现出色,可为医生提供更精确的分割指导。 展开更多
关键词 结肠息肉分割 分割一切模型 适配器 损失函数 轻量级微调
在线阅读 下载PDF
PCB CT Image Element Segmentation Model Optimizing the Semantic Perception of Connectivity Relationship
14
作者 Chen Chen Kai Qiao +2 位作者 Jie Yang Jian Chen Bin Yan 《Computers, Materials & Continua》 SCIE EI 2024年第11期2629-2642,共14页
Computed Tomography(CT)is a commonly used technology in Printed Circuit Boards(PCB)non-destructive testing,and element segmentation of CT images is a key subsequent step.With the development of deep learning,researche... Computed Tomography(CT)is a commonly used technology in Printed Circuit Boards(PCB)non-destructive testing,and element segmentation of CT images is a key subsequent step.With the development of deep learning,researchers began to exploit the“pre-training and fine-tuning”training process for multi-element segmentation,reducing the time spent on manual annotation.However,the existing element segmentation model only focuses on the overall accuracy at the pixel level,ignoring whether the element connectivity relationship can be correctly identified.To this end,this paper proposes a PCB CT image element segmentation model optimizing the semantic perception of connectivity relationship(OSPC-seg).The overall training process adopts a“pre-training and fine-tuning”training process.A loss function that optimizes the semantic perception of circuit connectivity relationship(OSPC Loss)is designed from the aspect of alleviating the class imbalance problem and improving the correct connectivity rate.Also,the correct connectivity rate index(CCR)is proposed to evaluate the model’s connectivity relationship recognition capabilities.Experiments show that mIoU and CCR of OSPC-seg on our datasets are 90.1%and 97.0%,improved by 1.5%and 1.6%respectively compared with the baseline model.From visualization results,it can be seen that the segmentation performance of connection positions is significantly improved,which also demonstrates the effectiveness of OSPC-seg. 展开更多
关键词 Semantic segmentation PCB non-destructive testing mask image modeling connectivity relationship
在线阅读 下载PDF
An Efficient Local Radial Basis Function Method for Image Segmentation Based on the Chan-Vese Model
15
作者 Shupeng Qiu Chujin Lin Wei Zhao 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第4期1119-1134,共16页
In this paper,we consider the Chan–Vese(C-V)model for image segmentation and obtain its numerical solution accurately and efficiently.For this purpose,we present a local radial basis function method based on a Gaussi... In this paper,we consider the Chan–Vese(C-V)model for image segmentation and obtain its numerical solution accurately and efficiently.For this purpose,we present a local radial basis function method based on a Gaussian kernel(GA-LRBF)for spatial discretization.Compared to the standard radial basis functionmethod,this approach consumes less CPU time and maintains good stability because it uses only a small subset of points in the whole computational domain.Additionally,since the Gaussian function has the property of dimensional separation,the GA-LRBF method is suitable for dealing with isotropic images.Finally,a numerical scheme that couples GA-LRBF with the fourth-order Runge–Kutta method is applied to the C-V model,and a comparison of some numerical results demonstrates that this scheme achieves much more reliable image segmentation. 展开更多
关键词 Image segmentation Chan–Vese model local radial basis functionmethod Gaussian kernel Runge–Kuttamethod
在线阅读 下载PDF
基于预训练SAM的提示式三维牙齿分割方法
16
作者 刘复昌 蔡煜晨 +1 位作者 缪永伟 范然 《浙江大学学报(理学版)》 北大核心 2025年第1期59-69,共11页
目前,大多研究采用有监督学习方法在牙齿的三维数据上训练网络,完成分割任务,但在处理缺牙、严重错位或颌部不完整的牙齿时效果不佳,泛化能力较弱。为此,提出了一种基于预训练分割一切模型(segment anything model,SAM)和提示分割技术... 目前,大多研究采用有监督学习方法在牙齿的三维数据上训练网络,完成分割任务,但在处理缺牙、严重错位或颌部不完整的牙齿时效果不佳,泛化能力较弱。为此,提出了一种基于预训练分割一切模型(segment anything model,SAM)和提示分割技术的方法。首先,在2022年国际医学图像计算和计算机辅助干预会议(MICCAI 2022)的三维牙齿公开数据集上微调模型。然后,将三维牙齿模型投影至多个二维视图,利用SAM网络进行图像分割。再将每个像素的标签映射回原始的三维三角形面片,完成三维牙齿分割。在该数据集中,测试了900个较理想的三维上下牙数据,取得了与主流技术相当的结果。对于缺牙、牙齿错位以及上下颚不完整的复杂情况,本文方法表现出显著优于现有技术的效果,展示了更强的泛化能力和稳定性。 展开更多
关键词 口腔正畸 三维牙齿分割 sam 图像分割
在线阅读 下载PDF
基于MSC-LSAM的多尺度交叉超声医学图像分割方法
17
作者 王朝欣 杨汶汶 +3 位作者 戎泽 李铮昱 王行 马磊 《数据采集与处理》 北大核心 2025年第2期469-484,共16页
脑卒中是全球范围内致死致残率最高的疾病之一,颈动脉狭窄和心脏病变是缺血性脑卒中的重要致病因素。超声(Ultrasound,US)是检查由颈动脉狭窄和心脏病变引起的缺血性脑卒中的常用影像学手段,但超声图像噪声多、边界模糊,具有较高的分割... 脑卒中是全球范围内致死致残率最高的疾病之一,颈动脉狭窄和心脏病变是缺血性脑卒中的重要致病因素。超声(Ultrasound,US)是检查由颈动脉狭窄和心脏病变引起的缺血性脑卒中的常用影像学手段,但超声图像噪声多、边界模糊,具有较高的分割难度。本文提出MSC⁃LSAM算法,一种多尺度交叉的双编码器超声图像分割网络,旨在实现颈动脉腔体和心脏腔体的快速、准确分割,辅助医生完成疾病诊断。MSC⁃LSAM在编码器部分并行了分割一切模型(Segment anything model,SAM)的视觉编码器和UNet编码器,在解码器部分采用UNet解码器。本研究首先冻结了预训练的SAM视觉编码器,并在Transformer层中引入高效的适配器(Adapter)块,被称可学习的分割一切模型(Learnable SAM,LSAM)。LSAM在拥有较低参数量的同时,保留学习能力和高度泛化性。然后,在UNet全局网络引入多尺度交叉注意力(Multi⁃scale cross⁃axial attention,MCA),实现多尺度特征的交叉融合,有效提升边缘分割能力,抑制模型过拟合。最后,通过高效通道注意力(Efficient channel attention,ECA)实现双编码器多尺度特征的高效融合,减少模型误分割。结果表明,本研究提出的MSC⁃LSAM在心脏超声公开数据集CAMUS和颈动脉超声自建数据集CAUS上均取得了良好的效果。CAMUS的两心腔(2CH)和四心腔(4CH)数据集分割的平均Dice相似系数(Dice similarity coefficient,DSC)分别达到0.927和0.934;CAUS数据集的平均DSC达到0.917。MSC⁃LSAM在颈动脉腔体和心脏腔体超声图像分割任务上获得了良好的分割准确度,高于主流分割算法,具有良好的应用前景。 展开更多
关键词 缺血性脑卒中 超声图像分割 分割一切模型 多尺度交叉注意力 高效通道注意力
在线阅读 下载PDF
基于伪标签去噪和SAM优化的大规模无监督语义分割
18
作者 杨维静 徐瑞 +3 位作者 顾浩文 陈涛 舒祥波 姚亚洲 《电子学报》 北大核心 2025年第3期716-727,共12页
语义分割技术能够对复杂、多元的场景实现细粒度理解,是促进无人系统高效、智能工作的关键技术之一.大规模无监督语义分割旨在从大规模未标记图像中学习语义分割能力.然而,现有方法由于自学习伪标签存在类别混淆和形状表示欠佳的问题,... 语义分割技术能够对复杂、多元的场景实现细粒度理解,是促进无人系统高效、智能工作的关键技术之一.大规模无监督语义分割旨在从大规模未标记图像中学习语义分割能力.然而,现有方法由于自学习伪标签存在类别混淆和形状表示欠佳的问题,导致最终分割精度较低.为此,本文提出一种伪标签去噪和SAM优化(Pseudo-label Denoising and SAM Optimization,PDSO)方法以解决大规模无监督语义分割问题.本文设计了一种基于去噪的特征微调模块,在基于小损失准则从大规模数据集中筛选出具有干净图像级伪标签的潜在样本后,利用这些干净样本对预训练的主干网络进行微调,使网络获得更稳健的类别表示.为了进一步减少伪标签中的类别噪声,设计了一种基于聚类的样本去噪模块,根据类别占比和样本与聚类中心之间的距离来去除干扰聚类任务的噪声样本,从而提升聚类性能.本文还设计了一种SAM提示优化模块,根据聚类距离识别出图像中的活跃类别,以过滤噪声目标,并将点和框作为SAM的目标提示信息,生成预期的目标掩膜以细化伪标签中目标的边缘.实验结果表明,在大规模语义分割数据集ImageNet-S_(50)、ImageNet-S_(300)和ImageNet-S_(919)的测试集上,本文方法在平均交并比指标上分别达到了45.0%、26.6%和14.5%,显著提高了分割目标的类别准确率和边缘精度. 展开更多
关键词 大规模无监督语义分割 图像级去噪 分割一切模型 伪标签 聚类
在线阅读 下载PDF
基于改进的SAM树冠轮廓分割
19
作者 方王俊 王山东 +1 位作者 郑帅锋 李佳云 《西北林学院学报》 北大核心 2025年第6期148-156,共9页
树冠信息的准确获取是树种分类的基本前提。针对分割一切模型(SAM)在可见光影像树冠轮廓提取时对树冠边界细节的分割不准确,存在漏分、错分等问题,设计了一种融合激光雷达三维点云数据的SAM影像树冠轮廓分割模型,以实现树冠轮廓的精细... 树冠信息的准确获取是树种分类的基本前提。针对分割一切模型(SAM)在可见光影像树冠轮廓提取时对树冠边界细节的分割不准确,存在漏分、错分等问题,设计了一种融合激光雷达三维点云数据的SAM影像树冠轮廓分割模型,以实现树冠轮廓的精细化提取。首先利用SAM分别提取可见光影像树冠轮廓和激光雷达三维点云树冠轮廓,并将合并后的树冠轮廓作为粗分割的结果。然后经过多向剖面分析、局部坐标系重定向、改进的K-means聚类等方法对树冠轮廓粗分割结果精细化。结果表明,改进后的SAM在郁闭度为80%的林区分割精度达到了86.35%,相对改进前的单影像SAM和单点云SAM分别提高了35.09%、51.75%,相对分水岭算法、SVM算法、多尺度分割算法分别提高了32.44%、16.12%、11.14%,能够很好地适应树冠轮廓在高郁闭度林区的分割任务。 展开更多
关键词 树冠轮廓分割 sam 可见光影像 激光雷达三维点云 数据融合
在线阅读 下载PDF
High-Precision Brain Tumor Segmentation using a Progressive Layered U-Net(PLU-Net)with Multi-Scale Data Augmentation and Attention Mechanisms on Multimodal Magnetic Resonance Imaging 被引量:1
20
作者 Noman Ahmed Siddiqui Muhammad Tahir Qadri +1 位作者 Muhammad Ovais Akhter Zain Anwar Ali 《Instrumentation》 2025年第1期77-92,共16页
Brain tumors present significant challenges in medical diagnosis and treatment,where early detection is crucial for reducing morbidity and mortality rates.This research introduces a novel deep learning model,the Progr... Brain tumors present significant challenges in medical diagnosis and treatment,where early detection is crucial for reducing morbidity and mortality rates.This research introduces a novel deep learning model,the Progressive Layered U-Net(PLU-Net),designed to improve brain tumor segmentation accuracy from Magnetic Resonance Imaging(MRI)scans.The PLU-Net extends the standard U-Net architecture by incorporating progressive layering,attention mechanisms,and multi-scale data augmentation.The progressive layering involves a cascaded structure that refines segmentation masks across multiple stages,allowing the model to capture features at different scales and resolutions.Attention gates within the convolutional layers selectively focus on relevant features while suppressing irrelevant ones,enhancing the model's ability to delineate tumor boundaries.Additionally,multi-scale data augmentation techniques increase the diversity of training data and boost the model's generalization capabilities.Evaluated on the BraTS 2021 dataset,the PLU-Net achieved state-of-the-art performance with a dice coefficient of 0.91,specificity of 0.92,sensitivity of 0.89,Hausdorff95 of 2.5,outperforming other modified U-Net architectures in segmentation accuracy.These results underscore the effectiveness of the PLU-Net in improving brain tumor segmentation from MRI scans,supporting clinicians in early diagnosis,treatment planning,and the development of new therapies. 展开更多
关键词 brain tumor segmentation MRI machine learning BraTS deep learning model PLU-Net
原文传递
上一页 1 2 250 下一页 到第
使用帮助 返回顶部