期刊文献+
共找到5,162篇文章
< 1 2 250 >
每页显示 20 50 100
A medical image segmentation model based on SAM with an integrated local multi-scale feature encoder
1
作者 DI Jing ZHU Yunlong LIANG Chan 《Journal of Measurement Science and Instrumentation》 2025年第3期359-370,共12页
Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding ... Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding phase.This paper presents a medical image segmentation model based on SAM with a local multi-scale feature encoder(LMSFE-SAM)to address the issues above.Firstly,based on the SAM,a local multi-scale feature encoder is introduced to improve the representation of features within local receptive field,thereby supplying the Vision Transformer(ViT)branch in SAM with enriched local multi-scale contextual information.At the same time,a multiaxial Hadamard product module(MHPM)is incorporated into the local multi-scale feature encoder in a lightweight manner to reduce the quadratic complexity and noise interference.Subsequently,a cross-branch balancing adapter is designed to balance the local and global information between the local multi-scale feature encoder and the ViT encoder in SAM.Finally,to obtain smaller input image size and to mitigate overlapping in patch embeddings,the size of the input image is reduced from 1024×1024 pixels to 256×256 pixels,and a multidimensional information adaptation component is developed,which includes feature adapters,position adapters,and channel-spatial adapters.This component effectively integrates the information from small-sized medical images into SAM,enhancing its suitability for clinical deployment.The proposed model demonstrates an average enhancement ranging from 0.0387 to 0.3191 across six objective evaluation metrics on BUSI,DDTI,and TN3K datasets compared to eight other representative image segmentation models.This significantly enhances the performance of the SAM on medical images,providing clinicians with a powerful tool in clinical diagnosis. 展开更多
关键词 segment anything model(sam) medical image segmentation ENCODER decoder multiaxial Hadamard product module(MHPM) cross-branch balancing adapter
在线阅读 下载PDF
Accelerated optical remote sensing mapping of oil spills in the China Seas using the Segment Anything Model
2
作者 Hang Lv Yingcheng Lu +5 位作者 Lifeng Wang Shuxian Song Wei Zhao Yanlong Chen Yuntao Wang Qingjun Song 《Acta Oceanologica Sinica》 2025年第10期184-197,共14页
Efficient segmentation of oiled pixels in optical remotely sensed images is the precondition of optical identification and classification of different spilled oils,which remains one of the keys to optical remote sensi... Efficient segmentation of oiled pixels in optical remotely sensed images is the precondition of optical identification and classification of different spilled oils,which remains one of the keys to optical remote sensing of oil spills.Optical remotely sensed images of oil spills are inherently multidimensional and embedded with a complex knowledge framework.This complexity often hinders the effectiveness of mechanistic algorithms across varied scenarios.Although optical remote-sensing theory for oil spills has advanced,the scarcity of curated datasets and the difficulty of collecting them limit their usefulness for training deep learning models.This study introduces a data expansion strategy that utilizes the Segment Anything Model(SAM),effectively bridging the gap between traditional mechanism algorithms and emergent self-adaptive deep learning models.Optical dimension reduction is achieved through standardized preprocessing processes that address the decipherable properties of the input image.After preprocessing,SAM can swiftly and accurately segment spilled oil in images.The unified AI-based workflow significantly accelerates labeled-dataset creation and has proven effective for both rapid emergency intelligence during spill incidents and the rapid mapping and classification of oil footprints across China’s coastal waters.Our results show that coupling a remote sensing mechanism with a foundation model enables near-real-time,large-scale monitoring of complex surface slicks and offers guidance for the next generation of detection and quantification algorithms. 展开更多
关键词 marine oil spills optical remote sensing segment anything model extract oil footprint spatiotemporal distribution
在线阅读 下载PDF
基于SAM的水陆两栖环境感知微调策略与应用
3
作者 左哲 蓝鸿 +1 位作者 覃卫 王坤 《北京理工大学学报》 北大核心 2026年第1期20-28,共9页
针对水陆两栖无人平台在不确定环境中面临的高误报率及多感知任务整合困难的问题,本研究提出了一种基于分割一切模型(segment anything model,SAM)的多模型联合环境感知方法,实现了障碍物检测与水陆域分割的统一处理.具体而言,是将U-Net... 针对水陆两栖无人平台在不确定环境中面临的高误报率及多感知任务整合困难的问题,本研究提出了一种基于分割一切模型(segment anything model,SAM)的多模型联合环境感知方法,实现了障碍物检测与水陆域分割的统一处理.具体而言,是将U-Net和YOLOv8与SAM结合,U-Net和YOLOv8负责获取目标的粗略轮廓,而SAM通过其编码−解码结构实现进一步精细分割.此外,设计了专门的微调策略以实现联合训练,进一步提升了模型的性能.本研究还构建了专有数据集USV-Dataset,并开发了数据引擎以提高标注效率.为增强模型的泛化能力,采用了4个公开数据集与USV-Dataset进行混合训练,涵盖了多样化的场景和障碍物类别.实验结果表明,该方法实现了96.8%的mPA分割精度和10 FPS的推理速度,展现出良好的泛化能力,能够满足中低速两栖无人平台的实时环境感知需求. 展开更多
关键词 水陆两栖平台 环境感知 sam 多模型融合
在线阅读 下载PDF
Optimizing zero-shot text-based segmentation of remote sensing imagery using SAM and Grounding DINO
4
作者 Mohanad Diab Polychronis Kolokoussis Maria Antonia Brovelli 《Artificial Intelligence in Geosciences》 2025年第1期14-24,共11页
The use of AI technologies in remote sensing(RS)tasks has been the focus of many individuals in both the professional and academic domains.Having more accessible interfaces and tools that allow people of little or no ... The use of AI technologies in remote sensing(RS)tasks has been the focus of many individuals in both the professional and academic domains.Having more accessible interfaces and tools that allow people of little or no experience to intuitively interact with RS data of multiple formats is a potential provided by this integration.However,the use of AI and AI agents to help automate RS-related tasks is still in its infancy stage,with some frameworks and interfaces built on top of well-known vision language models(VLM)such as GPT-4,segment anything model(SAM),and grounding DINO.These tools do promise and draw guidelines on the potentials and limitations of existing solutions concerning the use of said models.In this work,the state of the art AI foundation models(FM)are reviewed and used in a multi-modal manner to ingest RS imagery input and perform zero-shot object detection using natural language.The natural language input is then used to define the classes or labels the model should look for,then,both inputs are fed to the pipeline.The pipeline presented in this work makes up for the shortcomings of the general knowledge FMs by stacking pre-processing and post-processing applications on top of the FMs;these applications include tiling to produce uniform patches of the original image for faster detection,outlier rejection of redundant bounding boxes using statistical and machine learning methods.The pipeline was tested with UAV,aerial and satellite images taken over multiple areas.The accuracy for the semantic segmentation showed improvement from the original 64%to approximately 80%-99%by utilizing the pipeline and techniques proposed in this work.GitHub Repository:MohanadDiab/LangRS. 展开更多
关键词 Foundation models Multi-modal models Vision language models Semantic segmentation segment anything model Earth observation Remote sensing
在线阅读 下载PDF
Pre-trained SAM as data augmentation for image segmentation
5
作者 Junjun Wu Yunbo Rao +1 位作者 Shaoning Zeng Bob Zhang 《CAAI Transactions on Intelligence Technology》 2025年第1期268-282,共15页
Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in ord... Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation. 展开更多
关键词 data augmentation image segmentation large model segment anything model
在线阅读 下载PDF
Segmentation of CAD models using hybrid representation
6
作者 Claude UWIMANA Shengdi ZHOU +4 位作者 Limei YANG Zhuqing LI Norbelt MUTAGISHA Edouard NIYONGABO Bin ZHOU 《虚拟现实与智能硬件(中英文)》 2025年第2期188-202,共15页
In this paper,we introduce an innovative method for computer-aided design(CAD)segmentation by concatenating meshes and CAD models.Many previous CAD segmentation methods have achieved impressive performance using singl... In this paper,we introduce an innovative method for computer-aided design(CAD)segmentation by concatenating meshes and CAD models.Many previous CAD segmentation methods have achieved impressive performance using single representations,such as meshes,CAD,and point clouds.However,existing methods cannot effectively combine different three-dimensional model types for the direct conversion,alignment,and integrity maintenance of geometric and topological information.Hence,we propose an integration approach that combines the geometric accuracy of CAD data with the flexibility of mesh representations,as well as introduce a unique hybrid representation that combines CAD and mesh models to enhance segmentation accuracy.To combine these two model types,our hybrid system utilizes advanced-neural-network techniques to convert CAD models into mesh models.For complex CAD models,model segmentation is crucial for model retrieval and reuse.In partial retrieval,it aims to segment a complex CAD model into several simple components.The first component of our hybrid system involves advanced mesh-labeling algorithms that harness the digitization of CAD properties to mesh models.The second component integrates labelled face features for CAD segmentation by leveraging the abundant multisemantic information embedded in CAD models.This combination of mesh and CAD not only refines the accuracy of boundary delineation but also provides a comprehensive understanding of the underlying object semantics.This study uses the Fusion 360 Gallery dataset.Experimental results indicate that our hybrid method can segment these models with higher accuracy than other methods that use single representations. 展开更多
关键词 B-RepNet hybrid segmentation CAD models classification MeshCNN MeshCAD-Net
在线阅读 下载PDF
MSAMamba-UNet:A Lightweight Multi-Scale Adaptive Mamba Network for Skin Lesion Segmentation
7
作者 Shouming Hou Jianchao Hou +2 位作者 Yuteng Pang Aoyu Xia Beibei Hou 《Journal of Bionic Engineering》 2025年第6期3209-3225,共17页
Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion siz... Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion sizes.To overcome these limitations,we introduce MSAMamba-UNet,a lightweight model that integrates two novel architectures:Multi-Scale Mamba(MSMamba)and Adaptive Dynamic Gating Block(ADGB).MSMamba utilizes multi-scale decomposition and a parallel hierarchical structure to enhance the delineation of irregular lesion boundaries and sensitivity to small targets.ADGB dynamically selects convolutional kernels with varying receptive fields based on input features,improving the model’s capacity to accommodate diverse lesion textures and scales.Additionally,we introduce a Mix Attention Fusion Block(MAF)to enhance shallow feature representation by integrating parallel channel and pixel attention mechanisms.Extensive evaluation of MSAMamba-UNet on the ISIC 2016,ISIC 2017,and ISIC 2018 datasets demonstrates competitive segmentation accuracy with only 0.056 M parameters and 0.069 GFLOPs.Our experiments revealed that MSAMamba-UNet achieved IoU scores of 85.53%,85.47%,and 82.22%,as well as DSC scores of 92.20%,92.17%,and 90.24%,respectively.These results underscore the lightweight design and effectiveness of MSAMamba-UNet. 展开更多
关键词 TRANSFORMER segmenting skin lesions Mamba Lightweight model MULTI-SCALE
在线阅读 下载PDF
Dual-Stream Attention-Based Classification Network for Tibial Plateau Fractures via Diffusion Model Augmentation and Segmentation Map Integration
8
作者 Yi Xie Zhi-wei Hao +8 位作者 Xin-meng Wang Hong-lin Wang Jia-ming Yang Hong Zhou Xu-dong Wang Jia-yao Zhang Hui-wen Yang Peng-ran Liu Zhe-wei Ye 《Current Medical Science》 2025年第1期57-69,共13页
Objective This study aimed to explore a novel method that integrates the segmentation guidance classification and the dif-fusion model augmentation to realize the automatic classification for tibial plateau fractures(... Objective This study aimed to explore a novel method that integrates the segmentation guidance classification and the dif-fusion model augmentation to realize the automatic classification for tibial plateau fractures(TPFs).Methods YOLOv8n-cls was used to construct a baseline model on the data of 3781 patients from the Orthopedic Trauma Center of Wuhan Union Hospital.Additionally,a segmentation-guided classification approach was proposed.To enhance the dataset,a diffusion model was further demonstrated for data augmentation.Results The novel method that integrated the segmentation-guided classification and diffusion model augmentation sig-nificantly improved the accuracy and robustness of fracture classification.The average accuracy of classification for TPFs rose from 0.844 to 0.896.The comprehensive performance of the dual-stream model was also significantly enhanced after many rounds of training,with both the macro-area under the curve(AUC)and the micro-AUC increasing from 0.94 to 0.97.By utilizing diffusion model augmentation and segmentation map integration,the model demonstrated superior efficacy in identifying SchatzkerⅠ,achieving an accuracy of 0.880.It yielded an accuracy of 0.898 for SchatzkerⅡandⅢand 0.913 for SchatzkerⅣ;for SchatzkerⅤandⅥ,the accuracy was 0.887;and for intercondylar ridge fracture,the accuracy was 0.923.Conclusion The dual-stream attention-based classification network,which has been verified by many experiments,exhibited great potential in predicting the classification of TPFs.This method facilitates automatic TPF assessment and may assist surgeons in the rapid formulation of surgical plans. 展开更多
关键词 Artificial intelligence YOLOv8 Tibial plateau fracture Diffusion model augmentation segmentation map
暂未订购
The detection of keratoconus using a three-dimensional corneal model derived from anterior segment optical coherence tomography
9
作者 Sang Ngoc Tran Isa S.K.Mohammed +1 位作者 Zeshan Tariq Wuqaas M.Munir 《Annals of Eye Science》 2025年第3期73-82,共10页
Background:Traditional imaging approaches to keratoconus(KCN)have thus far failed to produce a standardized approach for diagnosis.While many diagnostic modalities and metrics exist,none have proven robust enough to b... Background:Traditional imaging approaches to keratoconus(KCN)have thus far failed to produce a standardized approach for diagnosis.While many diagnostic modalities and metrics exist,none have proven robust enough to be considered a gold standard.This study aims to introduce novel metrics to differentiate between KCN and healthy corneas using three-dimensional(3D)measurements of surface area and volume.Methods:This retrospective observational study examined KCN patients along with healthy control patients between the ages of 20 and 79 years old at the University of Maryland,Baltimore.The selected patients underwent a nine-line raster scan anterior segment optical coherence tomography(AS-OCT).ImageJ was used to determine the central 6 mm of each image and each corneal image was then divided into six 1 mm segments.Free-D software was then used to render the nine different images into a 3D model to calculate corneal surface area and volume.A two-tailed Mann-Whitney test was used to assess statistical significance when comparing these subsets.Results:Thirty-three eyes with KCN,along with 33 healthy control,were enrolled.There were statistically significant differences between the healthy and KCN groups in the metric of anterior corneal surface area(13.927 vs.13.991 mm^(2),P=0.046),posterior corneal surface area(14.045 vs.14.173 mm^(2),P<0.001),and volume(8.430 vs.7.773 mm3,P<0.001)within the central 6 mm.Conclusions:3D corneal models derived from AS-OCT can be used to measure anterior corneal surface area,posterior corneal surface area,and corneal volume.All three parameters are statistically different between corneas with KCN and healthy corneas.Further study and application of these parameters may yield new methodologies for the detection of KCN. 展开更多
关键词 CORNEA ECTASIA keratoconus(KCN) anterior segment optical coherence tomography(AS-OCT) three-dimensional model(3D model)
暂未订购
Deep Learning for Brain Tumor Segmentation and Classification: A Systematic Review of Methods and Trends
10
作者 Ameer Hamza Robertas Damaševicius 《Computers, Materials & Continua》 2026年第1期132-172,共41页
This systematic review aims to comprehensively examine and compare deep learning methods for brain tumor segmentation and classification using MRI and other imaging modalities,focusing on recent trends from 2022 to 20... This systematic review aims to comprehensively examine and compare deep learning methods for brain tumor segmentation and classification using MRI and other imaging modalities,focusing on recent trends from 2022 to 2025.The primary objective is to evaluate methodological advancements,model performance,dataset usage,and existing challenges in developing clinically robust AI systems.We included peer-reviewed journal articles and highimpact conference papers published between 2022 and 2025,written in English,that proposed or evaluated deep learning methods for brain tumor segmentation and/or classification.Excluded were non-open-access publications,books,and non-English articles.A structured search was conducted across Scopus,Google Scholar,Wiley,and Taylor&Francis,with the last search performed in August 2025.Risk of bias was not formally quantified but considered during full-text screening based on dataset diversity,validation methods,and availability of performance metrics.We used narrative synthesis and tabular benchmarking to compare performance metrics(e.g.,accuracy,Dice score)across model types(CNN,Transformer,Hybrid),imaging modalities,and datasets.A total of 49 studies were included(43 journal articles and 6 conference papers).These studies spanned over 9 public datasets(e.g.,BraTS,Figshare,REMBRANDT,MOLAB)and utilized a range of imaging modalities,predominantly MRI.Hybrid models,especially ResViT and UNetFormer,consistently achieved high performance,with classification accuracy exceeding 98%and segmentation Dice scores above 0.90 across multiple studies.Transformers and hybrid architectures showed increasing adoption post2023.Many studies lacked external validation and were evaluated only on a few benchmark datasets,raising concerns about generalizability and dataset bias.Few studies addressed clinical interpretability or uncertainty quantification.Despite promising results,particularly for hybrid deep learning models,widespread clinical adoption remains limited due to lack of validation,interpretability concerns,and real-world deployment barriers. 展开更多
关键词 Brain tumor segmentation brain tumor classification deep learning vision transformers hybrid models
在线阅读 下载PDF
EFFECTIVE IMAGE SEGMENTATION FRAMEWORK FOR GAUSSIAN MIXTURE MODEL INCORPORATING LOCAL INFORMATION 被引量:3
11
作者 蔡维玲 丁军娣 《Transactions of Nanjing University of Aeronautics and Astronautics》 EI 2008年第4期266-274,共9页
A new two-step framework is proposed for image segmentation. In the first step, the gray-value distribution of the given image is reshaped to have larger inter-class variance and less intra-class variance. In the sec-... A new two-step framework is proposed for image segmentation. In the first step, the gray-value distribution of the given image is reshaped to have larger inter-class variance and less intra-class variance. In the sec- ond step, the discriminant-based methods or clustering-based methods are performed on the reformed distribution. It is focused on the typical clustering methods-Gaussian mixture model (GMM) and its variant to demonstrate the feasibility of the framework. Due to the independence of the first step in its second step, it can be integrated into the pixel-based and the histogram-based methods to improve their segmentation quality. The experiments on artificial and real images show that the framework can achieve effective and robust segmentation results. 展开更多
关键词 pattern recognition image processing image segmentation Gaussian mixture model (GMM) expectation maximization (EM)
在线阅读 下载PDF
基于Stone-SAM的便携式粗集料级配智能检测 被引量:1
12
作者 张鸿 杨俊雅 +2 位作者 刘可心 张益鹏 程雪聪 《建筑材料学报》 北大核心 2025年第6期581-590,共10页
为实现精确的粗集料级配检测,提出了一种便携式粗集料级配智能检测方法。采用知识蒸馏的策略对视觉大模型——分割一切模型(SAM)进行网络结构轻量化,嵌入神经网络分类器PP-HGNetV2为模型提供语义判断的能力,设计粗集料颗粒特征参数数学... 为实现精确的粗集料级配检测,提出了一种便携式粗集料级配智能检测方法。采用知识蒸馏的策略对视觉大模型——分割一切模型(SAM)进行网络结构轻量化,嵌入神经网络分类器PP-HGNetV2为模型提供语义判断的能力,设计粗集料颗粒特征参数数学表征算法,开发移动端应用程序,实现粗集料级配高通量检测。对5种粗集料级配场景进行测试。结果表明:本研究方法对于粗集料颗粒的分割精度高于原始SAM模型,并且能够精确去除背景信息,粗集料颗粒关键参数提取结果准确可靠。 展开更多
关键词 分割一切模型(sam) 粗集料级配 智能检测 移动端 工程检测
在线阅读 下载PDF
光影智绘:基于SAM的视频阴影鲁棒抽取
13
作者 陈东 李昌隆 +2 位作者 杜振龙 宋爽 李晓丽 《图学学报》 北大核心 2025年第4期739-745,共7页
针对传统方法对于光照变化和物体遮挡引起复杂的、动态变化阴影处理易致阴影检测的准确率和鲁棒性较低问题,提出了一种基于分割万物模型(SAM)的视频阴影检测方法,对SAM解码器进行微调,使其更适合阴影检测;利用SAM提取关键帧阴影区域,引... 针对传统方法对于光照变化和物体遮挡引起复杂的、动态变化阴影处理易致阴影检测的准确率和鲁棒性较低问题,提出了一种基于分割万物模型(SAM)的视频阴影检测方法,对SAM解码器进行微调,使其更适合阴影检测;利用SAM提取关键帧阴影区域,引入XMem模型,结合感觉记忆、短时记忆和长时记忆联合前后帧信息,给出优化和稳定视频阴影检测结果。实验结果表明:在ViSha数据集的阴影实验结果与传统方法相比,该方法的均值绝对误差降低了约31.8%,交并比提升了约19.7%;定性和定量结果表明本方法不仅提升了视频阴影处理的准确率,并表现出较好的鲁棒性。 展开更多
关键词 阴影检测 语义分割 视频对象分割 sam XMem
在线阅读 下载PDF
SABM:一种蝴蝶生态图像分割的增强SAM模型
14
作者 谢娟英 兰翔 许升全 《陕西师范大学学报(自然科学版)》 北大核心 2025年第6期1-14,共14页
通过分割生态图像中蝴蝶获得蝴蝶掩码是基于生态图像的蝴蝶物种自动化识别的基础,因此研究蝴蝶生态图像分割有重要意义。然而,现有蝴蝶生态图像存在数据集样本量小、蝴蝶拟态、翅膀遮挡等问题,使现有深度网络难以训练出具有良好泛化能... 通过分割生态图像中蝴蝶获得蝴蝶掩码是基于生态图像的蝴蝶物种自动化识别的基础,因此研究蝴蝶生态图像分割有重要意义。然而,现有蝴蝶生态图像存在数据集样本量小、蝴蝶拟态、翅膀遮挡等问题,使现有深度网络难以训练出具有良好泛化能力的分割模型。为此,通过改进SAM(segment anything model)模型,提出一种鲁棒的蝴蝶生态图像分割新模型SABM(segment any butterfly model)。SABM模型通过引入双路卷积模块、蝴蝶词元(butterfly token)及一个3层MLP(multi-layer perceptron)使模型具有更好的特征学习能力。707张蝴蝶生态图像数据集的2折交叉验证实验表明,SABM模型对蝴蝶生态图像的分割能力超越了SAM及其现有的改进SOTA模型。7645张全新蝴蝶生态图像数据集的分割实验测试发现,SABM模型具有非常好的泛化性能,对7645张全新蝴蝶生态图像的蝴蝶实现了非常好的分割。该分割结果为未来的蝴蝶生态图像分割研究提供了10倍于现有数据的大数据集,为野外环境下的蝴蝶物种自动识别提供了更好的可用数据,也为测试聚类算法性能提供了富有挑战性的数据集。另外,还在医学图像数据测试了SABM模型的鲁棒性。 展开更多
关键词 蝴蝶分割 双路卷积 sam SABM 图像分割
在线阅读 下载PDF
针对SAM下游模型脆弱模块的对抗迁移攻击
15
作者 丁熠 林能健 +2 位作者 蒋昀陶 钟宇浩 曹明生 《计算机研究与发展》 北大核心 2025年第10期2455-2467,共13页
SAM(segment anything model)作为一种通用的视觉基础模型,已被广泛应用于多种图像分割任务,但其在对抗性攻击面前表现出脆弱性.提出一种针对SAM下游模型脆弱模块的对抗迁移攻击方法FSGR(fragile section gradient robustness).该方法... SAM(segment anything model)作为一种通用的视觉基础模型,已被广泛应用于多种图像分割任务,但其在对抗性攻击面前表现出脆弱性.提出一种针对SAM下游模型脆弱模块的对抗迁移攻击方法FSGR(fragile section gradient robustness).该方法在无需知晓下游微调细节的前提下,可有效生成对抗样本,实现对SAM下游模型的攻击.该方法运用“脆弱层精准定位+局部强化迁移”策略,通过特征相似度筛选出跨任务共享且最易被激活的模块,针对性地强化攻击效果;同时,引入梯度稳健损失以消除目标模型与下游任务模型间的梯度差异. FSGR方法融合了脆弱层攻击与梯度稳健损失机制,在多个数据集上均实现了相对性能的提升.实验结果表明,FSGR在多种微调模型(如医学分割、阴影分割和伪装分割)的迁移攻击中显著降低了模型性能,证明了其正确性和实用性.与基线方法相比,FSGR不仅在攻击成功率上表现出色,还通过结合脆弱层攻击和梯度稳健损失,实现了相对性能的提升. 展开更多
关键词 图像分割 对抗攻击 迁移攻击 特征相似度 模型鲁棒性
在线阅读 下载PDF
Multi-resolution image segmentation based on Gaussian mixture model 被引量:5
16
作者 Tang Yinggan Liu Dong Guan Xinping 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2006年第4期870-874,共5页
Mixture model based image segmentation method, which assumes that image pixels are independent and do not consider the position relationship between pixels, is not robust to noise and usually leads to misclassificatio... Mixture model based image segmentation method, which assumes that image pixels are independent and do not consider the position relationship between pixels, is not robust to noise and usually leads to misclassification. A new segmentation method, called multi-resolution Ganssian mixture model method, is proposed. First, an image pyramid is constructed and son-father link relationship is built between each level of pyramid. Then the mixture model segmentation method is applied to the top level. The segmentation result on the top level is passed top-down to the bottom level according to the son-father link relationship between levels. The proposed method considers not only local but also global information of image, it overcomes the effect of noise and can obtain better segmentation result. Experimental result demonstrates its effectiveness. 展开更多
关键词 image segmentation MULTI-RESOLUTION Ganssian mixture model.
在线阅读 下载PDF
Color Image Segmentation Based on HSI Model 被引量:6
17
作者 章毓晋 《High Technology Letters》 EI CAS 1998年第1期30-33,共4页
he objective of the research is to develop a fast procedure for segmenting typical videophone images. In this paper, a new approach to color image segmentation based on HSI(Hue, Saturation, Intensity) color model is r... he objective of the research is to develop a fast procedure for segmenting typical videophone images. In this paper, a new approach to color image segmentation based on HSI(Hue, Saturation, Intensity) color model is reported. It is in contrast to the conventional approaches by using the three components of HSI color model in succession. This strategy makes the segmentation procedure much fast and effective. Experimental results with typical “headandshoulders” real images taken from videophone sequences show that the new appproach can fulfill the application requirements. 展开更多
关键词 modelbased CODING HSI COLOR model COLOR transformation IMAGE segmentATION
在线阅读 下载PDF
基于SAM优化的饲喂目标实时识别方法
18
作者 张勤 翁凯航 《华南理工大学学报(自然科学版)》 北大核心 2025年第7期60-69,共10页
饲喂辅助机器人是推动畜牧业现代化转型的关键设备,饲喂目标的快速、准确识别是机器人实现智能推料的重要保证。匹配分割精度和运行效率是保证算法综合性能的关键步骤,也是识别算法的重要课题。针对现有奶牛饲喂目标识别方法存在分割精... 饲喂辅助机器人是推动畜牧业现代化转型的关键设备,饲喂目标的快速、准确识别是机器人实现智能推料的重要保证。匹配分割精度和运行效率是保证算法综合性能的关键步骤,也是识别算法的重要课题。针对现有奶牛饲喂目标识别方法存在分割精度和运行效率不匹配的问题,该文提出了一种基于分割大模型(SAM)优化的饲喂目标实时识别方法RTFTR。该方法首先在SAM-det架构基础上,通过轻量化图像编码器和目标检测器的参数,引入缓冲区队列的并行化设计方法来平衡各模块的运行效率,以提升推理速率;然后利用HQ形符增强特征空间的解码能力,优化设计掩码解码器,并采用针对饲喂目标的分阶段训练,以提高分割精度。实验结果表明:所提方法在提高分割精度的前提下保证了推理速率;在奶牛饲喂目标识别中,奶牛分割精度达98.7%,饲料分割精度达96.4%,料槽分割精度达99.2%,整体平均分割精度达98.1%,运行速率为52.9 f/s,满足养殖场复杂环境和机器人计算资源限制下对奶牛饲喂目标识别方法的高精度、高效率的应用需求。 展开更多
关键词 饲喂辅助机器人 分割大模型 奶牛饲喂 目标识别 分割精度
在线阅读 下载PDF
Fully Automatic Segmentation of Gynaecological Abnormality Using a New Viola–Jones Model 被引量:6
19
作者 Ihsan Jasim Hussein M.A.Burhanuddin +4 位作者 Mazin Abed Mohammed Mohamed Elhoseny Begonya Garcia-Zapirain Marwah Suliman Maashi Mashael S.Maashi 《Computers, Materials & Continua》 SCIE EI 2021年第3期3161-3182,共22页
One of the most complex tasks for computer-aided diagnosis(Intelligent decision support system)is the segmentation of lesions.Thus,this study proposes a new fully automated method for the segmentation of ovarian and b... One of the most complex tasks for computer-aided diagnosis(Intelligent decision support system)is the segmentation of lesions.Thus,this study proposes a new fully automated method for the segmentation of ovarian and breast ultrasound images.The main contributions of this research is the development of a novel Viola–James model capable of segmenting the ultrasound images of breast and ovarian cancer cases.In addition,proposed an approach that can efficiently generate region-of-interest(ROI)and new features that can be used in characterizing lesion boundaries.This study uses two databases in training and testing the proposed segmentation approach.The breast cancer database contains 250 images,while that of the ovarian tumor has 100 images obtained from several hospitals in Iraq.Results of the experiments showed that the proposed approach demonstrates better performance compared with those of other segmentation methods used for segmenting breast and ovarian ultrasound images.The segmentation result of the proposed system compared with the other existing techniques in the breast cancer data set was 78.8%.By contrast,the segmentation result of the proposed system in the ovarian tumor data set was 79.2%.In the classification results,we achieved 95.43%accuracy,92.20%sensitivity,and 97.5%specificity when we used the breast cancer data set.For the ovarian tumor data set,we achieved 94.84%accuracy,96.96%sensitivity,and 90.32%specificity. 展开更多
关键词 Viola-Jones model breast cancer segmentation ovarian tumor ovarian tumor segmentation breast cancer ultrasound images active contour cascade model
在线阅读 下载PDF
Color-texture segmentation using JSEG based on Gaussian mixture modeling 被引量:4
20
作者 Wang Yuzhong Yang Jie Zhou Yue 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2006年第1期24-29,共6页
An improved approach for J-value segmentation (JSEG) is presented for unsupervised color image segmentation. Instead of color quantization algorithm, an automatic classification method based on adaptive mean shift ... An improved approach for J-value segmentation (JSEG) is presented for unsupervised color image segmentation. Instead of color quantization algorithm, an automatic classification method based on adaptive mean shift (AMS) based clustering is used for nonparametric clustering of image data set. The clustering results are used to construct Gaussian mixture modelling (GMM) of image data for the calculation of soft J value. The region growing algorithm used in JSEG is then applied in segmenting the image based on the multiscale soft J-images. Experiments show that the synergism of JSEG and the soft classification based on AMS based clustering and GMM overcomes the limitations of JSEG successfully and is more robust. 展开更多
关键词 color image segmentation JSEG adaptive mean shift based dustering Gaussian mixture modeling soft J-value.
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部