期刊文献+
共找到23篇文章
< 1 2 >
每页显示 20 50 100
Automated labeling and segmentation based on segment anything model:Quantitative analysis of bubbles in gas-liquid flow
1
作者 Jia-Bin Shi Li-Jun You +3 位作者 Jia-Chen Dang Yi-Jun Wang Wei Gong Bo Peng 《Petroleum Science》 2025年第12期5212-5227,共16页
The quantitative analysis of dispersed phases(bubbles,droplets,and particles)in multiphase flow systems represents a persistent technological challenge in petroleum engineering applications,including CO2-enhanced oil ... The quantitative analysis of dispersed phases(bubbles,droplets,and particles)in multiphase flow systems represents a persistent technological challenge in petroleum engineering applications,including CO2-enhanced oil recovery,foam flooding,and unconventional reservoir development.Current characterization methods remain constrained by labor-intensive manual workflows and limited dynamic analysis capabilities,particularly for processing large-scale microscopy data and video sequences that capture critical transient behavior like gas cluster migration and droplet coalescence.These limitations hinder the establishment of robust correlations between pore-scale flow patterns and reservoir-scale production performance.This study introduces a novel computer vision framework that integrates foundation models with lightweight neural networks to address these industry challenges.Leveraging the segment anything model's zero-shot learning capability,we developed an automated workflow that achieves an efficiency improvement of approximately 29 times in bubble labeling compared to manual methods while maintaining less than 2%deviation from expert annotations.Engineering-oriented optimization ensures lightweight deployment with 94%segmentation accuracy,while the integrated quantification system precisely resolves gas saturation,shape factors,and interfacial dynamics,parameters critical for optimizing gas injection strategies and predicting phase redistribution patterns.Validated through microfluidic gas-liquid displacement experiments for discontinuous phase segmentation accuracy,this methodology enables precise bubble morphology quantification with broad application potential in multiphase systems,including emulsion droplet dynamics characterization and particle transport behavior analysis.This work bridges the critical gap between pore-scale dynamics characterization and reservoir-scale simulation requirements,providing a foundational framework for intelligent flow diagnostics and predictive modeling in next-generation digital oilfield systems. 展开更多
关键词 Dispersed phases Bubble segmentation Microfluidic system segment anything model Gas-liquid flow Artificial intelligence
原文传递
Accelerated optical remote sensing mapping of oil spills in the China Seas using the Segment Anything Model
2
作者 Hang Lv Yingcheng Lu +5 位作者 Lifeng Wang Shuxian Song Wei Zhao Yanlong Chen Yuntao Wang Qingjun Song 《Acta Oceanologica Sinica》 2025年第10期184-197,共14页
Efficient segmentation of oiled pixels in optical remotely sensed images is the precondition of optical identification and classification of different spilled oils,which remains one of the keys to optical remote sensi... Efficient segmentation of oiled pixels in optical remotely sensed images is the precondition of optical identification and classification of different spilled oils,which remains one of the keys to optical remote sensing of oil spills.Optical remotely sensed images of oil spills are inherently multidimensional and embedded with a complex knowledge framework.This complexity often hinders the effectiveness of mechanistic algorithms across varied scenarios.Although optical remote-sensing theory for oil spills has advanced,the scarcity of curated datasets and the difficulty of collecting them limit their usefulness for training deep learning models.This study introduces a data expansion strategy that utilizes the Segment Anything Model(SAM),effectively bridging the gap between traditional mechanism algorithms and emergent self-adaptive deep learning models.Optical dimension reduction is achieved through standardized preprocessing processes that address the decipherable properties of the input image.After preprocessing,SAM can swiftly and accurately segment spilled oil in images.The unified AI-based workflow significantly accelerates labeled-dataset creation and has proven effective for both rapid emergency intelligence during spill incidents and the rapid mapping and classification of oil footprints across China’s coastal waters.Our results show that coupling a remote sensing mechanism with a foundation model enables near-real-time,large-scale monitoring of complex surface slicks and offers guidance for the next generation of detection and quantification algorithms. 展开更多
关键词 marine oil spills optical remote sensing segment anything model extract oil footprint spatiotemporal distribution
在线阅读 下载PDF
PASS-SAM:Integration of Segment Anything Model for Large-Scale Unsupervised Semantic Segmentation
3
作者 Yin Tang Rui Chen +1 位作者 Gensheng Pei Qiong Wang 《Computational Visual Media》 2025年第3期669-674,共6页
Large-scale unsupervised semantic segmentation(LUSS)is a sophisticated process that aims to segment similar areas within an image without relying on labeled training data.While existing methodologies have made substan... Large-scale unsupervised semantic segmentation(LUSS)is a sophisticated process that aims to segment similar areas within an image without relying on labeled training data.While existing methodologies have made substantial progress in this area,there is ample scope for enhancement.We thus introduce the PASS-SAM model,a comprehensive solution that amalgamates the benefits of various models to improve segmentation performance. 展开更多
关键词 segmentation performance amalgamates benefits various models segment anything model pass sam model segment similar areas large scale unsupervised semantic segmentation
原文传递
When segment anything model meets inventorying of roadway assets
4
作者 Chenyu Zhang Sam Huang Ruwen Qin 《International Journal of Transportation Science and Technology》 2025年第4期1-14,共14页
Automating the identification,localization,and monitoring of roadway assets distributed widely in the roadway network is critical for the traffic management system.It can effi-ciently provide up-to-date information in... Automating the identification,localization,and monitoring of roadway assets distributed widely in the roadway network is critical for the traffic management system.It can effi-ciently provide up-to-date information in supporting transportation asset management(TAM).Collecting videos with vehicle-mounted cameras and processing the data with com-puter vision(CV)-based deep learning methods is garnering increased attention from transportation agencies.While promising,challenges arise due to the lack of high-quality annotations for roadway assets in images,difficulties in identifying these assets,and limited solutions.The segment anything model(SAM),a visual foundation model,demonstrates robust zero-shot capability for general image segmentation under various prompts.This study evaluates SAM’s applicability and efficiency in extracting roadway assets from images.Specifically,it examines the impacts of model size and prompt quality on SAM’s performance in segmenting roadway assets.Five state-of-the-art semantic seg-mentation models are trained and compared with SAM.Results show that a lightweight SAM with human-rendered prompts outperforms the five semantic segmentation models.Based on the evaluation results,future work will explore incorporating SAM into trans-portation asset management applications,promoting collaboration between human experts and artificial intelligence. 展开更多
关键词 Roadway asset Transportation asset management(TAM) segment anything model(SAM) Foundation model Semantic segmentation
在线阅读 下载PDF
A medical image segmentation model based on SAM with an integrated local multi-scale feature encoder
5
作者 DI Jing ZHU Yunlong LIANG Chan 《Journal of Measurement Science and Instrumentation》 2025年第3期359-370,共12页
Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding ... Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding phase.This paper presents a medical image segmentation model based on SAM with a local multi-scale feature encoder(LMSFE-SAM)to address the issues above.Firstly,based on the SAM,a local multi-scale feature encoder is introduced to improve the representation of features within local receptive field,thereby supplying the Vision Transformer(ViT)branch in SAM with enriched local multi-scale contextual information.At the same time,a multiaxial Hadamard product module(MHPM)is incorporated into the local multi-scale feature encoder in a lightweight manner to reduce the quadratic complexity and noise interference.Subsequently,a cross-branch balancing adapter is designed to balance the local and global information between the local multi-scale feature encoder and the ViT encoder in SAM.Finally,to obtain smaller input image size and to mitigate overlapping in patch embeddings,the size of the input image is reduced from 1024×1024 pixels to 256×256 pixels,and a multidimensional information adaptation component is developed,which includes feature adapters,position adapters,and channel-spatial adapters.This component effectively integrates the information from small-sized medical images into SAM,enhancing its suitability for clinical deployment.The proposed model demonstrates an average enhancement ranging from 0.0387 to 0.3191 across six objective evaluation metrics on BUSI,DDTI,and TN3K datasets compared to eight other representative image segmentation models.This significantly enhances the performance of the SAM on medical images,providing clinicians with a powerful tool in clinical diagnosis. 展开更多
关键词 segment anything model(SAM) medical image segmentation ENCODER decoder multiaxial Hadamard product module(MHPM) cross-branch balancing adapter
在线阅读 下载PDF
An intelligent segmentation method for leakage points in central serous chorioretinopathy based on fluorescein angiography images
6
作者 Jian-Guo Xu Yong-Chi Liu +4 位作者 Fen Zhou Jian-Xin Shen Zhi-Peng Yan Xin-Ya Hu Wei-Hua Yang 《International Journal of Ophthalmology(English edition)》 2026年第3期421-433,共13页
AIM:To construct an intelligent segmentation scheme for precise localization of central serous chorioretinopathy(CSC)leakage points,thereby enabling ophthalmologists to deliver accurate laser treatment without navigat... AIM:To construct an intelligent segmentation scheme for precise localization of central serous chorioretinopathy(CSC)leakage points,thereby enabling ophthalmologists to deliver accurate laser treatment without navigational laser equipment.METHODS:A dataset with dual labels(point-level and pixel-level)was first established based on fundus fluorescein angiography(FFA)images of CSC and subsequently divided into training(102 images),validation(40 images),and test(40 images)datasets.An intelligent segmentation method was then developed,based on the You Only Look Once version 8 Pose Estimation(YOLOv8-Pose)model and segment anything model(SAM),to segment CSC leakage points.Next,the YOLOv8-Pose model was trained for 200 epochs,and the best-performing model was selected to form the optimal combination with SAM.Additionally,the classic five types of U-Net series models[i.e.,U-Net,recurrent residual U-Net(R2U-Net),attention U-Net(AttU-Net),recurrent residual attention U-Net(R2AttUNet),and nested U-Net(UNet^(++))]were initialized with three random seeds and trained for 200 epochs,resulting in a total of 15 baseline models for comparison.Finally,based on the metrics including Dice similarity coefficient(DICE),intersection over union(IoU),precision,recall,precisionrecall(PR)curve,and receiver operating characteristic(ROC)curve,the proposed method was compared with baseline models through quantitative and qualitative experiments for leakage point segmentation,thereby demonstrating its effectiveness.RESULTS:With the increase of training epochs,the mAP50-95,Recall,and precision of the YOLOv8-Pose model showed a significant increase and tended to stabilize,and it achieved a preliminary localization success rate of 90%(i.e.,36 images)for CSC leakage points in 40 test images.Using manually expert-annotated pixel-level labels as the ground truth,the proposed method achieved outcomes with a DICE of 57.13%,an IoU of 45.31%,a precision of 45.91%,a recall of 93.57%,an area under the PR curve(AUC-PR)of 0.78 and an area under the ROC curve(AUC-ROC)of 0.97,which enables more accurate segmentation of CSC leakage points.CONCLUSION:By combining the precise localization capability of the YOLOv8-Pose model with the robust and flexible segmentation ability of SAM,the proposed method not only demonstrates the effectiveness of the YOLOv8-Pose model in detecting keypoint coordinates of CSC leakage points from the perspective of application innovation but also establishes a novel approach for accurate segmentation of CSC leakage points through the“detect-then-segment”strategy,thereby providing a potential auxiliary means for the automatic and precise realtime localization of leakage points during traditional laser photocoagulation for CSC. 展开更多
关键词 You Only Look Once version 8 Pose Estimation segment anything model central serous chorioretinopathy leakage point segmentation
原文传递
How precise is precise enough?Tree crown segmentation using high resolution close-up multispectral UAV images and its effect on NDVI accuracy in Fraxinus excelsior L.trees
7
作者 Lisa Buchner Anna-Katharina Eisen Susanne Jochner-Oette 《Journal of Forestry Research》 2026年第2期16-30,共15页
Detailed individual tree crown segmentation is highly relevant for the detection and monitoring of Fraxinus excelsior L.trees affected by ash dieback,a major threat to common ash populations across Europe.In this stud... Detailed individual tree crown segmentation is highly relevant for the detection and monitoring of Fraxinus excelsior L.trees affected by ash dieback,a major threat to common ash populations across Europe.In this study,both fine and coarse crown segmentation methods were applied to close-range multispectral UAV imagery.The fine tree crown segmentation method utilized a novel unsupervised machine learning approach based on a blended NIR-NDVI image,whereas the coarse segmentation relied on the segment anything model(SAM).Both methods successfully delineated tree crown outlines,however,only the fine segmentation accurately captured internal canopy gaps.Despite these structural differences,mean NDVI values calculated per tree crown revealed no significant differences between the two approaches,indicating that coarse segmentation is sufficient for mean vegetation index assessments.Nevertheless,the fine segmentation revealed increased heterogeneity in NDVI values in more severely damaged trees,underscoring its value for detailed structural and health analyses.Furthermore,the fine segmentation workflow proved transferable to both individual UAV images and orthophotos from broader UAV surveys.For applications focused on structural integrity and spatial variation in canopy health,the fine segmentation approach is recommended. 展开更多
关键词 Leaf mass segmentation Machine learning segment anything model Ash dieback
在线阅读 下载PDF
Tunnel SAM adapter:Adapting segment anything model for tunnel water leakage inspection
8
作者 Junxin Chen Xiaojie Yu +4 位作者 Shichang Liu Tao Chen Wei Wang Gwanggil Jeon Benguo He 《Geohazard Mechanics》 2024年第1期29-36,共8页
Water leakage inspection in the tunnels is a critical engineering job that has attracted increasing concerns.Leakage area detection via manual inspection techniques is time-consuming and might produce unreliablefindin... Water leakage inspection in the tunnels is a critical engineering job that has attracted increasing concerns.Leakage area detection via manual inspection techniques is time-consuming and might produce unreliablefindings, so that automated techniques should be created to increase reliability and efficiency. Pre-trainedfoundational segmentation models for large datasets have attracted great interests recently. This paper proposes a novel SAM-based network for accurate automated water leakage inspection. The contributions of thispaper include the efficient adaptation of the SAM (Segment Anything Model) for shield tunnel water leakagesegmentation and the demonstration of the application effect by data experiments. Tunnel SAM Adapter hassatisfactory performance, achieving 76.2 % mIoU and 77.5 % Dice. Experimental results demonstrate that ourapproach has advantages over peer studies and guarantees the integrity and safety of these vital assets whilestreamlining tunnel maintenance. 展开更多
关键词 Water leakage segmentation segment anything model SAM-Adapter Smart engineering
在线阅读 下载PDF
Pre-trained SAM as data augmentation for image segmentation 被引量:1
9
作者 Junjun Wu Yunbo Rao +1 位作者 Shaoning Zeng Bob Zhang 《CAAI Transactions on Intelligence Technology》 2025年第1期268-282,共15页
Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in ord... Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation. 展开更多
关键词 data augmentation image segmentation large model segment anything model
在线阅读 下载PDF
Intelligent evaluation of sandstone rock structure based on a visual large model
10
作者 REN Yili ZENG Changmin +10 位作者 LI Xin LIU Xi HU Yanxu SU Qianxiao WANG Xiaoming LIN Zhiwei ZHOU Yixiao ZHENG Zilu HU Huiying YANG Yanning HUI Fang 《Petroleum Exploration and Development》 2025年第2期548-558,共11页
Existing sandstone rock structure evaluation methods rely on visual inspection,with low efficiency,semi-quantitative analysis of roundness,and inability to perform classified statistics in particle size analysis.This ... Existing sandstone rock structure evaluation methods rely on visual inspection,with low efficiency,semi-quantitative analysis of roundness,and inability to perform classified statistics in particle size analysis.This study presents an intelligent evaluation method for sandstone rock structure based on the Segment Anything Model(SAM).By developing a lightweight SAM fine-tuning method with rank-decomposition matrix adapters,a multispectral rock particle segmentation model named CoreSAM is constructed,which achieves rock particle edge extraction and type identification.Building upon this,we propose a comprehensive quantitative evaluation system for rock structure,assessing parameters including particle size,sorting,roundness,particle contact and cementation types.The experimental results demonstrate that CoreSAM outperforms existing methods in rock particle segmentation accuracy while showing excellent generalization across different image types such as CT scans and core photographs.The proposed method enables full-sample,classified particle size analysis and quantitative characterization of parameters like roundness,advancing reservoir evaluation towards more precise,quantitative,intuitive,and comprehensive development. 展开更多
关键词 SANDSTONE rock structure intelligent evaluation segment anything model fine-tuning particle edge extraction type identification
在线阅读 下载PDF
从通用分割到专用化建筑物提取——SAM在高分遥感影像中的优化策略研究
11
作者 陈秀秀 金永胜 +1 位作者 叶建生 方雷 《中国图象图形学报》 北大核心 2026年第2期642-656,共15页
目的 针对传统高分辨率影像建筑物提取方法的精度瓶颈,SAM(segment anything model)模型虽然具有分割优势,却因训练域差异和人工提示依赖,无法直接应用于大规模遥感影像的自动化提取。为此,提出一种无提示—判别联合模型(SAM-Classifie... 目的 针对传统高分辨率影像建筑物提取方法的精度瓶颈,SAM(segment anything model)模型虽然具有分割优势,却因训练域差异和人工提示依赖,无法直接应用于大规模遥感影像的自动化提取。为此,提出一种无提示—判别联合模型(SAM-Classifier),实现了通用视觉模型向遥感场景的迁移,完成了建筑物的自动化高效提取。方法 本研究采用了一系列实验来系统探究不同提示方式(包括点提示、框提示和掩码提示)在SAM模型指导下的建筑物提取效果,并引入一个无需提示的联合模型——SAM-Classifier,以克服传统SAM模型在语义理解和提示依赖方面的限制。实验基于3个公开可用的数据集进行,以全面评估各种提示策略下SAM模型的表现。此外,为了比较不同解决方案在建筑物提取任务中的性能差异,还特别设计了对比实验,将SAM模型及SAMClassifier的结果与商汤科技开发的遥感大模型(Sense Earth 3.0)进行了详细的对比分析。结果 实验表明,框提示引导下的SAM分割表现最优(WHU数据集F1分数0.945);所提出的SAM-Classifier无需人工提示,Ma数据集F1分数0.717,与对比的先进方法性能相近。结论 本文提出SAM-Classifier,通过融合轻量级分类器实现无需提示的端到端建筑物提取,有效缓解了SAM的语义理解不足与提示依赖问题,为遥感影像的自动化解译提供了新方案。 展开更多
关键词 图像分割 高分辨率影像 建筑物提取 SAM(segment anything model) 提示分割 优化策略
原文传递
YOLOv8改进算法在油茶果分拣中的应用 被引量:1
12
作者 刘姜毅 高自成 +2 位作者 刘怀粤 尹浇钦 罗媛尹 《林业工程学报》 北大核心 2025年第1期120-127,共8页
现有的油茶果分拣系统所依赖的YOLO等算法的目标检测、实例分割在低尺寸及密集型样本中鲁棒性较差,存在机械臂常抓取到枝叶、抓取不牢固、易脱落等问题。大部分系统使用目标识别,无法准确识别油茶果具体轮廓信息,不能对油茶果进行大小... 现有的油茶果分拣系统所依赖的YOLO等算法的目标检测、实例分割在低尺寸及密集型样本中鲁棒性较差,存在机械臂常抓取到枝叶、抓取不牢固、易脱落等问题。大部分系统使用目标识别,无法准确识别油茶果具体轮廓信息,不能对油茶果进行大小分类。针对这一问题,研究提出了YOWNet模型应对油茶果分拣的小目标、高密度识别任务。首先,研究了自动化边缘标注脚本,脚本调用零样本Segment Anything框架对原有已标注的油茶果目标检测框提取兴趣区间,将其自动转化为边缘标注信息;其次,为了提高模型对小目标的识别能力,研究摒弃了现有的固定感受野的卷积模块,针对油茶果特性提出三维注意力动态卷积模块用于捕捉特征图中的关键信息;最后,研究通过使用Wise⁃IoU损失函数,基于动态非单调聚焦机制的边界框损失,提升边框回归精度。总体网络模型命名为YOWNet,通过与YOLOv8在油茶果上的消融实验对比,试验结果表明:YOWNet模型能够快速准确地识别油茶果实例,在私有数据集上,准确度、Box_loss可达89.90%和0.523。 展开更多
关键词 油茶果 三维动态卷积 实例分割 YOLOv8 segment anything model Wise⁃IoU
在线阅读 下载PDF
基于SAM图像处理的堆石料级配计算方法及验证 被引量:1
13
作者 张振伟 蔡可天 +3 位作者 高轩 贺一轩 王建 鲁洋 《水力发电》 2025年第2期80-86,共7页
堆石料级配检测是堆石坝施工过程中质量控制的重要环节,传统方法通常采用现场人工筛分法测量,存在检测样本少、效率低、干扰施工等问题。提出了一种基于图像处理的堆石料级配计算方法,采用国际最新Mata AI开源的通用图像分割大模型Segme... 堆石料级配检测是堆石坝施工过程中质量控制的重要环节,传统方法通常采用现场人工筛分法测量,存在检测样本少、效率低、干扰施工等问题。提出了一种基于图像处理的堆石料级配计算方法,采用国际最新Mata AI开源的通用图像分割大模型Segment Anything Model(SAM)对筑坝堆石料进行自动图像分割,提出堆石长宽比、面积比等堆石形态学几何参数用于提取堆石料图像中的堆石颗粒目标;同时,建立堆石形态数据库、堆石实例分割数据库,并分析参数取值和验证堆石图像级配计算方法的有效性;最后,试验验证结果表明该方法能够有效识别出图像中的堆石颗粒目标,实现级配曲线的智能识别,以及曲率、不均匀系数等级配指标的快速计算。该方法计算获得的级配与真实筛分法测的级配相关性可达0.94,平均绝对误差约5%,能够在堆石坝施工过程中有效辅助检测堆石料的颗粒级配信息,服务堆石坝的施工碾压质量控制。 展开更多
关键词 堆石料 级配 segment anything model(SAM) 图像识别 快速检测
在线阅读 下载PDF
SAY-SOD:基于大模型优化的高清遥感图像小目标检测框架 被引量:1
14
作者 曾文龙 贾海涛 +1 位作者 周昊哲 程卓尔 《网络安全与数据治理》 2025年第S1期90-97,共8页
随着遥感技术的不断发展,遥感图像中小目标检测面临着背景复杂、目标尺寸小、像素信息少等挑战,传统检测算法在这一领域的表现存在一定局限。提出了一种基于SAM大模型和改进YOLOv8的小目标检测框架。首先,利用SAM对原始遥感图像进行感... 随着遥感技术的不断发展,遥感图像中小目标检测面临着背景复杂、目标尺寸小、像素信息少等挑战,传统检测算法在这一领域的表现存在一定局限。提出了一种基于SAM大模型和改进YOLOv8的小目标检测框架。首先,利用SAM对原始遥感图像进行感兴趣区域的提取和分割,随后对分割后的图像进行多尺度增强,以提高小目标的显著性。增强后的图像与原图的编号和定位信息一起构建数据集,用于训练改进的YOLOv8模型。改进措施包括特征金字塔网络的优化、引入注意力机制、重新设计损失函数。实验结果表明,SAY-SOD框架在复杂背景下有效提升了遥感小目标的检测精度和鲁棒性,尤其在面对不同尺度和背景变化时表现出色。 展开更多
关键词 遥感图像 小目标检测 segment anything model YOLOv8 特征金字塔网络 数据增强 注意力机制
在线阅读 下载PDF
基于自适应参数优选分割一切模型的高分辨率遥感影像耕地地块提取
15
作者 马海荣 蔡端午 《中南农业科技》 2025年第11期114-117,131,共5页
针对分割一切模型(Segment anything model,SAM)在高分辨率遥感影像耕地地块提取中参数设置依赖人工经验、缺乏理论指导的问题,提出基于无监督分割质量评价的自适应参数优选方法。通过构建融合几何特征(边缘平滑度、凸度)、光谱特征(平... 针对分割一切模型(Segment anything model,SAM)在高分辨率遥感影像耕地地块提取中参数设置依赖人工经验、缺乏理论指导的问题,提出基于无监督分割质量评价的自适应参数优选方法。通过构建融合几何特征(边缘平滑度、凸度)、光谱特征(平均亮度、光谱方差)和纹理特征(同质性、对比度)的六维评价指标体系,建立综合评分机制,实现SAM关键分割参数的动态自适应优选。结果表明,该方法能够有效提取复杂场景下的耕地地块,在保证对象完整性的同时提升边界提取精度,为农业资源精准管理和智慧农业发展提供了可靠的技术支持。 展开更多
关键词 耕地地块提取 分割一切模型(segment anything model SAM) 参数自适应优选 无监督评价 高分辨率遥感影像
在线阅读 下载PDF
Efficient Dataset Generation for Stacked Meat Products Instance Segmentation in Food Automation
16
作者 Hoang Minh Pham Anh Dong Le +2 位作者 Pablo Malvido-Fresnillo Saigopal Vasudevan JoséL.Martínez Lastra 《IEEE/CAA Journal of Automatica Sinica》 2026年第1期224-226,共3页
Dear Editor,This letter presents techniques to simplify dataset generation for instance segmentation of raw meat products,a critical step toward automating food production lines.Accurate segmentation is essential for ... Dear Editor,This letter presents techniques to simplify dataset generation for instance segmentation of raw meat products,a critical step toward automating food production lines.Accurate segmentation is essential for addressing challenges such as occlusions,indistinct edges,and stacked configurations,which demand large,diverse datasets.To meet these demands,we propose two complementary approaches:a semi-automatic annotation interface using tools like the segment anything model(SAM)and GrabCut and a synthetic data generation pipeline leveraging 3D-scanned models.These methods reduce reliance on real meat,mitigate food waste,and improve scalability.Experimental results demonstrate that incorporating synthetic data enhances segmentation model performance and,when combined with real data,further boosts accuracy,paving the way for more efficient automation in the food industry. 展开更多
关键词 dataset generation segment anything model sam food automation raw meat productsa automating food production linesaccurate instance segmentation stacked meat products semi automatic annotation
在线阅读 下载PDF
基于SAM&ImageJ图像处理的堆石混凝土坝层面露石率研究 被引量:5
17
作者 安宇 徐小蓉 +2 位作者 尹志刚 金峰 张喜喜 《水资源与水工程学报》 CSCD 北大核心 2024年第1期154-161,共8页
堆石混凝土坝层面的外露块石为上下层提供了重要的啮合作用,其投影面积比例是科学评价层间抗剪性能的重要指标。采用国际最新Meta AI模型segment anything model(SAM)对层面外露堆石进行自动图像分割,并基于ImageJ软件对SAM识别后的图... 堆石混凝土坝层面的外露块石为上下层提供了重要的啮合作用,其投影面积比例是科学评价层间抗剪性能的重要指标。采用国际最新Meta AI模型segment anything model(SAM)对层面外露堆石进行自动图像分割,并基于ImageJ软件对SAM识别后的图片进行再加工与图像计算,利用平滑、差分算法、中值滤波等方法精准标定外露堆石,二值化后计算得到层面露石率。结果表明:SAM图像预分割可识别约90%的外露堆石,经过ImageJ二次图像处理后可有效提高小粒径堆石的识别精度,对比手动标注结果误差在±3%以内。以贵州省两座水库的工程应用为例,对浇筑仓面进行分区预处理,结果发现靠近上游、中部、下游不同区域的露石率差别较大,计算得到的层面露石率以10%~30%居多,其中堆石入仓运输通道区域的露石率较低。研究内容与结论可为堆石混凝土结构层间界面抗剪力学性能和大坝蓄水安全稳定的研究提供参考与借鉴。 展开更多
关键词 堆石混凝土坝 segment anything model(SAM) 图像处理技术 露石率 层间抗剪性能
在线阅读 下载PDF
结合SAM视觉分割模型与随机森林机器学习的无人机影像盐沼植被“精灵圈”提取 被引量:1
18
作者 周若彤 谭凯 +2 位作者 杨建儒 韩江涛 张卫国 《海洋学报》 CAS CSCD 北大核心 2024年第5期116-126,共11页
“精灵圈”是海岸带盐沼植被生态系统中的一种“空间自组织”结构,对盐沼湿地的生产力、稳定性和恢复力有重要影响。无人机影像是实现“精灵圈”空间位置高精度识别及解译其时空演化趋势与规律的重要数据源,但“精灵圈”像素与背景像素... “精灵圈”是海岸带盐沼植被生态系统中的一种“空间自组织”结构,对盐沼湿地的生产力、稳定性和恢复力有重要影响。无人机影像是实现“精灵圈”空间位置高精度识别及解译其时空演化趋势与规律的重要数据源,但“精灵圈”像素与背景像素在色彩信息和外形特征上差异较小,如何从二维影像中智能精准地识别“精灵圈”像素并对识别的单个像素形成个体“精灵圈”是目前的技术难点。本文提出了一种结合分割万物模型(Segment Anything Model,SAM)视觉分割模型与随机森林机器学习的无人机影像“精灵圈”分割及分类方法,实现了单个“精灵圈”的识别和提取。首先,通过构建索伦森-骰子系数(S?rensen-Dice coefficient,Dice)和交并比(Intersection over Union,IOU)评价指标,从SAM中筛选预训练模型并对其参数进行优化,实现全自动影像分割,得到无属性信息的分割掩码/分割类;然后,利用红、绿、蓝(RGB)三通道信息及空间二维坐标将分割掩码与原图像进行信息匹配,构造分割掩码的特征指标,并根据袋外数据(Out of Bag,OOB)误差减小及特征分布规律对特征进行分析和筛选;最后,利用筛选的特征对随机森林模型进行训练,实现“精灵圈”植被、普通植被和光滩的自动识别与分类。实验结果表明:本文方法“精灵圈”平均正确提取率96.1%,平均错误提取率为9.5%,为精准刻画“精灵圈”时空格局及海岸带无人机遥感图像处理提供了方法和技术支撑。 展开更多
关键词 盐沼植被 精灵圈 segment anything model(SAM) 无人机影像 机器学习
在线阅读 下载PDF
一种街景图像中建筑物高度估算方法 被引量:1
19
作者 戈士博 刘纪平 +1 位作者 王勇 车向红 《遥感信息》 CSCD 北大核心 2024年第3期1-6,共6页
建筑物高度信息是城市三维建模的基础数据,但已有的建筑物高度估算研究多采用LiDAR和SAR等遥感影像。随着计算机和互联网的快速发展,街景数据因采集容易和成本低等特点成为了一种新兴的建筑物高度估算数据源。文章提出一种街景图像中建... 建筑物高度信息是城市三维建模的基础数据,但已有的建筑物高度估算研究多采用LiDAR和SAR等遥感影像。随着计算机和互联网的快速发展,街景数据因采集容易和成本低等特点成为了一种新兴的建筑物高度估算数据源。文章提出一种街景图像中建筑物高度估算方法,首先利用segment anything model实现图像中建筑物像素高度提取;然后利用图像元数据和电子地图数据获取建筑物与相机之间的距离、图像焦距,根据街景图像与建筑物实体的几何关系改进针孔相机模型,构建建筑物高度估算方法;最后选取北京、柏林的Mapillary街景图像开展实验验证。结果表明,与改进前相比,改进后针孔相机模型明显提升了高度估算准确度,RMSE降低了11.31 m,R^(2)提高了0.4,具备实用价值。 展开更多
关键词 街景图像 建筑物高度估算 针孔相机模型 segment anything model Mapillary
在线阅读 下载PDF
Segment Anything Is Not Always Perfect: An Investigation of SAM on Different Real-world Applications 被引量:4
20
作者 Wei Ji Jingjing Li +3 位作者 Qi Bi Tingwei Liu Wenbo Li Li Cheng 《Machine Intelligence Research》 EI CSCD 2024年第4期617-630,共14页
Recently,Meta AI Research approaches a general,promptable segment anything model(SAM)pre-trained on an unprecedentedly large segmentation dataset(SA-1B).Without a doubt,the emergence of SAM will yield significant bene... Recently,Meta AI Research approaches a general,promptable segment anything model(SAM)pre-trained on an unprecedentedly large segmentation dataset(SA-1B).Without a doubt,the emergence of SAM will yield significant benefits for a wide array of practical image segmentation applications.In this study,we conduct a series of intriguing investigations into the performance of SAM across various applications,particularly in the fields of natural images,agriculture,manufacturing,remote sensing and healthcare.We analyze and discuss the benefits and limitations of SAM,while also presenting an outlook on its future development in segmentation tasks.By doing so,we aim to give a comprehensive understanding of SAM's practical applications.This work is expected to provide insights that facilitate future research activities toward generic segmentation.Source code is publicly available at https://github.com/LiuTingWed/SAM-Not-Perfect. 展开更多
关键词 segment anything model(SAM) visual perception segmentation foundational model computer vision.
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部