期刊文献+
共找到48篇文章
< 1 2 3 >
每页显示 20 50 100
A medical image segmentation model based on SAM with an integrated local multi-scale feature encoder
1
作者 DI Jing ZHU Yunlong LIANG Chan 《Journal of Measurement Science and Instrumentation》 2025年第3期359-370,共12页
Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding ... Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding phase.This paper presents a medical image segmentation model based on SAM with a local multi-scale feature encoder(LMSFE-SAM)to address the issues above.Firstly,based on the SAM,a local multi-scale feature encoder is introduced to improve the representation of features within local receptive field,thereby supplying the Vision Transformer(ViT)branch in SAM with enriched local multi-scale contextual information.At the same time,a multiaxial Hadamard product module(MHPM)is incorporated into the local multi-scale feature encoder in a lightweight manner to reduce the quadratic complexity and noise interference.Subsequently,a cross-branch balancing adapter is designed to balance the local and global information between the local multi-scale feature encoder and the ViT encoder in SAM.Finally,to obtain smaller input image size and to mitigate overlapping in patch embeddings,the size of the input image is reduced from 1024×1024 pixels to 256×256 pixels,and a multidimensional information adaptation component is developed,which includes feature adapters,position adapters,and channel-spatial adapters.This component effectively integrates the information from small-sized medical images into SAM,enhancing its suitability for clinical deployment.The proposed model demonstrates an average enhancement ranging from 0.0387 to 0.3191 across six objective evaluation metrics on BUSI,DDTI,and TN3K datasets compared to eight other representative image segmentation models.This significantly enhances the performance of the SAM on medical images,providing clinicians with a powerful tool in clinical diagnosis. 展开更多
关键词 segment anything model(SAM) medical image segmentation ENCODER decoder multiaxial Hadamard product module(MHPM) cross-branch balancing adapter
在线阅读 下载PDF
How precise is precise enough?Tree crown segmentation using high resolution close-up multispectral UAV images and its effect on NDVI accuracy in Fraxinus excelsior L.trees
2
作者 Lisa Buchner Anna-Katharina Eisen Susanne Jochner-Oette 《Journal of Forestry Research》 2026年第2期16-30,共15页
Detailed individual tree crown segmentation is highly relevant for the detection and monitoring of Fraxinus excelsior L.trees affected by ash dieback,a major threat to common ash populations across Europe.In this stud... Detailed individual tree crown segmentation is highly relevant for the detection and monitoring of Fraxinus excelsior L.trees affected by ash dieback,a major threat to common ash populations across Europe.In this study,both fine and coarse crown segmentation methods were applied to close-range multispectral UAV imagery.The fine tree crown segmentation method utilized a novel unsupervised machine learning approach based on a blended NIR-NDVI image,whereas the coarse segmentation relied on the segment anything model(SAM).Both methods successfully delineated tree crown outlines,however,only the fine segmentation accurately captured internal canopy gaps.Despite these structural differences,mean NDVI values calculated per tree crown revealed no significant differences between the two approaches,indicating that coarse segmentation is sufficient for mean vegetation index assessments.Nevertheless,the fine segmentation revealed increased heterogeneity in NDVI values in more severely damaged trees,underscoring its value for detailed structural and health analyses.Furthermore,the fine segmentation workflow proved transferable to both individual UAV images and orthophotos from broader UAV surveys.For applications focused on structural integrity and spatial variation in canopy health,the fine segmentation approach is recommended. 展开更多
关键词 Leaf mass segmentation Machine learning Segment anything model Ash dieback
在线阅读 下载PDF
An intelligent segmentation method for leakage points in central serous chorioretinopathy based on fluorescein angiography images
3
作者 Jian-Guo Xu Yong-Chi Liu +4 位作者 Fen Zhou Jian-Xin Shen Zhi-Peng Yan Xin-Ya Hu Wei-Hua Yang 《International Journal of Ophthalmology(English edition)》 2026年第3期421-433,共13页
AIM:To construct an intelligent segmentation scheme for precise localization of central serous chorioretinopathy(CSC)leakage points,thereby enabling ophthalmologists to deliver accurate laser treatment without navigat... AIM:To construct an intelligent segmentation scheme for precise localization of central serous chorioretinopathy(CSC)leakage points,thereby enabling ophthalmologists to deliver accurate laser treatment without navigational laser equipment.METHODS:A dataset with dual labels(point-level and pixel-level)was first established based on fundus fluorescein angiography(FFA)images of CSC and subsequently divided into training(102 images),validation(40 images),and test(40 images)datasets.An intelligent segmentation method was then developed,based on the You Only Look Once version 8 Pose Estimation(YOLOv8-Pose)model and segment anything model(SAM),to segment CSC leakage points.Next,the YOLOv8-Pose model was trained for 200 epochs,and the best-performing model was selected to form the optimal combination with SAM.Additionally,the classic five types of U-Net series models[i.e.,U-Net,recurrent residual U-Net(R2U-Net),attention U-Net(AttU-Net),recurrent residual attention U-Net(R2AttUNet),and nested U-Net(UNet^(++))]were initialized with three random seeds and trained for 200 epochs,resulting in a total of 15 baseline models for comparison.Finally,based on the metrics including Dice similarity coefficient(DICE),intersection over union(IoU),precision,recall,precisionrecall(PR)curve,and receiver operating characteristic(ROC)curve,the proposed method was compared with baseline models through quantitative and qualitative experiments for leakage point segmentation,thereby demonstrating its effectiveness.RESULTS:With the increase of training epochs,the mAP50-95,Recall,and precision of the YOLOv8-Pose model showed a significant increase and tended to stabilize,and it achieved a preliminary localization success rate of 90%(i.e.,36 images)for CSC leakage points in 40 test images.Using manually expert-annotated pixel-level labels as the ground truth,the proposed method achieved outcomes with a DICE of 57.13%,an IoU of 45.31%,a precision of 45.91%,a recall of 93.57%,an area under the PR curve(AUC-PR)of 0.78 and an area under the ROC curve(AUC-ROC)of 0.97,which enables more accurate segmentation of CSC leakage points.CONCLUSION:By combining the precise localization capability of the YOLOv8-Pose model with the robust and flexible segmentation ability of SAM,the proposed method not only demonstrates the effectiveness of the YOLOv8-Pose model in detecting keypoint coordinates of CSC leakage points from the perspective of application innovation but also establishes a novel approach for accurate segmentation of CSC leakage points through the“detect-then-segment”strategy,thereby providing a potential auxiliary means for the automatic and precise realtime localization of leakage points during traditional laser photocoagulation for CSC. 展开更多
关键词 You Only Look Once version 8 Pose Estimation segment anything model central serous chorioretinopathy leakage point segmentation
原文传递
Pre-trained SAM as data augmentation for image segmentation 被引量:1
4
作者 Junjun Wu Yunbo Rao +1 位作者 Shaoning Zeng Bob Zhang 《CAAI Transactions on Intelligence Technology》 2025年第1期268-282,共15页
Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in ord... Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation. 展开更多
关键词 data augmentation image segmentation large model segment anything model
在线阅读 下载PDF
Intelligent evaluation of sandstone rock structure based on a visual large model
5
作者 REN Yili ZENG Changmin +10 位作者 LI Xin LIU Xi HU Yanxu SU Qianxiao WANG Xiaoming LIN Zhiwei ZHOU Yixiao ZHENG Zilu HU Huiying YANG Yanning HUI Fang 《Petroleum Exploration and Development》 2025年第2期548-558,共11页
Existing sandstone rock structure evaluation methods rely on visual inspection,with low efficiency,semi-quantitative analysis of roundness,and inability to perform classified statistics in particle size analysis.This ... Existing sandstone rock structure evaluation methods rely on visual inspection,with low efficiency,semi-quantitative analysis of roundness,and inability to perform classified statistics in particle size analysis.This study presents an intelligent evaluation method for sandstone rock structure based on the Segment Anything Model(SAM).By developing a lightweight SAM fine-tuning method with rank-decomposition matrix adapters,a multispectral rock particle segmentation model named CoreSAM is constructed,which achieves rock particle edge extraction and type identification.Building upon this,we propose a comprehensive quantitative evaluation system for rock structure,assessing parameters including particle size,sorting,roundness,particle contact and cementation types.The experimental results demonstrate that CoreSAM outperforms existing methods in rock particle segmentation accuracy while showing excellent generalization across different image types such as CT scans and core photographs.The proposed method enables full-sample,classified particle size analysis and quantitative characterization of parameters like roundness,advancing reservoir evaluation towards more precise,quantitative,intuitive,and comprehensive development. 展开更多
关键词 SANDSTONE rock structure intelligent evaluation Segment Anything model fine-tuning particle edge extraction type identification
在线阅读 下载PDF
Accelerated optical remote sensing mapping of oil spills in the China Seas using the Segment Anything Model
6
作者 Hang Lv Yingcheng Lu +5 位作者 Lifeng Wang Shuxian Song Wei Zhao Yanlong Chen Yuntao Wang Qingjun Song 《Acta Oceanologica Sinica》 2025年第10期184-197,共14页
Efficient segmentation of oiled pixels in optical remotely sensed images is the precondition of optical identification and classification of different spilled oils,which remains one of the keys to optical remote sensi... Efficient segmentation of oiled pixels in optical remotely sensed images is the precondition of optical identification and classification of different spilled oils,which remains one of the keys to optical remote sensing of oil spills.Optical remotely sensed images of oil spills are inherently multidimensional and embedded with a complex knowledge framework.This complexity often hinders the effectiveness of mechanistic algorithms across varied scenarios.Although optical remote-sensing theory for oil spills has advanced,the scarcity of curated datasets and the difficulty of collecting them limit their usefulness for training deep learning models.This study introduces a data expansion strategy that utilizes the Segment Anything Model(SAM),effectively bridging the gap between traditional mechanism algorithms and emergent self-adaptive deep learning models.Optical dimension reduction is achieved through standardized preprocessing processes that address the decipherable properties of the input image.After preprocessing,SAM can swiftly and accurately segment spilled oil in images.The unified AI-based workflow significantly accelerates labeled-dataset creation and has proven effective for both rapid emergency intelligence during spill incidents and the rapid mapping and classification of oil footprints across China’s coastal waters.Our results show that coupling a remote sensing mechanism with a foundation model enables near-real-time,large-scale monitoring of complex surface slicks and offers guidance for the next generation of detection and quantification algorithms. 展开更多
关键词 marine oil spills optical remote sensing segment anything model extract oil footprint spatiotemporal distribution
在线阅读 下载PDF
A novel deep learning-based framework for forecasting
7
作者 Congqi Cao Ze Sun +2 位作者 Lanshu Hu Liujie Pan Yanning Zhang 《Atmospheric and Oceanic Science Letters》 2026年第1期22-26,共5页
Deep learning-based methods have become alternatives to traditional numerical weather prediction systems,offering faster computation and the ability to utilize large historical datasets.However,the application of deep... Deep learning-based methods have become alternatives to traditional numerical weather prediction systems,offering faster computation and the ability to utilize large historical datasets.However,the application of deep learning to medium-range regional weather forecasting with limited data remains a significant challenge.In this work,three key solutions are proposed:(1)motivated by the need to improve model performance in data-scarce regional forecasting scenarios,the authors innovatively apply semantic segmentation models,to better capture spatiotemporal features and improve prediction accuracy;(2)recognizing the challenge of overfitting and the inability of traditional noise-based data augmentation methods to effectively enhance model robustness,a novel learnable Gaussian noise mechanism is introduced that allows the model to adaptively optimize perturbations for different locations,ensuring more effective learning;and(3)to address the issue of error accumulation in autoregressive prediction,as well as the challenge of learning difficulty and the lack of intermediate data utilization in one-shot prediction,the authors propose a cascade prediction approach that effectively resolves these problems while significantly improving model forecasting performance.The method achieves a competitive result in The East China Regional AI Medium Range Weather Forecasting Competition.Ablation experiments further validate the effectiveness of each component,highlighting their contributions to enhancing prediction performance. 展开更多
关键词 Weather forecasting Deep learning Semantic segmentation models Learnable Gaussian noise Cascade prediction
在线阅读 下载PDF
FTIR STUDIES ON THE MODEL POLYURETHANE HARD SEGMENTS BASED ON A NEW WATERBORNE CHAIN EXTENDER DIMETHYLOL BUTANOIC ACID (DMBA) 被引量:3
8
作者 马德柱 《Chinese Journal of Polymer Science》 SCIE CAS CSCD 2004年第3期225-230,共6页
Three model polyurethane hard segments based on dimethylol butanoic acid (DMBA) and 1,6-hexane diisocyanate (HDI), toluene diisocyanate (TDI) and 4,4'-diphenylmethane diisocyanate (MDI) were prepared by the soluti... Three model polyurethane hard segments based on dimethylol butanoic acid (DMBA) and 1,6-hexane diisocyanate (HDI), toluene diisocyanate (TDI) and 4,4'-diphenylmethane diisocyanate (MDI) were prepared by the solution method. Fourier Infrared (FTIR) spectroscopy was employed to study the H-bonds in these model polyurethanes. The model polyurethane hard segment prepared from HDI and 1,4-butanodiol (BDO) was used for comparison. It was found that the incorporation of the pendent carboxyl through DMBA into the model hard segments weakens the original NH…O = C H-bond but gives more H-bond patterns based on the two H-bond donors, urethane NH and carboxylic OH. The carboxylic dimer is one of the main H-bond types and is stronger than another main H-bond type NH…O=C. In addition, the H-bond in aromatic model hard segments is stronger than that of aliphatic hard segments. The appearance of the free C=O and the fact that almost all N—H is H-bonded suggest that there possibly exist either the third H-bond acceptor or the H-bond formed by one acceptor with two donors. 展开更多
关键词 model hard segment H-BOND Polyurethane with carboxyl FTIR spectroscopy
在线阅读 下载PDF
A New Conception of Image Texture and Remote Sensing Image Segmentation Based on Markov Random Field 被引量:1
9
作者 GONG Yan SHU Ning +2 位作者 LI Jili LIN Liqun LI Xue 《Geo-Spatial Information Science》 2010年第1期16-23,共8页
The texture analysis is often discussed in image processing domain, but most methods are limited within gray-level image or color image, and the present conception of texture is defined mainly based on gray-level imag... The texture analysis is often discussed in image processing domain, but most methods are limited within gray-level image or color image, and the present conception of texture is defined mainly based on gray-level image of single band. One of the essential characters of remote sensing image is multidimensional or even high-dimensional, and the traditional texture conception cannot contain enough information for these. Therefore, it is necessary to pursuit a proper texture definition based on remote sensing images, which is the first discussion in this paper. This paper describes the mapping model of spectral vector in two-dimensional image space using Markov random field (MRF), establishes a texture model of multiband remote sensing image based on MRF, and analyzes the calculations of Gibbs potential energy and Gibbs parameters. Further, this paper also analyzes the limitations of the traditional Gibbs model, prefers a new Gibbs model avoiding estimation of parameters, and presents a new texture segmentation algorithm for hy-perspectral remote sensing image later. 展开更多
关键词 hyperspectral multispectral MRF Gibbs model texture segmentation
原文传递
The use of the greater trochanter marker in the thigh segment model:Implications for hip and knee frontal and transverse plane motion
10
作者 Valentina Graci Gretchen B.Salsich 《Journal of Sport and Health Science》 SCIE 2016年第1期95-100,共6页
Background:The greater trochanter marker is commonly used in 3-dimensional(3D) models;however,its influence on hip and knee kinematics during gait is unclear.Understanding the influence of the greater trochanter marke... Background:The greater trochanter marker is commonly used in 3-dimensional(3D) models;however,its influence on hip and knee kinematics during gait is unclear.Understanding the influence of the greater trochanter marker is important when quantifying frontal and transverse plane hip and knee kinematics,parameters which are particularly relevant to investigate in individuals with conditions such as patellofemoral pain,knee osteoarthritis,anterior cruciate ligament(ACL) injury,and hip pain.The aim of this study was to evaluate the effect of including the greater trochanter in the construction of the thigh segment on hip and knee kinematics during gait.Methods:3D kinematics were collected in 19 healthy subjects during walking using a surface marker system.Hip and knee angles were compared across two thigh segment definitions(with and without greater trochanter) at two time points during stance:peak knee flexion(PKF) and minimum knee flexion(Min KF).Results:Hip and knee angles differed in magnitude and direction in the transverse plane at both time points.In the thigh model with the greater trochanter the hip was more externally rotated than in the thigh model without the greater trochanter(PKF:-9.34°± 5.21° vs.1.40°± 5.22°,Min KF:-5.68°± 4.24° vs.5.01°± 4.86°;p < 0.001).In the thigh model with the greater trochanter,the knee angle was more internally rotated compared to the knee angle calculated using the thigh definition without the greater trochanter(PKF:14.67°± 6.78° vs.4.33°± 4.18°,Min KF:10.54°± 6.71° vs.-0.01°± 2.69°;p < 0.001).Small but significant differences were detected in the sagittal and frontal plane angles at both time points(p < 0.001).Conclusion:Hip and knee kinematics differed across different segment definitions including or excluding the greater trochanter marker,especially in the transverse plane.Therefore when considering whether to include the greater trochanter in the thigh segment model when using a surface markers to calculate 3D kinematics for movement assessment,it is important to have a clear understanding of the effect of different marker sets and segment models in use. 展开更多
关键词 3D motion analysis Thigh segment model Transverse plane motion
暂未订购
Automated labeling and segmentation based on segment anything model:Quantitative analysis of bubbles in gas-liquid flow
11
作者 Jia-Bin Shi Li-Jun You +3 位作者 Jia-Chen Dang Yi-Jun Wang Wei Gong Bo Peng 《Petroleum Science》 2025年第12期5212-5227,共16页
The quantitative analysis of dispersed phases(bubbles,droplets,and particles)in multiphase flow systems represents a persistent technological challenge in petroleum engineering applications,including CO2-enhanced oil ... The quantitative analysis of dispersed phases(bubbles,droplets,and particles)in multiphase flow systems represents a persistent technological challenge in petroleum engineering applications,including CO2-enhanced oil recovery,foam flooding,and unconventional reservoir development.Current characterization methods remain constrained by labor-intensive manual workflows and limited dynamic analysis capabilities,particularly for processing large-scale microscopy data and video sequences that capture critical transient behavior like gas cluster migration and droplet coalescence.These limitations hinder the establishment of robust correlations between pore-scale flow patterns and reservoir-scale production performance.This study introduces a novel computer vision framework that integrates foundation models with lightweight neural networks to address these industry challenges.Leveraging the segment anything model's zero-shot learning capability,we developed an automated workflow that achieves an efficiency improvement of approximately 29 times in bubble labeling compared to manual methods while maintaining less than 2%deviation from expert annotations.Engineering-oriented optimization ensures lightweight deployment with 94%segmentation accuracy,while the integrated quantification system precisely resolves gas saturation,shape factors,and interfacial dynamics,parameters critical for optimizing gas injection strategies and predicting phase redistribution patterns.Validated through microfluidic gas-liquid displacement experiments for discontinuous phase segmentation accuracy,this methodology enables precise bubble morphology quantification with broad application potential in multiphase systems,including emulsion droplet dynamics characterization and particle transport behavior analysis.This work bridges the critical gap between pore-scale dynamics characterization and reservoir-scale simulation requirements,providing a foundational framework for intelligent flow diagnostics and predictive modeling in next-generation digital oilfield systems. 展开更多
关键词 Dispersed phases Bubble segmentation Microfluidic system Segment anything model Gas-liquid flow Artificial intelligence
原文传递
YOLOv8改进算法在油茶果分拣中的应用 被引量:1
12
作者 刘姜毅 高自成 +2 位作者 刘怀粤 尹浇钦 罗媛尹 《林业工程学报》 北大核心 2025年第1期120-127,共8页
现有的油茶果分拣系统所依赖的YOLO等算法的目标检测、实例分割在低尺寸及密集型样本中鲁棒性较差,存在机械臂常抓取到枝叶、抓取不牢固、易脱落等问题。大部分系统使用目标识别,无法准确识别油茶果具体轮廓信息,不能对油茶果进行大小... 现有的油茶果分拣系统所依赖的YOLO等算法的目标检测、实例分割在低尺寸及密集型样本中鲁棒性较差,存在机械臂常抓取到枝叶、抓取不牢固、易脱落等问题。大部分系统使用目标识别,无法准确识别油茶果具体轮廓信息,不能对油茶果进行大小分类。针对这一问题,研究提出了YOWNet模型应对油茶果分拣的小目标、高密度识别任务。首先,研究了自动化边缘标注脚本,脚本调用零样本Segment Anything框架对原有已标注的油茶果目标检测框提取兴趣区间,将其自动转化为边缘标注信息;其次,为了提高模型对小目标的识别能力,研究摒弃了现有的固定感受野的卷积模块,针对油茶果特性提出三维注意力动态卷积模块用于捕捉特征图中的关键信息;最后,研究通过使用Wise⁃IoU损失函数,基于动态非单调聚焦机制的边界框损失,提升边框回归精度。总体网络模型命名为YOWNet,通过与YOLOv8在油茶果上的消融实验对比,试验结果表明:YOWNet模型能够快速准确地识别油茶果实例,在私有数据集上,准确度、Box_loss可达89.90%和0.523。 展开更多
关键词 油茶果 三维动态卷积 实例分割 YOLOv8 Segment Anything model Wise⁃IoU
在线阅读 下载PDF
基于SAM图像处理的堆石料级配计算方法及验证 被引量:1
13
作者 张振伟 蔡可天 +3 位作者 高轩 贺一轩 王建 鲁洋 《水力发电》 2025年第2期80-86,共7页
堆石料级配检测是堆石坝施工过程中质量控制的重要环节,传统方法通常采用现场人工筛分法测量,存在检测样本少、效率低、干扰施工等问题。提出了一种基于图像处理的堆石料级配计算方法,采用国际最新Mata AI开源的通用图像分割大模型Segme... 堆石料级配检测是堆石坝施工过程中质量控制的重要环节,传统方法通常采用现场人工筛分法测量,存在检测样本少、效率低、干扰施工等问题。提出了一种基于图像处理的堆石料级配计算方法,采用国际最新Mata AI开源的通用图像分割大模型Segment Anything Model(SAM)对筑坝堆石料进行自动图像分割,提出堆石长宽比、面积比等堆石形态学几何参数用于提取堆石料图像中的堆石颗粒目标;同时,建立堆石形态数据库、堆石实例分割数据库,并分析参数取值和验证堆石图像级配计算方法的有效性;最后,试验验证结果表明该方法能够有效识别出图像中的堆石颗粒目标,实现级配曲线的智能识别,以及曲率、不均匀系数等级配指标的快速计算。该方法计算获得的级配与真实筛分法测的级配相关性可达0.94,平均绝对误差约5%,能够在堆石坝施工过程中有效辅助检测堆石料的颗粒级配信息,服务堆石坝的施工碾压质量控制。 展开更多
关键词 堆石料 级配 Segment Anything model(SAM) 图像识别 快速检测
在线阅读 下载PDF
SAY-SOD:基于大模型优化的高清遥感图像小目标检测框架 被引量:1
14
作者 曾文龙 贾海涛 +1 位作者 周昊哲 程卓尔 《网络安全与数据治理》 2025年第S1期90-97,共8页
随着遥感技术的不断发展,遥感图像中小目标检测面临着背景复杂、目标尺寸小、像素信息少等挑战,传统检测算法在这一领域的表现存在一定局限。提出了一种基于SAM大模型和改进YOLOv8的小目标检测框架。首先,利用SAM对原始遥感图像进行感... 随着遥感技术的不断发展,遥感图像中小目标检测面临着背景复杂、目标尺寸小、像素信息少等挑战,传统检测算法在这一领域的表现存在一定局限。提出了一种基于SAM大模型和改进YOLOv8的小目标检测框架。首先,利用SAM对原始遥感图像进行感兴趣区域的提取和分割,随后对分割后的图像进行多尺度增强,以提高小目标的显著性。增强后的图像与原图的编号和定位信息一起构建数据集,用于训练改进的YOLOv8模型。改进措施包括特征金字塔网络的优化、引入注意力机制、重新设计损失函数。实验结果表明,SAY-SOD框架在复杂背景下有效提升了遥感小目标的检测精度和鲁棒性,尤其在面对不同尺度和背景变化时表现出色。 展开更多
关键词 遥感图像 小目标检测 Segment Anything model YOLOv8 特征金字塔网络 数据增强 注意力机制
在线阅读 下载PDF
Study on an improved saturation parameter method based on joint inversion of NMR and resistivity data in porous media
15
作者 Peng-Ji Zhang Bao-Zhi Pan +5 位作者 Yu-Hang Guo Li-Hua Zhang Zhao-Wei Si Feng Xu Ming-Yue Zhu Yan Li 《Petroleum Science》 2025年第6期2312-2324,共13页
CO_(2) storage capacity is significantly influenced by the saturation levels of reservoir rocks,with underground fluid saturation typically evaluated using resistivity data.The conductive pathways of fluids in various... CO_(2) storage capacity is significantly influenced by the saturation levels of reservoir rocks,with underground fluid saturation typically evaluated using resistivity data.The conductive pathways of fluids in various states within rock pores differ,alongside variations in conductive mechanisms.To clarify the conductivity of water in rocks across different states,this study employed a three-pore segment saturation model,which corrected for the additional conductivity of clay by categorizing water into large-pore segment,medium-pore segment,and small-pore segment types.Addressing the heterogeneity of tight sandstone reservoirs,we classified distinct pore structures and inverted Archie equation parameters from NMR logging data using a segmented characterization approach,yielding dynamic Archie parameters that vary with depth.Ultimately,we established an improved saturation parameter method based on joint inversion of NMR and resistivity data,which was validated through laboratory experiments and practical downhole applications.The results indicate that this saturation parameter inversion method has been effectively applied in both settings.Furthermore,we discussed the varying conductive behaviors of fluids in large and medium pore segment under saturated and drained states.Lastly,we proposed a workflow for inverting saturation based on downhole data,providing a robust foundation for CO_(2) storage and predicting underground fluid saturation. 展开更多
关键词 NMR T_(2)spectrum Fluid distribution Tight sandstone Groundwater saturation Three-pore segment saturation model Rock pore structure
原文传递
PASS-SAM:Integration of Segment Anything Model for Large-Scale Unsupervised Semantic Segmentation
16
作者 Yin Tang Rui Chen +1 位作者 Gensheng Pei Qiong Wang 《Computational Visual Media》 2025年第3期669-674,共6页
Large-scale unsupervised semantic segmentation(LUSS)is a sophisticated process that aims to segment similar areas within an image without relying on labeled training data.While existing methodologies have made substan... Large-scale unsupervised semantic segmentation(LUSS)is a sophisticated process that aims to segment similar areas within an image without relying on labeled training data.While existing methodologies have made substantial progress in this area,there is ample scope for enhancement.We thus introduce the PASS-SAM model,a comprehensive solution that amalgamates the benefits of various models to improve segmentation performance. 展开更多
关键词 segmentation performance amalgamates benefits various models segment anything model pass sam model segment similar areas large scale unsupervised semantic segmentation
原文传递
基于自适应参数优选分割一切模型的高分辨率遥感影像耕地地块提取
17
作者 马海荣 蔡端午 《中南农业科技》 2025年第11期114-117,131,共5页
针对分割一切模型(Segment anything model,SAM)在高分辨率遥感影像耕地地块提取中参数设置依赖人工经验、缺乏理论指导的问题,提出基于无监督分割质量评价的自适应参数优选方法。通过构建融合几何特征(边缘平滑度、凸度)、光谱特征(平... 针对分割一切模型(Segment anything model,SAM)在高分辨率遥感影像耕地地块提取中参数设置依赖人工经验、缺乏理论指导的问题,提出基于无监督分割质量评价的自适应参数优选方法。通过构建融合几何特征(边缘平滑度、凸度)、光谱特征(平均亮度、光谱方差)和纹理特征(同质性、对比度)的六维评价指标体系,建立综合评分机制,实现SAM关键分割参数的动态自适应优选。结果表明,该方法能够有效提取复杂场景下的耕地地块,在保证对象完整性的同时提升边界提取精度,为农业资源精准管理和智慧农业发展提供了可靠的技术支持。 展开更多
关键词 耕地地块提取 分割一切模型(Segment anything model SAM) 参数自适应优选 无监督评价 高分辨率遥感影像
在线阅读 下载PDF
APPROXIMATION TECHNIQUES FOR APPLICATION OF GENETIC ALGORITHMS TO STRUCTURAL OPTIMIZATION 被引量:1
18
作者 金海波 丁运亮 《Transactions of Nanjing University of Aeronautics and Astronautics》 EI 2003年第2期147-154,共8页
Although the genetic algorithm (GA) has very powerful robustness and fitness, it needs a large size of population and a large number of iterations to reach the optimum result. Especially when GA is used in complex str... Although the genetic algorithm (GA) has very powerful robustness and fitness, it needs a large size of population and a large number of iterations to reach the optimum result. Especially when GA is used in complex structural optimization problems, if the structural reanalysis technique is not adopted, the more the number of finite element analysis (FEA) is, the more the consuming time is. In the conventional structural optimization the number of FEA can be reduced by the structural reanalysis technique based on the approximation techniques and sensitivity analysis. With these techniques, this paper provides a new approximation model-segment approximation model, adopted for the GA application. This segment approximation model can decrease the number of FEA and increase the convergence rate of GA. So it can apparently decrease the computation time of GA. Two examples demonstrate the availability of the new segment approximation model. 展开更多
关键词 approximation techniques segment approximation model genetic algorithms structural optimization sensitivity analysis
在线阅读 下载PDF
Analysis and comparison of retinal vascular parameters under different glucose metabolic status based on deep learning 被引量:1
19
作者 Yan Jiang Di Gong +7 位作者 Xiao-Hong Chen Lin Yang Jing-Jing Xu Qi-Jie Wei Bin-Bin Chen Yong-Jiang Cai Wen-Qun Xi Zhe Zhang 《International Journal of Ophthalmology(English edition)》 SCIE CAS 2024年第9期1581-1591,共11页
AIM:To develop a deep learning-based model for automatic retinal vascular segmentation,analyzing and comparing parameters under diverse glucose metabolic status(normal,prediabetes,diabetes)and to assess the potential ... AIM:To develop a deep learning-based model for automatic retinal vascular segmentation,analyzing and comparing parameters under diverse glucose metabolic status(normal,prediabetes,diabetes)and to assess the potential of artificial intelligence(AI)in image segmentation and retinal vascular parameters for predicting prediabetes and diabetes.METHODS:Retinal fundus photos from 200 normal individuals,200 prediabetic patients,and 200 diabetic patients(600 eyes in total)were used.The U-Net network served as the foundational architecture for retinal arteryvein segmentation.An automatic segmentation and evaluation system for retinal vascular parameters was trained,encompassing 26 parameters.RESULTS:Significant differences were found in retinal vascular parameters across normal,prediabetes,and diabetes groups,including artery diameter(P=0.008),fractal dimension(P=0.000),vein curvature(P=0.003),C-zone artery branching vessel count(P=0.049),C-zone vein branching vessel count(P=0.041),artery branching angle(P=0.005),vein branching angle(P=0.001),artery angle asymmetry degree(P=0.003),vessel length density(P=0.000),and vessel area density(P=0.000),totaling 10 parameters.CONCLUSION:The deep learning-based model facilitates retinal vascular parameter identification and quantification,revealing significant differences.These parameters exhibit potential as biomarkers for prediabetes and diabetes. 展开更多
关键词 deep learning retinal vascular parameters segmentation model DIABETES PREDIABETES
原文传递
Efficient Dataset Generation for Stacked Meat Products Instance Segmentation in Food Automation
20
作者 Hoang Minh Pham Anh Dong Le +2 位作者 Pablo Malvido-Fresnillo Saigopal Vasudevan JoséL.Martínez Lastra 《IEEE/CAA Journal of Automatica Sinica》 2026年第1期224-226,共3页
Dear Editor,This letter presents techniques to simplify dataset generation for instance segmentation of raw meat products,a critical step toward automating food production lines.Accurate segmentation is essential for ... Dear Editor,This letter presents techniques to simplify dataset generation for instance segmentation of raw meat products,a critical step toward automating food production lines.Accurate segmentation is essential for addressing challenges such as occlusions,indistinct edges,and stacked configurations,which demand large,diverse datasets.To meet these demands,we propose two complementary approaches:a semi-automatic annotation interface using tools like the segment anything model(SAM)and GrabCut and a synthetic data generation pipeline leveraging 3D-scanned models.These methods reduce reliance on real meat,mitigate food waste,and improve scalability.Experimental results demonstrate that incorporating synthetic data enhances segmentation model performance and,when combined with real data,further boosts accuracy,paving the way for more efficient automation in the food industry. 展开更多
关键词 dataset generation segment anything model sam food automation raw meat productsa automating food production linesaccurate instance segmentation stacked meat products semi automatic annotation
在线阅读 下载PDF
上一页 1 2 3 下一页 到第
使用帮助 返回顶部