期刊文献+
共找到65,756篇文章
< 1 2 250 >
每页显示 20 50 100
Enhanced Cutaneous Melanoma Segmentation in Dermoscopic Images Using a Dual U-Net Framework with Multi-Path Convolution Block Attention Module and SE-Res-Conv
1
作者 Kun Lan Feiyang Gao +2 位作者 Xiaoliang Jiang Jianzhen Cheng Simon Fong 《Computers, Materials & Continua》 2025年第9期4805-4824,共20页
With the continuous development of artificial intelligence and machine learning techniques,there have been effective methods supporting the work of dermatologist in the field of skin cancer detection.However,object si... With the continuous development of artificial intelligence and machine learning techniques,there have been effective methods supporting the work of dermatologist in the field of skin cancer detection.However,object significant challenges have been presented in accurately segmenting melanomas in dermoscopic images due to the objects that could interfere human observations,such as bubbles and scales.To address these challenges,we propose a dual U-Net network framework for skin melanoma segmentation.In our proposed architecture,we introduce several innovative components that aim to enhance the performance and capabilities of the traditional U-Net.First,we establish a novel framework that links two simplified U-Nets,enabling more comprehensive information exchange and feature integration throughout the network.Second,after cascading the second U-Net,we introduce a skip connection between the decoder and encoder networks,and incorporate a modified receptive field block(MRFB),which is designed to capture multi-scale spatial information.Third,to further enhance the feature representation capabilities,we add a multi-path convolution block attention module(MCBAM)to the first two layers of the first U-Net encoding,and integrate a new squeeze-and-excitation(SE)mechanism with residual connections in the second U-Net.To illustrate the performance of our proposed model,we conducted comprehensive experiments on widely recognized skin datasets.On the ISIC-2017 dataset,the IoU value of our proposed model increased from 0.6406 to 0.6819 and the Dice coefficient increased from 0.7625 to 0.8023.On the ISIC-2018 dataset,the IoU value of proposed model also improved from 0.7138 to 0.7709,while the Dice coefficient increased from 0.8285 to 0.8665.Furthermore,the generalization experiments conducted on the jaw cyst dataset from Quzhou People’s Hospital further verified the outstanding segmentation performance of the proposed model.These findings collectively affirm the potential of our approach as a valuable tool in supporting clinical decision-making in the field of skin cancer detection,as well as advancing research in medical image analysis. 展开更多
关键词 Dual U-Net skin lesion segmentation squeeze-and-excitation modified receptive field block multi-path convolution block attention module
在线阅读 下载PDF
不同栅氧退火工艺下的SiC MOS电容及其界面电学特性
2
作者 付兴中 刘俊哲 +4 位作者 薛建红 尉升升 谭永亮 王德君 张力江 《半导体技术》 CAS 北大核心 2025年第1期32-38,共7页
SiC MOS电容的电学特性和栅氧界面特性是评价和改进SiC MOS器件制造工艺的重要依据。通过依次测试SiC MOS器件的氧化物绝缘特性、界面态密度、偏压温度应力不稳定性(偏压温度应力条件下的平带电压漂移)、氧化物陷阱电荷密度和可动电荷... SiC MOS电容的电学特性和栅氧界面特性是评价和改进SiC MOS器件制造工艺的重要依据。通过依次测试SiC MOS器件的氧化物绝缘特性、界面态密度、偏压温度应力不稳定性(偏压温度应力条件下的平带电压漂移)、氧化物陷阱电荷密度和可动电荷密度的方法,对分别经过氮等离子体氧化后退火(POA)处理、氮氢混合等离子体POA处理、氮氢氧混合等离子体POA处理、氮氢氯混合等离子体POA处理的SiC MOS电容的电学特性和栅氧界面特性进行了分析。结果表明,该方法可以系统地评价SiC MOS电容电学特性和栅氧界面特性,经过氮氢氯混合等离子体POA处理的SiC MOS电容可以满足高温、大场强下长期运行的性能指标。 展开更多
关键词 SiC mos电容 氧化后退火(POA) 平带电压漂移 氧化物陷阱电荷 可动电荷
原文传递
高压MOS器件SPICE建模研究
3
作者 顾祥 彭宏伟 +1 位作者 纪旭明 李金航 《微处理机》 2025年第2期9-13,共5页
针对BSIM3v3模型在高压集成电路中高压MOS器件建模上的局限性,通过深入分析栅源电压和衬源电压对源漏电阻的影响,以及电流准饱和效应、碰撞电离电流、高压寄生管和自热效应等关键因素,提出一种基于BSIM3v3的高压MOS模型改进方法。该方... 针对BSIM3v3模型在高压集成电路中高压MOS器件建模上的局限性,通过深入分析栅源电压和衬源电压对源漏电阻的影响,以及电流准饱和效应、碰撞电离电流、高压寄生管和自热效应等关键因素,提出一种基于BSIM3v3的高压MOS模型改进方法。该方法通过引入不同的压控电阻(VCR)、受控电压源(VCVS)和受控电流源(CCCS)等子电路,构建了一个用于精确模拟高压MOS器件的SPICE Macro模型。这一改进显著提升了高压MOS器件的建模精度,对高压集成电路的设计与仿真具有重要的实际意义和应用价值。 展开更多
关键词 高压mos SPICE模型 参数提取
在线阅读 下载PDF
一种单边高压MOS器件子电路模型搭建
4
作者 李艳艳 任庆宝 《中国集成电路》 2025年第10期30-34,共5页
与通用低压MOS器件相比,高压MOS器件的结构更为复杂。目前基于高压MOS器件的电路设计仿真,业界并没有标准的器件模型。本文主要研究了一种10V高压MOS器件子电路模型的建立。针对高压MOS器件的物理特征,分析了漏端电阻受栅源、栅漏电压... 与通用低压MOS器件相比,高压MOS器件的结构更为复杂。目前基于高压MOS器件的电路设计仿真,业界并没有标准的器件模型。本文主要研究了一种10V高压MOS器件子电路模型的建立。针对高压MOS器件的物理特征,分析了漏端电阻受栅源、栅漏电压的影响,在标准模型BSIM3V3的基础上对漏端电阻的描述进行了改进。结果表明,改进后的模型对测量数据的拟合度较高,大大提高了BSIM3V3 Ⅰ-Ⅴ模型模拟高压MOS器件的精确度。因此,本文对高压集成电路的设计仿真具有一定的指导意义。 展开更多
关键词 BSIM3V3模型 高压mos器件 子电路模型
在线阅读 下载PDF
MG-SLAM: RGB-D SLAM Based on Semantic Segmentation for Dynamic Environment in the Internet of Vehicles 被引量:1
5
作者 Fengju Zhang Kai Zhu 《Computers, Materials & Continua》 2025年第2期2353-2372,共20页
The Internet of Vehicles (IoV) has become an important direction in the field of intelligent transportation, in which vehicle positioning is a crucial part. SLAM (Simultaneous Localization and Mapping) technology play... The Internet of Vehicles (IoV) has become an important direction in the field of intelligent transportation, in which vehicle positioning is a crucial part. SLAM (Simultaneous Localization and Mapping) technology plays a crucial role in vehicle localization and navigation. Traditional Simultaneous Localization and Mapping (SLAM) systems are designed for use in static environments, and they can result in poor performance in terms of accuracy and robustness when used in dynamic environments where objects are in constant movement. To address this issue, a new real-time visual SLAM system called MG-SLAM has been developed. Based on ORB-SLAM2, MG-SLAM incorporates a dynamic target detection process that enables the detection of both known and unknown moving objects. In this process, a separate semantic segmentation thread is required to segment dynamic target instances, and the Mask R-CNN algorithm is applied on the Graphics Processing Unit (GPU) to accelerate segmentation. To reduce computational cost, only key frames are segmented to identify known dynamic objects. Additionally, a multi-view geometry method is adopted to detect unknown moving objects. The results demonstrate that MG-SLAM achieves higher precision, with an improvement from 0.2730 m to 0.0135 m in precision. Moreover, the processing time required by MG-SLAM is significantly reduced compared to other dynamic scene SLAM algorithms, which illustrates its efficacy in locating objects in dynamic scenes. 展开更多
关键词 Visual SLAM dynamic scene semantic segmentation GPU acceleration key segmentation frame
在线阅读 下载PDF
High-Precision Brain Tumor Segmentation using a Progressive Layered U-Net(PLU-Net)with Multi-Scale Data Augmentation and Attention Mechanisms on Multimodal Magnetic Resonance Imaging 被引量:1
6
作者 Noman Ahmed Siddiqui Muhammad Tahir Qadri +1 位作者 Muhammad Ovais Akhter Zain Anwar Ali 《Instrumentation》 2025年第1期77-92,共16页
Brain tumors present significant challenges in medical diagnosis and treatment,where early detection is crucial for reducing morbidity and mortality rates.This research introduces a novel deep learning model,the Progr... Brain tumors present significant challenges in medical diagnosis and treatment,where early detection is crucial for reducing morbidity and mortality rates.This research introduces a novel deep learning model,the Progressive Layered U-Net(PLU-Net),designed to improve brain tumor segmentation accuracy from Magnetic Resonance Imaging(MRI)scans.The PLU-Net extends the standard U-Net architecture by incorporating progressive layering,attention mechanisms,and multi-scale data augmentation.The progressive layering involves a cascaded structure that refines segmentation masks across multiple stages,allowing the model to capture features at different scales and resolutions.Attention gates within the convolutional layers selectively focus on relevant features while suppressing irrelevant ones,enhancing the model's ability to delineate tumor boundaries.Additionally,multi-scale data augmentation techniques increase the diversity of training data and boost the model's generalization capabilities.Evaluated on the BraTS 2021 dataset,the PLU-Net achieved state-of-the-art performance with a dice coefficient of 0.91,specificity of 0.92,sensitivity of 0.89,Hausdorff95 of 2.5,outperforming other modified U-Net architectures in segmentation accuracy.These results underscore the effectiveness of the PLU-Net in improving brain tumor segmentation from MRI scans,supporting clinicians in early diagnosis,treatment planning,and the development of new therapies. 展开更多
关键词 brain tumor segmentation MRI machine learning BraTS deep learning model PLU-Net
原文传递
基于多层MoS_(2)材料的MOSFET器件退火效应研究
7
作者 韩涛 廖乃镘 +1 位作者 孙艳敏 丁劲松 《今日制造与升级》 2025年第1期48-50,共3页
文章采用机械剥离法在SiO_(2)/Si衬底制备多层二硫化钼(MoS_(2))材料,通过电子束光刻和蒸发工艺制备MOSFET器件。使用光学显微镜、拉曼光谱仪和扫描电镜对多层MoS_(2)材料进行测试表征,并使用半导体参数分析仪对MOSFET器件的电学性能进... 文章采用机械剥离法在SiO_(2)/Si衬底制备多层二硫化钼(MoS_(2))材料,通过电子束光刻和蒸发工艺制备MOSFET器件。使用光学显微镜、拉曼光谱仪和扫描电镜对多层MoS_(2)材料进行测试表征,并使用半导体参数分析仪对MOSFET器件的电学性能进行测试分析。探索退火效应对器件电学性能的影响。 展开更多
关键词 mos2材料 mosFET器件 测试表征 退火效应 电学性能
在线阅读 下载PDF
MOS管在全固态中波发射机功放模块中的应用分析 被引量:1
8
作者 甘时伟 《中国宽带》 2025年第6期169-171,共3页
为研究MOS管在全固态中波发射机功放模块中的应用,本文详细分析了MOS管的结构特性、工作区域特性和温度特性,并探讨了其在全固态中波发射机功放模块中的具体应用。文章提出了基于IRFP350 MOS管的桥式开关放大架构设计,并通过驱动优化、... 为研究MOS管在全固态中波发射机功放模块中的应用,本文详细分析了MOS管的结构特性、工作区域特性和温度特性,并探讨了其在全固态中波发射机功放模块中的具体应用。文章提出了基于IRFP350 MOS管的桥式开关放大架构设计,并通过驱动优化、故障诊断与状态监测机制等多方面的改进,实现了高效率和高可靠性的功放模块。实验结果验证了该方案的有效性,显著提升了功放模块的效率和稳定性,降低了维护成本。 展开更多
关键词 mos 全固态中波发射机 功放模块 IRFP350
在线阅读 下载PDF
Stochastic Augmented-Based Dual-Teaching for Semi-Supervised Medical Image Segmentation
9
作者 Hengyang Liu Yang Yuan +2 位作者 Pengcheng Ren Chengyun Song Fen Luo 《Computers, Materials & Continua》 SCIE EI 2025年第1期543-560,共18页
Existing semi-supervisedmedical image segmentation algorithms use copy-paste data augmentation to correct the labeled-unlabeled data distribution mismatch.However,current copy-paste methods have three limitations:(1)t... Existing semi-supervisedmedical image segmentation algorithms use copy-paste data augmentation to correct the labeled-unlabeled data distribution mismatch.However,current copy-paste methods have three limitations:(1)training the model solely with copy-paste mixed pictures from labeled and unlabeled input loses a lot of labeled information;(2)low-quality pseudo-labels can cause confirmation bias in pseudo-supervised learning on unlabeled data;(3)the segmentation performance in low-contrast and local regions is less than optimal.We design a Stochastic Augmentation-Based Dual-Teaching Auxiliary Training Strategy(SADT),which enhances feature diversity and learns high-quality features to overcome these problems.To be more precise,SADT trains the Student Network by using pseudo-label-based training from Teacher Network 1 and supervised learning with labeled data,which prevents the loss of rare labeled data.We introduce a bi-directional copy-pastemask with progressive high-entropy filtering to reduce data distribution disparities and mitigate confirmation bias in pseudo-supervision.For the mixed images,Deep-Shallow Spatial Contrastive Learning(DSSCL)is proposed in the feature spaces of Teacher Network 2 and the Student Network to improve the segmentation capabilities in low-contrast and local areas.In this procedure,the features retrieved by the Student Network are subjected to a random feature perturbation technique.On two openly available datasets,extensive trials show that our proposed SADT performs much better than the state-ofthe-art semi-supervised medical segmentation techniques.Using only 10%of the labeled data for training,SADT was able to acquire a Dice score of 90.10%on the ACDC(Automatic Cardiac Diagnosis Challenge)dataset. 展开更多
关键词 SEMI-SUPERVISED medical image segmentation contrastive learning stochastic augmented
在线阅读 下载PDF
深层放电脉冲特征模拟及对MOS器件损伤试验
10
作者 陈益峰 冯娜 +2 位作者 王金晓 高志良 邵焜 《真空与低温》 2025年第5期619-625,共7页
深层带电效应已经成为威胁卫星在轨安全的主要空间环境效应因素。为准确模拟深层放电脉冲特征,实现星用器件抗深层放电的性能评价,提出了采用RLC优化电路与商用静电放电发生器相结合的方法。建立了一种能够定量研究深层放电对电子器件... 深层带电效应已经成为威胁卫星在轨安全的主要空间环境效应因素。为准确模拟深层放电脉冲特征,实现星用器件抗深层放电的性能评价,提出了采用RLC优化电路与商用静电放电发生器相结合的方法。建立了一种能够定量研究深层放电对电子器件损伤效应的测试装置,并对MOS器件开展了试验测试,分析了其损伤机制。研究结果表明,经RLC电路优化后的脉冲波形、电流幅值、持续时间等参数均符合深层放电特征,实现了深层放电脉冲的有效模拟。当脉冲电流幅值增加至9 A时,放电脉冲导致MOS器件的SiO2绝缘层击穿,使MOS器件发生不可恢复的“硬损伤”现象;同时研究发现,放电电流低于损伤阈值时放电脉冲同样会造成绝缘层损伤,且该损伤可通过多次累积最终导致绝缘层彻底击穿。 展开更多
关键词 空间辐射环境 深层带电效应 脉冲特征 mos器件 损伤机制 累积效应
在线阅读 下载PDF
Semantic Segmentation of Lumbar Vertebrae Using Meijering U-Net(MU-Net)on Spine Magnetic Resonance Images
11
作者 Lakshmi S V V Shiloah Elizabeth Darmanayagam Sunil Retmin Raj Cyril 《Computer Modeling in Engineering & Sciences》 SCIE EI 2025年第1期733-757,共25页
Lower back pain is one of the most common medical problems in the world and it is experienced by a huge percentage of people everywhere.Due to its ability to produce a detailed view of the soft tissues,including the s... Lower back pain is one of the most common medical problems in the world and it is experienced by a huge percentage of people everywhere.Due to its ability to produce a detailed view of the soft tissues,including the spinal cord,nerves,intervertebral discs,and vertebrae,Magnetic Resonance Imaging is thought to be the most effective method for imaging the spine.The semantic segmentation of vertebrae plays a major role in the diagnostic process of lumbar diseases.It is difficult to semantically partition the vertebrae in Magnetic Resonance Images from the surrounding variety of tissues,including muscles,ligaments,and intervertebral discs.U-Net is a powerful deep-learning architecture to handle the challenges of medical image analysis tasks and achieves high segmentation accuracy.This work proposes a modified U-Net architecture namely MU-Net,consisting of the Meijering convolutional layer that incorporates the Meijering filter to perform the semantic segmentation of lumbar vertebrae L1 to L5 and sacral vertebra S1.Pseudo-colour mask images were generated and used as ground truth for training the model.The work has been carried out on 1312 images expanded from T1-weighted mid-sagittal MRI images of 515 patients in the Lumbar Spine MRI Dataset publicly available from Mendeley Data.The proposed MU-Net model for the semantic segmentation of the lumbar vertebrae gives better performance with 98.79%of pixel accuracy(PA),98.66%of dice similarity coefficient(DSC),97.36%of Jaccard coefficient,and 92.55%mean Intersection over Union(mean IoU)metrics using the mentioned dataset. 展开更多
关键词 Computer aided diagnosis(CAD) magnetic resonance imaging(MRI) semantic segmentation lumbar vertebrae deep learning U-Net model
在线阅读 下载PDF
Dual encoding feature filtering generalized attention UNET for retinal vessel segmentation
12
作者 ISLAM Md Tauhidul WU Da-Wen +6 位作者 TANG Qing-Qing ZHAO Kai-Yang YIN Teng LI Yan-Fei SHANG Wen-Yi LIU Jing-Yu ZHANG Hai-Xian 《四川大学学报(自然科学版)》 北大核心 2025年第1期79-95,共17页
Retinal blood vessel segmentation is crucial for diagnosing ocular and cardiovascular diseases.Although the introduction of U-Net in 2015 by Olaf Ronneberger significantly advanced this field,yet issues like limited t... Retinal blood vessel segmentation is crucial for diagnosing ocular and cardiovascular diseases.Although the introduction of U-Net in 2015 by Olaf Ronneberger significantly advanced this field,yet issues like limited training data,imbalance data distribution,and inadequate feature extraction persist,hindering both the segmentation performance and optimal model generalization.Addressing these critical issues,the DEFFA-Unet is proposed featuring an additional encoder to process domain-invariant pre-processed inputs,thereby improving both richer feature encoding and enhanced model generalization.A feature filtering fusion module is developed to ensure the precise feature filtering and robust hybrid feature fusion.In response to the task-specific need for higher precision where false positives are very costly,traditional skip connections are replaced with the attention-guided feature reconstructing fusion module.Additionally,innovative data augmentation and balancing methods are proposed to counter data scarcity and distribution imbalance,further boosting the robustness and generalization of the model.With a comprehensive suite of evaluation metrics,extensive validations on four benchmark datasets(DRIVE,CHASEDB1,STARE,and HRF)and an SLO dataset(IOSTAR),demonstrate the proposed method’s superiority over both baseline and state-of-the-art models.Particularly the proposed method significantly outperforms the compared methods in cross-validation model generalization. 展开更多
关键词 Vessel segmentation Data balancing Data augmentation Dual encoder Attention Mechanism Model generalization
在线阅读 下载PDF
CableSAM:an efficient automatic segmentation method for aircraft cabin cables
13
作者 LING Aihua WANG Junwen +1 位作者 LU Jiaming LIU Ruyu 《Optoelectronics Letters》 2025年第3期183-187,共5页
Cabin cables,as critical components of an aircraft's electrical system,significantly impact the operational efficiency and safety of the aircraft.The existing cable segmentation methods in civil aviation cabins ar... Cabin cables,as critical components of an aircraft's electrical system,significantly impact the operational efficiency and safety of the aircraft.The existing cable segmentation methods in civil aviation cabins are limited,especially in automation,heavily dependent on large amounts of data and resources,lacking the flexibility to adapt to different scenarios.To address these challenges,this paper introduces a novel image segmentation model,CableSAM,specifically designed for automated segmentation of cabin cables.CableSAM improves segmentation efficiency and accuracy using knowledge distillation and employs a context ensemble strategy.It accurately segments cables in various scenarios with minimal input prompts.Comparative experiments on three cable datasets demonstrate that CableSAM surpasses other advanced cable segmentation methods in performance. 展开更多
关键词 image segmentation aircraft cabin automatic segmentation automated segmentation cabin cablesas civil aviation cabins cable segmentation knowledge distillation
原文传递
Optimized algorithm for image semantic segmentation compression algorithm in video surveillance scenarios
14
作者 ZHANG Yangmei ZHANG Xishan +1 位作者 ZHANG Shuo LI Jintao 《High Technology Letters》 2025年第2期194-203,共10页
In recent years,video coding has been widely applied in the field of video image processing to remove redundant information and improve data transmission efficiency.However,during the video coding process,irrelevant o... In recent years,video coding has been widely applied in the field of video image processing to remove redundant information and improve data transmission efficiency.However,during the video coding process,irrelevant objects such as background elements are often encoded due to environmental disturbances,resulting in the wastage of computational resources.Existing research on video coding efficiency optimization primarily focuses on optimizing encoding units during intra-frame or inter frame prediction after the generation of coding units,neglecting the optimization of video images before coding unit generation.To address this challenge,This work proposes an image semantic segmentation compression algorithm based on macroblock encoding,called image semantic segmentation compression algorithm based on macroblock encoding(ISSC-ME),which consists of three modules.(1)The semantic label generation module generates interesting object labels using a grid-based approach to reduce redundant coding of consecutive frames.(2)The image segmentation network module generates a semantic segmentation image using U-Net.(3)The macroblock coding module,is a block segmentation-based video encoding and decoding algorithm used to compress images and improve video transmission efficiency.Experimental results show that the proposed image semantic segmentation optimization algorithm can reduce the computational costs,and improve the overall accuracy by 1.00%and the mean intersection over union(IoU)by 1.20%.In addition,the proposed compression algorithm utilizes macroblock fusion,resulting in the image compression rate achieving 80.64%.It has been proven that the proposed algorithm greatly reduces data storage and transmission,and enables fast image compression processing at the millisecond level. 展开更多
关键词 macroblock encoding semantic segmentation segmentation compression
在线阅读 下载PDF
U-Net-Based Medical Image Segmentation:A Comprehensive Analysis and Performance Review
15
作者 Aliyu Abdulfatah Zhang Sheng Yirga Eyasu Tenawerk 《Journal of Electronic Research and Application》 2025年第1期202-208,共7页
Medical image segmentation has become a cornerstone for many healthcare applications,allowing for the automated extraction of critical information from images such as Computed Tomography(CT)scans,Magnetic Resonance Im... Medical image segmentation has become a cornerstone for many healthcare applications,allowing for the automated extraction of critical information from images such as Computed Tomography(CT)scans,Magnetic Resonance Imaging(MRIs),and X-rays.The introduction of U-Net in 2015 has significantly advanced segmentation capabilities,especially for small datasets commonly found in medical imaging.Since then,various modifications to the original U-Net architecture have been proposed to enhance segmentation accuracy and tackle challenges like class imbalance,data scarcity,and multi-modal image processing.This paper provides a detailed review and comparison of several U-Net-based architectures,focusing on their effectiveness in medical image segmentation tasks.We evaluate performance metrics such as Dice Similarity Coefficient(DSC)and Intersection over Union(IoU)across different U-Net variants including HmsU-Net,CrossU-Net,mResU-Net,and others.Our results indicate that architectural enhancements such as transformers,attention mechanisms,and residual connections improve segmentation performance across diverse medical imaging applications,including tumor detection,organ segmentation,and lesion identification.The study also identifies current challenges in the field,including data variability,limited dataset sizes,and issues with class imbalance.Based on these findings,the paper suggests potential future directions for improving the robustness and clinical applicability of U-Net-based models in medical image segmentation. 展开更多
关键词 U-Net architecture Medical image segmentation DSC IOU Transformer-based segmentation
在线阅读 下载PDF
Pre-trained SAM as data augmentation for image segmentation
16
作者 Junjun Wu Yunbo Rao +1 位作者 Shaoning Zeng Bob Zhang 《CAAI Transactions on Intelligence Technology》 2025年第1期268-282,共15页
Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in ord... Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation. 展开更多
关键词 data augmentation image segmentation large model segment anything model
在线阅读 下载PDF
CW-HRNet:Constrained Deformable Sampling and Wavelet-Guided Enhancement for Lightweight Crack Segmentation
17
作者 Dewang Ma 《Journal of Electronic Research and Application》 2025年第5期269-280,共12页
This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two ke... This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two key modules:Constrained Deformable Convolution(CDC),which stabilizes geometric alignment by applying a tanh limiter and learnable scaling factor to the predicted offsets,and the Wavelet Frequency Enhancement Module(WFEM),which decomposes features using Haar wavelets to preserve low-frequency structures while enhancing high-frequency boundaries and textures.Evaluations on the CrackSeg9k benchmark demonstrate CW-HRNet’s superior performance,achieving 82.39%mIoU with only 7.49M parameters and 10.34 GFLOPs,outperforming HrSegNet-B48 by 1.83% in segmentation accuracy with minimal complexity overhead.The model also shows strong cross-dataset generalization,achieving 60.01%mIoU and 66.22%F1 on Asphalt3k without fine-tuning.These results highlight CW-HRNet’s favorable accuracyefficiency trade-off for real-world crack segmentation tasks. 展开更多
关键词 Crack segmentation Lightweight semantic segmentation Deformable convolution Wavelet transform Road infrastructure
在线阅读 下载PDF
Global-Local Hybrid Modulation Network for Retinal Vessel and Coronary Angiograph Segmentation
18
作者 Pengfei Cai Biyuan Li +2 位作者 Jinying Ma Xiao Tian Jun Yan 《Journal of Bionic Engineering》 2025年第4期2050-2074,共25页
The segmentation of retinal vessels and coronary angiographs is essential for diagnosing conditions such as glaucoma,diabetes,hypertension,and coronary artery disease.However,retinal vessels and coronary angiographs a... The segmentation of retinal vessels and coronary angiographs is essential for diagnosing conditions such as glaucoma,diabetes,hypertension,and coronary artery disease.However,retinal vessels and coronary angiographs are characterized by low contrast and complex structures,posing challenges for vessel segmentation.Moreover,CNN-based approaches are limited in capturing long-range pixel relationships due to their focus on local feature extraction,while ViT-based approaches struggle to capture fine local details,impacting tasks like vessel segmentation that require precise boundary detection.To address these issues,in this paper,we propose a Global–Local Hybrid Modulation Network(GLHM-Net),a dual-encoder architecture that combines the strengths of CNNs and ViTs for vessel segmentation.First,the Hybrid Non-Local Transformer Block(HNLTB)is proposed to efficiently consolidate long-range spatial dependencies into a compact feature representation,providing a global perspective while significantly reducing computational overhead.Second,the Collaborative Attention Fusion Block(CAFB)is proposed to more effectively integrate local and global vessel features at the same hierarchical level during the encoding phase.Finally,the proposed Feature Cross-Modulation Block(FCMB)better complements the local and global features in the decoding stage,effectively enhancing feature learning and minimizing information loss.The experiments conducted on the DRIVE,CHASEDB1,DCA1,and XCAD datasets,achieving AUC values of 0.9811,0.9864,0.9915,and 0.9919,F1 scores of 0.8288,0.8202,0.8040,and 0.8150,and IOU values of 0.7076,0.6952,0.6723,and 0.6878,respectively,demonstrate the strong performance of our proposed network for vessel segmentation. 展开更多
关键词 Non-local transformer Feature fusion Collaborative attention Retinal vessel segmentation Coronary angiograph segmentation
在线阅读 下载PDF
CAMSNet:Few-Shot Semantic Segmentation via Class Activation Map and Self-Cross Attention Block
19
作者 Jingjing Yan Xuyang Zhuang +2 位作者 Xuezhuan Zhao Xiaoyan Shao Jiaqi Han 《Computers, Materials & Continua》 2025年第3期5363-5386,共24页
The key to the success of few-shot semantic segmentation(FSS)depends on the efficient use of limited annotated support set to accurately segment novel classes in the query set.Due to the few samples in the support set... The key to the success of few-shot semantic segmentation(FSS)depends on the efficient use of limited annotated support set to accurately segment novel classes in the query set.Due to the few samples in the support set,FSS faces challenges such as intra-class differences,background(BG)mismatches between query and support sets,and ambiguous segmentation between the foreground(FG)and BG in the query set.To address these issues,The paper propose a multi-module network called CAMSNet,which includes four modules:the General Information Module(GIM),the Class Activation Map Aggregation(CAMA)module,the Self-Cross Attention(SCA)Block,and the Feature Fusion Module(FFM).In CAMSNet,The GIM employs an improved triplet loss,which concatenates word embedding vectors and support prototypes as anchors,and uses local support features of FG and BG as positive and negative samples to help solve the problem of intra-class differences.Then for the first time,the Class Activation Map(CAM)from the Weakly Supervised Semantic Segmentation(WSSS)is applied to FSS within the CAMA module.This method replaces the traditional use of cosine similarity to locate query information.Subsequently,the SCA Block processes the support and query features aggregated by the CAMA module,significantly enhancing the understanding of input information,leading to more accurate predictions and effectively addressing BG mismatch and ambiguous FG-BG segmentation.Finally,The FFM combines general class information with the enhanced query information to achieve accurate segmentation of the query image.Extensive Experiments on PASCAL and COCO demonstrate that-5i-20ithe CAMSNet yields superior performance and set a state-of-the-art. 展开更多
关键词 Few-shot semantic segmentation semantic segmentation meta learning
在线阅读 下载PDF
A 3D semantic segmentation network for accurate neuronal soma segmentation
20
作者 Li Ma Qi Zhong +2 位作者 Yezi Wang Xiaoquan Yang Qian Du 《Journal of Innovative Optical Health Sciences》 2025年第1期67-83,共17页
Neuronal soma segmentation plays a crucial role in neuroscience applications.However,the fine structure,such as boundaries,small-volume neuronal somata and fibers,are commonly present in cell images,which pose a chall... Neuronal soma segmentation plays a crucial role in neuroscience applications.However,the fine structure,such as boundaries,small-volume neuronal somata and fibers,are commonly present in cell images,which pose a challenge for accurate segmentation.In this paper,we propose a 3D semantic segmentation network for neuronal soma segmentation to address this issue.Using an encoding-decoding structure,we introduce a Multi-Scale feature extraction and Adaptive Weighting fusion module(MSAW)after each encoding block.The MSAW module can not only emphasize the fine structures via an upsampling strategy,but also provide pixel-wise weights to measure the importance of the multi-scale features.Additionally,a dynamic convolution instead of normal convolution is employed to better adapt the network to input data with different distributions.The proposed MSAW-based semantic segmentation network(MSAW-Net)was evaluated on three neuronal soma images from mouse brain and one neuronal soma image from macaque brain,demonstrating the efficiency of the proposed method.It achieved an F1 score of 91.8%on Fezf2-2A-CreER dataset,97.1%on LSL-H2B-GFP dataset,82.8%on Thy1-EGFP-Mline dataset,and 86.9%on macaque dataset,achieving improvements over the 3D U-Net model by 3.1%,3.3%,3.9%,and 2.3%,respectively. 展开更多
关键词 Neuronal soma segmentation semantic segmentation network multi-scale feature extraction adaptive weighting fusion
原文传递
上一页 1 2 250 下一页 到第
使用帮助 返回顶部