期刊文献+
共找到63,807篇文章
< 1 2 250 >
每页显示 20 50 100
How precise is precise enough?Tree crown segmentation using high resolution close-up multispectral UAV images and its effect on NDVI accuracy in Fraxinus excelsior L.trees
1
作者 Lisa Buchner Anna-Katharina Eisen Susanne Jochner-Oette 《Journal of Forestry Research》 2026年第2期16-30,共15页
Detailed individual tree crown segmentation is highly relevant for the detection and monitoring of Fraxinus excelsior L.trees affected by ash dieback,a major threat to common ash populations across Europe.In this stud... Detailed individual tree crown segmentation is highly relevant for the detection and monitoring of Fraxinus excelsior L.trees affected by ash dieback,a major threat to common ash populations across Europe.In this study,both fine and coarse crown segmentation methods were applied to close-range multispectral UAV imagery.The fine tree crown segmentation method utilized a novel unsupervised machine learning approach based on a blended NIR-NDVI image,whereas the coarse segmentation relied on the segment anything model(SAM).Both methods successfully delineated tree crown outlines,however,only the fine segmentation accurately captured internal canopy gaps.Despite these structural differences,mean NDVI values calculated per tree crown revealed no significant differences between the two approaches,indicating that coarse segmentation is sufficient for mean vegetation index assessments.Nevertheless,the fine segmentation revealed increased heterogeneity in NDVI values in more severely damaged trees,underscoring its value for detailed structural and health analyses.Furthermore,the fine segmentation workflow proved transferable to both individual UAV images and orthophotos from broader UAV surveys.For applications focused on structural integrity and spatial variation in canopy health,the fine segmentation approach is recommended. 展开更多
关键词 Leaf mass segmentation Machine learning segment anything model Ash dieback
在线阅读 下载PDF
An intelligent segmentation method for leakage points in central serous chorioretinopathy based on fluorescein angiography images
2
作者 Jian-Guo Xu Yong-Chi Liu +4 位作者 Fen Zhou Jian-Xin Shen Zhi-Peng Yan Xin-Ya Hu Wei-Hua Yang 《International Journal of Ophthalmology(English edition)》 2026年第3期421-433,共13页
AIM:To construct an intelligent segmentation scheme for precise localization of central serous chorioretinopathy(CSC)leakage points,thereby enabling ophthalmologists to deliver accurate laser treatment without navigat... AIM:To construct an intelligent segmentation scheme for precise localization of central serous chorioretinopathy(CSC)leakage points,thereby enabling ophthalmologists to deliver accurate laser treatment without navigational laser equipment.METHODS:A dataset with dual labels(point-level and pixel-level)was first established based on fundus fluorescein angiography(FFA)images of CSC and subsequently divided into training(102 images),validation(40 images),and test(40 images)datasets.An intelligent segmentation method was then developed,based on the You Only Look Once version 8 Pose Estimation(YOLOv8-Pose)model and segment anything model(SAM),to segment CSC leakage points.Next,the YOLOv8-Pose model was trained for 200 epochs,and the best-performing model was selected to form the optimal combination with SAM.Additionally,the classic five types of U-Net series models[i.e.,U-Net,recurrent residual U-Net(R2U-Net),attention U-Net(AttU-Net),recurrent residual attention U-Net(R2AttUNet),and nested U-Net(UNet^(++))]were initialized with three random seeds and trained for 200 epochs,resulting in a total of 15 baseline models for comparison.Finally,based on the metrics including Dice similarity coefficient(DICE),intersection over union(IoU),precision,recall,precisionrecall(PR)curve,and receiver operating characteristic(ROC)curve,the proposed method was compared with baseline models through quantitative and qualitative experiments for leakage point segmentation,thereby demonstrating its effectiveness.RESULTS:With the increase of training epochs,the mAP50-95,Recall,and precision of the YOLOv8-Pose model showed a significant increase and tended to stabilize,and it achieved a preliminary localization success rate of 90%(i.e.,36 images)for CSC leakage points in 40 test images.Using manually expert-annotated pixel-level labels as the ground truth,the proposed method achieved outcomes with a DICE of 57.13%,an IoU of 45.31%,a precision of 45.91%,a recall of 93.57%,an area under the PR curve(AUC-PR)of 0.78 and an area under the ROC curve(AUC-ROC)of 0.97,which enables more accurate segmentation of CSC leakage points.CONCLUSION:By combining the precise localization capability of the YOLOv8-Pose model with the robust and flexible segmentation ability of SAM,the proposed method not only demonstrates the effectiveness of the YOLOv8-Pose model in detecting keypoint coordinates of CSC leakage points from the perspective of application innovation but also establishes a novel approach for accurate segmentation of CSC leakage points through the“detect-then-segment”strategy,thereby providing a potential auxiliary means for the automatic and precise realtime localization of leakage points during traditional laser photocoagulation for CSC. 展开更多
关键词 You Only Look Once version 8 Pose Estimation segment anything model central serous chorioretinopathy leakage point segmentation
原文传递
Precision organoid segmentation technique(POST):accurate organoid segmentation in challenging bright-field images 被引量:1
3
作者 Xuan Du Yuchen Li +5 位作者 Jiaping Song Zilin Zhang Jing Zhang Yanhui Li Zaozao Chen Zhongze Gu 《Bio-Design and Manufacturing》 2026年第1期80-93,I0013-I0016,共18页
Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of... Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of complex diseases,with some even achieving clinical translation.Changes in the overall size,shape,boundary,and other morphological features of organoids provide a noninvasive method for assessing organoid drug sensitivity.However,the precise segmentation of organoids in bright-field microscopy images is made difficult by the complexity of the organoid morphology and interference,including overlapping organoids,bubbles,dust particles,and cell fragments.This paper introduces the precision organoid segmentation technique(POST),which is a deep-learning algorithm for segmenting challenging organoids under simple bright-field imaging conditions.Unlike existing methods,POST accurately segments each organoid and eliminates various artifacts encountered during organoid culturing and imaging.Furthermore,it is sensitive to and aligns with measurements of organoid activity in drug sensitivity experiments.POST is expected to be a valuable tool for drug screening using organoids owing to its capability of automatically and rapidly eliminating interfering substances and thereby streamlining the organoid analysis and drug screening process. 展开更多
关键词 Organoid Drug screening Deep learning Image segmentation
暂未订购
Detection of co-phasing error in segmented mirror based on extended Young’s interferometry combined with Vision Transformer
4
作者 LIU Yin-ling YAO Chi +3 位作者 OUYANG Shang-tao WAN Yi-rong CHEN Mo LI Bin 《中国光学(中英文)》 北大核心 2026年第1期205-218,共14页
Due to the inability of manufacturing a single monolithic mirror at the 10-meter scales,segmented mirrors have become indispensable tools in modern astronomical research.However,to match the imaging performance of the... Due to the inability of manufacturing a single monolithic mirror at the 10-meter scales,segmented mirrors have become indispensable tools in modern astronomical research.However,to match the imaging performance of the monolithic counterpart,the sub-mirrors must maintain precise co-phasing.Piston error critically degrades segmented mirror imaging quality,necessitating efficient and precise detection.To ad-dress the limitations that the conventional circular-aperture diffraction with two-wavelength algorithm is sus-ceptible to decentration errors,and the traditional convolutional neural networks(CNNs)struggle to capture global features under large-range piston errors due to their restricted local receptive fields,this paper pro-poses a method that integrates extended Young’s interference principles with a Vision Transformer(ViT)to detect piston error.By suppressing decentration error interference through two symmetrically arranged aper-tures and extending the measurement range to±7.95μm via a two-wavelength(589 nm/600 nm)algorithm.This approach exploits ViT’s self-attention mechanism to model global characteristics of interference fringes.Unlike CNNs constrained by local convolutional kernels,the ViT significantly improves sensitivity to inter-ferogram periodicity.The simulation results demonstrate that the proposed method achieves a measurement accuracy of 5 nm(0.0083λ0)across the range of±7.95μm,while maintaining an accuracy exceeding 95%in the presence of Gaussian noise(SNR≥15 dB),Poisson noise(λ≥9 photons/pixel),and sub-mirror gap er-ror(Egap≤0.2)interference.Moreover,the detection speed shows significant improvement compared to the cross-correlation algorithm.This study establishes an accurate,robust framework for segmented mirror error detection,advancing high-precision astronomical observation. 展开更多
关键词 segmented mirror co-phasing piston errors ViT Young’s interference principles
在线阅读 下载PDF
Different approaches of laparoscopic anatomic hepatectomy of segment 7 for hepatocellular carcinoma:A multicenter study
5
作者 Xing-Ru Wang Qi-Fan Zhang +6 位作者 Wei Cheng Xiao Liang Jun Cao Yong-Gang Wei Jian-Wei Li Hong-Guang Wang Chinese Research Group for Minimally Invasive Anatomical Liver Resection(The Workshop of Liver Future 《Hepatobiliary & Pancreatic Diseases International》 2026年第1期42-51,共10页
Background:Laparoscopic anatomic hepatectomy of segment 7(LAH-S7)is a challenging surgery.In this study we aimed to investigate surgical and oncological outcomes of various approaches of LAH-S7 in patients with hepato... Background:Laparoscopic anatomic hepatectomy of segment 7(LAH-S7)is a challenging surgery.In this study we aimed to investigate surgical and oncological outcomes of various approaches of LAH-S7 in patients with hepatocellular carcinoma(HCC).A particular focus was placed on identifying the Glissonean pedicle of segment 7(G7)and the intersegmental plane.Given the scarcity of comprehensive reviews or comparative studies on clinical outcomes,we also sought to analyze the experiences and advantages associated with different approaches in relation to the anatomic variations of G7.Methods:The clinical data of 124 patients who underwent LAH-S7 for HCC across seven tertiary referral medical centers in China were retrospectively analyzed.Three surgical approaches were categorized based on the procedures used for G7 identification:the indocyanine green(ICG)fluorescence positive staining approach(IFPA),the Glissonean approach(GA),and the hepatic vein-guided approach(HVGA).Subsequently,the postoperative short-term results and oncological outcomes of the three different approaches were compared.Results:The distribution of surgical approaches among the patients was as follows:IFPA in 16(12.9%),GA in 62(50.0%),and HVGA in 46(37.1%)patients.Complications were observed in 27(21.8%)patients.The 1-,3-,and 5-year overall survival(OS)rates were 99.1%,89.2%,and 84.7%,respectively.The 1-,3-,and 5-year recurrence-free survival(RFS)rates were 99.0%,84.7%,and 69.3%,respectively.The OS and RFS rates were comparable across the three approaches.Conclusions:Following a standardized surgical procedure,LAH-S7 is demonstrated to be safe and yields favorable oncological outcomes.Surgeons performing LAH-S7 should select the appropriate surgical approach based on the anatomical characteristics and variations of G7. 展开更多
关键词 Hepatocellular carcinoma Liver neoplasms HEPATECTOMY LAPAROSCOPY Indocyanine green segment 7
暂未订购
Context Patch Fusion with Class Token Enhancement for Weakly Supervised Semantic Segmentation
6
作者 Yiyang Fu Hui Li Wangyu Wu 《Computer Modeling in Engineering & Sciences》 2026年第1期1130-1150,共21页
Weakly Supervised Semantic Segmentation(WSSS),which relies only on image-level labels,has attracted significant attention for its cost-effectiveness and scalability.Existing methods mainly enhance inter-class distinct... Weakly Supervised Semantic Segmentation(WSSS),which relies only on image-level labels,has attracted significant attention for its cost-effectiveness and scalability.Existing methods mainly enhance inter-class distinctions and employ data augmentation to mitigate semantic ambiguity and reduce spurious activations.However,they often neglect the complex contextual dependencies among image patches,resulting in incomplete local representations and limited segmentation accuracy.To address these issues,we propose the Context Patch Fusion with Class Token Enhancement(CPF-CTE)framework,which exploits contextual relations among patches to enrich feature repre-sentations and improve segmentation.At its core,the Contextual-Fusion Bidirectional Long Short-Term Memory(CF-BiLSTM)module captures spatial dependencies between patches and enables bidirectional information flow,yield-ing a more comprehensive understanding of spatial correlations.This strengthens feature learning and segmentation robustness.Moreover,we introduce learnable class tokens that dynamically encode and refine class-specific semantics,enhancing discriminative capability.By effectively integrating spatial and semantic cues,CPF-CTE produces richer and more accurate representations of image content.Extensive experiments on PASCAL VOC 2012 and MS COCO 2014 validate that CPF-CTE consistently surpasses prior WSSS methods. 展开更多
关键词 Weakly supervised semantic segmentation context-fusion class enhancement
在线阅读 下载PDF
Advances in deep learning for bacterial image segmentation in optical microscopy
7
作者 Zhijun Tan Yang Ding +6 位作者 Huibin Ma Jintao Li Danrou Zheng Hua Bai Weini Xin Lin Li Bo Peng 《Journal of Innovative Optical Health Sciences》 2026年第1期30-44,共15页
Microscopy imaging is fundamental in analyzing bacterial morphology and dynamics,offering critical insights into bacterial physiology and pathogenicity.Image segmentation techniques enable quantitative analysis of bac... Microscopy imaging is fundamental in analyzing bacterial morphology and dynamics,offering critical insights into bacterial physiology and pathogenicity.Image segmentation techniques enable quantitative analysis of bacterial structures,facilitating precise measurement of morphological variations and population behaviors at single-cell resolution.This paper reviews advancements in bacterial image segmentation,emphasizing the shift from traditional thresholding and watershed methods to deep learning-driven approaches.Convolutional neural networks(CNNs),U-Net architectures,and three-dimensional(3D)frameworks excel at segmenting dense biofilms and resolving antibiotic-induced morphological changes.These methods combine automated feature extraction with physics-informed postprocessing.Despite progress,challenges persist in computational efficiency,cross-species generalizability,and integration with multimodal experimental workflows.Future progress will depend on improving model robustness across species and imaging modalities,integrating multimodal data for phenotype-function mapping,and developing standard pipelines that link computational tools with clinical diagnostics.These innovations will expand microbial phenotyping beyond structural analysis,enabling deeper insights into bacterial physiology and ecological interactions. 展开更多
关键词 Bacterial image deep learning optical microscopy image segmentation artificial intelligence
原文传递
Intelligent Semantic Segmentation with Vision Transformers for Aerial Vehicle Monitoring
8
作者 Moneerah Alotaibi 《Computers, Materials & Continua》 2026年第1期1629-1648,共20页
Advanced traffic monitoring systems encounter substantial challenges in vehicle detection and classification due to the limitations of conventional methods,which often demand extensive computational resources and stru... Advanced traffic monitoring systems encounter substantial challenges in vehicle detection and classification due to the limitations of conventional methods,which often demand extensive computational resources and struggle with diverse data acquisition techniques.This research presents a novel approach for vehicle classification and recognition in aerial image sequences,integrating multiple advanced techniques to enhance detection accuracy.The proposed model begins with preprocessing using Multiscale Retinex(MSR)to enhance image quality,followed by Expectation-Maximization(EM)Segmentation for precise foreground object identification.Vehicle detection is performed using the state-of-the-art YOLOv10 framework,while feature extraction incorporates Maximally Stable Extremal Regions(MSER),Dense Scale-Invariant Feature Transform(Dense SIFT),and Zernike Moments Features to capture distinct object characteristics.Feature optimization is further refined through a Hybrid Swarm-based Optimization algorithm,ensuring optimal feature selection for improved classification performance.The final classification is conducted using a Vision Transformer,leveraging its robust learning capabilities for enhanced accuracy.Experimental evaluations on benchmark datasets,including UAVDT and the Unmanned Aerial Vehicle Intruder Dataset(UAVID),demonstrate the superiority of the proposed approach,achieving an accuracy of 94.40%on UAVDT and 93.57%on UAVID.The results highlight the efficacy of the model in significantly enhancing vehicle detection and classification in aerial imagery,outperforming existing methodologies and offering a statistically validated improvement for intelligent traffic monitoring systems compared to existing approaches. 展开更多
关键词 Machine learning semantic segmentation remote sensors deep learning object monitoring system
在线阅读 下载PDF
SwinHCAD: A Robust Multi-Modality Segmentation Model for Brain Tumors Using Transformer and Channel-Wise Attention
9
作者 Seyong Jin Muhammad Fayaz +2 位作者 L.Minh Dang Hyoung-Kyu Song Hyeonjoon Moon 《Computers, Materials & Continua》 2026年第1期511-533,共23页
Brain tumors require precise segmentation for diagnosis and treatment plans due to their complex morphology and heterogeneous characteristics.While MRI-based automatic brain tumor segmentation technology reduces the b... Brain tumors require precise segmentation for diagnosis and treatment plans due to their complex morphology and heterogeneous characteristics.While MRI-based automatic brain tumor segmentation technology reduces the burden on medical staff and provides quantitative information,existing methodologies and recent models still struggle to accurately capture and classify the fine boundaries and diverse morphologies of tumors.In order to address these challenges and maximize the performance of brain tumor segmentation,this research introduces a novel SwinUNETR-based model by integrating a new decoder block,the Hierarchical Channel-wise Attention Decoder(HCAD),into a powerful SwinUNETR encoder.The HCAD decoder block utilizes hierarchical features and channelspecific attention mechanisms to further fuse information at different scales transmitted from the encoder and preserve spatial details throughout the reconstruction phase.Rigorous evaluations on the recent BraTS GLI datasets demonstrate that the proposed SwinHCAD model achieved superior and improved segmentation accuracy on both the Dice score and HD95 metrics across all tumor subregions(WT,TC,and ET)compared to baseline models.In particular,the rationale and contribution of the model design were clarified through ablation studies to verify the effectiveness of the proposed HCAD decoder block.The results of this study are expected to greatly contribute to enhancing the efficiency of clinical diagnosis and treatment planning by increasing the precision of automated brain tumor segmentation. 展开更多
关键词 Attention mechanism brain tumor segmentation channel-wise attention decoder deep learning medical imaging MRI TRANSFORMER U-Net
在线阅读 下载PDF
GLMCNet: A Global-Local Multiscale Context Network for High-Resolution Remote Sensing Image Semantic Segmentation
10
作者 Yanting Zhang Qiyue Liu +4 位作者 Chuanzhao Tian Xuewen Li Na Yang Feng Zhang Hongyue Zhang 《Computers, Materials & Continua》 2026年第1期2086-2110,共25页
High-resolution remote sensing images(HRSIs)are now an essential data source for gathering surface information due to advancements in remote sensing data capture technologies.However,their significant scale changes an... High-resolution remote sensing images(HRSIs)are now an essential data source for gathering surface information due to advancements in remote sensing data capture technologies.However,their significant scale changes and wealth of spatial details pose challenges for semantic segmentation.While convolutional neural networks(CNNs)excel at capturing local features,they are limited in modeling long-range dependencies.Conversely,transformers utilize multihead self-attention to integrate global context effectively,but this approach often incurs a high computational cost.This paper proposes a global-local multiscale context network(GLMCNet)to extract both global and local multiscale contextual information from HRSIs.A detail-enhanced filtering module(DEFM)is proposed at the end of the encoder to refine the encoder outputs further,thereby enhancing the key details extracted by the encoder and effectively suppressing redundant information.In addition,a global-local multiscale transformer block(GLMTB)is proposed in the decoding stage to enable the modeling of rich multiscale global and local information.We also design a stair fusion mechanism to transmit deep semantic information from deep to shallow layers progressively.Finally,we propose the semantic awareness enhancement module(SAEM),which further enhances the representation of multiscale semantic features through spatial attention and covariance channel attention.Extensive ablation analyses and comparative experiments were conducted to evaluate the performance of the proposed method.Specifically,our method achieved a mean Intersection over Union(mIoU)of 86.89%on the ISPRS Potsdam dataset and 84.34%on the ISPRS Vaihingen dataset,outperforming existing models such as ABCNet and BANet. 展开更多
关键词 Multiscale context attention mechanism remote sensing images semantic segmentation
在线阅读 下载PDF
Deep Learning for Brain Tumor Segmentation and Classification: A Systematic Review of Methods and Trends
11
作者 Ameer Hamza Robertas Damaševicius 《Computers, Materials & Continua》 2026年第1期132-172,共41页
This systematic review aims to comprehensively examine and compare deep learning methods for brain tumor segmentation and classification using MRI and other imaging modalities,focusing on recent trends from 2022 to 20... This systematic review aims to comprehensively examine and compare deep learning methods for brain tumor segmentation and classification using MRI and other imaging modalities,focusing on recent trends from 2022 to 2025.The primary objective is to evaluate methodological advancements,model performance,dataset usage,and existing challenges in developing clinically robust AI systems.We included peer-reviewed journal articles and highimpact conference papers published between 2022 and 2025,written in English,that proposed or evaluated deep learning methods for brain tumor segmentation and/or classification.Excluded were non-open-access publications,books,and non-English articles.A structured search was conducted across Scopus,Google Scholar,Wiley,and Taylor&Francis,with the last search performed in August 2025.Risk of bias was not formally quantified but considered during full-text screening based on dataset diversity,validation methods,and availability of performance metrics.We used narrative synthesis and tabular benchmarking to compare performance metrics(e.g.,accuracy,Dice score)across model types(CNN,Transformer,Hybrid),imaging modalities,and datasets.A total of 49 studies were included(43 journal articles and 6 conference papers).These studies spanned over 9 public datasets(e.g.,BraTS,Figshare,REMBRANDT,MOLAB)and utilized a range of imaging modalities,predominantly MRI.Hybrid models,especially ResViT and UNetFormer,consistently achieved high performance,with classification accuracy exceeding 98%and segmentation Dice scores above 0.90 across multiple studies.Transformers and hybrid architectures showed increasing adoption post2023.Many studies lacked external validation and were evaluated only on a few benchmark datasets,raising concerns about generalizability and dataset bias.Few studies addressed clinical interpretability or uncertainty quantification.Despite promising results,particularly for hybrid deep learning models,widespread clinical adoption remains limited due to lack of validation,interpretability concerns,and real-world deployment barriers. 展开更多
关键词 Brain tumor segmentation brain tumor classification deep learning vision transformers hybrid models
在线阅读 下载PDF
Influence of Soft Segment Length on the Dielectric Polarization Behavior of Ketal-containing Polyurethane Elastomer
12
作者 Xue Mei Wan Lu +6 位作者 Jian-Rong Dong Ke-Heng Pan Jun-Jie Tan Hong-Ye Yan Jun Wu Yu Zhou Hong-Xiang Chen 《Chinese Journal of Polymer Science》 2026年第1期189-197,I0015,共10页
Polyurethane elastomers exhibit high dielectric constants owing to their polar groups,and can be used as energy storage capacitors.Energy storage depends not only on the dielectric constant but also on the dielectric ... Polyurethane elastomers exhibit high dielectric constants owing to their polar groups,and can be used as energy storage capacitors.Energy storage depends not only on the dielectric constant but also on the dielectric loss.However,the relationship between chain structure and dielectric properties is not yet clear.Ketal-containing crosslinked polyurethane elastomers were prepared using cyclic ketal diol as a chain extender.The effect of the soft segment length on the dielectric properties and energy storage was investigated.The cause of the change in the dipolar polarization with the soft segment length was analyzed.As the soft segment length increased,the hard-soft hydrogen bonding decreased,whereas the hard-hard hydrogen bonding increased.Under the action of an electric field,the polar bonds in the ketal-containing polyurethane elastomer overcome the hydrogen bonding between hard-soft segments to produce polarization;meanwhile,they also experience crankshaft motions to generate polarization.The former has a relatively high relaxation activation energy of approximately 10-20 k J·mol^(-1),resulting in a large dielectric loss.The latter has a relatively low relaxation activation energy,approximately 0.7-1.7 kJ·mol^(-1),leading to low dielectric loss.As a result,the dielectric constant showed a decreasing trend,and the dielectric loss gradually decreased.This study provides a theoretical foundation for improving the dielectric properties of polyurethane elastomers. 展开更多
关键词 Dielectric polarization Polyurethane elastomer Soft segment length Intermolecular interaction
原文传递
An enhanced segmentation method for 3D point cloud of tunnel support system using PointNet++t and coverage-voted strategy algorithms
13
作者 Wenju Liu Fuqiang Gao +4 位作者 Shuangyong Dong Xiaoqing Wang Shuwen Cao Wanjie Wang Xiaomin Liu 《Journal of Rock Mechanics and Geotechnical Engineering》 2026年第2期1653-1660,共8页
3D laser scanning technology is widely used in underground openings for high-precision,rapid,and nondestructive structural evaluations.Segmenting large 3D point cloud datasets,particularly in coal mine roadways with m... 3D laser scanning technology is widely used in underground openings for high-precision,rapid,and nondestructive structural evaluations.Segmenting large 3D point cloud datasets,particularly in coal mine roadways with multi-scale targets,remains challenging.This paper proposes an enhanced segmentation method integrating improved PointNet++with a coverage-voted strategy.The coverage-voted strategy reduces data while preserving multi-scale target topology.The segmentation is achieved using an enhanced PointNet++algorithm with a normalization preprocessing head,resulting in a 94%accuracy for common supporting components.Ablation experiments show that the preprocessing head and coverage strategies increase segmentation accuracy by 20%and 2%,respectively,and improve Intersection over Union(IoU)for bearing plate segmentation by 58%and 20%.The accuracy of the current pretraining segmentation model may be affected by variations in surface support components,but it can be readily enhanced through re-optimization with additional labeled point cloud data.This proposed method,combined with a previously developed machine learning model that links rock bolt load and the deformation field of its bearing plate,provides a robust technique for simultaneously measuring the load of multiple rock bolts in a single laser scan. 展开更多
关键词 Point cloud segmentation Improved PointNet++ Tunnel laser scanning Rock bolt automatic recognition
在线阅读 下载PDF
Deep Learning-Based Toolkit Inspection:Object Detection and Segmentation in Assembly Lines
14
作者 Arvind Mukundan Riya Karmakar +1 位作者 Devansh Gupta Hsiang-Chen Wang 《Computers, Materials & Continua》 2026年第1期1255-1277,共23页
Modern manufacturing processes have become more reliant on automation because of the accelerated transition from Industry 3.0 to Industry 4.0.Manual inspection of products on assembly lines remains inefficient,prone t... Modern manufacturing processes have become more reliant on automation because of the accelerated transition from Industry 3.0 to Industry 4.0.Manual inspection of products on assembly lines remains inefficient,prone to errors and lacks consistency,emphasizing the need for a reliable and automated inspection system.Leveraging both object detection and image segmentation approaches,this research proposes a vision-based solution for the detection of various kinds of tools in the toolkit using deep learning(DL)models.Two Intel RealSense D455f depth cameras were arranged in a top down configuration to capture both RGB and depth images of the toolkits.After applying multiple constraints and enhancing them through preprocessing and augmentation,a dataset consisting of 3300 annotated RGB-D photos was generated.Several DL models were selected through a comprehensive assessment of mean Average Precision(mAP),precision-recall equilibrium,inference latency(target≥30 FPS),and computational burden,resulting in a preference for YOLO and Region-based Convolutional Neural Networks(R-CNN)variants over ViT-based models due to the latter’s increased latency and resource requirements.YOLOV5,YOLOV8,YOLOV11,Faster R-CNN,and Mask R-CNN were trained on the annotated dataset and evaluated using key performance metrics(Recall,Accuracy,F1-score,and Precision).YOLOV11 demonstrated balanced excellence with 93.0%precision,89.9%recall,and a 90.6%F1-score in object detection,as well as 96.9%precision,95.3%recall,and a 96.5%F1-score in instance segmentation with an average inference time of 25 ms per frame(≈40 FPS),demonstrating real-time performance.Leveraging these results,a YOLOV11-based windows application was successfully deployed in a real-time assembly line environment,where it accurately processed live video streams to detect and segment tools within toolkits,demonstrating its practical effectiveness in industrial automation.The application is capable of precisely measuring socket dimensions by utilising edge detection techniques on YOLOv11 segmentation masks,in addition to detection and segmentation.This makes it possible to do specification-level quality control right on the assembly line,which improves the ability to examine things in real time.The implementation is a big step forward for intelligent manufacturing in the Industry 4.0 paradigm.It provides a scalable,efficient,and accurate way to do automated inspection and dimensional verification activities. 展开更多
关键词 Tool detection image segmentation object detection assembly line automation Industry 4.0 Intel RealSense deep learning toolkit verification RGB-D imaging quality assurance
在线阅读 下载PDF
Probabilistic seismic hazard analysis for the northern segment of the North-South Seismic Belt in China based on improved spatial smoothing and fault source model integration
15
作者 Yaohu Zhang Hua Pan +1 位作者 Meng Zhang Ying Shi 《Earthquake Science》 2026年第1期1-31,共31页
The northern segment of the North-South Seismic Belt is characterized by intense crustal deformation,well-developed active tectonics,and frequent occurrences of strong earthquakes.Therefore,conducting a Probabilistic ... The northern segment of the North-South Seismic Belt is characterized by intense crustal deformation,well-developed active tectonics,and frequent occurrences of strong earthquakes.Therefore,conducting a Probabilistic Seismic Hazard Analysis(PSHA)for this region is of significant importance for supporting seismic fortification in major engineering projects and formulating disaster prevention and mitigation policies.In this study,a composite seismic source model was constructed by integrating data on historical earthquakes,active faults,and paleoseismicity.Furthermore,a logic tree framework was employed to quantify epistemic uncertainties,enabling a systematic seismic hazard assessment of the region.To more accurately characterize the spatial heterogeneity of seismic activity,improvements were made to both the Circular Spatial Smoothing Model(CSSM)with a fixed radius and the Adaptive Spatial Smoothing Model(ASSM),with full consideration given to the spatiotemporal completeness of historical earthquake magnitudes.Regarding the CSSM,for scenarios involving small sample sizes in earthquake catalogs,the cross-validation method proposed in this study demonstrated higher robustness than the maximum likelihood method in determining the optimal correlation distance.Performance evaluation results indicate that while both models effectively characterize seismic activity,the ASSM exhibits superior overall predictive performance compared to the CSSM,owing to its ability to adaptively adjust the smoothing radius according to seismic density.Significant discrepancies were observed in the Peak Ground Acceleration(PGA)results calculated with a 10%probability of exceedance in 50 years across different combinations of seismic source models.The single spatially smoothed point-source model yielded a maximum PGA of approximately 0.52 g,with high-value areas concentrated near historical epicenters,thereby significantly underestimating the hazard associated with major fault zones.When combined with the simple fault-source model,the maximum PGA increased to 0.8 g,with high-value zones exhibiting a zonal distribution along faults;however,the risk remained underestimated for faults with low slip rates that are nevertheless approaching their recurrence cycles.Following the introduction of the time-dependent characteristic fault-source model,local PGA values for faults in the middle-to-late stages of their recurrence cycles increased by a factor of 2 to 7 compared to the single model.These results demonstrate that the characteristic fault-source model reasonably delineates the time-dependence of large earthquake recurrence,thereby providing a more accurate assessment of imminent seismic risks.By comprehensively applying the improved spatially smoothed pointsource model,the simple fault-source model,and the characteristic fault-source model,the following faults within the region were identified as having high seismic hazard:the Huangxianggou,Zhangxian,and Tianshui segments of the Xiqinling northern edge fault;the Maqin-Maqu segment of the Dongkunlun fault;the Longriqu fault;the Maoergai fault;the Elashan fault;the Riyueshan fault;the eastern segment of the Lenglongling fault;the Maxianshan segment of the Maxianshan northern Margin fault;and the Maomaoshan-Jinqianghe segment of the Laohushan-Maomaoshan fault.As these faults are located within seismic gaps or are approaching the recurrence periods of large earthquakes,they should be prioritized for current and future seismic monitoring as well as disaster prevention and mitigation efforts. 展开更多
关键词 northern segment of the North-South Seismic Belt fault-source characteristic earthquake spatial smoothing model
在线阅读 下载PDF
CableSAM:an efficient automatic segmentation method for aircraft cabin cables
16
作者 LING Aihua WANG Junwen +1 位作者 LU Jiaming LIU Ruyu 《Optoelectronics Letters》 2025年第3期183-187,共5页
Cabin cables,as critical components of an aircraft's electrical system,significantly impact the operational efficiency and safety of the aircraft.The existing cable segmentation methods in civil aviation cabins ar... Cabin cables,as critical components of an aircraft's electrical system,significantly impact the operational efficiency and safety of the aircraft.The existing cable segmentation methods in civil aviation cabins are limited,especially in automation,heavily dependent on large amounts of data and resources,lacking the flexibility to adapt to different scenarios.To address these challenges,this paper introduces a novel image segmentation model,CableSAM,specifically designed for automated segmentation of cabin cables.CableSAM improves segmentation efficiency and accuracy using knowledge distillation and employs a context ensemble strategy.It accurately segments cables in various scenarios with minimal input prompts.Comparative experiments on three cable datasets demonstrate that CableSAM surpasses other advanced cable segmentation methods in performance. 展开更多
关键词 image segmentation aircraft cabin automatic segmentation automated segmentation cabin cablesas civil aviation cabins cable segmentation knowledge distillation
原文传递
Improved SE-UNet network-based semantic segmentation and extraction of hidden geological significance in geological maps
17
作者 Kai Ma Jun-jie Liu +5 位作者 Si-qi Lu Ze-hua Huang Miao Tian Jun-yuan Deng Zhong Xie Qin-jun Qiu 《China Geology》 2025年第4期643-660,共18页
Automatic segmentation and recognition of content and element information in color geological map are of great significance for researchers to analyze the distribution of mineral resources and predict disaster informa... Automatic segmentation and recognition of content and element information in color geological map are of great significance for researchers to analyze the distribution of mineral resources and predict disaster information.This article focuses on color planar raster geological map(geological maps include planar geological maps,columnar maps,and profiles).While existing deep learning approaches are often used to segment general images,their performance is limited due to complex elements,diverse regional features,and complicated backgrounds for color geological map in the domain of geoscience.To address the issue,a color geological map segmentation model is proposed that combines the Felz clustering algorithm and an improved SE-UNet deep learning network(named GeoMSeg).Firstly,a symmetrical encoder-decoder structure backbone network based on UNet is constructed,and the channel attention mechanism SENet has been incorporated to augment the network’s capacity for feature representation,enabling the model to purposefully extract map information.The SE-UNet network is employed for feature extraction from the geological map and obtain coarse segmentation results.Secondly,the Felz clustering algorithm is used for super pixel pre-segmentation of geological maps.The coarse segmentation results are refined and modified based on the super pixel pre-segmentation results to obtain the final segmentation results.This study applies GeoMSeg to the constructed dataset,and the experimental results show that the algorithm proposed in this paper has superior performance compared to other mainstream map segmentation models,with an accuracy of 91.89%and a MIoU of 71.91%. 展开更多
关键词 Geological map UNet model Image segmentation Semantic segmentation Pixel pre-segmentation Clustering algorithm Attention mechanism Deep learning Artificial intelligence Geological survey engineering
在线阅读 下载PDF
MG-SLAM: RGB-D SLAM Based on Semantic Segmentation for Dynamic Environment in the Internet of Vehicles 被引量:1
18
作者 Fengju Zhang Kai Zhu 《Computers, Materials & Continua》 2025年第2期2353-2372,共20页
The Internet of Vehicles (IoV) has become an important direction in the field of intelligent transportation, in which vehicle positioning is a crucial part. SLAM (Simultaneous Localization and Mapping) technology play... The Internet of Vehicles (IoV) has become an important direction in the field of intelligent transportation, in which vehicle positioning is a crucial part. SLAM (Simultaneous Localization and Mapping) technology plays a crucial role in vehicle localization and navigation. Traditional Simultaneous Localization and Mapping (SLAM) systems are designed for use in static environments, and they can result in poor performance in terms of accuracy and robustness when used in dynamic environments where objects are in constant movement. To address this issue, a new real-time visual SLAM system called MG-SLAM has been developed. Based on ORB-SLAM2, MG-SLAM incorporates a dynamic target detection process that enables the detection of both known and unknown moving objects. In this process, a separate semantic segmentation thread is required to segment dynamic target instances, and the Mask R-CNN algorithm is applied on the Graphics Processing Unit (GPU) to accelerate segmentation. To reduce computational cost, only key frames are segmented to identify known dynamic objects. Additionally, a multi-view geometry method is adopted to detect unknown moving objects. The results demonstrate that MG-SLAM achieves higher precision, with an improvement from 0.2730 m to 0.0135 m in precision. Moreover, the processing time required by MG-SLAM is significantly reduced compared to other dynamic scene SLAM algorithms, which illustrates its efficacy in locating objects in dynamic scenes. 展开更多
关键词 Visual SLAM dynamic scene semantic segmentation GPU acceleration key segmentation frame
在线阅读 下载PDF
Pre-trained SAM as data augmentation for image segmentation 被引量:1
19
作者 Junjun Wu Yunbo Rao +1 位作者 Shaoning Zeng Bob Zhang 《CAAI Transactions on Intelligence Technology》 2025年第1期268-282,共15页
Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in ord... Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation. 展开更多
关键词 data augmentation image segmentation large model segment anything model
在线阅读 下载PDF
Multi-Robot Collaborative Complex Indoor Scene Segmentation via Multiplex Interactive Learning
20
作者 Jinfu Liu Zhongzien Jiang +3 位作者 Xinhua Xu Wenhao Li Mengyuan Liu Hong Liu 《CAAI Transactions on Intelligence Technology》 2025年第6期1646-1660,共15页
Indoor scene semantic segmentation is essential for enabling robots to understand and interact with their environments effectively.However,numerous challenges remain unresolved,particularly in single-robot systems,whi... Indoor scene semantic segmentation is essential for enabling robots to understand and interact with their environments effectively.However,numerous challenges remain unresolved,particularly in single-robot systems,which often struggle with the complexity and variability of indoor scenes.To address these limitations,we introduce a novel multi-robot collaborative framework based on multiplex interactive learning(MPIL)in which each robot specialises in a distinct visual task within a unified multitask architecture.During training,the framework employs task-specific decoders and cross-task feature sharing to enhance collaborative optimisation.At inference time,robots operate independently with optimised models,enabling scalable,asynchronous and efficient deployment in real-world scenarios.Specifically,MPIL employs specially designed modules that integrate RGB and depth data,refine feature representations and facilitate the simultaneous execution of multiple tasks,such as instance segmentation,scene classification and semantic segmentation.By leveraging these modules,distinct agents within multi-robot systems can effectively handle specialised tasks,thereby enhancing the overall system's flexibility and adaptability.This collaborative effort maximises the strengths of each robot,resulting in a more comprehensive understanding of environments.Extensive experiments on two public benchmark datasets demonstrate MPIL's competitive performance compared to state-of-the-art approaches,highlighting the effectiveness and robustness of our multi-robot system in complex indoor environments. 展开更多
关键词 cross-task interactive learning(artificial intelligence) MULTI-MODAL multiplex interactive learning multitask object segmentation semantic segmentation
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部