期刊文献+
共找到506篇文章
< 1 2 26 >
每页显示 20 50 100
Semantic segmentation method of road scene based on Deeplabv3+ and attention mechanism 被引量:7
1
作者 BAI Yanqiong ZHENG Yufu TIAN Hong 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2021年第4期412-422,共11页
In the study of automatic driving,understanding the road scene is a key to improve driving safety.The semantic segmentation method could divide the image into different areas associated with semantic categories in acc... In the study of automatic driving,understanding the road scene is a key to improve driving safety.The semantic segmentation method could divide the image into different areas associated with semantic categories in accordance with the pixel level,so as to help vehicles to perceive and obtain the surrounding road environment information,which would improve driving safety.Deeplabv3+is the current popular semantic segmentation model.There are phenomena that small targets are missed and similar objects are easily misjudged during its semantic segmentation tasks,which leads to rough segmentation boundary and reduces semantic accuracy.This study focuses on the issue,based on the Deeplabv3+network structure and combined with the attention mechanism,to increase the weight of the segmentation area,and then proposes an improved Deeplabv3+fusion attention mechanism for road scene semantic segmentation method.First,a group of parallel position attention module and channel attention module are introduced on the Deeplabv3+encoding end to capture more spatial context information and high-level semantic information.Then,an attention mechanism is introduced to restore the spatial detail information,and the data shall be normalized in order to accelerate the convergence speed of the model at the decoding end.The effects of model segmentation with different attention-introducing mechanisms are compared and tested on CamVid and Cityscapes datasets.The experimental results show that the mean Intersection over Unons of the improved model segmentation accuracies on the two datasets are boosted by 6.88%and 2.58%,respectively,which is better than using Deeplabv3+.This method does not significantly increase the amount of network calculation and complexity,and has a good balance of speed and accuracy. 展开更多
关键词 autonomous driving road scene semantic segmentation Deeplabv3+ attention mechanism
在线阅读 下载PDF
Semantic Segmentation-Based Road Marking Detection Using Around View Monitoring System 被引量:1
2
作者 XU Hanqing YANG Ming +4 位作者 DENG Liuyuan LI Hao WANG Chunxiang HAN Weibin YU Yuelong 《Journal of Shanghai Jiaotong university(Science)》 EI 2022年第6期833-843,共11页
Road marking detection is an important branch in autonomous driving,understanding the road information.In recent years,deep-learning-based semantic segmentation methods for road marking detection have been arising sin... Road marking detection is an important branch in autonomous driving,understanding the road information.In recent years,deep-learning-based semantic segmentation methods for road marking detection have been arising since they can generalize detection result well under complicated environments and hold rich pixel-level semantic information.Nevertheless,the previous methods mostly study the training process of the segmentation network,while omitting the time cost of manually annotating pixel-level data.Besides,the pixel-level semantic segmentation results need to be fitted into more reliable and compact models so that geometrical information of road markings can be explicitly obtained.In order to tackle the above problems,this paper describes a semantic segmentation-based road marking detection method using around view monitoring system.A semiautomatic semantic annotation platform is developed,which exploits an auxiliary segmentation graph to speed up the annotation process while guaranteeing the annotation accuracy.A segmentation-based detection module is also described,which models the semantic segmentation results for the more robust and compact analysis.The proposed detection module is composed of three parts:vote-based segmentation fusion filtering,graph-based road marking clustering,and road-marking fitting.Experiments under various scenarios show that the semantic segmentation-based detection method can achieve accurate,robust,and real-time detection performance. 展开更多
关键词 autonomous driving semantic segmentation road marking detection
原文传递
A Lightweight Road Scene Semantic Segmentation Algorithm 被引量:1
3
作者 Jiansheng Peng Qing Yang Yaru Hou 《Computers, Materials & Continua》 SCIE EI 2023年第11期1929-1948,共20页
In recent years,with the continuous deepening of smart city construction,there have been significant changes and improvements in the field of intelligent transportation.The semantic segmentation of road scenes has imp... In recent years,with the continuous deepening of smart city construction,there have been significant changes and improvements in the field of intelligent transportation.The semantic segmentation of road scenes has important practical significance in the fields of automatic driving,transportation planning,and intelligent transportation systems.However,the current mainstream lightweight semantic segmentation models in road scene segmentation face problems such as poor segmentation performance of small targets and insufficient refinement of segmentation edges.Therefore,this article proposes a lightweight semantic segmentation model based on the LiteSeg model improvement to address these issues.The model uses the lightweight backbone network MobileNet instead of the LiteSeg backbone network to reduce the network parameters and computation,and combines the Coordinate Attention(CA)mechanism to help the network capture long-distance dependencies.At the same time,by combining the dependencies of spatial information and channel information,the Spatial and Channel Network(SCNet)attention mechanism is proposed to improve the feature extraction ability of the model.Finally,a multiscale transposed attention encoding(MTAE)module was proposed to obtain features of different resolutions and perform feature fusion.In this paper,the proposed model is verified on the Cityscapes dataset.The experimental results show that the addition of SCNet and MTAE modules increases the mean Intersection over Union(mIoU)of the original LiteSeg model by 4.69%.On this basis,the backbone network is replaced with MobileNet,and the CA model is added at the same time.At the cost of increasing the minimum model parameters and computing costs,the mIoU of the original LiteSeg model is increased by 2.46%.This article also compares the proposed model with some current lightweight semantic segmentation models,and experiments show that the comprehensive performance of the proposed model is the best,especially in achieving excellent results in small object segmentation.Finally,this article will conduct generalization testing on the KITTI dataset for the proposed model,and the experimental results show that the proposed algorithm has a certain degree of generalization. 展开更多
关键词 semantic segmentation LIGHTWEIGHT road scenes multi-scale transposition attention encoding(MTAE)
在线阅读 下载PDF
Automatic Road Tunnel Crack Inspection Based on Crack Area Sensing and Multiscale Semantic Segmentation 被引量:1
4
作者 Dingping Chen Zhiheng Zhu +1 位作者 Jinyang Fu Jilin He 《Computers, Materials & Continua》 SCIE EI 2024年第4期1679-1703,共25页
The detection of crack defects on the walls of road tunnels is a crucial step in the process of ensuring travel safetyand performing routine tunnel maintenance. The automatic and accurate detection of cracks on the su... The detection of crack defects on the walls of road tunnels is a crucial step in the process of ensuring travel safetyand performing routine tunnel maintenance. The automatic and accurate detection of cracks on the surface of roadtunnels is the key to improving the maintenance efficiency of road tunnels. Machine vision technology combinedwith a deep neural network model is an effective means to realize the localization and identification of crackdefects on the surface of road tunnels.We propose a complete set of automatic inspection methods for identifyingcracks on the walls of road tunnels as a solution to the problem of difficulty in identifying cracks during manualmaintenance. First, a set of equipment applied to the real-time acquisition of high-definition images of walls inroad tunnels is designed. Images of walls in road tunnels are acquired based on the designed equipment, whereimages containing crack defects are manually identified and selected. Subsequently, the training and validationsets used to construct the crack inspection model are obtained based on the acquired images, whereas the regionscontaining cracks and the pixels of the cracks are finely labeled. After that, a crack area sensing module is designedbased on the proposed you only look once version 7 model combined with coordinate attention mechanism (CAYOLOV7) network to locate the crack regions in the road tunnel surface images. Only subimages containingcracks are acquired and sent to the multiscale semantic segmentation module for extraction of the pixels to whichthe cracks belong based on the DeepLab V3+ network. The precision and recall of the crack region localizationon the surface of a road tunnel based on our proposed method are 82.4% and 93.8%, respectively. Moreover, themean intersection over union (MIoU) and pixel accuracy (PA) values for achieving pixel-level detection accuracyare 76.84% and 78.29%, respectively. The experimental results on the dataset show that our proposed two-stagedetection method outperforms other state-of-the-art models in crack region localization and detection. Based onour proposedmethod, the images captured on the surface of a road tunnel can complete crack detection at a speed often frames/second, and the detection accuracy can reach 0.25 mm, which meets the requirements for maintenanceof an actual project. The designed CA-YOLO V7 network enables precise localization of the area to which a crackbelongs in images acquired under different environmental and lighting conditions in road tunnels. The improvedDeepLab V3+ network based on lightweighting is able to extract crack morphology in a given region more quicklywhile maintaining segmentation accuracy. The established model combines defect localization and segmentationmodels for the first time, realizing pixel-level defect localization and extraction on the surface of road tunnelsin complex environments, and is capable of determining the actual size of cracks based on the physical coordinatesystemafter camera calibration. The trainedmodelhas highaccuracy andcanbe extendedandapplied to embeddedcomputing devices for the assessment and repair of damaged areas in different types of road tunnels. 展开更多
关键词 road tunnel crack inspection crack area sensing multiscale semantic segmentation CA-YOLO V7 DeepLab V3+
在线阅读 下载PDF
A 3D semantic segmentation network for accurate neuronal soma segmentation
5
作者 Li Ma Qi Zhong +2 位作者 Yezi Wang Xiaoquan Yang Qian Du 《Journal of Innovative Optical Health Sciences》 2025年第1期67-83,共17页
Neuronal soma segmentation plays a crucial role in neuroscience applications.However,the fine structure,such as boundaries,small-volume neuronal somata and fibers,are commonly present in cell images,which pose a chall... Neuronal soma segmentation plays a crucial role in neuroscience applications.However,the fine structure,such as boundaries,small-volume neuronal somata and fibers,are commonly present in cell images,which pose a challenge for accurate segmentation.In this paper,we propose a 3D semantic segmentation network for neuronal soma segmentation to address this issue.Using an encoding-decoding structure,we introduce a Multi-Scale feature extraction and Adaptive Weighting fusion module(MSAW)after each encoding block.The MSAW module can not only emphasize the fine structures via an upsampling strategy,but also provide pixel-wise weights to measure the importance of the multi-scale features.Additionally,a dynamic convolution instead of normal convolution is employed to better adapt the network to input data with different distributions.The proposed MSAW-based semantic segmentation network(MSAW-Net)was evaluated on three neuronal soma images from mouse brain and one neuronal soma image from macaque brain,demonstrating the efficiency of the proposed method.It achieved an F1 score of 91.8%on Fezf2-2A-CreER dataset,97.1%on LSL-H2B-GFP dataset,82.8%on Thy1-EGFP-Mline dataset,and 86.9%on macaque dataset,achieving improvements over the 3D U-Net model by 3.1%,3.3%,3.9%,and 2.3%,respectively. 展开更多
关键词 Neuronal soma segmentation semantic segmentation network multi-scale feature extraction adaptive weighting fusion
原文传递
BSDNet:Semantic Information Distillation-Based for Bilateral-Branch Real-Time Semantic Segmentation on Street Scene Image
6
作者 Huan Zeng Jianxun Zhang +1 位作者 Hongji Chen Xinwei Zhu 《Computers, Materials & Continua》 2025年第11期3879-3896,共18页
Semantic segmentation in street scenes is a crucial technology for autonomous driving to analyze the surrounding environment.In street scenes,issues such as high image resolution caused by a large viewpoints and diffe... Semantic segmentation in street scenes is a crucial technology for autonomous driving to analyze the surrounding environment.In street scenes,issues such as high image resolution caused by a large viewpoints and differences in object scales lead to a decline in real-time performance and difficulties in multi-scale feature extraction.To address this,we propose a bilateral-branch real-time semantic segmentationmethod based on semantic information distillation(BSDNet)for street scene images.The BSDNet consists of a Feature Conversion Convolutional Block(FCB),a Semantic Information Distillation Module(SIDM),and a Deep Aggregation Atrous Convolution Pyramid Pooling(DASP).FCB reduces the semantic gap between the backbone and the semantic branch.SIDM extracts high-quality semantic information fromthe Transformer branch to reduce computational costs.DASP aggregates information lost in atrous convolutions,effectively capturingmulti-scale objects.Extensive experiments conducted on Cityscapes,CamVid,and ADE20K,achieving an accuracy of 81.7% Mean Intersection over Union(mIoU)at 70.6 Frames Per Second(FPS)on Cityscapes,demonstrate that our method achieves a better balance between accuracy and inference speed. 展开更多
关键词 Street scene understanding real-time semantic segmentation knowledge distillation multi-scale feature extraction
在线阅读 下载PDF
A Road Extraction Method for Remote Sensing Image Based on Encoder-Decoder Network 被引量:30
7
作者 Hao HE Shuyang WANG +2 位作者 Shicheng WANG Dongfang YANG Xing LIU 《Journal of Geodesy and Geoinformation Science》 2020年第2期16-25,共10页
According to the characteristics of the road features,an Encoder-Decoder deep semantic segmentation network is designed for the road extraction of remote sensing images.Firstly,as the features of the road target are r... According to the characteristics of the road features,an Encoder-Decoder deep semantic segmentation network is designed for the road extraction of remote sensing images.Firstly,as the features of the road target are rich in local details and simple in semantic features,an Encoder-Decoder network with shallow layers and high resolution is designed to improve the ability to represent detail information.Secondly,as the road area is a small proportion in remote sensing images,the cross-entropy loss function is improved,which solves the imbalance between positive and negative samples in the training process.Experiments on large road extraction datasets show that the proposed method gets the recall rate 83.9%,precision 82.5%and F1-score 82.9%,which can extract the road targets in remote sensing images completely and accurately.The Encoder-Decoder network designed in this paper performs well in the road extraction task and needs less artificial participation,so it has a good application prospect. 展开更多
关键词 remote sensing road extraction deep learning semantic segmentation Encoder-Decoder network
在线阅读 下载PDF
Weakly Supervised Network with Scribble-Supervised and Edge-Mask for Road Extraction from High-Resolution Remote Sensing Images
8
作者 Supeng Yu Fen Huang Chengcheng Fan 《Computers, Materials & Continua》 SCIE EI 2024年第4期549-562,共14页
Significant advancements have been achieved in road surface extraction based on high-resolution remote sensingimage processing. Most current methods rely on fully supervised learning, which necessitates enormous human... Significant advancements have been achieved in road surface extraction based on high-resolution remote sensingimage processing. Most current methods rely on fully supervised learning, which necessitates enormous humaneffort to label the image. Within this field, other research endeavors utilize weakly supervised methods. Theseapproaches aim to reduce the expenses associated with annotation by leveraging sparsely annotated data, such asscribbles. This paper presents a novel technique called a weakly supervised network using scribble-supervised andedge-mask (WSSE-net). This network is a three-branch network architecture, whereby each branch is equippedwith a distinct decoder module dedicated to road extraction tasks. One of the branches is dedicated to generatingedge masks using edge detection algorithms and optimizing road edge details. The other two branches supervise themodel’s training by employing scribble labels and spreading scribble information throughout the image. To addressthe historical flaw that created pseudo-labels that are not updated with network training, we use mixup to blendprediction results dynamically and continually update new pseudo-labels to steer network training. Our solutiondemonstrates efficient operation by simultaneously considering both edge-mask aid and dynamic pseudo-labelsupport. The studies are conducted on three separate road datasets, which consist primarily of high-resolutionremote-sensing satellite photos and drone images. The experimental findings suggest that our methodologyperforms better than advanced scribble-supervised approaches and specific traditional fully supervised methods. 展开更多
关键词 semantic segmentation road extraction weakly supervised learning scribble supervision remote sensing image
在线阅读 下载PDF
ConvNeXt-UperNet-Based Deep Learning Model for Road Extraction from High-Resolution Remote Sensing Images
9
作者 Jing Wang Chen Zhang Tianwen Lin 《Computers, Materials & Continua》 SCIE EI 2024年第8期1907-1925,共19页
When existing deep learning models are used for road extraction tasks from high-resolution images,they are easily affected by noise factors such as tree and building occlusion and complex backgrounds,resulting in inco... When existing deep learning models are used for road extraction tasks from high-resolution images,they are easily affected by noise factors such as tree and building occlusion and complex backgrounds,resulting in incomplete road extraction and low accuracy.We propose the introduction of spatial and channel attention modules to the convolutional neural network ConvNeXt.Then,ConvNeXt is used as the backbone network,which cooperates with the perceptual analysis network UPerNet,retains the detection head of the semantic segmentation,and builds a new model ConvNeXt-UPerNet to suppress noise interference.Training on the open-source DeepGlobe and CHN6-CUG datasets and introducing the DiceLoss on the basis of CrossEntropyLoss solves the problem of positive and negative sample imbalance.Experimental results show that the new network model can achieve the following performance on the DeepGlobe dataset:79.40%for precision(Pre),97.93% for accuracy(Acc),69.28% for intersection over union(IoU),and 83.56% for mean intersection over union(MIoU).On the CHN6-CUG dataset,the model achieves the respective values of 78.17%for Pre,97.63%for Acc,65.4% for IoU,and 81.46% for MIoU.Compared with other network models,the fused ConvNeXt-UPerNet model can extract road information better when faced with the influence of noise contained in high-resolution remote sensing images.It also achieves multiscale image feature information with unified perception,ultimately improving the generalization ability of deep learning technology in extracting complex roads from high-resolution remote sensing images. 展开更多
关键词 Deep learning semantic segmentation remote sensing imagery road extraction
在线阅读 下载PDF
Unstructured Road Extraction in UAV Images based on Lightweight Model 被引量:1
10
作者 Di Zhang Qichao An +3 位作者 Xiaoxue Feng Ronghua Liu Jun Han Feng Pan 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2024年第2期372-384,共13页
There is no unified planning standard for unstructured roads,and the morphological structures of these roads are complex and varied.It is important to maintain a balance between accuracy and speed for unstructured roa... There is no unified planning standard for unstructured roads,and the morphological structures of these roads are complex and varied.It is important to maintain a balance between accuracy and speed for unstructured road extraction models.Unstructured road extraction algorithms based on deep learning have problems such as high model complexity,high computational cost,and the inability to adapt to current edge computing devices.Therefore,it is best to use lightweight network models.Considering the need for lightweight models and the characteristics of unstructured roads with different pattern shapes,such as blocks and strips,a TMB(Triple Multi-Block)feature extraction module is proposed,and the overall structure of the TMBNet network is described.The TMB module was compared with SS-nbt,Non-bottleneck-1D,and other modules via experiments.The feasibility and effectiveness of the TMB module design were proven through experiments and visualizations.The comparison experiment,using multiple convolution kernel categories,proved that the TMB module can improve the segmentation accuracy of the network.The comparison with different semantic segmentation networks demonstrates that the TMBNet network has advantages in terms of unstructured road extraction. 展开更多
关键词 Unstructured road Lightweight model Triple Multi-Block(TMB) semantic segmentation net
在线阅读 下载PDF
MSC-Deep LabV3+:A Segmentation Model for Slender Fabric Roll Seam Detection
11
作者 Weimin Shi Kuntao Lv +1 位作者 Chang Xuan Ji Wu 《Computers, Materials & Continua》 2026年第5期480-498,共19页
The application of deep learning in fabric defect detection has become increasingly widespread.To address false positives and false negatives in fabric roll seam detection,and to improve automation efficiency and prod... The application of deep learning in fabric defect detection has become increasingly widespread.To address false positives and false negatives in fabric roll seam detection,and to improve automation efficiency and product quality,we propose the Multi-scale Context DeepLabV3+(MSC-DeepLabV3+),a semantic segmentation network designed for fabric roll seam detection,based on DeepLabV3+.The model improvements include enhancing the backbone performance through optimization of the UIB-MobileNetV2 network;designing the Dynamic Atrous and Sliding-window Fusion(DASF)module to improve adaptability to multi-scale seam structures with dynamic dilation rates and a sliding-window mechanism;and utilizing the Progressive Low-level Feature Fusion(PLFF)module to progressively restore seam boundary details via shallow feature fusion.Additionally,an enhanced 3-SE attention mechanism is employed,replacing the direct concatenation operation.Experimental results show thatMSCDeepLabV3+outperforms classical and recent segmentation models.Compared to DeepLabV3+with an Xception backbone,MSC-DeepLabV3+achieves a mean intersection over union(mIoU)of 92.30%and the boundary Fscore(BF)of 92.54%,representing improvements of 3.04%and 3.14%,respectively.Moreover,the model complexity is significantly reduced,with the model parameters(params)decreasing to 3.44M and Frames Per Second(FPS)increasing from 101 to 273,demonstrating its potential for deployment in resource-constrained industrial scenarios. 展开更多
关键词 Fabric roll seam detection semantic segmentation deep learning lightweight network multi-scale feature extraction improved attention mechanism
在线阅读 下载PDF
Navigation algorithm based on semantic segmentation in wheat fields using an RGB-D camera 被引量:2
12
作者 Yan Song Feiyang Xu +2 位作者 Qi Yao Jialin Liu Shuai Yang 《Information Processing in Agriculture》 EI CSCD 2023年第4期475-490,共16页
Determining the navigation line is critical for the automatic navigation of agricultural robots in the farmland.In this research,considering a wheat field as the typical scenario,a novel navigation line extraction alg... Determining the navigation line is critical for the automatic navigation of agricultural robots in the farmland.In this research,considering a wheat field as the typical scenario,a novel navigation line extraction algorithm based on semantic segmentation is proposed.The data containing horizontal parallax,height,and grayscale information(HHG)is constructed by combining re-encoded depth data and red-green-blue(RGB)data.The HHG,RGB,and depth data are used to achieve scene recognition and navigation line extraction for a wheat field.The method includes two main steps.First,the semantic segmentation of the wheat,ground,and background are performed using a fully convolutional network(FCN).Second,the navigation line is fitted in the camera coordinate system on the basis of the semantic segmentation result and the principle of camera pinhole imaging.Our segmentation model is trained using 508 randomly selected images from a data set,and the model is tested on 199 images.When labelled data are used as the reference benchmark,the mean intersection over union(mIoU)of the HHG data is greater than 95%,which is the highest among the three types of data.The semantic segmentation methods based on the RGB and HHG data show higher navigation line extraction accuracy rates(with the absolute value of the angle deviation less than 5)than the compared methods.The mean and standard deviation of the angle deviation of the two methods are within 0.1and 2.0,while the mean and standard deviation of the distance deviation are less than 30 mm and 60 mm,respectively.These values meet the basic requirements of agricultural machinery field navigation.The novelty of this work is the proposal of a navigation line extraction algorithm based on semantic segmentation in wheat fields.This method is high in accuracy and robustness to interference from crop occlusion. 展开更多
关键词 Fully convolutional network Navigation line extraction semantic segmentation Visual navigation
原文传递
An automatic extraction method for geothermal radiation sources based on an LST retrieval algorithm and semantic network
13
作者 Ruixi He Lijuan Jia Jinchuan Zhang 《Natural Gas Industry B》 2023年第5期419-435,共17页
Geothermal resources are efficient,renewable and clean energy sources,and their reservoirs are usually closely associated with high-temperature regions of the land surface.Current exploration methods primarily involve... Geothermal resources are efficient,renewable and clean energy sources,and their reservoirs are usually closely associated with high-temperature regions of the land surface.Current exploration methods primarily involve migrating traditional geological techniques,which fail to fully use the unique features of geothermal radiation characteristics.Thermal infrared remote-sensing imaging technology can capture and present areas with distinctive surface thermal radiation features,providing considerable significance as a guide for localization prior tofield exploration.In this study,we propose a deep learningebased method for intelligently identifying and segmenting geothermal radiation sources from thermal infrared remote-sensing images,including data preparation and model training.To improve the localization drift and anomalous interference caused by the high complexity of the Earth's surface environment,this study uses a surface temperature retrieval algorithm to calculate the land surface temperature in the research area.The retrieval results are used to train the semantic segmentation model.In addition,a pixel-level geothermal spatial segmentation network(PGSSNet)is proposed to suppress the diffuse thermal radiation and reduce the broad and blurred white areas of images to exact locations.Once the training is completed,the model directly segments and extracts the actual range of thermal radiation sources from subsequent thermal infrared remote-sensing images without temperature retrieval and/or manual calibration. 展开更多
关键词 GEOTHERMAL Deep learning Automatic extraction LST retrieval semantic segmentation
在线阅读 下载PDF
CW-HRNet:Constrained Deformable Sampling and Wavelet-Guided Enhancement for Lightweight Crack Segmentation
14
作者 Dewang Ma 《Journal of Electronic Research and Application》 2025年第5期269-280,共12页
This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two ke... This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two key modules:Constrained Deformable Convolution(CDC),which stabilizes geometric alignment by applying a tanh limiter and learnable scaling factor to the predicted offsets,and the Wavelet Frequency Enhancement Module(WFEM),which decomposes features using Haar wavelets to preserve low-frequency structures while enhancing high-frequency boundaries and textures.Evaluations on the CrackSeg9k benchmark demonstrate CW-HRNet’s superior performance,achieving 82.39%mIoU with only 7.49M parameters and 10.34 GFLOPs,outperforming HrSegNet-B48 by 1.83% in segmentation accuracy with minimal complexity overhead.The model also shows strong cross-dataset generalization,achieving 60.01%mIoU and 66.22%F1 on Asphalt3k without fine-tuning.These results highlight CW-HRNet’s favorable accuracyefficiency trade-off for real-world crack segmentation tasks. 展开更多
关键词 Crack segmentation Lightweight semantic segmentation Deformable convolution Wavelet transform road infrastructure
在线阅读 下载PDF
Vector Extraction from Design Drawings for Intelligent 3D Modeling of Transmission Towers
15
作者 Ziqiang Tang Chao Han +5 位作者 Hongwu Li Zhou Fan Ke Sun Yuntian Huang Yuhang Chen Chenxing Wang 《Computers, Materials & Continua》 2025年第2期2813-2829,共17页
Accurate vector extraction from design drawings is required first to automatically create 3D models from pixel-level engineering design drawings. However, this task faces the challenges of complicated design shapes as... Accurate vector extraction from design drawings is required first to automatically create 3D models from pixel-level engineering design drawings. However, this task faces the challenges of complicated design shapes as well as cumbersome and cluttered annotations on drawings, which interfere with the vector extraction heavily. In this article, the transmission tower containing the most complex structure is taken as the research object, and a semantic segmentation network is constructed to first segment the shape masks from the pixel-level drawings. Preprocessing and postprocessing are also proposed to ensure the stability and accuracy of the shape mask segmentation. Then, based on the obtained shape masks, a vector extraction network guided by heatmaps is designed to extract structural vectors by fusing the features from node heatmap and skeleton heatmap, respectively. Compared with the state-of-the-art methods, experiment results illustrate that the proposed semantic segmentation method can effectively eliminate the interference of many elements on drawings to segment the shape masks effectively, meanwhile, the model trained by the proposed vector extraction network can accurately extract the vectors such as nodes and line connections, avoiding redundant vector detection. The proposed method lays a solid foundation for automatic 3D model reconstruction and contributes to technological advancements in relevant fields. 展开更多
关键词 Design drawings semantic segmentation deep learning vector extraction DIGITIZATION 3D modeling
在线阅读 下载PDF
Extraction of Suspected Illegal Buildings from Land Satellite Images Based on Fully Convolutional Networks
16
作者 Yu PEI Xi SHEN +2 位作者 Xianwu YANG Kaiyu FU Qinfang ZHOU 《Meteorological and Environmental Research》 2025年第1期64-69,75,共7页
In the management of land resources and the protection of cultivated land,the law enforcement of land satellite images is often used as one of the main means.In recent years,the policies and regulations of the law enf... In the management of land resources and the protection of cultivated land,the law enforcement of land satellite images is often used as one of the main means.In recent years,the policies and regulations of the law enforcement of land satellite images have become more and more strict and been adjusted increasingly frequently,playing a decisive role in preventing excessive non-agricultural and non-food urbanization.In the process of the law enforcement,the extraction of suspected illegal buildings is the most important and time-consuming content.Compared with the traditional deep learning model,fully convolutional networks(FCN)has a great advantage in remote sensing image processing because its input images are not limited by size,and both convolution and deconvolution are independent of the overall size of images.In this paper,an intelligent extraction model of suspected illegal buildings from land satellite images based on deep learning FCN was built.Kaiyuan City,Yunnan Province was taken as an example.The verification results show that the global accuracy of this model was 86.6%in the process of building extraction,and mean intersection over union(mIoU)was 73.6%.This study can provide reference for the extraction of suspected illegal buildings in the law enforcement work of land satellite images,and reduce the tedious manual operation to a certain extent. 展开更多
关键词 Deep learning Fully convolutional network semantic segmentation Law enforcement of land satellite images extraction of suspected illegal buildings
在线阅读 下载PDF
Automatic Horizontal Road Design Information Extraction from Georeferenced Polygonals: A Brazilian Federal Highway Network Study
17
作者 Alexandre H. Coelho Nataniel P. Borges Jr. +2 位作者 Nicolas P. Borges Marcos D. Gallo Amir M. Valente 《Journal of Civil Engineering and Architecture》 2015年第12期1513-1522,共10页
Road geometric design data are a vital input for diverse transportation studies. This information is usually obtained from the road design project. However, these are not always available and the as-built course of th... Road geometric design data are a vital input for diverse transportation studies. This information is usually obtained from the road design project. However, these are not always available and the as-built course of the road may diverge considerably from its projected one, rendering subsequent studies inaccurate or impossible. Moreover, the systematic acquisition of this data for the entire road network of a country or even a state represents a very challenging and laborious task. This study's goal was the extraction of geometric design data for the paved segments of the Brazilian federal highway network, containing more than 47,000 km of highways. It presents the details of the method's adoption process, the particularities of its application to the dataset and the obtained geometric design information. Additionally, it provides a first overview of the Brazilian federal highway network composition (curves and tangents) and geometry. 展开更多
关键词 Curve identification information extraction geometric design polygonal segmentation road.
在线阅读 下载PDF
面向农业地块提取的边缘-语义协同双分支解码网络
18
作者 杨梅 刘司南 +2 位作者 潘臻 高磊 闵帆 《计算机工程与科学》 北大核心 2026年第3期444-455,共12页
面向农业资源监测的遥感影像农业地块精准提取是实现耕地资源智能化管理的关键技术。针对现有深度学习方法在复杂农田场景中面临的边界模糊、纹理多样及形态异构导致的分割精度不足问题,提出边缘与语义协同优化的多任务神经网络ESDNet,... 面向农业资源监测的遥感影像农业地块精准提取是实现耕地资源智能化管理的关键技术。针对现有深度学习方法在复杂农田场景中面临的边界模糊、纹理多样及形态异构导致的分割精度不足问题,提出边缘与语义协同优化的多任务神经网络ESDNet,通过3种关键策略实现性能提升。首先,在编码器与主解码器间嵌入坐标注意力(CA)模块,通过坐标敏感的注意力权重增强模糊边界的鉴别能力;其次,设计具有多级感受野的特征增强(FE)模块,采用金字塔空洞卷积与自适应特征融合策略提升网络对异质纹理的解析度;最后,构建边界映射、距离映射与掩膜映射的多任务协同优化框架,通过几何约束与语义引导的联合学习策略,强化对复杂形态地块的空间认知。为验证网络普适性,实验选取中国山东、四川及荷兰地区的高分二号、哨兵二号多源遥感影像构建测试集。结果表明,ESDNet在交并比IoU指标上分别提升0.77个百分点、2.17个百分点和2.28个百分点,优于现有最优网络,其展现出的强泛化能力和高精度分割特性,为智慧农业中的耕地资源动态监测提供了可靠的技术支撑。 展开更多
关键词 农业地块提取 遥感 语义分割 神经网络 多任务学习
在线阅读 下载PDF
基于特征融合及多尺度上下文提取的实时语义分割
19
作者 刘伯红 刘磊 《重庆邮电大学学报(自然科学版)》 北大核心 2026年第1期167-176,共10页
针对实时语义分割算法常用的双分支结构存在空间分支和上下文分支特征融合不充分、多尺度上下文信息提取不全面等问题,提出基于特征融合及多尺度上下文提取的实时语义分割网络。设计空间-多尺度双向注意力融合模块,使用空间注意力机制... 针对实时语义分割算法常用的双分支结构存在空间分支和上下文分支特征融合不充分、多尺度上下文信息提取不全面等问题,提出基于特征融合及多尺度上下文提取的实时语义分割网络。设计空间-多尺度双向注意力融合模块,使用空间注意力机制和多尺度特征融合模块实现双分支交互融合,促进空间特征以及语义特征在双分支上的流动;在上下文分支末端设计了串联聚合金字塔池化模块,更精确地捕捉细节信息;聚合空间分支不同阶段特征,增强模型对图像特征的全面理解能力,促进深层特征与浅层特征的深度融合;使用多尺度注意力特征融合模块引导空间分支和上下文分支融合。实验结果表明,构建的网络在Cityscapes数据集上平均交并比(mean intersection over union,MIoU)达到78.0%,推理速度为104.5 Frame/s;在CamVid数据集上,MIoU达到75.9%,推理速度为224.6 Frame/s。 展开更多
关键词 实时语义分割 特征融合 注意力机制 多尺度上下文提取
在线阅读 下载PDF
SAMSNet:融合分散注意力与多尺度通道注意力的遥感道路提取网络
20
作者 魏德宾 徐永强 +1 位作者 李品儒 解鸿基 《遥感学报》 北大核心 2026年第2期371-384,共14页
从遥感图像中自动提取道路在智慧城市、智慧交通和自动驾驶等领域有着广泛的应用前景。然而,从高分辨率遥感图像中自动提取的道路存在碎片化、连通性差等问题,提取完整的道路仍然具有挑战性。为此,本文提出一种改进的编码器—解码器网络... 从遥感图像中自动提取道路在智慧城市、智慧交通和自动驾驶等领域有着广泛的应用前景。然而,从高分辨率遥感图像中自动提取的道路存在碎片化、连通性差等问题,提取完整的道路仍然具有挑战性。为此,本文提出一种改进的编码器—解码器网络SAMSNet(Split-Attention and Multi-Scale Attention Network)。首先,采用Split-Attention Network(ResNeSt-50)作为编码器,通过跨通道提取图像的语义信息以实现高质量的特征表示;其次,引入级联并行的空洞卷积块,在扩大感受野的同时提高网络对多尺度上下文信息的感知能力;最后,在跳跃连接部分引入多尺度通道注意力模块MS-CAM(Multi-Scale Channel Attention Module),同时关注分布全局的和局部的道路信息,帮助网络识别和检测极端尺度变化下的道路。并在DeepGlobe Road数据集、Massachusetts Road数据集和GRSet数据集上进行实验验证,将本文提出的SAMSNet与其他9种主流模型进行对比。验证结果表明,SAMSNet在3个公开数据集上的IoU和F1-score等多项评价指标均优于其他对比模型,取得了最优的提取结果。 展开更多
关键词 遥感图像 道路提取 语义分割 ResNeSt-50 分散注意力 多尺度通道注意力 空洞卷积
原文传递
上一页 1 2 26 下一页 到第
使用帮助 返回顶部