Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the a...Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the ability to simulate geometric transformations.Therefore,a deformable convolution is introduced to enhance the adaptability of convolutional networks to spatial transformation.Considering that the deep convolutional neural networks cannot adequately segment the local objects at the output layer due to using the pooling layers in neural network architecture.To overcome this shortcoming,the rough prediction segmentation results of the neural network output layer will be processed by fully connected conditional random fields to improve the ability of image segmentation.The proposed method can easily be trained by end-to-end using standard backpropagation algorithms.Finally,the proposed method is tested on the ISPRS dataset.The results show that the proposed method can effectively overcome the influence of the complex structure of the segmentation object and obtain state-of-the-art accuracy on the ISPRS Vaihingen 2D semantic labeling dataset.展开更多
In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process....In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.Traditional fabric defect detection algorithms can only detect specific materials and specific fabric defect types;in addition,their detection efficiency is low,and their detection results are relatively poor.Deep learning-based methods have many advantages in the field of fabric defect detection,however,such methods are less effective in identifying multiscale fabric defects and defects with complex shapes.Therefore,we propose an effective algorithm,namely multilayer feature extraction combined with deformable convolution(MFDC),for fabric defect detection.In MFDC,multi-layer feature extraction is used to fuse the underlying location features with high-level classification features through a horizontally connected top-down architecture to improve the detection of multi-scale fabric defects.On this basis,a deformable convolution is added to solve the problem of the algorithm’s weak detection ability of irregularly shaped fabric defects.In this approach,Roi Align and Cascade-RCNN are integrated to enhance the adaptability of the algorithm in materials with complex patterned backgrounds.The experimental results show that the MFDC algorithm can achieve good detection results for both multi-scale fabric defects and defects with complex shapes,at the expense of a small increase in detection time.展开更多
This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two ke...This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two key modules:Constrained Deformable Convolution(CDC),which stabilizes geometric alignment by applying a tanh limiter and learnable scaling factor to the predicted offsets,and the Wavelet Frequency Enhancement Module(WFEM),which decomposes features using Haar wavelets to preserve low-frequency structures while enhancing high-frequency boundaries and textures.Evaluations on the CrackSeg9k benchmark demonstrate CW-HRNet’s superior performance,achieving 82.39%mIoU with only 7.49M parameters and 10.34 GFLOPs,outperforming HrSegNet-B48 by 1.83% in segmentation accuracy with minimal complexity overhead.The model also shows strong cross-dataset generalization,achieving 60.01%mIoU and 66.22%F1 on Asphalt3k without fine-tuning.These results highlight CW-HRNet’s favorable accuracyefficiency trade-off for real-world crack segmentation tasks.展开更多
Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Vari...Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Various strategies have been developed to satisfy the increasing needs of deformable medical image registration.One popular registration method is estimating the displacement field by computing the optical flow between two images.The motion field(flow field)is computed based on either gray-value or handcrafted descriptors such as the scale-invariant feature transform(SIFT).These methods assume that illumination is constant between images.However,medical images may not always satisfy this assumption.In this study,we propose a metric learning-based motion estimation method called Siamese Flow for deformable medical image registration.We train metric learners using a Siamese network,which produces an image patch descriptor that guarantees a smaller feature distance in two similar anatomical structures and a larger feature distance in two dissimilar anatomical structures.In the proposed registration framework,the flow field is computed based on such features and is close to the real deformation field due to the excellent feature representation ability of the Siamese network.Experimental results demonstrate that the proposed method outperforms the Demons,SIFT Flow,Elastix,and VoxelMorph networks regarding registration accuracy and robustness,particularly with large deformations.展开更多
The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segm...The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segmentation networks fail to extract features in fundus image sufficiently,we propose a novel network(DSeU-net)based on deformable convolution and squeeze excitation residual module.The deformable convolution is utilized to dynamically adjust the receptive field for the feature extraction of retinal vessel.And the squeeze excitation residual module is used to scale the weights of the low-level features so that the network learns the complex relationships of the different feature layers efficiently.We validate the DSeU-net on three public retinal vessel segmentation datasets including DRIVE,CHASEDB1,and STARE,and the experimental results demonstrate the satisfactory segmentation performance of the network.展开更多
Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust a...Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust and accurate correspondences,we propose DSD-MatchingNet for local feature matching in this study.First,we develop a deformable feature extraction module to obtain multilevel feature maps,which harvest contextual information from dynamic receptive fields.The dynamic receptive fields provided by the deformable convolution network ensure that our method obtains dense and robust correspondence.Second,we utilize sparse-to-dense matching with symmetry of correspondence to implement accurate pixel-level matching,which enables our method to produce more accurate correspondences.Result Experiments show that our proposed DSD-MatchingNet achieves a better performance on the image matching benchmark,as well as on the visual localization benchmark.Specifically,our method achieved 91.3%mean matching accuracy on the HPatches dataset and 99.3%visual localization recalls on the Aachen Day-Night dataset.展开更多
The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data...The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data scattergrams are constructed using current data from both sides of the transmission line and their sum.Following a comprehensive analysis of the varying characteristics of 3D data scattergrams under different conditions,a 3D data scattergram image classification based protection method is developed.The depth-wise separable convolution is used to ensure a lightweight convolutional neural network(CNN)structure without compromising performance.In addition,a Bayesian hyperparameter optimization algorithm is used to achieve a hyperparametric search to simplify the training process.Compared with artificial neural networks and CNNs,the depth-wise separable convolution based CNN(DPCNN)achieves a higher recognition accuracy.The 3D data scattergram image classification based protection method using DPCNN can accurately separate internal faults from other disturbances and identify fault phases under different operating states and fault conditions.The proposed protection method also shows first-class tolerability against current transformer(CT)saturation and CT measurement errors.展开更多
One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence...One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence of emotions.Observing the reduction of BPSD in dementia through emotions can be considered effective and widely used in the field of non-pharmacological therapy.At present,this article will verify whether the image recognition artificial intelligence(AI)system can correctly reflect the emotional performance of the elderly with dementia through a questionnaire survey of three professional elderly nursing staff.The ANOVA(sig.=0.50)is used to determine that the judgment given by the nursing staff has no obvious deviation,and then Kendall's test(0.722**)and spearman's test(0.863**)are used to verify the judgment severity of the emotion recognition system and the nursing staff unanimously.This implies the usability of the tool.Additionally,it can be expected to be further applied in the research related to BPSD elderly emotion detection.展开更多
Two actual rocks drilled from a typical ultra-deep hydrocarbon reservoir in the Tarim Basin are selected to conduct in-situ stress-loading micro-focus CT scanning experiments.The gray images of rock microstructure at ...Two actual rocks drilled from a typical ultra-deep hydrocarbon reservoir in the Tarim Basin are selected to conduct in-situ stress-loading micro-focus CT scanning experiments.The gray images of rock microstructure at different stress loading stages are obtained.The U-Net fully convolutional neural network is utilized to achieve fine semantic segmentation of rock skeleton,pore space,and microfractures based on CT slice images of deep rocks.The three-dimensional digital rock models of deformed multiscale fractured-porous media at different stress loading stages are thereafter reconstructed,and the equivalent fracture-pore network models are finally extracted to explore the underlying mechanisms of gas-water two-phase flow at the pore-scale.Results indicate that,in the process of insitu stress loading,both the deep rocks have experienced three stages:linear elastic deformation,nonlinear plastic deformation,and shear failure.The micro-mechanical behavior greatly affects the dynamic deformation of rock microstructure and gas-water two-phase flow.In the linear elastic deformation stage,with the increase in in-situ stress,both the deep rocks are gradually compacted,leading to decreases in average pore radius,pore throat ratio,tortuosity,and water-phase relative permeability,while the coordination number nearly remains unchanged.In the plastic deformation stage,the synergistic influence of rock compaction and existence of micro-fractures typically exert a great effect on pore-throat topological properties and gas-water relative permeability.In the shear failure stage,due to the generation and propagation of micro-fractures inside the deep rock,the topological connectivity becomes better,fluid flow paths increase,and flow conductivity is promoted,thus leading to sharp increases in average pore radius and coordination number,rapid decreases in pore throat ratio and tortuosity,as well as remarkable improvement in relative permeability of gas phase and waterphase.展开更多
为解决由于无人机视角下毛竹林的形状和纹理复杂,现有方法在分割精度和鲁棒性方面表现不佳的问题,提出了一种应用跨领域适应和偏移量引导的毛竹林分割网络——BFSNet。以百山祖国家公园为试验区,利用无人机拍摄周边毛竹林图像构建数据...为解决由于无人机视角下毛竹林的形状和纹理复杂,现有方法在分割精度和鲁棒性方面表现不佳的问题,提出了一种应用跨领域适应和偏移量引导的毛竹林分割网络——BFSNet。以百山祖国家公园为试验区,利用无人机拍摄周边毛竹林图像构建数据集。为增强模型的特征提取能力,提出跨领域适应模块以有效利用源模型的强特征提取能力,并结合自主学习提取适用于毛竹林分割任务的特征,利用两者的优势进行互补。为提高模型对于不同形状毛竹林的识别和定位能力,结合可变形卷积的偏移量引导模块,引入可学习的偏移量参数,以适应不同形状的毛竹林目标。将BFSNet在DeepGlobe Land Cover Classification Challenge和自制数据集上进行模型训练和测试,并与多种主流图像分割方法进行对比。结果表明:BFSNet在交并比、Dice系数、精确率和召回率4项指标上均取得了最优的性能表现,分别获得了76.04%和71.93%的交并比。与多种主流的图像分割模型相比,BFSNet在毛竹林的分割效果方面表现最为出色,对毛竹林形状的精确建模能力能够有效地应对不同形态的毛竹林。展开更多
Extracting implicit anomaly information through deformation monitoring data mining is highly significant to determining dam safety status.As an intelligent singular value diagnostic method for concrete dam deformation...Extracting implicit anomaly information through deformation monitoring data mining is highly significant to determining dam safety status.As an intelligent singular value diagnostic method for concrete dam deformation monitoring, shallow neural network models result in local optima and overfitting, and require manual feature extraction.To obtain an intelligent singular value diagnosis model that can be used for dam safety monitoring, a convolutional neural network (CNN) model that has advantages of deep learning (DL), such as automatic feature extraction, good model fitting, and strong generalizability, was trained in this study.An engineering example shows that the predicted result of the intelligent singular value diagnostic method based on CNN is highly compatible with the confusion matrix, with a precision of 92.41%, receiver operating characteristic (ROC) coordinates of (0.03, 0.97), an area-under-curve (AUC) value of 0.99, and an F1-score of 0.91.Moreover, the performance of the CNN model is better than those of models based on decision tree (DT) and k-nearest neighbor (KNN) methods.Therefore, the intelligent singular value diagnostic method based on CNN is simple to operate, highly intelligent, and highly reliable, and it has a high potential for application in engineering.展开更多
Lane detection is a fundamental necessary task for autonomous driving.The conventional methods mainly treat lane detection as a pixel-wise segmentation problem,which suffers from the challenge of uncontrollable drivin...Lane detection is a fundamental necessary task for autonomous driving.The conventional methods mainly treat lane detection as a pixel-wise segmentation problem,which suffers from the challenge of uncontrollable driving road environments and needs post-processing to abstract the lane parameters.In this work,a series of lines are used to represent traffic lanes and a novel line deformation network(LDNet) is proposed to directly predict the coordinates of lane line points.Inspired by the dynamic behavior of classic snake algorithms,LDNet uses a neural network to iteratively deform an initial lane line to match the lane markings.To capture the long and discontinuous structures of lane lines,1 D convolution in LDNet is used for structured feature learning along the lane lines.Based on LDNet,a two-stage pipeline is developed for lane marking detection:(1) initial lane line proposal to predict a list of lane line candidates,and(2) lane line deformation to obtain the coordinates of lane line points.Experiments show that the proposed approach achieves competitive performances on the TuSimple dataset while being efficient for real-time applications on a GTX 1650 GPU.In particular,the accuracy of LDNet with the annotated starting and ending points is up to99.45%,which indicates the improved initial lane line proposal method can further enhance the performance of LDNet.展开更多
基金National Key Research and Development Program of China(No.2017YFC0405806)。
文摘Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the ability to simulate geometric transformations.Therefore,a deformable convolution is introduced to enhance the adaptability of convolutional networks to spatial transformation.Considering that the deep convolutional neural networks cannot adequately segment the local objects at the output layer due to using the pooling layers in neural network architecture.To overcome this shortcoming,the rough prediction segmentation results of the neural network output layer will be processed by fully connected conditional random fields to improve the ability of image segmentation.The proposed method can easily be trained by end-to-end using standard backpropagation algorithms.Finally,the proposed method is tested on the ISPRS dataset.The results show that the proposed method can effectively overcome the influence of the complex structure of the segmentation object and obtain state-of-the-art accuracy on the ISPRS Vaihingen 2D semantic labeling dataset.
基金supported in part by the National Science Foundation of China under Grant 62001236in part by the Natural Science Foundation of the Jiangsu Higher Education Institutions of China under Grant 20KJA520003.
文摘In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.Traditional fabric defect detection algorithms can only detect specific materials and specific fabric defect types;in addition,their detection efficiency is low,and their detection results are relatively poor.Deep learning-based methods have many advantages in the field of fabric defect detection,however,such methods are less effective in identifying multiscale fabric defects and defects with complex shapes.Therefore,we propose an effective algorithm,namely multilayer feature extraction combined with deformable convolution(MFDC),for fabric defect detection.In MFDC,multi-layer feature extraction is used to fuse the underlying location features with high-level classification features through a horizontally connected top-down architecture to improve the detection of multi-scale fabric defects.On this basis,a deformable convolution is added to solve the problem of the algorithm’s weak detection ability of irregularly shaped fabric defects.In this approach,Roi Align and Cascade-RCNN are integrated to enhance the adaptability of the algorithm in materials with complex patterned backgrounds.The experimental results show that the MFDC algorithm can achieve good detection results for both multi-scale fabric defects and defects with complex shapes,at the expense of a small increase in detection time.
文摘This paper presents CW-HRNet,a high-resolution,lightweight crack segmentation network designed to address challenges in complex scenes with slender,deformable,and blurred crack structures.The model incorporates two key modules:Constrained Deformable Convolution(CDC),which stabilizes geometric alignment by applying a tanh limiter and learnable scaling factor to the predicted offsets,and the Wavelet Frequency Enhancement Module(WFEM),which decomposes features using Haar wavelets to preserve low-frequency structures while enhancing high-frequency boundaries and textures.Evaluations on the CrackSeg9k benchmark demonstrate CW-HRNet’s superior performance,achieving 82.39%mIoU with only 7.49M parameters and 10.34 GFLOPs,outperforming HrSegNet-B48 by 1.83% in segmentation accuracy with minimal complexity overhead.The model also shows strong cross-dataset generalization,achieving 60.01%mIoU and 66.22%F1 on Asphalt3k without fine-tuning.These results highlight CW-HRNet’s favorable accuracyefficiency trade-off for real-world crack segmentation tasks.
基金This study was supported in part by the Sichuan Science and Technology Program(2019YFH0085,2019ZDZX0005,2019YFG0196)in part by the Foundation of Chengdu University of Information Technology(No.KYTZ202008).
文摘Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Various strategies have been developed to satisfy the increasing needs of deformable medical image registration.One popular registration method is estimating the displacement field by computing the optical flow between two images.The motion field(flow field)is computed based on either gray-value or handcrafted descriptors such as the scale-invariant feature transform(SIFT).These methods assume that illumination is constant between images.However,medical images may not always satisfy this assumption.In this study,we propose a metric learning-based motion estimation method called Siamese Flow for deformable medical image registration.We train metric learners using a Siamese network,which produces an image patch descriptor that guarantees a smaller feature distance in two similar anatomical structures and a larger feature distance in two dissimilar anatomical structures.In the proposed registration framework,the flow field is computed based on such features and is close to the real deformation field due to the excellent feature representation ability of the Siamese network.Experimental results demonstrate that the proposed method outperforms the Demons,SIFT Flow,Elastix,and VoxelMorph networks regarding registration accuracy and robustness,particularly with large deformations.
基金Beijing Natural Science Foundation(No.IS23112)Beijing Institute of Technology Research Fund Program for Young Scholars(No.6120220236)。
文摘The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segmentation networks fail to extract features in fundus image sufficiently,we propose a novel network(DSeU-net)based on deformable convolution and squeeze excitation residual module.The deformable convolution is utilized to dynamically adjust the receptive field for the feature extraction of retinal vessel.And the squeeze excitation residual module is used to scale the weights of the low-level features so that the network learns the complex relationships of the different feature layers efficiently.We validate the DSeU-net on three public retinal vessel segmentation datasets including DRIVE,CHASEDB1,and STARE,and the experimental results demonstrate the satisfactory segmentation performance of the network.
基金Supported by the National Natural Science Foundation of China under Grants 61872241,62077037 and 62272298in part by Shanghai Municipal Science and Technology Major Project under Grant 2021SHZDZX0102。
文摘Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust and accurate correspondences,we propose DSD-MatchingNet for local feature matching in this study.First,we develop a deformable feature extraction module to obtain multilevel feature maps,which harvest contextual information from dynamic receptive fields.The dynamic receptive fields provided by the deformable convolution network ensure that our method obtains dense and robust correspondence.Second,we utilize sparse-to-dense matching with symmetry of correspondence to implement accurate pixel-level matching,which enables our method to produce more accurate correspondences.Result Experiments show that our proposed DSD-MatchingNet achieves a better performance on the image matching benchmark,as well as on the visual localization benchmark.Specifically,our method achieved 91.3%mean matching accuracy on the HPatches dataset and 99.3%visual localization recalls on the Aachen Day-Night dataset.
基金supported by the Fundamental Research Funds for Central Universities(No.2024JCCXJD01).
文摘The distinctive fault characteristics of battery energy storage stations(BESSs)significantly affect the reliability of conventional protection methods for transmission lines.In this paper,the three-dimensional(3D)data scattergrams are constructed using current data from both sides of the transmission line and their sum.Following a comprehensive analysis of the varying characteristics of 3D data scattergrams under different conditions,a 3D data scattergram image classification based protection method is developed.The depth-wise separable convolution is used to ensure a lightweight convolutional neural network(CNN)structure without compromising performance.In addition,a Bayesian hyperparameter optimization algorithm is used to achieve a hyperparametric search to simplify the training process.Compared with artificial neural networks and CNNs,the depth-wise separable convolution based CNN(DPCNN)achieves a higher recognition accuracy.The 3D data scattergram image classification based protection method using DPCNN can accurately separate internal faults from other disturbances and identify fault phases under different operating states and fault conditions.The proposed protection method also shows first-class tolerability against current transformer(CT)saturation and CT measurement errors.
文摘One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence of emotions.Observing the reduction of BPSD in dementia through emotions can be considered effective and widely used in the field of non-pharmacological therapy.At present,this article will verify whether the image recognition artificial intelligence(AI)system can correctly reflect the emotional performance of the elderly with dementia through a questionnaire survey of three professional elderly nursing staff.The ANOVA(sig.=0.50)is used to determine that the judgment given by the nursing staff has no obvious deviation,and then Kendall's test(0.722**)and spearman's test(0.863**)are used to verify the judgment severity of the emotion recognition system and the nursing staff unanimously.This implies the usability of the tool.Additionally,it can be expected to be further applied in the research related to BPSD elderly emotion detection.
基金supported by the National Natural Science Foundation of China(No.52174043)the Beijing Natural Science Foundation(No.3242019)+1 种基金the CNPC Innovation Foundation(No.2022DQ02-0208)the State Key Laboratory of Deep Oil and Gas(No.SKLD0G2024-KFZD-06).
文摘Two actual rocks drilled from a typical ultra-deep hydrocarbon reservoir in the Tarim Basin are selected to conduct in-situ stress-loading micro-focus CT scanning experiments.The gray images of rock microstructure at different stress loading stages are obtained.The U-Net fully convolutional neural network is utilized to achieve fine semantic segmentation of rock skeleton,pore space,and microfractures based on CT slice images of deep rocks.The three-dimensional digital rock models of deformed multiscale fractured-porous media at different stress loading stages are thereafter reconstructed,and the equivalent fracture-pore network models are finally extracted to explore the underlying mechanisms of gas-water two-phase flow at the pore-scale.Results indicate that,in the process of insitu stress loading,both the deep rocks have experienced three stages:linear elastic deformation,nonlinear plastic deformation,and shear failure.The micro-mechanical behavior greatly affects the dynamic deformation of rock microstructure and gas-water two-phase flow.In the linear elastic deformation stage,with the increase in in-situ stress,both the deep rocks are gradually compacted,leading to decreases in average pore radius,pore throat ratio,tortuosity,and water-phase relative permeability,while the coordination number nearly remains unchanged.In the plastic deformation stage,the synergistic influence of rock compaction and existence of micro-fractures typically exert a great effect on pore-throat topological properties and gas-water relative permeability.In the shear failure stage,due to the generation and propagation of micro-fractures inside the deep rock,the topological connectivity becomes better,fluid flow paths increase,and flow conductivity is promoted,thus leading to sharp increases in average pore radius and coordination number,rapid decreases in pore throat ratio and tortuosity,as well as remarkable improvement in relative permeability of gas phase and waterphase.
文摘为解决由于无人机视角下毛竹林的形状和纹理复杂,现有方法在分割精度和鲁棒性方面表现不佳的问题,提出了一种应用跨领域适应和偏移量引导的毛竹林分割网络——BFSNet。以百山祖国家公园为试验区,利用无人机拍摄周边毛竹林图像构建数据集。为增强模型的特征提取能力,提出跨领域适应模块以有效利用源模型的强特征提取能力,并结合自主学习提取适用于毛竹林分割任务的特征,利用两者的优势进行互补。为提高模型对于不同形状毛竹林的识别和定位能力,结合可变形卷积的偏移量引导模块,引入可学习的偏移量参数,以适应不同形状的毛竹林目标。将BFSNet在DeepGlobe Land Cover Classification Challenge和自制数据集上进行模型训练和测试,并与多种主流图像分割方法进行对比。结果表明:BFSNet在交并比、Dice系数、精确率和召回率4项指标上均取得了最优的性能表现,分别获得了76.04%和71.93%的交并比。与多种主流的图像分割模型相比,BFSNet在毛竹林的分割效果方面表现最为出色,对毛竹林形状的精确建模能力能够有效地应对不同形态的毛竹林。
基金supported by the National Natural Science Foundation of China(Grant No.51579207)the Open Foundation of State Key Laboratory Base of Eco-Hydraulic Engineering in Arid Area(Grant No.2016ZZKT-8)the Key Projects of Natural Science Basic Research Program of Shaanxi Province(Grant No.2018JZ5010)
文摘Extracting implicit anomaly information through deformation monitoring data mining is highly significant to determining dam safety status.As an intelligent singular value diagnostic method for concrete dam deformation monitoring, shallow neural network models result in local optima and overfitting, and require manual feature extraction.To obtain an intelligent singular value diagnosis model that can be used for dam safety monitoring, a convolutional neural network (CNN) model that has advantages of deep learning (DL), such as automatic feature extraction, good model fitting, and strong generalizability, was trained in this study.An engineering example shows that the predicted result of the intelligent singular value diagnostic method based on CNN is highly compatible with the confusion matrix, with a precision of 92.41%, receiver operating characteristic (ROC) coordinates of (0.03, 0.97), an area-under-curve (AUC) value of 0.99, and an F1-score of 0.91.Moreover, the performance of the CNN model is better than those of models based on decision tree (DT) and k-nearest neighbor (KNN) methods.Therefore, the intelligent singular value diagnostic method based on CNN is simple to operate, highly intelligent, and highly reliable, and it has a high potential for application in engineering.
基金Supported by the Science and Technology Research Project of Hubei Provincial Department of Education (No.Q20202604)。
文摘Lane detection is a fundamental necessary task for autonomous driving.The conventional methods mainly treat lane detection as a pixel-wise segmentation problem,which suffers from the challenge of uncontrollable driving road environments and needs post-processing to abstract the lane parameters.In this work,a series of lines are used to represent traffic lanes and a novel line deformation network(LDNet) is proposed to directly predict the coordinates of lane line points.Inspired by the dynamic behavior of classic snake algorithms,LDNet uses a neural network to iteratively deform an initial lane line to match the lane markings.To capture the long and discontinuous structures of lane lines,1 D convolution in LDNet is used for structured feature learning along the lane lines.Based on LDNet,a two-stage pipeline is developed for lane marking detection:(1) initial lane line proposal to predict a list of lane line candidates,and(2) lane line deformation to obtain the coordinates of lane line points.Experiments show that the proposed approach achieves competitive performances on the TuSimple dataset while being efficient for real-time applications on a GTX 1650 GPU.In particular,the accuracy of LDNet with the annotated starting and ending points is up to99.45%,which indicates the improved initial lane line proposal method can further enhance the performance of LDNet.