The semantic segmentation methods based on CNN have made great progress,but there are still some shortcomings in the application of remote sensing images segmentation,such as the small receptive field can not effectiv...The semantic segmentation methods based on CNN have made great progress,but there are still some shortcomings in the application of remote sensing images segmentation,such as the small receptive field can not effectively capture global context.In order to solve this problem,this paper proposes a hybrid model based on ResNet50 and swin transformer to directly capture long-range dependence,which fuses features through Cross Feature Modulation Module(CFMM).Experimental results on two publicly available datasets,Vaihingen and Potsdam,are mIoU of 70.27%and 76.63%,respectively.Thus,CFM-UNet can maintain a high segmentation performance compared with other competitive networks.展开更多
The application of transformer networks and feature fusion models in medical image segmentation has aroused considerable attention within the academic circle.Nevertheless,two main obstacles persist:(1)the restrictions...The application of transformer networks and feature fusion models in medical image segmentation has aroused considerable attention within the academic circle.Nevertheless,two main obstacles persist:(1)the restrictions of the Transformer network in dealing with locally detailed features,and(2)the considerable loss of feature information in current feature fusion modules.To solve these issues,this study initially presents a refined feature extraction approach,employing a double-branch feature extraction network to capture complex multi-scale local and global information from images.Subsequently,we proposed a low-loss feature fusion method-Multi-branch Feature Fusion Enhancement Module(MFFEM),which realizes effective feature fusion with minimal loss.Simultaneously,the cross-layer cross-attention fusion module(CLCA)is adopted to further achieve adequate feature fusion by enhancing the interaction between encoders and decoders of various scales.Finally,the feasibility of our method was verified using the Synapse and ACDC datasets,demonstrating its competitiveness.The average DSC(%)was 83.62 and 91.99 respectively,and the average HD95(mm)was reduced to 19.55 and 1.15 respectively.展开更多
Accurate and rapid recognition of weathering degree(WD)and groundwater condition(GC)is essential for evaluating rock mass quality and conducting stability analyses in underground engineering.Conventional WD and GC rec...Accurate and rapid recognition of weathering degree(WD)and groundwater condition(GC)is essential for evaluating rock mass quality and conducting stability analyses in underground engineering.Conventional WD and GC recognition methods often rely on subjective evaluation by field experts,supplemented by field sampling and laboratory testing.These methods are frequently complex and timeconsuming,making it challenging to meet the rapidly evolving demands of underground engineering.Therefore,this study proposes a rock non-geometric parameter classification network(RNPC-net)to rapidly achieve the recognition and mapping ofWD and GC of tunnel faces.The hybrid feature extraction module(HFEM)in RNPC-net can fully extract,fuse,and utilize multi-scale features of images,enhancing the network's classification performance.Moreover,the designed adaptive weighting auxiliary classifier(AC)helps the network learn features more efficiently.Experimental results show that RNPC-net achieved classification accuracies of 0.8756 and 0.8710 for WD and GC,respectively,representing an improvement of approximately 2%e10%compared to other methods.Both quantitative and qualitative experiments confirm the effectiveness and superiority of RNPC-net.Furthermore,for WD and GC mapping,RNPC-net outperformed other methods by achieving the highest mean intersection over union(mIOU)across most tunnel faces.The mapping results closely align with measurements provided by field experts.The application of WD and GC mapping results to the rock mass rating(RMR)system achieved a transition from conventional qualitative to quantitative evaluation.This advancement enables more accurate and reliable rock mass quality evaluations,particularly under critical conditions of RMR.展开更多
An automatic method for classifying frequency shift keying(FSK),minimum shift keying(MSK),phase shift keying(PSK),quadrature amplitude modulation(QAM),and orthogonal frequency division multiplexing(OFDM)is proposed by...An automatic method for classifying frequency shift keying(FSK),minimum shift keying(MSK),phase shift keying(PSK),quadrature amplitude modulation(QAM),and orthogonal frequency division multiplexing(OFDM)is proposed by simultaneously using normality test,spectral analysis,and geometrical characteristics of in-phase-quadrature(I-Q)constellation diagram.Since the extracted features are unique for each modulation,they can be considered as a fingerprint of each modulation.We show that the proposed algorithm outperforms the previously published methods in terms of signal-to-noise ratio(SNR)and success rate.For example,the success rate of the proposed method for 64-QAM modulation at SNR=11 dB is 99%.Another advantage of the proposed method is its wide SNR range;such that the probability of classification for 16-QAM at SNR=3 dB is almost 1.The proposed method also provides a database for geometrical features of I-Q constellation diagram.By comparing and correlating the data of the provided database with the estimated I-Q diagram of the received signal,the processing gain of 4 dB is obtained.Whatever can be mentioned about the preference of the proposed algorithm are low complexity,low SNR,wide range of modulation set,and enhanced recognition at higher-order modulations.展开更多
The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-genera...The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.展开更多
Automatic crack detection of cement pavement chiefly benefits from the rapid development of deep learning,with convolutional neural networks(CNN)playing an important role in this field.However,as the performance of cr...Automatic crack detection of cement pavement chiefly benefits from the rapid development of deep learning,with convolutional neural networks(CNN)playing an important role in this field.However,as the performance of crack detection in cement pavement improves,the depth and width of the network structure are significantly increased,which necessitates more computing power and storage space.This limitation hampers the practical implementation of crack detection models on various platforms,particularly portable devices like small mobile devices.To solve these problems,we propose a dual-encoder-based network architecture that focuses on extracting more comprehensive fracture feature information and combines cross-fusion modules and coordinated attention mechanisms formore efficient feature fusion.Firstly,we use small channel convolution to construct shallow feature extractionmodule(SFEM)to extract low-level feature information of cracks in cement pavement images,in order to obtainmore information about cracks in the shallowfeatures of images.In addition,we construct large kernel atrous convolution(LKAC)to enhance crack information,which incorporates coordination attention mechanism for non-crack information filtering,and large kernel atrous convolution with different cores,using different receptive fields to extract more detailed edge and context information.Finally,the three-stage feature map outputs from the shallow feature extraction module is cross-fused with the two-stage feature map outputs from the large kernel atrous convolution module,and the shallow feature and detailed edge feature are fully fused to obtain the final crack prediction map.We evaluate our method on three public crack datasets:DeepCrack,CFD,and Crack500.Experimental results on theDeepCrack dataset demonstrate the effectiveness of our proposed method compared to state-of-the-art crack detection methods,which achieves Precision(P)87.2%,Recall(R)87.7%,and F-score(F1)87.4%.Thanks to our lightweight crack detectionmodel,the parameter count of the model in real-world detection scenarios has been significantly reduced to less than 2M.This advancement also facilitates technical support for portable scene detection.展开更多
The off situ accurate reconstruction of the core neutron field is an important step in realizing real-time reactor monitoring.The existing off situ reconstruction method of the neutron field is only applicable to case...The off situ accurate reconstruction of the core neutron field is an important step in realizing real-time reactor monitoring.The existing off situ reconstruction method of the neutron field is only applicable to cases wherein a single region changes at a specified location of the core.However,when the neutron field changes are complex,the accurate identification of the individual changed regions becomes challenging,which seriously affects the accuracy and stability of the neutron field recon-struction.Therefore,this study proposed a dual-task hybrid network architecture(DTHNet)for off situ reconstruction of the core neutron field,which trained the outermost assembly reconstruction task and the core reconstruction task jointly such that the former could assist the latter in the reconstruction of the core neutron field under core complex changes.Furthermore,to exploit the characteristics of the ex-core detection signals,this study designed a global-local feature upsampling module that efficiently distributed the ex-core detection signals to each reconstruction unit to improve the accuracy and stability of reconstruction.Reconstruction experiments were performed on the simulation datasets of the CLEAR-I reactor to verify the accuracy and stability of the proposed method.The results showed that when the location uncertainty of a single region did not exceed nine and the number of multiple changed regions did not exceed five.Further,the reconstructed ARD was within 2%,RD_(max)was maintained within 17.5%,and the number of RD≥10%was maintained within 10.Furthermore,when the noise interference of the ex-core detection signals was within±2%,although the average number of RD≥10%increased to 16,the average ARD was still within in 2%,and the average RD_(max)was within 22%.Collectively,these results show that,theoretically,the DTHNet can accurately and stably reconstruct most of the neutron field under certain complex core changes.展开更多
Low-light image enhancement methods have limitations in addressing issues such as color distortion,lack of vibrancy,and uneven light distribution and often require paired training data.To address these issues,we propo...Low-light image enhancement methods have limitations in addressing issues such as color distortion,lack of vibrancy,and uneven light distribution and often require paired training data.To address these issues,we propose a two-stage unsupervised low-light image enhancement algorithm called Retinex and Exposure Fusion Network(RFNet),which can overcome the problems of over-enhancement of the high dynamic range and under-enhancement of the low dynamic range in existing enhancement algorithms.This algorithm can better manage the challenges brought about by complex environments in real-world scenarios by training with unpaired low-light images and regular-light images.In the first stage,we design a multi-scale feature extraction module based on Retinex theory,capable of extracting details and structural information at different scales to generate high-quality illumination and reflection images.In the second stage,an exposure image generator is designed through the camera response mechanism function to acquire exposure images containing more dark features,and the generated images are fused with the original input images to complete the low-light image enhancement.Experiments show the effectiveness and rationality of each module designed in this paper.And the method reconstructs the details of contrast and color distribution,outperforms the current state-of-the-art methods in both qualitative and quantitative metrics,and shows excellent performance in the real world.展开更多
Social robot accounts controlled by artificial intelligence or humans are active in social networks,bringing negative impacts to network security and social life.Existing social robot detection methods based on graph ...Social robot accounts controlled by artificial intelligence or humans are active in social networks,bringing negative impacts to network security and social life.Existing social robot detection methods based on graph neural networks suffer from the problem of many social network nodes and complex relationships,which makes it difficult to accurately describe the difference between the topological relations of nodes,resulting in low detection accuracy of social robots.This paper proposes a social robot detection method with the use of an improved neural network.First,social relationship subgraphs are constructed by leveraging the user’s social network to disentangle intricate social relationships effectively.Then,a linear modulated graph attention residual network model is devised to extract the node and network topology features of the social relation subgraph,thereby generating comprehensive social relation subgraph features,and the feature-wise linear modulation module of the model can better learn the differences between the nodes.Next,user text content and behavioral gene sequences are extracted to construct social behavioral features combined with the social relationship subgraph features.Finally,social robots can be more accurately identified by combining user behavioral and relationship features.By carrying out experimental studies based on the publicly available datasets TwiBot-20 and Cresci-15,the suggested method’s detection accuracies can achieve 86.73%and 97.86%,respectively.Compared with the existing mainstream approaches,the accuracy of the proposed method is 2.2%and 1.35%higher on the two datasets.The results show that the method proposed in this paper can effectively detect social robots and maintain a healthy ecological environment of social networks.展开更多
The numerous photos captured by low-price Internet of Things(IoT)sensors are frequently affected by meteorological factors,especially rainfall.It causes varying sizes of white streaks on the image,destroying the image...The numerous photos captured by low-price Internet of Things(IoT)sensors are frequently affected by meteorological factors,especially rainfall.It causes varying sizes of white streaks on the image,destroying the image texture and ruining the performance of the outdoor computer vision system.Existing methods utilise training with pairs of images,which is difficult to cover all scenes and leads to domain gaps.In addition,the network structures adopt deep learning to map rain images to rain-free images,failing to use prior knowledge effectively.To solve these problems,we introduce a single image derain model in edge computing that combines prior knowledge of rain patterns with the learning capability of the neural network.Specifically,the algorithm first uses Residue Channel Prior to filter out the rainfall textural features then it uses the Feature Fusion Module to fuse the original image with the background feature information.This results in a pre-processed image which is fed into Half Instance Net(HINet)to recover a high-quality rain-free image with a clear and accurate structure,and the model does not rely on any rainfall assumptions.Experimental results on synthetic and real-world datasets show that the average peak signal-to-noise ratio of the model decreases by 0.37 dB on the synthetic dataset and increases by 0.43 dB on the real-world dataset,demonstrating that a combined model reduces the gap between synthetic data and natural rain scenes,improves the generalization ability of the derain network,and alleviates the overfitting problem.展开更多
Sputum smear tests are critical for the diagnosis of respiratory diseases. Automatic segmentation of bacteria from spu-tum smear images is important for improving diagnostic efficiency. However, this remains a challen...Sputum smear tests are critical for the diagnosis of respiratory diseases. Automatic segmentation of bacteria from spu-tum smear images is important for improving diagnostic efficiency. However, this remains a challenging task owing to the high interclass similarity among different categories of bacteria and the low contrast of the bacterial edges. To explore more levels of global pattern features to promote the distinguishing ability of bacterial categories and main-tain sufficient local fine-grained features to ensure accurate localization of ambiguous bacteria simultaneously, we propose a novel dual-branch deformable cross-attention fusion network (DB-DCAFN) for accurate bacterial segmen-tation. Specifically, we first designed a dual-branch encoder consisting of multiple convolution and transformer blocks in parallel to simultaneously extract multilevel local and global features. We then designed a sparse and deformable cross-attention module to capture the semantic dependencies between local and global features, which can bridge the semantic gap and fuse features effectively. Furthermore, we designed a feature assignment fusion module to enhance meaningful features using an adaptive feature weighting strategy to obtain more accurate segmentation. We conducted extensive experiments to evaluate the effectiveness of DB-DCAFN on a clinical dataset comprising three bacterial categories: Acinetobacter baumannii, Klebsiella pneumoniae, and Pseudomonas aeruginosa. The experi-mental results demonstrate that the proposed DB-DCAFN outperforms other state-of-the-art methods and is effective at segmenting bacteria from sputum smear images.展开更多
To address the issues of small target miss detection,false positives in complex scenarios,and insufficient real-time performance in maglev train foreign object intrusion detection,this paper proposes a multi-module fu...To address the issues of small target miss detection,false positives in complex scenarios,and insufficient real-time performance in maglev train foreign object intrusion detection,this paper proposes a multi-module fusion improvement algorithm,YOLO11-FADA(Fusion of Augmented Features and Dynamic Attention),based on YOLO11.The model achieves collaborative optimization through three key modules:The Local Feature Augmentation Module(LFAM)enhances small target features and mitigates feature loss during down-sampling through multi-scale feature parallel extraction and attention fusion.The Dynamically Tuned Self-Attention(DTSA)module introduces learnable parameters to adjust attentionweights dynamically,and,in combinationwith convolution,expands the receptive field to suppress complex background interference.TheWeighted Convolution 2D(wConv2D)module optimizes convolution kernel weights using symmetric density functions and sparsification,reducing the parameter count by 30% while retaining core feature extraction capabilities.YOLO11-FADA achieves a mAP@0.5 of 0.907 on a custom maglev train foreign object dataset,improving by 3.0% over the baseline YOLO11 model.The model’s computational complexity is 7.3 GFLOPs,with a detection speed of 118.6 FPS,striking a balance between detection accuracy and real-time performance,thereby offering an efficient solution for rail transit safety monitoring.展开更多
Rubber products have become an important strategic resource in the global economy.However,individual rubber tree segmentation in plantation environments remains challenging due to canopy background interfer-ence and s...Rubber products have become an important strategic resource in the global economy.However,individual rubber tree segmentation in plantation environments remains challenging due to canopy background interfer-ence and significant morphological variations among trees.To address these issues,we propose a high-precision segmentation network,TM-WSNet(Spatial Geometry Enhanced Hybrid Feature Extraction Module-Wavelet Grid Feature Fusion Encoder Segmentation Network).First,we introduce SGTramba,a hybrid feature extraction module combining Grouped Transformer and Mamba architectures,designed to reduce confusion between tree crown boundaries and surrounding vegetation or background elements.Second,we propose the WGMS encoder,which enhances structural feature recognition by applying wavelet-based spatial grid downsampling and mul-tiscale feature fusion,effectively handling variations in canopy shape and tree height.Third,a scale optimization algorithm(SCPO)is developed to adaptively search for the optimal learning rate,addressing uneven learning across different resolution scales.We evaluate TM-WSNet on a self-constructed dataset(RubberTree)and two public datasets(ShapeNetPart and ForestSemantic),where it consistently achieves high segmentation accuracy and robustness.In practical field tests,our method accurately predicts key rubber tree parameters—height,crown width,and diameter at breast height with coefficients of determination(R^(2))of 1.00,0.99,and 0.89,respectively.These results demonstrate TM-WSNet's strong potential for supporting precision rubber yield estimation and health monitoring in complex plantation environments.展开更多
基金Young Innovative Talents Project of Guangdong Ordinary Universities(No.2022KQNCX225)School-level Teaching and Research Project of Guangzhou City Polytechnic(No.2022xky046)。
文摘The semantic segmentation methods based on CNN have made great progress,but there are still some shortcomings in the application of remote sensing images segmentation,such as the small receptive field can not effectively capture global context.In order to solve this problem,this paper proposes a hybrid model based on ResNet50 and swin transformer to directly capture long-range dependence,which fuses features through Cross Feature Modulation Module(CFMM).Experimental results on two publicly available datasets,Vaihingen and Potsdam,are mIoU of 70.27%and 76.63%,respectively.Thus,CFM-UNet can maintain a high segmentation performance compared with other competitive networks.
基金funded by the Henan Science and Technology research project(222103810042)Support by the open project of scientific research platform of grain information processing center of Henan University of Technology(KFJJ-2021-108)+1 种基金Support by the innovative funds plan of Henan University of Technology(2021ZKCJ14)Henan University of Technology Youth Backbone Teacher Program.
文摘The application of transformer networks and feature fusion models in medical image segmentation has aroused considerable attention within the academic circle.Nevertheless,two main obstacles persist:(1)the restrictions of the Transformer network in dealing with locally detailed features,and(2)the considerable loss of feature information in current feature fusion modules.To solve these issues,this study initially presents a refined feature extraction approach,employing a double-branch feature extraction network to capture complex multi-scale local and global information from images.Subsequently,we proposed a low-loss feature fusion method-Multi-branch Feature Fusion Enhancement Module(MFFEM),which realizes effective feature fusion with minimal loss.Simultaneously,the cross-layer cross-attention fusion module(CLCA)is adopted to further achieve adequate feature fusion by enhancing the interaction between encoders and decoders of various scales.Finally,the feasibility of our method was verified using the Synapse and ACDC datasets,demonstrating its competitiveness.The average DSC(%)was 83.62 and 91.99 respectively,and the average HD95(mm)was reduced to 19.55 and 1.15 respectively.
基金supported by the National Natural Science Foundation of China(Grant Nos.42077242 and 42171407)the Graduate Innovation Fund of Jilin University.
文摘Accurate and rapid recognition of weathering degree(WD)and groundwater condition(GC)is essential for evaluating rock mass quality and conducting stability analyses in underground engineering.Conventional WD and GC recognition methods often rely on subjective evaluation by field experts,supplemented by field sampling and laboratory testing.These methods are frequently complex and timeconsuming,making it challenging to meet the rapidly evolving demands of underground engineering.Therefore,this study proposes a rock non-geometric parameter classification network(RNPC-net)to rapidly achieve the recognition and mapping ofWD and GC of tunnel faces.The hybrid feature extraction module(HFEM)in RNPC-net can fully extract,fuse,and utilize multi-scale features of images,enhancing the network's classification performance.Moreover,the designed adaptive weighting auxiliary classifier(AC)helps the network learn features more efficiently.Experimental results show that RNPC-net achieved classification accuracies of 0.8756 and 0.8710 for WD and GC,respectively,representing an improvement of approximately 2%e10%compared to other methods.Both quantitative and qualitative experiments confirm the effectiveness and superiority of RNPC-net.Furthermore,for WD and GC mapping,RNPC-net outperformed other methods by achieving the highest mean intersection over union(mIOU)across most tunnel faces.The mapping results closely align with measurements provided by field experts.The application of WD and GC mapping results to the rock mass rating(RMR)system achieved a transition from conventional qualitative to quantitative evaluation.This advancement enables more accurate and reliable rock mass quality evaluations,particularly under critical conditions of RMR.
文摘An automatic method for classifying frequency shift keying(FSK),minimum shift keying(MSK),phase shift keying(PSK),quadrature amplitude modulation(QAM),and orthogonal frequency division multiplexing(OFDM)is proposed by simultaneously using normality test,spectral analysis,and geometrical characteristics of in-phase-quadrature(I-Q)constellation diagram.Since the extracted features are unique for each modulation,they can be considered as a fingerprint of each modulation.We show that the proposed algorithm outperforms the previously published methods in terms of signal-to-noise ratio(SNR)and success rate.For example,the success rate of the proposed method for 64-QAM modulation at SNR=11 dB is 99%.Another advantage of the proposed method is its wide SNR range;such that the probability of classification for 16-QAM at SNR=3 dB is almost 1.The proposed method also provides a database for geometrical features of I-Q constellation diagram.By comparing and correlating the data of the provided database with the estimated I-Q diagram of the received signal,the processing gain of 4 dB is obtained.Whatever can be mentioned about the preference of the proposed algorithm are low complexity,low SNR,wide range of modulation set,and enhanced recognition at higher-order modulations.
基金the National Natural Science Foundation of China(No.61976080)the Academic Degrees&Graduate Education Reform Project of Henan Province(No.2021SJGLX195Y)+1 种基金the Teaching Reform Research and Practice Project of Henan Undergraduate Universities(No.2022SYJXLX008)the Key Project on Research and Practice of Henan University Graduate Education and Teaching Reform(No.YJSJG2023XJ006)。
文摘The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.
基金supported by the National Natural Science Foundation of China(No.62176034)the Science and Technology Research Program of Chongqing Municipal Education Commission(No.KJZD-M202300604)the Natural Science Foundation of Chongqing(Nos.cstc2021jcyj-msxmX0518,2023NSCQ-MSX1781).
文摘Automatic crack detection of cement pavement chiefly benefits from the rapid development of deep learning,with convolutional neural networks(CNN)playing an important role in this field.However,as the performance of crack detection in cement pavement improves,the depth and width of the network structure are significantly increased,which necessitates more computing power and storage space.This limitation hampers the practical implementation of crack detection models on various platforms,particularly portable devices like small mobile devices.To solve these problems,we propose a dual-encoder-based network architecture that focuses on extracting more comprehensive fracture feature information and combines cross-fusion modules and coordinated attention mechanisms formore efficient feature fusion.Firstly,we use small channel convolution to construct shallow feature extractionmodule(SFEM)to extract low-level feature information of cracks in cement pavement images,in order to obtainmore information about cracks in the shallowfeatures of images.In addition,we construct large kernel atrous convolution(LKAC)to enhance crack information,which incorporates coordination attention mechanism for non-crack information filtering,and large kernel atrous convolution with different cores,using different receptive fields to extract more detailed edge and context information.Finally,the three-stage feature map outputs from the shallow feature extraction module is cross-fused with the two-stage feature map outputs from the large kernel atrous convolution module,and the shallow feature and detailed edge feature are fully fused to obtain the final crack prediction map.We evaluate our method on three public crack datasets:DeepCrack,CFD,and Crack500.Experimental results on theDeepCrack dataset demonstrate the effectiveness of our proposed method compared to state-of-the-art crack detection methods,which achieves Precision(P)87.2%,Recall(R)87.7%,and F-score(F1)87.4%.Thanks to our lightweight crack detectionmodel,the parameter count of the model in real-world detection scenarios has been significantly reduced to less than 2M.This advancement also facilitates technical support for portable scene detection.
基金supported by the National Natural Science Foundation of China(No.12305344)the 2023 Anhui university research project of China(No.2023AH052179).
文摘The off situ accurate reconstruction of the core neutron field is an important step in realizing real-time reactor monitoring.The existing off situ reconstruction method of the neutron field is only applicable to cases wherein a single region changes at a specified location of the core.However,when the neutron field changes are complex,the accurate identification of the individual changed regions becomes challenging,which seriously affects the accuracy and stability of the neutron field recon-struction.Therefore,this study proposed a dual-task hybrid network architecture(DTHNet)for off situ reconstruction of the core neutron field,which trained the outermost assembly reconstruction task and the core reconstruction task jointly such that the former could assist the latter in the reconstruction of the core neutron field under core complex changes.Furthermore,to exploit the characteristics of the ex-core detection signals,this study designed a global-local feature upsampling module that efficiently distributed the ex-core detection signals to each reconstruction unit to improve the accuracy and stability of reconstruction.Reconstruction experiments were performed on the simulation datasets of the CLEAR-I reactor to verify the accuracy and stability of the proposed method.The results showed that when the location uncertainty of a single region did not exceed nine and the number of multiple changed regions did not exceed five.Further,the reconstructed ARD was within 2%,RD_(max)was maintained within 17.5%,and the number of RD≥10%was maintained within 10.Furthermore,when the noise interference of the ex-core detection signals was within±2%,although the average number of RD≥10%increased to 16,the average ARD was still within in 2%,and the average RD_(max)was within 22%.Collectively,these results show that,theoretically,the DTHNet can accurately and stably reconstruct most of the neutron field under certain complex core changes.
基金supported by the National Key Research and Development Program Topics(Grant No.2021YFB4000905)the National Natural Science Foundation of China(Grant Nos.62101432 and 62102309)in part by Shaanxi Natural Science Fundamental Research Program Project(No.2022JM-508).
文摘Low-light image enhancement methods have limitations in addressing issues such as color distortion,lack of vibrancy,and uneven light distribution and often require paired training data.To address these issues,we propose a two-stage unsupervised low-light image enhancement algorithm called Retinex and Exposure Fusion Network(RFNet),which can overcome the problems of over-enhancement of the high dynamic range and under-enhancement of the low dynamic range in existing enhancement algorithms.This algorithm can better manage the challenges brought about by complex environments in real-world scenarios by training with unpaired low-light images and regular-light images.In the first stage,we design a multi-scale feature extraction module based on Retinex theory,capable of extracting details and structural information at different scales to generate high-quality illumination and reflection images.In the second stage,an exposure image generator is designed through the camera response mechanism function to acquire exposure images containing more dark features,and the generated images are fused with the original input images to complete the low-light image enhancement.Experiments show the effectiveness and rationality of each module designed in this paper.And the method reconstructs the details of contrast and color distribution,outperforms the current state-of-the-art methods in both qualitative and quantitative metrics,and shows excellent performance in the real world.
基金This work was supported in part by the National Natural Science Foundation of China under Grants 62273272,62303375 and 61873277in part by the Key Research and Development Program of Shaanxi Province under Grant 2023-YBGY-243+2 种基金in part by the Natural Science Foundation of Shaanxi Province under Grants 2022JQ-606 and 2020-JQ758in part by the Research Plan of Department of Education of Shaanxi Province under Grant 21JK0752in part by the Youth Innovation Team of Shaanxi Universities.
文摘Social robot accounts controlled by artificial intelligence or humans are active in social networks,bringing negative impacts to network security and social life.Existing social robot detection methods based on graph neural networks suffer from the problem of many social network nodes and complex relationships,which makes it difficult to accurately describe the difference between the topological relations of nodes,resulting in low detection accuracy of social robots.This paper proposes a social robot detection method with the use of an improved neural network.First,social relationship subgraphs are constructed by leveraging the user’s social network to disentangle intricate social relationships effectively.Then,a linear modulated graph attention residual network model is devised to extract the node and network topology features of the social relation subgraph,thereby generating comprehensive social relation subgraph features,and the feature-wise linear modulation module of the model can better learn the differences between the nodes.Next,user text content and behavioral gene sequences are extracted to construct social behavioral features combined with the social relationship subgraph features.Finally,social robots can be more accurately identified by combining user behavioral and relationship features.By carrying out experimental studies based on the publicly available datasets TwiBot-20 and Cresci-15,the suggested method’s detection accuracies can achieve 86.73%and 97.86%,respectively.Compared with the existing mainstream approaches,the accuracy of the proposed method is 2.2%and 1.35%higher on the two datasets.The results show that the method proposed in this paper can effectively detect social robots and maintain a healthy ecological environment of social networks.
基金supported by the National Natural Science Foundation of China under Grant no.41975183,and Grant no.41875184 and Supported by a grant from State Key Laboratory of Resources and Environmental Information System.
文摘The numerous photos captured by low-price Internet of Things(IoT)sensors are frequently affected by meteorological factors,especially rainfall.It causes varying sizes of white streaks on the image,destroying the image texture and ruining the performance of the outdoor computer vision system.Existing methods utilise training with pairs of images,which is difficult to cover all scenes and leads to domain gaps.In addition,the network structures adopt deep learning to map rain images to rain-free images,failing to use prior knowledge effectively.To solve these problems,we introduce a single image derain model in edge computing that combines prior knowledge of rain patterns with the learning capability of the neural network.Specifically,the algorithm first uses Residue Channel Prior to filter out the rainfall textural features then it uses the Feature Fusion Module to fuse the original image with the background feature information.This results in a pre-processed image which is fed into Half Instance Net(HINet)to recover a high-quality rain-free image with a clear and accurate structure,and the model does not rely on any rainfall assumptions.Experimental results on synthetic and real-world datasets show that the average peak signal-to-noise ratio of the model decreases by 0.37 dB on the synthetic dataset and increases by 0.43 dB on the real-world dataset,demonstrating that a combined model reduces the gap between synthetic data and natural rain scenes,improves the generalization ability of the derain network,and alleviates the overfitting problem.
基金the Natural Science Foundation of Shandong Province,No.ZR2021MH213and in part by the Suzhou Science and Technology Bureau,No.SJC2021023.
文摘Sputum smear tests are critical for the diagnosis of respiratory diseases. Automatic segmentation of bacteria from spu-tum smear images is important for improving diagnostic efficiency. However, this remains a challenging task owing to the high interclass similarity among different categories of bacteria and the low contrast of the bacterial edges. To explore more levels of global pattern features to promote the distinguishing ability of bacterial categories and main-tain sufficient local fine-grained features to ensure accurate localization of ambiguous bacteria simultaneously, we propose a novel dual-branch deformable cross-attention fusion network (DB-DCAFN) for accurate bacterial segmen-tation. Specifically, we first designed a dual-branch encoder consisting of multiple convolution and transformer blocks in parallel to simultaneously extract multilevel local and global features. We then designed a sparse and deformable cross-attention module to capture the semantic dependencies between local and global features, which can bridge the semantic gap and fuse features effectively. Furthermore, we designed a feature assignment fusion module to enhance meaningful features using an adaptive feature weighting strategy to obtain more accurate segmentation. We conducted extensive experiments to evaluate the effectiveness of DB-DCAFN on a clinical dataset comprising three bacterial categories: Acinetobacter baumannii, Klebsiella pneumoniae, and Pseudomonas aeruginosa. The experi-mental results demonstrate that the proposed DB-DCAFN outperforms other state-of-the-art methods and is effective at segmenting bacteria from sputum smear images.
文摘To address the issues of small target miss detection,false positives in complex scenarios,and insufficient real-time performance in maglev train foreign object intrusion detection,this paper proposes a multi-module fusion improvement algorithm,YOLO11-FADA(Fusion of Augmented Features and Dynamic Attention),based on YOLO11.The model achieves collaborative optimization through three key modules:The Local Feature Augmentation Module(LFAM)enhances small target features and mitigates feature loss during down-sampling through multi-scale feature parallel extraction and attention fusion.The Dynamically Tuned Self-Attention(DTSA)module introduces learnable parameters to adjust attentionweights dynamically,and,in combinationwith convolution,expands the receptive field to suppress complex background interference.TheWeighted Convolution 2D(wConv2D)module optimizes convolution kernel weights using symmetric density functions and sparsification,reducing the parameter count by 30% while retaining core feature extraction capabilities.YOLO11-FADA achieves a mAP@0.5 of 0.907 on a custom maglev train foreign object dataset,improving by 3.0% over the baseline YOLO11 model.The model’s computational complexity is 7.3 GFLOPs,with a detection speed of 118.6 FPS,striking a balance between detection accuracy and real-time performance,thereby offering an efficient solution for rail transit safety monitoring.
基金This work was supported by the Hainan Province Science and Technology Special Fund(Grant No.ZDYF2025XDNY113)the Central Public-interest Scientific Institution Basal Research Fund(Grant No.1630032022007)+2 种基金the Special Fund for Hainan Excellent Team“Rubber Genetics and Breeding”(Grant No.20210203)the Hunan Provincial Natural Science Foundation Project(Grant No.2025JJ50385)in part by the National Natural Science Foundation of China(Grant No.62276276).
文摘Rubber products have become an important strategic resource in the global economy.However,individual rubber tree segmentation in plantation environments remains challenging due to canopy background interfer-ence and significant morphological variations among trees.To address these issues,we propose a high-precision segmentation network,TM-WSNet(Spatial Geometry Enhanced Hybrid Feature Extraction Module-Wavelet Grid Feature Fusion Encoder Segmentation Network).First,we introduce SGTramba,a hybrid feature extraction module combining Grouped Transformer and Mamba architectures,designed to reduce confusion between tree crown boundaries and surrounding vegetation or background elements.Second,we propose the WGMS encoder,which enhances structural feature recognition by applying wavelet-based spatial grid downsampling and mul-tiscale feature fusion,effectively handling variations in canopy shape and tree height.Third,a scale optimization algorithm(SCPO)is developed to adaptively search for the optimal learning rate,addressing uneven learning across different resolution scales.We evaluate TM-WSNet on a self-constructed dataset(RubberTree)and two public datasets(ShapeNetPart and ForestSemantic),where it consistently achieves high segmentation accuracy and robustness.In practical field tests,our method accurately predicts key rubber tree parameters—height,crown width,and diameter at breast height with coefficients of determination(R^(2))of 1.00,0.99,and 0.89,respectively.These results demonstrate TM-WSNet's strong potential for supporting precision rubber yield estimation and health monitoring in complex plantation environments.