Landfill leaks pose a serious threat to environmental health,risking the contamination of both groundwater and soil resources.Accurate investigation of these sites is essential for implementing effective prevention an...Landfill leaks pose a serious threat to environmental health,risking the contamination of both groundwater and soil resources.Accurate investigation of these sites is essential for implementing effective prevention and control measures.The self-potential(SP)stands out for its sensitivity to contamination plumes,offering a solution for monitoring and detecting the movement and seepage of subsurface pollutants.However,traditional SP inversion techniques heavily rely on precise subsurface resistivity information.In this study,we propose the Attention U-Net deep learning network for rapid SP inversion.By incorporating an attention mechanism,this algorithm effectively learns the relationship between array-style SP data and the location and extent of subsurface contaminated sources.We designed a synthetic landfill model with a heterogeneous resistivity structure to assess the performance of Attention U-Net deep learning network.Additionally,we conducted further validation using a laboratory model to assess its practical applicability.The results demonstrate that the algorithm is not solely dependent on resistivity information,enabling effective locating of the source distribution,even in models with intricate subsurface structures.Our work provides a promising tool for SP data processing,enhancing the applicability of this method in the field of near-subsurface environmental monitoring.展开更多
Recent advances in deep learning have significantly improved flood detection and segmentation from aerial and satellite imagery.However,conventional convolutional neural networks(CNNs)often struggle in complex flood s...Recent advances in deep learning have significantly improved flood detection and segmentation from aerial and satellite imagery.However,conventional convolutional neural networks(CNNs)often struggle in complex flood scenarios involving reflections,occlusions,or indistinct boundaries due to limited contextual modeling.To address these challenges,we propose a hybrid flood segmentation framework that integrates a Vision Transformer(ViT)encoder with a U-Net decoder,enhanced by a novel Flood-Aware Refinement Block(FARB).The FARB module improves boundary delineation and suppresses noise by combining residual smoothing with spatial-channel attention mechanisms.We evaluate our model on a UAV-acquired flood imagery dataset,demonstrating that the proposed ViTUNet+FARB architecture outperforms existing CNN and Transformer-based models in terms of accuracy and mean Intersection over Union(mIoU).Detailed ablation studies further validate the contribution of each component,confirming that the FARB design significantly enhances segmentation quality.To its better performance and computational efficiency,the proposed framework is well-suited for flood monitoring and disaster response applications,particularly in resource-constrained environments.展开更多
Underwater pipeline inspection plays a vital role in the proactive maintenance and management of critical marine infrastructure and subaquatic systems.However,the inspection of underwater pipelines presents a challeng...Underwater pipeline inspection plays a vital role in the proactive maintenance and management of critical marine infrastructure and subaquatic systems.However,the inspection of underwater pipelines presents a challenge due to factors such as light scattering,absorption,restricted visibility,and ambient noise.The advancement of deep learning has introduced powerful techniques for processing large amounts of unstructured and imperfect data collected from underwater environments.This study evaluated the efficacy of the You Only Look Once(YOLO)algorithm,a real-time object detection and localization model based on convolutional neural networks,in identifying and classifying various types of pipeline defects in underwater settings.YOLOv8,the latest evolution in the YOLO family,integrates advanced capabilities,such as anchor-free detection,a cross-stage partial network backbone for efficient feature extraction,and a feature pyramid network+path aggregation network neck for robust multi-scale object detection,which make it particularly well-suited for complex underwater environments.Due to the lack of suitable open-access datasets for underwater pipeline defects,a custom dataset was captured using a remotely operated vehicle in a controlled environment.This application has the following assets available for use.Extensive experimentation demonstrated that YOLOv8 X-Large consistently outperformed other models in terms of pipe defect detection and classification and achieved a strong balance between precision and recall in identifying pipeline cracks,rust,corners,defective welds,flanges,tapes,and holes.This research establishes the baseline performance of YOLOv8 for underwater defect detection and showcases its potential to enhance the reliability and efficiency of pipeline inspection tasks in challenging underwater environments.展开更多
As deep learning(DL)models are increasingly deployed in sensitive domains(e.g.,healthcare),concerns over privacy and security have intensified.Conventional penetration testing frameworks,such asOWASP and NIST,are effe...As deep learning(DL)models are increasingly deployed in sensitive domains(e.g.,healthcare),concerns over privacy and security have intensified.Conventional penetration testing frameworks,such asOWASP and NIST,are effective for traditional networks and applications but lack the capabilities to address DL-specific threats,such asmodel inversion,membership inference,and adversarial attacks.This review provides a comprehensive analysis of penetration testing for the privacy of DL models,examining the shortfalls of existing frameworks,tools,and testing methodologies.Through systematic evaluation of existing literature and empirical analysis,we identify three major contributions:(i)a critical assessment of traditional penetration testing frameworks’inadequacies when applied to DL-specific privacy vulnerabilities,(ii)a comprehensive evaluation of state-of-the-art privacy-preserving methods and their integration with penetration testing workflows,and(iii)the development of a structured framework that combines reconnaissance,threat modeling,exploitation,and post-exploitation phases specifically tailored for DL privacy assessment.Moreover,this review evaluates popular solutions such as IBMAdversarial Robustness Toolbox and TensorFlowPrivacy,alongside privacy-preserving techniques(e.g.,Differential Privacy,Homomorphic Encryption,and Federated Learning),which we systematically analyze through comparative studies of their effectiveness,computational overhead,and practical deployment constraints.While these techniques offer promising safeguards,their adoption is hindered by accuracy loss,performance overheads,and the rapid evolution of attack strategies.Our findings reveal that no single existing solution provides comprehensive protection,which leads us to propose a hybrid approach that strategically combines multiple privacy-preserving mechanisms.The findings of this survey underscore an urgent need for automated,regulationcompliant penetration testing frameworks specifically tailored to DL systems.We argue for hybrid privacy solutions that combinemultiple protectivemechanisms to ensure bothmodel accuracy and privacy.Building on our analysis,we present actionable recommendations for developing adaptive penetration testing strategies that incorporate automated vulnerability assessment,continuous monitoring,and regulatory compliance verification.展开更多
As urbanization continues to accelerate,the challenges associated with managing transportation in metropolitan areas become increasingly complex.The surge in population density contributes to traffic congestion,impact...As urbanization continues to accelerate,the challenges associated with managing transportation in metropolitan areas become increasingly complex.The surge in population density contributes to traffic congestion,impacting travel experiences and posing safety risks.Smart urban transportation management emerges as a strategic solution,conceptualized here as a multidimensional big data problem.The success of this strategy hinges on the effective collection of information from diverse,extensive,and heterogeneous data sources,necessitating the implementation of full⁃stack Information and Communication Technology(ICT)solutions.The main idea of the work is to investigate the current technologies of Intelligent Transportation Systems(ITS)and enhance the safety of urban transportation systems.Machine learning models,trained on historical data,can predict traffic congestion,allowing for the implementation of preventive measures.Deep learning architectures,with their ability to handle complex data representations,further refine traffic predictions,contributing to more accurate and dynamic transportation management.The background of this research underscores the challenges posed by traffic congestion in metropolitan areas and emphasizes the need for advanced technological solutions.By integrating GPS and GIS technologies with machine learning algorithms,this work aims to pay attention to the development of intelligent transportation systems that not only address current challenges but also pave the way for future advancements in urban transportation management.展开更多
With the proliferation of Internet of Things(IoT)devices,securing these interconnected systems against cyberattacks has become a critical challenge.Traditional security paradigms often fail to cope with the scale and ...With the proliferation of Internet of Things(IoT)devices,securing these interconnected systems against cyberattacks has become a critical challenge.Traditional security paradigms often fail to cope with the scale and diversity of IoT network traffic.This paper presents a comparative benchmark of classic machine learning(ML)and state-of-the-art deep learning(DL)algorithms for IoT intrusion detection.Our methodology employs a twophased approach:a preliminary pilot study using a custom-generated dataset to establish baselines,followed by a comprehensive evaluation on the large-scale CICIoTDataset2023.We benchmarked algorithms including Random Forest,XGBoost,CNN,and StackedLSTM.The results indicate that while top-performingmodels frombothcategories achieve over 99%classification accuracy,this metric masks a crucial performance trade-off.We demonstrate that treebased ML ensembles exhibit superior precision(91%)in identifying benign traffic,making them effective at reducing false positives.Conversely,DL models demonstrate superior recall(96%),making them better suited for minimizing the interruption of legitimate traffic.We conclude that the selection of an optimal model is not merely a matter of maximizing accuracy but is a strategic choice dependent on the specific security priority either minimizing false alarms or ensuring service availability.Thiswork provides a practical framework for deploying context-aware security solutions in diverse IoT environments.展开更多
Image registration within a solar photosphere sequence is crucial for observational solar physics studies requiring high spatial and temporal resolutions.Previously,we identified residual large-scale nonrigid distorti...Image registration within a solar photosphere sequence is crucial for observational solar physics studies requiring high spatial and temporal resolutions.Previously,we identified residual large-scale nonrigid distortions in high-resolution solar photosphere images from ground-based telescopes after high-resolution reconstruction.Because these distortions are not eliminated by conventional sequence correlation alignment,they can affect the analysis of small-scale activity in the solar photosphere.Here,we implemented an image registration model using deep learning(HCAM-Net)to solve the problem.Within an encoder-decoder framework,we introduced a hybrid attention mechanism to improve context information capture and extract accurate deformation fields.Analyzing solar photosphere images acquired by the New Vacuum Solar Telescope,we demonstrated that the proposed model effectively achieved highly accurate nonrigid image registration.Evaluation metrics and visualization results indicated that our model outperformed current state-of-the-art models,such as VoxelMorph and TransMorph,for nonrigid registration of solar photosphere images,with a structural similarity index measure of 0.965 and a coefficient of determination of 0.976.展开更多
This study aimed to integrate Monte Carlo(MC)simulation with deep learning(DL)-based denoising techniques to achieve fast and accurate prediction of high-quality electronic portal imaging device(EPID)transmission dose...This study aimed to integrate Monte Carlo(MC)simulation with deep learning(DL)-based denoising techniques to achieve fast and accurate prediction of high-quality electronic portal imaging device(EPID)transmission dose(TD)for patientspecific quality assurance(PSQA).A total of 100 lung cases were used to obtain the noisy EPID TD by the ARCHER MC code under four kinds of particle numbers(1×10^(6),1×10^(7),1×10^(8)and 1×10^(9)),and the original EPID TD was denoised by the SUNet neural network.The denoised EPID TD was assessed both qualitatively and quantitatively using the structural similarity(SSIM),peak signal-to-noise ratio(PSNR),and gamma passing rate(GPR)with respect to 1×10^(9)as a reference.The computation times for both the MC simulation and DL-based denoising were recorded.As the number of particles increased,both the quality of the noisy EPID TD and computation time increased significantly(1×10^(6):1.12 s,1×10^(7):1.72 s,1×10^(8):8.62 s,and 1×10^(9):73.89 s).In contrast,the DL-based denoising time remained at 0.13-0.16 s.The denoised EPID TD shows a smoother visual appearance and profile curves,but differences between 1×10^(6)and 1×10^(9)still remain.SSIM improves from 0.61 to 0.95 for 1×10^(6),0.70 to 0.96 for 1×10^(7),and 0.90 to 0.97 for 1×10^(8).PSNR increases by>20%for 1×10^(6)and 1×10^(7),and>10%for 1×10^(8).GPR improves from 48.47%to 89.10%for 1×10^(6),61.04%to 94.35%for 1×10^(7),and 91.88%to 99.55%for 1×10^(8).The method that combines MC simulation with DL-based denoising for EPID TD generation can accelerate TD prediction and maintain high accuracy,offering a promising solution for efficient PSQA.展开更多
The solar cycle(SC),a phenomenon caused by the quasi-periodic regular activities in the Sun,occurs approximately every 11 years.Intense solar activity can disrupt the Earth’s ionosphere,affecting communication and na...The solar cycle(SC),a phenomenon caused by the quasi-periodic regular activities in the Sun,occurs approximately every 11 years.Intense solar activity can disrupt the Earth’s ionosphere,affecting communication and navigation systems.Consequently,accurately predicting the intensity of the SC holds great significance,but predicting the SC involves a long-term time series,and many existing time series forecasting methods have fallen short in terms of accuracy and efficiency.The Time-series Dense Encoder model is a deep learning solution tailored for long time series prediction.Based on a multi-layer perceptron structure,it outperforms the best previously existing models in accuracy,while being efficiently trainable on general datasets.We propose a method based on this model for SC forecasting.Using a trained model,we predict the test set from SC 19 to SC 25 with an average mean absolute percentage error of 32.02,root mean square error of 30.3,mean absolute error of 23.32,and R^(2)(coefficient of determination)of 0.76,outperforming other deep learning models in terms of accuracy and training efficiency on sunspot number datasets.Subsequently,we use it to predict the peaks of SC 25 and SC 26.For SC 25,the peak time has ended,but a stronger peak is predicted for SC 26,of 199.3,within a range of 170.8-221.9,projected to occur during April 2034.展开更多
Wearable sensors integrated with deep learning techniques have the potential to revolutionize seamless human-machine interfaces for real-time health monitoring,clinical diagnosis,and robotic applications.Nevertheless,...Wearable sensors integrated with deep learning techniques have the potential to revolutionize seamless human-machine interfaces for real-time health monitoring,clinical diagnosis,and robotic applications.Nevertheless,it remains a critical challenge to simultaneously achieve desirable mechanical and electrical performance along with biocompatibility,adhesion,self-healing,and environmental robustness with excellent sensing metrics.Herein,we report a multifunctional,anti-freezing,selfadhesive,and self-healable organogel pressure sensor composed of cobalt nanoparticle encapsulated nitrogen-doped carbon nanotubes(CoN CNT)embedded in a polyvinyl alcohol-gelatin(PVA/GLE)matrix.Fabricated using a binary solvent system of water and ethylene glycol(EG),the CoN CNT/PVA/GLE organogel exhibits excellent flexibility,biocompatibility,and temperature tolerance with remarkable environmental stability.Electrochemical impedance spectroscopy confirms near-stable performance across a broad humidity range(40%-95%RH).Freeze-tolerant conductivity under sub-zero conditions(-20℃)is attributed to the synergistic role of CoN CNT and EG,preserving mobility and network integrity.The Co N CNT/PVA/GLE organogel sensor exhibits high sensitivity of 5.75 k Pa^(-1)in the detection range from 0 to 20 k Pa,ideal for subtle biomechanical motion detection.A smart human-machine interface for English letter recognition using deep learning achieved 98%accuracy.The organogel sensor utility was extended to detect human gestures like finger bending,wrist motion,and throat vibration during speech.展开更多
Test case prioritization and ranking play a crucial role in software testing by improving fault detection efficiency and ensuring software reliability.While prioritization selects the most relevant test cases for opti...Test case prioritization and ranking play a crucial role in software testing by improving fault detection efficiency and ensuring software reliability.While prioritization selects the most relevant test cases for optimal coverage,ranking further refines their execution order to detect critical faults earlier.This study investigates machine learning techniques to enhance both prioritization and ranking,contributing to more effective and efficient testing processes.We first employ advanced feature engineering alongside ensemble models,including Gradient Boosted,Support Vector Machines,Random Forests,and Naive Bayes classifiers to optimize test case prioritization,achieving an accuracy score of 0.98847 and significantly improving the Average Percentage of Fault Detection(APFD).Subsequently,we introduce a deep Q-learning framework combined with a Genetic Algorithm(GA)to refine test case ranking within priority levels.This approach achieves a rank accuracy of 0.9172,demonstrating robust performance despite the increasing computational demands of specialized variation operators.Our findings highlight the effectiveness of stacked ensemble learning and reinforcement learning in optimizing test case prioritization and ranking.This integrated approach improves testing efficiency,reduces late-stage defects,and improves overall software stability.The study provides valuable information for AI-driven testing frameworks,paving the way for more intelligent and adaptive software quality assurance methodologies.展开更多
Accurate streamflow prediction is essential for flood warning,reservoir operation,irrigation scheduling,hydropower planning,and sustainable water management,yet remains challenging due to the complexity of hydrologica...Accurate streamflow prediction is essential for flood warning,reservoir operation,irrigation scheduling,hydropower planning,and sustainable water management,yet remains challenging due to the complexity of hydrological processes.Although data-driven models often outperform conventional physics-based hydrological modelling approaches,their real-world deployment is limited by cost,infrastructure demands,and the interdisciplinary expertise required.To bridge this gap,this study developed QPred,a regional,lightweight,cost-effective,web-delivered application for daily streamflow forecasting.The study executed an end-to-end workflow,from field data acquisition to accessible web-based deployment for on-demand forecasting.High-resolution rainfall data were recorded with tippingbucket gauges and loggers,while river water depth in the Aglar and Paligaad watersheds was converted to discharge using site-specific rating curves,resulting in a daily dataset of precipitation,river water level and discharge.Four DL architectures were trained,including vanilla Long Short-Term Memory(LSTM),stacked LSTM,bidirectional LSTM,and Gated Recurrent Unit(GRU),and evaluated using Nash-Sutcliffe Efficiency(NSE),Coefficient of Determination(R2),Root-Mean-Square-Error-Standard-Deviation Ratio(RSR),and Percentage Bias(PBIAS)metrics.Performance was watershed-specific,as the vanilla LSTM demonstrated the best generalisation for the Aglar watershed(R2=0.88,NSE=0.82,RMSE=0.12 during validation),while the GRU achieved the highest validation accuracy in Paligaad(R2=0.88,NSE=0.88,RMSE=0.49).All models achieved satisfactory to excellent performance during calibration(R2>0.91,NSE>0.91 for both watersheds),demonstrating strong capability to capture streamflow dynamics.The highest performing models were selected and embedded into the QPred application.QPred was developed as a lightweight web pipeline,utilising Google Colab as the primary execution environment,Flask as the backend inference framework,Google Drive for artefact storage,andNgrok for secureHTTPS tunnelling.Auser-friendly front end utilises range sliders(bounded by observed minima and maxima)to gather inputs and provides discharge data along with metadata,thereby enhancing transparency.This work demonstrates that accurate,context-aware deep learningmodels can be delivered through low-cost,web-based platforms,providing a reproducible and scalable pipeline for hydrological applications in other watersheds and for practitioners.展开更多
Accurate detection of driver fatigue is essential for improving road safety.This study investigates the effectiveness of using multimodal physiological signals for fatigue detection while incorporating uncertainty qua...Accurate detection of driver fatigue is essential for improving road safety.This study investigates the effectiveness of using multimodal physiological signals for fatigue detection while incorporating uncertainty quantification to enhance the reliability of predictions.Physiological signals,including Electrocardiogram(ECG),Galvanic Skin Response(GSR),and Electroencephalogram(EEG),were transformed into image representations and analyzed using pretrained deep neu-ral networks.The extracted features were classified through a feedforward neural network,and prediction reliability was assessed using uncertainty quantification techniques such as Monte Carlo Dropout(MCD),model ensembles,and combined approaches.Evaluation metrics included standard measures(sensitivity,specificity,precision,and accuracy)along with uncertainty-aware metrics such as uncertainty sensitivity and uncertainty precision.Across all evaluations,ECG-based models consistently demonstrated strong performance.The findings indicate that combining multimodal physi-ological signals,Transfer Learning(TL),and uncertainty quantification can significantly improve both the accuracy and trustworthiness of fatigue detection systems.This approach supports the development of more reliable driver assistance technologies aimed at preventing fatigue-related accidents.展开更多
Although machine learning models have achieved high enough accuracy in predicting shield position deviations,their“black box”nature makes the prediction mechanisms and decision-making processes opaque,leading to wea...Although machine learning models have achieved high enough accuracy in predicting shield position deviations,their“black box”nature makes the prediction mechanisms and decision-making processes opaque,leading to weaker explanations and practicability.This study introduces a novel explainable deep learning framework comprising the Informer model with enhanced attention mechanisms(EAMInfor)and deep learning important features(DeepLIFT),aimed at improving the prediction accuracy of shield position deviations and providing interpretability for predictive results.The EAMInfor model attempts to integrate channel attention,spatial attention,and simple attention modules to improve the Informer model's performance.The framework is tested with the four different geological conditions datasets generated from the Xiamen metro line 3,China.Results show that the EAMInfor model outperforms the traditional Informer and comparison models.The analysis with the DeepLIFT method indicates that the push thrust of push cylinder and the earth chamber pressure are the most significant features,while the stroke length of the push cylinder demonstrated lower importance.Furthermore,the variation trends in the significance of data points within input sequences exhibit substantial differences between single and composite strata.This framework not only improves predictive accuracy but also strengthens the credibility and reliability of the results.展开更多
At present,energy consumption is one of the main bottlenecks in autonomous mobile robot development.To address the challenge of high energy consumption in path planning for autonomous mobile robots navigating unknown ...At present,energy consumption is one of the main bottlenecks in autonomous mobile robot development.To address the challenge of high energy consumption in path planning for autonomous mobile robots navigating unknown and complex environments,this paper proposes an Attention-Enhanced Dueling Deep Q-Network(ADDueling DQN),which integrates a multi-head attention mechanism and a prioritized experience replay strategy into a Dueling-DQN reinforcement learning framework.A multi-objective reward function,centered on energy efficiency,is designed to comprehensively consider path length,terrain slope,motion smoothness,and obstacle avoidance,enabling optimal low-energy trajectory generation in 3D space from the source.The incorporation of a multihead attention mechanism allows the model to dynamically focus on energy-critical state features—such as slope gradients and obstacle density—thereby significantly improving its ability to recognize and avoid energy-intensive paths.Additionally,the prioritized experience replay mechanism accelerates learning from key decision-making experiences,suppressing inefficient exploration and guiding the policy toward low-energy solutions more rapidly.The effectiveness of the proposed path planning algorithm is validated through simulation experiments conducted in multiple off-road scenarios.Results demonstrate that AD-Dueling DQN consistently achieves the lowest average energy consumption across all tested environments.Moreover,the proposed method exhibits faster convergence and greater training stability compared to baseline algorithms,highlighting its global optimization capability under energy-aware objectives in complex terrains.This study offers an efficient and scalable intelligent control strategy for the development of energy-conscious autonomous navigation systems.展开更多
The stator of the maglev track plays a crucial role in the operation of the maglev system.Currently,the efficiency of maglev track inspection is limited by several factors,including the large span of elevated structur...The stator of the maglev track plays a crucial role in the operation of the maglev system.Currently,the efficiency of maglev track inspection is limited by several factors,including the large span of elevated structures,manual visual inspection,short inspection window times,and limited GPS positioning accuracy.To address these issues,this paper proposes a deep learning-based method for detecting and locating stator surface damage.This study establishes a maglev track stator surface image dataset,trains different object detection models,and compares their performance.Ultimately,YOLO and ByteTrack object tracking algorithms were chosen as the basic framework and enhanced to achieve automatic identification of high-speed maglev track stator surface damage images and track and count stator surface localization feature images.By matching the identified damaged images with their corresponding stator segment and beam segment sequence numbers,the location of the damage is pinpointed to the corresponding stator segment,enabling rapid and accurate identification and localization of complex damage to the maglev track stator surface.展开更多
Intelligent inspection of transmission lines enables efficient automated fault detection by integrating artificial intelligence,robotics,and other related technologies.It plays a key role in ensuring power grid safety...Intelligent inspection of transmission lines enables efficient automated fault detection by integrating artificial intelligence,robotics,and other related technologies.It plays a key role in ensuring power grid safety,reducing operation and maintenance costs,driving the digital transformation of the power industry,and facilitating the achievement of the dual-carbon goals.This review focuses on vision-based power line inspection,with deep learning as the core perspective to systematically analyze the latest research advancements in this field.Firstly,at the technical foundation level,it elaborates on deep learning algorithms for intelligent transmission line inspection based on image perception,covering object detection algorithms,semantic segmentation algorithms,and other relevant methodologies.Secondly,in application practice,it summarizes deep learning-based intelligent inspection applications across six dimensions—including detection of power insulators and their defects,transmission tower detection,power line feature extraction,metal fitting and defect detection,thermal fault diagnosis of power components,and safety hazard detection in power scenarios,and further lists relevant public datasets.Finally,in response to current challenges,it identifies five key future research directions,such as the deep integration of multiple learning paradigms,multi-modal data fusion,collaborative application of large and small models,cloud-edge-end collaborative integration,and multi-agent cluster control.This paper reviews and analyzes numerous deep learning-based intelligent detectionmethods for aerial images,comprehensively explores the application of deep learning in Unmanned Aerial Vehicle(UAV)inspection scenarios,and thus provides valuable theoretical and practical references for scholars engaged in smart grid automated inspection research.展开更多
Nondestructive measurement technology of phenotype can provide substantial phenotypic data support for applications such as seedling breeding,management,and quality testing.The current method of measuring seedling phe...Nondestructive measurement technology of phenotype can provide substantial phenotypic data support for applications such as seedling breeding,management,and quality testing.The current method of measuring seedling phenotypes mainly relies on manual measurement which is inefficient,subjective and destroys samples.Therefore,the paper proposes a nondestructive measurement method for the canopy phenotype of the watermelon plug seedlings based on deep learning.The Azure Kinect was used to shoot canopy color images,depth images,and RGB-D images of the watermelon plug seedlings.The Mask-RCNN network was used to classify,segment,and count the canopy leaves of the watermelon plug seedlings.To reduce the error of leaf area measurement caused by mutual occlusion of leaves,the leaves were repaired by CycleGAN,and the depth images were restored by image processing.Then,the Delaunay triangulation was adopted to measure the leaf area in the leaf point cloud.The YOLOX target detection network was used to identify the growing point position of each seedling on the plug tray.Then the depth differences between the growing point and the upper surface of the plug tray were calculated to obtain plant height.The experiment results show that the nondestructive measurement algorithm proposed in this paper achieves good measurement performance for the watermelon plug seedlings from the 1 true-leaf to 3 true-leaf stages.The average relative error of measurement is 2.33%for the number of true leaves,4.59%for the number of cotyledons,8.37%for the leaf area,and 3.27%for the plant height.The experiment results demonstrate that the proposed algorithm in this paper provides an effective solution for the nondestructive measurement of the canopy phenotype of the plug seedlings.展开更多
Deep learning attentionmechanisms have achieved remarkable progress in computer vision,but still face limitations when handling images with ambiguous boundaries and uncertain feature representations.Conventional atten...Deep learning attentionmechanisms have achieved remarkable progress in computer vision,but still face limitations when handling images with ambiguous boundaries and uncertain feature representations.Conventional attention modules such as SE-Net,CBAM,ECA-Net,and CA adopt a deterministic paradigm,assigning fixed scalar weights to features without modeling ambiguity or confidence.To overcome these limitations,this paper proposes the Fuzzy Attention Network Layer(FANL),which integrates intuitionistic fuzzy set theory with convolutional neural networks to explicitly represent feature uncertainty through membership(μ),non-membership(ν),and hesitation(π)degrees.FANLconsists of four coremodules:(1)feature dimensionality reduction via global pooling,(2)fuzzymodeling using learnable clustering centers,(3)adaptive attention generation through weighted fusion of fuzzy components,and(4)feature refinement through residual connections.A cross-layer guidance mechanism is further introduced to enhance hierarchical feature propagation,allowing high-level semantic features to incorporate fine-grained texture information from shallow layers.Comprehensive experiments on three benchmark datasets—PathMNIST-30000,full PathMNIST,and Blood MNIST—demonstrate the effectiveness and generalizability of FANL.The model achieves 84.41±0.56%accuracy and a 1.69%improvement over the baseline CNN while maintaining lightweight computational complexity.Ablation studies show that removing any component causes a 1.7%–2.0%performance drop,validating the synergistic contribution of each module.Furthermore,FANL provides superior uncertainty calibration(ECE=0.0452)and interpretable selective prediction under uncertainty.Overall,FANL presents an efficient and uncertaintyaware attention framework that improves both accuracy and reliability,offering a promising direction for robust visual recognition under ambiguous or noisy conditions.展开更多
In recent years,the use of deep learning to replace traditional numerical methods for electromagnetic propagation has shown tremendous potential in the rapid design of photonic devices.However,most research on deep le...In recent years,the use of deep learning to replace traditional numerical methods for electromagnetic propagation has shown tremendous potential in the rapid design of photonic devices.However,most research on deep learning has focused on single-layer grating couplers,and the accuracy of multi-layer grating couplers has not yet reached a high level.This paper proposes and demonstrates a novel deep learning network-assisted strategy for inverse design.The network model is based on a multi-layer perceptron(MLP)and incorporates convolutional neural networks(CNNs)and transformers.Through the stacking of multiple layers,it achieves a high-precision design for both multi-layer and single-layer raster couplers with various functionalities.The deep learning network exhibits exceptionally high predictive accuracy,with an average absolute error across the full wavelength range of 1300–1700 nm being only 0.17%,and an even lower predictive absolute error below 0.09%at the specific wavelength of 1550 nm.By combining the deep learning network with the genetic algorithm,we can efficiently design grating couplers that perform different functions.Simulation results indicate that the designed single-wavelength grating couplers achieve coupling efficiencies exceeding 80%at central wavelengths of 1550 nm and 1310 nm.The performance of designed dual-wavelength and broadband grating couplers also reaches high industry standards.Furthermore,the network structure and inverse design method are highly scalable and can be applied not only to multi-layer grating couplers but also directly to the prediction and design of single-layer grating couplers,providing a new perspective for the innovative development of photonic devices.展开更多
基金Projects(42174170,41874145,72088101)supported by the National Natural Science Foundation of ChinaProject(CX20200228)supported by the Hunan Provincial Innovation Foundation for Postgraduate,China。
文摘Landfill leaks pose a serious threat to environmental health,risking the contamination of both groundwater and soil resources.Accurate investigation of these sites is essential for implementing effective prevention and control measures.The self-potential(SP)stands out for its sensitivity to contamination plumes,offering a solution for monitoring and detecting the movement and seepage of subsurface pollutants.However,traditional SP inversion techniques heavily rely on precise subsurface resistivity information.In this study,we propose the Attention U-Net deep learning network for rapid SP inversion.By incorporating an attention mechanism,this algorithm effectively learns the relationship between array-style SP data and the location and extent of subsurface contaminated sources.We designed a synthetic landfill model with a heterogeneous resistivity structure to assess the performance of Attention U-Net deep learning network.Additionally,we conducted further validation using a laboratory model to assess its practical applicability.The results demonstrate that the algorithm is not solely dependent on resistivity information,enabling effective locating of the source distribution,even in models with intricate subsurface structures.Our work provides a promising tool for SP data processing,enhancing the applicability of this method in the field of near-subsurface environmental monitoring.
基金supported by the National Research Foundation of Korea(NRF)grant funded by theKorea government(MSIT)(No.RS-2024-00405278)partially supported by the Jeju Industry-University Convergence District Project for Promoting Industry-Campus Cooperationfunded by the Ministry of Trade,Industry and Energy(MOTIE,Korea)[Project Name:Jeju Industry-University Convergence District Project for Promoting Industry-Campus Cooperation/Project Number:P0029950].
文摘Recent advances in deep learning have significantly improved flood detection and segmentation from aerial and satellite imagery.However,conventional convolutional neural networks(CNNs)often struggle in complex flood scenarios involving reflections,occlusions,or indistinct boundaries due to limited contextual modeling.To address these challenges,we propose a hybrid flood segmentation framework that integrates a Vision Transformer(ViT)encoder with a U-Net decoder,enhanced by a novel Flood-Aware Refinement Block(FARB).The FARB module improves boundary delineation and suppresses noise by combining residual smoothing with spatial-channel attention mechanisms.We evaluate our model on a UAV-acquired flood imagery dataset,demonstrating that the proposed ViTUNet+FARB architecture outperforms existing CNN and Transformer-based models in terms of accuracy and mean Intersection over Union(mIoU).Detailed ablation studies further validate the contribution of each component,confirming that the FARB design significantly enhances segmentation quality.To its better performance and computational efficiency,the proposed framework is well-suited for flood monitoring and disaster response applications,particularly in resource-constrained environments.
文摘Underwater pipeline inspection plays a vital role in the proactive maintenance and management of critical marine infrastructure and subaquatic systems.However,the inspection of underwater pipelines presents a challenge due to factors such as light scattering,absorption,restricted visibility,and ambient noise.The advancement of deep learning has introduced powerful techniques for processing large amounts of unstructured and imperfect data collected from underwater environments.This study evaluated the efficacy of the You Only Look Once(YOLO)algorithm,a real-time object detection and localization model based on convolutional neural networks,in identifying and classifying various types of pipeline defects in underwater settings.YOLOv8,the latest evolution in the YOLO family,integrates advanced capabilities,such as anchor-free detection,a cross-stage partial network backbone for efficient feature extraction,and a feature pyramid network+path aggregation network neck for robust multi-scale object detection,which make it particularly well-suited for complex underwater environments.Due to the lack of suitable open-access datasets for underwater pipeline defects,a custom dataset was captured using a remotely operated vehicle in a controlled environment.This application has the following assets available for use.Extensive experimentation demonstrated that YOLOv8 X-Large consistently outperformed other models in terms of pipe defect detection and classification and achieved a strong balance between precision and recall in identifying pipeline cracks,rust,corners,defective welds,flanges,tapes,and holes.This research establishes the baseline performance of YOLOv8 for underwater defect detection and showcases its potential to enhance the reliability and efficiency of pipeline inspection tasks in challenging underwater environments.
基金supported in part by the Tianjin Natural Science Foundation Project(24JCZDJC01000)the Fundamental Research Funds for the Central Universities of China(No.3122025091).
文摘As deep learning(DL)models are increasingly deployed in sensitive domains(e.g.,healthcare),concerns over privacy and security have intensified.Conventional penetration testing frameworks,such asOWASP and NIST,are effective for traditional networks and applications but lack the capabilities to address DL-specific threats,such asmodel inversion,membership inference,and adversarial attacks.This review provides a comprehensive analysis of penetration testing for the privacy of DL models,examining the shortfalls of existing frameworks,tools,and testing methodologies.Through systematic evaluation of existing literature and empirical analysis,we identify three major contributions:(i)a critical assessment of traditional penetration testing frameworks’inadequacies when applied to DL-specific privacy vulnerabilities,(ii)a comprehensive evaluation of state-of-the-art privacy-preserving methods and their integration with penetration testing workflows,and(iii)the development of a structured framework that combines reconnaissance,threat modeling,exploitation,and post-exploitation phases specifically tailored for DL privacy assessment.Moreover,this review evaluates popular solutions such as IBMAdversarial Robustness Toolbox and TensorFlowPrivacy,alongside privacy-preserving techniques(e.g.,Differential Privacy,Homomorphic Encryption,and Federated Learning),which we systematically analyze through comparative studies of their effectiveness,computational overhead,and practical deployment constraints.While these techniques offer promising safeguards,their adoption is hindered by accuracy loss,performance overheads,and the rapid evolution of attack strategies.Our findings reveal that no single existing solution provides comprehensive protection,which leads us to propose a hybrid approach that strategically combines multiple privacy-preserving mechanisms.The findings of this survey underscore an urgent need for automated,regulationcompliant penetration testing frameworks specifically tailored to DL systems.We argue for hybrid privacy solutions that combinemultiple protectivemechanisms to ensure bothmodel accuracy and privacy.Building on our analysis,we present actionable recommendations for developing adaptive penetration testing strategies that incorporate automated vulnerability assessment,continuous monitoring,and regulatory compliance verification.
文摘As urbanization continues to accelerate,the challenges associated with managing transportation in metropolitan areas become increasingly complex.The surge in population density contributes to traffic congestion,impacting travel experiences and posing safety risks.Smart urban transportation management emerges as a strategic solution,conceptualized here as a multidimensional big data problem.The success of this strategy hinges on the effective collection of information from diverse,extensive,and heterogeneous data sources,necessitating the implementation of full⁃stack Information and Communication Technology(ICT)solutions.The main idea of the work is to investigate the current technologies of Intelligent Transportation Systems(ITS)and enhance the safety of urban transportation systems.Machine learning models,trained on historical data,can predict traffic congestion,allowing for the implementation of preventive measures.Deep learning architectures,with their ability to handle complex data representations,further refine traffic predictions,contributing to more accurate and dynamic transportation management.The background of this research underscores the challenges posed by traffic congestion in metropolitan areas and emphasizes the need for advanced technological solutions.By integrating GPS and GIS technologies with machine learning algorithms,this work aims to pay attention to the development of intelligent transportation systems that not only address current challenges but also pave the way for future advancements in urban transportation management.
文摘With the proliferation of Internet of Things(IoT)devices,securing these interconnected systems against cyberattacks has become a critical challenge.Traditional security paradigms often fail to cope with the scale and diversity of IoT network traffic.This paper presents a comparative benchmark of classic machine learning(ML)and state-of-the-art deep learning(DL)algorithms for IoT intrusion detection.Our methodology employs a twophased approach:a preliminary pilot study using a custom-generated dataset to establish baselines,followed by a comprehensive evaluation on the large-scale CICIoTDataset2023.We benchmarked algorithms including Random Forest,XGBoost,CNN,and StackedLSTM.The results indicate that while top-performingmodels frombothcategories achieve over 99%classification accuracy,this metric masks a crucial performance trade-off.We demonstrate that treebased ML ensembles exhibit superior precision(91%)in identifying benign traffic,making them effective at reducing false positives.Conversely,DL models demonstrate superior recall(96%),making them better suited for minimizing the interruption of legitimate traffic.We conclude that the selection of an optimal model is not merely a matter of maximizing accuracy but is a strategic choice dependent on the specific security priority either minimizing false alarms or ensuring service availability.Thiswork provides a practical framework for deploying context-aware security solutions in diverse IoT environments.
基金funded by the Strategic Priority Research Program of the Chinese Academy of Sciences(XDB0560000)the National Natural Science Foundation of China(12473054)+1 种基金the Basic Research on Fund Projects in Yunnan Province(2019FA001)the Yunnan Province Science Foundation Project(202105AC160085).
文摘Image registration within a solar photosphere sequence is crucial for observational solar physics studies requiring high spatial and temporal resolutions.Previously,we identified residual large-scale nonrigid distortions in high-resolution solar photosphere images from ground-based telescopes after high-resolution reconstruction.Because these distortions are not eliminated by conventional sequence correlation alignment,they can affect the analysis of small-scale activity in the solar photosphere.Here,we implemented an image registration model using deep learning(HCAM-Net)to solve the problem.Within an encoder-decoder framework,we introduced a hybrid attention mechanism to improve context information capture and extract accurate deformation fields.Analyzing solar photosphere images acquired by the New Vacuum Solar Telescope,we demonstrated that the proposed model effectively achieved highly accurate nonrigid image registration.Evaluation metrics and visualization results indicated that our model outperformed current state-of-the-art models,such as VoxelMorph and TransMorph,for nonrigid registration of solar photosphere images,with a structural similarity index measure of 0.965 and a coefficient of determination of 0.976.
基金supported by National Key R&D Program of China(No.2022YFC2404604)Chongqing Research Institution Performance Incentive Guidance Special Project(No.CSTB2023JXJL-YFX0080)Chongqing Medical Scientific Research Project(Joint project of Chongqing Health Commission and Science and Technology Bureau)(No.2022DBXM005)。
文摘This study aimed to integrate Monte Carlo(MC)simulation with deep learning(DL)-based denoising techniques to achieve fast and accurate prediction of high-quality electronic portal imaging device(EPID)transmission dose(TD)for patientspecific quality assurance(PSQA).A total of 100 lung cases were used to obtain the noisy EPID TD by the ARCHER MC code under four kinds of particle numbers(1×10^(6),1×10^(7),1×10^(8)and 1×10^(9)),and the original EPID TD was denoised by the SUNet neural network.The denoised EPID TD was assessed both qualitatively and quantitatively using the structural similarity(SSIM),peak signal-to-noise ratio(PSNR),and gamma passing rate(GPR)with respect to 1×10^(9)as a reference.The computation times for both the MC simulation and DL-based denoising were recorded.As the number of particles increased,both the quality of the noisy EPID TD and computation time increased significantly(1×10^(6):1.12 s,1×10^(7):1.72 s,1×10^(8):8.62 s,and 1×10^(9):73.89 s).In contrast,the DL-based denoising time remained at 0.13-0.16 s.The denoised EPID TD shows a smoother visual appearance and profile curves,but differences between 1×10^(6)and 1×10^(9)still remain.SSIM improves from 0.61 to 0.95 for 1×10^(6),0.70 to 0.96 for 1×10^(7),and 0.90 to 0.97 for 1×10^(8).PSNR increases by>20%for 1×10^(6)and 1×10^(7),and>10%for 1×10^(8).GPR improves from 48.47%to 89.10%for 1×10^(6),61.04%to 94.35%for 1×10^(7),and 91.88%to 99.55%for 1×10^(8).The method that combines MC simulation with DL-based denoising for EPID TD generation can accelerate TD prediction and maintain high accuracy,offering a promising solution for efficient PSQA.
基金supported by the Academic Research Projects of Beijing Union University(ZK20202204)the National Natural Science Foundation of China(12250005,12073040,12273059,11973056,12003051,11573037,12073041,11427901,11572005,11611530679 and 12473052)+1 种基金the Strategic Priority Research Program of the China Academy of Sciences(XDB0560000,XDA15052200,XDB09040200,XDA15010700,XDB0560301,and XDA15320102)the Chinese Meridian Project(CMP).
文摘The solar cycle(SC),a phenomenon caused by the quasi-periodic regular activities in the Sun,occurs approximately every 11 years.Intense solar activity can disrupt the Earth’s ionosphere,affecting communication and navigation systems.Consequently,accurately predicting the intensity of the SC holds great significance,but predicting the SC involves a long-term time series,and many existing time series forecasting methods have fallen short in terms of accuracy and efficiency.The Time-series Dense Encoder model is a deep learning solution tailored for long time series prediction.Based on a multi-layer perceptron structure,it outperforms the best previously existing models in accuracy,while being efficiently trainable on general datasets.We propose a method based on this model for SC forecasting.Using a trained model,we predict the test set from SC 19 to SC 25 with an average mean absolute percentage error of 32.02,root mean square error of 30.3,mean absolute error of 23.32,and R^(2)(coefficient of determination)of 0.76,outperforming other deep learning models in terms of accuracy and training efficiency on sunspot number datasets.Subsequently,we use it to predict the peaks of SC 25 and SC 26.For SC 25,the peak time has ended,but a stronger peak is predicted for SC 26,of 199.3,within a range of 170.8-221.9,projected to occur during April 2034.
基金supported by the Basic Science Research Program(2023R1A2C3004336,RS-202300243807)&Regional Leading Research Center(RS-202400405278)through the National Research Foundation of Korea(NRF)grant funded by the Korea Government(MSIT)。
文摘Wearable sensors integrated with deep learning techniques have the potential to revolutionize seamless human-machine interfaces for real-time health monitoring,clinical diagnosis,and robotic applications.Nevertheless,it remains a critical challenge to simultaneously achieve desirable mechanical and electrical performance along with biocompatibility,adhesion,self-healing,and environmental robustness with excellent sensing metrics.Herein,we report a multifunctional,anti-freezing,selfadhesive,and self-healable organogel pressure sensor composed of cobalt nanoparticle encapsulated nitrogen-doped carbon nanotubes(CoN CNT)embedded in a polyvinyl alcohol-gelatin(PVA/GLE)matrix.Fabricated using a binary solvent system of water and ethylene glycol(EG),the CoN CNT/PVA/GLE organogel exhibits excellent flexibility,biocompatibility,and temperature tolerance with remarkable environmental stability.Electrochemical impedance spectroscopy confirms near-stable performance across a broad humidity range(40%-95%RH).Freeze-tolerant conductivity under sub-zero conditions(-20℃)is attributed to the synergistic role of CoN CNT and EG,preserving mobility and network integrity.The Co N CNT/PVA/GLE organogel sensor exhibits high sensitivity of 5.75 k Pa^(-1)in the detection range from 0 to 20 k Pa,ideal for subtle biomechanical motion detection.A smart human-machine interface for English letter recognition using deep learning achieved 98%accuracy.The organogel sensor utility was extended to detect human gestures like finger bending,wrist motion,and throat vibration during speech.
文摘Test case prioritization and ranking play a crucial role in software testing by improving fault detection efficiency and ensuring software reliability.While prioritization selects the most relevant test cases for optimal coverage,ranking further refines their execution order to detect critical faults earlier.This study investigates machine learning techniques to enhance both prioritization and ranking,contributing to more effective and efficient testing processes.We first employ advanced feature engineering alongside ensemble models,including Gradient Boosted,Support Vector Machines,Random Forests,and Naive Bayes classifiers to optimize test case prioritization,achieving an accuracy score of 0.98847 and significantly improving the Average Percentage of Fault Detection(APFD).Subsequently,we introduce a deep Q-learning framework combined with a Genetic Algorithm(GA)to refine test case ranking within priority levels.This approach achieves a rank accuracy of 0.9172,demonstrating robust performance despite the increasing computational demands of specialized variation operators.Our findings highlight the effectiveness of stacked ensemble learning and reinforcement learning in optimizing test case prioritization and ranking.This integrated approach improves testing efficiency,reduces late-stage defects,and improves overall software stability.The study provides valuable information for AI-driven testing frameworks,paving the way for more intelligent and adaptive software quality assurance methodologies.
文摘Accurate streamflow prediction is essential for flood warning,reservoir operation,irrigation scheduling,hydropower planning,and sustainable water management,yet remains challenging due to the complexity of hydrological processes.Although data-driven models often outperform conventional physics-based hydrological modelling approaches,their real-world deployment is limited by cost,infrastructure demands,and the interdisciplinary expertise required.To bridge this gap,this study developed QPred,a regional,lightweight,cost-effective,web-delivered application for daily streamflow forecasting.The study executed an end-to-end workflow,from field data acquisition to accessible web-based deployment for on-demand forecasting.High-resolution rainfall data were recorded with tippingbucket gauges and loggers,while river water depth in the Aglar and Paligaad watersheds was converted to discharge using site-specific rating curves,resulting in a daily dataset of precipitation,river water level and discharge.Four DL architectures were trained,including vanilla Long Short-Term Memory(LSTM),stacked LSTM,bidirectional LSTM,and Gated Recurrent Unit(GRU),and evaluated using Nash-Sutcliffe Efficiency(NSE),Coefficient of Determination(R2),Root-Mean-Square-Error-Standard-Deviation Ratio(RSR),and Percentage Bias(PBIAS)metrics.Performance was watershed-specific,as the vanilla LSTM demonstrated the best generalisation for the Aglar watershed(R2=0.88,NSE=0.82,RMSE=0.12 during validation),while the GRU achieved the highest validation accuracy in Paligaad(R2=0.88,NSE=0.88,RMSE=0.49).All models achieved satisfactory to excellent performance during calibration(R2>0.91,NSE>0.91 for both watersheds),demonstrating strong capability to capture streamflow dynamics.The highest performing models were selected and embedded into the QPred application.QPred was developed as a lightweight web pipeline,utilising Google Colab as the primary execution environment,Flask as the backend inference framework,Google Drive for artefact storage,andNgrok for secureHTTPS tunnelling.Auser-friendly front end utilises range sliders(bounded by observed minima and maxima)to gather inputs and provides discharge data along with metadata,thereby enhancing transparency.This work demonstrates that accurate,context-aware deep learningmodels can be delivered through low-cost,web-based platforms,providing a reproducible and scalable pipeline for hydrological applications in other watersheds and for practitioners.
基金the Australian Research Council Discovery Projects funding scheme(DP190102181,DP210101465).
文摘Accurate detection of driver fatigue is essential for improving road safety.This study investigates the effectiveness of using multimodal physiological signals for fatigue detection while incorporating uncertainty quantification to enhance the reliability of predictions.Physiological signals,including Electrocardiogram(ECG),Galvanic Skin Response(GSR),and Electroencephalogram(EEG),were transformed into image representations and analyzed using pretrained deep neu-ral networks.The extracted features were classified through a feedforward neural network,and prediction reliability was assessed using uncertainty quantification techniques such as Monte Carlo Dropout(MCD),model ensembles,and combined approaches.Evaluation metrics included standard measures(sensitivity,specificity,precision,and accuracy)along with uncertainty-aware metrics such as uncertainty sensitivity and uncertainty precision.Across all evaluations,ECG-based models consistently demonstrated strong performance.The findings indicate that combining multimodal physi-ological signals,Transfer Learning(TL),and uncertainty quantification can significantly improve both the accuracy and trustworthiness of fatigue detection systems.This approach supports the development of more reliable driver assistance technologies aimed at preventing fatigue-related accidents.
基金supported by the National Natural Science Foundation of China(Grant Nos.52378392,52408356)the Foal Eagle Program Youth Top-notch Talent Project of Fujian Province,China(Grant No.00387088).
文摘Although machine learning models have achieved high enough accuracy in predicting shield position deviations,their“black box”nature makes the prediction mechanisms and decision-making processes opaque,leading to weaker explanations and practicability.This study introduces a novel explainable deep learning framework comprising the Informer model with enhanced attention mechanisms(EAMInfor)and deep learning important features(DeepLIFT),aimed at improving the prediction accuracy of shield position deviations and providing interpretability for predictive results.The EAMInfor model attempts to integrate channel attention,spatial attention,and simple attention modules to improve the Informer model's performance.The framework is tested with the four different geological conditions datasets generated from the Xiamen metro line 3,China.Results show that the EAMInfor model outperforms the traditional Informer and comparison models.The analysis with the DeepLIFT method indicates that the push thrust of push cylinder and the earth chamber pressure are the most significant features,while the stroke length of the push cylinder demonstrated lower importance.Furthermore,the variation trends in the significance of data points within input sequences exhibit substantial differences between single and composite strata.This framework not only improves predictive accuracy but also strengthens the credibility and reliability of the results.
文摘At present,energy consumption is one of the main bottlenecks in autonomous mobile robot development.To address the challenge of high energy consumption in path planning for autonomous mobile robots navigating unknown and complex environments,this paper proposes an Attention-Enhanced Dueling Deep Q-Network(ADDueling DQN),which integrates a multi-head attention mechanism and a prioritized experience replay strategy into a Dueling-DQN reinforcement learning framework.A multi-objective reward function,centered on energy efficiency,is designed to comprehensively consider path length,terrain slope,motion smoothness,and obstacle avoidance,enabling optimal low-energy trajectory generation in 3D space from the source.The incorporation of a multihead attention mechanism allows the model to dynamically focus on energy-critical state features—such as slope gradients and obstacle density—thereby significantly improving its ability to recognize and avoid energy-intensive paths.Additionally,the prioritized experience replay mechanism accelerates learning from key decision-making experiences,suppressing inefficient exploration and guiding the policy toward low-energy solutions more rapidly.The effectiveness of the proposed path planning algorithm is validated through simulation experiments conducted in multiple off-road scenarios.Results demonstrate that AD-Dueling DQN consistently achieves the lowest average energy consumption across all tested environments.Moreover,the proposed method exhibits faster convergence and greater training stability compared to baseline algorithms,highlighting its global optimization capability under energy-aware objectives in complex terrains.This study offers an efficient and scalable intelligent control strategy for the development of energy-conscious autonomous navigation systems.
基金supported in part by the National Natural Science Foundation of China under Grant 52432012in part by the Shanghai Science and Technology Project with 25ZR1402508。
文摘The stator of the maglev track plays a crucial role in the operation of the maglev system.Currently,the efficiency of maglev track inspection is limited by several factors,including the large span of elevated structures,manual visual inspection,short inspection window times,and limited GPS positioning accuracy.To address these issues,this paper proposes a deep learning-based method for detecting and locating stator surface damage.This study establishes a maglev track stator surface image dataset,trains different object detection models,and compares their performance.Ultimately,YOLO and ByteTrack object tracking algorithms were chosen as the basic framework and enhanced to achieve automatic identification of high-speed maglev track stator surface damage images and track and count stator surface localization feature images.By matching the identified damaged images with their corresponding stator segment and beam segment sequence numbers,the location of the damage is pinpointed to the corresponding stator segment,enabling rapid and accurate identification and localization of complex damage to the maglev track stator surface.
基金financially supported by theNatural Research Project of College in Anhui Province under grant 2024AH051365,2025AHGXZK30826Research Platform of New Energy and Energy-Saving Technology Research Center under grant KYJG002.
文摘Intelligent inspection of transmission lines enables efficient automated fault detection by integrating artificial intelligence,robotics,and other related technologies.It plays a key role in ensuring power grid safety,reducing operation and maintenance costs,driving the digital transformation of the power industry,and facilitating the achievement of the dual-carbon goals.This review focuses on vision-based power line inspection,with deep learning as the core perspective to systematically analyze the latest research advancements in this field.Firstly,at the technical foundation level,it elaborates on deep learning algorithms for intelligent transmission line inspection based on image perception,covering object detection algorithms,semantic segmentation algorithms,and other relevant methodologies.Secondly,in application practice,it summarizes deep learning-based intelligent inspection applications across six dimensions—including detection of power insulators and their defects,transmission tower detection,power line feature extraction,metal fitting and defect detection,thermal fault diagnosis of power components,and safety hazard detection in power scenarios,and further lists relevant public datasets.Finally,in response to current challenges,it identifies five key future research directions,such as the deep integration of multiple learning paradigms,multi-modal data fusion,collaborative application of large and small models,cloud-edge-end collaborative integration,and multi-agent cluster control.This paper reviews and analyzes numerous deep learning-based intelligent detectionmethods for aerial images,comprehensively explores the application of deep learning in Unmanned Aerial Vehicle(UAV)inspection scenarios,and thus provides valuable theoretical and practical references for scholars engaged in smart grid automated inspection research.
基金funded by the National Key Research and Development Program of China(Grant No.2019YFD1001900)the HZAU-AGIS Cooperation Fund(Grant No.SZYJY2022006).
文摘Nondestructive measurement technology of phenotype can provide substantial phenotypic data support for applications such as seedling breeding,management,and quality testing.The current method of measuring seedling phenotypes mainly relies on manual measurement which is inefficient,subjective and destroys samples.Therefore,the paper proposes a nondestructive measurement method for the canopy phenotype of the watermelon plug seedlings based on deep learning.The Azure Kinect was used to shoot canopy color images,depth images,and RGB-D images of the watermelon plug seedlings.The Mask-RCNN network was used to classify,segment,and count the canopy leaves of the watermelon plug seedlings.To reduce the error of leaf area measurement caused by mutual occlusion of leaves,the leaves were repaired by CycleGAN,and the depth images were restored by image processing.Then,the Delaunay triangulation was adopted to measure the leaf area in the leaf point cloud.The YOLOX target detection network was used to identify the growing point position of each seedling on the plug tray.Then the depth differences between the growing point and the upper surface of the plug tray were calculated to obtain plant height.The experiment results show that the nondestructive measurement algorithm proposed in this paper achieves good measurement performance for the watermelon plug seedlings from the 1 true-leaf to 3 true-leaf stages.The average relative error of measurement is 2.33%for the number of true leaves,4.59%for the number of cotyledons,8.37%for the leaf area,and 3.27%for the plant height.The experiment results demonstrate that the proposed algorithm in this paper provides an effective solution for the nondestructive measurement of the canopy phenotype of the plug seedlings.
文摘Deep learning attentionmechanisms have achieved remarkable progress in computer vision,but still face limitations when handling images with ambiguous boundaries and uncertain feature representations.Conventional attention modules such as SE-Net,CBAM,ECA-Net,and CA adopt a deterministic paradigm,assigning fixed scalar weights to features without modeling ambiguity or confidence.To overcome these limitations,this paper proposes the Fuzzy Attention Network Layer(FANL),which integrates intuitionistic fuzzy set theory with convolutional neural networks to explicitly represent feature uncertainty through membership(μ),non-membership(ν),and hesitation(π)degrees.FANLconsists of four coremodules:(1)feature dimensionality reduction via global pooling,(2)fuzzymodeling using learnable clustering centers,(3)adaptive attention generation through weighted fusion of fuzzy components,and(4)feature refinement through residual connections.A cross-layer guidance mechanism is further introduced to enhance hierarchical feature propagation,allowing high-level semantic features to incorporate fine-grained texture information from shallow layers.Comprehensive experiments on three benchmark datasets—PathMNIST-30000,full PathMNIST,and Blood MNIST—demonstrate the effectiveness and generalizability of FANL.The model achieves 84.41±0.56%accuracy and a 1.69%improvement over the baseline CNN while maintaining lightweight computational complexity.Ablation studies show that removing any component causes a 1.7%–2.0%performance drop,validating the synergistic contribution of each module.Furthermore,FANL provides superior uncertainty calibration(ECE=0.0452)and interpretable selective prediction under uncertainty.Overall,FANL presents an efficient and uncertaintyaware attention framework that improves both accuracy and reliability,offering a promising direction for robust visual recognition under ambiguous or noisy conditions.
基金sponsored by the National Key Scientific Instrument and Equipment Development Projects of China(Grant No.62027823)the National Natural Science Foun-dation of China(Grant No.61775048).
文摘In recent years,the use of deep learning to replace traditional numerical methods for electromagnetic propagation has shown tremendous potential in the rapid design of photonic devices.However,most research on deep learning has focused on single-layer grating couplers,and the accuracy of multi-layer grating couplers has not yet reached a high level.This paper proposes and demonstrates a novel deep learning network-assisted strategy for inverse design.The network model is based on a multi-layer perceptron(MLP)and incorporates convolutional neural networks(CNNs)and transformers.Through the stacking of multiple layers,it achieves a high-precision design for both multi-layer and single-layer raster couplers with various functionalities.The deep learning network exhibits exceptionally high predictive accuracy,with an average absolute error across the full wavelength range of 1300–1700 nm being only 0.17%,and an even lower predictive absolute error below 0.09%at the specific wavelength of 1550 nm.By combining the deep learning network with the genetic algorithm,we can efficiently design grating couplers that perform different functions.Simulation results indicate that the designed single-wavelength grating couplers achieve coupling efficiencies exceeding 80%at central wavelengths of 1550 nm and 1310 nm.The performance of designed dual-wavelength and broadband grating couplers also reaches high industry standards.Furthermore,the network structure and inverse design method are highly scalable and can be applied not only to multi-layer grating couplers but also directly to the prediction and design of single-layer grating couplers,providing a new perspective for the innovative development of photonic devices.