Anomaly detection is an important method for intrusion detection.In recent years,unsupervised methods have been widely researched because they do not require labeling.For example,a nonlinear autoencoder can use recons...Anomaly detection is an important method for intrusion detection.In recent years,unsupervised methods have been widely researched because they do not require labeling.For example,a nonlinear autoencoder can use reconstruction errors to attain the discrimination threshold.This method is not effective when the model complexity is high or the data contains noise.The method for detecting the density of compressed features in a hidden layer can be used to reduce the influence of noise on the selection of the threshold because the density of abnormal data in hidden layers is smaller than normal data.However,compressed features may lose some of the high-dimensional distribution information of the original data.In this paper,we present an efficient anomaly detection framework for unsupervised anomaly detection,which includes network data capturing,processing,feature extraction,and anomaly detection.We employ a deep autoencoder to obtain compressed features and multi-layer reconstruction errors,and feeds them the same to the Gaussian mixture model to estimate the density.The proposed approach is trained and tested on multiple current intrusion detection datasets and real network scenes,and performance indicators,namely accuracy,recall,and F1-score,are better than other autoencoder models.展开更多
The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities...The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities and obstacles.The huge and diversified nature of these datasets cannot always be managed using traditional data analysis methods.As a consequence,deep learning has emerged as a strong tool for analysing numerous omics data due to its ability to handle complex and non-linear relationships.This paper explores the fundamental concepts of deep learning and how they are used in multi-omics medical data mining.We demonstrate how autoencoders,variational autoencoders,multimodal models,attention mechanisms,transformers,and graph neural networks enable pattern analysis and recognition across all omics data.Deep learning has been found to be effective in illness classification,biomarker identification,gene network learning,and therapeutic efficacy prediction.We also consider critical problems like as data quality,model explainability,whether findings can be repeated,and computational power requirements.We now consider future elements of combining omics with clinical and imaging data,explainable AI,federated learning,and real-time diagnostics.Overall,this study emphasises the need of collaborating across disciplines to advance deep learning-based multi-omics research for precision medicine and comprehending complicated disorders.展开更多
Credit risk assessment is a crucial task in bank risk management.By making lending decisions based on credit risk assessment results,banks can reduce the probability of non-performing loans.However,class imbalance in ...Credit risk assessment is a crucial task in bank risk management.By making lending decisions based on credit risk assessment results,banks can reduce the probability of non-performing loans.However,class imbalance in bank credit default datasets limits the predictive performance of traditional machine learning and deep learning models.To address this issue,this study employs the conditional variational autoencoder-Wasserstein generative adversarial network with gradient penalty(CVAE-WGAN-gp)model for oversampling,generating samples similar to the original default customer data to enhance model prediction performance.To evaluate the quality of the data generated by the CVAE-WGAN-gp model,we selected several bank loan datasets for experimentation.The experimental results demonstrate that using the CVAE-WGAN-gp model for oversampling can significantly improve the predictive performance in credit risk assessment problems.展开更多
Electronic nose and thermal images are effective ways to diagnose the presence of gases in real-time realtime.Multimodal fusion of these modalities can result in the development of highly accurate diagnostic systems.T...Electronic nose and thermal images are effective ways to diagnose the presence of gases in real-time realtime.Multimodal fusion of these modalities can result in the development of highly accurate diagnostic systems.The low-cost thermal imaging software produces low-resolution thermal images in grayscale format,hence necessitating methods for improving the resolution and colorizing the images.The objective of this paper is to develop and train a super-resolution generative adversarial network for improving the resolution of the thermal images,followed by a sparse autoencoder for colorization of thermal images and amultimodal convolutional neural network for gas detection using electronic nose and thermal images.The dataset used comprises 6400 thermal images and electronic nose measurements for four classes.A multimodal Convolutional Neural Network(CNN)comprising an EfficientNetB2 pre-trainedmodel was developed using both early and late feature fusion.The Super Resolution Generative Adversarial Network(SRGAN)model was developed and trained on low and high-resolution thermal images.Asparse autoencoder was trained on the grayscale and colorized thermal images.The SRGAN was trained on lowand high-resolution thermal images,achieving a Structural Similarity Index(SSIM)of 90.28,a Peak Signal-to-Noise Ratio(PSNR)of 68.74,and a Mean Absolute Error(MAE)of 0.066.The autoencoder model produced an MAE of 0.035,a Mean Squared Error(MSE)of 0.006,and a Root Mean Squared Error(RMSE)of 0.0705.The multimodal CNN,trained on these images and electronic nose measurements using both early and late fusion techniques,achieved accuracies of 97.89% and 98.55%,respectively.Hence,the proposed framework can be of great aid for the integration with low-cost software to generate high quality thermal camera images and highly accurate detection of gases in real-time.展开更多
High-speed railway bridges are essential components of any railway transportation system that should keep adequate levels of serviceability and safety.In this context,drive-by methodologies have emerged as a feasible ...High-speed railway bridges are essential components of any railway transportation system that should keep adequate levels of serviceability and safety.In this context,drive-by methodologies have emerged as a feasible and cost-effective monitor-ing solution for detecting damage on railway bridges while minimizing train operation interruptions.Moreover,integrating advanced sensor technologies and machine learning algorithms has significantly enhanced structural health monitoring(SHM)for bridges.Despite being increasingly used in traditional SHM applications,studies using autoencoders within drive-by methodologies are rare,especially in the railway field.This study presents a novel approach for drive-by damage detection in HSR bridges.The methodology relies on acceleration records collected from multiple bridge crossings by an operational train equipped with onboard sensors.Log-Mel spectrogram features derived from the acceleration records are used together with sparse autoencoders for computing statistical distribution-based damage indexes.Numerical simulations were performed on a 3D vehicle-track-bridge interaction system model implemented in Matlab to evaluate the robustness and effectiveness of the proposed approach,considering several damage scenarios,vehicle speeds,and environmental and operational variations,such as multiple track irregularities and varying measurement noise.The results show that the pro-posed approach can successfully detect damages,as well as characterize their severity,especially for very early-stage dam-ages.This demonstrates the high potential of applying Mel-frequency damage-sensitive features associated with machine learning algorithms in the drive-by condition assessment of high-speed railway bridges.展开更多
Traditional inverse neural network(INN)approaches for inverse design typically require auxiliary feedforward networks,leading to increased computational complexity and architectural dependencies.This study introduces ...Traditional inverse neural network(INN)approaches for inverse design typically require auxiliary feedforward networks,leading to increased computational complexity and architectural dependencies.This study introduces a standalone INN methodology that eliminates the need for feedforward networks while maintaining high reconstruction accuracy.The approach integrates Principal Component Analysis(PCA)and Partial Least Squares(PLS)for optimized feature space learning,enabling the standalone INN to effectively capture bidirectionalmappings between geometric parameters and mechanical properties.Validation using established numerical datasets demonstrates that the standalone INN architecture achieves reconstruction accuracy equal or better than traditional tandem approaches while completely eliminating the workload and training time required for Feedforward Neural Networks(FNN).These findings contribute to AI methodology development by proving that standalone invertible architectures can achieve comparable performance to complex hybrid systems with significantly improved computational efficiency.展开更多
Time series anomaly detection is crucial in finance,healthcare,and industrial monitoring.However,traditional methods often face challenges when handling time series data,such as limited feature extraction capability,p...Time series anomaly detection is crucial in finance,healthcare,and industrial monitoring.However,traditional methods often face challenges when handling time series data,such as limited feature extraction capability,poor temporal dependency handling,and suboptimal real-time performance,sometimes even neglecting the temporal relationships between data.To address these issues and improve anomaly detection performance by better capturing temporal dependencies,we propose an unsupervised time series anomaly detection method,VLT-Anomaly.First,we enhance the Variational Autoencoder(VAE)module by redesigning its network structure to better suit anomaly detection through data reconstruction.We introduce hyperparameters to control the weight of the Kullback-Leibler(KL)divergence term in the Evidence Lower Bound(ELBO),thereby improving the encoder module’s decoupling and expressive power in the latent space,which yields more effective latent representations of the data.Next,we incorporate transformer and Long Short-Term Memory(LSTM)modules to estimate the long-term dependencies of the latent representations,capturing both forward and backward temporal relationships and performing time series forecasting.Finally,we compute the reconstruction error by averaging the predicted results and decoder reconstruction and detect anomalies through grid search for optimal threshold values.Experimental results demonstrate that the proposed method performs superior anomaly detection on multiple public time series datasets,effectively extracting complex time-related features and enabling efficient computation and real-time anomaly detection.It improves detection accuracy and robustness while reducing false positives and false negatives.展开更多
Osteoporosis is a major cause of bone fracture and can be characterised by both mass loss and microstructure deterioration of the bone.The modern way of osteoporosis assessment is through the measurement of bone miner...Osteoporosis is a major cause of bone fracture and can be characterised by both mass loss and microstructure deterioration of the bone.The modern way of osteoporosis assessment is through the measurement of bone mineral density,which is not able to unveil the pathological condition from the mesoscale aspect.To obtain mesoscale information from computed tomography(CT),the super-resolution(SR)approach for volumetric imaging data is required.A deep learning model AESR3D is proposed to recover high-resolution(HR)Micro-CT from low-resolution Micro-CT and implement an unsupervised segmentation for better trabecular observation and measurement.A new regularisation overcomplete autoencoder framework for the SR task is proposed and theoretically analysed.The best performance is achieved on structural similarity measure of trabecular CT SR task compared with the state-of-the-art models in both natural and medical image SR tasks.The HR and SR images show a high correlation(r=0.996,intraclass correlation coefficients=0.917)on trabecular bone morphological indicators.The results also prove the effectiveness of our regularisation framework when training a large capacity model.展开更多
The emergence of Generative Adversarial Network(GAN)techniques has garnered significant attention from the research community for the development of Intrusion Detection Systems(IDS).However,conventional GAN-based IDS ...The emergence of Generative Adversarial Network(GAN)techniques has garnered significant attention from the research community for the development of Intrusion Detection Systems(IDS).However,conventional GAN-based IDS models face several challenges,including training instability,high computational costs,and system failures.To address these limitations,we propose a Hybrid Wasserstein GAN and Autoencoder Model(WGAN-AE)for intrusion detection.The proposed framework leverages the stability of WGAN and the feature extraction capabilities of the Autoencoder Model.The model was trained and evaluated using two recent benchmark datasets,5GNIDD and IDSIoT2024.When trained on the 5GNIDD dataset,the model achieved an average area under the precisionrecall curve is 99.8%using five-fold cross-validation and demonstrated a high detection accuracy of 97.35%when tested on independent test data.Additionally,the model is well-suited for deployment on resource-limited Internetof-Things(IoT)devices due to its ability to detect attacks within microseconds and its small memory footprint of 60.24 kB.Similarly,when trained on the IDSIoT2024 dataset,the model achieved an average PR-AUC of 94.09%and an attack detection accuracy of 97.35%on independent test data,with a memory requirement of 61.84 kB.Extensive simulation results demonstrate that the proposed hybrid model effectively addresses the shortcomings of traditional GAN-based IDS approaches in terms of detection accuracy,computational efficiency,and applicability to real-world IoT environments.展开更多
It is crucial to predict future mechanical behaviors for the prevention of structural disasters.Especially for underground construction,the structural mechanical behaviors are affected by multiple internal and externa...It is crucial to predict future mechanical behaviors for the prevention of structural disasters.Especially for underground construction,the structural mechanical behaviors are affected by multiple internal and external factors due to the complex conditions.Given that the existing models fail to take into account all the factors and accurate prediction of the multiple time series simultaneously is difficult using these models,this study proposed an improved prediction model through the autoencoder fused long-and short-term time-series network driven by the mass number of monitoring data.Then,the proposed model was formalized on multiple time series of strain monitoring data.Also,the discussion analysis with a classical baseline and an ablation experiment was conducted to verify the effectiveness of the prediction model.As the results indicate,the proposed model shows obvious superiority in predicting the future mechanical behaviors of structures.As a case study,the presented model was applied to the Nanjing Dinghuaimen tunnel to predict the stain variation on a different time scale in the future.展开更多
The aerial deployment method enables Unmanned Aerial Vehicles(UAVs)to be directly positioned at the required altitude for their mission.This method typically employs folding technology to improve loading efficiency,wi...The aerial deployment method enables Unmanned Aerial Vehicles(UAVs)to be directly positioned at the required altitude for their mission.This method typically employs folding technology to improve loading efficiency,with applications such as the gravity-only aerial deployment of high-aspect-ratio solar-powered UAVs,and aerial takeoff of fixed-wing drones in Mars research.However,the significant morphological changes during deployment are accompanied by strong nonlinear dynamic aerodynamic forces,which result in multiple degrees of freedom and an unstable character.This hinders the description and analysis of unknown dynamic behaviors,further leading to difficulties in the design of deployment strategies and flight control.To address this issue,this paper proposes an analysis method for dynamic behaviors during aerial deployment based on the Variational Autoencoder(VAE).Focusing on the gravity-only deployment problem of highaspect-ratio foldable-wing UAVs,the method encodes the multi-degree-of-freedom unstable motion signals into a low-dimensional feature space through a data-driven approach.By clustering in the feature space,this paper identifies and studies several dynamic behaviors during aerial deployment.The research presented in this paper offers a new method and perspective for feature extraction and analysis of complex and difficult-to-describe extreme flight dynamics,guiding the research on aerial deployment drones design and control strategies.展开更多
Deep Learning has been widely used to model soft sensors in modern industrial processes with nonlinear variables and uncertainty.Due to the outstanding ability for high-level feature extraction,stacked autoencoder(SAE...Deep Learning has been widely used to model soft sensors in modern industrial processes with nonlinear variables and uncertainty.Due to the outstanding ability for high-level feature extraction,stacked autoencoder(SAE)has been widely used to improve the model accuracy of soft sensors.However,with the increase of network layers,SAE may encounter serious information loss issues,which affect the modeling performance of soft sensors.Besides,there are typically very few labeled samples in the data set,which brings challenges to traditional neural networks to solve.In this paper,a multi-scale feature fused stacked autoencoder(MFF-SAE)is suggested for feature representation related to hierarchical output,where stacked autoencoder,mutual information(MI)and multi-scale feature fusion(MFF)strategies are integrated.Based on correlation analysis between output and input variables,critical hidden variables are extracted from the original variables in each autoencoder's input layer,which are correspondingly given varying weights.Besides,an integration strategy based on multi-scale feature fusion is adopted to mitigate the impact of information loss with the deepening of the network layers.Then,the MFF-SAE method is designed and stacked to form deep networks.Two practical industrial processes are utilized to evaluate the performance of MFF-SAE.Results from simulations indicate that in comparison to other cutting-edge techniques,the proposed method may considerably enhance the accuracy of soft sensor modeling,where the suggested method reduces the root mean square error(RMSE)by 71.8%,17.1%and 64.7%,15.1%,respectively.展开更多
Direct coal liquefaction products offer a considerable quantity of cycloalkanes, which are the valuable candidates for making the high energy density fuels. The creation of such fuels depends on designing molecular st...Direct coal liquefaction products offer a considerable quantity of cycloalkanes, which are the valuable candidates for making the high energy density fuels. The creation of such fuels depends on designing molecular structures and calculating their properties, which can be expedited with computer-aided techniques. In this study, a dataset containing 367 fuel molecules was constructed based on the analysis of direct coal liquefied oil. Three convolutional neural network property prediction models have been created based on molecular structure-physical and chemical property data from the library. All the models have good fitting ability with R2 values above 0.97. Then, a variational autoencoder generation model has been established using the molecular structures from the library, focusing on the structure of saturated cycloalkanes. The structure-property prediction model was then applied to the newly generated molecules, assessing their density, volumetric calorific value, and melting point. As a result, 70000 novel molecular structures were generated, and 25 molecular structures meeting the criteria for high energy density fuels were identified. The established variational autoencoder model in this study effectively assimilates the structural information from the sample set and autonomously generates novel high energy density fuels, which is difficult to achieve in traditional experimental methods.展开更多
To predict the lithium-ion(Li-ion)battery degradation trajectory in the early phase,arranging the maintenance of battery energy storage systems is of great importance.However,under different operation conditions,Li-io...To predict the lithium-ion(Li-ion)battery degradation trajectory in the early phase,arranging the maintenance of battery energy storage systems is of great importance.However,under different operation conditions,Li-ion batteries present distinct degradation patterns,and it is challenging to capture negligible capacity fade in early cycles.Despite the data-driven method showing promising performance,insufficient data is still a big issue since the ageing experiments on the batteries are too slow and expensive.In this study,we proposed twin autoencoders integrated into a two-stage method to predict the early cycles'degradation trajectories.The two-stage method can properly predict the degradation from course to fine.The twin autoencoders serve as a feature extractor and a synthetic data generator,respectively.Ultimately,a learning procedure based on the long-short term memory(LSTM)network is designed to hybridize the learning process between the real and synthetic data.The performance of the proposed method is verified on three datasets,and the experimental results show that the proposed method can achieve accurate predictions compared to its competitors.展开更多
Geochemical survey data are essential across Earth Science disciplines but are often affected by noise,which can obscure important geological signals and compromise subsequent prediction and interpretation.Quantifying...Geochemical survey data are essential across Earth Science disciplines but are often affected by noise,which can obscure important geological signals and compromise subsequent prediction and interpretation.Quantifying prediction uncertainty is hence crucial for robust geoscientific decision-making.This study proposes a novel deep learning framework,the Spatially Constrained Variational Autoencoder(SC-VAE),for denoising geochemical survey data with integrated uncertainty quantification.The SC-VAE incorporates spatial regularization,which enforces spatial coherence by modeling inter-sample relationships directly within the latent space.The performance of the SC-VAE was systematically evaluated against a standard Variational Autoencoder(VAE)using geochemical data from the gold polymetallic district in the northwestern part of Sichuan Province,China.Both models were optimized using Bayesian optimization,with objective functions specifically designed to maintain essential geostatistical characteristics.Evaluation metrics include variogram analysis,quantitative measures of spatial interpolation accuracy,visual assessment of denoised maps,and statistical analysis of data distributions,as well as decomposition of uncertainties.Results show that the SC-VAE achieves superior noise suppression and better preservation of spatial structure compared to the standard VAE,as demonstrated by a significant reduction in the variogram nugget effect and an increased partial sill.The SC-VAE produces denoised maps with clearer anomaly delineation and more regularized data distributions,effectively mitigating outliers and reducing kurtosis.Additionally,it delivers improved interpolation accuracy and spatially explicit uncertainty estimates,facilitating more reliable and interpretable assessments of prediction confidence.The SC-VAE framework thus provides a robust,geostatistically informed solution for enhancing the quality and interpretability of geochemical data,with broad applicability in mineral exploration,environmental geochemistry,and other Earth Science domains.展开更多
High-resolution spatiotemporal simulations effectively capture the complexities of atmospheric plume sion disper-in complex terrain.However,their high computational cost makes them impractical for applications requiri...High-resolution spatiotemporal simulations effectively capture the complexities of atmospheric plume sion disper-in complex terrain.However,their high computational cost makes them impractical for applications requiring rapid responses or iterative processes,such as optimization,uncertainty quantification,or inverse modeling.To address this challenge,this work introduces the Dual-Stage Temporal Three-dimensional UNet Super-resolution(DST3D-UNet-SR)model,a highly efficient deep learning model for plume dispersion predictions.DST3D-UNet-SR is composed of two sequential modules:the temporal module(TM),which predicts the transient evolution of a plume in complex terrain from low-resolution temporal data,and the spatial refinement module(SRM),which subsequently enhances the spatial resolution of the TM predictions.We train DST3D-UNet-SR using a comprehensive dataset derived from high-resolution large eddy simulations(LES)of plume transport.We propose the DST3D-UNet-SR model to significantly accelerate LES of three-dimensional(3D)plume dispersion by three orders of magnitude.Additionally,the model demonstrates the ability to dynamically adapt to evolving conditions through the incorporation of new observational data,substantially improving prediction accuracy in high-concentration regions near the source.展开更多
As oil and gas exploration continues to progress into deeper and unconventional reservoirs,the likelihood of kick risk increases,making kick warning a critical factor in ensuring drilling safety and efficiency.Due to ...As oil and gas exploration continues to progress into deeper and unconventional reservoirs,the likelihood of kick risk increases,making kick warning a critical factor in ensuring drilling safety and efficiency.Due to the scarcity of kick samples,traditional supervised models perform poorly,and significant fluctuations in field data lead to high false alarm rates.This study proposes an unsupervised graph autoencoder(GAE)-based kick warning method,which effectively reduces false alarms by eliminating the influence of field engineer operations and incorporating real-time model updates.The method utilizes the GAE model to process time-series data during drilling,accurately identifying kick risk while overcoming challenges related to small sample sizes and missing features.To further reduce false alarms,the weighted dynamic time warping(WDTW)algorithm is introduced to identify fluctuations in logging data caused by field engineer operations during drilling,with real-time updates applied to prevent normal conditions from being misclassified as kick risk.Experimental results show that the GAE-based kick warning method achieves an accuracy of 92.7%and significantly reduces the false alarm rate.The GAE model continues to operate effectively even under conditions of missing features and issues kick warnings 4 min earlier than field engineers,demonstrating its high sensitivity and robustness.After integrating the WDTW algorithm and real-time updates,the false alarm rate is reduced from 17.3%to 5.6%,further improving the accuracy of kick warnings.The proposed method provides an efficient and reliable approach for kick warning in drilling operations,offering strong practical value and technical support for the intelligent management of future drilling operations.展开更多
3D medical image reconstruction has significantly enhanced diagnostic accuracy,yet the reliance on densely sampled projection data remains a major limitation in clinical practice.Sparse-angle X-ray imaging,though safe...3D medical image reconstruction has significantly enhanced diagnostic accuracy,yet the reliance on densely sampled projection data remains a major limitation in clinical practice.Sparse-angle X-ray imaging,though safer and faster,poses challenges for accurate volumetric reconstruction due to limited spatial information.This study proposes a 3D reconstruction neural network based on adaptive weight fusion(AdapFusionNet)to achieve high-quality 3D medical image reconstruction from sparse-angle X-ray images.To address the issue of spatial inconsistency in multi-angle image reconstruction,an innovative adaptive fusion module was designed to score initial reconstruction results during the inference stage and perform weighted fusion,thereby improving the final reconstruction quality.The reconstruction network is built on an autoencoder(AE)framework and uses orthogonal-angle X-ray images(frontal and lateral projections)as inputs.The encoder extracts 2D features,which the decoder maps into 3D space.This study utilizes a lung CT dataset to obtain complete three-dimensional volumetric data,from which digitally reconstructed radiographs(DRR)are generated at various angles to simulate X-ray images.Since real-world clinical X-ray images rarely come with perfectly corresponding 3D“ground truth,”using CT scans as the three-dimensional reference effectively supports the training and evaluation of deep networks for sparse-angle X-ray 3D reconstruction.Experiments conducted on the LIDC-IDRI dataset with simulated X-ray images(DRR images)as training data demonstrate the superior performance of AdapFusionNet compared to other fusion methods.Quantitative results show that AdapFusionNet achieves SSIM,PSNR,and MAE values of 0.332,13.404,and 0.163,respectively,outperforming other methods(SingleViewNet:0.289,12.363,0.182;AvgFusionNet:0.306,13.384,0.159).Qualitative analysis further confirms that AdapFusionNet significantly enhances the reconstruction of lung and chest contours while effectively reducing noise during the reconstruction process.The findings demonstrate that AdapFusionNet offers significant advantages in 3D reconstruction of sparse-angle X-ray images.展开更多
During its growth stage,the plant is exposed to various diseases.Detection and early detection of crop diseases is amajor challenge in the horticulture industry.Crop infections can harmtotal crop yield and reduce farm...During its growth stage,the plant is exposed to various diseases.Detection and early detection of crop diseases is amajor challenge in the horticulture industry.Crop infections can harmtotal crop yield and reduce farmers’income if not identified early.Today’s approved method involves a professional plant pathologist to diagnose the disease by visual inspection of the afflicted plant leaves.This is an excellent use case for Community Assessment and Treatment Services(CATS)due to the lengthy manual disease diagnosis process and the accuracy of identification is directly proportional to the skills of pathologists.An alternative to conventional Machine Learning(ML)methods,which require manual identification of parameters for exact results,is to develop a prototype that can be classified without pre-processing.To automatically diagnose tomato leaf disease,this research proposes a hybrid model using the Convolutional Auto-Encoders(CAE)network and the CNN-based deep learning architecture of DenseNet.To date,none of the modern systems described in this paper have a combined model based on DenseNet,CAE,and ConvolutionalNeuralNetwork(CNN)todiagnose the ailments of tomato leaves automatically.Themodelswere trained on a dataset obtained from the Plant Village repository.The dataset consisted of 9920 tomato leaves,and the model-tomodel accuracy ratio was 98.35%.Unlike other approaches discussed in this paper,this hybrid strategy requires fewer training components.Therefore,the training time to classify plant diseases with the trained algorithm,as well as the training time to automatically detect the ailments of tomato leaves,is significantly reduced.展开更多
基金This work is supported by the Introducing Program of Dongguan for Leading Talents in Innovation and Entrepreneur(Dongren Han[2018],No.738).
文摘Anomaly detection is an important method for intrusion detection.In recent years,unsupervised methods have been widely researched because they do not require labeling.For example,a nonlinear autoencoder can use reconstruction errors to attain the discrimination threshold.This method is not effective when the model complexity is high or the data contains noise.The method for detecting the density of compressed features in a hidden layer can be used to reduce the influence of noise on the selection of the threshold because the density of abnormal data in hidden layers is smaller than normal data.However,compressed features may lose some of the high-dimensional distribution information of the original data.In this paper,we present an efficient anomaly detection framework for unsupervised anomaly detection,which includes network data capturing,processing,feature extraction,and anomaly detection.We employ a deep autoencoder to obtain compressed features and multi-layer reconstruction errors,and feeds them the same to the Gaussian mixture model to estimate the density.The proposed approach is trained and tested on multiple current intrusion detection datasets and real network scenes,and performance indicators,namely accuracy,recall,and F1-score,are better than other autoencoder models.
文摘The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities and obstacles.The huge and diversified nature of these datasets cannot always be managed using traditional data analysis methods.As a consequence,deep learning has emerged as a strong tool for analysing numerous omics data due to its ability to handle complex and non-linear relationships.This paper explores the fundamental concepts of deep learning and how they are used in multi-omics medical data mining.We demonstrate how autoencoders,variational autoencoders,multimodal models,attention mechanisms,transformers,and graph neural networks enable pattern analysis and recognition across all omics data.Deep learning has been found to be effective in illness classification,biomarker identification,gene network learning,and therapeutic efficacy prediction.We also consider critical problems like as data quality,model explainability,whether findings can be repeated,and computational power requirements.We now consider future elements of combining omics with clinical and imaging data,explainable AI,federated learning,and real-time diagnostics.Overall,this study emphasises the need of collaborating across disciplines to advance deep learning-based multi-omics research for precision medicine and comprehending complicated disorders.
基金supported by National Key R&D Program of China(2022YFA1008000)the National Natural Science Foundation of China(12571297,12101585)+1 种基金the CAS Talent Introduction Program(Category B)the Young Elite Scientist Sponsorship Program by CAST(YESS20220125).
文摘Credit risk assessment is a crucial task in bank risk management.By making lending decisions based on credit risk assessment results,banks can reduce the probability of non-performing loans.However,class imbalance in bank credit default datasets limits the predictive performance of traditional machine learning and deep learning models.To address this issue,this study employs the conditional variational autoencoder-Wasserstein generative adversarial network with gradient penalty(CVAE-WGAN-gp)model for oversampling,generating samples similar to the original default customer data to enhance model prediction performance.To evaluate the quality of the data generated by the CVAE-WGAN-gp model,we selected several bank loan datasets for experimentation.The experimental results demonstrate that using the CVAE-WGAN-gp model for oversampling can significantly improve the predictive performance in credit risk assessment problems.
基金funded by the Centre for Advanced Modelling and Geospatial Information Systems(CAMGIS),Faculty of Engineering and IT,University of Technology Sydneysupported by the Researchers Supporting Project,King Saud University,Riyadh,Saudi Arabia,under Project RSP2025 R14.
文摘Electronic nose and thermal images are effective ways to diagnose the presence of gases in real-time realtime.Multimodal fusion of these modalities can result in the development of highly accurate diagnostic systems.The low-cost thermal imaging software produces low-resolution thermal images in grayscale format,hence necessitating methods for improving the resolution and colorizing the images.The objective of this paper is to develop and train a super-resolution generative adversarial network for improving the resolution of the thermal images,followed by a sparse autoencoder for colorization of thermal images and amultimodal convolutional neural network for gas detection using electronic nose and thermal images.The dataset used comprises 6400 thermal images and electronic nose measurements for four classes.A multimodal Convolutional Neural Network(CNN)comprising an EfficientNetB2 pre-trainedmodel was developed using both early and late feature fusion.The Super Resolution Generative Adversarial Network(SRGAN)model was developed and trained on low and high-resolution thermal images.Asparse autoencoder was trained on the grayscale and colorized thermal images.The SRGAN was trained on lowand high-resolution thermal images,achieving a Structural Similarity Index(SSIM)of 90.28,a Peak Signal-to-Noise Ratio(PSNR)of 68.74,and a Mean Absolute Error(MAE)of 0.066.The autoencoder model produced an MAE of 0.035,a Mean Squared Error(MSE)of 0.006,and a Root Mean Squared Error(RMSE)of 0.0705.The multimodal CNN,trained on these images and electronic nose measurements using both early and late fusion techniques,achieved accuracies of 97.89% and 98.55%,respectively.Hence,the proposed framework can be of great aid for the integration with low-cost software to generate high quality thermal camera images and highly accurate detection of gases in real-time.
基金support of CNPq(Brazilian Ministry of Science and Technology Agency),of CAPES(Higher Education Improvement Agency),of FAPESP(São Paulo Research Foundation)under grant#2022/13045-1,of VALE Catedra Under Rail and of Base Funding-UIDB/04708/2020Programmatic Funding-UIDP/04708/2020 of the CONSTRUCT-“Instituto de I&D em Estruturas e Construções”.
文摘High-speed railway bridges are essential components of any railway transportation system that should keep adequate levels of serviceability and safety.In this context,drive-by methodologies have emerged as a feasible and cost-effective monitor-ing solution for detecting damage on railway bridges while minimizing train operation interruptions.Moreover,integrating advanced sensor technologies and machine learning algorithms has significantly enhanced structural health monitoring(SHM)for bridges.Despite being increasingly used in traditional SHM applications,studies using autoencoders within drive-by methodologies are rare,especially in the railway field.This study presents a novel approach for drive-by damage detection in HSR bridges.The methodology relies on acceleration records collected from multiple bridge crossings by an operational train equipped with onboard sensors.Log-Mel spectrogram features derived from the acceleration records are used together with sparse autoencoders for computing statistical distribution-based damage indexes.Numerical simulations were performed on a 3D vehicle-track-bridge interaction system model implemented in Matlab to evaluate the robustness and effectiveness of the proposed approach,considering several damage scenarios,vehicle speeds,and environmental and operational variations,such as multiple track irregularities and varying measurement noise.The results show that the pro-posed approach can successfully detect damages,as well as characterize their severity,especially for very early-stage dam-ages.This demonstrates the high potential of applying Mel-frequency damage-sensitive features associated with machine learning algorithms in the drive-by condition assessment of high-speed railway bridges.
基金funding by the Deutsche Forschungsgemeinschaft under Germany’s Excellence Strategy within the Cluster of Excellence PhoenixD(EXC 2122,Project ID 390833453).
文摘Traditional inverse neural network(INN)approaches for inverse design typically require auxiliary feedforward networks,leading to increased computational complexity and architectural dependencies.This study introduces a standalone INN methodology that eliminates the need for feedforward networks while maintaining high reconstruction accuracy.The approach integrates Principal Component Analysis(PCA)and Partial Least Squares(PLS)for optimized feature space learning,enabling the standalone INN to effectively capture bidirectionalmappings between geometric parameters and mechanical properties.Validation using established numerical datasets demonstrates that the standalone INN architecture achieves reconstruction accuracy equal or better than traditional tandem approaches while completely eliminating the workload and training time required for Feedforward Neural Networks(FNN).These findings contribute to AI methodology development by proving that standalone invertible architectures can achieve comparable performance to complex hybrid systems with significantly improved computational efficiency.
基金support from the Fundamental Research Funds for Central Public Welfare Research Institutes(SK202324)the Central Guidance on Local Science and Technology Development Fund of Hebei Province(236Z0104G)+1 种基金the National Natural Science Foundation of China(62476078)the Geological Survey Project of China Geological Survey(G202304-2).
文摘Time series anomaly detection is crucial in finance,healthcare,and industrial monitoring.However,traditional methods often face challenges when handling time series data,such as limited feature extraction capability,poor temporal dependency handling,and suboptimal real-time performance,sometimes even neglecting the temporal relationships between data.To address these issues and improve anomaly detection performance by better capturing temporal dependencies,we propose an unsupervised time series anomaly detection method,VLT-Anomaly.First,we enhance the Variational Autoencoder(VAE)module by redesigning its network structure to better suit anomaly detection through data reconstruction.We introduce hyperparameters to control the weight of the Kullback-Leibler(KL)divergence term in the Evidence Lower Bound(ELBO),thereby improving the encoder module’s decoupling and expressive power in the latent space,which yields more effective latent representations of the data.Next,we incorporate transformer and Long Short-Term Memory(LSTM)modules to estimate the long-term dependencies of the latent representations,capturing both forward and backward temporal relationships and performing time series forecasting.Finally,we compute the reconstruction error by averaging the predicted results and decoder reconstruction and detect anomalies through grid search for optimal threshold values.Experimental results demonstrate that the proposed method performs superior anomaly detection on multiple public time series datasets,effectively extracting complex time-related features and enabling efficient computation and real-time anomaly detection.It improves detection accuracy and robustness while reducing false positives and false negatives.
基金Beijing Natural Science Foundation-Haidian original Innovation Joint Foundation,Grant/Award Number:L192016Joint Funds of the National Natural Science Foundation of China,Grant/Award Number:U21A20489+3 种基金National Natural Science Foundation of China,Grant/Award Number:62003330Shenzhen Fundamental Research Funds,Grant/Award Numbers:JCYJ20220818101608019,JCYJ20190807170407391,JCYJ20180507182415428Natural Science Foundation of Guangdong Province,Grant/Award Number:2019A1515011699Guangdong-Hong Kong-Macao Joint Laboratory of Human-Machine Intelligence-Synergy Systems,Shenzhen Institute of Advanced Technology。
文摘Osteoporosis is a major cause of bone fracture and can be characterised by both mass loss and microstructure deterioration of the bone.The modern way of osteoporosis assessment is through the measurement of bone mineral density,which is not able to unveil the pathological condition from the mesoscale aspect.To obtain mesoscale information from computed tomography(CT),the super-resolution(SR)approach for volumetric imaging data is required.A deep learning model AESR3D is proposed to recover high-resolution(HR)Micro-CT from low-resolution Micro-CT and implement an unsupervised segmentation for better trabecular observation and measurement.A new regularisation overcomplete autoencoder framework for the SR task is proposed and theoretically analysed.The best performance is achieved on structural similarity measure of trabecular CT SR task compared with the state-of-the-art models in both natural and medical image SR tasks.The HR and SR images show a high correlation(r=0.996,intraclass correlation coefficients=0.917)on trabecular bone morphological indicators.The results also prove the effectiveness of our regularisation framework when training a large capacity model.
基金the Deanship of Research and Graduate Studies at King Khalid University for funding this work through Large Group Project under grant number(RGP.2/245/46)funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2025R760)Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.The research team thanks the Deanship of Graduate Studies and Scientific Research at Najran University for supporting the research project through the Nama’a program,with the project code NU/GP/SERC/13/352-1。
文摘The emergence of Generative Adversarial Network(GAN)techniques has garnered significant attention from the research community for the development of Intrusion Detection Systems(IDS).However,conventional GAN-based IDS models face several challenges,including training instability,high computational costs,and system failures.To address these limitations,we propose a Hybrid Wasserstein GAN and Autoencoder Model(WGAN-AE)for intrusion detection.The proposed framework leverages the stability of WGAN and the feature extraction capabilities of the Autoencoder Model.The model was trained and evaluated using two recent benchmark datasets,5GNIDD and IDSIoT2024.When trained on the 5GNIDD dataset,the model achieved an average area under the precisionrecall curve is 99.8%using five-fold cross-validation and demonstrated a high detection accuracy of 97.35%when tested on independent test data.Additionally,the model is well-suited for deployment on resource-limited Internetof-Things(IoT)devices due to its ability to detect attacks within microseconds and its small memory footprint of 60.24 kB.Similarly,when trained on the IDSIoT2024 dataset,the model achieved an average PR-AUC of 94.09%and an attack detection accuracy of 97.35%on independent test data,with a memory requirement of 61.84 kB.Extensive simulation results demonstrate that the proposed hybrid model effectively addresses the shortcomings of traditional GAN-based IDS approaches in terms of detection accuracy,computational efficiency,and applicability to real-world IoT environments.
基金National Key Research and Development Program of China,Grant/Award Number:2018YFB2101003National Natural Science Foundation of China,Grant/Award Numbers:51991395,U1806226,51778033,51822802,71901011,U1811463,51991391Science and Technology Major Project of Beijing,Grant/Award Number:Z191100002519012。
文摘It is crucial to predict future mechanical behaviors for the prevention of structural disasters.Especially for underground construction,the structural mechanical behaviors are affected by multiple internal and external factors due to the complex conditions.Given that the existing models fail to take into account all the factors and accurate prediction of the multiple time series simultaneously is difficult using these models,this study proposed an improved prediction model through the autoencoder fused long-and short-term time-series network driven by the mass number of monitoring data.Then,the proposed model was formalized on multiple time series of strain monitoring data.Also,the discussion analysis with a classical baseline and an ablation experiment was conducted to verify the effectiveness of the prediction model.As the results indicate,the proposed model shows obvious superiority in predicting the future mechanical behaviors of structures.As a case study,the presented model was applied to the Nanjing Dinghuaimen tunnel to predict the stain variation on a different time scale in the future.
基金co-supported by the Natural Science Basic Research Program of Shaanxi,China(No.2023-JC-QN-0043)the ND Basic Research Funds,China(No.G2022WD).
文摘The aerial deployment method enables Unmanned Aerial Vehicles(UAVs)to be directly positioned at the required altitude for their mission.This method typically employs folding technology to improve loading efficiency,with applications such as the gravity-only aerial deployment of high-aspect-ratio solar-powered UAVs,and aerial takeoff of fixed-wing drones in Mars research.However,the significant morphological changes during deployment are accompanied by strong nonlinear dynamic aerodynamic forces,which result in multiple degrees of freedom and an unstable character.This hinders the description and analysis of unknown dynamic behaviors,further leading to difficulties in the design of deployment strategies and flight control.To address this issue,this paper proposes an analysis method for dynamic behaviors during aerial deployment based on the Variational Autoencoder(VAE).Focusing on the gravity-only deployment problem of highaspect-ratio foldable-wing UAVs,the method encodes the multi-degree-of-freedom unstable motion signals into a low-dimensional feature space through a data-driven approach.By clustering in the feature space,this paper identifies and studies several dynamic behaviors during aerial deployment.The research presented in this paper offers a new method and perspective for feature extraction and analysis of complex and difficult-to-describe extreme flight dynamics,guiding the research on aerial deployment drones design and control strategies.
基金supported by the National Key Research and Development Program of China(2023YFB3307800)National Natural Science Foundation of China(62394343,62373155)+2 种基金Major Science and Technology Project of Xinjiang(No.2022A01006-4)State Key Laboratory of Industrial Control Technology,China(Grant No.ICT2024A26)Fundamental Research Funds for the Central Universities.
文摘Deep Learning has been widely used to model soft sensors in modern industrial processes with nonlinear variables and uncertainty.Due to the outstanding ability for high-level feature extraction,stacked autoencoder(SAE)has been widely used to improve the model accuracy of soft sensors.However,with the increase of network layers,SAE may encounter serious information loss issues,which affect the modeling performance of soft sensors.Besides,there are typically very few labeled samples in the data set,which brings challenges to traditional neural networks to solve.In this paper,a multi-scale feature fused stacked autoencoder(MFF-SAE)is suggested for feature representation related to hierarchical output,where stacked autoencoder,mutual information(MI)and multi-scale feature fusion(MFF)strategies are integrated.Based on correlation analysis between output and input variables,critical hidden variables are extracted from the original variables in each autoencoder's input layer,which are correspondingly given varying weights.Besides,an integration strategy based on multi-scale feature fusion is adopted to mitigate the impact of information loss with the deepening of the network layers.Then,the MFF-SAE method is designed and stacked to form deep networks.Two practical industrial processes are utilized to evaluate the performance of MFF-SAE.Results from simulations indicate that in comparison to other cutting-edge techniques,the proposed method may considerably enhance the accuracy of soft sensor modeling,where the suggested method reduces the root mean square error(RMSE)by 71.8%,17.1%and 64.7%,15.1%,respectively.
基金the National Natural Science Foundation of China(22178243 and 22038008).
文摘Direct coal liquefaction products offer a considerable quantity of cycloalkanes, which are the valuable candidates for making the high energy density fuels. The creation of such fuels depends on designing molecular structures and calculating their properties, which can be expedited with computer-aided techniques. In this study, a dataset containing 367 fuel molecules was constructed based on the analysis of direct coal liquefied oil. Three convolutional neural network property prediction models have been created based on molecular structure-physical and chemical property data from the library. All the models have good fitting ability with R2 values above 0.97. Then, a variational autoencoder generation model has been established using the molecular structures from the library, focusing on the structure of saturated cycloalkanes. The structure-property prediction model was then applied to the newly generated molecules, assessing their density, volumetric calorific value, and melting point. As a result, 70000 novel molecular structures were generated, and 25 molecular structures meeting the criteria for high energy density fuels were identified. The established variational autoencoder model in this study effectively assimilates the structural information from the sample set and autonomously generates novel high energy density fuels, which is difficult to achieve in traditional experimental methods.
基金financially supported by the National Natural Science Foundation of China under Grant 62372369,52107229,62272383the Key Research and Development Program of Shaanxi Province(2024GX-YBXM-442)Natural Science Basic Research Program of Shaanxi Province(2024JC-YBMS-477)。
文摘To predict the lithium-ion(Li-ion)battery degradation trajectory in the early phase,arranging the maintenance of battery energy storage systems is of great importance.However,under different operation conditions,Li-ion batteries present distinct degradation patterns,and it is challenging to capture negligible capacity fade in early cycles.Despite the data-driven method showing promising performance,insufficient data is still a big issue since the ageing experiments on the batteries are too slow and expensive.In this study,we proposed twin autoencoders integrated into a two-stage method to predict the early cycles'degradation trajectories.The two-stage method can properly predict the degradation from course to fine.The twin autoencoders serve as a feature extractor and a synthetic data generator,respectively.Ultimately,a learning procedure based on the long-short term memory(LSTM)network is designed to hybridize the learning process between the real and synthetic data.The performance of the proposed method is verified on three datasets,and the experimental results show that the proposed method can achieve accurate predictions compared to its competitors.
基金supported by the National Natural Science Foundation of China(Nos.42530801,42425208)the Natural Science Foundation of Hubei Province(China)(No.2023AFA001)+1 种基金the MOST Special Fund from State Key Laboratory of Geological Processes and Mineral Resources,China University of Geosciences(No.MSFGPMR2025-401)the China Scholarship Council(No.202306410181)。
文摘Geochemical survey data are essential across Earth Science disciplines but are often affected by noise,which can obscure important geological signals and compromise subsequent prediction and interpretation.Quantifying prediction uncertainty is hence crucial for robust geoscientific decision-making.This study proposes a novel deep learning framework,the Spatially Constrained Variational Autoencoder(SC-VAE),for denoising geochemical survey data with integrated uncertainty quantification.The SC-VAE incorporates spatial regularization,which enforces spatial coherence by modeling inter-sample relationships directly within the latent space.The performance of the SC-VAE was systematically evaluated against a standard Variational Autoencoder(VAE)using geochemical data from the gold polymetallic district in the northwestern part of Sichuan Province,China.Both models were optimized using Bayesian optimization,with objective functions specifically designed to maintain essential geostatistical characteristics.Evaluation metrics include variogram analysis,quantitative measures of spatial interpolation accuracy,visual assessment of denoised maps,and statistical analysis of data distributions,as well as decomposition of uncertainties.Results show that the SC-VAE achieves superior noise suppression and better preservation of spatial structure compared to the standard VAE,as demonstrated by a significant reduction in the variogram nugget effect and an increased partial sill.The SC-VAE produces denoised maps with clearer anomaly delineation and more regularized data distributions,effectively mitigating outliers and reducing kurtosis.Additionally,it delivers improved interpolation accuracy and spatially explicit uncertainty estimates,facilitating more reliable and interpretable assessments of prediction confidence.The SC-VAE framework thus provides a robust,geostatistically informed solution for enhancing the quality and interpretability of geochemical data,with broad applicability in mineral exploration,environmental geochemistry,and other Earth Science domains.
文摘High-resolution spatiotemporal simulations effectively capture the complexities of atmospheric plume sion disper-in complex terrain.However,their high computational cost makes them impractical for applications requiring rapid responses or iterative processes,such as optimization,uncertainty quantification,or inverse modeling.To address this challenge,this work introduces the Dual-Stage Temporal Three-dimensional UNet Super-resolution(DST3D-UNet-SR)model,a highly efficient deep learning model for plume dispersion predictions.DST3D-UNet-SR is composed of two sequential modules:the temporal module(TM),which predicts the transient evolution of a plume in complex terrain from low-resolution temporal data,and the spatial refinement module(SRM),which subsequently enhances the spatial resolution of the TM predictions.We train DST3D-UNet-SR using a comprehensive dataset derived from high-resolution large eddy simulations(LES)of plume transport.We propose the DST3D-UNet-SR model to significantly accelerate LES of three-dimensional(3D)plume dispersion by three orders of magnitude.Additionally,the model demonstrates the ability to dynamically adapt to evolving conditions through the incorporation of new observational data,substantially improving prediction accuracy in high-concentration regions near the source.
基金Youth Foundation of National Natural Science Foundation of China (No. 52204020)Distinguished Young Foundation of National Natural Science Foundation of China (No. 52125401).
文摘As oil and gas exploration continues to progress into deeper and unconventional reservoirs,the likelihood of kick risk increases,making kick warning a critical factor in ensuring drilling safety and efficiency.Due to the scarcity of kick samples,traditional supervised models perform poorly,and significant fluctuations in field data lead to high false alarm rates.This study proposes an unsupervised graph autoencoder(GAE)-based kick warning method,which effectively reduces false alarms by eliminating the influence of field engineer operations and incorporating real-time model updates.The method utilizes the GAE model to process time-series data during drilling,accurately identifying kick risk while overcoming challenges related to small sample sizes and missing features.To further reduce false alarms,the weighted dynamic time warping(WDTW)algorithm is introduced to identify fluctuations in logging data caused by field engineer operations during drilling,with real-time updates applied to prevent normal conditions from being misclassified as kick risk.Experimental results show that the GAE-based kick warning method achieves an accuracy of 92.7%and significantly reduces the false alarm rate.The GAE model continues to operate effectively even under conditions of missing features and issues kick warnings 4 min earlier than field engineers,demonstrating its high sensitivity and robustness.After integrating the WDTW algorithm and real-time updates,the false alarm rate is reduced from 17.3%to 5.6%,further improving the accuracy of kick warnings.The proposed method provides an efficient and reliable approach for kick warning in drilling operations,offering strong practical value and technical support for the intelligent management of future drilling operations.
基金Supported by Sichuan Science and Technology Program(2023YFSY0026,2023YFH0004).
文摘3D medical image reconstruction has significantly enhanced diagnostic accuracy,yet the reliance on densely sampled projection data remains a major limitation in clinical practice.Sparse-angle X-ray imaging,though safer and faster,poses challenges for accurate volumetric reconstruction due to limited spatial information.This study proposes a 3D reconstruction neural network based on adaptive weight fusion(AdapFusionNet)to achieve high-quality 3D medical image reconstruction from sparse-angle X-ray images.To address the issue of spatial inconsistency in multi-angle image reconstruction,an innovative adaptive fusion module was designed to score initial reconstruction results during the inference stage and perform weighted fusion,thereby improving the final reconstruction quality.The reconstruction network is built on an autoencoder(AE)framework and uses orthogonal-angle X-ray images(frontal and lateral projections)as inputs.The encoder extracts 2D features,which the decoder maps into 3D space.This study utilizes a lung CT dataset to obtain complete three-dimensional volumetric data,from which digitally reconstructed radiographs(DRR)are generated at various angles to simulate X-ray images.Since real-world clinical X-ray images rarely come with perfectly corresponding 3D“ground truth,”using CT scans as the three-dimensional reference effectively supports the training and evaluation of deep networks for sparse-angle X-ray 3D reconstruction.Experiments conducted on the LIDC-IDRI dataset with simulated X-ray images(DRR images)as training data demonstrate the superior performance of AdapFusionNet compared to other fusion methods.Quantitative results show that AdapFusionNet achieves SSIM,PSNR,and MAE values of 0.332,13.404,and 0.163,respectively,outperforming other methods(SingleViewNet:0.289,12.363,0.182;AvgFusionNet:0.306,13.384,0.159).Qualitative analysis further confirms that AdapFusionNet significantly enhances the reconstruction of lung and chest contours while effectively reducing noise during the reconstruction process.The findings demonstrate that AdapFusionNet offers significant advantages in 3D reconstruction of sparse-angle X-ray images.
基金funded by UKRI EPSRC Grant EP/W020408/1 Project SPRITE+2:The Security,Privacy,Identity,and Trust Engagement Network plus(phase 2)for this studyfunded by PhD project RS718 on Explainable AI through the UKRI EPSRC Grant-funded Doctoral Training Centre at Swansea University.
文摘During its growth stage,the plant is exposed to various diseases.Detection and early detection of crop diseases is amajor challenge in the horticulture industry.Crop infections can harmtotal crop yield and reduce farmers’income if not identified early.Today’s approved method involves a professional plant pathologist to diagnose the disease by visual inspection of the afflicted plant leaves.This is an excellent use case for Community Assessment and Treatment Services(CATS)due to the lengthy manual disease diagnosis process and the accuracy of identification is directly proportional to the skills of pathologists.An alternative to conventional Machine Learning(ML)methods,which require manual identification of parameters for exact results,is to develop a prototype that can be classified without pre-processing.To automatically diagnose tomato leaf disease,this research proposes a hybrid model using the Convolutional Auto-Encoders(CAE)network and the CNN-based deep learning architecture of DenseNet.To date,none of the modern systems described in this paper have a combined model based on DenseNet,CAE,and ConvolutionalNeuralNetwork(CNN)todiagnose the ailments of tomato leaves automatically.Themodelswere trained on a dataset obtained from the Plant Village repository.The dataset consisted of 9920 tomato leaves,and the model-tomodel accuracy ratio was 98.35%.Unlike other approaches discussed in this paper,this hybrid strategy requires fewer training components.Therefore,the training time to classify plant diseases with the trained algorithm,as well as the training time to automatically detect the ailments of tomato leaves,is significantly reduced.