The Internet of Things(IoT)is integral to modern infrastructure,enabling connectivity among a wide range of devices from home automation to industrial control systems.With the exponential increase in data generated by...The Internet of Things(IoT)is integral to modern infrastructure,enabling connectivity among a wide range of devices from home automation to industrial control systems.With the exponential increase in data generated by these interconnected devices,robust anomaly detection mechanisms are essential.Anomaly detection in this dynamic environment necessitates methods that can accurately distinguish between normal and anomalous behavior by learning intricate patterns.This paper presents a novel approach utilizing generative adversarial networks(GANs)for anomaly detection in IoT systems.However,optimizing GANs involves tuning hyper-parameters such as learning rate,batch size,and optimization algorithms,which can be challenging due to the non-convex nature of GAN loss functions.To address this,we propose a five-dimensional Gray wolf optimizer(5DGWO)to optimize GAN hyper-parameters.The 5DGWO introduces two new types of wolves:gamma(γ)for improved exploitation and convergence,and theta(θ)for enhanced exploration and escaping local minima.The proposed system framework comprises four key stages:1)preprocessing,2)generative model training,3)autoencoder(AE)training,and 4)predictive model training.The generative models are utilized to assist the AE training,and the final predictive models(including convolutional neural network(CNN),deep belief network(DBN),recurrent neural network(RNN),random forest(RF),and extreme gradient boosting(XGBoost))are trained using the generated data and AE-encoded features.We evaluated the system on three benchmark datasets:NSL-KDD,UNSW-NB15,and IoT-23.Experiments conducted on diverse IoT datasets show that our method outperforms existing anomaly detection strategies and significantly reduces false positives.The 5DGWO-GAN-CNNAE exhibits superior performance in various metrics,including accuracy,recall,precision,root mean square error(RMSE),and convergence trend.The proposed 5DGWO-GAN-CNNAE achieved the lowest RMSE values across the NSL-KDD,UNSW-NB15,and IoT-23 datasets,with values of 0.24,1.10,and 0.09,respectively.Additionally,it attained the highest accuracy,ranging from 94%to 100%.These results suggest a promising direction for future IoT security frameworks,offering a scalable and efficient solution to safeguard against evolving cyber threats.展开更多
Existing imaging techniques cannot simultaneously achieve high resolution and a wide field of view,and manual multi-mineral segmentation in shale lacks precision.To address these limitations,we propose a comprehensive...Existing imaging techniques cannot simultaneously achieve high resolution and a wide field of view,and manual multi-mineral segmentation in shale lacks precision.To address these limitations,we propose a comprehensive framework based on generative adversarial network(GAN)for characterizing pore structure properties of shale,which incorporates image augmentation,super-resolution reconstruction,and multi-mineral auto-segmentation.Using real 2D and 3D shale images,the framework was assessed through correlation function,entropy,porosity,pore size distribution,and permeability.The application results show that this framework enables the enhancement of 3D low-resolution digital cores by a scale factor of 8,without paired shale images,effectively reconstructing the unresolved fine-scale pores under a low resolution,rather than merely denoising,deblurring,and edge clarification.The trained GAN-based segmentation model effectively improves manual multi-mineral segmentation results,resulting in a strong resemblance to real samples in terms of pore size distribution and permeability.This framework significantly improves the characterization of complex shale microstructures and can be expanded to other heterogeneous porous media,such as carbonate,coal,and tight sandstone reservoirs.展开更多
The development of generative architectures has resulted in numerous novel deep-learning models that generate images using text inputs.However,humans naturally use speech for visualization prompts.Therefore,this paper...The development of generative architectures has resulted in numerous novel deep-learning models that generate images using text inputs.However,humans naturally use speech for visualization prompts.Therefore,this paper proposes an architecture that integrates speech prompts as input to image-generation Generative Adversarial Networks(GANs)model,leveraging Speech-to-Text translation along with the CLIP+VQGAN model.The proposed method involves translating speech prompts into text,which is then used by the Contrastive Language-Image Pretraining(CLIP)+Vector Quantized Generative Adversarial Network(VQGAN)model to generate images.This paper outlines the steps required to implement such a model and describes in detail the methods used for evaluating the model.The GAN model successfully generates artwork from descriptions using speech and text prompts.Experimental outcomes of synthesized images demonstrate that the proposed methodology can produce beautiful abstract visuals containing elements from the input prompts.The model achieved a Frechet Inception Distance(FID)score of 28.75,showcasing its capability to produce high-quality and diverse images.The proposed model can find numerous applications in educational,artistic,and design spaces due to its ability to generate images using speech and the distinct abstract artistry of the output images.This capability is demonstrated by giving the model out-of-the-box prompts to generate never-before-seen images with plausible realistic qualities.展开更多
In order to address the widespread data shortage problem in battery research,this paper proposes a generative adversarial network model that combines it with deep convolutional networks,the Wasserstein distance,and th...In order to address the widespread data shortage problem in battery research,this paper proposes a generative adversarial network model that combines it with deep convolutional networks,the Wasserstein distance,and the gradient penalty to achieve data augmentation.To lower the threshold for implementing the proposed method,transfer learning is further introduced.The W-DC-GAN-GP-TL framework is thereby formed.This framework is evaluated on 3 different publicly available datasets to judge the quality of generated data.Through visual comparisons and the examination of two visualization methods(probability density function(PDF)and principal component analysis(PCA)),it is demonstrated that the generated data is hard to distinguish from the real data.The application of generated data for training a battery state model using transfer learning is further evaluated.Specifically,Bi-GRU-based and Transformer-based methods are implemented on 2 separate datasets for estimating state of health(SOH)and state of charge(SOC),respectively.The results indicate that the proposed framework demonstrates satisfactory performance in different scenarios:for the data replacement scenario,where real data are removed and replaced with generated data,the state estimator accuracy decreases only slightly;for the data enhancement scenario,the estimator accuracy is further improved.The estimation accuracy of SOH and SOC is as low as 0.69%and 0.58%root mean square error(RMSE)after applying the proposed framework.This framework provides a reliable method for enriching battery measurement data.It is a generalized framework capable of generating a variety of time series data.展开更多
The issues of seepage in calcareous sand foundations and backfillshave a potentially detrimental effect on the stability and safety of superstructures.Simplifying calcareous sand grains as spheres or ellipsoids in num...The issues of seepage in calcareous sand foundations and backfillshave a potentially detrimental effect on the stability and safety of superstructures.Simplifying calcareous sand grains as spheres or ellipsoids in numerical simulations may lead to significantinaccuracies.In this paper,we present a novel intelligence framework based on a deep convolutional generative adversarial network(DCGAN).A DCGAN model was trained using a training dataset comprising 11,625 real particles for the random generation of three-dimensional calcareous sand particles.Subsequently,3800 realistic calcareous sand particles with intra-particle voids were generated.Generative fidelityand validity of the DCGAN model were well verifiedby the consistency of the statistical values of nine morphological parameters of both the training dataset and the generated dataset.Digital calcareous sand columns were obtained through gravitational deposition simulation of the generated particles.Directional seepage simulations were conducted,and the vertical permeability values of the sand columns were found to be in accordance with the objective law.The results demonstrate the potential of the proposed framework for stochastic modeling and multi-scale simulation of the seepage behaviors in calcareous sand foundations and backfills.展开更多
Tropical cyclones(TCs)are complex and powerful weather systems,and accurately forecasting their path,structure,and intensity remains a critical focus and challenge in meteorological research.In this paper,we propose a...Tropical cyclones(TCs)are complex and powerful weather systems,and accurately forecasting their path,structure,and intensity remains a critical focus and challenge in meteorological research.In this paper,we propose an Attention Spatio-Temporal predictive Generative Adversarial Network(AST-GAN)model for predicting the temporal and spatial distribution of TCs.The model forecasts the spatial distribution of TC wind speeds for the next 15 hours at 3-hour intervals,emphasizing the cyclone's center,high wind-speed areas,and its asymmetric structure.To effectively capture spatiotemporal feature transfer at different time steps,we employ a channel attention mechanism for feature selection,enhancing model performance and reducing parameter redundancy.We utilized High-Resolution Weather Research and Forecasting(HWRF)data to train our model,allowing it to assimilate a wide range of TC motion patterns.The model is versatile and can be applied to various complex scenarios,such as multiple TCs moving simultaneously or TCs approaching landfall.Our proposed model demonstrates superior forecasting performance,achieving a root-mean-square error(RMSE)of 0.71 m s^(-1)for overall wind speed and 2.74 m s^(-1)for maximum wind speed when benchmarked against ground truth data from HWRF.Furthermore,the model underwent optimization and independent testing using ERA5reanalysis data,showcasing its stability and scalability.After fine-tuning on the ERA5 dataset,the model achieved an RMSE of 1.33 m s^(-1)for wind speed and 1.75 m s^(-1)for maximum wind speed.The AST-GAN model outperforms other state-of-the-art models in RMSE on both the HWRF and ERA5 datasets,maintaining its superior performance and demonstrating its effectiveness for spatiotemporal prediction of TCs.展开更多
The generation of high-quality,realistic face generation has emerged as a key field of research in computer vision.This paper proposes a robust approach that combines a Super-Resolution Generative Adversarial Network(...The generation of high-quality,realistic face generation has emerged as a key field of research in computer vision.This paper proposes a robust approach that combines a Super-Resolution Generative Adversarial Network(SRGAN)with a Pyramid Attention Module(PAM)to enhance the quality of deep face generation.The SRGAN framework is designed to improve the resolution of generated images,addressing common challenges such as blurriness and a lack of intricate details.The Pyramid Attention Module further complements the process by focusing on multi-scale feature extraction,enabling the network to capture finer details and complex facial features more effectively.The proposed method was trained and evaluated over 100 epochs on the CelebA dataset,demonstrating consistent improvements in image quality and a marked decrease in generator and discriminator losses,reflecting the model’s capacity to learn and synthesize high-quality images effectively,given adequate computational resources.Experimental outcome demonstrates that the SRGAN model with PAM module has outperformed,yielding an aggregate discriminator loss of 0.055 for real,0.043 for fake,and a generator loss of 10.58 after training for 100 epochs.The model has yielded an structural similarity index measure of 0.923,that has outperformed the other models that are considered in the current study for analysis.展开更多
This paper addresses the performance degradation issue in a fast radio burst search pipeline based on deep learning.This issue is caused by the class imbalance of the radio frequency interference samples in the traini...This paper addresses the performance degradation issue in a fast radio burst search pipeline based on deep learning.This issue is caused by the class imbalance of the radio frequency interference samples in the training dataset,and one solution is applied to improve the distribution of the training data by augmenting minority class samples using a deep convolutional generative adversarial network.Experi.mental results demonstrate that retraining the deep learning model with the newly generated dataset leads to a new fast radio burst classifier,which effectively reduces false positives caused by periodic wide-band impulsive radio frequency interference,thereby enhancing the performance of the search pipeline.展开更多
The increasing demand for radioauthorized applications in the 6G era necessitates enhanced monitoring and management of radio resources,particularly for precise control over the electromagnetic environment.The radio m...The increasing demand for radioauthorized applications in the 6G era necessitates enhanced monitoring and management of radio resources,particularly for precise control over the electromagnetic environment.The radio map serves as a crucial tool for describing signal strength distribution within the current electromagnetic environment.However,most existing algorithms rely on sparse measurements of radio strength,disregarding the impact of building information.In this paper,we propose a spectrum cartography(SC)algorithm that eliminates the need for relying on sparse ground-based radio strength measurements by utilizing a satellite network to collect data on buildings and transmitters.Our algorithm leverages Pix2Pix Generative Adversarial Network(GAN)to construct accurate radio maps using transmitter information within real geographical environments.Finally,simulation results demonstrate that our algorithm exhibits superior accuracy compared to previously proposed methods.展开更多
An intelligent diagnosis method based on self-adaptiveWasserstein dual generative adversarial networks and feature fusion is proposed due to problems such as insufficient sample size and incomplete fault feature extra...An intelligent diagnosis method based on self-adaptiveWasserstein dual generative adversarial networks and feature fusion is proposed due to problems such as insufficient sample size and incomplete fault feature extraction,which are commonly faced by rolling bearings and lead to low diagnostic accuracy.Initially,dual models of the Wasserstein deep convolutional generative adversarial network incorporating gradient penalty(1D-2DWDCGAN)are constructed to augment the original dataset.A self-adaptive loss threshold control training strategy is introduced,and establishing a self-adaptive balancing mechanism for stable model training.Subsequently,a diagnostic model based on multidimensional feature fusion is designed,wherein complex features from various dimensions are extracted,merging the original signal waveform features,structured features,and time-frequency features into a deep composite feature representation that encompasses multiple dimensions and scales;thus,efficient and accurate small sample fault diagnosis is facilitated.Finally,an experiment between the bearing fault dataset of CaseWestern ReserveUniversity and the fault simulation experimental platformdataset of this research group shows that this method effectively supplements the dataset and remarkably improves the diagnostic accuracy.The diagnostic accuracy after data augmentation reached 99.94%and 99.87%in two different experimental environments,respectively.In addition,robustness analysis is conducted on the diagnostic accuracy of the proposed method under different noise backgrounds,verifying its good generalization performance.展开更多
Full waveform inversion(FWI)has showed great potential in the detection of musculoskeletal disease.However,FWI is an ill-posed inverse problem and has a high requirement on the initial model during the imaging process...Full waveform inversion(FWI)has showed great potential in the detection of musculoskeletal disease.However,FWI is an ill-posed inverse problem and has a high requirement on the initial model during the imaging process.An inaccurate initial model may lead to local minima in the inversion and unexpected imaging results caused by cycle-skipping phenomenon.Deep learning methods have been applied in musculoskeletal imaging,but need a large amount of data for training.Inspired by work related to generative adversarial networks with physical informed constrain,we proposed a method named as bone ultrasound imaging with physics informed generative adversarial network(BUIPIGAN)to achieve unsupervised multi-parameter imaging for musculoskeletal tissues,focusing on speed of sound(SOS)and density.In the in-silico experiments using a ring array transducer,conventional FWI methods and BUIPIGAN were employed for multiparameter imaging of two musculoskeletal tissue models.The results were evaluated based on visual appearance,structural similarity index measure(SSIM),signal-to-noise ratio(SNR),and relative error(RE).For SOS imaging of the tibia–fibula model,the proposed BUIPIGAN achieved accurate SOS imaging with best performance.The specific quantitative metrics for SOS imaging were SSIM 0.9573,SNR 28.70 dB,and RE 5.78%.For the multi-parameter imaging of the tibia–fibula and human forearm,the BUIPIGAN successfully reconstructed SOS and density distributions with SSIM above 94%,SNR above 21 dB,and RE below 10%.The BUIPIGAN also showed robustness across various noise levels(i.e.,30 dB,10 dB).The results demonstrated that the proposed BUIPIGAN can achieve high-accuracy SOS and density imaging,proving its potential for applications in musculoskeletal ultrasound imaging.展开更多
Multifocal metalenses are of great concern in optical communications,optical imaging and micro-optics systems,but their design is extremely challenging.In recent years,deep learning methods have provided novel solutio...Multifocal metalenses are of great concern in optical communications,optical imaging and micro-optics systems,but their design is extremely challenging.In recent years,deep learning methods have provided novel solutions to the design of optical planar devices.Here,an approach is proposed to explore the use of generative adversarial networks(GANs)to realize the design of metalenses with different focusing positions at dual wavelengths.This approach includes a forward network and an inverse network,where the former predicts the optical response of meta-atoms and the latter generates structures that meet specific requirements.Compared to the traditional search method,the inverse network demonstrates higher precision and efficiency in designing a dual-wavelength bifocal metalens.The results will provide insights and methodologies for the design of tunable wavelength metalenses,while also highlighting the potential of deep learning in optical device design.展开更多
This study addresses the pressing challenge of generating realistic strong ground motion data for simulating earthquakes,a crucial component in pre-earthquake risk assessments and post-earthquake disaster evaluations,...This study addresses the pressing challenge of generating realistic strong ground motion data for simulating earthquakes,a crucial component in pre-earthquake risk assessments and post-earthquake disaster evaluations,particularly suited for regions with limited seismic data.Herein,we report a generative adversarial network(GAN)framework capable of simulating strong ground motions under various environmental conditions using only a small set of real earthquake records.The constructed GAN model generates ground motions based on continuous physical variables such as source distance,site conditions,and magnitude,effectively capturing the complexity and diversity of ground motions under different scenarios.This capability allows the proposed model to approximate real seismic data,making it applicable to a wide range of engineering purposes.Using the Shandong Pingyuan earthquake as an example,a specialized dataset was constructed based on regional real ground motion records.The response spectrum at target locations was obtained through inverse distance-weighted interpolation of actual response spectra,followed by continuous wavelet transform to derive the ground motion time histories at these locations.Through iterative parameter adjustments,the constructed GAN model learned the probability distribution of strong-motion data for this event.The trained model generated three-component ground-motion time histories with clear P-wave and S-wave characteristics,accurately reflecting the non-stationary nature of seismic records.Statistical comparisons between synthetic and real response spectra,waveform envelopes,and peak ground acceleration show a high degree of similarity,underscoring the effectiveness of the model in replicating both the statistical and physical characteristics of real ground motions.These findings validate the feasibility of GANs for generating realistic earthquake data in data-scarce regions,providing a reliable approach for enriching regional ground motion databases.Additionally,the results suggest that GAN-based networks are a powerful tool for building predictive models in seismic hazard analysis.展开更多
Background:Enhancing the quality of images from retinal,MRI and echocardiography imaging shows promise with SR-GANs for medical imaging use.Using these networks,it is possible to produce high-quality images even from ...Background:Enhancing the quality of images from retinal,MRI and echocardiography imaging shows promise with SR-GANs for medical imaging use.Using these networks,it is possible to produce high-quality images even from low-quality medical scans.Methods:To do this,SR-GANs make use of growth from low to high resolutions in two 2×stages,multiple sizes of filters and powerful loss functions.The medical super-resolution network and denoising SR-GAN focus on problems such as image noise and artifacts to improve a photo’s stability,ability to extract features and how it looks.Results:Assessment by numbers has found that using SR-GAN-based approaches leads to marked improvements such as increases in the PSNR by up to 4.85 dB and improvements in the SSIM by between 0.04 and 0.05.Such improvements are better than traditional super-resolution methods which help doctors achieve clear images of the mitral valve in cardiac ultrasonography.Conclusion:It is anticipated that applying SR-GANs in clinical tasks will increase the accuracy of diagnoses,ease the workload for patients and widen the application of super-resolution methods in various medical procedures.The results prove that SR-GANs improve the picture quality of echocardiograms used for diagnosing mitral valve problems.Having proven the model in research settings,future studies should try to apply it to real-world clinical cases,test for its use across a range of imaging devices and perfect the system to ensure it is efficient for use in medical settings.展开更多
Symmetric encryption algorithms learned by the previous proposed end-to-end adversarial network encryption communication systems are deterministic.With the same key and same plaintext,the deterministic algorithm will ...Symmetric encryption algorithms learned by the previous proposed end-to-end adversarial network encryption communication systems are deterministic.With the same key and same plaintext,the deterministic algorithm will lead to the same ciphertext.This means that the key in the deterministic encryption algorithm can only be used once,thus the encryption is not practical.To solve this problem,a nondeterministic symmetric encryption end-to-end communication system based on generative adversarial networks is proposed.We design a nonce-based adversarial neural network model,where a“nonce”standing for“number used only once”is passed to communication participants,and does not need to be secret.Moreover,we optimize the network structure through adding Batch Normalization(BN)to the CNNs(Convolutional Neural Networks),selecting the appropriate activation functions,and setting appropriate CNNs parameters.Results of experiments and analysis show that our system can achieve non-deterministic symmetric encryption,where Alice encrypting the same plaintext with the key twice will generate different ciphertexts,and Bob can decrypt all these different ciphertexts of the same plaintext to the correct plaintext.And our proposed system has fast convergence and the correct rate of decryption when the plaintext length is 256 or even longer.展开更多
Generative adversarial network(GAN)is one of the most promising methods for unsupervised learning in recent years.GAN works via adversarial training concept and has shown excellent performance in the fields image synt...Generative adversarial network(GAN)is one of the most promising methods for unsupervised learning in recent years.GAN works via adversarial training concept and has shown excellent performance in the fields image synthesis,image super-resolution,video generation,image translation,etc.Compared with classical algorithms,quantum algorithms have their unique advantages in dealing with complex tasks,quantum machine learning(QML)is one of the most promising quantum algorithms with the rapid development of quantum technology.Specifically,Quantum generative adversarial network(QGAN)has shown the potential exponential quantum speedups in terms of performance.Meanwhile,QGAN also exhibits some problems,such as barren plateaus,unstable gradient,model collapse,absent complete scientific evaluation system,etc.How to improve the theory of QGAN and apply it that have attracted some researcher.In this paper,we comprehensively and deeply review recently proposed GAN and QAGN models and their applications,and we discuss the existing problems and future research trends of QGAN.展开更多
In this paper,we propose a hybrid model aiming to map the input noise vector to the label of the generated image by the generative adversarial network(GAN).This model mainly consists of a pre-trained deep convolution ...In this paper,we propose a hybrid model aiming to map the input noise vector to the label of the generated image by the generative adversarial network(GAN).This model mainly consists of a pre-trained deep convolution generative adversarial network(DCGAN)and a classifier.By using the model,we visualize the distribution of two-dimensional input noise,leading to a specific type of the generated image after each training epoch of GAN.The visualization reveals the distribution feature of the input noise vector and the performance of the generator.With this feature,we try to build a guided generator(GG)with the ability to produce a fake image we need.Two methods are proposed to build GG.One is the most significant noise(MSN)method,and the other utilizes labeled noise.The MSN method can generate images precisely but with less variations.In contrast,the labeled noise method has more variations but is slightly less stable.Finally,we propose a criterion to measure the performance of the generator,which can be used as a loss function to effectively train the network.展开更多
Recently, generative adversarial networks(GANs)have become a research focus of artificial intelligence. Inspired by two-player zero-sum game, GANs comprise a generator and a discriminator, both trained under the adver...Recently, generative adversarial networks(GANs)have become a research focus of artificial intelligence. Inspired by two-player zero-sum game, GANs comprise a generator and a discriminator, both trained under the adversarial learning idea.The goal of GANs is to estimate the potential distribution of real data samples and generate new samples from that distribution.Since their initiation, GANs have been widely studied due to their enormous prospect for applications, including image and vision computing, speech and language processing, etc. In this review paper, we summarize the state of the art of GANs and look into the future. Firstly, we survey GANs' proposal background,theoretic and implementation models, and application fields.Then, we discuss GANs' advantages and disadvantages, and their development trends. In particular, we investigate the relation between GANs and parallel intelligence,with the conclusion that GANs have a great potential in parallel systems research in terms of virtual-real interaction and integration. Clearly, GANs can provide substantial algorithmic support for parallel intelligence.展开更多
Deep Learning(DL)is such a powerful tool that we have seen tremendous success in areas such as Computer Vision,Speech Recognition,and Natural Language Processing.Since Automated Modulation Classification(AMC)is an imp...Deep Learning(DL)is such a powerful tool that we have seen tremendous success in areas such as Computer Vision,Speech Recognition,and Natural Language Processing.Since Automated Modulation Classification(AMC)is an important part in Cognitive Radio Networks,we try to explore its potential in solving signal modulation recognition problem.It cannot be overlooked that DL model is a complex model,thus making them prone to over-fitting.DL model requires many training data to combat with over-fitting,but adding high quality labels to training data manually is not always cheap and accessible,especially in real-time system,which may counter unprecedented data in dataset.Semi-supervised Learning is a way to exploit unlabeled data effectively to reduce over-fitting in DL.In this paper,we extend Generative Adversarial Networks(GANs)to the semi-supervised learning will show it is a method can be used to create a more dataefficient classifier.展开更多
In recent years,landslide susceptibility mapping has substantially improved with advances in machine learning.However,there are still challenges remain in landslide mapping due to the availability of limited inventory...In recent years,landslide susceptibility mapping has substantially improved with advances in machine learning.However,there are still challenges remain in landslide mapping due to the availability of limited inventory data.In this paper,a novel method that improves the performance of machine learning techniques is presented.The proposed method creates synthetic inventory data using Generative Adversarial Networks(GANs)for improving the prediction of landslides.In this research,landslide inventory data of 156 landslide locations were identified in Cameron Highlands,Malaysia,taken from previous projects the authors worked on.Elevation,slope,aspect,plan curvature,profile curvature,total curvature,lithology,land use and land cover(LULC),distance to the road,distance to the river,stream power index(SPI),sediment transport index(STI),terrain roughness index(TRI),topographic wetness index(TWI)and vegetation density are geo-environmental factors considered in this study based on suggestions from previous works on Cameron Highlands.To show the capability of GANs in improving landslide prediction models,this study tests the proposed GAN model with benchmark models namely Artificial Neural Network(ANN),Support Vector Machine(SVM),Decision Trees(DT),Random Forest(RF)and Bagging ensemble models with ANN and SVM models.These models were validated using the area under the receiver operating characteristic curve(AUROC).The DT,RF,SVM,ANN and Bagging ensemble could achieve the AUROC values of(0.90,0.94,0.86,0.69 and 0.82)for the training;and the AUROC of(0.76,0.81,0.85,0.72 and 0.75)for the test,subsequently.When using additional samples,the same models achieved the AUROC values of(0.92,0.94,0.88,0.75 and 0.84)for the training and(0.78,0.82,0.82,0.78 and 0.80)for the test,respectively.Using the additional samples improved the test accuracy of all the models except SVM.As a result,in data-scarce environments,this research showed that utilizing GANs to generate supplementary samples is promising because it can improve the predictive capability of common landslide prediction models.展开更多
基金described in this paper has been developed with in the project PRESECREL(PID2021-124502OB-C43)。
文摘The Internet of Things(IoT)is integral to modern infrastructure,enabling connectivity among a wide range of devices from home automation to industrial control systems.With the exponential increase in data generated by these interconnected devices,robust anomaly detection mechanisms are essential.Anomaly detection in this dynamic environment necessitates methods that can accurately distinguish between normal and anomalous behavior by learning intricate patterns.This paper presents a novel approach utilizing generative adversarial networks(GANs)for anomaly detection in IoT systems.However,optimizing GANs involves tuning hyper-parameters such as learning rate,batch size,and optimization algorithms,which can be challenging due to the non-convex nature of GAN loss functions.To address this,we propose a five-dimensional Gray wolf optimizer(5DGWO)to optimize GAN hyper-parameters.The 5DGWO introduces two new types of wolves:gamma(γ)for improved exploitation and convergence,and theta(θ)for enhanced exploration and escaping local minima.The proposed system framework comprises four key stages:1)preprocessing,2)generative model training,3)autoencoder(AE)training,and 4)predictive model training.The generative models are utilized to assist the AE training,and the final predictive models(including convolutional neural network(CNN),deep belief network(DBN),recurrent neural network(RNN),random forest(RF),and extreme gradient boosting(XGBoost))are trained using the generated data and AE-encoded features.We evaluated the system on three benchmark datasets:NSL-KDD,UNSW-NB15,and IoT-23.Experiments conducted on diverse IoT datasets show that our method outperforms existing anomaly detection strategies and significantly reduces false positives.The 5DGWO-GAN-CNNAE exhibits superior performance in various metrics,including accuracy,recall,precision,root mean square error(RMSE),and convergence trend.The proposed 5DGWO-GAN-CNNAE achieved the lowest RMSE values across the NSL-KDD,UNSW-NB15,and IoT-23 datasets,with values of 0.24,1.10,and 0.09,respectively.Additionally,it attained the highest accuracy,ranging from 94%to 100%.These results suggest a promising direction for future IoT security frameworks,offering a scalable and efficient solution to safeguard against evolving cyber threats.
基金Supported by the National Natural Science Foundation of China(U23A20595,52034010,52288101)National Key Research and Development Program of China(2022YFE0203400)+1 种基金Shandong Provincial Natural Science Foundation(ZR2024ZD17)Fundamental Research Funds for the Central Universities(23CX10004A).
文摘Existing imaging techniques cannot simultaneously achieve high resolution and a wide field of view,and manual multi-mineral segmentation in shale lacks precision.To address these limitations,we propose a comprehensive framework based on generative adversarial network(GAN)for characterizing pore structure properties of shale,which incorporates image augmentation,super-resolution reconstruction,and multi-mineral auto-segmentation.Using real 2D and 3D shale images,the framework was assessed through correlation function,entropy,porosity,pore size distribution,and permeability.The application results show that this framework enables the enhancement of 3D low-resolution digital cores by a scale factor of 8,without paired shale images,effectively reconstructing the unresolved fine-scale pores under a low resolution,rather than merely denoising,deblurring,and edge clarification.The trained GAN-based segmentation model effectively improves manual multi-mineral segmentation results,resulting in a strong resemblance to real samples in terms of pore size distribution and permeability.This framework significantly improves the characterization of complex shale microstructures and can be expanded to other heterogeneous porous media,such as carbonate,coal,and tight sandstone reservoirs.
基金funded by the Centre for Advanced Modelling and Geospatial Information Systems(CAMGIS),Faculty of Engineering and IT,University of Technology SydneyMoreover,supported by the Researchers Supporting Project,King Saud University,Riyadh,Saudi Arabia,under Ongoing Research Funding(ORF-2025-14).
文摘The development of generative architectures has resulted in numerous novel deep-learning models that generate images using text inputs.However,humans naturally use speech for visualization prompts.Therefore,this paper proposes an architecture that integrates speech prompts as input to image-generation Generative Adversarial Networks(GANs)model,leveraging Speech-to-Text translation along with the CLIP+VQGAN model.The proposed method involves translating speech prompts into text,which is then used by the Contrastive Language-Image Pretraining(CLIP)+Vector Quantized Generative Adversarial Network(VQGAN)model to generate images.This paper outlines the steps required to implement such a model and describes in detail the methods used for evaluating the model.The GAN model successfully generates artwork from descriptions using speech and text prompts.Experimental outcomes of synthesized images demonstrate that the proposed methodology can produce beautiful abstract visuals containing elements from the input prompts.The model achieved a Frechet Inception Distance(FID)score of 28.75,showcasing its capability to produce high-quality and diverse images.The proposed model can find numerous applications in educational,artistic,and design spaces due to its ability to generate images using speech and the distinct abstract artistry of the output images.This capability is demonstrated by giving the model out-of-the-box prompts to generate never-before-seen images with plausible realistic qualities.
基金funded by the Bavarian State Ministry of Science,Research and Art(Grant number:H.2-F1116.WE/52/2)。
文摘In order to address the widespread data shortage problem in battery research,this paper proposes a generative adversarial network model that combines it with deep convolutional networks,the Wasserstein distance,and the gradient penalty to achieve data augmentation.To lower the threshold for implementing the proposed method,transfer learning is further introduced.The W-DC-GAN-GP-TL framework is thereby formed.This framework is evaluated on 3 different publicly available datasets to judge the quality of generated data.Through visual comparisons and the examination of two visualization methods(probability density function(PDF)and principal component analysis(PCA)),it is demonstrated that the generated data is hard to distinguish from the real data.The application of generated data for training a battery state model using transfer learning is further evaluated.Specifically,Bi-GRU-based and Transformer-based methods are implemented on 2 separate datasets for estimating state of health(SOH)and state of charge(SOC),respectively.The results indicate that the proposed framework demonstrates satisfactory performance in different scenarios:for the data replacement scenario,where real data are removed and replaced with generated data,the state estimator accuracy decreases only slightly;for the data enhancement scenario,the estimator accuracy is further improved.The estimation accuracy of SOH and SOC is as low as 0.69%and 0.58%root mean square error(RMSE)after applying the proposed framework.This framework provides a reliable method for enriching battery measurement data.It is a generalized framework capable of generating a variety of time series data.
基金financially supported by the National Natural Science Foundation of China(Grant No.42077232)the National Natural Science Foundation for Excellent Young Scholars of China(Grant No.52222110)the Fundamental Research Funds for the Central Universities(Grant No.14380229).
文摘The issues of seepage in calcareous sand foundations and backfillshave a potentially detrimental effect on the stability and safety of superstructures.Simplifying calcareous sand grains as spheres or ellipsoids in numerical simulations may lead to significantinaccuracies.In this paper,we present a novel intelligence framework based on a deep convolutional generative adversarial network(DCGAN).A DCGAN model was trained using a training dataset comprising 11,625 real particles for the random generation of three-dimensional calcareous sand particles.Subsequently,3800 realistic calcareous sand particles with intra-particle voids were generated.Generative fidelityand validity of the DCGAN model were well verifiedby the consistency of the statistical values of nine morphological parameters of both the training dataset and the generated dataset.Digital calcareous sand columns were obtained through gravitational deposition simulation of the generated particles.Directional seepage simulations were conducted,and the vertical permeability values of the sand columns were found to be in accordance with the objective law.The results demonstrate the potential of the proposed framework for stochastic modeling and multi-scale simulation of the seepage behaviors in calcareous sand foundations and backfills.
基金supported by the Southern Marine Science and Engineering Guangdong Laboratory(Zhuhai)(NO.SML2021SP201)the National Natural Science Foundation of China(Grant No.42306200 and 42306216)+2 种基金the National Key Research and Development Program of China(Grant No.2023YFC3008100)the Innovation Group Project of the Southern Marine Science and Engineering Guangdong Laboratory(Zhuhai)(Grant No.311021004)the Oceanic Interdisciplinary Program of Shanghai Jiao Tong University(Project No.SL2021ZD203)。
文摘Tropical cyclones(TCs)are complex and powerful weather systems,and accurately forecasting their path,structure,and intensity remains a critical focus and challenge in meteorological research.In this paper,we propose an Attention Spatio-Temporal predictive Generative Adversarial Network(AST-GAN)model for predicting the temporal and spatial distribution of TCs.The model forecasts the spatial distribution of TC wind speeds for the next 15 hours at 3-hour intervals,emphasizing the cyclone's center,high wind-speed areas,and its asymmetric structure.To effectively capture spatiotemporal feature transfer at different time steps,we employ a channel attention mechanism for feature selection,enhancing model performance and reducing parameter redundancy.We utilized High-Resolution Weather Research and Forecasting(HWRF)data to train our model,allowing it to assimilate a wide range of TC motion patterns.The model is versatile and can be applied to various complex scenarios,such as multiple TCs moving simultaneously or TCs approaching landfall.Our proposed model demonstrates superior forecasting performance,achieving a root-mean-square error(RMSE)of 0.71 m s^(-1)for overall wind speed and 2.74 m s^(-1)for maximum wind speed when benchmarked against ground truth data from HWRF.Furthermore,the model underwent optimization and independent testing using ERA5reanalysis data,showcasing its stability and scalability.After fine-tuning on the ERA5 dataset,the model achieved an RMSE of 1.33 m s^(-1)for wind speed and 1.75 m s^(-1)for maximum wind speed.The AST-GAN model outperforms other state-of-the-art models in RMSE on both the HWRF and ERA5 datasets,maintaining its superior performance and demonstrating its effectiveness for spatiotemporal prediction of TCs.
基金supported by the National Research Foundation of Korea(NRF)grant funded by the Korea government(*MSIT)(No.2018R1A5A7059549).
文摘The generation of high-quality,realistic face generation has emerged as a key field of research in computer vision.This paper proposes a robust approach that combines a Super-Resolution Generative Adversarial Network(SRGAN)with a Pyramid Attention Module(PAM)to enhance the quality of deep face generation.The SRGAN framework is designed to improve the resolution of generated images,addressing common challenges such as blurriness and a lack of intricate details.The Pyramid Attention Module further complements the process by focusing on multi-scale feature extraction,enabling the network to capture finer details and complex facial features more effectively.The proposed method was trained and evaluated over 100 epochs on the CelebA dataset,demonstrating consistent improvements in image quality and a marked decrease in generator and discriminator losses,reflecting the model’s capacity to learn and synthesize high-quality images effectively,given adequate computational resources.Experimental outcome demonstrates that the SRGAN model with PAM module has outperformed,yielding an aggregate discriminator loss of 0.055 for real,0.043 for fake,and a generator loss of 10.58 after training for 100 epochs.The model has yielded an structural similarity index measure of 0.923,that has outperformed the other models that are considered in the current study for analysis.
基金supported by the Chinese Academy of Science"Light of West China"Program(2022-XBQNXZ-015)the National Natural Science Foundation of China(11903071)the Operation,Maintenance and Upgrading Fund for Astronomical Telescopes and Facility Instruments,budgeted from the Ministry of Finance of China and administered by the Chinese Academy of Sciences。
文摘This paper addresses the performance degradation issue in a fast radio burst search pipeline based on deep learning.This issue is caused by the class imbalance of the radio frequency interference samples in the training dataset,and one solution is applied to improve the distribution of the training data by augmenting minority class samples using a deep convolutional generative adversarial network.Experi.mental results demonstrate that retraining the deep learning model with the newly generated dataset leads to a new fast radio burst classifier,which effectively reduces false positives caused by periodic wide-band impulsive radio frequency interference,thereby enhancing the performance of the search pipeline.
文摘The increasing demand for radioauthorized applications in the 6G era necessitates enhanced monitoring and management of radio resources,particularly for precise control over the electromagnetic environment.The radio map serves as a crucial tool for describing signal strength distribution within the current electromagnetic environment.However,most existing algorithms rely on sparse measurements of radio strength,disregarding the impact of building information.In this paper,we propose a spectrum cartography(SC)algorithm that eliminates the need for relying on sparse ground-based radio strength measurements by utilizing a satellite network to collect data on buildings and transmitters.Our algorithm leverages Pix2Pix Generative Adversarial Network(GAN)to construct accurate radio maps using transmitter information within real geographical environments.Finally,simulation results demonstrate that our algorithm exhibits superior accuracy compared to previously proposed methods.
基金supported by the National Natural Science Foundation of China(Grant Nos.12272259 and 52005148).
文摘An intelligent diagnosis method based on self-adaptiveWasserstein dual generative adversarial networks and feature fusion is proposed due to problems such as insufficient sample size and incomplete fault feature extraction,which are commonly faced by rolling bearings and lead to low diagnostic accuracy.Initially,dual models of the Wasserstein deep convolutional generative adversarial network incorporating gradient penalty(1D-2DWDCGAN)are constructed to augment the original dataset.A self-adaptive loss threshold control training strategy is introduced,and establishing a self-adaptive balancing mechanism for stable model training.Subsequently,a diagnostic model based on multidimensional feature fusion is designed,wherein complex features from various dimensions are extracted,merging the original signal waveform features,structured features,and time-frequency features into a deep composite feature representation that encompasses multiple dimensions and scales;thus,efficient and accurate small sample fault diagnosis is facilitated.Finally,an experiment between the bearing fault dataset of CaseWestern ReserveUniversity and the fault simulation experimental platformdataset of this research group shows that this method effectively supplements the dataset and remarkably improves the diagnostic accuracy.The diagnostic accuracy after data augmentation reached 99.94%and 99.87%in two different experimental environments,respectively.In addition,robustness analysis is conducted on the diagnostic accuracy of the proposed method under different noise backgrounds,verifying its good generalization performance.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.12122403 and 12327807).
文摘Full waveform inversion(FWI)has showed great potential in the detection of musculoskeletal disease.However,FWI is an ill-posed inverse problem and has a high requirement on the initial model during the imaging process.An inaccurate initial model may lead to local minima in the inversion and unexpected imaging results caused by cycle-skipping phenomenon.Deep learning methods have been applied in musculoskeletal imaging,but need a large amount of data for training.Inspired by work related to generative adversarial networks with physical informed constrain,we proposed a method named as bone ultrasound imaging with physics informed generative adversarial network(BUIPIGAN)to achieve unsupervised multi-parameter imaging for musculoskeletal tissues,focusing on speed of sound(SOS)and density.In the in-silico experiments using a ring array transducer,conventional FWI methods and BUIPIGAN were employed for multiparameter imaging of two musculoskeletal tissue models.The results were evaluated based on visual appearance,structural similarity index measure(SSIM),signal-to-noise ratio(SNR),and relative error(RE).For SOS imaging of the tibia–fibula model,the proposed BUIPIGAN achieved accurate SOS imaging with best performance.The specific quantitative metrics for SOS imaging were SSIM 0.9573,SNR 28.70 dB,and RE 5.78%.For the multi-parameter imaging of the tibia–fibula and human forearm,the BUIPIGAN successfully reconstructed SOS and density distributions with SSIM above 94%,SNR above 21 dB,and RE below 10%.The BUIPIGAN also showed robustness across various noise levels(i.e.,30 dB,10 dB).The results demonstrated that the proposed BUIPIGAN can achieve high-accuracy SOS and density imaging,proving its potential for applications in musculoskeletal ultrasound imaging.
基金National Natural Science Foundation of China(No.61975029)。
文摘Multifocal metalenses are of great concern in optical communications,optical imaging and micro-optics systems,but their design is extremely challenging.In recent years,deep learning methods have provided novel solutions to the design of optical planar devices.Here,an approach is proposed to explore the use of generative adversarial networks(GANs)to realize the design of metalenses with different focusing positions at dual wavelengths.This approach includes a forward network and an inverse network,where the former predicts the optical response of meta-atoms and the latter generates structures that meet specific requirements.Compared to the traditional search method,the inverse network demonstrates higher precision and efficiency in designing a dual-wavelength bifocal metalens.The results will provide insights and methodologies for the design of tunable wavelength metalenses,while also highlighting the potential of deep learning in optical device design.
基金Funded by the National Key Research and Development Program(2022YFC3003502).
文摘This study addresses the pressing challenge of generating realistic strong ground motion data for simulating earthquakes,a crucial component in pre-earthquake risk assessments and post-earthquake disaster evaluations,particularly suited for regions with limited seismic data.Herein,we report a generative adversarial network(GAN)framework capable of simulating strong ground motions under various environmental conditions using only a small set of real earthquake records.The constructed GAN model generates ground motions based on continuous physical variables such as source distance,site conditions,and magnitude,effectively capturing the complexity and diversity of ground motions under different scenarios.This capability allows the proposed model to approximate real seismic data,making it applicable to a wide range of engineering purposes.Using the Shandong Pingyuan earthquake as an example,a specialized dataset was constructed based on regional real ground motion records.The response spectrum at target locations was obtained through inverse distance-weighted interpolation of actual response spectra,followed by continuous wavelet transform to derive the ground motion time histories at these locations.Through iterative parameter adjustments,the constructed GAN model learned the probability distribution of strong-motion data for this event.The trained model generated three-component ground-motion time histories with clear P-wave and S-wave characteristics,accurately reflecting the non-stationary nature of seismic records.Statistical comparisons between synthetic and real response spectra,waveform envelopes,and peak ground acceleration show a high degree of similarity,underscoring the effectiveness of the model in replicating both the statistical and physical characteristics of real ground motions.These findings validate the feasibility of GANs for generating realistic earthquake data in data-scarce regions,providing a reliable approach for enriching regional ground motion databases.Additionally,the results suggest that GAN-based networks are a powerful tool for building predictive models in seismic hazard analysis.
文摘Background:Enhancing the quality of images from retinal,MRI and echocardiography imaging shows promise with SR-GANs for medical imaging use.Using these networks,it is possible to produce high-quality images even from low-quality medical scans.Methods:To do this,SR-GANs make use of growth from low to high resolutions in two 2×stages,multiple sizes of filters and powerful loss functions.The medical super-resolution network and denoising SR-GAN focus on problems such as image noise and artifacts to improve a photo’s stability,ability to extract features and how it looks.Results:Assessment by numbers has found that using SR-GAN-based approaches leads to marked improvements such as increases in the PSNR by up to 4.85 dB and improvements in the SSIM by between 0.04 and 0.05.Such improvements are better than traditional super-resolution methods which help doctors achieve clear images of the mitral valve in cardiac ultrasonography.Conclusion:It is anticipated that applying SR-GANs in clinical tasks will increase the accuracy of diagnoses,ease the workload for patients and widen the application of super-resolution methods in various medical procedures.The results prove that SR-GANs improve the picture quality of echocardiograms used for diagnosing mitral valve problems.Having proven the model in research settings,future studies should try to apply it to real-world clinical cases,test for its use across a range of imaging devices and perfect the system to ensure it is efficient for use in medical settings.
基金supported by The National Defense Innovation Project(No.ZZKY20222411)Natural Science Basic Research Plan in Shaanxi Province of China(No.2024JC-YBMS-546).
文摘Symmetric encryption algorithms learned by the previous proposed end-to-end adversarial network encryption communication systems are deterministic.With the same key and same plaintext,the deterministic algorithm will lead to the same ciphertext.This means that the key in the deterministic encryption algorithm can only be used once,thus the encryption is not practical.To solve this problem,a nondeterministic symmetric encryption end-to-end communication system based on generative adversarial networks is proposed.We design a nonce-based adversarial neural network model,where a“nonce”standing for“number used only once”is passed to communication participants,and does not need to be secret.Moreover,we optimize the network structure through adding Batch Normalization(BN)to the CNNs(Convolutional Neural Networks),selecting the appropriate activation functions,and setting appropriate CNNs parameters.Results of experiments and analysis show that our system can achieve non-deterministic symmetric encryption,where Alice encrypting the same plaintext with the key twice will generate different ciphertexts,and Bob can decrypt all these different ciphertexts of the same plaintext to the correct plaintext.And our proposed system has fast convergence and the correct rate of decryption when the plaintext length is 256 or even longer.
基金This work is supported by the National Natural Science Foundation of China(No.61572086,No.61402058)the Key Research and Development Project of Sichuan Province(Nos.20ZDYF2324,2019ZYD027 and 2018TJPT0012)+3 种基金the Innovation Team of Quantum Security Communication of Sichuan Province(No.17TD0009)the Academic and Technical Leaders Training Funding Support Projects of Sichuan Province(No.2016120080102643)the Application Foundation Project of Sichuan Province(No.2017JY0168)the Science and Technology Support Project of Sichuan Province(Nos.2018GZ0204 and 2016FZ0112).
文摘Generative adversarial network(GAN)is one of the most promising methods for unsupervised learning in recent years.GAN works via adversarial training concept and has shown excellent performance in the fields image synthesis,image super-resolution,video generation,image translation,etc.Compared with classical algorithms,quantum algorithms have their unique advantages in dealing with complex tasks,quantum machine learning(QML)is one of the most promising quantum algorithms with the rapid development of quantum technology.Specifically,Quantum generative adversarial network(QGAN)has shown the potential exponential quantum speedups in terms of performance.Meanwhile,QGAN also exhibits some problems,such as barren plateaus,unstable gradient,model collapse,absent complete scientific evaluation system,etc.How to improve the theory of QGAN and apply it that have attracted some researcher.In this paper,we comprehensively and deeply review recently proposed GAN and QAGN models and their applications,and we discuss the existing problems and future research trends of QGAN.
基金supported by Shenzhen Science and Technology Innovation Committee under Grants No. JCYJ20170306170559215 and No. JCYJ20180302153918689。
文摘In this paper,we propose a hybrid model aiming to map the input noise vector to the label of the generated image by the generative adversarial network(GAN).This model mainly consists of a pre-trained deep convolution generative adversarial network(DCGAN)and a classifier.By using the model,we visualize the distribution of two-dimensional input noise,leading to a specific type of the generated image after each training epoch of GAN.The visualization reveals the distribution feature of the input noise vector and the performance of the generator.With this feature,we try to build a guided generator(GG)with the ability to produce a fake image we need.Two methods are proposed to build GG.One is the most significant noise(MSN)method,and the other utilizes labeled noise.The MSN method can generate images precisely but with less variations.In contrast,the labeled noise method has more variations but is slightly less stable.Finally,we propose a criterion to measure the performance of the generator,which can be used as a loss function to effectively train the network.
基金supported by the National Natural Science Foundation of China(61533019,71232006,91520301)
文摘Recently, generative adversarial networks(GANs)have become a research focus of artificial intelligence. Inspired by two-player zero-sum game, GANs comprise a generator and a discriminator, both trained under the adversarial learning idea.The goal of GANs is to estimate the potential distribution of real data samples and generate new samples from that distribution.Since their initiation, GANs have been widely studied due to their enormous prospect for applications, including image and vision computing, speech and language processing, etc. In this review paper, we summarize the state of the art of GANs and look into the future. Firstly, we survey GANs' proposal background,theoretic and implementation models, and application fields.Then, we discuss GANs' advantages and disadvantages, and their development trends. In particular, we investigate the relation between GANs and parallel intelligence,with the conclusion that GANs have a great potential in parallel systems research in terms of virtual-real interaction and integration. Clearly, GANs can provide substantial algorithmic support for parallel intelligence.
基金This work is supported by the National Natural Science Foundation of China(Nos.61771154,61603239,61772454,6171101570).
文摘Deep Learning(DL)is such a powerful tool that we have seen tremendous success in areas such as Computer Vision,Speech Recognition,and Natural Language Processing.Since Automated Modulation Classification(AMC)is an important part in Cognitive Radio Networks,we try to explore its potential in solving signal modulation recognition problem.It cannot be overlooked that DL model is a complex model,thus making them prone to over-fitting.DL model requires many training data to combat with over-fitting,but adding high quality labels to training data manually is not always cheap and accessible,especially in real-time system,which may counter unprecedented data in dataset.Semi-supervised Learning is a way to exploit unlabeled data effectively to reduce over-fitting in DL.In this paper,we extend Generative Adversarial Networks(GANs)to the semi-supervised learning will show it is a method can be used to create a more dataefficient classifier.
基金This research is funded by the Centre for Advanced Modeling and Geospatial Information Systems(CAMGIS),Faculty of Engineering and Information Technology,the University of Technology Sydney,Australia.
文摘In recent years,landslide susceptibility mapping has substantially improved with advances in machine learning.However,there are still challenges remain in landslide mapping due to the availability of limited inventory data.In this paper,a novel method that improves the performance of machine learning techniques is presented.The proposed method creates synthetic inventory data using Generative Adversarial Networks(GANs)for improving the prediction of landslides.In this research,landslide inventory data of 156 landslide locations were identified in Cameron Highlands,Malaysia,taken from previous projects the authors worked on.Elevation,slope,aspect,plan curvature,profile curvature,total curvature,lithology,land use and land cover(LULC),distance to the road,distance to the river,stream power index(SPI),sediment transport index(STI),terrain roughness index(TRI),topographic wetness index(TWI)and vegetation density are geo-environmental factors considered in this study based on suggestions from previous works on Cameron Highlands.To show the capability of GANs in improving landslide prediction models,this study tests the proposed GAN model with benchmark models namely Artificial Neural Network(ANN),Support Vector Machine(SVM),Decision Trees(DT),Random Forest(RF)and Bagging ensemble models with ANN and SVM models.These models were validated using the area under the receiver operating characteristic curve(AUROC).The DT,RF,SVM,ANN and Bagging ensemble could achieve the AUROC values of(0.90,0.94,0.86,0.69 and 0.82)for the training;and the AUROC of(0.76,0.81,0.85,0.72 and 0.75)for the test,subsequently.When using additional samples,the same models achieved the AUROC values of(0.92,0.94,0.88,0.75 and 0.84)for the training and(0.78,0.82,0.82,0.78 and 0.80)for the test,respectively.Using the additional samples improved the test accuracy of all the models except SVM.As a result,in data-scarce environments,this research showed that utilizing GANs to generate supplementary samples is promising because it can improve the predictive capability of common landslide prediction models.