The increasing demand for radioauthorized applications in the 6G era necessitates enhanced monitoring and management of radio resources,particularly for precise control over the electromagnetic environment.The radio m...The increasing demand for radioauthorized applications in the 6G era necessitates enhanced monitoring and management of radio resources,particularly for precise control over the electromagnetic environment.The radio map serves as a crucial tool for describing signal strength distribution within the current electromagnetic environment.However,most existing algorithms rely on sparse measurements of radio strength,disregarding the impact of building information.In this paper,we propose a spectrum cartography(SC)algorithm that eliminates the need for relying on sparse ground-based radio strength measurements by utilizing a satellite network to collect data on buildings and transmitters.Our algorithm leverages Pix2Pix Generative Adversarial Network(GAN)to construct accurate radio maps using transmitter information within real geographical environments.Finally,simulation results demonstrate that our algorithm exhibits superior accuracy compared to previously proposed methods.展开更多
In this study,cylindrical sandstone samples were imaged by CT scanning technique,and the pore structure images of sandstone samples were analyzed and generated by combining with StyleGAN2-ADA generative adversarial ne...In this study,cylindrical sandstone samples were imaged by CT scanning technique,and the pore structure images of sandstone samples were analyzed and generated by combining with StyleGAN2-ADA generative adversarial network(GAN)model.Firstly,nine small column samples with a diameter of 4 mm were drilled from sandstone samples with a diameter of 2.5 cm,and their CT scanning results were preprocessed.Because the change between adjacent slices was little,using all slices directly may lead to the problem of pattern collapse in the process of model generation.In order to solve this problem,one slice was selected as training data every 30 slices,and the diversity of slices was verified by calculating the LPIPS values of these slices.The results showed that the strategy of selecting one slice every 30 slices could effectively improve the diversity of images generated by the model and avoid the phenomenon of pattern collapse.Through this process,a total of 295 discontinuous two-dimensional slices were generated for the generation and segmentation analysis of sandstone pore structures.This study can provide effective data support for accurate segmentation of porous medium structures,and simultaneously improves the stability and diversity of generative adversarial network under the condition of small samples.展开更多
The generative adversarial network(GAN)is first proposed in 2014,and this kind of network model is machine learning systems that can learn to measure a given distribution of data,one of the most important applications...The generative adversarial network(GAN)is first proposed in 2014,and this kind of network model is machine learning systems that can learn to measure a given distribution of data,one of the most important applications is style transfer.Style transfer is a class of vision and graphics problems where the goal is to learn the mapping between an input image and an output image.CYCLE-GAN is a classic GAN model,which has a wide range of scenarios in style transfer.Considering its unsupervised learning characteristics,the mapping is easy to be learned between an input image and an output image.However,it is difficult for CYCLE-GAN to converge and generate high-quality images.In order to solve this problem,spectral normalization is introduced into each convolutional kernel of the discriminator.Every convolutional kernel reaches Lipschitz stability constraint with adding spectral normalization and the value of the convolutional kernel is limited to[0,1],which promotes the training process of the proposed model.Besides,we use pretrained model(VGG16)to control the loss of image content in the position of l1 regularization.To avoid overfitting,l1 regularization term and l2 regularization term are both used in the object loss function.In terms of Frechet Inception Distance(FID)score evaluation,our proposed model achieves outstanding performance and preserves more discriminative features.Experimental results show that the proposed model converges faster and achieves better FID scores than the state of the art.展开更多
The polyp dataset involves the confidentiality of medical records, so it might be difficult to obtain datasets with accurate annotations. This problem can be effectively solved by expanding the polyp data set with alg...The polyp dataset involves the confidentiality of medical records, so it might be difficult to obtain datasets with accurate annotations. This problem can be effectively solved by expanding the polyp data set with algorithms. The traditional polyp dataset expansion scheme usually requires the use of two models or traditional visual methods. These methods are both tedious and difficult to provide new polyp features for training data. Therefore, our research aims to efficiently generate high-quality polyp samples, so as to effectively expand the polyp dataset. In this study, we first added the attention mechanism to the generation model and improved the loss function to reduce the interference caused by reflection in the image generation process. Meanwhile, we used the improved generation model to remove polyps from the original image. In addition, we used masks of different shapes generated by random combinations to generate polyps with more characteristic information. The same generation model was used for the removal and generation of polyps. The generated polyp image has its own annotation, which is conducive to us directly using the expanded data set for training. Finally, we verified the effectiveness of the improved model and the dataset expansion scheme through a series of comparative experiments on the public dataset. The results showed that using the dataset we generate for training can significantly optimize the main performance indicators.展开更多
Subsurface rocks,as complex porous media,exhibit multiscale pore structures and intricate physical properties.Digital rock physics technology has become increasingly influential in the study of subsurface rock propert...Subsurface rocks,as complex porous media,exhibit multiscale pore structures and intricate physical properties.Digital rock physics technology has become increasingly influential in the study of subsurface rock properties.Given the multiscale characteristics of rock pore structures,direct three-dimensional imaging at sub-micrometer and nanometer scales is typically infeasible.This study introduces a method for reconstructing porous media using multidimensional data,which combines one-dimensional pore structure parameters with two-dimensional images to reconstruct three-dimensional models.The pore network model(PNM)is stochastically reconstructed using one-dimensional parameters,and a generative adversarial network(GAN)is utilized to equip the PNM with pore morphologies derived from two-dimensional images.The digital rocks generated by this method possess excellent controllability.Using Berea sandstone and Grosmont carbonate samples,we performed digital rock reconstructions based on PNM extracted by the maximum ball algorithm and compared them with stochastically reconstructed PNM.Pore structure parameters,permeability,and formation factors were calculated.The results show that the generated samples exhibit good consistency with real samples in terms of pore morphology,pore structure,and physical properties.Furthermore,our method effectively supplements the micropores not captured in CT images,demonstrating its potential in multiscale carbonate samples.Thus,the proposed reconstruction method is promising for advancing porous media property research.展开更多
In recent years,Pix2Pix,a model within the domain of GANs,has found widespread application in the field of image-to-image translation.However,traditional Pix2Pix models suffer from significant drawbacks in image gener...In recent years,Pix2Pix,a model within the domain of GANs,has found widespread application in the field of image-to-image translation.However,traditional Pix2Pix models suffer from significant drawbacks in image generation,such as the loss of important information features during the encoding and decoding processes,as well as a lack of constraints during the training process.To address these issues and improve the quality of Pix2Pixgenerated images,this paper introduces two key enhancements.Firstly,to reduce information loss during encoding and decoding,we utilize the U-Net++network as the generator for the Pix2Pix model,incorporating denser skip-connection to minimize information loss.Secondly,to enhance constraints during image generation,we introduce a specialized discriminator designed to distinguish differential images,further enhancing the quality of the generated images.We conducted experiments on the facades dataset and the sketch portrait dataset from the Chinese University of Hong Kong to validate our proposed model.The experimental results demonstrate that our improved Pix2Pix model significantly enhances image quality and outperforms other models in the selected metrics.Notably,the Pix2Pix model incorporating the differential image discriminator exhibits the most substantial improvements across all metrics.An analysis of the experimental results reveals that the use of the U-Net++generator effectively reduces information feature loss,while the Pix2Pix model incorporating the differential image discriminator enhances the supervision of the generator during training.Both of these enhancements collectively improve the quality of Pix2Pix-generated images.展开更多
Concrete subjected to fire loads is susceptible to explosive spalling, which can lead to the exposure of reinforcingsteel bars to the fire, substantially jeopardizing the structural safety and stability. The spalling ...Concrete subjected to fire loads is susceptible to explosive spalling, which can lead to the exposure of reinforcingsteel bars to the fire, substantially jeopardizing the structural safety and stability. The spalling of fire-loaded concreteis closely related to the evolution of pore pressure and temperature. Conventional analytical methods involve theresolution of complex, strongly coupled multifield equations, necessitating significant computational efforts. Torapidly and accurately obtain the distributions of pore-pressure and temperature, the Pix2Pix model is adoptedin this work, which is celebrated for its capabilities in image generation. The open-source dataset used hereinfeatures RGB images we generated using a sophisticated coupled model, while the grayscale images encapsulate the15 principal variables influencing spalling. After conducting a series of tests with different layers configurations,activation functions and loss functions, the Pix2Pix model suitable for assessing the spalling risk of fire-loadedconcrete has been meticulously designed and trained. The applicability and reliability of the Pix2Pix model inconcrete parameter prediction are verified by comparing its outcomes with those derived fromthe strong couplingTHC model. Notably, for the practical engineering applications, our findings indicate that utilizing monochromeimages as the initial target for analysis yields more dependable results. This work not only offers valuable insightsfor civil engineers specializing in concrete structures but also establishes a robust methodological approach forresearchers seeking to create similar predictive models.展开更多
2D patterned hollow structures have emerged as advanced materials with exceptional mechanical properties and lightweight characteristics,making them ideal for high-performance applications in aerospace and automotive ...2D patterned hollow structures have emerged as advanced materials with exceptional mechanical properties and lightweight characteristics,making them ideal for high-performance applications in aerospace and automotive industries.However,optimizing their structural design to achieve uniform stress distribution and minimize stress concentration remains a significant challenge due to the complex interplay between geometric patterns and mechanical performance.In this study,we develop an integrated framework combining conditional generative adversarial networks(cGANs)and deep Q-networks(DQNs)to predict and optimize the stress fields of 2D-PHS.We generated a comprehensive dataet comprising 1000 samples across five distinct density classes using a custom grid pattern generation algorithm,ensuring a wide range of structural variations.The cGAN accurately predicts stress distributions,achieving a high correlation with finite element analysis(FEA)results while reducing computational time from approximately 40 s(FEA)to just 1-2 s per prediction.Concurrently,the DQN optimizes design parameters through scaling and rotation operations,enhancing structural performance based on predicted stress metrics.Our approach resulted in a 4.3%improvement in average stress uniformity and a 23.1%reduction in maximum stress concentration.These improvements were validated through FEA simulations and experimental tensile tests on 3D-printed thermoplastic polyurethane samples.The tensile strength of the optimized samples increased from an initial average of 5.9-6.6 MPa under 100%strain,demonstrating enhanced mechanical resilience.This study demonstrates the efficacy of combining advanced AI techniques for rapid and precise material design optimization,providing a scalable and cost-effective solution for developing superior lightweight materials with tailored mechanical properties for critical engineering applications.展开更多
Alzheimer’s Disease(AD)is a progressive neurodegenerative disorder that significantly affects cognitive function,making early and accurate diagnosis essential.Traditional Deep Learning(DL)-based approaches often stru...Alzheimer’s Disease(AD)is a progressive neurodegenerative disorder that significantly affects cognitive function,making early and accurate diagnosis essential.Traditional Deep Learning(DL)-based approaches often struggle with low-contrast MRI images,class imbalance,and suboptimal feature extraction.This paper develops a Hybrid DL system that unites MobileNetV2 with adaptive classification methods to boost Alzheimer’s diagnosis by processing MRI scans.Image enhancement is done using Contrast-Limited Adaptive Histogram Equalization(CLAHE)and Enhanced Super-Resolution Generative Adversarial Networks(ESRGAN).A classification robustness enhancement system integrates class weighting techniques and a Matthews Correlation Coefficient(MCC)-based evaluation method into the design.The trained and validated model gives a 98.88%accuracy rate and 0.9614 MCC score.We also performed a 10-fold cross-validation experiment with an average accuracy of 96.52%(±1.51),a loss of 0.1671,and an MCC score of 0.9429 across folds.The proposed framework outperforms the state-of-the-art models with a 98%weighted F1-score while decreasing misdiagnosis results for every AD stage.The model demonstrates apparent separation abilities between AD progression stages according to the results of the confusion matrix analysis.These results validate the effectiveness of hybrid DL models with adaptive preprocessing for early and reliable Alzheimer’s diagnosis,contributing to improved computer-aided diagnosis(CAD)systems in clinical practice.展开更多
Recently,Generative Adversarial Networks(GANs)have become the mainstream text-to-image(T2I)framework.However,a standard normal distribution noise of inputs cannot provide sufficient information to synthesize an image ...Recently,Generative Adversarial Networks(GANs)have become the mainstream text-to-image(T2I)framework.However,a standard normal distribution noise of inputs cannot provide sufficient information to synthesize an image that approaches the ground-truth image distribution.Moreover,the multistage generation strategy results in complex T2I applications.Therefore,this study proposes a novel feature-grounded single-stage T2I model,which considers the“real”distribution learned from training images as one input and introduces a worst-case-optimized similarity measure into the loss function to enhance the model's generation capacity.Experimental results on two benchmark datasets demonstrate the competitive performance of the proposed model in terms of the Frechet inception distance and inception score compared to those of some classical and state-of-the-art models,showing the improved similarities among the generated image,text,and ground truth.展开更多
文摘The increasing demand for radioauthorized applications in the 6G era necessitates enhanced monitoring and management of radio resources,particularly for precise control over the electromagnetic environment.The radio map serves as a crucial tool for describing signal strength distribution within the current electromagnetic environment.However,most existing algorithms rely on sparse measurements of radio strength,disregarding the impact of building information.In this paper,we propose a spectrum cartography(SC)algorithm that eliminates the need for relying on sparse ground-based radio strength measurements by utilizing a satellite network to collect data on buildings and transmitters.Our algorithm leverages Pix2Pix Generative Adversarial Network(GAN)to construct accurate radio maps using transmitter information within real geographical environments.Finally,simulation results demonstrate that our algorithm exhibits superior accuracy compared to previously proposed methods.
文摘In this study,cylindrical sandstone samples were imaged by CT scanning technique,and the pore structure images of sandstone samples were analyzed and generated by combining with StyleGAN2-ADA generative adversarial network(GAN)model.Firstly,nine small column samples with a diameter of 4 mm were drilled from sandstone samples with a diameter of 2.5 cm,and their CT scanning results were preprocessed.Because the change between adjacent slices was little,using all slices directly may lead to the problem of pattern collapse in the process of model generation.In order to solve this problem,one slice was selected as training data every 30 slices,and the diversity of slices was verified by calculating the LPIPS values of these slices.The results showed that the strategy of selecting one slice every 30 slices could effectively improve the diversity of images generated by the model and avoid the phenomenon of pattern collapse.Through this process,a total of 295 discontinuous two-dimensional slices were generated for the generation and segmentation analysis of sandstone pore structures.This study can provide effective data support for accurate segmentation of porous medium structures,and simultaneously improves the stability and diversity of generative adversarial network under the condition of small samples.
基金This work is supported by the National Natural Science Foundation of China(No.61702226)the 111 Project(B12018)+1 种基金the Natural Science Foundation of Jiangsu Province(No.BK20170200)the Fundamental Research Funds for the Central Universities(No.JUSRP11854).
文摘The generative adversarial network(GAN)is first proposed in 2014,and this kind of network model is machine learning systems that can learn to measure a given distribution of data,one of the most important applications is style transfer.Style transfer is a class of vision and graphics problems where the goal is to learn the mapping between an input image and an output image.CYCLE-GAN is a classic GAN model,which has a wide range of scenarios in style transfer.Considering its unsupervised learning characteristics,the mapping is easy to be learned between an input image and an output image.However,it is difficult for CYCLE-GAN to converge and generate high-quality images.In order to solve this problem,spectral normalization is introduced into each convolutional kernel of the discriminator.Every convolutional kernel reaches Lipschitz stability constraint with adding spectral normalization and the value of the convolutional kernel is limited to[0,1],which promotes the training process of the proposed model.Besides,we use pretrained model(VGG16)to control the loss of image content in the position of l1 regularization.To avoid overfitting,l1 regularization term and l2 regularization term are both used in the object loss function.In terms of Frechet Inception Distance(FID)score evaluation,our proposed model achieves outstanding performance and preserves more discriminative features.Experimental results show that the proposed model converges faster and achieves better FID scores than the state of the art.
基金supported by the Natural Science Foundation Project of Fujian Province,China(Grant Nos.2023J011439 and 2019J01859).
文摘The polyp dataset involves the confidentiality of medical records, so it might be difficult to obtain datasets with accurate annotations. This problem can be effectively solved by expanding the polyp data set with algorithms. The traditional polyp dataset expansion scheme usually requires the use of two models or traditional visual methods. These methods are both tedious and difficult to provide new polyp features for training data. Therefore, our research aims to efficiently generate high-quality polyp samples, so as to effectively expand the polyp dataset. In this study, we first added the attention mechanism to the generation model and improved the loss function to reduce the interference caused by reflection in the image generation process. Meanwhile, we used the improved generation model to remove polyps from the original image. In addition, we used masks of different shapes generated by random combinations to generate polyps with more characteristic information. The same generation model was used for the removal and generation of polyps. The generated polyp image has its own annotation, which is conducive to us directly using the expanded data set for training. Finally, we verified the effectiveness of the improved model and the dataset expansion scheme through a series of comparative experiments on the public dataset. The results showed that using the dataset we generate for training can significantly optimize the main performance indicators.
基金supported by the Shandong Provincial Natural Science Foundation(ZR2024MD116)National Natural Science Foundation of China(Grant Nos.42174143,42004098)Technology Innovation Leading Program of Shaanxi(No.2024 ZC-YYDP-27).
文摘Subsurface rocks,as complex porous media,exhibit multiscale pore structures and intricate physical properties.Digital rock physics technology has become increasingly influential in the study of subsurface rock properties.Given the multiscale characteristics of rock pore structures,direct three-dimensional imaging at sub-micrometer and nanometer scales is typically infeasible.This study introduces a method for reconstructing porous media using multidimensional data,which combines one-dimensional pore structure parameters with two-dimensional images to reconstruct three-dimensional models.The pore network model(PNM)is stochastically reconstructed using one-dimensional parameters,and a generative adversarial network(GAN)is utilized to equip the PNM with pore morphologies derived from two-dimensional images.The digital rocks generated by this method possess excellent controllability.Using Berea sandstone and Grosmont carbonate samples,we performed digital rock reconstructions based on PNM extracted by the maximum ball algorithm and compared them with stochastically reconstructed PNM.Pore structure parameters,permeability,and formation factors were calculated.The results show that the generated samples exhibit good consistency with real samples in terms of pore morphology,pore structure,and physical properties.Furthermore,our method effectively supplements the micropores not captured in CT images,demonstrating its potential in multiscale carbonate samples.Thus,the proposed reconstruction method is promising for advancing porous media property research.
基金supported in part by the Xinjiang Natural Science Foundation of China(2021D01C078).
文摘In recent years,Pix2Pix,a model within the domain of GANs,has found widespread application in the field of image-to-image translation.However,traditional Pix2Pix models suffer from significant drawbacks in image generation,such as the loss of important information features during the encoding and decoding processes,as well as a lack of constraints during the training process.To address these issues and improve the quality of Pix2Pixgenerated images,this paper introduces two key enhancements.Firstly,to reduce information loss during encoding and decoding,we utilize the U-Net++network as the generator for the Pix2Pix model,incorporating denser skip-connection to minimize information loss.Secondly,to enhance constraints during image generation,we introduce a specialized discriminator designed to distinguish differential images,further enhancing the quality of the generated images.We conducted experiments on the facades dataset and the sketch portrait dataset from the Chinese University of Hong Kong to validate our proposed model.The experimental results demonstrate that our improved Pix2Pix model significantly enhances image quality and outperforms other models in the selected metrics.Notably,the Pix2Pix model incorporating the differential image discriminator exhibits the most substantial improvements across all metrics.An analysis of the experimental results reveals that the use of the U-Net++generator effectively reduces information feature loss,while the Pix2Pix model incorporating the differential image discriminator enhances the supervision of the generator during training.Both of these enhancements collectively improve the quality of Pix2Pix-generated images.
基金the National Natural Science Foundation of China(NSFC)(52178324).
文摘Concrete subjected to fire loads is susceptible to explosive spalling, which can lead to the exposure of reinforcingsteel bars to the fire, substantially jeopardizing the structural safety and stability. The spalling of fire-loaded concreteis closely related to the evolution of pore pressure and temperature. Conventional analytical methods involve theresolution of complex, strongly coupled multifield equations, necessitating significant computational efforts. Torapidly and accurately obtain the distributions of pore-pressure and temperature, the Pix2Pix model is adoptedin this work, which is celebrated for its capabilities in image generation. The open-source dataset used hereinfeatures RGB images we generated using a sophisticated coupled model, while the grayscale images encapsulate the15 principal variables influencing spalling. After conducting a series of tests with different layers configurations,activation functions and loss functions, the Pix2Pix model suitable for assessing the spalling risk of fire-loadedconcrete has been meticulously designed and trained. The applicability and reliability of the Pix2Pix model inconcrete parameter prediction are verified by comparing its outcomes with those derived fromthe strong couplingTHC model. Notably, for the practical engineering applications, our findings indicate that utilizing monochromeimages as the initial target for analysis yields more dependable results. This work not only offers valuable insightsfor civil engineers specializing in concrete structures but also establishes a robust methodological approach forresearchers seeking to create similar predictive models.
基金supported by the National Natural Science Foundation of China(Grant Nos.52322305 and 52473098)the starting Grant of ShanghaiTech University,the Double First-Class Initiative Fund of ShanghaiTech University and the Shanghai Clinical Research and Trial Center.Materials were tested at the Analytical Instrumentation Center(Grant No.SPST-AIC10112914)the Center for High-resolution Electron Microscopy(C-hEM),SPST,ShanghaiTech University.
文摘2D patterned hollow structures have emerged as advanced materials with exceptional mechanical properties and lightweight characteristics,making them ideal for high-performance applications in aerospace and automotive industries.However,optimizing their structural design to achieve uniform stress distribution and minimize stress concentration remains a significant challenge due to the complex interplay between geometric patterns and mechanical performance.In this study,we develop an integrated framework combining conditional generative adversarial networks(cGANs)and deep Q-networks(DQNs)to predict and optimize the stress fields of 2D-PHS.We generated a comprehensive dataet comprising 1000 samples across five distinct density classes using a custom grid pattern generation algorithm,ensuring a wide range of structural variations.The cGAN accurately predicts stress distributions,achieving a high correlation with finite element analysis(FEA)results while reducing computational time from approximately 40 s(FEA)to just 1-2 s per prediction.Concurrently,the DQN optimizes design parameters through scaling and rotation operations,enhancing structural performance based on predicted stress metrics.Our approach resulted in a 4.3%improvement in average stress uniformity and a 23.1%reduction in maximum stress concentration.These improvements were validated through FEA simulations and experimental tensile tests on 3D-printed thermoplastic polyurethane samples.The tensile strength of the optimized samples increased from an initial average of 5.9-6.6 MPa under 100%strain,demonstrating enhanced mechanical resilience.This study demonstrates the efficacy of combining advanced AI techniques for rapid and precise material design optimization,providing a scalable and cost-effective solution for developing superior lightweight materials with tailored mechanical properties for critical engineering applications.
基金funded by the Deanship of Graduate Studies and Scientific Research at Jouf University under grant No.(DGSSR-2025-02-01295).
文摘Alzheimer’s Disease(AD)is a progressive neurodegenerative disorder that significantly affects cognitive function,making early and accurate diagnosis essential.Traditional Deep Learning(DL)-based approaches often struggle with low-contrast MRI images,class imbalance,and suboptimal feature extraction.This paper develops a Hybrid DL system that unites MobileNetV2 with adaptive classification methods to boost Alzheimer’s diagnosis by processing MRI scans.Image enhancement is done using Contrast-Limited Adaptive Histogram Equalization(CLAHE)and Enhanced Super-Resolution Generative Adversarial Networks(ESRGAN).A classification robustness enhancement system integrates class weighting techniques and a Matthews Correlation Coefficient(MCC)-based evaluation method into the design.The trained and validated model gives a 98.88%accuracy rate and 0.9614 MCC score.We also performed a 10-fold cross-validation experiment with an average accuracy of 96.52%(±1.51),a loss of 0.1671,and an MCC score of 0.9429 across folds.The proposed framework outperforms the state-of-the-art models with a 98%weighted F1-score while decreasing misdiagnosis results for every AD stage.The model demonstrates apparent separation abilities between AD progression stages according to the results of the confusion matrix analysis.These results validate the effectiveness of hybrid DL models with adaptive preprocessing for early and reliable Alzheimer’s diagnosis,contributing to improved computer-aided diagnosis(CAD)systems in clinical practice.
基金supported by the National Natural Science Foundation of China(No.61872187).
文摘Recently,Generative Adversarial Networks(GANs)have become the mainstream text-to-image(T2I)framework.However,a standard normal distribution noise of inputs cannot provide sufficient information to synthesize an image that approaches the ground-truth image distribution.Moreover,the multistage generation strategy results in complex T2I applications.Therefore,this study proposes a novel feature-grounded single-stage T2I model,which considers the“real”distribution learned from training images as one input and introduces a worst-case-optimized similarity measure into the loss function to enhance the model's generation capacity.Experimental results on two benchmark datasets demonstrate the competitive performance of the proposed model in terms of the Frechet inception distance and inception score compared to those of some classical and state-of-the-art models,showing the improved similarities among the generated image,text,and ground truth.