The growing demand for wireless connectivity has made massive multiple-input multiple-output(MIMO)a cornerstone of modern communication systems.To optimize network performance and resource allocation,an efficient and ...The growing demand for wireless connectivity has made massive multiple-input multiple-output(MIMO)a cornerstone of modern communication systems.To optimize network performance and resource allocation,an efficient and robust approach is joint device activity detection and channel estimation.In this paper,we present an approach utilizing score-based generative models to address the underdetermined nature of channel estimation,which is data-driven and well-suited for the complex and dynamic environment of massive MIMO systems.Our experimental results,based on a comprehensive dataset generated through Monte-Carlo sampling,demonstrate the high precision of our channel estimation approach,with errors reduced to as low as-45 d B,and exceptional accuracy in detecting active devices.展开更多
AIM:To build a functional generalized estimating equation(GEE)model to detect glaucomatous visual field progression and compare the performance of the proposed method with that of commonly employed algorithms.METHODS:...AIM:To build a functional generalized estimating equation(GEE)model to detect glaucomatous visual field progression and compare the performance of the proposed method with that of commonly employed algorithms.METHODS:Totally 716 eyes of 716 patients with primary open angle glaucoma(POAG)with at least 5 reliable 24-2 test results and 2y of follow-up were selected.The functional GEE model was used to detect perimetric progression in the training dataset(501 eyes).In the testing dataset(215 eyes),progression was evaluated the functional GEE model,mean deviation(MD)and visual field index(VFI)rates of change,Advanced Glaucoma Intervention Study(AGIS)and Collaborative Initial Glaucoma Treatment Study(CIGTS)scores,and pointwise linear regression(PLR).RESULTS:The proposed method showed the highest proportion of eyes detected as progression(54.4%),followed by the VFI rate(34.4%),PLR(23.3%),and MD rate(21.4%).The CIGTS and AGIS scores had a lower proportion of eyes detected as progression(7.9%and 5.1%,respectively).The time to detection of progression was significantly shorter for the proposed method than that of other algorithms(adjusted P≤0.019).The VFI rate displayed moderate pairwise agreement with the proposed method(k=0.47).CONCLUSION:The functional GEE model shows the highest proportion of eyes detected as perimetric progression and the shortest time to detect perimetric progression in patients with POAG.展开更多
With the miniaturization of devices and the development of modern heating technologies,the generalization of heat conduction and thermoelastic coupling has become crucial,effectively emulating the thermodynamic behavi...With the miniaturization of devices and the development of modern heating technologies,the generalization of heat conduction and thermoelastic coupling has become crucial,effectively emulating the thermodynamic behavior of materials in ultrashort time scales.Theoretically,generalized heat conductive models are considered in this work.By analogy with mechanical viscoelastic models,this paper further enriches the heat conduction models and gives their one-dimensional physical expression.Numerically,the transient thermoelastic response of the slim strip material under thermal shock is investigated by applying the proposed models.First,the analytical solution in the Laplace domain is obtained by the Laplace transform.Then,the numerical results of the transient responses are obtained by the numerical inverse Laplace transform.Finally,the transient responses of different models are analyzed and compared,and the effects of material parameters are discussed.This work not only opens up new research perspectives on generalized heat conductive and thermoelastic coupling theories,but also is expected to be beneficial for the deeper understanding of the heat wave theory.展开更多
We analyze the suitability of existing pre-trained transformer-based language models(PLMs)for abstractive text summarization on German technical healthcare texts.The study focuses on the multilingual capabilities of t...We analyze the suitability of existing pre-trained transformer-based language models(PLMs)for abstractive text summarization on German technical healthcare texts.The study focuses on the multilingual capabilities of these models and their ability to perform the task of abstractive text summarization in the healthcare field.The research hypothesis was that large language models could perform high-quality abstractive text summarization on German technical healthcare texts,even if the model is not specifically trained in that language.Through experiments,the research questions explore the performance of transformer language models in dealing with complex syntax constructs,the difference in performance between models trained in English and German,and the impact of translating the source text to English before conducting the summarization.We conducted an evaluation of four PLMs(GPT-3,a translation-based approach also utilizing GPT-3,a German language Model,and a domain-specific bio-medical model approach).The evaluation considered the informativeness using 3 types of metrics based on Recall-Oriented Understudy for Gisting Evaluation(ROUGE)and the quality of results which is manually evaluated considering 5 aspects.The results show that text summarization models could be used in the German healthcare domain and that domain-independent language models achieved the best results.The study proves that text summarization models can simplify the search for pre-existing German knowledge in various domains.展开更多
This study explores a novel educational model of generative AI-empowered interdisciplinary project-based learning(PBL).By analyzing the current applications of generative AI technology in information technology curric...This study explores a novel educational model of generative AI-empowered interdisciplinary project-based learning(PBL).By analyzing the current applications of generative AI technology in information technology curricula,it elucidates its advantages and operational mechanisms in interdisciplinary PBL.Combining case studies and empirical research,the investigation proposes implementation pathways and strategies for the generative AI-enhanced interdisciplinary PBL model,detailing specific applications across three phases:project preparation,implementation,and evaluation.The research demonstrates that generative AI-enabled interdisciplinary project-based learning can effectively enhance students’learning motivation,interdisciplinary thinking capabilities,and innovative competencies,providing new conceptual frameworks and practical approaches for educational model innovation.展开更多
Over the past century,advancements in chemistry have significantly propelled human innovation,enhancing both industrial and consumer products.However,this rapid progression has resulted in chemical pollution increasin...Over the past century,advancements in chemistry have significantly propelled human innovation,enhancing both industrial and consumer products.However,this rapid progression has resulted in chemical pollution increasingly surpassing planetary boundaries,as production and release rates have outpaced our monitoring capabilities.To catalyze more impactful efforts,this study transitions from traditional chemical assessment to inverse chemical design,introducing a generative graph latent diffusion model aimed at discovering safer alternatives.In a case study on the design of green solvents for cyclohexane/benzene extraction distillation,we constructed a design database encompassing functional,environmental hazards,and process constraints.Virtual screening of previous design dataset revealed distinct trade-off trends between these design requirements.Based on the screening outcomes,an unconstrained generative model was developed,which covered a broader chemical space and demonstrated superior capabilities for structural interpolation and extrapolation.To further optimize molecular generation towards desired properties,a multi-objective latent diffusion method was applied,yielding 19 candidate molecules.Of these,7 were identified in PubChem as the most viable green solvent candidates,while the remaining 12 as potential novel candidates.Overall,this study effectively designed green solvent candidates for safer and more sustainable industrial production,setting a promising precedent for the development of environmentally friendly alternatives in other areas of chemical research.展开更多
This study focuses on the construction and application of intelligent financial decision-making models driven by generative artificial intelligence(AI).It analyzes the mechanisms by which generative AI empowers financ...This study focuses on the construction and application of intelligent financial decision-making models driven by generative artificial intelligence(AI).It analyzes the mechanisms by which generative AI empowers financial decision-making within a dual framework of dynamic knowledge evolution and risk control.The research reveals that generative AI,with its superior data processing,pattern recognition,and autonomous learning capabilities,can transcend the limitations of traditional decision-making models,facilitating a significant shift from causal inference to probabilistic creation in decision-making paradigms.By systematically constructing an intelligent financial decision-making model that includes data governance,core engine,and decision output layers,the study clarifies the functional roles and collaborative mechanisms of each layer.Additionally,it addresses key challenges in technology application,institutional adaptation,and organizational transformation by proposing systematic strategies for technical risk management,institutional innovation,and organizational capability enhancement,aiming to provide robust theoretical support and practical guidance for the intelligent transformation of corporate financial decision-making.展开更多
We employed random distributions and gradient descent methods for the Generator Coordinate Method(GCM)to identify effective basis wave functions,taking halo nuclei ^(6)He and ^(6)Li as examples.By comparing the ground...We employed random distributions and gradient descent methods for the Generator Coordinate Method(GCM)to identify effective basis wave functions,taking halo nuclei ^(6)He and ^(6)Li as examples.By comparing the ground state(0^(+))energy of ^(6)He and the excited state(0^(+))energy of 6 Li calculated with various random distributions and manually selected generation coordinates,we found that the heavy tail characteristic of the logistic distribution better describes the features of the halo nuclei.Subsequently,the Adam algorithm from machine learning was applied to optimize the basis wave functions,indicating that a limited number of basis wave functions can approximate the converged values.These results offer some empirical insights for selecting basis wave functions and contribute to the broader application of machine learning methods in predicting effective basis wave functions.展开更多
Inverse design has long been an efficient and powerful design tool in the aircraft industry.In this paper,a novel inverse design method for supercritical airfoils is proposed based on generative models in deep learnin...Inverse design has long been an efficient and powerful design tool in the aircraft industry.In this paper,a novel inverse design method for supercritical airfoils is proposed based on generative models in deep learning.A Conditional Variational Auto Encoder(CVAE)and an integrated generative network CVAE-GAN that combines the CVAE with the Wasserstein Generative Adversarial Networks(WGAN),are conducted as generative models.They are used to generate target wall Mach distributions for the inverse design that matches specified features,such as locations of suction peak,shock and aft loading.Qualitative and quantitative results show that both adopted generative models can generate diverse and realistic wall Mach number distributions satisfying the given features.The CVAE-GAN model outperforms the CVAE model and achieves better reconstruction accuracies for all the samples in the dataset.Furthermore,a deep neural network for nonlinear mapping is adopted to obtain the airfoil shape corresponding to the target wall Mach number distribution.The performances of the designed deep neural network are fully demonstrated and a smoothness measurement is proposed to quantify small oscillations in the airfoil surface,proving the authenticity and accuracy of the generated airfoil shapes.展开更多
The growth of Sakhalin fir(Abies sachalinen-sis)seedlings,an important forest tree species in northern Hokkaido,Japan,is significantly affected by competition from surrounding vegetation,especially evergreen dwarf bam...The growth of Sakhalin fir(Abies sachalinen-sis)seedlings,an important forest tree species in northern Hokkaido,Japan,is significantly affected by competition from surrounding vegetation,especially evergreen dwarf bamboo.In this study,we investigated the height and root collar diameter(RCD)growth of Sakhalin fir seedlings under various degrees of cover by deciduous vegetation and evergreen dwarf bamboo.Generalized additive models were used to quantify the effects of canopy cover and forest floor cover on the relative growth rates of these two parameters.The canopy cover of Sakhalin fir seedlings had a nonlin-ear negative effect on both the height growth of seedlings in the subsequent year and the RCD growth in the current year,given the general growth pattern in this species,where height growth ceases in early summer and RCD growth con-tinues until autumn.Height growth declined sharply after the canopy cover rate exceeded 50%,while RCD growth declined rapidly between 0 and 50%canopy cover rate.The forest floor cover had a greater negative impact on RCD growth than on height growth.These results suggested that Sakhalin fir seedlings respond to vegetative competition by prioritizing height growth for light acquisition at the expense of diameter growth and possibly root growth for below-ground competition.The cover of evergreen dwarf bamboo reduced the height growth of fir seedlings significantly more than the cover of deciduous vegetation.This difference is likely due to the timing of light availability.When competing with deciduous vegetation,Sakhalin fir seedlings exposed to light during the post-snow melt and early spring before the development of the deciduous vegetation canopy can photosynthesize more effectively,leading to greater height growth.The results of this study highlighted the importance of vegetation control considering the type of vegetation for successful Sakhalin fir reforestation.Adjusting the intensity and timing of weeding based on the presence and abundance of dwarf bamboo and other competing vegetation could potentially reduce weeding costs and increase biodiversity in reforested areas.展开更多
Panoramic images, offering a 360-degree view, are essential in virtual reality(VR) and augmented reality(AR), enhancing realism with high-quality textures. However, acquiring complete and high-quality panoramic textur...Panoramic images, offering a 360-degree view, are essential in virtual reality(VR) and augmented reality(AR), enhancing realism with high-quality textures. However, acquiring complete and high-quality panoramic textures is challenging. This paper introduces a method using generative adversarial networks(GANs) and the contrastive language-image pretraining(CLIP) model to restore and control texture in panoramic images. The GAN model captures complex structures and maintains consistency, while CLIP enables fine-grained texture control via semantic text-image associations. GAN inversion optimizes latent codes for precise texture details. The resulting low dynamic range(LDR) images are converted to high dynamic range(HDR) using the Blender engine for seamless texture blending. Experimental results demonstrate the effectiveness and flexibility of this method in panoramic texture restoration and generation.展开更多
Brain encoding and decoding via functional magnetic resonance imaging(fMRI)are two important aspects of visual perception neuroscience.Although previous researchers have made significant advances in brain encoding and...Brain encoding and decoding via functional magnetic resonance imaging(fMRI)are two important aspects of visual perception neuroscience.Although previous researchers have made significant advances in brain encoding and decoding models,existing methods still require improvement using advanced machine learning techniques.For example,traditional methods usually build the encoding and decoding models separately,and are prone to overfitting on a small dataset.In fact,effectively unifying the encoding and decoding procedures may allow for more accurate predictions.In this paper,we first review the existing encoding and decoding methods and discuss the potential advantages of a“bidirectional”modeling strategy.Next,we show that there are correspondences between deep neural networks and human visual streams in terms of the architecture and computational rules.Furthermore,deep generative models(e.g.,variational autoencoders(VAEs)and generative adversarial networks(GANs))have produced promising results in studies on brain encoding and decoding.Finally,we propose that the dual learning method,which was originally designed for machine translation tasks,could help to improve the performance of encoding and decoding models by leveraging large-scale unpaired data.展开更多
Natural products(NPs) have long been recognized as a valuable resource for drug discovery, and bringing NP-related features to virtual libraries is believed to be an effective way to increase the coverage of druggab...Natural products(NPs) have long been recognized as a valuable resource for drug discovery, and bringing NP-related features to virtual libraries is believed to be an effective way to increase the coverage of druggable chemical space. Here, deep learning-based molecule generative model, which is a recent technique in de novo molecule design, was applied to generate virtual libraries with NP-like properties. Results demonstrated that the model was effective in generating molecules that highly resemble NPs. Moreover, the model was also found to be capable of generating NP-like molecules that were also easy to synthesize, significantly increasing the practical value of the compound library.展开更多
Deep learning(DL)has proven to be important for computed tomography(CT)image denoising.However,such models are usually trained under supervision,requiring paired data that may be difficult to obtain in practice.Diffus...Deep learning(DL)has proven to be important for computed tomography(CT)image denoising.However,such models are usually trained under supervision,requiring paired data that may be difficult to obtain in practice.Diffusion models offer unsupervised means of solving a wide range of inverse problems via posterior sampling.In particular,using the estimated unconditional score function of the prior distribution,obtained via unsupervised learning,one can sample from the desired posterior via hijacking and regularization.However,due to the iterative solvers used,the number of function evaluations(NFE)required may be orders of magnitudes larger than for single-step samplers.In this paper,we present a novel image denoising technique for photon-counting CT by extending the unsupervised approach to inverse problem solving to the case of Poisson flow generative models(PFGM)++.By hijacking and regularizing the sampling process we obtain a single-step sampler,that is NFE=1.Our proposed method incorporates posterior sampling using diffusion models as a special case.We demonstrate that the added robustness afforded by the PFGM++framework yields significant performance gains.Our results indicate competitive performance compared to popular supervised,including state-of-the-art diffusion-style models with NFE=1(consistency models),unsupervised,and non-DL-based image denoising techniques,on clinical low-dose CT data and clinical images from a prototype photon-counting CT system developed by GE HealthCare.展开更多
In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood e...In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood estimation is considered. Three diagnostic statistics are used to detect whether the outliers exist in the data set. Simulation results show that when the sample size is small, the values of diagnostic statistics based on the maximum Lq-likelihood estimation are greater than the values based on the maximum likelihood estimation. As the sample size increases, the difference between the values of the diagnostic statistics based on two estimation methods diminishes gradually. It means that the outliers can be distinguished easier through the maximum Lq-likelihood method than those through the maximum likelihood estimation method.展开更多
Large language models(LLMs)have undergone significant expansion and have been increasingly integrated across various domains.Notably,in the realm of robot task planning,LLMs harness their advanced reasoning and langua...Large language models(LLMs)have undergone significant expansion and have been increasingly integrated across various domains.Notably,in the realm of robot task planning,LLMs harness their advanced reasoning and language comprehension capabilities to formulate precise and efficient action plans based on natural language instructions.However,for embodied tasks,where robots interact with complex environments,textonly LLMs often face challenges due to a lack of compatibility with robotic visual perception.This study provides a comprehensive overview of the emerging integration of LLMs and multimodal LLMs into various robotic tasks.Additionally,we propose a framework that utilizes multimodal GPT-4V to enhance embodied task planning through the combination of natural language instructions and robot visual perceptions.Our results,based on diverse datasets,indicate that GPT-4V effectively enhances robot performance in embodied tasks.This extensive survey and evaluation of LLMs and multimodal LLMs across a variety of robotic tasks enriches the understanding of LLM-centric embodied intelligence and provides forward-looking insights towards bridging the gap in Human-Robot-Environment interaction.展开更多
The Coronavirus Disease 2019(COVID-19)is wreaking havoc around the world,bring out that the enormous pressure on national health and medical staff systems.One of the most effective and critical steps in the fight agai...The Coronavirus Disease 2019(COVID-19)is wreaking havoc around the world,bring out that the enormous pressure on national health and medical staff systems.One of the most effective and critical steps in the fight against COVID-19,is to examine the patient’s lungs based on the Chest X-ray and CT generated by radiation imaging.In this paper,five keras-related deep learning models:ResNet50,InceptionResNetV2,Xception,transfer learning and pre-trained VGGNet16 is applied to formulate an classification-detection approaches of COVID-19.Two benchmark methods SVM(Support Vector Machine),CNN(Conventional Neural Networks)are provided to compare with the classification-detection approaches based on the performance indicators,i.e.,precision,recall,F1 scores,confusion matrix,classification accuracy and three types of AUC(Area Under Curve).The highest classification accuracy derived by classification-detection based on 5857 Chest X-rays and 767 Chest CTs are respectively 84%and 75%,which shows that the keras-related deep learning approaches facilitate accurate and effective COVID-19-assisted detection.展开更多
Recently,diffusion models have emerged as a promising paradigm for molecular design and optimization.However,most diffusion-based molecular generative models focus on modeling 2D graphs or 3D geom-etries,with limited ...Recently,diffusion models have emerged as a promising paradigm for molecular design and optimization.However,most diffusion-based molecular generative models focus on modeling 2D graphs or 3D geom-etries,with limited research on molecular sequence diffusion models.The International Union of Pure and Applied Chemistry(IUPAC)names are more akin to chemical natural language than the simplified molecular input line entry system(SMILES)for organic compounds.In this work,we apply an IUPAC-guided conditional diffusion model to facilitate molecular editing from chemical natural language to chemical language(SMILES)and explore whether the pre-trained generative performance of diffusion models can be transferred to chemical natural language.We propose DiffIUPAC,a controllable molecular editing diffusion model that converts IUPAC names to SMILES strings.Evaluation results demonstrate that our model out-performs existing methods and successfully captures the semantic rules of both chemical languages.Chemical space and scaffold analysis show that the model can generate similar compounds with diverse scaffolds within the specified constraints.Additionally,to illustrate the model’s applicability in drug design,we conducted case studies in functional group editing,analogue design and linker design.展开更多
Integration of digital twin(DT)and wireless channel provides new solution of channel modeling and simulation,and can assist to design,optimize and evaluate intelligent wireless communication system and networks.With D...Integration of digital twin(DT)and wireless channel provides new solution of channel modeling and simulation,and can assist to design,optimize and evaluate intelligent wireless communication system and networks.With DT channel modeling,the generated channel data can be closer to realistic channel measurements without requiring a prior channel model,and amount of channel data can be significantly increased.Artificial intelligence(AI)based modeling approach shows outstanding performance to solve such problems.In this work,a channel modeling method based on generative adversarial networks is proposed for DT channel,which can generate identical statistical distribution with measured channel.Model validation is conducted by comparing DT channel characteristics with measurements,and results show that DT channel leads to fairly good agreement with measured channel.Finally,a link-layer simulation is implemented based on DT channel.It is found that the proposed DT channel model can be well used to conduct link-layer simulation and its performance is comparable to using measurement data.The observations and results can facilitate the development of DT channel modeling and provide new thoughts for DT channel applications,as well as improving the performance and reliability of intelligent communication networking.展开更多
For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for...For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for the ensemble-based data assimilation methods.In this paper,we propose a multi-source information fused generative adversarial network(MSIGAN)model,which is used for parameterization of the complex geologies.In MSIGAN,various information such as facies distribution,microseismic,and inter-well connectivity,can be integrated to learn the geological features.And two major generative models in deep learning,variational autoencoder(VAE)and generative adversarial network(GAN)are combined in our model.Then the proposed MSIGAN model is integrated into the ensemble smoother with multiple data assimilation(ESMDA)method to conduct history matching.We tested the proposed method on two reservoir models with fluvial facies.The experimental results show that the proposed MSIGAN model can effectively learn the complex geological features,which can promote the accuracy of history matching.展开更多
文摘The growing demand for wireless connectivity has made massive multiple-input multiple-output(MIMO)a cornerstone of modern communication systems.To optimize network performance and resource allocation,an efficient and robust approach is joint device activity detection and channel estimation.In this paper,we present an approach utilizing score-based generative models to address the underdetermined nature of channel estimation,which is data-driven and well-suited for the complex and dynamic environment of massive MIMO systems.Our experimental results,based on a comprehensive dataset generated through Monte-Carlo sampling,demonstrate the high precision of our channel estimation approach,with errors reduced to as low as-45 d B,and exceptional accuracy in detecting active devices.
基金Supported by the Korea Health Technology R&D Project through the Korea Health Industry Development Institute(KHIDI),funded by the Ministry of Health&Welfare,Republic of Korea(No.HR20C0026)the National Research Foundation of Korea(NRF)(No.RS-2023-00247504)the Patient-Centered Clinical Research Coordinating Center,funded by the Ministry of Health&Welfare,Republic of Korea(No.HC19C0276).
文摘AIM:To build a functional generalized estimating equation(GEE)model to detect glaucomatous visual field progression and compare the performance of the proposed method with that of commonly employed algorithms.METHODS:Totally 716 eyes of 716 patients with primary open angle glaucoma(POAG)with at least 5 reliable 24-2 test results and 2y of follow-up were selected.The functional GEE model was used to detect perimetric progression in the training dataset(501 eyes).In the testing dataset(215 eyes),progression was evaluated the functional GEE model,mean deviation(MD)and visual field index(VFI)rates of change,Advanced Glaucoma Intervention Study(AGIS)and Collaborative Initial Glaucoma Treatment Study(CIGTS)scores,and pointwise linear regression(PLR).RESULTS:The proposed method showed the highest proportion of eyes detected as progression(54.4%),followed by the VFI rate(34.4%),PLR(23.3%),and MD rate(21.4%).The CIGTS and AGIS scores had a lower proportion of eyes detected as progression(7.9%and 5.1%,respectively).The time to detection of progression was significantly shorter for the proposed method than that of other algorithms(adjusted P≤0.019).The VFI rate displayed moderate pairwise agreement with the proposed method(k=0.47).CONCLUSION:The functional GEE model shows the highest proportion of eyes detected as perimetric progression and the shortest time to detect perimetric progression in patients with POAG.
基金Project supported by the Guangdong Basic and Applied Basic Research Foundation of China(No.2023A1515012809)the Natural Science Foundation of Shaanxi Province of China(No.2023-JC-YB-073)the Fundamental Research Funds for the Central Universities of China(No.D5000230066)。
文摘With the miniaturization of devices and the development of modern heating technologies,the generalization of heat conduction and thermoelastic coupling has become crucial,effectively emulating the thermodynamic behavior of materials in ultrashort time scales.Theoretically,generalized heat conductive models are considered in this work.By analogy with mechanical viscoelastic models,this paper further enriches the heat conduction models and gives their one-dimensional physical expression.Numerically,the transient thermoelastic response of the slim strip material under thermal shock is investigated by applying the proposed models.First,the analytical solution in the Laplace domain is obtained by the Laplace transform.Then,the numerical results of the transient responses are obtained by the numerical inverse Laplace transform.Finally,the transient responses of different models are analyzed and compared,and the effects of material parameters are discussed.This work not only opens up new research perspectives on generalized heat conductive and thermoelastic coupling theories,but also is expected to be beneficial for the deeper understanding of the heat wave theory.
文摘We analyze the suitability of existing pre-trained transformer-based language models(PLMs)for abstractive text summarization on German technical healthcare texts.The study focuses on the multilingual capabilities of these models and their ability to perform the task of abstractive text summarization in the healthcare field.The research hypothesis was that large language models could perform high-quality abstractive text summarization on German technical healthcare texts,even if the model is not specifically trained in that language.Through experiments,the research questions explore the performance of transformer language models in dealing with complex syntax constructs,the difference in performance between models trained in English and German,and the impact of translating the source text to English before conducting the summarization.We conducted an evaluation of four PLMs(GPT-3,a translation-based approach also utilizing GPT-3,a German language Model,and a domain-specific bio-medical model approach).The evaluation considered the informativeness using 3 types of metrics based on Recall-Oriented Understudy for Gisting Evaluation(ROUGE)and the quality of results which is manually evaluated considering 5 aspects.The results show that text summarization models could be used in the German healthcare domain and that domain-independent language models achieved the best results.The study proves that text summarization models can simplify the search for pre-existing German knowledge in various domains.
文摘This study explores a novel educational model of generative AI-empowered interdisciplinary project-based learning(PBL).By analyzing the current applications of generative AI technology in information technology curricula,it elucidates its advantages and operational mechanisms in interdisciplinary PBL.Combining case studies and empirical research,the investigation proposes implementation pathways and strategies for the generative AI-enhanced interdisciplinary PBL model,detailing specific applications across three phases:project preparation,implementation,and evaluation.The research demonstrates that generative AI-enabled interdisciplinary project-based learning can effectively enhance students’learning motivation,interdisciplinary thinking capabilities,and innovative competencies,providing new conceptual frameworks and practical approaches for educational model innovation.
基金supported by Shanghai Science and Technology Commission Project(No.21DZ1201502)Shanghai Municipal Bureau of Ecology and Environment(Shanghai Environ-mental Science[2023]No.40)+1 种基金the Interdisciplinary Joint Research Project of Tongji University(No.2022-4-YB-12)Shanghai Science and Technology Commission Project(No.22DZ2200200).
文摘Over the past century,advancements in chemistry have significantly propelled human innovation,enhancing both industrial and consumer products.However,this rapid progression has resulted in chemical pollution increasingly surpassing planetary boundaries,as production and release rates have outpaced our monitoring capabilities.To catalyze more impactful efforts,this study transitions from traditional chemical assessment to inverse chemical design,introducing a generative graph latent diffusion model aimed at discovering safer alternatives.In a case study on the design of green solvents for cyclohexane/benzene extraction distillation,we constructed a design database encompassing functional,environmental hazards,and process constraints.Virtual screening of previous design dataset revealed distinct trade-off trends between these design requirements.Based on the screening outcomes,an unconstrained generative model was developed,which covered a broader chemical space and demonstrated superior capabilities for structural interpolation and extrapolation.To further optimize molecular generation towards desired properties,a multi-objective latent diffusion method was applied,yielding 19 candidate molecules.Of these,7 were identified in PubChem as the most viable green solvent candidates,while the remaining 12 as potential novel candidates.Overall,this study effectively designed green solvent candidates for safer and more sustainable industrial production,setting a promising precedent for the development of environmentally friendly alternatives in other areas of chemical research.
文摘This study focuses on the construction and application of intelligent financial decision-making models driven by generative artificial intelligence(AI).It analyzes the mechanisms by which generative AI empowers financial decision-making within a dual framework of dynamic knowledge evolution and risk control.The research reveals that generative AI,with its superior data processing,pattern recognition,and autonomous learning capabilities,can transcend the limitations of traditional decision-making models,facilitating a significant shift from causal inference to probabilistic creation in decision-making paradigms.By systematically constructing an intelligent financial decision-making model that includes data governance,core engine,and decision output layers,the study clarifies the functional roles and collaborative mechanisms of each layer.Additionally,it addresses key challenges in technology application,institutional adaptation,and organizational transformation by proposing systematic strategies for technical risk management,institutional innovation,and organizational capability enhancement,aiming to provide robust theoretical support and practical guidance for the intelligent transformation of corporate financial decision-making.
基金supported by the National Key R&D Program of China(No.2023YFA1606701)the National Natural Science Foundation of China(Nos.12175042,11890710,11890714,12047514,12147101,and 12347106)+1 种基金Guangdong Major Project of Basic and Applied Basic Research(No.2020B0301030008)China National Key R&D Program(No.2022YFA1602402).
文摘We employed random distributions and gradient descent methods for the Generator Coordinate Method(GCM)to identify effective basis wave functions,taking halo nuclei ^(6)He and ^(6)Li as examples.By comparing the ground state(0^(+))energy of ^(6)He and the excited state(0^(+))energy of 6 Li calculated with various random distributions and manually selected generation coordinates,we found that the heavy tail characteristic of the logistic distribution better describes the features of the halo nuclei.Subsequently,the Adam algorithm from machine learning was applied to optimize the basis wave functions,indicating that a limited number of basis wave functions can approximate the converged values.These results offer some empirical insights for selecting basis wave functions and contribute to the broader application of machine learning methods in predicting effective basis wave functions.
基金co-supported by the National Key Project of China(No.GJXM92579)the National Natural Science Foundation of China(Nos.92052203,61903178 and61906081)。
文摘Inverse design has long been an efficient and powerful design tool in the aircraft industry.In this paper,a novel inverse design method for supercritical airfoils is proposed based on generative models in deep learning.A Conditional Variational Auto Encoder(CVAE)and an integrated generative network CVAE-GAN that combines the CVAE with the Wasserstein Generative Adversarial Networks(WGAN),are conducted as generative models.They are used to generate target wall Mach distributions for the inverse design that matches specified features,such as locations of suction peak,shock and aft loading.Qualitative and quantitative results show that both adopted generative models can generate diverse and realistic wall Mach number distributions satisfying the given features.The CVAE-GAN model outperforms the CVAE model and achieves better reconstruction accuracies for all the samples in the dataset.Furthermore,a deep neural network for nonlinear mapping is adopted to obtain the airfoil shape corresponding to the target wall Mach number distribution.The performances of the designed deep neural network are fully demonstrated and a smoothness measurement is proposed to quantify small oscillations in the airfoil surface,proving the authenticity and accuracy of the generated airfoil shapes.
基金supported by the Ministry of Agriculture,Forestry,and Fisheries of Japan (25093 C)JSPS KAKENHI (JP23H02262)
文摘The growth of Sakhalin fir(Abies sachalinen-sis)seedlings,an important forest tree species in northern Hokkaido,Japan,is significantly affected by competition from surrounding vegetation,especially evergreen dwarf bamboo.In this study,we investigated the height and root collar diameter(RCD)growth of Sakhalin fir seedlings under various degrees of cover by deciduous vegetation and evergreen dwarf bamboo.Generalized additive models were used to quantify the effects of canopy cover and forest floor cover on the relative growth rates of these two parameters.The canopy cover of Sakhalin fir seedlings had a nonlin-ear negative effect on both the height growth of seedlings in the subsequent year and the RCD growth in the current year,given the general growth pattern in this species,where height growth ceases in early summer and RCD growth con-tinues until autumn.Height growth declined sharply after the canopy cover rate exceeded 50%,while RCD growth declined rapidly between 0 and 50%canopy cover rate.The forest floor cover had a greater negative impact on RCD growth than on height growth.These results suggested that Sakhalin fir seedlings respond to vegetative competition by prioritizing height growth for light acquisition at the expense of diameter growth and possibly root growth for below-ground competition.The cover of evergreen dwarf bamboo reduced the height growth of fir seedlings significantly more than the cover of deciduous vegetation.This difference is likely due to the timing of light availability.When competing with deciduous vegetation,Sakhalin fir seedlings exposed to light during the post-snow melt and early spring before the development of the deciduous vegetation canopy can photosynthesize more effectively,leading to greater height growth.The results of this study highlighted the importance of vegetation control considering the type of vegetation for successful Sakhalin fir reforestation.Adjusting the intensity and timing of weeding based on the presence and abundance of dwarf bamboo and other competing vegetation could potentially reduce weeding costs and increase biodiversity in reforested areas.
文摘Panoramic images, offering a 360-degree view, are essential in virtual reality(VR) and augmented reality(AR), enhancing realism with high-quality textures. However, acquiring complete and high-quality panoramic textures is challenging. This paper introduces a method using generative adversarial networks(GANs) and the contrastive language-image pretraining(CLIP) model to restore and control texture in panoramic images. The GAN model captures complex structures and maintains consistency, while CLIP enables fine-grained texture control via semantic text-image associations. GAN inversion optimizes latent codes for precise texture details. The resulting low dynamic range(LDR) images are converted to high dynamic range(HDR) using the Blender engine for seamless texture blending. Experimental results demonstrate the effectiveness and flexibility of this method in panoramic texture restoration and generation.
基金This work was supported by the National Key Research and Development Program of China(2018YFC2001302)National Natural Science Foundation of China(91520202)+2 种基金Chinese Academy of Sciences Scientific Equipment Development Project(YJKYYQ20170050)Beijing Municipal Science and Technology Commission(Z181100008918010)Youth Innovation Promotion Association of Chinese Academy of Sciences,and Strategic Priority Research Program of Chinese Academy of Sciences(XDB32040200).
文摘Brain encoding and decoding via functional magnetic resonance imaging(fMRI)are two important aspects of visual perception neuroscience.Although previous researchers have made significant advances in brain encoding and decoding models,existing methods still require improvement using advanced machine learning techniques.For example,traditional methods usually build the encoding and decoding models separately,and are prone to overfitting on a small dataset.In fact,effectively unifying the encoding and decoding procedures may allow for more accurate predictions.In this paper,we first review the existing encoding and decoding methods and discuss the potential advantages of a“bidirectional”modeling strategy.Next,we show that there are correspondences between deep neural networks and human visual streams in terms of the architecture and computational rules.Furthermore,deep generative models(e.g.,variational autoencoders(VAEs)and generative adversarial networks(GANs))have produced promising results in studies on brain encoding and decoding.Finally,we propose that the dual learning method,which was originally designed for machine translation tasks,could help to improve the performance of encoding and decoding models by leveraging large-scale unpaired data.
基金The National Natural Science Foundation of China(Grant No.81573273,81673279,21572010 and 21772005)National Major Scientific and Technological Special Project for"Significant New Drugs Development"(Grant No.2018ZX09735001-003)
文摘Natural products(NPs) have long been recognized as a valuable resource for drug discovery, and bringing NP-related features to virtual libraries is believed to be an effective way to increase the coverage of druggable chemical space. Here, deep learning-based molecule generative model, which is a recent technique in de novo molecule design, was applied to generate virtual libraries with NP-like properties. Results demonstrated that the model was effective in generating molecules that highly resemble NPs. Moreover, the model was also found to be capable of generating NP-like molecules that were also easy to synthesize, significantly increasing the practical value of the compound library.
基金supported by MedTechLabs,GE HealthCare,the Swedish Research council,No.2021-05103the Göran Gustafsson foundation,No.2114.
文摘Deep learning(DL)has proven to be important for computed tomography(CT)image denoising.However,such models are usually trained under supervision,requiring paired data that may be difficult to obtain in practice.Diffusion models offer unsupervised means of solving a wide range of inverse problems via posterior sampling.In particular,using the estimated unconditional score function of the prior distribution,obtained via unsupervised learning,one can sample from the desired posterior via hijacking and regularization.However,due to the iterative solvers used,the number of function evaluations(NFE)required may be orders of magnitudes larger than for single-step samplers.In this paper,we present a novel image denoising technique for photon-counting CT by extending the unsupervised approach to inverse problem solving to the case of Poisson flow generative models(PFGM)++.By hijacking and regularizing the sampling process we obtain a single-step sampler,that is NFE=1.Our proposed method incorporates posterior sampling using diffusion models as a special case.We demonstrate that the added robustness afforded by the PFGM++framework yields significant performance gains.Our results indicate competitive performance compared to popular supervised,including state-of-the-art diffusion-style models with NFE=1(consistency models),unsupervised,and non-DL-based image denoising techniques,on clinical low-dose CT data and clinical images from a prototype photon-counting CT system developed by GE HealthCare.
基金The National Natural Science Foundation of China(No.11171065)the Natural Science Foundation of Jiangsu Province(No.BK2011058)
文摘In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood estimation is considered. Three diagnostic statistics are used to detect whether the outliers exist in the data set. Simulation results show that when the sample size is small, the values of diagnostic statistics based on the maximum Lq-likelihood estimation are greater than the values based on the maximum likelihood estimation. As the sample size increases, the difference between the values of the diagnostic statistics based on two estimation methods diminishes gradually. It means that the outliers can be distinguished easier through the maximum Lq-likelihood method than those through the maximum likelihood estimation method.
基金supported by National Natural Science Foundation of China(62376219 and 62006194)Foundational Research Project in Specialized Discipline(Grant No.G2024WD0146)Faculty Construction Project(Grant No.24GH0201148).
文摘Large language models(LLMs)have undergone significant expansion and have been increasingly integrated across various domains.Notably,in the realm of robot task planning,LLMs harness their advanced reasoning and language comprehension capabilities to formulate precise and efficient action plans based on natural language instructions.However,for embodied tasks,where robots interact with complex environments,textonly LLMs often face challenges due to a lack of compatibility with robotic visual perception.This study provides a comprehensive overview of the emerging integration of LLMs and multimodal LLMs into various robotic tasks.Additionally,we propose a framework that utilizes multimodal GPT-4V to enhance embodied task planning through the combination of natural language instructions and robot visual perceptions.Our results,based on diverse datasets,indicate that GPT-4V effectively enhances robot performance in embodied tasks.This extensive survey and evaluation of LLMs and multimodal LLMs across a variety of robotic tasks enriches the understanding of LLM-centric embodied intelligence and provides forward-looking insights towards bridging the gap in Human-Robot-Environment interaction.
基金This project is supported by National Natural Science Foundation of China(NSFC)(Nos.61902158,61806087)Graduate student innovation program for academic degrees in general university in Jiangsu Province(No.KYZZ16-0337).
文摘The Coronavirus Disease 2019(COVID-19)is wreaking havoc around the world,bring out that the enormous pressure on national health and medical staff systems.One of the most effective and critical steps in the fight against COVID-19,is to examine the patient’s lungs based on the Chest X-ray and CT generated by radiation imaging.In this paper,five keras-related deep learning models:ResNet50,InceptionResNetV2,Xception,transfer learning and pre-trained VGGNet16 is applied to formulate an classification-detection approaches of COVID-19.Two benchmark methods SVM(Support Vector Machine),CNN(Conventional Neural Networks)are provided to compare with the classification-detection approaches based on the performance indicators,i.e.,precision,recall,F1 scores,confusion matrix,classification accuracy and three types of AUC(Area Under Curve).The highest classification accuracy derived by classification-detection based on 5857 Chest X-rays and 767 Chest CTs are respectively 84%and 75%,which shows that the keras-related deep learning approaches facilitate accurate and effective COVID-19-assisted detection.
基金supported by the Yonsei University graduate school Department of Integrative Biotechnology.
文摘Recently,diffusion models have emerged as a promising paradigm for molecular design and optimization.However,most diffusion-based molecular generative models focus on modeling 2D graphs or 3D geom-etries,with limited research on molecular sequence diffusion models.The International Union of Pure and Applied Chemistry(IUPAC)names are more akin to chemical natural language than the simplified molecular input line entry system(SMILES)for organic compounds.In this work,we apply an IUPAC-guided conditional diffusion model to facilitate molecular editing from chemical natural language to chemical language(SMILES)and explore whether the pre-trained generative performance of diffusion models can be transferred to chemical natural language.We propose DiffIUPAC,a controllable molecular editing diffusion model that converts IUPAC names to SMILES strings.Evaluation results demonstrate that our model out-performs existing methods and successfully captures the semantic rules of both chemical languages.Chemical space and scaffold analysis show that the model can generate similar compounds with diverse scaffolds within the specified constraints.Additionally,to illustrate the model’s applicability in drug design,we conducted case studies in functional group editing,analogue design and linker design.
基金supported by National Key R&D Program of China under Grant 2021YFB3901302 and 2021YFB2900301the National Natural Science Foundation of China under Grant 62271037,62001519,62221001,and U21A20445+1 种基金the State Key Laboratory of Advanced Rail Autonomous Operation under Grant RCS2022ZZ004the Fundamental Research Funds for the Central Universities under Grant 2022JBQY004.
文摘Integration of digital twin(DT)and wireless channel provides new solution of channel modeling and simulation,and can assist to design,optimize and evaluate intelligent wireless communication system and networks.With DT channel modeling,the generated channel data can be closer to realistic channel measurements without requiring a prior channel model,and amount of channel data can be significantly increased.Artificial intelligence(AI)based modeling approach shows outstanding performance to solve such problems.In this work,a channel modeling method based on generative adversarial networks is proposed for DT channel,which can generate identical statistical distribution with measured channel.Model validation is conducted by comparing DT channel characteristics with measurements,and results show that DT channel leads to fairly good agreement with measured channel.Finally,a link-layer simulation is implemented based on DT channel.It is found that the proposed DT channel model can be well used to conduct link-layer simulation and its performance is comparable to using measurement data.The observations and results can facilitate the development of DT channel modeling and provide new thoughts for DT channel applications,as well as improving the performance and reliability of intelligent communication networking.
基金supported by the National Natural Science Foundation of China under Grant 51722406,52074340,and 51874335the Shandong Provincial Natural Science Foundation under Grant JQ201808+5 种基金The Fundamental Research Funds for the Central Universities under Grant 18CX02097Athe Major Scientific and Technological Projects of CNPC under Grant ZD2019-183-008the Science and Technology Support Plan for Youth Innovation of University in Shandong Province under Grant 2019KJH002the National Research Council of Science and Technology Major Project of China under Grant 2016ZX05025001-006111 Project under Grant B08028Sinopec Science and Technology Project under Grant P20050-1
文摘For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for the ensemble-based data assimilation methods.In this paper,we propose a multi-source information fused generative adversarial network(MSIGAN)model,which is used for parameterization of the complex geologies.In MSIGAN,various information such as facies distribution,microseismic,and inter-well connectivity,can be integrated to learn the geological features.And two major generative models in deep learning,variational autoencoder(VAE)and generative adversarial network(GAN)are combined in our model.Then the proposed MSIGAN model is integrated into the ensemble smoother with multiple data assimilation(ESMDA)method to conduct history matching.We tested the proposed method on two reservoir models with fluvial facies.The experimental results show that the proposed MSIGAN model can effectively learn the complex geological features,which can promote the accuracy of history matching.