Discrete Bayesian Dose-response Analysis under Dose Uncertainty.Eduard Hofer1(1.3 Constance Road,Claremont,Cape Town 7708,South Africa.)Abstract:Establishing a relationship between disease and dose requires each indiv...Discrete Bayesian Dose-response Analysis under Dose Uncertainty.Eduard Hofer1(1.3 Constance Road,Claremont,Cape Town 7708,South Africa.)Abstract:Establishing a relationship between disease and dose requires each individual in the population under investigation to be known by disease status and by the value of the dose received.展开更多
Accurate retrieval of atmospheric vertical profiles is critical for improving weather prediction and climate monitoring.However,the complexity of atmospheric processes in cloudy regions poses challenges compared to th...Accurate retrieval of atmospheric vertical profiles is critical for improving weather prediction and climate monitoring.However,the complexity of atmospheric processes in cloudy regions poses challenges compared to those of clear sky scenarios.This study presents a novel framework that integrates Bayesian optimization and machine learning approaches to retrieve atmospheric vertical profiles—including temperature,humidity,ozone concentration,cloud fraction,ice water content(IWC),and liquid water content(LWC)—from hyperspectral infrared observations.Specifically,a Bayesian method was used to refine ERA5 reanalysis data by minimizing brightness temperature(BT)discrepancies against FY-4B Geostationary Interferometric Infrared Sounder(GIIRS)observations,generating a high-quality profile database(~2.8 million profiles)across diverse weather systems.The optimized profiles improve radiative consistency,reducing BT biases from>40 K to<10 K in cloudy regions.To further overcome the limitations of the Bayesian method,we developed a Transformer-Resnet hybrid model(TERNet),which achieved superior performance with RMSE values of 1.61 K(temperature),5.77%(humidity),and 2.25×10^(–6)/6.09×10^(–6)kg kg^(–1)(IWC/LWC)across the entire vertical levels in all-sky conditions.The TERNet outperforms both ERA5 in cloud parameter retrieval and the GIIRS L2 product in thermodynamic profiling.Independent verification with radiosonde and Cloud-Aerosol Lidar and Infrared Pathfinder Satellite Observations(CALIPSO)datasets confirms the framework's reliability across various meteorological regimes.This work demonstrates the capability of combining physics-informed Bayesian methods with data-driven machine learning to fully exploit hyperspectral IR data.展开更多
The integrated nested Laplace approximation(INLA)algorithm provides a computationally efficient approach for approximate Bayesian inference,overcoming the limitations of traditional Markov chain Monte Carlo(MCMC)metho...The integrated nested Laplace approximation(INLA)algorithm provides a computationally efficient approach for approximate Bayesian inference,overcoming the limitations of traditional Markov chain Monte Carlo(MCMC)methods.This paper reviews INLA algorithm and provides a systematic review of six key books that explore the theoretical foundations,practical implementations,and diverse applications of INLA.These six books cover spatial and spatio-temporal modelling,general Bayesian inference,SPDE-based spatial analysis,geospatial health data,regression modelling,and dynamic time series.In addition,these books highlight the versatility of INLA method in handling complex models while maintaining high computational efficiency.This paper begins with an introduction to the INLA method and algorithm,followed by a systematic review of six key publications in the field.展开更多
Wetting deformation in earth-rockfill dams is a critical factor influencingdam safety.Although numerous mathematical models have been developed to describe this phenomenon,most of them rely on empirical formulations a...Wetting deformation in earth-rockfill dams is a critical factor influencingdam safety.Although numerous mathematical models have been developed to describe this phenomenon,most of them rely on empirical formulations and lack prior knowledge of model parameters,which is essential for Bayesian parameter inversion to enhance accuracy and reduce uncertainty.This study introduces a datadriven approach to establishing prior knowledge of earth-rockfill dams.Driving factors are utilized to determine the potential range of model parameters,and settlement changes within this range are calculated.The results are iteratively compared with actual monitoring data until the calculated range encompasses the observed data,thereby providing prior knowledge of the model parameters.The proposed method is applied to the right-bank earth-rockfilldam of Danjiangkou.Employing a Gibbs sample size of 30,000,the proposed method effectively calibrates the prior knowledge of the wetting model parameters,achieving a root mean square error(RMSE)of 5.18 mm for the settlement predictions.By comparison,the use of non-informative priors with sample sizes of 30,000 and 50,000 results in significantly larger RMSE values of 11.97 mm and 16.07 mm,respectively.Furthermore,the computational efficiencyof the proposed method is demonstrated by an inversion computation time of 902 s for 30,000 samples,which is notably shorter than the 1026 s and 1558 s required for noninformative priors with 30,000 and 50,000 samples,respectively.These findingsunderscore the superior performance of the proposed approach in terms of both prediction accuracy and computational efficiency.These results demonstrate that the proposed method not only improves the predictive accuracy but also enhances the computational efficiency,enabling optimal parameter identificationwith reduced computational effort.This approach provides a robust and efficientframework for advancing dam safety assessments.展开更多
Xylogenesis,the process through which wood cells are formed,results in the long-term storage of carbon in woody biomass,making it a key component of the global carbon cycle.Understanding how environmental drivers infl...Xylogenesis,the process through which wood cells are formed,results in the long-term storage of carbon in woody biomass,making it a key component of the global carbon cycle.Understanding how environmental drivers influence xylogenesis during the growing season is therefore of great interest.However,studying shortterm drivers of wood production using xylogenetic data is complicated by the usual sampling scheme and the influence of eccentric growth,i.e.,heterogeneous growth around the stem.In this study,we improve xylogenesis research by introducing a statistical approach that explicitly considers seasonal phenology,short-term growth rates,and growth eccentricity.To this end,we developed Bayesian models of xylogenesis and compared them with a conventional method based on the use of Gompertz functions.Our results show that eccentricity generated high temporal autocorrelation between successive samples,and that explicitly taking it into account improved both the representativeness of phenology and intra-ring variability.We observed consistent short-term patterns in the model residuals,suggesting the influence of an unaccounted-for environmental variable on cell production.The proposed models offer several advantages over traditional methods,including robust confidence intervals around predictions,consistency with phenology,and reduced sensitivity to extreme observations at the end of the growing season,often linked to eccentric growth.These models also provide a benchmark for mechanistic testing of short-term drivers of wood formation.展开更多
This paper investigates the reliability of internal marine combustion engines using an integrated approach that combines Fault Tree Analysis(FTA)and Bayesian Networks(BN).FTA provides a structured,top-down method for ...This paper investigates the reliability of internal marine combustion engines using an integrated approach that combines Fault Tree Analysis(FTA)and Bayesian Networks(BN).FTA provides a structured,top-down method for identifying critical failure modes and their root causes,while BN introduces flexibility in probabilistic reasoning,enabling dynamic updates based on new evidence.This dual methodology overcomes the limitations of static FTA models,offering a comprehensive framework for system reliability analysis.Critical failures,including External Leakage(ELU),Failure to Start(FTS),and Overheating(OHE),were identified as key risks.By incorporating redundancy into high-risk components such as pumps and batteries,the likelihood of these failures was significantly reduced.For instance,redundant pumps reduced the probability of ELU by 31.88%,while additional batteries decreased the occurrence of FTS by 36.45%.The results underscore the practical benefits of combining FTA and BN for enhancing system reliability,particularly in maritime applications where operational safety and efficiency are critical.This research provides valuable insights for maintenance planning and highlights the importance of redundancy in critical systems,especially as the industry transitions toward more autonomous vessels.展开更多
To address the zero-sample challenge in preparation parameter design for newly developed alloys,a novel machine learning strategy that integrates basic dataset construction with Bayesian optimization,was proposed.The ...To address the zero-sample challenge in preparation parameter design for newly developed alloys,a novel machine learning strategy that integrates basic dataset construction with Bayesian optimization,was proposed.The impact of basic sample dataset construction methods,optimization benchmarks and multi-objective utility functions on Bayesian optimization was investigated.It was found that the combination of orthogonal design,linear benchmark,and shifted multiplicative utility function exhibits the best optimization performance.The strategy was then applied to a new Cu-Ni-Co-Si alloy with ultra-low Co content(0.7 wt.%Co),previously designed by our research team.Rapid optimization of six preparation parameters in the two-stage deformation and aging process of the zero-sample alloy was achieved through only 23 experiments.The measured ultimate tensile strength and electrical conductivity of the new alloy were 878 MPa and 44.0%(IACS),respectively,reaching the comprehensive performance level of the Cu-Ni-Co-Si alloy(C70350 alloy)containing 1.0-2.0 wt.%Co.展开更多
The reliable operation of power grid secondary equipment is an important guarantee for the safety and stability of the power system.However,various defects could be produced in the secondary equipment during longtermo...The reliable operation of power grid secondary equipment is an important guarantee for the safety and stability of the power system.However,various defects could be produced in the secondary equipment during longtermoperation.The complex relationship between the defect phenomenon andmulti-layer causes and the probabilistic influence of secondary equipment cannot be described through knowledge extraction and fusion technology by existing methods,which limits the real-time and accuracy of defect identification.Therefore,a defect recognition method based on the Bayesian network and knowledge graph fusion is proposed.The defect data of secondary equipment is transformed into the structured knowledge graph through knowledge extraction and fusion technology.The knowledge graph of power grid secondary equipment is mapped to the Bayesian network framework,combined with historical defect data,and introduced Noisy-OR nodes.The prior and conditional probabilities of the Bayesian network are then reasonably assigned to build a model that reflects the probability dependence between defect phenomena and potential causes in power grid secondary equipment.Defect identification of power grid secondary equipment is achieved by defect subgraph search based on the knowledge graph,and defect inference based on the Bayesian network.Practical application cases prove this method’s effectiveness in identifying secondary equipment defect causes,improving identification accuracy and efficiency.展开更多
This paper introduces a framework for modeling random fields,with a particular emphasis on analyzing anisotropic spatial variability.It establishes a clear connection between the correlation function and the Kriging v...This paper introduces a framework for modeling random fields,with a particular emphasis on analyzing anisotropic spatial variability.It establishes a clear connection between the correlation function and the Kriging variogram across various anisotropic modes,providing mathematical models to enhance our understanding of random fields.A new anisotropy index,called LSAI,is introduced to quantify anisotropy based on the autocorrelation length and the orientation of the principal axes within the variogram.An LSAI value closer to one indicates a lower degree of anisotropy.The present study examines how the degree of anisotropy varies with different autocorrelation lengths and angles between the principal axes,providing valuable insights into these relationships.To improve the accuracy of parameter probability distribution estimations,this study integrates limited field test data using a Bayesian inference approach.Additionally,the Markov chain Monte Carlo simulation method is employed to develop a conditional random field(CRF)for the deformation modulus.By incorporating data from field bearing plate tests,the posterior variance data for the deformation modulus are derived.This process facilitates the construction of a detailed and reliable CRF for the deformation modulus.展开更多
With the deep integration of smart manufacturing and IoT technologies,higher demands are placed on the intelligence and real-time performance of industrial equipment fault detection.For industrial fans,base bolt loose...With the deep integration of smart manufacturing and IoT technologies,higher demands are placed on the intelligence and real-time performance of industrial equipment fault detection.For industrial fans,base bolt loosening faults are difficult to identify through conventional spectrum analysis,and the extreme scarcity of fault data leads to limited training datasets,making traditional deep learning methods inaccurate in fault identification and incapable of detecting loosening severity.This paper employs Bayesian Learning by training on a small fault dataset collected from the actual operation of axial-flow fans in a factory to obtain posterior distribution.This method proposes specific data processing approaches and a configuration of Bayesian Convolutional Neural Network(BCNN).It can effectively improve the model’s generalization ability.Experimental results demonstrate high detection accuracy and alignment with real-world applications,offering practical significance and reference value for industrial fan bolt loosening detection under data-limited conditions.展开更多
Mountain communities in Nepal are increasingly exposed to climate-induced shifts in water availability,driven by glacial retreat,altered precipitation/snowmelt regimes,and declining groundwater sources.This study pres...Mountain communities in Nepal are increasingly exposed to climate-induced shifts in water availability,driven by glacial retreat,altered precipitation/snowmelt regimes,and declining groundwater sources.This study presents an integrated framework combining hydrological source analysis with socio-demographic survey data to evaluate seasonal water contributions and communitylevel water use patterns in the Upper Marsyangdi catchment,Manang District,Nepal.Isotopic(δ^(18)O)and geochemical(silica)tracers were used in a Bayesian mixing model to quantify the seasonal contributions of glacial melt,snow,rain,and groundwater to river flow.Findings indicate that groundwater dominates pre-monsoon flow(60%-70%)while post-monsoon discharge reflects more balanced inputs from all sources.In parallel,120 household surveys were analysed using Latent Class Analysis to characterise water use across domestic,agricultural,energy,and tourism sectors.Results reveal spatial and demographic gradients in water source dependency,including gender and occupation as important predictors of water use.Respondents reported perceived increases in spring flow,alongside reductions in the availability of snow for household and tourism use and deteriorating river water quality and quantity,particularly affecting hydropower operations.Adaptation strategies include increased reliance on water storage infrastructure and source switching.The study highlights the value of applying probabilistic methods to hydrological and sociocultural data to identify vulnerable populations and inform targeted,context-sensitive adaptation strategies.The proposed framework is transferable to other high-altitude regions,offering a robust approach for assessing climate resilience through the synthesis of scientific and local knowledge systems.展开更多
The detection of gravitational waves by the LIGO-Virgo-KAGRA collaboration has ushered in a new era of observational astronomy,emphasizing the need for rapid and detailed parameter estimation and population-level anal...The detection of gravitational waves by the LIGO-Virgo-KAGRA collaboration has ushered in a new era of observational astronomy,emphasizing the need for rapid and detailed parameter estimation and population-level analyses.Traditional Bayesian inference methods,particularly Markov chain Monte Carlo,face significant computational challenges when dealing with the high-dimensional parameter spaces and complex noise characteristics inherent in gravitational wave data.This review examines the emerging role of simulation-based inference methods in gravitational wave astronomy,with a focus on approaches that leverage machine-learning techniques such as normalizing flows and neural posterior estimation.We provide a comprehensive overview of the theoretical foundations underlying various simulation-based inference methods,including neural posterior estimation,neural ratio estimation,neural likelihood estimation,flow matching,and consistency models.We explore the applications of these methods across diverse gravitational wave data processing scenarios,from single-source parameter estimation and overlapping signal analysis to testing general relativity and conducting population studies.Although these techniques demonstrate speed improvements over traditional methods in controlled studies,their model-dependent nature and sensitivity to prior assumptions are barriers to their widespread adoption.Their accuracy,which is similar to that of conventional methods,requires further validation across broader parameter spaces and noise conditions.展开更多
Inverse design of advanced materials represents a pivotal challenge in materials science.Leveraging the latent space of Variational Autoencoders(VAEs)for material optimization has emerged as a significant advancement ...Inverse design of advanced materials represents a pivotal challenge in materials science.Leveraging the latent space of Variational Autoencoders(VAEs)for material optimization has emerged as a significant advancement in the field of material inverse design.However,VAEs are inherently prone to generating blurred images,posing challenges for precise inverse design and microstructure manufacturing.While increasing the dimensionality of the VAE latent space can mitigate reconstruction blurriness to some extent,it simultaneously imposes a substantial burden on target optimization due to an excessively high search space.To address these limitations,this study adopts a Variational Autoencoder guided Conditional Diffusion Generative Model(VAE-CDGM)framework integrated with Bayesian optimization to achieve the inverse design of composite materials with targeted mechanical properties.The VAE-CDGM model synergizes the strengths of VAEs and Denoising Diffusion Probabilistic Models(DDPM),enabling the generation of high-quality,sharp images while preserving a manipulable latent space.To accommodate varying dimensional requirements of the latent space,two optimization strategies are proposed.When the latent space dimensionality is excessively high,SHapley Additive exPlanations(SHAP)sensitivity analysis is employed to identify critical latent features for optimization within a reduced subspace.Conversely,direct optimization is performed in the low-dimensional latent space of VAE-CDGM when dimensionality is modest.The results demonstrate that both strategies accurately achieve the targeted design of composite materials while circumventing the blurred reconstruction flaws of VAEs,which offers a novel pathway for the precise design of advanced materials.展开更多
Rainfall input errors are a major source of uncertainty in flood forecasting,and merging multi-source precipitation data is essential for improving accuracy.Traditional merging methods often prioritize precipitation m...Rainfall input errors are a major source of uncertainty in flood forecasting,and merging multi-source precipitation data is essential for improving accuracy.Traditional merging methods often prioritize precipitation magnitude enhancements while overlooking event detection and false alarms.To address these limitations,this study developed a precipitation integration framework that combines machine learning classification-plus-regression models with Bayesian model averaging(BMA).Three machine learning algorithms-categorical boosting(CatBoost),light gradient boosting machine(LightGBM),and random forest(RF)-were used to improve precipitation event detection.The framework includes spatial unification of raw satellite products using bilinear interpolation,bias correction through classification-plus-regression models,and final merging via a seasonal-scale BMA model.The method integrated GSMaP,IMERG,and PERSIANN satellite precipitation products,with ground observations used for model training(2001-2014)and independent validation(2015-2020)in the Upper Ganjiang River Basin,China.Results showed that the framework significantly enhanced precipitation estimation accuracy and detection capability.LightGBM-based integration exhibited superior detection performance(FAR=0.08,CSI=0.86),while RF-based integration achieved the highest overall accuracy(RMSE=4.67,CC=0.92).Seasonal variations in BMA weights underscored the need to account for seasonal characteristics of precipitation products.Additionally,accuracy improvements were observed across all rainfall categories,especially for heavy rainstorms.The seasonal-scale BMA fusion has combined the strengths of individual corrections and further enhanced precipitation estimation.This research offers a robust method for generating accurate rainfall inputs,providing valuable support for hydrological modeling and flood forecasting applications.展开更多
Leveraging high-precision lattice QCD data on the equation of state and baryon number susceptibility at a vanishing chemical potential,we constructed a Bayesian holographic QCD model and systematically analyzed the th...Leveraging high-precision lattice QCD data on the equation of state and baryon number susceptibility at a vanishing chemical potential,we constructed a Bayesian holographic QCD model and systematically analyzed the thermodynamic properties of heavy quarkonium in QCD matter under varying temperatures and chemical potentials.We computed the quark-antiquark interquark distance,potential energy,entropy,binding energy,and internal energy.We present detailed posterior distribution results of the thermodynamic quantities of heavy quarkonium,including maximum a posteriori(MAP)value estimates and 95%confidence levels(CL).Through numerical simulations and theoretical analysis,we find that an increase in the temperature and chemical potential reduces the quark distance,thereby facilitating the dissociation of heavy quarkonium and leading to a suppressed potential energy.The increase in temperature and chemical potential also raises the entropy and entropy force,further accelerating the dissociation of heavy quarkonium.The calculated results of binding energy indicate that a higher temperature and chemical potential enhance the tendency of heavy quarkonium to dissociate into free quarks.The internal energy also increases with rising temperature and chemical potential.These findings provide significant theoretical insights into the properties of strongly interacting matter under extreme conditions and lay a solid foundation for the interpretation and validation of future experimental data.Finally,we also present the results for the free energy,entropy,and internal energy of a single quark.展开更多
Recommendation systems have become indispensable for providing tailored suggestions and capturing evolving user preferences based on interaction histories.The collaborative filtering(CF)model,which depends exclusively...Recommendation systems have become indispensable for providing tailored suggestions and capturing evolving user preferences based on interaction histories.The collaborative filtering(CF)model,which depends exclusively on user-item interactions,commonly encounters challenges,including the cold-start problem and an inability to effectively capture the sequential and temporal characteristics of user behavior.This paper introduces a personalized recommendation system that combines deep learning techniques with Bayesian Personalized Ranking(BPR)optimization to address these limitations.With the strong support of Long Short-Term Memory(LSTM)networks,we apply it to identify sequential dependencies of user behavior and then incorporate an attention mechanism to improve the prioritization of relevant items,thereby enhancing recommendations based on the hybrid feedback of the user and its interaction patterns.The proposed system is empirically evaluated using publicly available datasets from movie and music,and we evaluate the performance against standard recommendation models,including Popularity,BPR,ItemKNN,FPMC,LightGCN,GRU4Rec,NARM,SASRec,and BERT4Rec.The results demonstrate that our proposed framework consistently achieves high outcomes in terms of HitRate,NDCG,MRR,and Precision at K=100,with scores of(0.6763,0.1892,0.0796,0.0068)on MovieLens-100K,(0.6826,0.1920,0.0813,0.0068)on MovieLens-1M,and(0.7937,0.3701,0.2756,0.0078)on Last.fm.The results show an average improvement of around 15%across all metrics compared to existing sequence models,proving that our framework ranks and recommends items more accurately.展开更多
Background Multibreed genomic prediction(MBGP)is crucial for improving prediction accuracy for breeds with small populations,for which limited data are often available.Recent studies have demonstrated that partitionin...Background Multibreed genomic prediction(MBGP)is crucial for improving prediction accuracy for breeds with small populations,for which limited data are often available.Recent studies have demonstrated that partitioning the genome into nonoverlapping blocks to model heterogeneous genetic(co)variance in multitrait models can achieve higher joint prediction accuracy.However,the block partitioning method,a key factor influencing model performance,has not been extensively explored.Results We introduce mbBayesABLD,a novel Bayesian MBGP model that partitions each chromosome into nonoverlapping blocks on the basis of linkage disequilibrium(LD)patterns.In this model,marker effects within each block are assumed to follow normal distributions with block-specific parameters.We employ simulated data as well as empirical datasets from pigs and beans to assess genomic prediction accuracy across different models using cross-validation.The results demonstrate that mbBayesABLD significantly outperforms conventional MBGP models,such as GBLUP and BayesR.For the meat marbling score trait in pigs,compared with GBLUP,which does not account for heterogeneous genetic(co)variance,mbBayesABLD improves the prediction accuracy for the small-population breed Landrace by 15.6%.Furthermore,our findings indicate that a moderate level of similarity in LD patterns between breeds(with an average correlation of 0.6)is sufficient to improve the prediction accuracy of the target breed.Conclusions This study presents a novel LD block-based approach for multibreed genomic prediction.Our work provides a practical tool for livestock breeding programs and offers new insights into leveraging genetic diversity across breeds for improved genomic prediction.展开更多
A performance improvement model of research and development(R&D)institutions based on evolutionary game and Bayesian network is proposed.First,the nature and performance factors of new R&D institutions are sys...A performance improvement model of research and development(R&D)institutions based on evolutionary game and Bayesian network is proposed.First,the nature and performance factors of new R&D institutions are systematically analyzed,the appropriate factor model is found,and the sharing of performance benefits between institutions and employees,the change in distribution proportion,and the risk of institutional improvement and employee cooperation are considered.Second,based on the mechanism improvement and employee cooperation,the payment matrix is given and evolutionary game analysis is carried out to obtain a stable and balanced institutional improvement probability and employee cooperation probability.These two probability values are substituted into the Bayesian network model of performance improvement of new R&D institutions,and the posterior probability of performance improvement is predicted by Bayesian network reasoning and diagnosis to find effective improvement measures.Finally,practical case analysis is given to verify the effectiveness and practicability of the proposed method.展开更多
Research on neutron-induced fission product yields of^(232)Th is crucial for understanding the competition between symmetric and asymmetric fission in actinide nuclei.However,obtaining complete isotopic yield distribu...Research on neutron-induced fission product yields of^(232)Th is crucial for understanding the competition between symmetric and asymmetric fission in actinide nuclei.However,obtaining complete isotopic yield distributions over a wide range of neutron energies remains a challenge.In this study,a Bayesian neural network model was developed to predict the independent(IND)and cumulative fission yields of^(232)Th under neutron irradiation at various incident energies.To address the limited availability of experimental data for the analysis of IND mass distributions,we substituted mass-number-based yields with the yields of specific isotopes.Furthermore,physical phenomena or quantities,such as the odd-even effect and isospin,were introduced as constraints to enhance the physical consistency of the predictions.The impact of these constraints was evaluated using mass-chain yield distributions and their dependence on energy.Incorporating physical constraints significantly improves the prediction accuracy,yielding more reliable and physically meaningful fission yield data for nuclear physics and reactor design applications.展开更多
文摘Discrete Bayesian Dose-response Analysis under Dose Uncertainty.Eduard Hofer1(1.3 Constance Road,Claremont,Cape Town 7708,South Africa.)Abstract:Establishing a relationship between disease and dose requires each individual in the population under investigation to be known by disease status and by the value of the dose received.
基金supported by the National Natural Science Foundation of China under Grant U2442219Fengyun Satellite Application Pioneer Program(2023)Special Initiative on Numerical Weather Prediction(NWP)Applications,the Civil Aerospace Technology Pre-Research Project(D040405)the Joint Funds of the Zhejiang Provincial Natural Science Foundation of China under Grant No.LZJMZ23D050003。
文摘Accurate retrieval of atmospheric vertical profiles is critical for improving weather prediction and climate monitoring.However,the complexity of atmospheric processes in cloudy regions poses challenges compared to those of clear sky scenarios.This study presents a novel framework that integrates Bayesian optimization and machine learning approaches to retrieve atmospheric vertical profiles—including temperature,humidity,ozone concentration,cloud fraction,ice water content(IWC),and liquid water content(LWC)—from hyperspectral infrared observations.Specifically,a Bayesian method was used to refine ERA5 reanalysis data by minimizing brightness temperature(BT)discrepancies against FY-4B Geostationary Interferometric Infrared Sounder(GIIRS)observations,generating a high-quality profile database(~2.8 million profiles)across diverse weather systems.The optimized profiles improve radiative consistency,reducing BT biases from>40 K to<10 K in cloudy regions.To further overcome the limitations of the Bayesian method,we developed a Transformer-Resnet hybrid model(TERNet),which achieved superior performance with RMSE values of 1.61 K(temperature),5.77%(humidity),and 2.25×10^(–6)/6.09×10^(–6)kg kg^(–1)(IWC/LWC)across the entire vertical levels in all-sky conditions.The TERNet outperforms both ERA5 in cloud parameter retrieval and the GIIRS L2 product in thermodynamic profiling.Independent verification with radiosonde and Cloud-Aerosol Lidar and Infrared Pathfinder Satellite Observations(CALIPSO)datasets confirms the framework's reliability across various meteorological regimes.This work demonstrates the capability of combining physics-informed Bayesian methods with data-driven machine learning to fully exploit hyperspectral IR data.
基金supported by the National Natural Science Foundation of China[grant number 12001266]the Humanities and Social Science Projects ofMinistry of Education of China[grant number 19YJCZH166]supported by the National Natural Science Foundation of China[grant numbers 12271168 and 12531013].
文摘The integrated nested Laplace approximation(INLA)algorithm provides a computationally efficient approach for approximate Bayesian inference,overcoming the limitations of traditional Markov chain Monte Carlo(MCMC)methods.This paper reviews INLA algorithm and provides a systematic review of six key books that explore the theoretical foundations,practical implementations,and diverse applications of INLA.These six books cover spatial and spatio-temporal modelling,general Bayesian inference,SPDE-based spatial analysis,geospatial health data,regression modelling,and dynamic time series.In addition,these books highlight the versatility of INLA method in handling complex models while maintaining high computational efficiency.This paper begins with an introduction to the INLA method and algorithm,followed by a systematic review of six key publications in the field.
基金supported by the National Key R&D Program of China(Grant No.2023YFC3209504)Natural Science Foundation of Wuhan(Grant No.2024040801020271)the Fundamental Research Funds for Central Public Welfare Research Institutes(Grant No.CKSF2025718/YT).
文摘Wetting deformation in earth-rockfill dams is a critical factor influencingdam safety.Although numerous mathematical models have been developed to describe this phenomenon,most of them rely on empirical formulations and lack prior knowledge of model parameters,which is essential for Bayesian parameter inversion to enhance accuracy and reduce uncertainty.This study introduces a datadriven approach to establishing prior knowledge of earth-rockfill dams.Driving factors are utilized to determine the potential range of model parameters,and settlement changes within this range are calculated.The results are iteratively compared with actual monitoring data until the calculated range encompasses the observed data,thereby providing prior knowledge of the model parameters.The proposed method is applied to the right-bank earth-rockfilldam of Danjiangkou.Employing a Gibbs sample size of 30,000,the proposed method effectively calibrates the prior knowledge of the wetting model parameters,achieving a root mean square error(RMSE)of 5.18 mm for the settlement predictions.By comparison,the use of non-informative priors with sample sizes of 30,000 and 50,000 results in significantly larger RMSE values of 11.97 mm and 16.07 mm,respectively.Furthermore,the computational efficiencyof the proposed method is demonstrated by an inversion computation time of 902 s for 30,000 samples,which is notably shorter than the 1026 s and 1558 s required for noninformative priors with 30,000 and 50,000 samples,respectively.These findingsunderscore the superior performance of the proposed approach in terms of both prediction accuracy and computational efficiency.These results demonstrate that the proposed method not only improves the predictive accuracy but also enhances the computational efficiency,enabling optimal parameter identificationwith reduced computational effort.This approach provides a robust and efficientframework for advancing dam safety assessments.
基金supported by the Discovery Grants program of the Natural Sciences and Engineering Research Council of Canada(No.RGPIN-2021-03553)the Canadian Research Chair in dendroecology and dendroclimatology(CRC-2021-00368)+3 种基金the Ministère des Ressources Naturelles et des Forèts(MRNF,Contract no.142332177-D)the Natural Sciences and Engineering Research Council of Canada(Alliance Grant No.ALLRP 557148-20,obtained in partnership with the MRNF and Resolute Forest Products)the Fonds de recherche du Qu ebec–Nature et technologies(Partnership Research Program on the Contribution of the Forestry Sector to Climate Change MitigationGrant No.2022-0FC-309064)。
文摘Xylogenesis,the process through which wood cells are formed,results in the long-term storage of carbon in woody biomass,making it a key component of the global carbon cycle.Understanding how environmental drivers influence xylogenesis during the growing season is therefore of great interest.However,studying shortterm drivers of wood production using xylogenetic data is complicated by the usual sampling scheme and the influence of eccentric growth,i.e.,heterogeneous growth around the stem.In this study,we improve xylogenesis research by introducing a statistical approach that explicitly considers seasonal phenology,short-term growth rates,and growth eccentricity.To this end,we developed Bayesian models of xylogenesis and compared them with a conventional method based on the use of Gompertz functions.Our results show that eccentricity generated high temporal autocorrelation between successive samples,and that explicitly taking it into account improved both the representativeness of phenology and intra-ring variability.We observed consistent short-term patterns in the model residuals,suggesting the influence of an unaccounted-for environmental variable on cell production.The proposed models offer several advantages over traditional methods,including robust confidence intervals around predictions,consistency with phenology,and reduced sensitivity to extreme observations at the end of the growing season,often linked to eccentric growth.These models also provide a benchmark for mechanistic testing of short-term drivers of wood formation.
基金supported by Istanbul Technical University(Project No.45698)supported through the“Young Researchers’Career Development Project-training of doctoral students”of the Croatian Science Foundation.
文摘This paper investigates the reliability of internal marine combustion engines using an integrated approach that combines Fault Tree Analysis(FTA)and Bayesian Networks(BN).FTA provides a structured,top-down method for identifying critical failure modes and their root causes,while BN introduces flexibility in probabilistic reasoning,enabling dynamic updates based on new evidence.This dual methodology overcomes the limitations of static FTA models,offering a comprehensive framework for system reliability analysis.Critical failures,including External Leakage(ELU),Failure to Start(FTS),and Overheating(OHE),were identified as key risks.By incorporating redundancy into high-risk components such as pumps and batteries,the likelihood of these failures was significantly reduced.For instance,redundant pumps reduced the probability of ELU by 31.88%,while additional batteries decreased the occurrence of FTS by 36.45%.The results underscore the practical benefits of combining FTA and BN for enhancing system reliability,particularly in maritime applications where operational safety and efficiency are critical.This research provides valuable insights for maintenance planning and highlights the importance of redundancy in critical systems,especially as the industry transitions toward more autonomous vessels.
基金supported by the National Natural Science Foundation of China(Nos.52404387,52090041,52374379,52425409)Xiaomi Young Scholars Program China,the National Postdoctoral Program for Innovative Talents,China(No.BX20230042)China Postdoctoral Science Foundation(No.2024M750174)。
文摘To address the zero-sample challenge in preparation parameter design for newly developed alloys,a novel machine learning strategy that integrates basic dataset construction with Bayesian optimization,was proposed.The impact of basic sample dataset construction methods,optimization benchmarks and multi-objective utility functions on Bayesian optimization was investigated.It was found that the combination of orthogonal design,linear benchmark,and shifted multiplicative utility function exhibits the best optimization performance.The strategy was then applied to a new Cu-Ni-Co-Si alloy with ultra-low Co content(0.7 wt.%Co),previously designed by our research team.Rapid optimization of six preparation parameters in the two-stage deformation and aging process of the zero-sample alloy was achieved through only 23 experiments.The measured ultimate tensile strength and electrical conductivity of the new alloy were 878 MPa and 44.0%(IACS),respectively,reaching the comprehensive performance level of the Cu-Ni-Co-Si alloy(C70350 alloy)containing 1.0-2.0 wt.%Co.
基金supported by the State Grid Southwest Branch Project“Research on Defect Diagnosis and Early Warning Technology of Relay Protection and Safety Automation Devices Based on Multi-Source Heterogeneous Defect Data”.
文摘The reliable operation of power grid secondary equipment is an important guarantee for the safety and stability of the power system.However,various defects could be produced in the secondary equipment during longtermoperation.The complex relationship between the defect phenomenon andmulti-layer causes and the probabilistic influence of secondary equipment cannot be described through knowledge extraction and fusion technology by existing methods,which limits the real-time and accuracy of defect identification.Therefore,a defect recognition method based on the Bayesian network and knowledge graph fusion is proposed.The defect data of secondary equipment is transformed into the structured knowledge graph through knowledge extraction and fusion technology.The knowledge graph of power grid secondary equipment is mapped to the Bayesian network framework,combined with historical defect data,and introduced Noisy-OR nodes.The prior and conditional probabilities of the Bayesian network are then reasonably assigned to build a model that reflects the probability dependence between defect phenomena and potential causes in power grid secondary equipment.Defect identification of power grid secondary equipment is achieved by defect subgraph search based on the knowledge graph,and defect inference based on the Bayesian network.Practical application cases prove this method’s effectiveness in identifying secondary equipment defect causes,improving identification accuracy and efficiency.
基金supported by the Doctoral Research Funds for Nanchang HangKong University,China(Grant No.EA202411211)support is gratefully acknowledged.
文摘This paper introduces a framework for modeling random fields,with a particular emphasis on analyzing anisotropic spatial variability.It establishes a clear connection between the correlation function and the Kriging variogram across various anisotropic modes,providing mathematical models to enhance our understanding of random fields.A new anisotropy index,called LSAI,is introduced to quantify anisotropy based on the autocorrelation length and the orientation of the principal axes within the variogram.An LSAI value closer to one indicates a lower degree of anisotropy.The present study examines how the degree of anisotropy varies with different autocorrelation lengths and angles between the principal axes,providing valuable insights into these relationships.To improve the accuracy of parameter probability distribution estimations,this study integrates limited field test data using a Bayesian inference approach.Additionally,the Markov chain Monte Carlo simulation method is employed to develop a conditional random field(CRF)for the deformation modulus.By incorporating data from field bearing plate tests,the posterior variance data for the deformation modulus are derived.This process facilitates the construction of a detailed and reliable CRF for the deformation modulus.
基金funded by the Zhejiang Provincial Key Science and Technology“LingYan”Project Foundation,grant number 2023C01145Zhejiang Gongshang University Higher Education Research Projects,grant number Xgy22028.
文摘With the deep integration of smart manufacturing and IoT technologies,higher demands are placed on the intelligence and real-time performance of industrial equipment fault detection.For industrial fans,base bolt loosening faults are difficult to identify through conventional spectrum analysis,and the extreme scarcity of fault data leads to limited training datasets,making traditional deep learning methods inaccurate in fault identification and incapable of detecting loosening severity.This paper employs Bayesian Learning by training on a small fault dataset collected from the actual operation of axial-flow fans in a factory to obtain posterior distribution.This method proposes specific data processing approaches and a configuration of Bayesian Convolutional Neural Network(BCNN).It can effectively improve the model’s generalization ability.Experimental results demonstrate high detection accuracy and alignment with real-world applications,offering practical significance and reference value for industrial fan bolt loosening detection under data-limited conditions.
基金funded by the Natural Environment Research Council’s Global Challenges Research Fund(NE/P016146/1)。
文摘Mountain communities in Nepal are increasingly exposed to climate-induced shifts in water availability,driven by glacial retreat,altered precipitation/snowmelt regimes,and declining groundwater sources.This study presents an integrated framework combining hydrological source analysis with socio-demographic survey data to evaluate seasonal water contributions and communitylevel water use patterns in the Upper Marsyangdi catchment,Manang District,Nepal.Isotopic(δ^(18)O)and geochemical(silica)tracers were used in a Bayesian mixing model to quantify the seasonal contributions of glacial melt,snow,rain,and groundwater to river flow.Findings indicate that groundwater dominates pre-monsoon flow(60%-70%)while post-monsoon discharge reflects more balanced inputs from all sources.In parallel,120 household surveys were analysed using Latent Class Analysis to characterise water use across domestic,agricultural,energy,and tourism sectors.Results reveal spatial and demographic gradients in water source dependency,including gender and occupation as important predictors of water use.Respondents reported perceived increases in spring flow,alongside reductions in the availability of snow for household and tourism use and deteriorating river water quality and quantity,particularly affecting hydropower operations.Adaptation strategies include increased reliance on water storage infrastructure and source switching.The study highlights the value of applying probabilistic methods to hydrological and sociocultural data to identify vulnerable populations and inform targeted,context-sensitive adaptation strategies.The proposed framework is transferable to other high-altitude regions,offering a robust approach for assessing climate resilience through the synthesis of scientific and local knowledge systems.
基金supported by the National Key Research and Development Program of China(2021YFC2203004)the National Natural Science Foundation of China(NSFC)(12405076,12247187,and 12147103)+1 种基金the National Astronomical Data Center(NADC2023YDS-01)the Fundamental Research Funds for the Central Universities.
文摘The detection of gravitational waves by the LIGO-Virgo-KAGRA collaboration has ushered in a new era of observational astronomy,emphasizing the need for rapid and detailed parameter estimation and population-level analyses.Traditional Bayesian inference methods,particularly Markov chain Monte Carlo,face significant computational challenges when dealing with the high-dimensional parameter spaces and complex noise characteristics inherent in gravitational wave data.This review examines the emerging role of simulation-based inference methods in gravitational wave astronomy,with a focus on approaches that leverage machine-learning techniques such as normalizing flows and neural posterior estimation.We provide a comprehensive overview of the theoretical foundations underlying various simulation-based inference methods,including neural posterior estimation,neural ratio estimation,neural likelihood estimation,flow matching,and consistency models.We explore the applications of these methods across diverse gravitational wave data processing scenarios,from single-source parameter estimation and overlapping signal analysis to testing general relativity and conducting population studies.Although these techniques demonstrate speed improvements over traditional methods in controlled studies,their model-dependent nature and sensitivity to prior assumptions are barriers to their widespread adoption.Their accuracy,which is similar to that of conventional methods,requires further validation across broader parameter spaces and noise conditions.
文摘Inverse design of advanced materials represents a pivotal challenge in materials science.Leveraging the latent space of Variational Autoencoders(VAEs)for material optimization has emerged as a significant advancement in the field of material inverse design.However,VAEs are inherently prone to generating blurred images,posing challenges for precise inverse design and microstructure manufacturing.While increasing the dimensionality of the VAE latent space can mitigate reconstruction blurriness to some extent,it simultaneously imposes a substantial burden on target optimization due to an excessively high search space.To address these limitations,this study adopts a Variational Autoencoder guided Conditional Diffusion Generative Model(VAE-CDGM)framework integrated with Bayesian optimization to achieve the inverse design of composite materials with targeted mechanical properties.The VAE-CDGM model synergizes the strengths of VAEs and Denoising Diffusion Probabilistic Models(DDPM),enabling the generation of high-quality,sharp images while preserving a manipulable latent space.To accommodate varying dimensional requirements of the latent space,two optimization strategies are proposed.When the latent space dimensionality is excessively high,SHapley Additive exPlanations(SHAP)sensitivity analysis is employed to identify critical latent features for optimization within a reduced subspace.Conversely,direct optimization is performed in the low-dimensional latent space of VAE-CDGM when dimensionality is modest.The results demonstrate that both strategies accurately achieve the targeted design of composite materials while circumventing the blurred reconstruction flaws of VAEs,which offers a novel pathway for the precise design of advanced materials.
基金supported by the National Natural Science Foundation of China(42471049).
文摘Rainfall input errors are a major source of uncertainty in flood forecasting,and merging multi-source precipitation data is essential for improving accuracy.Traditional merging methods often prioritize precipitation magnitude enhancements while overlooking event detection and false alarms.To address these limitations,this study developed a precipitation integration framework that combines machine learning classification-plus-regression models with Bayesian model averaging(BMA).Three machine learning algorithms-categorical boosting(CatBoost),light gradient boosting machine(LightGBM),and random forest(RF)-were used to improve precipitation event detection.The framework includes spatial unification of raw satellite products using bilinear interpolation,bias correction through classification-plus-regression models,and final merging via a seasonal-scale BMA model.The method integrated GSMaP,IMERG,and PERSIANN satellite precipitation products,with ground observations used for model training(2001-2014)and independent validation(2015-2020)in the Upper Ganjiang River Basin,China.Results showed that the framework significantly enhanced precipitation estimation accuracy and detection capability.LightGBM-based integration exhibited superior detection performance(FAR=0.08,CSI=0.86),while RF-based integration achieved the highest overall accuracy(RMSE=4.67,CC=0.92).Seasonal variations in BMA weights underscored the need to account for seasonal characteristics of precipitation products.Additionally,accuracy improvements were observed across all rainfall categories,especially for heavy rainstorms.The seasonal-scale BMA fusion has combined the strengths of individual corrections and further enhanced precipitation estimation.This research offers a robust method for generating accurate rainfall inputs,providing valuable support for hydrological modeling and flood forecasting applications.
基金supported in part by the National Key Research and Development Program of China(No.2022YFA1604900)the National Natural Science Foundation of China(NSFC)(Nos.12405154,12235016,12221005,12435009,12275104,92570117)+7 种基金the Strategic Priority Research Program of Chinese Academy of Sciences(No.XDB34030000)the Fundamental Research Funds for the Central UniversitiesOpen fund for Key Laboratories of the Ministry of Education(No.QLPL2024P01)CUHK-Shenzhen University Development Fund(Nos.UDF01003041 and UDF03003041)Shenzhen Peacock Fund(No.2023TC0007)Ministry of Science and Technology of China(No.2024YFA1611004)the European Union–Next Generation EU through the research(No.P2022Z4P4B)“SOPHYA-Sustainable Optimized PHYsics Algorithms:fundamental physics to build an advanced society”under the program PRIN 2022 PNRR of the Italian Ministero dell’Universitàe Ricerca(MUR)。
文摘Leveraging high-precision lattice QCD data on the equation of state and baryon number susceptibility at a vanishing chemical potential,we constructed a Bayesian holographic QCD model and systematically analyzed the thermodynamic properties of heavy quarkonium in QCD matter under varying temperatures and chemical potentials.We computed the quark-antiquark interquark distance,potential energy,entropy,binding energy,and internal energy.We present detailed posterior distribution results of the thermodynamic quantities of heavy quarkonium,including maximum a posteriori(MAP)value estimates and 95%confidence levels(CL).Through numerical simulations and theoretical analysis,we find that an increase in the temperature and chemical potential reduces the quark distance,thereby facilitating the dissociation of heavy quarkonium and leading to a suppressed potential energy.The increase in temperature and chemical potential also raises the entropy and entropy force,further accelerating the dissociation of heavy quarkonium.The calculated results of binding energy indicate that a higher temperature and chemical potential enhance the tendency of heavy quarkonium to dissociate into free quarks.The internal energy also increases with rising temperature and chemical potential.These findings provide significant theoretical insights into the properties of strongly interacting matter under extreme conditions and lay a solid foundation for the interpretation and validation of future experimental data.Finally,we also present the results for the free energy,entropy,and internal energy of a single quark.
基金funded by Soonchunhyang University,Grant Number 20250029。
文摘Recommendation systems have become indispensable for providing tailored suggestions and capturing evolving user preferences based on interaction histories.The collaborative filtering(CF)model,which depends exclusively on user-item interactions,commonly encounters challenges,including the cold-start problem and an inability to effectively capture the sequential and temporal characteristics of user behavior.This paper introduces a personalized recommendation system that combines deep learning techniques with Bayesian Personalized Ranking(BPR)optimization to address these limitations.With the strong support of Long Short-Term Memory(LSTM)networks,we apply it to identify sequential dependencies of user behavior and then incorporate an attention mechanism to improve the prioritization of relevant items,thereby enhancing recommendations based on the hybrid feedback of the user and its interaction patterns.The proposed system is empirically evaluated using publicly available datasets from movie and music,and we evaluate the performance against standard recommendation models,including Popularity,BPR,ItemKNN,FPMC,LightGCN,GRU4Rec,NARM,SASRec,and BERT4Rec.The results demonstrate that our proposed framework consistently achieves high outcomes in terms of HitRate,NDCG,MRR,and Precision at K=100,with scores of(0.6763,0.1892,0.0796,0.0068)on MovieLens-100K,(0.6826,0.1920,0.0813,0.0068)on MovieLens-1M,and(0.7937,0.3701,0.2756,0.0078)on Last.fm.The results show an average improvement of around 15%across all metrics compared to existing sequence models,proving that our framework ranks and recommends items more accurately.
基金supported by the Biological Breeding-Major Projects in National Science and Technology(No.2023ZD0404405)the Earmarked Fund for China Agriculture Research System(No.CARS-pig-35)+2 种基金the National Natural Science Foundation of China(No.3227284,32302708)the 2115 Talent Development Program of China Agricultural University,the Chinese Universities Scientific Fund(No.2023TC196)the Seed Industry Revitalization Action Project of Guangdong Province(No.2024-XPY-06-001)。
文摘Background Multibreed genomic prediction(MBGP)is crucial for improving prediction accuracy for breeds with small populations,for which limited data are often available.Recent studies have demonstrated that partitioning the genome into nonoverlapping blocks to model heterogeneous genetic(co)variance in multitrait models can achieve higher joint prediction accuracy.However,the block partitioning method,a key factor influencing model performance,has not been extensively explored.Results We introduce mbBayesABLD,a novel Bayesian MBGP model that partitions each chromosome into nonoverlapping blocks on the basis of linkage disequilibrium(LD)patterns.In this model,marker effects within each block are assumed to follow normal distributions with block-specific parameters.We employ simulated data as well as empirical datasets from pigs and beans to assess genomic prediction accuracy across different models using cross-validation.The results demonstrate that mbBayesABLD significantly outperforms conventional MBGP models,such as GBLUP and BayesR.For the meat marbling score trait in pigs,compared with GBLUP,which does not account for heterogeneous genetic(co)variance,mbBayesABLD improves the prediction accuracy for the small-population breed Landrace by 15.6%.Furthermore,our findings indicate that a moderate level of similarity in LD patterns between breeds(with an average correlation of 0.6)is sufficient to improve the prediction accuracy of the target breed.Conclusions This study presents a novel LD block-based approach for multibreed genomic prediction.Our work provides a practical tool for livestock breeding programs and offers new insights into leveraging genetic diversity across breeds for improved genomic prediction.
基金supported by the National Natural Science Foundation of China(72071106)Jiangsu Provincial Social Science Fund(23EYA001)+1 种基金Jiangsu Provincial Education Science Planning Fund(Ba/2024/08)Jiangsu Higher Education Association Fund(24FYHLX090)。
文摘A performance improvement model of research and development(R&D)institutions based on evolutionary game and Bayesian network is proposed.First,the nature and performance factors of new R&D institutions are systematically analyzed,the appropriate factor model is found,and the sharing of performance benefits between institutions and employees,the change in distribution proportion,and the risk of institutional improvement and employee cooperation are considered.Second,based on the mechanism improvement and employee cooperation,the payment matrix is given and evolutionary game analysis is carried out to obtain a stable and balanced institutional improvement probability and employee cooperation probability.These two probability values are substituted into the Bayesian network model of performance improvement of new R&D institutions,and the posterior probability of performance improvement is predicted by Bayesian network reasoning and diagnosis to find effective improvement measures.Finally,practical case analysis is given to verify the effectiveness and practicability of the proposed method.
基金supported by the National Natural Science Foundation of China(Nos.12247126 and 12375123)Henan Postdoctoral Foundation(No.HN2024013)the Natural Science Foundation of Henan Province(No.242300421048)。
文摘Research on neutron-induced fission product yields of^(232)Th is crucial for understanding the competition between symmetric and asymmetric fission in actinide nuclei.However,obtaining complete isotopic yield distributions over a wide range of neutron energies remains a challenge.In this study,a Bayesian neural network model was developed to predict the independent(IND)and cumulative fission yields of^(232)Th under neutron irradiation at various incident energies.To address the limited availability of experimental data for the analysis of IND mass distributions,we substituted mass-number-based yields with the yields of specific isotopes.Furthermore,physical phenomena or quantities,such as the odd-even effect and isospin,were introduced as constraints to enhance the physical consistency of the predictions.The impact of these constraints was evaluated using mass-chain yield distributions and their dependence on energy.Incorporating physical constraints significantly improves the prediction accuracy,yielding more reliable and physically meaningful fission yield data for nuclear physics and reactor design applications.