The subset threshold auto regressive (SSTAR) model, which is capable of reproducing the limit cycle behavior of nonlinear time series, is introduced. The algorithm for fitting the sampled data with SSTAR model is pr...The subset threshold auto regressive (SSTAR) model, which is capable of reproducing the limit cycle behavior of nonlinear time series, is introduced. The algorithm for fitting the sampled data with SSTAR model is proposed and applied to model and forecast power load. Numerical example verifies that desirable accuracy of short term load forecasting can be achieved by using the SSTAR model.展开更多
Firstly,based on the data of air quality and the meteorological data in Baoding City from 2017 to 2021,the correlations of meteorological elements and pollutants with O_(3)concentration were explored to determine the ...Firstly,based on the data of air quality and the meteorological data in Baoding City from 2017 to 2021,the correlations of meteorological elements and pollutants with O_(3)concentration were explored to determine the forecast factors of forecast models.Secondly,the O_(3)-8h concentration in Baoding City in 2021 was predicted based on the constructed models of multiple linear regression(MLR),backward propagation neural network(BPNN),and auto regressive integrated moving average(ARIMA),and the predicted values were compared with the observed values to test their prediction effects.The results show that overall,the MLR,BPNN and ARIMA models were able to forecast the changing trend of O_(3)-8h concentration in Baoding in 2021,but the BPNN model gave better forecast results than the ARIMA and MLR models,especially for the prediction of the high values of O_(3)-8h concentration,and the correlation coefficients between the predicted values and the observed values were all higher than 0.9 during June-September.The mean error(ME),mean absolute error(MAE),and root mean square error(RMSE)of the predicted values and the observed values of daily O_(3)-8h concentration based on the BPNN model were 0.45,19.11 and 24.41μg/m 3,respectively,which were significantly better than those of the MLR and ARIMA models.The prediction effects of the MLR,BPNN and ARIMA models were the best at the pollution level,followed by the excellent level,and it was the worst at the good level.In comparison,the prediction effect of BPNN model was better than that of the MLR and ARIMA models as a whole,especially for the pollution and excellent levels.The TS scores of the BPNN model were all above 66%,and the PC values were above 86%.The BPNN model can forecast the changing trend of O_(3)concentration more accurately,and has a good practical application value,but at the same time,the predicted high values of O_(3)concentration should be appropriately increased according to error characteristics of the model.展开更多
The results of mass appraisal in many countries are used as a basis for calculating the amount of real estate tax,therefore,regardless of the methods used to calculate it,the resulting value should be as close as poss...The results of mass appraisal in many countries are used as a basis for calculating the amount of real estate tax,therefore,regardless of the methods used to calculate it,the resulting value should be as close as possible to the market value of the real estate to maintain a balance of interests between the state and the rights holders.In practice,this condition is not always met,since,firstly,the quality of market data is often very low,and secondly,some markets are characterized by low activity,which is expressed in a deficit of information on asking prices.The aim of the work is ecological valuation of land use:how regression-based mass appraisal can inform ecological conservation,land degradation,and sustainable land management.Four multiple regression models were constructed for AI generated map of land plots for recreational use in St.Petersburg(Russia)with different volumes of market information(32,30,20 and 15 units of market information with four price-forming factors).During the analysis of the quality of the models,it was revealed that the best result is shown by the model built on the maximum sample size,then the model based on 15 analogs,which proves that a larger number of analog objects does not always allow us to achieve better results,since the more analog objects there are.展开更多
The work proposes a distributed Kalman filtering(KF)algorithm to track a time-varying unknown signal process for a stochastic regression model over network systems in a cooperative way.We provide the stability analysi...The work proposes a distributed Kalman filtering(KF)algorithm to track a time-varying unknown signal process for a stochastic regression model over network systems in a cooperative way.We provide the stability analysis of the proposed distributed KF algorithm without independent and stationary signal assumptions,which implies that the theoretical results are able to be applied to stochastic feedback systems.Note that the main difficulty of stability analysis lies in analyzing the properties of the product of non-independent and non-stationary random matrices involved in the error equation.We employ analysis techniques such as stochastic Lyapunov function,stability theory of stochastic systems,and algebraic graph theory to deal with the above issue.The stochastic spatio-temporal cooperative information condition shows the cooperative property of multiple sensors that even though any local sensor cannot track the time-varying unknown signal,the distributed KF algorithm can be utilized to finish the filtering task in a cooperative way.At last,we illustrate the property of the proposed distributed KF algorithm by a simulation example.展开更多
Moistube irrigation is a new micro-irrigation technology.Accurately estimating its wetting pattern dimensions presents a challenge.Therefore,it is necessary to develop models for efficient assessment of the wetting tr...Moistube irrigation is a new micro-irrigation technology.Accurately estimating its wetting pattern dimensions presents a challenge.Therefore,it is necessary to develop models for efficient assessment of the wetting transport pattern in order to design a cost-effective moistube irrigation system.To achieve this goal,this study developed a multivariate nonlinear regression model and compared it with a dimensional model.HYDRUS-2D was used to perform numerical simulations of 56 irrigation scenarios with different factors.The experiments showed that the shape of the wetting soil body approximated a cylinder and was mainly affected by soil texture,pressure head,and matric potential.A multivariate nonlinear model using a power function relationship between wetting size and irrigation time was developed,with a determination coefficient greater than 0.99.The model was validated for cases with six soil texture types,with mean average absolute errors of 0.43-0.90 cm,root mean square errors of 0.51-0.95 cm,and mean deviation percentage values of 3.23%-6.27%.The multivariate nonlinear regression model outperformed the dimensional model.It can therefore provide a scientific foundation for the development of moistube irrigation systems.展开更多
BACKGROUND Congenital heart disease is most commonly seen in neonates and it is a major cause of pediatric illness and childhood morbidity and mortality.AIM To identify and build the best predictive model for predicti...BACKGROUND Congenital heart disease is most commonly seen in neonates and it is a major cause of pediatric illness and childhood morbidity and mortality.AIM To identify and build the best predictive model for predicting cyanotic and acyanotic congenital heart disease in children during pregnancy and identify their potential risk factors.METHODS The data were collected from the Pediatric Cardiology Department at Chaudhry Pervaiz Elahi Institute of Cardiology Multan,Pakistan from December 2017 to October 2019.A sample of 3900 mothers whose children were diagnosed with identify the potential outliers.Different machine learning models were compared,and the best-fitted model was selected using the area under the curve,sensitivity,and specificity of the models.RESULTS Out of 3900 patients included,about 69.5%had acyanotic and 30.5%had cyanotic congenital heart disease.Males had more cases of acyanotic(53.6%)and cyanotic(54.5%)congenital heart disease as compared to females.The odds of having cyanotic was 1.28 times higher for children whose mothers used more fast food frequently during pregnancy.The artificial neural network model was selected as the best predictive model with an area under the curve of 0.9012,sensitivity of 65.76%,and specificity of 97.23%.CONCLUSION Children having a positive family history are at very high risk of having cyanotic and acyanotic congenital heart disease.Males are more at risk and their mothers need more care,good food,and physical activity during pregnancy.The best-fitted model for predicting cyanotic and acyanotic congenital heart disease is the artificial neural network.The results obtained and the best model identified will be useful for medical practitioners and public health scientists for an informed decision-making process about the earlier diagnosis and improve the health condition of children in Pakistan.展开更多
To accurately model flows with shock waves using staggered-grid Lagrangian hydrodynamics, the artificial viscosity has to be introduced to convert kinetic energy into internal energy, thereby increasing the entropy ac...To accurately model flows with shock waves using staggered-grid Lagrangian hydrodynamics, the artificial viscosity has to be introduced to convert kinetic energy into internal energy, thereby increasing the entropy across shocks. Determining the appropriate strength of the artificial viscosity is an art and strongly depends on the particular problem and experience of the researcher. The objective of this study is to pose the problem of finding the appropriate strength of the artificial viscosity as an optimization problem and solve this problem using machine learning (ML) tools, specifically using surrogate models based on Gaussian Process regression (GPR) and Bayesian analysis. We describe the optimization method and discuss various practical details of its implementation. The shock-containing problems for which we apply this method all have been implemented in the LANL code FLAG (Burton in Connectivity structures and differencing techniques for staggered-grid free-Lagrange hydrodynamics, Tech. Rep. UCRL-JC-110555, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1992, in Consistent finite-volume discretization of hydrodynamic conservation laws for unstructured grids, Tech. Rep. CRL-JC-118788, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1994, Multidimensional discretization of conservation laws for unstructured polyhedral grids, Tech. Rep. UCRL-JC-118306, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1994, in FLAG, a multi-dimensional, multiple mesh, adaptive free-Lagrange, hydrodynamics code. In: NECDC, 1992). First, we apply ML to find optimal values to isolated shock problems of different strengths. Second, we apply ML to optimize the viscosity for a one-dimensional (1D) propagating detonation problem based on Zel’dovich-von Neumann-Doring (ZND) (Fickett and Davis in Detonation: theory and experiment. Dover books on physics. Dover Publications, Mineola, 2000) detonation theory using a reactive burn model. We compare results for default (currently used values in FLAG) and optimized values of the artificial viscosity for these problems demonstrating the potential for significant improvement in the accuracy of computations.展开更多
As maritime activities increase globally,there is a greater dependency on technology in monitoring,control,and surveillance of vessel activity.One of the most prominent systems for monitoring vessel activity is the Au...As maritime activities increase globally,there is a greater dependency on technology in monitoring,control,and surveillance of vessel activity.One of the most prominent systems for monitoring vessel activity is the Automatic Identification System(AIS).An increase in both vessels fitted with AIS transponders and satellite and terrestrial AIS receivers has resulted in a significant increase in AIS messages received globally.This resultant rich spatial and temporal data source related to vessel activity provides analysts with the ability to perform enhanced vessel movement analytics,of which a pertinent example is the improvement of vessel location predictions.In this paper,we propose a novel strategy for predicting future locations of vessels making use of historic AIS data.The proposed method uses a Linear Regression Model(LRM)and utilizes historic AIS movement data in the form of a-priori generated spatial maps of the course over ground(LRMAC).The LRMAC is an accurate low complexity first-order method that is easy to implement operationally and shows promising results in areas where there is a consistency in the directionality of historic vessel movement.In areas where the historic directionality of vessel movement is diverse,such as areas close to harbors and ports,the LRMAC defaults to the LRM.The proposed LRMAC method is compared to the Single-Point Neighbor Search(SPNS),which is also a first-order method and has a similar level of computational complexity,and for the use case of predicting tanker and cargo vessel trajectories up to 8 hours into the future,the LRMAC showed improved results both in terms of prediction accuracy and execution time.展开更多
Carbon emissions have become a critical concern in the global effort to combat climate change,with each country or region contributing differently based on its economic structures,energy sources,and industrial activit...Carbon emissions have become a critical concern in the global effort to combat climate change,with each country or region contributing differently based on its economic structures,energy sources,and industrial activities.The factors influencing carbon emissions vary across countries and sectors.This study examined the factors influencing CO_(2)emissions in the 7 South American countries including Argentina,Brazil,Chile,Colombia,Ecuador,Peru,and Venezuela.We used the Seemingly Unrelated Regression(SUR)model to analyse the relationship of CO_(2)emissions with gross domestic product(GDP),renewable energy use,urbanization,industrialization,international tourism,agricultural productivity,and forest area based on data from 2000 to 2022.According to the SUR model,we found that GDP and industrialization had a moderate positive effect on CO_(2)emissions,whereas renewable energy use had a moderate negative effect on CO_(2)emissions.International tourism generally had a positive impact on CO_(2)emissions,while forest area tended to decrease CO_(2)emissions.Different variables had different effects on CO_(2)emissions in the 7 South American countries.In Argentina and Venezuela,GDP,international tourism,and agricultural productivity significantly affected CO_(2)emissions.In Colombia,GDP and international tourism had a negative impact on CO_(2)emissions.In Brazil,CO_(2)emissions were primarily driven by GDP,while in Chile,Ecuador,and Peru,international tourism had a negative effect on CO_(2)emissions.Overall,this study highlights the importance of country-specific strategies for reducing CO_(2)emissions and emphasizes the varying roles of these driving factors in shaping environmental quality in the 7 South American countries.展开更多
This study aims to analyze and predict the relationship between the average price per box in the cigarette market of City A and government procurement,providing a scientific basis and support for decision-making.By re...This study aims to analyze and predict the relationship between the average price per box in the cigarette market of City A and government procurement,providing a scientific basis and support for decision-making.By reviewing relevant theories and literature,qualitative prediction methods,regression prediction models,and other related theories were explored.Through the analysis of annual cigarette sales data and government procurement data in City A,a comprehensive understanding of the development of the tobacco industry and the economic trends of tobacco companies in the county was obtained.By predicting and analyzing the average price per box of cigarette sales across different years,corresponding prediction results were derived and compared with actual sales data.The prediction results indicate that the correlation coefficient between the average price per box of cigarette sales and government procurement is 0.982,implying that government procurement accounts for 96.4%of the changes in the average price per box of cigarettes.These findings offer an in-depth exploration of the relationship between the average price per box of cigarettes in City A and government procurement,providing a scientific foundation for corporate decision-making and market operations.展开更多
Municipal solid waste generation is strongly linked to rising human population and expanding urban areas, with significant implications on urban metabolism as well as space and place values redefinition. Effective man...Municipal solid waste generation is strongly linked to rising human population and expanding urban areas, with significant implications on urban metabolism as well as space and place values redefinition. Effective management performance of municipal solid waste management underscores the interdisciplinarity strategies. Such knowledge and skills are paramount to uncover the sources of waste generation as well as means of waste storage, collection, recycling, transportation, handling/treatment, disposal, and monitoring. This study was conducted in Dar es Salaam city. Driven by the curiosity model of the solid waste minimization performance at source, study data was collected using focus group discussion techniques to ward-level local government officers, which was triangulated with literature and documentary review. The main themes of the FGD were situational factors (SFA) and local government by-laws (LGBY). In the FGD session, sub-themes of SFA tricked to understand how MSW minimization is related to the presence and effect of services such as land use planning, availability of landfills, solid waste transfer stations, material recovery facilities, incinerators, solid waste collection bins, solid waste trucks, solid waste management budget and solid waste collection agents. Similarly, FGD on LGBY was extended by sub-themes such as contents of the by-law, community awareness of the by-law, and by-law enforcement mechanisms. While data preparation applied an analytical hierarchy process, data analysis applied an ordinary least square (OLS) regression model for sub-criteria that explain SFA and LGBY;and OLS standard residues as variables into geographically weighted regression with a resolution of 241 × 241 meter in ArcMap v10.5. Results showed that situational factors and local government by-laws have a strong relationship with the rate of minimizing solid waste dumping in water bodies (local R square = 0.94).展开更多
Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear mode...Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear model is the most used technique for identifying hidden relationships between underlying random variables of interest. However, data quality is a significant challenge in machine learning, especially when missing data is present. The linear regression model is a commonly used statistical modeling technique used in various applications to find relationships between variables of interest. When estimating linear regression parameters which are useful for things like future prediction and partial effects analysis of independent variables, maximum likelihood estimation (MLE) is the method of choice. However, many datasets contain missing observations, which can lead to costly and time-consuming data recovery. To address this issue, the expectation-maximization (EM) algorithm has been suggested as a solution for situations including missing data. The EM algorithm repeatedly finds the best estimates of parameters in statistical models that depend on variables or data that have not been observed. This is called maximum likelihood or maximum a posteriori (MAP). Using the present estimate as input, the expectation (E) step constructs a log-likelihood function. Finding the parameters that maximize the anticipated log-likelihood, as determined in the E step, is the job of the maximization (M) phase. This study looked at how well the EM algorithm worked on a made-up compositional dataset with missing observations. It used both the robust least square version and ordinary least square regression techniques. The efficacy of the EM algorithm was compared with two alternative imputation techniques, k-Nearest Neighbor (k-NN) and mean imputation (), in terms of Aitchison distances and covariance.展开更多
Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical robots.This st...Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical robots.This study proposes a novel end-to-end disparity estimation model to address these challenges.Our approach combines a Pseudo-Siamese neural network architecture with pyramid dilated convolutions,integrating multi-scale image information to enhance robustness against lighting interferences.This study introduces a Pseudo-Siamese structure-based disparity regression model that simplifies left-right image comparison,improving accuracy and efficiency.The model was evaluated using a dataset of stereo endoscopic videos captured by the Da Vinci surgical robot,comprising simulated silicone heart sequences and real heart video data.Experimental results demonstrate significant improvement in the network’s resistance to lighting interference without substantially increasing parameters.Moreover,the model exhibited faster convergence during training,contributing to overall performance enhancement.This study advances endoscopic image processing accuracy and has potential implications for surgical robot applications in complex environments.展开更多
Under-fitting problems usually occur in regression models for dam safety monitoring.To overcome the local convergence of the regression, a genetic algorithm (GA) was proposed using a real parameter coding, a ranking s...Under-fitting problems usually occur in regression models for dam safety monitoring.To overcome the local convergence of the regression, a genetic algorithm (GA) was proposed using a real parameter coding, a ranking selection operator, an arithmetical crossover operator and a uniform mutation operator, and calculated the least-square error of the observed and computed values as its fitness function. The elitist strategy was used to improve the speed of the convergence. After that, the modified genetic algorithm was applied to reassess the coefficients of the regression model and a genetic regression model was set up. As an example, a slotted gravity dam in the Northeast of China was introduced. The computational results show that the genetic regression model can solve the under-fitting problems perfectly.展开更多
A fuzzy observations-based radial basis function neural network (FORBFNN) is presented for modeling nonlinear systems in which the observations of response are imprecise but can be represented as fuzzy membership fu...A fuzzy observations-based radial basis function neural network (FORBFNN) is presented for modeling nonlinear systems in which the observations of response are imprecise but can be represented as fuzzy membership functions. In the FORBFNN model, the weight coefficients of nodes in the hidden layer are identified by using the fuzzy expectation-maximization ( EM ) algorithm, whereas the optimal number of these nodes as well as the centers and widths of radial basis functions are automatically constructed by using a data-driven method. Namely, the method starts with an initial node, and then a new node is added in a hidden layer according to some rules. This procedure is not terminated until the model meets the preset requirements. The method considers both the accuracy and complexity of the model. Numerical simulation results show that the modeling method is effective, and the established model has high prediction accuracy.展开更多
Based on modeling principle of GM(1,1)model and linear regression model,a combined prediction model is established to predict equipment fault by the fitting of two models.The new prediction model takes full advantag...Based on modeling principle of GM(1,1)model and linear regression model,a combined prediction model is established to predict equipment fault by the fitting of two models.The new prediction model takes full advantage of prediction information provided by the two models and improves the prediction precision.Finally,this model is introduced to predict the system fault time according to the output voltages of a certain type of radar transmitter.展开更多
Pyrrolizidine alkaloids(PAs)and their N-oxides(PANOs)are phytotoxins produced by various plant species and have been emerged as environmental pollutants.The sorption/desorption behaviors of PAs/PANOs in soil are cruci...Pyrrolizidine alkaloids(PAs)and their N-oxides(PANOs)are phytotoxins produced by various plant species and have been emerged as environmental pollutants.The sorption/desorption behaviors of PAs/PANOs in soil are crucial due to the horizontal transfer of these natural products from PA-producing plants to soil and subsequently absorbed by plant roots.This study firstly investigated the sorption/desorption behaviors of PAs/PANOs in tea plantation soils with distinct characteristics.Sorption amounts for seneciphylline(Sp)and seneciphylline-N-oxide(SpNO)in three acidic soils ranged from 2.9 to 5.9μg/g and 1.7 to 2.8μg/g,respectively.Desorption percentages for Sp and SpNO were from 22.2%to 30.5%and 36.1%to 43.9%.In the mixed PAs/PANOs systems,stronger sorption of PAs over PANOs was occurred in tested soils.Additionally,the Freundlich models more precisely described the sorption/desorption isotherms.Cation exchange capacity,sand content and total nitrogen were identified as major influencing factors by linear regression models.Overall,the soils exhibiting higher sorption capacities for compounds with greater hydrophobicity.PANOs were more likely to migrate within soils and be absorbed by tea plants.It contributes to the understanding of environmental fate of PAs/PANOs in tea plantations and provides basic data and clues for the development of PAs/PANOs reduction technology.展开更多
High-entropy alloys(HEAs)have emerged as promising catalysts for the hydrogen evolution reaction(HER)due to their compositional diversity and synergistic effects.In this study,machine learning-accelerated density func...High-entropy alloys(HEAs)have emerged as promising catalysts for the hydrogen evolution reaction(HER)due to their compositional diversity and synergistic effects.In this study,machine learning-accelerated density functional theory(DFT)calculations were employed to assess the catalytic performance of PtPd-based HEAs with the formula PtPdXYZ(X,Y,Z=Fe,Co,Ni,Cu,Ru,Rh,Ag,Au;X≠Y≠Z).Among 56 screened HEA(111)surfaces,PtPdRuCoNi(111)was identified as the most promising,with adsorption energies(E_(ads))between−0.50 and−0.60 eV and high d-band center of−1.85 eV,indicating enhanced activity.This surface showed the hydrogen adsorption free energy(ΔG_(H^(*)))of−0.03 eV for hydrogen adsorption,outperforming Pt(111)by achieving a better balance between adsorption and desorption.Machine learning models,particularly extreme gradient boosting regression(XGBR),significantly reduced computational costs while maintaining high accuracy(root-mean-square error,RMSE=0.128 eV).These results demonstrate the potential of HEAs for efficient and sustainable hydrogen production.展开更多
The aim of this study was to assay the polyphenols,flavonoid,polyphenol oxidase and phenylalnine ammonialyase which were relative to the anthocyanins synthesis of purple corn. The optimization of multiple linear regre...The aim of this study was to assay the polyphenols,flavonoid,polyphenol oxidase and phenylalnine ammonialyase which were relative to the anthocyanins synthesis of purple corn. The optimization of multiple linear regression model of anthocyanins synthesis was y=4.383 86-0.205 45x1+5.479 638x2+0.195 575x4. According to standard partial regression coefficient testing,the result indicated that polyphenols content was negatively correlated with anthocyanins and the relative influence to anthocyanins synthesis was-42.7%; flavonoid content and activity of polyphenol oxidase were positively correlated with anthocyanins of purple corn and the relative influence to anthocyanins synthesis were 71.45% and 73.32% respectively. There was no positive correlation between the activity of phenylalnine ammonialyase and anthocyanins of purple corn. The establishment of multiple linear regression model of anthocyanins synthesis was to provide theory foundation of producing anthocyanins in laboratory.展开更多
Amyotrophic lateral sclerosis is a rare neurodegenerative disease characterized by the involvement of both upper and lower motor neurons.Early bilateral limb involvement significantly affects patients'daily lives ...Amyotrophic lateral sclerosis is a rare neurodegenerative disease characterized by the involvement of both upper and lower motor neurons.Early bilateral limb involvement significantly affects patients'daily lives and may lead them to be confined to bed.However,the effect of upper and lower motor neuron impairment and other risk factors on bilateral limb involvement is unclear.To address this issue,we retrospectively collected data from 586 amyotrophic lateral sclerosis patients with limb onset diagnosed at Peking University Third Hospital between January 2020 and May 2022.A univariate analysis revealed no significant differences in the time intervals of spread in different directions between individuals with upper motor neuron-dominant amyotrophic lateral sclerosis and those with classic amyotrophic lateral sclerosis.We used causal directed acyclic graphs for risk factor determination and Cox proportional hazards models to investigate the association between the duration of bilateral limb involvement and clinical baseline characteristics in amyotrophic lateral sclerosis patients.Multiple factor analyses revealed that higher upper motor neuron scores(hazard ratio[HR]=1.05,95%confidence interval[CI]=1.01–1.09,P=0.018),onset in the left limb(HR=0.72,95%CI=0.58–0.89,P=0.002),and a horizontal pattern of progression(HR=0.46,95%CI=0.37–0.58,P<0.001)were risk factors for a shorter interval until bilateral limb involvement.The results demonstrated that a greater degree of upper motor neuron involvement might cause contralateral limb involvement to progress more quickly in limb-onset amyotrophic lateral sclerosis patients.These findings may improve the management of amyotrophic lateral sclerosis patients with limb onset and the prediction of patient prognosis.展开更多
文摘The subset threshold auto regressive (SSTAR) model, which is capable of reproducing the limit cycle behavior of nonlinear time series, is introduced. The algorithm for fitting the sampled data with SSTAR model is proposed and applied to model and forecast power load. Numerical example verifies that desirable accuracy of short term load forecasting can be achieved by using the SSTAR model.
基金the Project of the Key Open Laboratory of Atmospheric Detection,China Meteorological Administration(2023KLAS02M)the Second Batch of Science and Technology Project of China Meteorological Administration("Jiebangguashuai"):the Research and Development of Short-term and Near-term Warning Products for Severe Convective Weather in Beijing-Tianjin-Hebei Region(CMAJBGS202307).
文摘Firstly,based on the data of air quality and the meteorological data in Baoding City from 2017 to 2021,the correlations of meteorological elements and pollutants with O_(3)concentration were explored to determine the forecast factors of forecast models.Secondly,the O_(3)-8h concentration in Baoding City in 2021 was predicted based on the constructed models of multiple linear regression(MLR),backward propagation neural network(BPNN),and auto regressive integrated moving average(ARIMA),and the predicted values were compared with the observed values to test their prediction effects.The results show that overall,the MLR,BPNN and ARIMA models were able to forecast the changing trend of O_(3)-8h concentration in Baoding in 2021,but the BPNN model gave better forecast results than the ARIMA and MLR models,especially for the prediction of the high values of O_(3)-8h concentration,and the correlation coefficients between the predicted values and the observed values were all higher than 0.9 during June-September.The mean error(ME),mean absolute error(MAE),and root mean square error(RMSE)of the predicted values and the observed values of daily O_(3)-8h concentration based on the BPNN model were 0.45,19.11 and 24.41μg/m 3,respectively,which were significantly better than those of the MLR and ARIMA models.The prediction effects of the MLR,BPNN and ARIMA models were the best at the pollution level,followed by the excellent level,and it was the worst at the good level.In comparison,the prediction effect of BPNN model was better than that of the MLR and ARIMA models as a whole,especially for the pollution and excellent levels.The TS scores of the BPNN model were all above 66%,and the PC values were above 86%.The BPNN model can forecast the changing trend of O_(3)concentration more accurately,and has a good practical application value,but at the same time,the predicted high values of O_(3)concentration should be appropriately increased according to error characteristics of the model.
基金financed as part of the project“Development of a methodology for instrumental base formation for analysis and modeling of the spatial socio-economic development of systems based on internal reserves in the context of digitalization”(FSEG-2023-0008)funded by the Russian Science Foundation(Agreement 23-41-10001,https://doi.org/https://rscf.ru/project/23-41-10001/).
文摘The results of mass appraisal in many countries are used as a basis for calculating the amount of real estate tax,therefore,regardless of the methods used to calculate it,the resulting value should be as close as possible to the market value of the real estate to maintain a balance of interests between the state and the rights holders.In practice,this condition is not always met,since,firstly,the quality of market data is often very low,and secondly,some markets are characterized by low activity,which is expressed in a deficit of information on asking prices.The aim of the work is ecological valuation of land use:how regression-based mass appraisal can inform ecological conservation,land degradation,and sustainable land management.Four multiple regression models were constructed for AI generated map of land plots for recreational use in St.Petersburg(Russia)with different volumes of market information(32,30,20 and 15 units of market information with four price-forming factors).During the analysis of the quality of the models,it was revealed that the best result is shown by the model built on the maximum sample size,then the model based on 15 analogs,which proves that a larger number of analog objects does not always allow us to achieve better results,since the more analog objects there are.
基金supported in part by Sichuan Science and Technology Program under Grant No.2025ZNSFSC151in part by the Strategic Priority Research Program of Chinese Academy of Sciences under Grant No.XDA27030201+1 种基金the Natural Science Foundation of China under Grant No.U21B6001in part by the Natural Science Foundation of Tianjin under Grant No.24JCQNJC01930.
文摘The work proposes a distributed Kalman filtering(KF)algorithm to track a time-varying unknown signal process for a stochastic regression model over network systems in a cooperative way.We provide the stability analysis of the proposed distributed KF algorithm without independent and stationary signal assumptions,which implies that the theoretical results are able to be applied to stochastic feedback systems.Note that the main difficulty of stability analysis lies in analyzing the properties of the product of non-independent and non-stationary random matrices involved in the error equation.We employ analysis techniques such as stochastic Lyapunov function,stability theory of stochastic systems,and algebraic graph theory to deal with the above issue.The stochastic spatio-temporal cooperative information condition shows the cooperative property of multiple sensors that even though any local sensor cannot track the time-varying unknown signal,the distributed KF algorithm can be utilized to finish the filtering task in a cooperative way.At last,we illustrate the property of the proposed distributed KF algorithm by a simulation example.
基金supported by the National Natural Science Foundation of China(Grant No.51969013)the Natural Science Foundation of Gansu Province(Grant No.21JR7RA225).
文摘Moistube irrigation is a new micro-irrigation technology.Accurately estimating its wetting pattern dimensions presents a challenge.Therefore,it is necessary to develop models for efficient assessment of the wetting transport pattern in order to design a cost-effective moistube irrigation system.To achieve this goal,this study developed a multivariate nonlinear regression model and compared it with a dimensional model.HYDRUS-2D was used to perform numerical simulations of 56 irrigation scenarios with different factors.The experiments showed that the shape of the wetting soil body approximated a cylinder and was mainly affected by soil texture,pressure head,and matric potential.A multivariate nonlinear model using a power function relationship between wetting size and irrigation time was developed,with a determination coefficient greater than 0.99.The model was validated for cases with six soil texture types,with mean average absolute errors of 0.43-0.90 cm,root mean square errors of 0.51-0.95 cm,and mean deviation percentage values of 3.23%-6.27%.The multivariate nonlinear regression model outperformed the dimensional model.It can therefore provide a scientific foundation for the development of moistube irrigation systems.
文摘BACKGROUND Congenital heart disease is most commonly seen in neonates and it is a major cause of pediatric illness and childhood morbidity and mortality.AIM To identify and build the best predictive model for predicting cyanotic and acyanotic congenital heart disease in children during pregnancy and identify their potential risk factors.METHODS The data were collected from the Pediatric Cardiology Department at Chaudhry Pervaiz Elahi Institute of Cardiology Multan,Pakistan from December 2017 to October 2019.A sample of 3900 mothers whose children were diagnosed with identify the potential outliers.Different machine learning models were compared,and the best-fitted model was selected using the area under the curve,sensitivity,and specificity of the models.RESULTS Out of 3900 patients included,about 69.5%had acyanotic and 30.5%had cyanotic congenital heart disease.Males had more cases of acyanotic(53.6%)and cyanotic(54.5%)congenital heart disease as compared to females.The odds of having cyanotic was 1.28 times higher for children whose mothers used more fast food frequently during pregnancy.The artificial neural network model was selected as the best predictive model with an area under the curve of 0.9012,sensitivity of 65.76%,and specificity of 97.23%.CONCLUSION Children having a positive family history are at very high risk of having cyanotic and acyanotic congenital heart disease.Males are more at risk and their mothers need more care,good food,and physical activity during pregnancy.The best-fitted model for predicting cyanotic and acyanotic congenital heart disease is the artificial neural network.The results obtained and the best model identified will be useful for medical practitioners and public health scientists for an informed decision-making process about the earlier diagnosis and improve the health condition of children in Pakistan.
基金This work was performed under the auspices of the National Nuclear Security Administration of the US Department of Energy at Los Alamos National Laboratory under Contract No.89233218CNA000001The Authors gratefully acknowledge the support of the US Department of Energy National Nuclear Security Administration Advanced Simulation and Computing Program.LA-UR-22-33159.
文摘To accurately model flows with shock waves using staggered-grid Lagrangian hydrodynamics, the artificial viscosity has to be introduced to convert kinetic energy into internal energy, thereby increasing the entropy across shocks. Determining the appropriate strength of the artificial viscosity is an art and strongly depends on the particular problem and experience of the researcher. The objective of this study is to pose the problem of finding the appropriate strength of the artificial viscosity as an optimization problem and solve this problem using machine learning (ML) tools, specifically using surrogate models based on Gaussian Process regression (GPR) and Bayesian analysis. We describe the optimization method and discuss various practical details of its implementation. The shock-containing problems for which we apply this method all have been implemented in the LANL code FLAG (Burton in Connectivity structures and differencing techniques for staggered-grid free-Lagrange hydrodynamics, Tech. Rep. UCRL-JC-110555, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1992, in Consistent finite-volume discretization of hydrodynamic conservation laws for unstructured grids, Tech. Rep. CRL-JC-118788, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1994, Multidimensional discretization of conservation laws for unstructured polyhedral grids, Tech. Rep. UCRL-JC-118306, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1994, in FLAG, a multi-dimensional, multiple mesh, adaptive free-Lagrange, hydrodynamics code. In: NECDC, 1992). First, we apply ML to find optimal values to isolated shock problems of different strengths. Second, we apply ML to optimize the viscosity for a one-dimensional (1D) propagating detonation problem based on Zel’dovich-von Neumann-Doring (ZND) (Fickett and Davis in Detonation: theory and experiment. Dover books on physics. Dover Publications, Mineola, 2000) detonation theory using a reactive burn model. We compare results for default (currently used values in FLAG) and optimized values of the artificial viscosity for these problems demonstrating the potential for significant improvement in the accuracy of computations.
文摘As maritime activities increase globally,there is a greater dependency on technology in monitoring,control,and surveillance of vessel activity.One of the most prominent systems for monitoring vessel activity is the Automatic Identification System(AIS).An increase in both vessels fitted with AIS transponders and satellite and terrestrial AIS receivers has resulted in a significant increase in AIS messages received globally.This resultant rich spatial and temporal data source related to vessel activity provides analysts with the ability to perform enhanced vessel movement analytics,of which a pertinent example is the improvement of vessel location predictions.In this paper,we propose a novel strategy for predicting future locations of vessels making use of historic AIS data.The proposed method uses a Linear Regression Model(LRM)and utilizes historic AIS movement data in the form of a-priori generated spatial maps of the course over ground(LRMAC).The LRMAC is an accurate low complexity first-order method that is easy to implement operationally and shows promising results in areas where there is a consistency in the directionality of historic vessel movement.In areas where the historic directionality of vessel movement is diverse,such as areas close to harbors and ports,the LRMAC defaults to the LRM.The proposed LRMAC method is compared to the Single-Point Neighbor Search(SPNS),which is also a first-order method and has a similar level of computational complexity,and for the use case of predicting tanker and cargo vessel trajectories up to 8 hours into the future,the LRMAC showed improved results both in terms of prediction accuracy and execution time.
文摘Carbon emissions have become a critical concern in the global effort to combat climate change,with each country or region contributing differently based on its economic structures,energy sources,and industrial activities.The factors influencing carbon emissions vary across countries and sectors.This study examined the factors influencing CO_(2)emissions in the 7 South American countries including Argentina,Brazil,Chile,Colombia,Ecuador,Peru,and Venezuela.We used the Seemingly Unrelated Regression(SUR)model to analyse the relationship of CO_(2)emissions with gross domestic product(GDP),renewable energy use,urbanization,industrialization,international tourism,agricultural productivity,and forest area based on data from 2000 to 2022.According to the SUR model,we found that GDP and industrialization had a moderate positive effect on CO_(2)emissions,whereas renewable energy use had a moderate negative effect on CO_(2)emissions.International tourism generally had a positive impact on CO_(2)emissions,while forest area tended to decrease CO_(2)emissions.Different variables had different effects on CO_(2)emissions in the 7 South American countries.In Argentina and Venezuela,GDP,international tourism,and agricultural productivity significantly affected CO_(2)emissions.In Colombia,GDP and international tourism had a negative impact on CO_(2)emissions.In Brazil,CO_(2)emissions were primarily driven by GDP,while in Chile,Ecuador,and Peru,international tourism had a negative effect on CO_(2)emissions.Overall,this study highlights the importance of country-specific strategies for reducing CO_(2)emissions and emphasizes the varying roles of these driving factors in shaping environmental quality in the 7 South American countries.
基金National Social Science Fund Project“Research on the Operational Risks and Prevention of Government Procurement of Community Services Project System”(Project No.21CSH018)Research and Application of SDM Cigarette Supply Strategy Based on Consumer Data Analysis(Project No.2023ASXM07)。
文摘This study aims to analyze and predict the relationship between the average price per box in the cigarette market of City A and government procurement,providing a scientific basis and support for decision-making.By reviewing relevant theories and literature,qualitative prediction methods,regression prediction models,and other related theories were explored.Through the analysis of annual cigarette sales data and government procurement data in City A,a comprehensive understanding of the development of the tobacco industry and the economic trends of tobacco companies in the county was obtained.By predicting and analyzing the average price per box of cigarette sales across different years,corresponding prediction results were derived and compared with actual sales data.The prediction results indicate that the correlation coefficient between the average price per box of cigarette sales and government procurement is 0.982,implying that government procurement accounts for 96.4%of the changes in the average price per box of cigarettes.These findings offer an in-depth exploration of the relationship between the average price per box of cigarettes in City A and government procurement,providing a scientific foundation for corporate decision-making and market operations.
文摘Municipal solid waste generation is strongly linked to rising human population and expanding urban areas, with significant implications on urban metabolism as well as space and place values redefinition. Effective management performance of municipal solid waste management underscores the interdisciplinarity strategies. Such knowledge and skills are paramount to uncover the sources of waste generation as well as means of waste storage, collection, recycling, transportation, handling/treatment, disposal, and monitoring. This study was conducted in Dar es Salaam city. Driven by the curiosity model of the solid waste minimization performance at source, study data was collected using focus group discussion techniques to ward-level local government officers, which was triangulated with literature and documentary review. The main themes of the FGD were situational factors (SFA) and local government by-laws (LGBY). In the FGD session, sub-themes of SFA tricked to understand how MSW minimization is related to the presence and effect of services such as land use planning, availability of landfills, solid waste transfer stations, material recovery facilities, incinerators, solid waste collection bins, solid waste trucks, solid waste management budget and solid waste collection agents. Similarly, FGD on LGBY was extended by sub-themes such as contents of the by-law, community awareness of the by-law, and by-law enforcement mechanisms. While data preparation applied an analytical hierarchy process, data analysis applied an ordinary least square (OLS) regression model for sub-criteria that explain SFA and LGBY;and OLS standard residues as variables into geographically weighted regression with a resolution of 241 × 241 meter in ArcMap v10.5. Results showed that situational factors and local government by-laws have a strong relationship with the rate of minimizing solid waste dumping in water bodies (local R square = 0.94).
文摘Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear model is the most used technique for identifying hidden relationships between underlying random variables of interest. However, data quality is a significant challenge in machine learning, especially when missing data is present. The linear regression model is a commonly used statistical modeling technique used in various applications to find relationships between variables of interest. When estimating linear regression parameters which are useful for things like future prediction and partial effects analysis of independent variables, maximum likelihood estimation (MLE) is the method of choice. However, many datasets contain missing observations, which can lead to costly and time-consuming data recovery. To address this issue, the expectation-maximization (EM) algorithm has been suggested as a solution for situations including missing data. The EM algorithm repeatedly finds the best estimates of parameters in statistical models that depend on variables or data that have not been observed. This is called maximum likelihood or maximum a posteriori (MAP). Using the present estimate as input, the expectation (E) step constructs a log-likelihood function. Finding the parameters that maximize the anticipated log-likelihood, as determined in the E step, is the job of the maximization (M) phase. This study looked at how well the EM algorithm worked on a made-up compositional dataset with missing observations. It used both the robust least square version and ordinary least square regression techniques. The efficacy of the EM algorithm was compared with two alternative imputation techniques, k-Nearest Neighbor (k-NN) and mean imputation (), in terms of Aitchison distances and covariance.
基金Supported by Sichuan Science and Technology Program(2023YFSY0026,2023YFH0004)Supported by the Institute of Information&Communications Technology Planning&Evaluation(IITP)grant funded by the Korean government(MSIT)(No.RS-2022-00155885,Artificial Intelligence Convergence Innovation Human Resources Development(Hanyang University ERICA)).
文摘Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical robots.This study proposes a novel end-to-end disparity estimation model to address these challenges.Our approach combines a Pseudo-Siamese neural network architecture with pyramid dilated convolutions,integrating multi-scale image information to enhance robustness against lighting interferences.This study introduces a Pseudo-Siamese structure-based disparity regression model that simplifies left-right image comparison,improving accuracy and efficiency.The model was evaluated using a dataset of stereo endoscopic videos captured by the Da Vinci surgical robot,comprising simulated silicone heart sequences and real heart video data.Experimental results demonstrate significant improvement in the network’s resistance to lighting interference without substantially increasing parameters.Moreover,the model exhibited faster convergence during training,contributing to overall performance enhancement.This study advances endoscopic image processing accuracy and has potential implications for surgical robot applications in complex environments.
文摘Under-fitting problems usually occur in regression models for dam safety monitoring.To overcome the local convergence of the regression, a genetic algorithm (GA) was proposed using a real parameter coding, a ranking selection operator, an arithmetical crossover operator and a uniform mutation operator, and calculated the least-square error of the observed and computed values as its fitness function. The elitist strategy was used to improve the speed of the convergence. After that, the modified genetic algorithm was applied to reassess the coefficients of the regression model and a genetic regression model was set up. As an example, a slotted gravity dam in the Northeast of China was introduced. The computational results show that the genetic regression model can solve the under-fitting problems perfectly.
基金The National Natural Science Foundation of China(No.51106025,51106027,51036002)Specialized Research Fund for the Doctoral Program of Higher Education(No.20130092110061)the Youth Foundation of Nanjing Institute of Technology(No.QKJA201303)
文摘A fuzzy observations-based radial basis function neural network (FORBFNN) is presented for modeling nonlinear systems in which the observations of response are imprecise but can be represented as fuzzy membership functions. In the FORBFNN model, the weight coefficients of nodes in the hidden layer are identified by using the fuzzy expectation-maximization ( EM ) algorithm, whereas the optimal number of these nodes as well as the centers and widths of radial basis functions are automatically constructed by using a data-driven method. Namely, the method starts with an initial node, and then a new node is added in a hidden layer according to some rules. This procedure is not terminated until the model meets the preset requirements. The method considers both the accuracy and complexity of the model. Numerical simulation results show that the modeling method is effective, and the established model has high prediction accuracy.
基金National Natural Science Foundation of China(No.51175480)
文摘Based on modeling principle of GM(1,1)model and linear regression model,a combined prediction model is established to predict equipment fault by the fitting of two models.The new prediction model takes full advantage of prediction information provided by the two models and improves the prediction precision.Finally,this model is introduced to predict the system fault time according to the output voltages of a certain type of radar transmitter.
基金supported by the earmarked fund for the Modern Agro-Industry Technology Research System (No.CARS-19)the Innovative Research Team in Chinese Academy of Agricultural Sciences (No.CAAS ASTIP-2014-TRICAAS).
文摘Pyrrolizidine alkaloids(PAs)and their N-oxides(PANOs)are phytotoxins produced by various plant species and have been emerged as environmental pollutants.The sorption/desorption behaviors of PAs/PANOs in soil are crucial due to the horizontal transfer of these natural products from PA-producing plants to soil and subsequently absorbed by plant roots.This study firstly investigated the sorption/desorption behaviors of PAs/PANOs in tea plantation soils with distinct characteristics.Sorption amounts for seneciphylline(Sp)and seneciphylline-N-oxide(SpNO)in three acidic soils ranged from 2.9 to 5.9μg/g and 1.7 to 2.8μg/g,respectively.Desorption percentages for Sp and SpNO were from 22.2%to 30.5%and 36.1%to 43.9%.In the mixed PAs/PANOs systems,stronger sorption of PAs over PANOs was occurred in tested soils.Additionally,the Freundlich models more precisely described the sorption/desorption isotherms.Cation exchange capacity,sand content and total nitrogen were identified as major influencing factors by linear regression models.Overall,the soils exhibiting higher sorption capacities for compounds with greater hydrophobicity.PANOs were more likely to migrate within soils and be absorbed by tea plants.It contributes to the understanding of environmental fate of PAs/PANOs in tea plantations and provides basic data and clues for the development of PAs/PANOs reduction technology.
基金the Second Century Fund(C2F),Chulalongkorn UniversityThailand Science Research and Innovation Fund Chulalongkorn University(No.IND_FF_68_054_2100_009)National Science and Technology Development Agency,Thailand,Hub of Knowledge funding,and the Mid-Career Research Grant 2024,National Research Council of Thailand(No.N42A670295).
文摘High-entropy alloys(HEAs)have emerged as promising catalysts for the hydrogen evolution reaction(HER)due to their compositional diversity and synergistic effects.In this study,machine learning-accelerated density functional theory(DFT)calculations were employed to assess the catalytic performance of PtPd-based HEAs with the formula PtPdXYZ(X,Y,Z=Fe,Co,Ni,Cu,Ru,Rh,Ag,Au;X≠Y≠Z).Among 56 screened HEA(111)surfaces,PtPdRuCoNi(111)was identified as the most promising,with adsorption energies(E_(ads))between−0.50 and−0.60 eV and high d-band center of−1.85 eV,indicating enhanced activity.This surface showed the hydrogen adsorption free energy(ΔG_(H^(*)))of−0.03 eV for hydrogen adsorption,outperforming Pt(111)by achieving a better balance between adsorption and desorption.Machine learning models,particularly extreme gradient boosting regression(XGBR),significantly reduced computational costs while maintaining high accuracy(root-mean-square error,RMSE=0.128 eV).These results demonstrate the potential of HEAs for efficient and sustainable hydrogen production.
文摘The aim of this study was to assay the polyphenols,flavonoid,polyphenol oxidase and phenylalnine ammonialyase which were relative to the anthocyanins synthesis of purple corn. The optimization of multiple linear regression model of anthocyanins synthesis was y=4.383 86-0.205 45x1+5.479 638x2+0.195 575x4. According to standard partial regression coefficient testing,the result indicated that polyphenols content was negatively correlated with anthocyanins and the relative influence to anthocyanins synthesis was-42.7%; flavonoid content and activity of polyphenol oxidase were positively correlated with anthocyanins of purple corn and the relative influence to anthocyanins synthesis were 71.45% and 73.32% respectively. There was no positive correlation between the activity of phenylalnine ammonialyase and anthocyanins of purple corn. The establishment of multiple linear regression model of anthocyanins synthesis was to provide theory foundation of producing anthocyanins in laboratory.
基金supported by the National Natural Science Foundation of China,Nos.82071426,81873784Clinical Cohort Construction Program of Peking University Third Hospital,No.BYSYDL2019002(all to DF)。
文摘Amyotrophic lateral sclerosis is a rare neurodegenerative disease characterized by the involvement of both upper and lower motor neurons.Early bilateral limb involvement significantly affects patients'daily lives and may lead them to be confined to bed.However,the effect of upper and lower motor neuron impairment and other risk factors on bilateral limb involvement is unclear.To address this issue,we retrospectively collected data from 586 amyotrophic lateral sclerosis patients with limb onset diagnosed at Peking University Third Hospital between January 2020 and May 2022.A univariate analysis revealed no significant differences in the time intervals of spread in different directions between individuals with upper motor neuron-dominant amyotrophic lateral sclerosis and those with classic amyotrophic lateral sclerosis.We used causal directed acyclic graphs for risk factor determination and Cox proportional hazards models to investigate the association between the duration of bilateral limb involvement and clinical baseline characteristics in amyotrophic lateral sclerosis patients.Multiple factor analyses revealed that higher upper motor neuron scores(hazard ratio[HR]=1.05,95%confidence interval[CI]=1.01–1.09,P=0.018),onset in the left limb(HR=0.72,95%CI=0.58–0.89,P=0.002),and a horizontal pattern of progression(HR=0.46,95%CI=0.37–0.58,P<0.001)were risk factors for a shorter interval until bilateral limb involvement.The results demonstrated that a greater degree of upper motor neuron involvement might cause contralateral limb involvement to progress more quickly in limb-onset amyotrophic lateral sclerosis patients.These findings may improve the management of amyotrophic lateral sclerosis patients with limb onset and the prediction of patient prognosis.