In this investigation,the Gradient Boosting(GB),Linear Regression(LR),Decision Tree(DT),and Voting algo-rithms were applied to predict the distribution pattern of Au geochemical data.Trace and indicator elements,inclu...In this investigation,the Gradient Boosting(GB),Linear Regression(LR),Decision Tree(DT),and Voting algo-rithms were applied to predict the distribution pattern of Au geochemical data.Trace and indicator elements,including Mo,Cu,Pb,Zn,Ag,Ni,Co,Mn,Fe,and As,were used with these machine learning algorithms(MLAs)to predict Au concentration values in the Doostbigloo porphyry Cu-Au-Mo mineralization area.The performance of the models was evaluated using the Mean Absolute Percentage Error(MAPE)and Root Mean Square Error(RMSE)metrics.The proposed ensemble Voting algorithm outperformed the other models,yielding more ac-curate predictions according to both metrics.The predicted data from the GB,LR,DT,and Voting MLAs were modeled using the Concentration-Area fractal method,and Au geochemical anomalies were mapped.To compare and validate the results,factors such as the location of the mineral deposits,their surface extent,and mineralization trend were considered.The results indicate that integrating hybrid MLAs with fractal modeling signifi-cantly improves geochemical prospectivity mapping.Among the four models,three(DT,GB,Voting)accurately identified both mineral deposits.The LR model,however,only identified Deposit I(central),and its mineralization trend diverged from the field data.The GB and Voting models produced similar results,with their final maps derived from fractal modeling showing the same anomalous areas.The anomaly boundaries identified by these two models are consistent with the two known reserves in the region.The results and plots related to prediction indicators and error rates for these two models also show high similarity,with lower error rates than the other models.Notably,the Voting model demonstrated superior performance in accurately delineating mineral deposit locations and identifying realistic mineralization trends while minimizing false anomalies.展开更多
The work proposes a distributed Kalman filtering(KF)algorithm to track a time-varying unknown signal process for a stochastic regression model over network systems in a cooperative way.We provide the stability analysi...The work proposes a distributed Kalman filtering(KF)algorithm to track a time-varying unknown signal process for a stochastic regression model over network systems in a cooperative way.We provide the stability analysis of the proposed distributed KF algorithm without independent and stationary signal assumptions,which implies that the theoretical results are able to be applied to stochastic feedback systems.Note that the main difficulty of stability analysis lies in analyzing the properties of the product of non-independent and non-stationary random matrices involved in the error equation.We employ analysis techniques such as stochastic Lyapunov function,stability theory of stochastic systems,and algebraic graph theory to deal with the above issue.The stochastic spatio-temporal cooperative information condition shows the cooperative property of multiple sensors that even though any local sensor cannot track the time-varying unknown signal,the distributed KF algorithm can be utilized to finish the filtering task in a cooperative way.At last,we illustrate the property of the proposed distributed KF algorithm by a simulation example.展开更多
Choosing optimal parameters for support vector regression (SVR) is an important step in SVR. design, which strongly affects the pefformance of SVR. In this paper, based on the analysis of influence of SVR parameters...Choosing optimal parameters for support vector regression (SVR) is an important step in SVR. design, which strongly affects the pefformance of SVR. In this paper, based on the analysis of influence of SVR parameters on generalization error, a new approach with two steps is proposed for selecting SVR parameters, First the kernel function and SVM parameters are optimized roughly through genetic algorithm, then the kernel parameter is finely adjusted by local linear search, This approach has been successfully applied to the prediction model of the sulfur content in hot metal. The experiment results show that the proposed approach can yield better generalization performance of SVR than other methods,展开更多
This paper describes a robust support vector regression (SVR) methodology, which can offer superior performance for important process engineering problems. The method incorporates hybrid support vector regression an...This paper describes a robust support vector regression (SVR) methodology, which can offer superior performance for important process engineering problems. The method incorporates hybrid support vector regression and genetic algorithm technique (SVR-GA) for efficient tuning of SVR meta-parameters. The algorithm has been applied for prediction of pressure drop of solid liquid slurry flow. A comparison with selected correlations in the lit- erature showed that the developed SVR correlation noticeably improved the prediction of pressure drop over a wide range of operating conditions, physical properties, and pipe diameters.展开更多
The main purpose of blasting operation is to produce desired and optimum mean size rock fragments.Smaller or fine fragments cause the loss of ore during loading and transportation,whereas large or coarser fragments ne...The main purpose of blasting operation is to produce desired and optimum mean size rock fragments.Smaller or fine fragments cause the loss of ore during loading and transportation,whereas large or coarser fragments need to be further processed,which enhances production cost.Therefore,accurate prediction of rock fragmentation is crucial in blasting operations.Mean fragment size(MFS) is a crucial index that measures the goodness of blasting designs.Over the past decades,various models have been proposed to evaluate and predict blasting fragmentation.Among these models,artificial intelligence(AI)-based models are becoming more popular due to their outstanding prediction results for multiinfluential factors.In this study,support vector regression(SVR) techniques are adopted as the basic prediction tools,and five types of optimization algorithms,i.e.grid search(GS),grey wolf optimization(GWO),particle swarm optimization(PSO),genetic algorithm(GA) and salp swarm algorithm(SSA),are implemented to improve the prediction performance and optimize the hyper-parameters.The prediction model involves 19 influential factors that constitute a comprehensive blasting MFS evaluation system based on AI techniques.Among all the models,the GWO-v-SVR-based model shows the best comprehensive performance in predicting MFS in blasting operation.Three types of mathematical indices,i.e.mean square error(MSE),coefficient of determination(R^(2)) and variance accounted for(VAF),are utilized for evaluating the performance of different prediction models.The R^(2),MSE and VAF values for the training set are 0.8355,0.00138 and 80.98,respectively,whereas 0.8353,0.00348 and 82.41,respectively for the testing set.Finally,sensitivity analysis is performed to understand the influence of input parameters on MFS.It shows that the most sensitive factor in blasting MFS is the uniaxial compressive strength.展开更多
The safety factor is a crucial quantitative index for evaluating slope stability.However,the traditional calculation methods suffer from unreasonable assumptions,complex soil composition,and inadequate consideration o...The safety factor is a crucial quantitative index for evaluating slope stability.However,the traditional calculation methods suffer from unreasonable assumptions,complex soil composition,and inadequate consideration of the influencing factors,leading to large errors in their calculations.Therefore,a stacking ensemble learning model(stacking-SSAOP)based on multi-layer regression algorithm fusion and optimized by the sparrow search algorithm is proposed for predicting the slope safety factor.In this method,the density,cohesion,friction angle,slope angle,slope height,and pore pressure ratio are selected as characteristic parameters from the 210 sets of established slope sample data.Random Forest,Extra Trees,AdaBoost,Bagging,and Support Vector regression are used as the base model(inner loop)to construct the first-level regression algorithm layer,and XGBoost is used as the meta-model(outer loop)to construct the second-level regression algorithm layer and complete the construction of the stacked learning model for improving the model prediction accuracy.The sparrow search algorithm is used to optimize the hyperparameters of the above six regression models and correct the over-and underfitting problems of the single regression model to further improve the prediction accuracy.The mean square error(MSE)of the predicted and true values and the fitting of the data are compared and analyzed.The MSE of the stacking-SSAOP model was found to be smaller than that of the single regression model(MSE=0.03917).Therefore,the former has a higher prediction accuracy and better data fitting.This study innovatively applies the sparrow search algorithm to predict the slope safety factor,showcasing its advantages over traditional methods.Additionally,our proposed stacking-SSAOP model integrates multiple regression algorithms to enhance prediction accuracy.This model not only refines the prediction accuracy of the slope safety factor but also offers a fresh approach to handling the intricate soil composition and other influencing factors,making it a precise and reliable method for slope stability evaluation.This research holds importance for the modernization and digitalization of slope safety assessments.展开更多
Gaussian process(GP)has fewer parameters,simple model and output of probabilistic sense,when compared with the methods such as support vector machines.Selection of the hyper-parameters is critical to the performance o...Gaussian process(GP)has fewer parameters,simple model and output of probabilistic sense,when compared with the methods such as support vector machines.Selection of the hyper-parameters is critical to the performance of Gaussian process model.However,the common-used algorithm has the disadvantages of difficult determination of iteration steps,over-dependence of optimization effect on initial values,and easily falling into local optimum.To solve this problem,a method combining the Gaussian process with memetic algorithm was proposed.Based on this method,memetic algorithm was used to search the optimal hyper parameters of Gaussian process regression(GPR)model in the training process and form MA-GPR algorithms,and then the model was used to predict and test the results.When used in the marine long-range precision strike system(LPSS)battle effectiveness evaluation,the proposed MA-GPR model significantly improved the prediction accuracy,compared with the conjugate gradient method and the genetic algorithm optimization process.展开更多
Linear Least Square (LLS) is an approach for modeling regression analysis, applied for prediction and quantification of the strength of relationship between dependent and independent variables. There are a number of m...Linear Least Square (LLS) is an approach for modeling regression analysis, applied for prediction and quantification of the strength of relationship between dependent and independent variables. There are a number of methods for solving the LLS problem but as soon as the data size increases and system becomes ill conditioned, the classical methods become complex at time and space with decreasing level of accuracy. Proposed work is based on prediction and quantification of the strength of relationship between sugar fasting and Post-Prandial (PP) sugar with 73 factors that affect diabetes. Due to the large number of independent variables, presented problem of diabetes prediction also presented similar complexities. ABS method is an approach proven better than other classical approaches for LLS problems. ABS algorithm has been applied for solving LLS problem. Hence, separate regression equations were obtained for sugar fasting and PP severity.展开更多
Budgeting planning plays an important role in coordinating activities in organizations. An accurate sales volume forecasting is the key to the entire budgeting process. All of the other parts of the master budget are ...Budgeting planning plays an important role in coordinating activities in organizations. An accurate sales volume forecasting is the key to the entire budgeting process. All of the other parts of the master budget are dependent on the sales volume forecasting in some way. If the sales volume forecasting is sloppily done, then the rest of the budgeting process is largely a waste of time. Therefore, the sales volume forecasting process is a critical one for most businesses, and also a difficult area of management. Most of researches and companies use the statistical methods, regression analysis, or sophisticated computer simulations to analyze the sales volume forecasting. Recently, various prediction Artificial Intelligent (AI) techniques have been proposed in forecasting. Support Vector Regression (SVR) has been applied successfully to solve problems in numerous fields and proved to be a better prediction model. However, the select of appropriate SVR parameters is difficult. Therefore, to improve the accuracy of SVR, a hybrid intelligent support system based on evolutionary computation to solve the difficulties involved with the parameters selection is presented in this research. Genetic Algorithms (GAs) are used to optimize free parameters of SVR. The experimental results indicate that GA-SVR can achieve better forecasting accuracy and performance than traditional SVR and artificial neural network (ANN) prediction models in sales volume forecasting.展开更多
Rainfall forecasting is becoming more and more significant and precipitation anomalies would lead to droughts and floods disasters.However,because of the complexity and non-stationary of rainfall data,it is difficult ...Rainfall forecasting is becoming more and more significant and precipitation anomalies would lead to droughts and floods disasters.However,because of the complexity and non-stationary of rainfall data,it is difficult to forecast.In this paper,a novel hybrid model to forecast rainfall is developed by incorporating singular spectrum analysis (SSA) and dragonfly algorithm (DA) into support vector regression (SVR) method.Firstly,SSA is used for extracting the trend components of the hydrological data.Then,SVR is utilized to deal with the volatility and irregularity of the precipitation series.Finally,the parameter of SVR is optimized by DA.The proposed SSA-DA-SVR method is used to forecast the monthly precipitation for Songbai,Panshui,Lanma and Jiulongchi stations.To validate the efficiency of the method,four compared models,DA-SVR,SSA-GWO-SVR,SSA-PSO-SVR and SSA-CS-SVR are established.The result shows that the proposed method has the best performance among all five models,and its prediction has high precision and accuracy.展开更多
The stochastic simulation algorithm (SSA) accurately depicts spatially homogeneous wellstirred chemically reacting systems with small populations of chemical species and properly represents noise, but it is often ab...The stochastic simulation algorithm (SSA) accurately depicts spatially homogeneous wellstirred chemically reacting systems with small populations of chemical species and properly represents noise, but it is often abandoned when modeling larger systems because of its computational complexity. In this work, a twin support vector regression based stochastic simulations algorithm (TS^3A) is proposed by combining the twin support vector regression and SSA, the former is a well-known robust regression method in machine learning. Numerical results indicate that this proposed algorithm can be applied to a wide range of chemically reacting systems and obtain significant improvements on efficiency and accuracy with fewer simulating runs over the existing methods.展开更多
Due to the geological body uncertainty,the identification of the surrounding rock parameters in the tunnel construction process is of great significance to the calculation of tunnel stability.The ubiquitous-joint mode...Due to the geological body uncertainty,the identification of the surrounding rock parameters in the tunnel construction process is of great significance to the calculation of tunnel stability.The ubiquitous-joint model and three-dimensional numerical simulation have advantages in the parameter identification of surrounding rock with weak planes,but conventional methods have certain problems,such as a large number of parameters and large time consumption.To solve the problems,this study combines the orthogonal design,Gaussian process(GP)regression,and difference evolution(DE)optimization,and it constructs the parameters identification method of the jointed surrounding rock.The calculation process of parameters identification of a tunnel jointed surrounding rock based on the GP optimized by the DE includes the following steps.First,a three-dimensional numerical simulation based on the ubiquitous-joint model is conducted according to the orthogonal and uniform design parameters combing schemes,where the model input consists of jointed rock parameters and model output is the information on the surrounding rock displacement and stress.Then,the GP regress model optimized by DE is trained by the data samples.Finally,the GP model is integrated into the DE algorithm,and the absolute differences in the displacement and stress between calculated and monitored values are used as the objective function,while the parameters of the jointed surrounding rock are used as variables and identified.The proposed method is verified by the experiments with a joint rock surface in the Dadongshan tunnel,which is located in Dalian,China.The obtained calculation and analysis results are as follows:CR=0.9,F=0.6,NP=100,and the difference strategy DE/Best/1 is recommended.The results of the back analysis are compared with the field monitored values,and the relative error is 4.58%,which is satisfactory.The algorithm influencing factors are also discussed,and it is found that the local correlation coefficientσf and noise standard deviationσn affected the prediction accuracy of the GP model.The results show that the proposed method is feasible and can achieve high identification precision.The study provides an effective reference for parameter identification of jointed surrounding rock in a tunnel.展开更多
With the development of UAV technology,UAV aerial magnetic survey plays an important role in the airborne geophysical prospecting.In the aeromagnetic survey,the magnetic field interferences generated by the magnetic c...With the development of UAV technology,UAV aerial magnetic survey plays an important role in the airborne geophysical prospecting.In the aeromagnetic survey,the magnetic field interferences generated by the magnetic components on the aircraft greatly affect the accuracy of the survey results.Therefore,it is necessary to use aeromagnetic compensation technology to eliminate the interfering magnetic field.So far,the aeromagnetic compensation methods used are mainly linear regression compensation methods based on the T-L equation.The least square is one of the most commonly used methods to solve multiple linear regressions.However,considering that the correlation between data may lead to instability of the algorithm,we use the ridge regression algorithm to solve the multicollinearity problem in the T-L equation.Subsequently this method is applied to the aeromagnetic survey data,and the standard deviation is selected as the index to evaluate the compensation effect to verify the effectiveness of the method.展开更多
The study on scientific analysis and prediction of China’s future carbon emissions is conducive to balancing the relationship between economic development and carbon emissions in the new era,and actively responding t...The study on scientific analysis and prediction of China’s future carbon emissions is conducive to balancing the relationship between economic development and carbon emissions in the new era,and actively responding to climate change policy.Through the analysis of the application of the generalized regression neural network(GRNN)in prediction,this paper improved the prediction method of GRNN.Genetic algorithm(GA)was adopted to search the optimal smooth factor as the only factor of GRNN,which was then used for prediction in GRNN.During the prediction of carbon dioxide emissions using the improved method,the increments of data were taken into account.The target values were obtained after the calculation of the predicted results.Finally,compared with the results of GRNN,the improved method realized higher prediction accuracy.It thus offers a new way of predicting total carbon dioxide emissions,and the prediction results can provide macroscopic guidance and decision-making reference for China’s environmental protection and trading of carbon emissions.展开更多
BACKGROUND Difficulty of colonoscopy insertion(DCI)significantly affects colonoscopy effectiveness and serves as a key quality indicator.Predicting and evaluating DCI risk preoperatively is crucial for optimizing intr...BACKGROUND Difficulty of colonoscopy insertion(DCI)significantly affects colonoscopy effectiveness and serves as a key quality indicator.Predicting and evaluating DCI risk preoperatively is crucial for optimizing intraoperative strategies.AIM To evaluate the predictive performance of machine learning(ML)algorithms for DCI by comparing three modeling approaches,identify factors influencing DCI,and develop a preoperative prediction model using ML algorithms to enhance colonoscopy quality and efficiency.METHODS This cross-sectional study enrolled 712 patients who underwent colonoscopy at a tertiary hospital between June 2020 and May 2021.Demographic data,past medical history,medication use,and psychological status were collected.The endoscopist assessed DCI using the visual analogue scale.After univariate screening,predictive models were developed using multivariable logistic regression,least absolute shrinkage and selection operator(LASSO)regression,and random forest(RF)algorithms.Model performance was evaluated based on discrimination,calibration,and decision curve analysis(DCA),and results were visualized using nomograms.RESULTS A total of 712 patients(53.8%male;mean age 54.5 years±12.9 years)were included.Logistic regression analysis identified constipation[odds ratio(OR)=2.254,95%confidence interval(CI):1.289-3.931],abdominal circumference(AC)(77.5–91.9 cm,OR=1.895,95%CI:1.065-3.350;AC≥92 cm,OR=1.271,95%CI:0.730-2.188),and anxiety(OR=1.071,95%CI:1.044-1.100)as predictive factors for DCI,validated by LASSO and RF methods.Model performance revealed training/validation sensitivities of 0.826/0.925,0.924/0.868,and 1.000/0.981;specificities of 0.602/0.511,0.510/0.562,and 0.977/0.526;and corresponding area under the receiver operating characteristic curves(AUCs)of 0.780(0.737-0.823)/0.726(0.654-0.799),0.754(0.710-0.798)/0.723(0.656-0.791),and 1.000(1.000-1.000)/0.754(0.688-0.820),respectively.DCA indicated optimal net benefit within probability thresholds of 0-0.9 and 0.05-0.37.The RF model demonstrated superior diagnostic accuracy,reflected by perfect training sensitivity(1.000)and highest validation AUC(0.754),outperforming other methods in clinical applicability.CONCLUSION The RF-based model exhibited superior predictive accuracy for DCI compared to multivariable logistic and LASSO regression models.This approach supports individualized preoperative optimization,enhancing colonoscopy quality through targeted risk stratification.展开更多
A general regression neural network model,combined with an interative algorithm(GRNNI)using sparsely distributed samples and auxiliary environmental variables was proposed to predict both spatial distribution and vari...A general regression neural network model,combined with an interative algorithm(GRNNI)using sparsely distributed samples and auxiliary environmental variables was proposed to predict both spatial distribution and variability of soil organic matter(SOM)in a bamboo forest.The auxiliary environmental variables were:elevation,slope,mean annual temperature,mean annual precipitation,and normalized difference vegetation index.The prediction accuracy of this model was assessed via three accuracy indices,mean error(ME),mean absolute error(MAE),and root mean squared error(RMSE)for validation in sampling sites.Both the prediction accuracy and reliability of this model were compared to those of regression kriging(RK)and ordinary kriging(OK).The results show that the prediction accuracy of the GRNNI model was higher than that of both RK and OK.The three accuracy indices(ME,MAE,and RMSE)of the GRNNI model were lower than those of RK and OK.Relative improvements of RMSE of the GRNNI model compared with RK and OK were 13.6%and 17.5%,respectively.In addition,a more realistic spatial pattern of SOM was produced by the model because the GRNNI model was more suitable than multiple linear regression to capture the nonlinear relationship between SOM and the auxiliary environmental variables.Therefore,the GRNNI model can improve both prediction accuracy and reliability for determining spatial distribution and variability of SOM.展开更多
Regression testing is a widely used approach to confirm the correct functionality of the software in incremental development.The use of test cases makes it easier to test the ripple effect of changed requirements.Rigo...Regression testing is a widely used approach to confirm the correct functionality of the software in incremental development.The use of test cases makes it easier to test the ripple effect of changed requirements.Rigorous testingmay help in meeting the quality criteria that is based on the conformance to the requirements as given by the intended stakeholders.However,a minimized and prioritized set of test cases may reduce the efforts and time required for testingwhile focusing on the timely delivery of the software application.In this research,a technique named Test Reduce has been presented to get a minimal set of test cases based on high priority to ensure that the web applicationmeets the required quality criteria.A new technique TestReduce is proposed with a blend of genetic algorithm to find an optimized and minimal set of test cases.The ultimate objective associated with this study is to provide a technique that may solve the minimization problem of regression test cases in the case of linked requirements.In this research,the 100-Dollar prioritization approach is used to define the priority of the new requirements.展开更多
Autism spectrum disorder(ASD),classified as a developmental disability,is now more common in children than ever.A drastic increase in the rate of autism spectrum disorder in children worldwide demands early detection ...Autism spectrum disorder(ASD),classified as a developmental disability,is now more common in children than ever.A drastic increase in the rate of autism spectrum disorder in children worldwide demands early detection of autism in children.Parents can seek professional help for a better prognosis of the child’s therapy when ASD is diagnosed under five years.This research study aims to develop an automated tool for diagnosing autism in children.The computer-aided diagnosis tool for ASD detection is designed and developed by a novel methodology that includes data acquisition,feature selection,and classification phases.The most deterministic features are selected from the self-acquired dataset by novel feature selection methods before classification.The Imperialistic competitive algorithm(ICA)based on empires conquering colonies performs feature selection in this study.The performance of Logistic Regression(LR),Decision tree,K-Nearest Neighbor(KNN),and Random Forest(RF)classifiers are experimentally studied in this research work.The experimental results prove that the Logistic regression classifier exhibits the highest accuracy for the self-acquired dataset.The ASD detection is evaluated experimentally with the Least Absolute Shrinkage and Selection Operator(LASSO)feature selection method and different classifiers.The Exploratory Data Analysis(EDA)phase has uncovered crucial facts about the data,like the correlation of the features in the dataset with the class variable.展开更多
Regression testing(RT)is an essential but an expensive activity in software development.RT confirms that new faults/errors will not have occurred in the modified program.RT efficiency can be improved through an effect...Regression testing(RT)is an essential but an expensive activity in software development.RT confirms that new faults/errors will not have occurred in the modified program.RT efficiency can be improved through an effective technique of selected only modified test cases that appropriate to the modifications within the given time frame.Earlier,several test case selection approaches have been introduced,but either these techniques were not sufficient according to the requirements of software tester experts or they are ineffective and cannot be used for available test suite specifications and architecture.To address these limitations,we recommend an improved and efficient test case selection(TCS)algorithm for RT.Our proposed technique decreases the execution time and redundancy of the duplicate test cases(TC)and detects onlymodified changes that appropriate to themodifications in test cases.To reduce execution time for TCS,evaluation results of our proposed approach are established on fault detection,redundancy and already executed test case.Results indicate that proposed technique decreases the inclusive testing time of TCS to execute modified test cases by,on average related to a method of Hybrid Whale Algorithm(HWOA),which is a progressive TCS approach in regression testing for a single product.展开更多
This paper concerns computational problems of the concave penalized linear regression model.We propose a fixed point iterative algorithm to solve the computational problem based on the fact that the penalized estimato...This paper concerns computational problems of the concave penalized linear regression model.We propose a fixed point iterative algorithm to solve the computational problem based on the fact that the penalized estimator satisfies a fixed point equation.The convergence property of the proposed algorithm is established.Numerical studies are conducted to evaluate the finite sample performance of the proposed algorithm.展开更多
文摘In this investigation,the Gradient Boosting(GB),Linear Regression(LR),Decision Tree(DT),and Voting algo-rithms were applied to predict the distribution pattern of Au geochemical data.Trace and indicator elements,including Mo,Cu,Pb,Zn,Ag,Ni,Co,Mn,Fe,and As,were used with these machine learning algorithms(MLAs)to predict Au concentration values in the Doostbigloo porphyry Cu-Au-Mo mineralization area.The performance of the models was evaluated using the Mean Absolute Percentage Error(MAPE)and Root Mean Square Error(RMSE)metrics.The proposed ensemble Voting algorithm outperformed the other models,yielding more ac-curate predictions according to both metrics.The predicted data from the GB,LR,DT,and Voting MLAs were modeled using the Concentration-Area fractal method,and Au geochemical anomalies were mapped.To compare and validate the results,factors such as the location of the mineral deposits,their surface extent,and mineralization trend were considered.The results indicate that integrating hybrid MLAs with fractal modeling signifi-cantly improves geochemical prospectivity mapping.Among the four models,three(DT,GB,Voting)accurately identified both mineral deposits.The LR model,however,only identified Deposit I(central),and its mineralization trend diverged from the field data.The GB and Voting models produced similar results,with their final maps derived from fractal modeling showing the same anomalous areas.The anomaly boundaries identified by these two models are consistent with the two known reserves in the region.The results and plots related to prediction indicators and error rates for these two models also show high similarity,with lower error rates than the other models.Notably,the Voting model demonstrated superior performance in accurately delineating mineral deposit locations and identifying realistic mineralization trends while minimizing false anomalies.
基金supported in part by Sichuan Science and Technology Program under Grant No.2025ZNSFSC151in part by the Strategic Priority Research Program of Chinese Academy of Sciences under Grant No.XDA27030201+1 种基金the Natural Science Foundation of China under Grant No.U21B6001in part by the Natural Science Foundation of Tianjin under Grant No.24JCQNJC01930.
文摘The work proposes a distributed Kalman filtering(KF)algorithm to track a time-varying unknown signal process for a stochastic regression model over network systems in a cooperative way.We provide the stability analysis of the proposed distributed KF algorithm without independent and stationary signal assumptions,which implies that the theoretical results are able to be applied to stochastic feedback systems.Note that the main difficulty of stability analysis lies in analyzing the properties of the product of non-independent and non-stationary random matrices involved in the error equation.We employ analysis techniques such as stochastic Lyapunov function,stability theory of stochastic systems,and algebraic graph theory to deal with the above issue.The stochastic spatio-temporal cooperative information condition shows the cooperative property of multiple sensors that even though any local sensor cannot track the time-varying unknown signal,the distributed KF algorithm can be utilized to finish the filtering task in a cooperative way.At last,we illustrate the property of the proposed distributed KF algorithm by a simulation example.
文摘Choosing optimal parameters for support vector regression (SVR) is an important step in SVR. design, which strongly affects the pefformance of SVR. In this paper, based on the analysis of influence of SVR parameters on generalization error, a new approach with two steps is proposed for selecting SVR parameters, First the kernel function and SVM parameters are optimized roughly through genetic algorithm, then the kernel parameter is finely adjusted by local linear search, This approach has been successfully applied to the prediction model of the sulfur content in hot metal. The experiment results show that the proposed approach can yield better generalization performance of SVR than other methods,
文摘This paper describes a robust support vector regression (SVR) methodology, which can offer superior performance for important process engineering problems. The method incorporates hybrid support vector regression and genetic algorithm technique (SVR-GA) for efficient tuning of SVR meta-parameters. The algorithm has been applied for prediction of pressure drop of solid liquid slurry flow. A comparison with selected correlations in the lit- erature showed that the developed SVR correlation noticeably improved the prediction of pressure drop over a wide range of operating conditions, physical properties, and pipe diameters.
基金funded by the National Natural Science Foundation of China(Grant No.42177164)the Innovation-Driven Project of Central South University(Grant No.2020CX040)supported by China Scholarship Council(Grant No.202006370006)。
文摘The main purpose of blasting operation is to produce desired and optimum mean size rock fragments.Smaller or fine fragments cause the loss of ore during loading and transportation,whereas large or coarser fragments need to be further processed,which enhances production cost.Therefore,accurate prediction of rock fragmentation is crucial in blasting operations.Mean fragment size(MFS) is a crucial index that measures the goodness of blasting designs.Over the past decades,various models have been proposed to evaluate and predict blasting fragmentation.Among these models,artificial intelligence(AI)-based models are becoming more popular due to their outstanding prediction results for multiinfluential factors.In this study,support vector regression(SVR) techniques are adopted as the basic prediction tools,and five types of optimization algorithms,i.e.grid search(GS),grey wolf optimization(GWO),particle swarm optimization(PSO),genetic algorithm(GA) and salp swarm algorithm(SSA),are implemented to improve the prediction performance and optimize the hyper-parameters.The prediction model involves 19 influential factors that constitute a comprehensive blasting MFS evaluation system based on AI techniques.Among all the models,the GWO-v-SVR-based model shows the best comprehensive performance in predicting MFS in blasting operation.Three types of mathematical indices,i.e.mean square error(MSE),coefficient of determination(R^(2)) and variance accounted for(VAF),are utilized for evaluating the performance of different prediction models.The R^(2),MSE and VAF values for the training set are 0.8355,0.00138 and 80.98,respectively,whereas 0.8353,0.00348 and 82.41,respectively for the testing set.Finally,sensitivity analysis is performed to understand the influence of input parameters on MFS.It shows that the most sensitive factor in blasting MFS is the uniaxial compressive strength.
基金supported by the Basic Research Special Plan of Yunnan Provincial Department of Science and Technology-General Project(Grant No.202101AT070094)。
文摘The safety factor is a crucial quantitative index for evaluating slope stability.However,the traditional calculation methods suffer from unreasonable assumptions,complex soil composition,and inadequate consideration of the influencing factors,leading to large errors in their calculations.Therefore,a stacking ensemble learning model(stacking-SSAOP)based on multi-layer regression algorithm fusion and optimized by the sparrow search algorithm is proposed for predicting the slope safety factor.In this method,the density,cohesion,friction angle,slope angle,slope height,and pore pressure ratio are selected as characteristic parameters from the 210 sets of established slope sample data.Random Forest,Extra Trees,AdaBoost,Bagging,and Support Vector regression are used as the base model(inner loop)to construct the first-level regression algorithm layer,and XGBoost is used as the meta-model(outer loop)to construct the second-level regression algorithm layer and complete the construction of the stacked learning model for improving the model prediction accuracy.The sparrow search algorithm is used to optimize the hyperparameters of the above six regression models and correct the over-and underfitting problems of the single regression model to further improve the prediction accuracy.The mean square error(MSE)of the predicted and true values and the fitting of the data are compared and analyzed.The MSE of the stacking-SSAOP model was found to be smaller than that of the single regression model(MSE=0.03917).Therefore,the former has a higher prediction accuracy and better data fitting.This study innovatively applies the sparrow search algorithm to predict the slope safety factor,showcasing its advantages over traditional methods.Additionally,our proposed stacking-SSAOP model integrates multiple regression algorithms to enhance prediction accuracy.This model not only refines the prediction accuracy of the slope safety factor but also offers a fresh approach to handling the intricate soil composition and other influencing factors,making it a precise and reliable method for slope stability evaluation.This research holds importance for the modernization and digitalization of slope safety assessments.
基金Project(513300303)supported by the General Armament Department,China
文摘Gaussian process(GP)has fewer parameters,simple model and output of probabilistic sense,when compared with the methods such as support vector machines.Selection of the hyper-parameters is critical to the performance of Gaussian process model.However,the common-used algorithm has the disadvantages of difficult determination of iteration steps,over-dependence of optimization effect on initial values,and easily falling into local optimum.To solve this problem,a method combining the Gaussian process with memetic algorithm was proposed.Based on this method,memetic algorithm was used to search the optimal hyper parameters of Gaussian process regression(GPR)model in the training process and form MA-GPR algorithms,and then the model was used to predict and test the results.When used in the marine long-range precision strike system(LPSS)battle effectiveness evaluation,the proposed MA-GPR model significantly improved the prediction accuracy,compared with the conjugate gradient method and the genetic algorithm optimization process.
文摘Linear Least Square (LLS) is an approach for modeling regression analysis, applied for prediction and quantification of the strength of relationship between dependent and independent variables. There are a number of methods for solving the LLS problem but as soon as the data size increases and system becomes ill conditioned, the classical methods become complex at time and space with decreasing level of accuracy. Proposed work is based on prediction and quantification of the strength of relationship between sugar fasting and Post-Prandial (PP) sugar with 73 factors that affect diabetes. Due to the large number of independent variables, presented problem of diabetes prediction also presented similar complexities. ABS method is an approach proven better than other classical approaches for LLS problems. ABS algorithm has been applied for solving LLS problem. Hence, separate regression equations were obtained for sugar fasting and PP severity.
文摘Budgeting planning plays an important role in coordinating activities in organizations. An accurate sales volume forecasting is the key to the entire budgeting process. All of the other parts of the master budget are dependent on the sales volume forecasting in some way. If the sales volume forecasting is sloppily done, then the rest of the budgeting process is largely a waste of time. Therefore, the sales volume forecasting process is a critical one for most businesses, and also a difficult area of management. Most of researches and companies use the statistical methods, regression analysis, or sophisticated computer simulations to analyze the sales volume forecasting. Recently, various prediction Artificial Intelligent (AI) techniques have been proposed in forecasting. Support Vector Regression (SVR) has been applied successfully to solve problems in numerous fields and proved to be a better prediction model. However, the select of appropriate SVR parameters is difficult. Therefore, to improve the accuracy of SVR, a hybrid intelligent support system based on evolutionary computation to solve the difficulties involved with the parameters selection is presented in this research. Genetic Algorithms (GAs) are used to optimize free parameters of SVR. The experimental results indicate that GA-SVR can achieve better forecasting accuracy and performance than traditional SVR and artificial neural network (ANN) prediction models in sales volume forecasting.
文摘Rainfall forecasting is becoming more and more significant and precipitation anomalies would lead to droughts and floods disasters.However,because of the complexity and non-stationary of rainfall data,it is difficult to forecast.In this paper,a novel hybrid model to forecast rainfall is developed by incorporating singular spectrum analysis (SSA) and dragonfly algorithm (DA) into support vector regression (SVR) method.Firstly,SSA is used for extracting the trend components of the hydrological data.Then,SVR is utilized to deal with the volatility and irregularity of the precipitation series.Finally,the parameter of SVR is optimized by DA.The proposed SSA-DA-SVR method is used to forecast the monthly precipitation for Songbai,Panshui,Lanma and Jiulongchi stations.To validate the efficiency of the method,four compared models,DA-SVR,SSA-GWO-SVR,SSA-PSO-SVR and SSA-CS-SVR are established.The result shows that the proposed method has the best performance among all five models,and its prediction has high precision and accuracy.
基金This work was supported by the National Natural Science Foundation of China (No.30871341), the National High-Tech Research and Development Program of China (No.2006AA02-Z190), the Shanghai Leading Academic Discipline Project (No.S30405), and the Natural Science Foundation of Shanghai Normal University (No.SK200937).
文摘The stochastic simulation algorithm (SSA) accurately depicts spatially homogeneous wellstirred chemically reacting systems with small populations of chemical species and properly represents noise, but it is often abandoned when modeling larger systems because of its computational complexity. In this work, a twin support vector regression based stochastic simulations algorithm (TS^3A) is proposed by combining the twin support vector regression and SSA, the former is a well-known robust regression method in machine learning. Numerical results indicate that this proposed algorithm can be applied to a wide range of chemically reacting systems and obtain significant improvements on efficiency and accuracy with fewer simulating runs over the existing methods.
基金This work was supported by the National Natural Science Foundation of China(Nos.51678101,52078093)Liaoning Revitalization Talents Program(No.XLYC1905015).
文摘Due to the geological body uncertainty,the identification of the surrounding rock parameters in the tunnel construction process is of great significance to the calculation of tunnel stability.The ubiquitous-joint model and three-dimensional numerical simulation have advantages in the parameter identification of surrounding rock with weak planes,but conventional methods have certain problems,such as a large number of parameters and large time consumption.To solve the problems,this study combines the orthogonal design,Gaussian process(GP)regression,and difference evolution(DE)optimization,and it constructs the parameters identification method of the jointed surrounding rock.The calculation process of parameters identification of a tunnel jointed surrounding rock based on the GP optimized by the DE includes the following steps.First,a three-dimensional numerical simulation based on the ubiquitous-joint model is conducted according to the orthogonal and uniform design parameters combing schemes,where the model input consists of jointed rock parameters and model output is the information on the surrounding rock displacement and stress.Then,the GP regress model optimized by DE is trained by the data samples.Finally,the GP model is integrated into the DE algorithm,and the absolute differences in the displacement and stress between calculated and monitored values are used as the objective function,while the parameters of the jointed surrounding rock are used as variables and identified.The proposed method is verified by the experiments with a joint rock surface in the Dadongshan tunnel,which is located in Dalian,China.The obtained calculation and analysis results are as follows:CR=0.9,F=0.6,NP=100,and the difference strategy DE/Best/1 is recommended.The results of the back analysis are compared with the field monitored values,and the relative error is 4.58%,which is satisfactory.The algorithm influencing factors are also discussed,and it is found that the local correlation coefficientσf and noise standard deviationσn affected the prediction accuracy of the GP model.The results show that the proposed method is feasible and can achieve high identification precision.The study provides an effective reference for parameter identification of jointed surrounding rock in a tunnel.
文摘With the development of UAV technology,UAV aerial magnetic survey plays an important role in the airborne geophysical prospecting.In the aeromagnetic survey,the magnetic field interferences generated by the magnetic components on the aircraft greatly affect the accuracy of the survey results.Therefore,it is necessary to use aeromagnetic compensation technology to eliminate the interfering magnetic field.So far,the aeromagnetic compensation methods used are mainly linear regression compensation methods based on the T-L equation.The least square is one of the most commonly used methods to solve multiple linear regressions.However,considering that the correlation between data may lead to instability of the algorithm,we use the ridge regression algorithm to solve the multicollinearity problem in the T-L equation.Subsequently this method is applied to the aeromagnetic survey data,and the standard deviation is selected as the index to evaluate the compensation effect to verify the effectiveness of the method.
文摘The study on scientific analysis and prediction of China’s future carbon emissions is conducive to balancing the relationship between economic development and carbon emissions in the new era,and actively responding to climate change policy.Through the analysis of the application of the generalized regression neural network(GRNN)in prediction,this paper improved the prediction method of GRNN.Genetic algorithm(GA)was adopted to search the optimal smooth factor as the only factor of GRNN,which was then used for prediction in GRNN.During the prediction of carbon dioxide emissions using the improved method,the increments of data were taken into account.The target values were obtained after the calculation of the predicted results.Finally,compared with the results of GRNN,the improved method realized higher prediction accuracy.It thus offers a new way of predicting total carbon dioxide emissions,and the prediction results can provide macroscopic guidance and decision-making reference for China’s environmental protection and trading of carbon emissions.
基金the Chinese Clinical Trial Registry(No.ChiCTR2000040109)approved by the Hospital Ethics Committee(No.20210130017).
文摘BACKGROUND Difficulty of colonoscopy insertion(DCI)significantly affects colonoscopy effectiveness and serves as a key quality indicator.Predicting and evaluating DCI risk preoperatively is crucial for optimizing intraoperative strategies.AIM To evaluate the predictive performance of machine learning(ML)algorithms for DCI by comparing three modeling approaches,identify factors influencing DCI,and develop a preoperative prediction model using ML algorithms to enhance colonoscopy quality and efficiency.METHODS This cross-sectional study enrolled 712 patients who underwent colonoscopy at a tertiary hospital between June 2020 and May 2021.Demographic data,past medical history,medication use,and psychological status were collected.The endoscopist assessed DCI using the visual analogue scale.After univariate screening,predictive models were developed using multivariable logistic regression,least absolute shrinkage and selection operator(LASSO)regression,and random forest(RF)algorithms.Model performance was evaluated based on discrimination,calibration,and decision curve analysis(DCA),and results were visualized using nomograms.RESULTS A total of 712 patients(53.8%male;mean age 54.5 years±12.9 years)were included.Logistic regression analysis identified constipation[odds ratio(OR)=2.254,95%confidence interval(CI):1.289-3.931],abdominal circumference(AC)(77.5–91.9 cm,OR=1.895,95%CI:1.065-3.350;AC≥92 cm,OR=1.271,95%CI:0.730-2.188),and anxiety(OR=1.071,95%CI:1.044-1.100)as predictive factors for DCI,validated by LASSO and RF methods.Model performance revealed training/validation sensitivities of 0.826/0.925,0.924/0.868,and 1.000/0.981;specificities of 0.602/0.511,0.510/0.562,and 0.977/0.526;and corresponding area under the receiver operating characteristic curves(AUCs)of 0.780(0.737-0.823)/0.726(0.654-0.799),0.754(0.710-0.798)/0.723(0.656-0.791),and 1.000(1.000-1.000)/0.754(0.688-0.820),respectively.DCA indicated optimal net benefit within probability thresholds of 0-0.9 and 0.05-0.37.The RF model demonstrated superior diagnostic accuracy,reflected by perfect training sensitivity(1.000)and highest validation AUC(0.754),outperforming other methods in clinical applicability.CONCLUSION The RF-based model exhibited superior predictive accuracy for DCI compared to multivariable logistic and LASSO regression models.This approach supports individualized preoperative optimization,enhancing colonoscopy quality through targeted risk stratification.
基金The article is supported by National Key Research and Development Projects of P.R.China(No.2018YFD0600100).
文摘A general regression neural network model,combined with an interative algorithm(GRNNI)using sparsely distributed samples and auxiliary environmental variables was proposed to predict both spatial distribution and variability of soil organic matter(SOM)in a bamboo forest.The auxiliary environmental variables were:elevation,slope,mean annual temperature,mean annual precipitation,and normalized difference vegetation index.The prediction accuracy of this model was assessed via three accuracy indices,mean error(ME),mean absolute error(MAE),and root mean squared error(RMSE)for validation in sampling sites.Both the prediction accuracy and reliability of this model were compared to those of regression kriging(RK)and ordinary kriging(OK).The results show that the prediction accuracy of the GRNNI model was higher than that of both RK and OK.The three accuracy indices(ME,MAE,and RMSE)of the GRNNI model were lower than those of RK and OK.Relative improvements of RMSE of the GRNNI model compared with RK and OK were 13.6%and 17.5%,respectively.In addition,a more realistic spatial pattern of SOM was produced by the model because the GRNNI model was more suitable than multiple linear regression to capture the nonlinear relationship between SOM and the auxiliary environmental variables.Therefore,the GRNNI model can improve both prediction accuracy and reliability for determining spatial distribution and variability of SOM.
基金The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work through Large Groups,Project under grant number RGP.2/49/43.
文摘Regression testing is a widely used approach to confirm the correct functionality of the software in incremental development.The use of test cases makes it easier to test the ripple effect of changed requirements.Rigorous testingmay help in meeting the quality criteria that is based on the conformance to the requirements as given by the intended stakeholders.However,a minimized and prioritized set of test cases may reduce the efforts and time required for testingwhile focusing on the timely delivery of the software application.In this research,a technique named Test Reduce has been presented to get a minimal set of test cases based on high priority to ensure that the web applicationmeets the required quality criteria.A new technique TestReduce is proposed with a blend of genetic algorithm to find an optimized and minimal set of test cases.The ultimate objective associated with this study is to provide a technique that may solve the minimization problem of regression test cases in the case of linked requirements.In this research,the 100-Dollar prioritization approach is used to define the priority of the new requirements.
基金The authors extend their appreciation to the Deputyship for Research&Innovation,Ministry of Education in Saudi Arabia for funding this research work through the Project Number(IF2-PSAU-2022/01/22043)。
文摘Autism spectrum disorder(ASD),classified as a developmental disability,is now more common in children than ever.A drastic increase in the rate of autism spectrum disorder in children worldwide demands early detection of autism in children.Parents can seek professional help for a better prognosis of the child’s therapy when ASD is diagnosed under five years.This research study aims to develop an automated tool for diagnosing autism in children.The computer-aided diagnosis tool for ASD detection is designed and developed by a novel methodology that includes data acquisition,feature selection,and classification phases.The most deterministic features are selected from the self-acquired dataset by novel feature selection methods before classification.The Imperialistic competitive algorithm(ICA)based on empires conquering colonies performs feature selection in this study.The performance of Logistic Regression(LR),Decision tree,K-Nearest Neighbor(KNN),and Random Forest(RF)classifiers are experimentally studied in this research work.The experimental results prove that the Logistic regression classifier exhibits the highest accuracy for the self-acquired dataset.The ASD detection is evaluated experimentally with the Least Absolute Shrinkage and Selection Operator(LASSO)feature selection method and different classifiers.The Exploratory Data Analysis(EDA)phase has uncovered crucial facts about the data,like the correlation of the features in the dataset with the class variable.
基金This work was supported in part by the Research Management Center(RMC),Universiti Teknologi Malaysia(UTM)and Ministry of Higher Education Malaysia(MOHE)through the UTM High Impact Research(UTMHR)grant scheme under(Vot Number Q.J130000.2451.08G55).
文摘Regression testing(RT)is an essential but an expensive activity in software development.RT confirms that new faults/errors will not have occurred in the modified program.RT efficiency can be improved through an effective technique of selected only modified test cases that appropriate to the modifications within the given time frame.Earlier,several test case selection approaches have been introduced,but either these techniques were not sufficient according to the requirements of software tester experts or they are ineffective and cannot be used for available test suite specifications and architecture.To address these limitations,we recommend an improved and efficient test case selection(TCS)algorithm for RT.Our proposed technique decreases the execution time and redundancy of the duplicate test cases(TC)and detects onlymodified changes that appropriate to themodifications in test cases.To reduce execution time for TCS,evaluation results of our proposed approach are established on fault detection,redundancy and already executed test case.Results indicate that proposed technique decreases the inclusive testing time of TCS to execute modified test cases by,on average related to a method of Hybrid Whale Algorithm(HWOA),which is a progressive TCS approach in regression testing for a single product.
基金Supported by the National Natural Science Foundation of China(11701571)
文摘This paper concerns computational problems of the concave penalized linear regression model.We propose a fixed point iterative algorithm to solve the computational problem based on the fact that the penalized estimator satisfies a fixed point equation.The convergence property of the proposed algorithm is established.Numerical studies are conducted to evaluate the finite sample performance of the proposed algorithm.