Correction to:J.Iron Steel Res.Int.https://doi.org/10.1007/s42243-025-01545-x The publication of this article unfortunately contained mistakes.Equation(14)was not correct.The corrected equation is given below.
Given the growing concern over global warming and the critical role of carbon dioxide(CO_(2))in this phenomenon,the study of CO_(2)-induced alterations in coal strength has garnered significant attention due to its im...Given the growing concern over global warming and the critical role of carbon dioxide(CO_(2))in this phenomenon,the study of CO_(2)-induced alterations in coal strength has garnered significant attention due to its implications for carbon sequestration.A large number of experiments have proved that CO_(2) interaction time(T),saturation pressure(P)and other parameters have significant effects on coal strength.However,accurate evaluation of CO_(2)-induced alterations in coal strength is still a difficult problem,so it is particularly important to establish accurate and efficient prediction models.This study explored the application of advancedmachine learning(ML)algorithms and Gene Expression Programming(GEP)techniques to predict CO_(2)-induced alterations in coal strength.Sixmodels were developed,including three metaheuristic-optimized XGBoost models(GWO-XGBoost,SSA-XGBoost,PO-XGBoost)and three GEP models(GEP-1,GEP-2,GEP-3).Comprehensive evaluations using multiple metrics revealed that all models demonstrated high predictive accuracy,with the SSA-XGBoost model achieving the best performance(R2—Coefficient of determination=0.99396,RMSE—Root Mean Square Error=0.62102,MAE—Mean Absolute Error=0.36164,MAPE—Mean Absolute Percentage Error=4.8101%,RPD—Residual Predictive Deviation=13.4741).Model interpretability analyses using SHAP(Shapley Additive exPlanations),ICE(Individual Conditional Expectation),and PDP(Partial Dependence Plot)techniques highlighted the dominant role of fixed carbon content(FC)and significant interactions between FC and CO_(2) saturation pressure(P).Theresults demonstrated that the proposedmodels effectively address the challenges of CO_(2)-induced strength prediction,providing valuable insights for geological storage safety and environmental applications.展开更多
In order to solve the black-box modeling problem and improve the prediction accuracy of model,two distinguished models for tensile strength(Ts)and yield strength(Ys)of hot-rolled strip steel are established based on t...In order to solve the black-box modeling problem and improve the prediction accuracy of model,two distinguished models for tensile strength(Ts)and yield strength(Ys)of hot-rolled strip steel are established based on the industrial hot-rolled data and the algorithm of gene expression programming(GEP).Firstly,the industrial data of hot-rolled strip steel are preprocessed using the Pauta criterion,so as to eliminate outliers.The key input variables that affect Ys and Ts are selected by using the method of the maximal information coefficient(MIC).Secondly,the explicit prediction models of Ys and Ts are established using GEP.Subsequently,the model results based on GEP are compared with those based on the support vector regression(SVR)and the back propagation neural network(BPNN).Finally,the mathematical expression models for Ys and Ts obtained by GEP are used to further analyse the specific relationships between the chemical composition and mechanical property.It is shown that the errors of Ys and Ts based on GEP are less than 4%,and the coefficient of determination(R^(2))of Ys and Ts based on GEP is above 0.9,which has strong prediction performance.The prediction accuracy of GEP can achieve the same level with SVR and BPNN.It is worth mentioning that the proposed model can not only show the explicit relationship between the chemical composition,production process,and mechanical property of strip steel,but also occupy high prediction accuracy,which can make reliable reference for strip steel product design and optimisation.展开更多
As a critical component of the in situ stress state,determination of the minimum horizontal principal stress plays a significant role in both geotechnical and petroleum engineering.To this end,a gene expression progra...As a critical component of the in situ stress state,determination of the minimum horizontal principal stress plays a significant role in both geotechnical and petroleum engineering.To this end,a gene expression programming(GEP)algorithm-based model,in which the data of borehole breakout size,vertical principal stress,and rock strength characteristics are used as the inputs,is proposed to predict the minimum horizontal principal stress.Seventy-nine(79)samples with seven features are collected to construct the minimum horizontal principal stress dataset used for training models.Twenty-four(24)GEP model hyperparameter sets were configured to explore the key parameter combinations among the inputs and their potential relationships with the minimum horizontal principal stresses.Model performance was evaluated using root mean squared error(RMSE),mean absolute error(MAE),mean absolute percentage error(MAPE),and coefficient of determination(R^(2)).By comparing predictive performance and parameter composition,two models were selected from 24 GEP models that demonstrated excellent predictive performance and simpler parameter composition.Compared with prevalent models,the results indicate that the two selected GEP models have better performance on the test set(R^(2)=0.9568 and 0.9621).Additionally,the results conducted by SHapley Additive exPlanations(SHAP)sensitivity analysis and Local Interpretable Model-agnostic Explanations(LIME)demonstrate that the vertical principal stress is the most influential parameter in both GEP models.The two GEP models have simple parameter compositions as well as stable and excellent prediction performance,which is a viable method for predicting the minimum horizontal principal stresses.展开更多
Assessing the stability of pillars in underground mines(especially in deep underground mines)is a critical concern during both the design and the operational phases of a project.This study mainly focuses on developing...Assessing the stability of pillars in underground mines(especially in deep underground mines)is a critical concern during both the design and the operational phases of a project.This study mainly focuses on developing two practical models to predict pillar stability status.For this purpose,two robust models were developed using a database including 236 case histories from seven underground hard rock mines,based on gene expression programming(GEP)and decision tree-support vector machine(DT-SVM)hybrid algorithms.The performance of the developed models was evaluated based on four common statistical criteria(sensitivity,specificity,Matthews correlation coefficient,and accuracy),receiver operating characteristic(ROC)curve,and testing data sets.The results showed that the GEP and DT-SVM models performed exceptionally well in assessing pillar stability,showing a high level of accuracy.The DT-SVM model,in particular,outperformed the GEP model(accuracy of 0.914,sensitivity of 0.842,specificity of 0.929,Matthews correlation coefficient of 0.767,and area under the ROC of 0.897 for the test data set).Furthermore,upon comparing the developed models with the previous ones,it was revealed that both models can effectively determine the condition of pillar stability with low uncertainty and acceptable accuracy.This suggests that these models could serve as dependable tools for project managers,aiding in the evaluation of pillar stability during the design and operational phases of mining projects,despite the inherent challenges in this domain.展开更多
It is a complicated problem for the bottom-to-top adaptive conceptual design of complicated products between structure and function. Reliable theories demand to be found in order to determine whether the structure acc...It is a complicated problem for the bottom-to-top adaptive conceptual design of complicated products between structure and function. Reliable theories demand to be found in order to determine whether the structure accords with the requirement of design. For the requirement generally is dynamic variety as time passes, new requirements will come, and some initial requirements can no longer be used. The number of product requirements, the gene length expressing requirements, the structure of the product, and the correlation matrix are varied with individuation of customer requirements of the product. By researching on the calculation mechanisms of dynamic variety, the approaches of gene expression and variable length gene expression are proposed. According to the diversity of structure selection in conceptual design and mutual relations between structure and function as well as structure and structure, the correlation matrixes between structure and function as well as structure and structure are defined. By the approach of making the sum of the elements of correlation matrix maximum, the mathematical models of multi-object optimization for structure design are provided based on variable requirements. An improved genetic algorithm called segment genetic algorithm is proposed based on optimization preservation simple genetic algorithm. The models of multi-object optimization are calculated by the segment genetic algorithm and hybrid genetic algorithm. An example for the conceptual design of a washing machine is given to show that the proposed method is able to realize the optimization structure design fitting for variable requirements. In addition, the proposed approach can provide good Pareto optimization solutions, and the individuation customer requirements for structures of products are able to be resolved effectively.展开更多
In this context,two different approaches of soil liquefaction evaluation using a soft computing technique based on the worldwide standard penetration test(SPT) databases have been studied.Gene expression programming(G...In this context,two different approaches of soil liquefaction evaluation using a soft computing technique based on the worldwide standard penetration test(SPT) databases have been studied.Gene expression programming(GEP) as a gray-box modeling approach is used to develop different deterministic models in order to evaluate the occurrence of soil liquefaction in terms of liquefaction field performance indicator(LI) and factor of safety(FS) in logistic regression and classification concepts.The comparative plots illustrate that the classification concept-based models show a better performance than those based on logistic regression.In the probabilistic approach,a calibrated mapping function is developed in the context of Bayes’ theorem in order to capture the failure probabilities(PL) in the absence of the knowledge of parameter uncertainty.Consistent results obtained from the proposed probabilistic models,compared to the most well-known models,indicate the robustness of the methodology used in this study.The probability models provide a simple,but also efficient decision-making tool in engineering design to quantitatively assess the liquefaction triggering thresholds.展开更多
Prediction of mode I fracture toughness(KIC) of rock is of significant importance in rock engineering analyses. In this study, linear multiple regression(LMR) and gene expression programming(GEP)methods were used to p...Prediction of mode I fracture toughness(KIC) of rock is of significant importance in rock engineering analyses. In this study, linear multiple regression(LMR) and gene expression programming(GEP)methods were used to provide a reliable relationship to determine mode I fracture toughness of rock. The presented model was developed based on 60 datasets taken from the previous literature. To predict fracture parameters, three mechanical parameters of rock mass including uniaxial compressive strength(UCS), Brazilian tensile strength(BTS), and elastic modulus(E) have been selected as the input parameters. A cluster of data was collected and divided into two random groups of training and testing datasets.Then, different statistical linear and artificial intelligence based nonlinear analyses were conducted on the training data to provide a reliable prediction model of KIC. These two predictive methods were then evaluated based on the testing data. To evaluate the efficiency of the proposed models for predicting the mode I fracture toughness of rock, various statistical indices including coefficient of determination(R2),root mean square error(RMSE), and mean absolute error(MAE) were utilized herein. In the case of testing datasets, the values of R2, RMSE, and MAE for the GEP model were 0.87, 0.188, and 0.156,respectively, while they were 0.74, 0.473, and 0.223, respectively, for the LMR model. The results indicated that the selected GEP model delivered superior performance with a higher R2value and lower errors.展开更多
In order to minimize the project duration of resourceconstrained project scheduling problem( RCPSP), a gene expression programming-based scheduling rule( GEP-SR) method is proposed to automatically discover and select...In order to minimize the project duration of resourceconstrained project scheduling problem( RCPSP), a gene expression programming-based scheduling rule( GEP-SR) method is proposed to automatically discover and select the effective scheduling rules( SRs) which are constructed using the project status and attributes of the activities. SRs are represented by the chromosomes of GEP, and an improved parallel schedule generation scheme( IPSGS) is used to transform the SRs into explicit schedules. The framework of GEP-SR for RCPSP is designed,and the effectiveness of the GEP-SR approach is demonstrated by comparing with other methods on the same instances.展开更多
The shear stress distribution in circular channels was modeled in this study using gene expression programming(GEP). 173 sets of reliable data were collected under four flow conditions for use in the training and test...The shear stress distribution in circular channels was modeled in this study using gene expression programming(GEP). 173 sets of reliable data were collected under four flow conditions for use in the training and testing stages. The effect of input variables on GEP modeling was studied and 15 different GEP models with individual, binary, ternary, and quaternary input combinations were investigated. The sensitivity analysis results demonstrate that dimensionless parameter y/P, where y is the transverse coordinate, and P is the wetted perimeter, is the most influential parameter with regard to the shear stress distribution in circular channels. GEP model 10, with the parameter y/P and Reynolds number(Re) as inputs, outperformed the other GEP models, with a coefficient of determination of 0.7814 for the testing data set. An equation was derived from the best GEP model and its results were compared with an artificial neural network(ANN) model and an equation based on the Shannon entropy proposed by other researchers. The GEP model, with an average RMSE of 0.0301, exhibits superior performance over the Shannon entropy-based equation, with an average RMSE of 0.1049, and the ANN model, with an average RMSE of 0.2815 for all flow depths.展开更多
Submerged vanes are installed on rivers and channel beds to protect the outer bank bends from scouring.Also,local scouring occurs around the submerged vanes over time,and identifying the effective factors on the scour...Submerged vanes are installed on rivers and channel beds to protect the outer bank bends from scouring.Also,local scouring occurs around the submerged vanes over time,and identifying the effective factors on the scouring phenomena around these submerged vanes is one of the important issues in river engineering.The most important aimof this study is investigation of scour pattern around submerged vanes located in 180°bend experimentally and numerically.Firstly,the effects of various parameters such as the Froude number(Fr),angle of submerged vanes to the flow(α),angle of submerged vane location in the bend(θ),distance between submerged vanes(d),height(H),and length(L)of the vanes on the dimensionless volume of the scour hole were experimentally studied.The submerged vanes were installed on a 180°bend whose central radius and channel width were 2.8 and 0.6 m,respectively.By reducing the Froude number,the scour hole volume decreased.For all Froude numbers,the biggest scour hole formed atθ=15°.In all models,by increasing the Froude number,the scour hole volume significantly increases.In addition,by increasing the submerged vanes’length and height,the scour hole dimensions also grow.Secondly,using gene expression programming(GEP),a relationship for determining the scour hole volume around the submerged vanes was provided.For this model,the determination coefficients(R2)for the training and test modes were computed as 0.91 and 0.9,respectively.In addition,this study performed partial derivative sensitivity analysis(PDSA).According to the results,the PDSA was calculated as positive for all input variables.展开更多
Rock strength is a crucial factor to consider when designing and constructing underground projects.This study utilizes a gene expression programming(GEP)algorithm-based model to predict the true triaxial strength of r...Rock strength is a crucial factor to consider when designing and constructing underground projects.This study utilizes a gene expression programming(GEP)algorithm-based model to predict the true triaxial strength of rocks,taking into account the influence of rock genesis on their mechanical behavior during the model building process.A true triaxial strength criterion based on the GEP model for igneous,metamorphic and magmatic rocks was obtained by training the model using collected data.Compared to the modified Weibols-Cook criterion,the modified Mohr-Coulomb criterion,and the modified Lade criterion,the strength criterion based on the GEP model exhibits superior prediction accuracy performance.The strength criterion based on the GEP model has better performance in R2,RMSE and MAPE for the data set used in this study.Furthermore,the strength criterion based on the GEP model shows greater stability in predicting the true triaxial strength of rocks across different types.Compared to the existing strength criterion based on the genetic programming(GP)model,the proposed criterion based on GEP model achieves more accurate predictions of the variation of true triaxial strength(s1)with intermediate principal stress(s2).Finally,based on the Sobol sensitivity analysis technique,the effects of the parameters of the three obtained strength criteria on the true triaxial strength of the rock are analysed.In general,the proposed strength criterion exhibits superior performance in terms of both accuracy and stability of prediction results.展开更多
Themulti-skill resource-constrained project scheduling problem(MS-RCPSP)is a significantmanagement science problem that extends from the resource-constrained project scheduling problem(RCPSP)and is integrated with a r...Themulti-skill resource-constrained project scheduling problem(MS-RCPSP)is a significantmanagement science problem that extends from the resource-constrained project scheduling problem(RCPSP)and is integrated with a real project and production environment.To solve MS-RCPSP,it is an efficient method to use dispatching rules combined with a parallel scheduling mechanism to generate a scheduling scheme.This paper proposes an improved gene expression programming(IGEP)approach to explore newly dispatching rules that can broadly solve MS-RCPSP.A new backward traversal decoding mechanism,and several neighborhood operators are applied in IGEP.The backward traversal decoding mechanism dramatically reduces the space complexity in the decoding process,and improves the algorithm’s performance.Several neighborhood operators improve the exploration of the potential search space.The experiment takes the intelligent multi-objective project scheduling environment(iMOPSE)benchmark dataset as the training set and testing set of IGEP.Ten newly dispatching rules are discovered and extracted by IGEP,and eight out of ten are superior to other typical dispatching rules.展开更多
Accurate gas viscosity determination is an important issue in the oil and gas industries.Experimental approaches for gas viscosity measurement are timeconsuming,expensive and hardly possible at high pressures and high...Accurate gas viscosity determination is an important issue in the oil and gas industries.Experimental approaches for gas viscosity measurement are timeconsuming,expensive and hardly possible at high pressures and high temperatures(HPHT).In this study,a number of correlations were developed to estimate gas viscosity by the use of group method of data handling(GMDH)type neural network and gene expression programming(GEP)techniques using a large data set containing more than 3000 experimental data points for methane,nitrogen,and hydrocarbon gas mixtures.It is worth mentioning that unlike many of viscosity correlations,the proposed ones in this study could compute gas viscosity at pressures ranging between 34 and 172 MPa and temperatures between 310 and 1300 K.Also,a comparison was performed between the results of these established models and the results of ten wellknown models reported in the literature.Average absolute relative errors of GMDH models were obtained 4.23%,0.64%,and 0.61%for hydrocarbon gas mixtures,methane,and nitrogen,respectively.In addition,graphical analyses indicate that the GMDH can predict gas viscosity with higher accuracy than GEP at HPHT conditions.Also,using leverage technique,valid,suspected and outlier data points were determined.Finally,trends of gas viscosity models at different conditions were evaluated.展开更多
This paper deals with the reflectance estimation model issue to improve the estimation accuracy. We propose a model containing two core procedures: dimensionality reduction and model mining. First, the dimensionality ...This paper deals with the reflectance estimation model issue to improve the estimation accuracy. We propose a model containing two core procedures: dimensionality reduction and model mining. First, the dimensionality reduction algorithm of hyperspectral data based on dependence degree(DRNDDD) is proposed to reduce the redundant hyperspectral band. DRND-DD solves the selection of suitable hyperspectral band via rough set theory. Furthermore, to improve the computation speed and accuracy of the model, based on DRND-DD, this paper proposes reflectance estimation model mining of leaf nitrogen concentration(LNC) for hyperspectral data by using hybrid gene expression programming(REMLNC-HGEP). Experimental results on three datasets demonstrate that the DRND-DD algorithm can obtain good results with a very short running time compared with principal component analysis(PCA), singular value decomposition(SVD), a dimensionality reduction algorithm based on the positive region(AR-PR) and a dimensionality reduction algorithm based on a discernable matrix(ARDM), and REMLNC-HGEP has low average time-consumption, high model mining success ratio and estimation accuracy. It was concluded that the REMLNC-HGEP performs better than the regression methods.展开更多
At present, transcription analysis of gene expression commonly uses housekeeping genes as control for normalization. In this study, the expression levels of three housekeeping genes including GAPDH, β-actin, and 18S ...At present, transcription analysis of gene expression commonly uses housekeeping genes as control for normalization. In this study, the expression levels of three housekeeping genes including GAPDH, β-actin, and 18S rRNA in six tissues and five developmental stages of the Mandarin fish Siniperca chuatsi were assayed with quantitative real-time PCR (qPCR). Differences in expression levels were analyzed using geNorm program. The results demonstrate that β-actin is the most stable gene at developmental stages and GAPDH is the most stable in different tissues. While 18S rRNA expression during development is differentially regulated, which indicates it is suitable as an internal control for gene expression normalization at the developmental level. Overall, the data suggest that the two most stable housekeeping genes are enough to accurately calibrate gene expression in S. chuatsi. The significance of this study provided convincing references and methodology for housekeeping gene selection and normalization in gene expression analysis with regular PCR or qPCR.展开更多
Genetic Programming (GP) is an important approach to deal with complex problem analysis and modeling, and has been applied in a wide range of areas. The development of GP involves various aspects, including design of ...Genetic Programming (GP) is an important approach to deal with complex problem analysis and modeling, and has been applied in a wide range of areas. The development of GP involves various aspects, including design of genetic operators, evolutionary controls and implementations of heuristic strategy, evaluations and other mechanisms. When designing genetic operators, it is necessary to consider the possible limitations of encoding methods of individuals. And when selecting evolutionary control strategies, it is also necessary to balance search efficiency and diversity based on representation characteristics as well as the problem itself. More importantly, all of these matters, among others, have to be implemented through tedious coding work. Therefore, GP development is both complex and time-consuming. To overcome some of these difficulties that hinder the enhancement of GP development efficiency, we explore the feasibility of mutual assistance among GP variants, and then propose a rapid GP prototyping development method based on πGrammatical Evolution (πGE). It is demonstrated through regression analysis experiments that not only is this method beneficial for the GP developers to get rid of some tedious implementations, but also enables them to concentrate on the essence of the referred problem, such as individual representation, decoding means and evaluation. Additionally, it provides new insights into the roles of individual delineations in phenotypes and semantic research of individuals.展开更多
Dear Editor,Epigenetic programming plays a critical role in response to environmental fluctuations in eukaryotes(Jaenisch and Bird,2003).From initial contact to successful colonization,pathogenic microorganisms adeptl...Dear Editor,Epigenetic programming plays a critical role in response to environmental fluctuations in eukaryotes(Jaenisch and Bird,2003).From initial contact to successful colonization,pathogenic microorganisms adeptly adapt to environmental changes by orchestrating a cascade of gene expression.However,how plant pathogens adapt to the host environment during early infection at the epigenetic level is largely elusive.展开更多
Maternal undernutrition or overnutrition during pregnancy alters organ structure, impairs prenatal and neonatal growth and development, and reduces feed efficiency for lean tissue gains in pigs. These adverse effects ...Maternal undernutrition or overnutrition during pregnancy alters organ structure, impairs prenatal and neonatal growth and development, and reduces feed efficiency for lean tissue gains in pigs. These adverse effects may be carried over to the next generation or beyond. This phenomenon of the transgenerational impacts is known as fetal programming, which is mediated by stable and heritable alterations of gene expression through covalent modifications of DNA and histones without changes in DNA sequences(namely, epigenetics). The mechanisms responsible for the epigenetic regulation of protein expression and functions include chromatin remodeling; DNA methylation(occurring at the 5′-position of cytosine residues within CpG dinucleotides); and histone modifications(acetylation, methylation, phosphorylation, and ubiquitination). Like maternal malnutrition, undernutrition during the neonatal period also reduces growth performance and feed efficiency(weight gain:feed intake; also known as weightgain efficiency) in postweaning pigs by 5–10%, thereby increasing the days necessary to reach the market bodyweight. Supplementing functional amino acids(e.g., arginine and glutamine) and vitamins(e.g., folate) play a key role in activating the mammalian target of rapamycin signaling and regulating the provision of methyl donors for DNA and protein methylation. Therefore, these nutrients are beneficial for the dietary treatment of metabolic disorders in offspring with intrauterine growth restriction or neonatal malnutrition. The mechanism-based strategies hold great promise for the improvement of the efficiency of pork production and the sustainability of the global swine industry.展开更多
文摘Correction to:J.Iron Steel Res.Int.https://doi.org/10.1007/s42243-025-01545-x The publication of this article unfortunately contained mistakes.Equation(14)was not correct.The corrected equation is given below.
基金partially supported by the National Natural Science Foundation of China(42177164,52474121)the Outstanding Youth Project of Hunan Provincial Department of Education(23B0008).
文摘Given the growing concern over global warming and the critical role of carbon dioxide(CO_(2))in this phenomenon,the study of CO_(2)-induced alterations in coal strength has garnered significant attention due to its implications for carbon sequestration.A large number of experiments have proved that CO_(2) interaction time(T),saturation pressure(P)and other parameters have significant effects on coal strength.However,accurate evaluation of CO_(2)-induced alterations in coal strength is still a difficult problem,so it is particularly important to establish accurate and efficient prediction models.This study explored the application of advancedmachine learning(ML)algorithms and Gene Expression Programming(GEP)techniques to predict CO_(2)-induced alterations in coal strength.Sixmodels were developed,including three metaheuristic-optimized XGBoost models(GWO-XGBoost,SSA-XGBoost,PO-XGBoost)and three GEP models(GEP-1,GEP-2,GEP-3).Comprehensive evaluations using multiple metrics revealed that all models demonstrated high predictive accuracy,with the SSA-XGBoost model achieving the best performance(R2—Coefficient of determination=0.99396,RMSE—Root Mean Square Error=0.62102,MAE—Mean Absolute Error=0.36164,MAPE—Mean Absolute Percentage Error=4.8101%,RPD—Residual Predictive Deviation=13.4741).Model interpretability analyses using SHAP(Shapley Additive exPlanations),ICE(Individual Conditional Expectation),and PDP(Partial Dependence Plot)techniques highlighted the dominant role of fixed carbon content(FC)and significant interactions between FC and CO_(2) saturation pressure(P).Theresults demonstrated that the proposedmodels effectively address the challenges of CO_(2)-induced strength prediction,providing valuable insights for geological storage safety and environmental applications.
基金supported by the National Natural Science Foundation of China(Grant Nos.52074187 and 52274388)Liaoning Province Artificial Intelligence Innovation and Development Plan Project(Major Science and Technology Project)(2023JH26-10100002)the National Key Research and Development Program of China(No.2022YFB3304800).
文摘In order to solve the black-box modeling problem and improve the prediction accuracy of model,two distinguished models for tensile strength(Ts)and yield strength(Ys)of hot-rolled strip steel are established based on the industrial hot-rolled data and the algorithm of gene expression programming(GEP).Firstly,the industrial data of hot-rolled strip steel are preprocessed using the Pauta criterion,so as to eliminate outliers.The key input variables that affect Ys and Ts are selected by using the method of the maximal information coefficient(MIC).Secondly,the explicit prediction models of Ys and Ts are established using GEP.Subsequently,the model results based on GEP are compared with those based on the support vector regression(SVR)and the back propagation neural network(BPNN).Finally,the mathematical expression models for Ys and Ts obtained by GEP are used to further analyse the specific relationships between the chemical composition and mechanical property.It is shown that the errors of Ys and Ts based on GEP are less than 4%,and the coefficient of determination(R^(2))of Ys and Ts based on GEP is above 0.9,which has strong prediction performance.The prediction accuracy of GEP can achieve the same level with SVR and BPNN.It is worth mentioning that the proposed model can not only show the explicit relationship between the chemical composition,production process,and mechanical property of strip steel,but also occupy high prediction accuracy,which can make reliable reference for strip steel product design and optimisation.
基金partially supported by the National Natural Science Foundation of China(Grant Nos.42177164 and 52474121)the Distinguished Youth Science Foundation of Hunan Province of China(Grant No.2022JJ10073).
文摘As a critical component of the in situ stress state,determination of the minimum horizontal principal stress plays a significant role in both geotechnical and petroleum engineering.To this end,a gene expression programming(GEP)algorithm-based model,in which the data of borehole breakout size,vertical principal stress,and rock strength characteristics are used as the inputs,is proposed to predict the minimum horizontal principal stress.Seventy-nine(79)samples with seven features are collected to construct the minimum horizontal principal stress dataset used for training models.Twenty-four(24)GEP model hyperparameter sets were configured to explore the key parameter combinations among the inputs and their potential relationships with the minimum horizontal principal stresses.Model performance was evaluated using root mean squared error(RMSE),mean absolute error(MAE),mean absolute percentage error(MAPE),and coefficient of determination(R^(2)).By comparing predictive performance and parameter composition,two models were selected from 24 GEP models that demonstrated excellent predictive performance and simpler parameter composition.Compared with prevalent models,the results indicate that the two selected GEP models have better performance on the test set(R^(2)=0.9568 and 0.9621).Additionally,the results conducted by SHapley Additive exPlanations(SHAP)sensitivity analysis and Local Interpretable Model-agnostic Explanations(LIME)demonstrate that the vertical principal stress is the most influential parameter in both GEP models.The two GEP models have simple parameter compositions as well as stable and excellent prediction performance,which is a viable method for predicting the minimum horizontal principal stresses.
文摘Assessing the stability of pillars in underground mines(especially in deep underground mines)is a critical concern during both the design and the operational phases of a project.This study mainly focuses on developing two practical models to predict pillar stability status.For this purpose,two robust models were developed using a database including 236 case histories from seven underground hard rock mines,based on gene expression programming(GEP)and decision tree-support vector machine(DT-SVM)hybrid algorithms.The performance of the developed models was evaluated based on four common statistical criteria(sensitivity,specificity,Matthews correlation coefficient,and accuracy),receiver operating characteristic(ROC)curve,and testing data sets.The results showed that the GEP and DT-SVM models performed exceptionally well in assessing pillar stability,showing a high level of accuracy.The DT-SVM model,in particular,outperformed the GEP model(accuracy of 0.914,sensitivity of 0.842,specificity of 0.929,Matthews correlation coefficient of 0.767,and area under the ROC of 0.897 for the test data set).Furthermore,upon comparing the developed models with the previous ones,it was revealed that both models can effectively determine the condition of pillar stability with low uncertainty and acceptable accuracy.This suggests that these models could serve as dependable tools for project managers,aiding in the evaluation of pillar stability during the design and operational phases of mining projects,despite the inherent challenges in this domain.
基金supported by National Natural Science Foundation of China(Grant No.50975033,Grant No.60875046)Program of Education Office of Liaoning Province,China(Grant No.LT2010074)
文摘It is a complicated problem for the bottom-to-top adaptive conceptual design of complicated products between structure and function. Reliable theories demand to be found in order to determine whether the structure accords with the requirement of design. For the requirement generally is dynamic variety as time passes, new requirements will come, and some initial requirements can no longer be used. The number of product requirements, the gene length expressing requirements, the structure of the product, and the correlation matrix are varied with individuation of customer requirements of the product. By researching on the calculation mechanisms of dynamic variety, the approaches of gene expression and variable length gene expression are proposed. According to the diversity of structure selection in conceptual design and mutual relations between structure and function as well as structure and structure, the correlation matrixes between structure and function as well as structure and structure are defined. By the approach of making the sum of the elements of correlation matrix maximum, the mathematical models of multi-object optimization for structure design are provided based on variable requirements. An improved genetic algorithm called segment genetic algorithm is proposed based on optimization preservation simple genetic algorithm. The models of multi-object optimization are calculated by the segment genetic algorithm and hybrid genetic algorithm. An example for the conceptual design of a washing machine is given to show that the proposed method is able to realize the optimization structure design fitting for variable requirements. In addition, the proposed approach can provide good Pareto optimization solutions, and the individuation customer requirements for structures of products are able to be resolved effectively.
文摘In this context,two different approaches of soil liquefaction evaluation using a soft computing technique based on the worldwide standard penetration test(SPT) databases have been studied.Gene expression programming(GEP) as a gray-box modeling approach is used to develop different deterministic models in order to evaluate the occurrence of soil liquefaction in terms of liquefaction field performance indicator(LI) and factor of safety(FS) in logistic regression and classification concepts.The comparative plots illustrate that the classification concept-based models show a better performance than those based on logistic regression.In the probabilistic approach,a calibrated mapping function is developed in the context of Bayes’ theorem in order to capture the failure probabilities(PL) in the absence of the knowledge of parameter uncertainty.Consistent results obtained from the proposed probabilistic models,compared to the most well-known models,indicate the robustness of the methodology used in this study.The probability models provide a simple,but also efficient decision-making tool in engineering design to quantitatively assess the liquefaction triggering thresholds.
文摘Prediction of mode I fracture toughness(KIC) of rock is of significant importance in rock engineering analyses. In this study, linear multiple regression(LMR) and gene expression programming(GEP)methods were used to provide a reliable relationship to determine mode I fracture toughness of rock. The presented model was developed based on 60 datasets taken from the previous literature. To predict fracture parameters, three mechanical parameters of rock mass including uniaxial compressive strength(UCS), Brazilian tensile strength(BTS), and elastic modulus(E) have been selected as the input parameters. A cluster of data was collected and divided into two random groups of training and testing datasets.Then, different statistical linear and artificial intelligence based nonlinear analyses were conducted on the training data to provide a reliable prediction model of KIC. These two predictive methods were then evaluated based on the testing data. To evaluate the efficiency of the proposed models for predicting the mode I fracture toughness of rock, various statistical indices including coefficient of determination(R2),root mean square error(RMSE), and mean absolute error(MAE) were utilized herein. In the case of testing datasets, the values of R2, RMSE, and MAE for the GEP model were 0.87, 0.188, and 0.156,respectively, while they were 0.74, 0.473, and 0.223, respectively, for the LMR model. The results indicated that the selected GEP model delivered superior performance with a higher R2value and lower errors.
基金The Spring Plan of Ministry of Education,China(No.Z2012017)
文摘In order to minimize the project duration of resourceconstrained project scheduling problem( RCPSP), a gene expression programming-based scheduling rule( GEP-SR) method is proposed to automatically discover and select the effective scheduling rules( SRs) which are constructed using the project status and attributes of the activities. SRs are represented by the chromosomes of GEP, and an improved parallel schedule generation scheme( IPSGS) is used to transform the SRs into explicit schedules. The framework of GEP-SR for RCPSP is designed,and the effectiveness of the GEP-SR approach is demonstrated by comparing with other methods on the same instances.
文摘The shear stress distribution in circular channels was modeled in this study using gene expression programming(GEP). 173 sets of reliable data were collected under four flow conditions for use in the training and testing stages. The effect of input variables on GEP modeling was studied and 15 different GEP models with individual, binary, ternary, and quaternary input combinations were investigated. The sensitivity analysis results demonstrate that dimensionless parameter y/P, where y is the transverse coordinate, and P is the wetted perimeter, is the most influential parameter with regard to the shear stress distribution in circular channels. GEP model 10, with the parameter y/P and Reynolds number(Re) as inputs, outperformed the other GEP models, with a coefficient of determination of 0.7814 for the testing data set. An equation was derived from the best GEP model and its results were compared with an artificial neural network(ANN) model and an equation based on the Shannon entropy proposed by other researchers. The GEP model, with an average RMSE of 0.0301, exhibits superior performance over the Shannon entropy-based equation, with an average RMSE of 0.1049, and the ANN model, with an average RMSE of 0.2815 for all flow depths.
文摘Submerged vanes are installed on rivers and channel beds to protect the outer bank bends from scouring.Also,local scouring occurs around the submerged vanes over time,and identifying the effective factors on the scouring phenomena around these submerged vanes is one of the important issues in river engineering.The most important aimof this study is investigation of scour pattern around submerged vanes located in 180°bend experimentally and numerically.Firstly,the effects of various parameters such as the Froude number(Fr),angle of submerged vanes to the flow(α),angle of submerged vane location in the bend(θ),distance between submerged vanes(d),height(H),and length(L)of the vanes on the dimensionless volume of the scour hole were experimentally studied.The submerged vanes were installed on a 180°bend whose central radius and channel width were 2.8 and 0.6 m,respectively.By reducing the Froude number,the scour hole volume decreased.For all Froude numbers,the biggest scour hole formed atθ=15°.In all models,by increasing the Froude number,the scour hole volume significantly increases.In addition,by increasing the submerged vanes’length and height,the scour hole dimensions also grow.Secondly,using gene expression programming(GEP),a relationship for determining the scour hole volume around the submerged vanes was provided.For this model,the determination coefficients(R2)for the training and test modes were computed as 0.91 and 0.9,respectively.In addition,this study performed partial derivative sensitivity analysis(PDSA).According to the results,the PDSA was calculated as positive for all input variables.
基金supported by the National Natural Science Foundation of China(Grant No.42177164)the Distinguished Youth Science Foundation of Hunan Province of China(Grant No.2022JJ10073)the Innovation-Driven Project of Central South University(Grant No.2020CX040).
文摘Rock strength is a crucial factor to consider when designing and constructing underground projects.This study utilizes a gene expression programming(GEP)algorithm-based model to predict the true triaxial strength of rocks,taking into account the influence of rock genesis on their mechanical behavior during the model building process.A true triaxial strength criterion based on the GEP model for igneous,metamorphic and magmatic rocks was obtained by training the model using collected data.Compared to the modified Weibols-Cook criterion,the modified Mohr-Coulomb criterion,and the modified Lade criterion,the strength criterion based on the GEP model exhibits superior prediction accuracy performance.The strength criterion based on the GEP model has better performance in R2,RMSE and MAPE for the data set used in this study.Furthermore,the strength criterion based on the GEP model shows greater stability in predicting the true triaxial strength of rocks across different types.Compared to the existing strength criterion based on the genetic programming(GP)model,the proposed criterion based on GEP model achieves more accurate predictions of the variation of true triaxial strength(s1)with intermediate principal stress(s2).Finally,based on the Sobol sensitivity analysis technique,the effects of the parameters of the three obtained strength criteria on the true triaxial strength of the rock are analysed.In general,the proposed strength criterion exhibits superior performance in terms of both accuracy and stability of prediction results.
基金funded by the National Natural Science Foundation of China(Nos.51875420,51875421,52275504).
文摘Themulti-skill resource-constrained project scheduling problem(MS-RCPSP)is a significantmanagement science problem that extends from the resource-constrained project scheduling problem(RCPSP)and is integrated with a real project and production environment.To solve MS-RCPSP,it is an efficient method to use dispatching rules combined with a parallel scheduling mechanism to generate a scheduling scheme.This paper proposes an improved gene expression programming(IGEP)approach to explore newly dispatching rules that can broadly solve MS-RCPSP.A new backward traversal decoding mechanism,and several neighborhood operators are applied in IGEP.The backward traversal decoding mechanism dramatically reduces the space complexity in the decoding process,and improves the algorithm’s performance.Several neighborhood operators improve the exploration of the potential search space.The experiment takes the intelligent multi-objective project scheduling environment(iMOPSE)benchmark dataset as the training set and testing set of IGEP.Ten newly dispatching rules are discovered and extracted by IGEP,and eight out of ten are superior to other typical dispatching rules.
文摘Accurate gas viscosity determination is an important issue in the oil and gas industries.Experimental approaches for gas viscosity measurement are timeconsuming,expensive and hardly possible at high pressures and high temperatures(HPHT).In this study,a number of correlations were developed to estimate gas viscosity by the use of group method of data handling(GMDH)type neural network and gene expression programming(GEP)techniques using a large data set containing more than 3000 experimental data points for methane,nitrogen,and hydrocarbon gas mixtures.It is worth mentioning that unlike many of viscosity correlations,the proposed ones in this study could compute gas viscosity at pressures ranging between 34 and 172 MPa and temperatures between 310 and 1300 K.Also,a comparison was performed between the results of these established models and the results of ten wellknown models reported in the literature.Average absolute relative errors of GMDH models were obtained 4.23%,0.64%,and 0.61%for hydrocarbon gas mixtures,methane,and nitrogen,respectively.In addition,graphical analyses indicate that the GMDH can predict gas viscosity with higher accuracy than GEP at HPHT conditions.Also,using leverage technique,valid,suspected and outlier data points were determined.Finally,trends of gas viscosity models at different conditions were evaluated.
基金support by National Natural Science Foundation of China(61202354,51507084)Nanjing University of Post and Telecommunications Science Foundation(NUPTSF)(NT214203)
基金supported in part by the National Natural Science Foundation of China (11&zd167, 51507084, 61572262)NSF of Jiangsu Province (BK20141427)+2 种基金NUPT (NY214097)Open research fund of Key Lab of Broadband Wireless Communication and Sensor Network Technology (NUPT), Ministry of Education (NYKL201507)Qinlan Project of Jiangsu Province and the General Project of National Natural Science Found of China under Grant 41471300
文摘This paper deals with the reflectance estimation model issue to improve the estimation accuracy. We propose a model containing two core procedures: dimensionality reduction and model mining. First, the dimensionality reduction algorithm of hyperspectral data based on dependence degree(DRNDDD) is proposed to reduce the redundant hyperspectral band. DRND-DD solves the selection of suitable hyperspectral band via rough set theory. Furthermore, to improve the computation speed and accuracy of the model, based on DRND-DD, this paper proposes reflectance estimation model mining of leaf nitrogen concentration(LNC) for hyperspectral data by using hybrid gene expression programming(REMLNC-HGEP). Experimental results on three datasets demonstrate that the DRND-DD algorithm can obtain good results with a very short running time compared with principal component analysis(PCA), singular value decomposition(SVD), a dimensionality reduction algorithm based on the positive region(AR-PR) and a dimensionality reduction algorithm based on a discernable matrix(ARDM), and REMLNC-HGEP has low average time-consumption, high model mining success ratio and estimation accuracy. It was concluded that the REMLNC-HGEP performs better than the regression methods.
基金国家自然科学基金(3077164430972263)Aid Program for Science and Technology Innovative Research Team in Higher Educational Instituions of Hunan Province
文摘At present, transcription analysis of gene expression commonly uses housekeeping genes as control for normalization. In this study, the expression levels of three housekeeping genes including GAPDH, β-actin, and 18S rRNA in six tissues and five developmental stages of the Mandarin fish Siniperca chuatsi were assayed with quantitative real-time PCR (qPCR). Differences in expression levels were analyzed using geNorm program. The results demonstrate that β-actin is the most stable gene at developmental stages and GAPDH is the most stable in different tissues. While 18S rRNA expression during development is differentially regulated, which indicates it is suitable as an internal control for gene expression normalization at the developmental level. Overall, the data suggest that the two most stable housekeeping genes are enough to accurately calibrate gene expression in S. chuatsi. The significance of this study provided convincing references and methodology for housekeeping gene selection and normalization in gene expression analysis with regular PCR or qPCR.
文摘Genetic Programming (GP) is an important approach to deal with complex problem analysis and modeling, and has been applied in a wide range of areas. The development of GP involves various aspects, including design of genetic operators, evolutionary controls and implementations of heuristic strategy, evaluations and other mechanisms. When designing genetic operators, it is necessary to consider the possible limitations of encoding methods of individuals. And when selecting evolutionary control strategies, it is also necessary to balance search efficiency and diversity based on representation characteristics as well as the problem itself. More importantly, all of these matters, among others, have to be implemented through tedious coding work. Therefore, GP development is both complex and time-consuming. To overcome some of these difficulties that hinder the enhancement of GP development efficiency, we explore the feasibility of mutual assistance among GP variants, and then propose a rapid GP prototyping development method based on πGrammatical Evolution (πGE). It is demonstrated through regression analysis experiments that not only is this method beneficial for the GP developers to get rid of some tedious implementations, but also enables them to concentrate on the essence of the referred problem, such as individual representation, decoding means and evaluation. Additionally, it provides new insights into the roles of individual delineations in phenotypes and semantic research of individuals.
基金supported by the National Natural Science Foundation of China(32100158 to H.C.)the National Key R&D Program of China(2022YFC2601000 and 2022YFD1100202 to S.D.)+2 种基金the Natural Science Foundation of Jiangsu Province(BK20200538 to H.C.)the China Agriculture Research System-potato(CARS-potato,CARS09-P20 to S.D.)the Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX19_0535 to H.S.)。
文摘Dear Editor,Epigenetic programming plays a critical role in response to environmental fluctuations in eukaryotes(Jaenisch and Bird,2003).From initial contact to successful colonization,pathogenic microorganisms adeptly adapt to environmental changes by orchestrating a cascade of gene expression.However,how plant pathogens adapt to the host environment during early infection at the epigenetic level is largely elusive.
基金supported by the National Basic Research Program of China(2013CB127302)the National Natural Science Foundation of China(31272450 and 31572412)+2 种基金Competitive Grants from the Animal Reproduction Program(no.2014-67015-21770)Animal Growth & Nutrient Utilization Programs(no.2015-67015-23276)of the USDA National Institute of Food and AgricultureTexas A&M AgriL ife Research(H-8200)
文摘Maternal undernutrition or overnutrition during pregnancy alters organ structure, impairs prenatal and neonatal growth and development, and reduces feed efficiency for lean tissue gains in pigs. These adverse effects may be carried over to the next generation or beyond. This phenomenon of the transgenerational impacts is known as fetal programming, which is mediated by stable and heritable alterations of gene expression through covalent modifications of DNA and histones without changes in DNA sequences(namely, epigenetics). The mechanisms responsible for the epigenetic regulation of protein expression and functions include chromatin remodeling; DNA methylation(occurring at the 5′-position of cytosine residues within CpG dinucleotides); and histone modifications(acetylation, methylation, phosphorylation, and ubiquitination). Like maternal malnutrition, undernutrition during the neonatal period also reduces growth performance and feed efficiency(weight gain:feed intake; also known as weightgain efficiency) in postweaning pigs by 5–10%, thereby increasing the days necessary to reach the market bodyweight. Supplementing functional amino acids(e.g., arginine and glutamine) and vitamins(e.g., folate) play a key role in activating the mammalian target of rapamycin signaling and regulating the provision of methyl donors for DNA and protein methylation. Therefore, these nutrients are beneficial for the dietary treatment of metabolic disorders in offspring with intrauterine growth restriction or neonatal malnutrition. The mechanism-based strategies hold great promise for the improvement of the efficiency of pork production and the sustainability of the global swine industry.