Based on the stability and inequality of texture features between coal and rock,this study used the digital image analysis technique to propose a coal–rock interface detection method.By using gray level co-occurrence...Based on the stability and inequality of texture features between coal and rock,this study used the digital image analysis technique to propose a coal–rock interface detection method.By using gray level co-occurrence matrix,twenty-two texture features were extracted from the images of coal and rock.Data dimension of the feature space reduced to four by feature selection,which was according to a separability criterion based on inter-class mean difference and within-class scatter.The experimental results show that the optimized features were effective in improving the separability of the samples and reducing the time complexity of the algorithm.In the optimized low-dimensional feature space,the coal–rock classifer was set up using the fsher discriminant method.Using the 10-fold cross-validation technique,the performance of the classifer was evaluated,and an average recognition rate of 94.12%was obtained.The results of comparative experiments show that the identifcation performance of the proposed method was superior to the texture description method based on gray histogram and gradient histogram.展开更多
A total of 107 soil samples were taken from the city of Qingdao,Shandong Province,China.Soil water retention data at 2.5,6,10,33,100,300,and 1 500 kPa matric potentials were measured using a pressure membrane apparatu...A total of 107 soil samples were taken from the city of Qingdao,Shandong Province,China.Soil water retention data at 2.5,6,10,33,100,300,and 1 500 kPa matric potentials were measured using a pressure membrane apparatus.Multiple linear regression (MLR) was used to develop pedotransfer functions (PTFs) for single point estimation and van Genuchten parameter estimation based on readily measurable soil properties,i.e.,MLR-based point (MLRP) PTF and MLR-based parametric (MLRV) PTF.The double cross-validation method was used to evaluate the accuracy of PTF estimates and the stability of the PTFs developed in this study.The performance of MLRP and MLRV PTFs in estimating water contents at matric potentials of 10,33,and 1 500 kPa was compared with that of two existing PTFs,the Rawls PTF and the Vereecken PTF.In addition,geostatistical analyses were conducted to assess the capabilities of these PTFs in describing the spatial variability of soil water retention characteristics.Results showed that among all PTFs only the Vereecken PTF failed to accurately estimate water retention characteristics.Although the MLRP PTF can be used to predict retention characteristics through traditional statistical analyses,it failed to describe the spatial variability of soil water retention characteristics.Although the MLRV and Rawls PTFs failed to describe the spatial variability of water contents at a matric potential of 10 kPa,they can be used to quantify the spatial variability of water contents at matric potentials of 33 and 1 500 kPa.展开更多
The Efficient Global Optimization(EGO)algorithm has been widely used in the numerical design optimization of engineering systems.However,the need for an uncertainty estimator limits the selection of a surrogate model....The Efficient Global Optimization(EGO)algorithm has been widely used in the numerical design optimization of engineering systems.However,the need for an uncertainty estimator limits the selection of a surrogate model.In this paper,a Sequential Ensemble Optimization(SEO)algorithm based on the ensemble model is proposed.In the proposed algorithm,there is no limitation on the selection of an individual surrogate model.Specifically,the SEO is built based on the EGO by extending the EGO algorithm so that it can be used in combination with the ensemble model.Also,a new uncertainty estimator for any surrogate model named the General Uncertainty Estimator(GUE)is proposed.The performance of the proposed SEO algorithm is verified by the simulations using ten well-known mathematical functions with varying dimensions.The results show that the proposed SEO algorithm performs better than the traditional EGO algorithm in terms of both the final optimization results and the convergence rate.Further,the proposed algorithm is applied to the global optimization control for turbo-fan engine acceleration schedule design.展开更多
The reservoir volumetric approach represents a widely accepted, but flawed method of petroleum play resource calculation. In this paper, we propose a combination of techniques that can improve the applicability and qu...The reservoir volumetric approach represents a widely accepted, but flawed method of petroleum play resource calculation. In this paper, we propose a combination of techniques that can improve the applicability and quality of the resource estimation. These techniques include: 1) the use of the Multivariate Discovery Process model (MDP) to derive unbiased distribution parameters of reservoir volumetric variables and to reveal correlations among the variables; 2) the use of the Geo-anchored method to estimate simultaneously the number of oil and gas pools in the same play; and 3) the crossvalidation of assessment results from different methods. These techniques are illustrated by using an example of crude oil and natural gas resource assessment of the Sverdrup Basin, Canadian Archipelago. The example shows that when direct volumetric measurements of the untested prospects are not available, the MDP model can help derive unbiased estimates of the distribution parameters by using information from the discovered oil and gas accumulations. It also shows that an estimation of the number of oil and gas accumulations and associated size ranges from a discovery process model can provide an alternative and efficient approach when inadequate geological data hinder the estimation. Cross-examination of assessment results derived using different methods allows one to focus on and analyze the causes for the major differences, thus providing a more reliable assessment outcome.展开更多
Water vapor permeability of building materials is a crucial parameter for analysing and optimizing the hygrothermal performance of building envelopes and built environments.Its measurement is accurate but time-consumi...Water vapor permeability of building materials is a crucial parameter for analysing and optimizing the hygrothermal performance of building envelopes and built environments.Its measurement is accurate but time-consuming,while data mining methods have the potential to predict water vapor permeability efficiently.In this study,six data mining methods—support vector regression(SVR),decision tree regression(DT),random forest regression(RF),K-nearest neighbor(KNN),multi-layer perceptron(MLP),and adaptive boosting regression(AdaBoost)—were compared to predict the water vapor permeability of cement-based materials.A total of 143 datasets of material properties were collected to build prediction models,and five materials were experimentally determined for model validation.The results show that RF has excellent generalization,stability,and precision.AdaBoost has great generalization and precision,only slightly inferior to the former,and its stability is excellent.DT has good precision and acceptable generalization,but its stability is poor.SVR and KNN have superior stability,but their generalization and precision are inadequate.MLP lacks generalization,and its stability and precision are unacceptable.In short,RF has the best comprehensive performance,demonstrated by a limited prediction deviation of 26.3%from the experimental results,better than AdaBoost(38.0%)and DT(38.3%)and far better than other remaining methods.It is also found that data mining methods provide better predictions when cement-based materials’water vapor permeability is high.展开更多
In this paper,we mainly study how to estimate the error density in the ultrahigh dimensional sparse additive model,where the number of variables is larger than the sample size.First,a smoothing method based on B-splin...In this paper,we mainly study how to estimate the error density in the ultrahigh dimensional sparse additive model,where the number of variables is larger than the sample size.First,a smoothing method based on B-splines is applied to the estimation of regression functions.Second,an improved two-stage refitted crossvalidation(RCV)procedure by random splitting technique is used to obtain the residuals of the model,and then the residual-based kernel method is applied to estimate the error density function.Under suitable sparse conditions,the large sample properties of the estimator,including the weak and strong consistency,as well as normality and the law of the iterated logarithm,are obtained.Especially,the relationship between the sparsity and the convergence rate of the kernel density estimator is given.The methodology is illustrated by simulations and a real data example,which suggests that the proposed method performs well.展开更多
In recent years, the popular multifractal detrended fluctuation analysis (MF-DFA) is extended to two-dimensional (2D) version, which has been applied in some field of image processing. In this paper, based on the ...In recent years, the popular multifractal detrended fluctuation analysis (MF-DFA) is extended to two-dimensional (2D) version, which has been applied in some field of image processing. In this paper, based on the 2D MF-DFA, a novel multifractal estimation method for images, which we called the local multifractal detrended fluctuation analysis (LMF-DFA), is proposed to recognize and distinguish 20 types of tea breeds. A set of new multifractal descriptors, namely the local multifractal fluctuation exponents is defined to portray the local scaling properties of a surface. After collecting 10 tea leaves for each breed and photographing them to standard images, the LMF-DFA method is used to extract characteristic parameters for the images. Our analysis finds that there are significant differences among the different tea breeds' characteristic parameters by analysis of variance. Both the proposed LMF-DFA exponents and another classic parameter, namely the exponent based on capacity measure method have been used as features to distinguish the 20 tea breeds. The comparison results illustrate that the LMF-DFA estimation can differentiate the tea breeds more effectively and provide more satisfactory accuracy.展开更多
基金the National Natural Science Foundation of China(No.51134024/E0422)for the financial support
文摘Based on the stability and inequality of texture features between coal and rock,this study used the digital image analysis technique to propose a coal–rock interface detection method.By using gray level co-occurrence matrix,twenty-two texture features were extracted from the images of coal and rock.Data dimension of the feature space reduced to four by feature selection,which was according to a separability criterion based on inter-class mean difference and within-class scatter.The experimental results show that the optimized features were effective in improving the separability of the samples and reducing the time complexity of the algorithm.In the optimized low-dimensional feature space,the coal–rock classifer was set up using the fsher discriminant method.Using the 10-fold cross-validation technique,the performance of the classifer was evaluated,and an average recognition rate of 94.12%was obtained.The results of comparative experiments show that the identifcation performance of the proposed method was superior to the texture description method based on gray histogram and gradient histogram.
基金Supported by the National Natural Science Foundation of China (Nos. 40771095,40725010,and 41030746)the Water Conservancy Science & Technology Foundation of Qingdao City,China (No. 2006-003)
文摘A total of 107 soil samples were taken from the city of Qingdao,Shandong Province,China.Soil water retention data at 2.5,6,10,33,100,300,and 1 500 kPa matric potentials were measured using a pressure membrane apparatus.Multiple linear regression (MLR) was used to develop pedotransfer functions (PTFs) for single point estimation and van Genuchten parameter estimation based on readily measurable soil properties,i.e.,MLR-based point (MLRP) PTF and MLR-based parametric (MLRV) PTF.The double cross-validation method was used to evaluate the accuracy of PTF estimates and the stability of the PTFs developed in this study.The performance of MLRP and MLRV PTFs in estimating water contents at matric potentials of 10,33,and 1 500 kPa was compared with that of two existing PTFs,the Rawls PTF and the Vereecken PTF.In addition,geostatistical analyses were conducted to assess the capabilities of these PTFs in describing the spatial variability of soil water retention characteristics.Results showed that among all PTFs only the Vereecken PTF failed to accurately estimate water retention characteristics.Although the MLRP PTF can be used to predict retention characteristics through traditional statistical analyses,it failed to describe the spatial variability of soil water retention characteristics.Although the MLRV and Rawls PTFs failed to describe the spatial variability of water contents at a matric potential of 10 kPa,they can be used to quantify the spatial variability of water contents at matric potentials of 33 and 1 500 kPa.
基金the financial support of the National Natural Science Foundation of China(Nos.52076180,51876176 and 51906204)National Science and Technology Major Project,China(No.2017-I0001-0001)。
文摘The Efficient Global Optimization(EGO)algorithm has been widely used in the numerical design optimization of engineering systems.However,the need for an uncertainty estimator limits the selection of a surrogate model.In this paper,a Sequential Ensemble Optimization(SEO)algorithm based on the ensemble model is proposed.In the proposed algorithm,there is no limitation on the selection of an individual surrogate model.Specifically,the SEO is built based on the EGO by extending the EGO algorithm so that it can be used in combination with the ensemble model.Also,a new uncertainty estimator for any surrogate model named the General Uncertainty Estimator(GUE)is proposed.The performance of the proposed SEO algorithm is verified by the simulations using ten well-known mathematical functions with varying dimensions.The results show that the proposed SEO algorithm performs better than the traditional EGO algorithm in terms of both the final optimization results and the convergence rate.Further,the proposed algorithm is applied to the global optimization control for turbo-fan engine acceleration schedule design.
文摘The reservoir volumetric approach represents a widely accepted, but flawed method of petroleum play resource calculation. In this paper, we propose a combination of techniques that can improve the applicability and quality of the resource estimation. These techniques include: 1) the use of the Multivariate Discovery Process model (MDP) to derive unbiased distribution parameters of reservoir volumetric variables and to reveal correlations among the variables; 2) the use of the Geo-anchored method to estimate simultaneously the number of oil and gas pools in the same play; and 3) the crossvalidation of assessment results from different methods. These techniques are illustrated by using an example of crude oil and natural gas resource assessment of the Sverdrup Basin, Canadian Archipelago. The example shows that when direct volumetric measurements of the untested prospects are not available, the MDP model can help derive unbiased estimates of the distribution parameters by using information from the discovered oil and gas accumulations. It also shows that an estimation of the number of oil and gas accumulations and associated size ranges from a discovery process model can provide an alternative and efficient approach when inadequate geological data hinder the estimation. Cross-examination of assessment results derived using different methods allows one to focus on and analyze the causes for the major differences, thus providing a more reliable assessment outcome.
基金supported by the National Natural Science Foundation of China(No.52178065).
文摘Water vapor permeability of building materials is a crucial parameter for analysing and optimizing the hygrothermal performance of building envelopes and built environments.Its measurement is accurate but time-consuming,while data mining methods have the potential to predict water vapor permeability efficiently.In this study,six data mining methods—support vector regression(SVR),decision tree regression(DT),random forest regression(RF),K-nearest neighbor(KNN),multi-layer perceptron(MLP),and adaptive boosting regression(AdaBoost)—were compared to predict the water vapor permeability of cement-based materials.A total of 143 datasets of material properties were collected to build prediction models,and five materials were experimentally determined for model validation.The results show that RF has excellent generalization,stability,and precision.AdaBoost has great generalization and precision,only slightly inferior to the former,and its stability is excellent.DT has good precision and acceptable generalization,but its stability is poor.SVR and KNN have superior stability,but their generalization and precision are inadequate.MLP lacks generalization,and its stability and precision are unacceptable.In short,RF has the best comprehensive performance,demonstrated by a limited prediction deviation of 26.3%from the experimental results,better than AdaBoost(38.0%)and DT(38.3%)and far better than other remaining methods.It is also found that data mining methods provide better predictions when cement-based materials’water vapor permeability is high.
基金supported by National Natural Science Foundation of China (Grant Nos. 11971324 and 11471223)Interdisciplinary Construction of Bioinformatics and StatisticsAcademy for Multidisciplinary Studies, Capital Normal University
文摘In this paper,we mainly study how to estimate the error density in the ultrahigh dimensional sparse additive model,where the number of variables is larger than the sample size.First,a smoothing method based on B-splines is applied to the estimation of regression functions.Second,an improved two-stage refitted crossvalidation(RCV)procedure by random splitting technique is used to obtain the residuals of the model,and then the residual-based kernel method is applied to estimate the error density function.Under suitable sparse conditions,the large sample properties of the estimator,including the weak and strong consistency,as well as normality and the law of the iterated logarithm,are obtained.Especially,the relationship between the sparsity and the convergence rate of the kernel density estimator is given.The methodology is illustrated by simulations and a real data example,which suggests that the proposed method performs well.
文摘In recent years, the popular multifractal detrended fluctuation analysis (MF-DFA) is extended to two-dimensional (2D) version, which has been applied in some field of image processing. In this paper, based on the 2D MF-DFA, a novel multifractal estimation method for images, which we called the local multifractal detrended fluctuation analysis (LMF-DFA), is proposed to recognize and distinguish 20 types of tea breeds. A set of new multifractal descriptors, namely the local multifractal fluctuation exponents is defined to portray the local scaling properties of a surface. After collecting 10 tea leaves for each breed and photographing them to standard images, the LMF-DFA method is used to extract characteristic parameters for the images. Our analysis finds that there are significant differences among the different tea breeds' characteristic parameters by analysis of variance. Both the proposed LMF-DFA exponents and another classic parameter, namely the exponent based on capacity measure method have been used as features to distinguish the 20 tea breeds. The comparison results illustrate that the LMF-DFA estimation can differentiate the tea breeds more effectively and provide more satisfactory accuracy.