期刊文献+
共找到111篇文章
< 1 2 6 >
每页显示 20 50 100
基于Cross-Validation的小波自适应去噪方法 被引量:5
1
作者 黄文清 戴瑜兴 李加升 《湖南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2008年第11期40-43,共4页
小波去噪算法中,阈值的选择非常关键.提出一种自适应阈值选择算法.该算法先通过Cross-Validation方法将噪声干扰信号分成两个子信号,一个用于阈值处理,一个用作参考信号;再采用最深梯度法来寻求一个最优去噪阈值.仿真和实验结果表明:在... 小波去噪算法中,阈值的选择非常关键.提出一种自适应阈值选择算法.该算法先通过Cross-Validation方法将噪声干扰信号分成两个子信号,一个用于阈值处理,一个用作参考信号;再采用最深梯度法来寻求一个最优去噪阈值.仿真和实验结果表明:在均方误差意义上,所提算法去噪效果优于Donoho等提出的VisuShrink和SureShrink两种去噪算法,且不需要带噪信号的任何'先验信息',适应于实际信号去噪处理. 展开更多
关键词 小波变换 cross-validation 自适应滤波 阈值
在线阅读 下载PDF
A Deep Learning Framework for Heart Disease Prediction with Explainable Artificial Intelligence
2
作者 Muhammad Adil Nadeem Javaid +2 位作者 Imran Ahmed Abrar Ahmed Nabil Alrajeh 《Computers, Materials & Continua》 2026年第1期1944-1963,共20页
Heart disease remains a leading cause of mortality worldwide,emphasizing the urgent need for reliable and interpretable predictive models to support early diagnosis and timely intervention.However,existing Deep Learni... Heart disease remains a leading cause of mortality worldwide,emphasizing the urgent need for reliable and interpretable predictive models to support early diagnosis and timely intervention.However,existing Deep Learning(DL)approaches often face several limitations,including inefficient feature extraction,class imbalance,suboptimal classification performance,and limited interpretability,which collectively hinder their deployment in clinical settings.To address these challenges,we propose a novel DL framework for heart disease prediction that integrates a comprehensive preprocessing pipeline with an advanced classification architecture.The preprocessing stage involves label encoding and feature scaling.To address the issue of class imbalance inherent in the personal key indicators of the heart disease dataset,the localized random affine shadowsampling technique is employed,which enhances minority class representation while minimizing overfitting.At the core of the framework lies the Deep Residual Network(DeepResNet),which employs hierarchical residual transformations to facilitate efficient feature extraction and capture complex,non-linear relationships in the data.Experimental results demonstrate that the proposed model significantly outperforms existing techniques,achieving improvements of 3.26%in accuracy,3.16%in area under the receiver operating characteristics,1.09%in recall,and 1.07%in F1-score.Furthermore,robustness is validated using 10-fold crossvalidation,confirming the model’s generalizability across diverse data distributions.Moreover,model interpretability is ensured through the integration of Shapley additive explanations and local interpretable model-agnostic explanations,offering valuable insights into the contribution of individual features to model predictions.Overall,the proposed DL framework presents a robust,interpretable,and clinically applicable solution for heart disease prediction. 展开更多
关键词 Heart disease deep learning localized random affine shadowsampling local interpretable modelagnostic explanations shapley additive explanations 10-fold cross-validation
在线阅读 下载PDF
Cross-Validation, Shrinkage and Variable Selection in Linear Regression Revisited 被引量:3
3
作者 Hans C. van Houwelingen Willi Sauerbrei 《Open Journal of Statistics》 2013年第2期79-102,共24页
In deriving a regression model analysts often have to use variable selection, despite of problems introduced by data- dependent model building. Resampling approaches are proposed to handle some of the critical issues.... In deriving a regression model analysts often have to use variable selection, despite of problems introduced by data- dependent model building. Resampling approaches are proposed to handle some of the critical issues. In order to assess and compare several strategies, we will conduct a simulation study with 15 predictors and a complex correlation structure in the linear regression model. Using sample sizes of 100 and 400 and estimates of the residual variance corresponding to R2 of 0.50 and 0.71, we consider 4 scenarios with varying amount of information. We also consider two examples with 24 and 13 predictors, respectively. We will discuss the value of cross-validation, shrinkage and backward elimination (BE) with varying significance level. We will assess whether 2-step approaches using global or parameterwise shrinkage (PWSF) can improve selected models and will compare results to models derived with the LASSO procedure. Beside of MSE we will use model sparsity and further criteria for model assessment. The amount of information in the data has an influence on the selected models and the comparison of the procedures. None of the approaches was best in all scenarios. The performance of backward elimination with a suitably chosen significance level was not worse compared to the LASSO and BE models selected were much sparser, an important advantage for interpretation and transportability. Compared to global shrinkage, PWSF had better performance. Provided that the amount of information is not too small, we conclude that BE followed by PWSF is a suitable approach when variable selection is a key part of data analysis. 展开更多
关键词 cross-validation LASSO SHRINKAGE SIMULATION STUDY VARIABLE SELECTION
暂未订购
Classification of aviation incident causes using LGBM with improved cross-validation 被引量:1
4
作者 NI Xiaomei WANG Huawei +1 位作者 CHEN Lingzi LIN Ruiguan 《Journal of Systems Engineering and Electronics》 SCIE CSCD 2024年第2期396-405,共10页
Aviation accidents are currently one of the leading causes of significant injuries and deaths worldwide. This entices researchers to investigate aircraft safety using data analysis approaches based on an advanced mach... Aviation accidents are currently one of the leading causes of significant injuries and deaths worldwide. This entices researchers to investigate aircraft safety using data analysis approaches based on an advanced machine learning algorithm.To assess aviation safety and identify the causes of incidents, a classification model with light gradient boosting machine (LGBM)based on the aviation safety reporting system (ASRS) has been developed. It is improved by k-fold cross-validation with hybrid sampling model (HSCV), which may boost classification performance and maintain data balance. The results show that employing the LGBM-HSCV model can significantly improve accuracy while alleviating data imbalance. Vertical comparison with other cross-validation (CV) methods and lateral comparison with different fold times comprise the comparative approach. Aside from the comparison, two further CV approaches based on the improved method in this study are discussed:one with a different sampling and folding order, and the other with more CV. According to the assessment indices with different methods, the LGBMHSCV model proposed here is effective at detecting incident causes. The improved model for imbalanced data categorization proposed may serve as a point of reference for similar data processing, and the model’s accurate identification of civil aviation incident causes can assist to improve civil aviation safety. 展开更多
关键词 aviation safety imbalance data light gradient boosting machine(LGBM) cross-validation(CV)
在线阅读 下载PDF
Using Multiple Risk Factors and Generalized Linear Mixed Models with 5-Fold Cross-Validation Strategy for Optimal Carotid Plaque Progression Prediction
5
作者 Qingyu Wang Dalin Tang +5 位作者 Liang Wang Gador Canton Zheyang Wu Thomas SHatsukami Kristen L Billiar Chun Yuan 《医用生物力学》 EI CAS CSCD 北大核心 2019年第A01期74-75,共2页
Background Cardiovascular diseases are closely linked to atherosclerotic plaque development and rupture.Plaque progression prediction is of fundamental significance to cardiovascular research and disease diagnosis,pre... Background Cardiovascular diseases are closely linked to atherosclerotic plaque development and rupture.Plaque progression prediction is of fundamental significance to cardiovascular research and disease diagnosis,prevention,and treatment.Generalized linear mixed models(GLMM)is an extension of linear model for categorical responses while considering the correlation among observations.Methods Magnetic resonance image(MRI)data of carotid atheroscleroticplaques were acquired from 20 patients with consent obtained and 3D thin-layer models were constructed to calculate plaque stress and strain for plaque progression prediction.Data for ten morphological and biomechanical risk factors included wall thickness(WT),lipid percent(LP),minimum cap thickness(MinCT),plaque area(PA),plaque burden(PB),lumen area(LA),maximum plaque wall stress(MPWS),maximum plaque wall strain(MPWSn),average plaque wall stress(APWS),and average plaque wall strain(APWSn)were extracted from all slices for analysis.Wall thickness increase(WTI),plaque burden increase(PBI)and plaque area increase(PAI) were chosen as three measures for plaque progression.Generalized linear mixed models(GLMM)with 5-fold cross-validation strategy were used to calculate prediction accuracy for each predictor and identify optimal predictor with the highest prediction accuracy defined as sum of sensitivity and specificity.All 201 MRI slices were randomly divided into 4 training subgroups and 1 verification subgroup.The training subgroups were used for model fitting,and the verification subgroup was used to estimate the model.All combinations(total1023)of 10 risk factors were feed to GLMM and the prediction accuracy of each predictor were selected from the point on the ROC(receiver operating characteristic)curve with the highest sum of specificity and sensitivity.Results LA was the best single predictor for PBI with the highest prediction accuracy(1.360 1),and the area under of the ROC curve(AUC)is0.654 0,followed by APWSn(1.336 3)with AUC=0.6342.The optimal predictor among all possible combinations for PBI was the combination of LA,PA,LP,WT,MPWS and MPWSn with prediction accuracy=1.414 6(AUC=0.715 8).LA was once again the best single predictor for PAI with the highest prediction accuracy(1.184 6)with AUC=0.606 4,followed by MPWSn(1. 183 2)with AUC=0.6084.The combination of PA,PB,WT,MPWS,MPWSn and APWSn gave the best prediction accuracy(1.302 5)for PAI,and the AUC value is 0.6657.PA was the best single predictor for WTI with highest prediction accuracy(1.288 7)with AUC=0.641 5,followed by WT(1.254 0),with AUC=0.6097.The combination of PA,PB,WT,LP,MinCT,MPWS and MPWS was the best predictor for WTI with prediction accuracy as 1.314 0,with AUC=0.6552.This indicated that PBI was a more predictable measure than WTI and PAI. The combinational predictors improved prediction accuracy by 9.95%,4.01%and 1.96%over the best single predictors for PAI,PBI and WTI(AUC values improved by9.78%,9.45%,and 2.14%),respectively.Conclusions The use of GLMM with 5-fold cross-validation strategy combining both morphological and biomechanical risk factors could potentially improve the accuracy of carotid plaque progression prediction.This study suggests that a linear combination of multiple predictors can provide potential improvement to existing plaque assessment schemes. 展开更多
关键词 Multiple Risk FACTORS GENERALIZED Linear 5-Fold cross-validation STRATEGY AUC
原文传递
ON THE CONSISTENCY OF CROSS-VALIDATIONIN NONLINEAR WAVELET REGRESSION ESTIMATION
6
作者 张双林 郑忠国 《Acta Mathematica Scientia》 SCIE CSCD 2000年第1期1-11,共11页
For the nonparametric regression model Y-ni = g(x(ni)) + epsilon(ni)i = 1, ..., n, with regularly spaced nonrandom design, the authors study the behavior of the nonlinear wavelet estimator of g(x). When the threshold ... For the nonparametric regression model Y-ni = g(x(ni)) + epsilon(ni)i = 1, ..., n, with regularly spaced nonrandom design, the authors study the behavior of the nonlinear wavelet estimator of g(x). When the threshold and truncation parameters are chosen by cross-validation on the everage squared error, strong consistency for the case of dyadic sample size and moment consistency for arbitrary sample size are established under some regular conditions. 展开更多
关键词 CONSISTENCY cross-validation nonparametric regression THRESHOLD TRUNCATION wavelet estimator
在线阅读 下载PDF
On-Street Parking Space Detection Using YOLO Models and Recommendations Based on KD-Tree Suitability Search
7
作者 Ibrahim Yahaya Garta William Eric Manongga +1 位作者 Su-Wen Huang Rung-Ching Chen 《Computers, Materials & Continua》 2025年第12期4457-4471,共15页
Unlike the detection of marked on-street parking spaces,detecting unmarked spaces poses significant challenges due to the absence of clear physical demarcation and uneven gaps caused by irregular parking.In urban citi... Unlike the detection of marked on-street parking spaces,detecting unmarked spaces poses significant challenges due to the absence of clear physical demarcation and uneven gaps caused by irregular parking.In urban cities with heavy traffic flow,these challenges can result in traffic disruptions,rear-end collisions,sideswipes,and congestion as drivers struggle to make decisions.We propose a real-time detection system for on-street parking spaces using YOLO models and recommend the most suitable space based on KD-tree search.Lightweight versions of YOLOv5,YOLOv7-tiny,and YOLOv8 with different architectures are trained.Among the models,YOLOv5s with SPPF at the backbone achieved an F1-score of 0.89,which was selected for validation using k-fold cross-validation on our dataset.The Low variance and standard deviation recorded across folds indicate the model’s generalizability,reliability,and stability.Inference with KD-tree using predictions from the YOLO models recorded FPS of 37.9 for YOLOv5,67.2 for YOLOv7-tiny,and 67.0 for YOLOv8.The models successfully detect both marked and unmarked empty parking spaces on test data with varying inference speeds and FPS.These models can be efficiently deployed for real-time applications due to their high FPS,inference speed,and lightweight nature.In comparison with other state-of-the-art models,our models outperform them,further demonstrating their effectiveness. 展开更多
关键词 On-street parking YOLO models K-dimensional tree K-fold cross-validation
在线阅读 下载PDF
Detection and analysis of Spartina alterniflora in Chongming East Beach using Sentinel-2 imagery and image texture features
8
作者 Xinyu Mei Zhongbiao Chen +1 位作者 Runxia Sun Yijun He 《Acta Oceanologica Sinica》 2025年第2期80-90,共11页
Spartina alterniflora is now listed among the world’s 100 most dangerous invasive species,severely affecting the ecological balance of coastal wetlands.Remote sensing technologies based on deep learning enable large-... Spartina alterniflora is now listed among the world’s 100 most dangerous invasive species,severely affecting the ecological balance of coastal wetlands.Remote sensing technologies based on deep learning enable large-scale monitoring of Spartina alterniflora,but they require large datasets and have poor interpretability.A new method is proposed to detect Spartina alterniflora from Sentinel-2 imagery.Firstly,to get the high canopy cover and dense community characteristics of Spartina alterniflora,multi-dimensional shallow features are extracted from the imagery.Secondly,to detect different objects from satellite imagery,index features are extracted,and the statistical features of the Gray-Level Co-occurrence Matrix(GLCM)are derived using principal component analysis.Then,ensemble learning methods,including random forest,extreme gradient boosting,and light gradient boosting machine models,are employed for image classification.Meanwhile,Recursive Feature Elimination with Cross-Validation(RFECV)is used to select the best feature subset.Finally,to enhance the interpretability of the models,the best features are utilized to classify multi-temporal images and SHapley Additive exPlanations(SHAP)is combined with these classifications to explain the model prediction process.The method is validated by using Sentinel-2 imageries and previous observations of Spartina alterniflora in Chongming Island,it is found that the model combining image texture features such as GLCM covariance can significantly improve the detection accuracy of Spartina alterniflora by about 8%compared with the model without image texture features.Through multiple model comparisons and feature selection via RFECV,the selected model and eight features demonstrated good classification accuracy when applied to data from different time periods,proving that feature reduction can effectively enhance model generalization.Additionally,visualizing model decisions using SHAP revealed that the image texture feature component_1_GLCMVariance is particularly important for identifying each land cover type. 展开更多
关键词 texture features Recursive Feature Elimination with cross-validation(RFECV) SHapley Additive exPlanations(SHAP) Sentinel-2 time-series imagery multi-model comparison
在线阅读 下载PDF
基于ANFIS和Elman网络的信用评价研究 被引量:8
9
作者 梁樑 吴德胜 +2 位作者 王志强 熊立 王国华 《管理工程学报》 CSSCI 2005年第1期69-73,共5页
BP神经网络用作信用等级分类可取得较好的效果,但在过分要求输出信用分值时效果不佳。针对该缺陷,本文采用自适应神经网络(ANFIS)和Elman网络研究公司信用评分。文中提出了一套甄选方法准则,用于建立适合我国企业的信用评分指标体系;然... BP神经网络用作信用等级分类可取得较好的效果,但在过分要求输出信用分值时效果不佳。针对该缺陷,本文采用自适应神经网络(ANFIS)和Elman网络研究公司信用评分。文中提出了一套甄选方法准则,用于建立适合我国企业的信用评分指标体系;然后依据该指标体系建立了基于Elman网络和ANFIS的信用评估模型;采用V foldCross validation技巧,利用样本公司实际指标数据对该模型的评分效果进行了实证研究。 展开更多
关键词 信用评分 自适应神经模糊推理 ELMAN网络 V-fold cross-validation技巧 主成分分析
在线阅读 下载PDF
直方图理论与最优直方图制作 被引量:27
10
作者 张建方 王秀祥 《应用概率统计》 CSCD 北大核心 2009年第2期201-214,共14页
直方图是一种最为常见的密度估计和数据分析工具.在直方图理论和制作过程中,组距的选择和边界点的确定尤为重要.然而,许多学者对这两个参数的选择仍然采用经验的方法,甚至现在大多数统计软件在确定直方图分组数时也是默认采用粗略的计... 直方图是一种最为常见的密度估计和数据分析工具.在直方图理论和制作过程中,组距的选择和边界点的确定尤为重要.然而,许多学者对这两个参数的选择仍然采用经验的方法,甚至现在大多数统计软件在确定直方图分组数时也是默认采用粗略的计算公式.本文主要介绍直方图理论和最优直方图制作的最新研究成果,强调面向样本的最优直方图制作方法. 展开更多
关键词 直方图 Sturges公式 Scott公式 cross-validation Histogram-Kernel ERROR 误差平方和
在线阅读 下载PDF
不同模型在信用评价中的比较研究 被引量:8
11
作者 吴德胜 梁樑 杨力 《预测》 CSSCI 2004年第2期73-76,69,共5页
比较了不同模型应用于企业信用评价问题的优劣,针对信用评分问题特点,采用Elman回归神经网络和BP网络建模。在建立了适合于我国企业的信用评分指标体系之后,运用以上两种方法进行实证研究并比较两种网络的诊断行为;为克服小样本建模的缺... 比较了不同模型应用于企业信用评价问题的优劣,针对信用评分问题特点,采用Elman回归神经网络和BP网络建模。在建立了适合于我国企业的信用评分指标体系之后,运用以上两种方法进行实证研究并比较两种网络的诊断行为;为克服小样本建模的缺点,引进V foldCross validation计算技巧。 展开更多
关键词 ELMAN神经网络 BP神经网络 V-fold cross-validation技巧 信用评分
在线阅读 下载PDF
基于Walsh平均的非参数回归模型的稳健估计 被引量:4
12
作者 彭佳 李长青 王晓燕 《数理统计与管理》 CSSCI 北大核心 2015年第4期636-646,共11页
由于非参数回归模型复杂灵活,被广泛应用。在众多估计方法中,最小二乘法最为常用,一般情况下具有良好的性质,但在处理厚尾分布及异常点时表现的不够稳健。本文针对此,提出了基于Walsh平均的稳健样条估计。我们理论地推导了估计结果的相... 由于非参数回归模型复杂灵活,被广泛应用。在众多估计方法中,最小二乘法最为常用,一般情况下具有良好的性质,但在处理厚尾分布及异常点时表现的不够稳健。本文针对此,提出了基于Walsh平均的稳健样条估计。我们理论地推导了估计结果的相合性和渐近正态性;并与多项式样条回归做比较。计算得Walsh平均的样条估计相对于多项式样条回归的渐近相对效率与Wilcoxon符号秩检验相对于t-检验的渐近相对效率是一样的。在正态情形下我们的方法与多项式样条回归差不多,在非正态情形下,我们的方法表现更为稳健,效率明显优于多项式样条回归。 展开更多
关键词 非参数回归 Walsh平均 B-样条 Wilcoxon符号秩检验 cross-validation
原文传递
基于支持向量机的机械故障特征选择方法研究 被引量:4
13
作者 王新峰 邱静 刘冠军 《机械科学与技术》 CSCD 北大核心 2005年第9期1122-1125,共4页
在机械故障诊断中,对机器状态信号进行处理可得到故障特征集。但是此特征集中通常含有冗余特征而影响诊断效果。特征选择可以去除原始特征中的冗余特征,提高诊断精度和诊断效率。本文提出采用支持向量机(SVM)作为决策分类器,研究了使用... 在机械故障诊断中,对机器状态信号进行处理可得到故障特征集。但是此特征集中通常含有冗余特征而影响诊断效果。特征选择可以去除原始特征中的冗余特征,提高诊断精度和诊断效率。本文提出采用支持向量机(SVM)作为决策分类器,研究了使用SVM的错误上界如半径-间距上界代替学习错误率作为特征性能评价,并且使用遗传算法对特征集进行寻优的特征选择方法。此方法由于只需要训练一次SVM,相比常用的分组轮换方法有较高的计算效率。数值仿真和减速器的轴承故障特征选择试验中,采用此方法对生成特征集进行选择,并与常用的分组轮换法进行了对比。结果显示此方法有较好的选择性能和选择效率。 展开更多
关键词 特征选择 分组轮换法(cross-validation) 支持向量机(SVM) 半径-间距上界 遗传算法
在线阅读 下载PDF
洗衣产品的物理化学属性与洗涤效果的模型研究 被引量:1
14
作者 王凡 李雪 《首都师范大学学报(自然科学版)》 2011年第6期9-14,共6页
优化洗衣产品的配方的主要目的是要在降低成本、减少对环境的污染等条件下,生产出洗涤效果更好的产品.本文利用洗衣产品溶于水后溶液的一些物理化学属性及洗涤功效的数据,研究溶液属性和产品功效之间的关系,建立它们之间的模型,找出对... 优化洗衣产品的配方的主要目的是要在降低成本、减少对环境的污染等条件下,生产出洗涤效果更好的产品.本文利用洗衣产品溶于水后溶液的一些物理化学属性及洗涤功效的数据,研究溶液属性和产品功效之间的关系,建立它们之间的模型,找出对洗涤功效起到显著作用的因素,使得可以根据建立的模型找到最优的洗衣产品配方.面对洗衣产品中多种可能起作用的物理化学属性以及种类繁多的污渍,需要运用处理高维数据的方法进行研究.我们用最小角回归(Lars),逐步回归(Stepwise)和交叉验证(Cross-Validation)方法对数据进行分析,建立的各个污渍与溶液属性间的数学模型,并给出几点对优化洗衣产品配方的建议. 展开更多
关键词 洗衣产品物理化学属性 洗涤效果 最小角回归(Lars) 逐步回归(Stepwise) 交叉验证(cross-validation)
在线阅读 下载PDF
非参数回归的CV NN中位数估计的相合性
15
作者 杨瑛 《西北师范大学学报(自然科学版)》 CAS 1992年第3期15-23,共9页
考虑非参数回归模型Y_i=g(x_i)+e_i,其中g(x)是待估的连续函数,x_i是非随机的,e_i是i.i.d.随机误差。笔者讨论最近邻中位数估计g_(n,h)(x_i)=m(Y_i(1),…,Y_i(h))=Y_i(1),…,Y_i(h)的中位数,其中h利用平均平方误差意义下的cross-validat... 考虑非参数回归模型Y_i=g(x_i)+e_i,其中g(x)是待估的连续函数,x_i是非随机的,e_i是i.i.d.随机误差。笔者讨论最近邻中位数估计g_(n,h)(x_i)=m(Y_i(1),…,Y_i(h))=Y_i(1),…,Y_i(h)的中位数,其中h利用平均平方误差意义下的cross-validation方法选择。在一定条件下,建立了cross-validation最近邻中位数估计的相合性。 展开更多
关键词 cross-validation相合性 最近邻中位数估计 非参数回归
在线阅读 下载PDF
An Insect Imaging System to Automate Rice Light-Trap Pest Identification 被引量:24
16
作者 YAO Qing LV Jun +4 位作者 LIU Qing-jie DIAO Guang-qiang YANG Bao-jun CHEN Hong-ming TANGJian 《Journal of Integrative Agriculture》 SCIE CAS CSCD 2012年第6期978-985,共8页
Identification and counting of rice light-trap pests are important to monitor rice pest population dynamics and make pest forecast. Identification and counting of rice light-trap pests manually is time-consuming, and ... Identification and counting of rice light-trap pests are important to monitor rice pest population dynamics and make pest forecast. Identification and counting of rice light-trap pests manually is time-consuming, and leads to fatigue and an increase in the error rate. A rice light-trap insect imaging system is developed to automate rice pest identification. This system can capture the top and bottom images of each insect by two cameras to obtain more image features. A method is proposed for removing the background by color difference of two images with pests and non-pests. 156 features including color, shape and texture features of each pest are extracted into an support vector machine (SVM) classifier with radial basis kernel function. The seven-fold cross-validation is used to improve the accurate rate of pest identification. Four species of Lepidoptera rice pests are tested and achieved 97.5% average accurate rate. 展开更多
关键词 automatic identification imaging system rice light-trap pests SVM cross-validate
在线阅读 下载PDF
Coal–rock interface detection on the basis of image texture features 被引量:22
17
作者 Sun Jiping Su Bo 《International Journal of Mining Science and Technology》 SCIE EI 2013年第5期681-687,共7页
Based on the stability and inequality of texture features between coal and rock,this study used the digital image analysis technique to propose a coal–rock interface detection method.By using gray level co-occurrence... Based on the stability and inequality of texture features between coal and rock,this study used the digital image analysis technique to propose a coal–rock interface detection method.By using gray level co-occurrence matrix,twenty-two texture features were extracted from the images of coal and rock.Data dimension of the feature space reduced to four by feature selection,which was according to a separability criterion based on inter-class mean difference and within-class scatter.The experimental results show that the optimized features were effective in improving the separability of the samples and reducing the time complexity of the algorithm.In the optimized low-dimensional feature space,the coal–rock classifer was set up using the fsher discriminant method.Using the 10-fold cross-validation technique,the performance of the classifer was evaluated,and an average recognition rate of 94.12%was obtained.The results of comparative experiments show that the identifcation performance of the proposed method was superior to the texture description method based on gray histogram and gradient histogram. 展开更多
关键词 Coal–rock interface detection TEXTURE Gray level co-occurrence matrix Feature selection Fisher discriminant method cross-validation
在线阅读 下载PDF
Comparison of machine learning methods for ground settlement prediction with different tunneling datasets 被引量:21
18
作者 Libin Tang SeonHong Na 《Journal of Rock Mechanics and Geotechnical Engineering》 SCIE CSCD 2021年第6期1274-1289,共16页
This study integrates different machine learning(ML) methods and 5-fold cross-validation(CV) method to estimate the ground maximal surface settlement(MSS) induced by tunneling.We further investigate the applicability ... This study integrates different machine learning(ML) methods and 5-fold cross-validation(CV) method to estimate the ground maximal surface settlement(MSS) induced by tunneling.We further investigate the applicability of artificial intelligent(AI) based prediction through a comparative study of two tunnelling datasets with different sizes and features.Four different ML approaches,including support vector machine(SVM),random forest(RF),back-propagation neural network(BPNN),and deep neural network(DNN),are utilized.Two techniques,i.e.particle swarm optimization(PSO) and grid search(GS)methods,are adopted for hyperparameter optimization.To assess the reliability and efficiency of the predictions,three performance evaluation indicators,including the mean absolute error(MAE),root mean square error(RMSE),and Pearson correlation coefficient(R),are calculated.Our results indicate that proposed models can accurately and efficiently predict the settlement,while the RF model outperforms the other three methods on both datasets.The difference in model performance on two datasets(Datasets A and B) reveals the importance of data quality and quantity.Sensitivity analysis indicates that Dataset A is more significantly affected by geological conditions,while geometric characteristics play a more dominant role on Dataset B. 展开更多
关键词 Surface settlement Tunnel construction Machine learning(ML) Hyperparameter optimization cross-validation(CV)
在线阅读 下载PDF
An Optimized Random Forest Model and Its Generalization Ability in Landslide Susceptibility Mapping:Application in Two Areas of Three Gorges Reservoir,China 被引量:15
19
作者 Deliang Sun Jiahui Xu +1 位作者 Haijia Wen Yue Wang 《Journal of Earth Science》 SCIE CAS CSCD 2020年第6期1068-1086,共19页
Numerous researches have been published on the application of landslide susceptibility assessment models;however,they were only applied in the same areas as the models were originated,the effect of applying the models... Numerous researches have been published on the application of landslide susceptibility assessment models;however,they were only applied in the same areas as the models were originated,the effect of applying the models to other areas than the origin of the models has not been explored.This study is purposed to develop an optimized random forest(RF)model with best ratios of positive-to-negative cells and 10-fold cross-validation for landslide susceptibility mapping(LSM),and then explore its generalization ability not only in the area where the model is originated but also in area other than the origin of the model.Two typical counties(Fengjie County and Wushan County)in the Three Gorges Reservoir area,China,which have the same terrain and geological conditions,were selected as an example.To begin with,landslide inventory was prepared based on field investigations,satellite images,and historical records,and 1522 landslides were then identified in Fengjie County.22 landslide-conditioning factors under the influence of topography,geology,environmental conditions,and human activities were prepared.Then,combined with 10-fold cross-validation,three typical ratios of positive-to-negative cells,i.e.,1:1,1:5,and 1:10,were adopted for comparative analyses.An optimized RF model(Fengjie-based model)with the best ratios of positive-to-negative cells and 10-fold cross-validation was constructed.Finally,the Fengjie-based model was applied to Fengjie County and Wushan County,and the confusion matrix and area under the receiver operating characteristic(ROC)curve value(AUC)were used to estimate the accuracy.The Fengjie-based model delivered high stability and predictive capability in Fengjie County,indicating a great generalization ability of the model to the area where the model is originated.The LSM in Wushan County generated by the Fengjie-based model had a reasonable reference value,indicating the Fengjiebased model had a great generalization ability in area other than the origin of the model.The Fengjiebased model in this study could be applied in other similar areas/countries with the same terrain and geological conditions,and a LSM may be generated without collecting landslide information for modeling,so as to reduce workload and improve efficiency in practice. 展开更多
关键词 landslide susceptibility mapping generalization ability random forest Three Gorges Reservoir area 10-fold cross-validation
原文传递
Prediction of geological characteristics from shield operational parameters by integrating grid search and K-fold cross validation into stacking classification algorithm 被引量:13
20
作者 Tao Yan Shui-Long Shen +1 位作者 Annan Zhou Xiangsheng Chen 《Journal of Rock Mechanics and Geotechnical Engineering》 SCIE CSCD 2022年第4期1292-1303,共12页
This study presents a framework for predicting geological characteristics based on integrating a stacking classification algorithm(SCA) with a grid search(GS) and K-fold cross validation(K-CV). The SCA includes two le... This study presents a framework for predicting geological characteristics based on integrating a stacking classification algorithm(SCA) with a grid search(GS) and K-fold cross validation(K-CV). The SCA includes two learner layers: a primary learner’s layer and meta-classifier layer. The accuracy of the SCA can be improved by using the GS and K-CV. The GS was developed to match the hyper-parameters and optimise complicated problems. The K-CV is commonly applied to changing the validation set in a training set. In general, a GS is usually combined with K-CV to produce a corresponding evaluation index and select the best hyper-parameters. The torque penetration index(TPI) and field penetration index(FPI) are proposed based on shield parameters to express the geological characteristics. The elbow method(EM) and silhouette coefficient(Si) are employed to determine the types of geological characteristics(K) in a Kmeans++ algorithm. A case study on mixed ground in Guangzhou is adopted to validate the applicability of the developed model. The results show that with the developed framework, the four selected parameters, i.e. thrust, advance rate, cutterhead rotation speed and cutterhead torque, can be used to effectively predict the corresponding geological characteristics. 展开更多
关键词 Geological characteristics Stacking classification algorithm(SCA) K-fold cross-validation(K-CV) K-means++
在线阅读 下载PDF
上一页 1 2 6 下一页 到第
使用帮助 返回顶部