期刊文献+
共找到1,883篇文章
< 1 2 95 >
每页显示 20 50 100
Random forest algorithm reveals novel sites in HA protein that shift receptor binding preference of the H9N2 avian influenza virus
1
作者 Yuncong Yin Wen Li +7 位作者 Rujian Chen Xiao Wang Yiting Chen Xinyuan Cui Xingbang Lu David M.Irwin Xuejuan Shen Yongyi Shen 《Virologica Sinica》 2025年第1期109-117,共9页
A switch from avian-typeα-2,3 to human-typeα-2,6 receptors is an essential element for the initiation of a pandemic from an avian influenza virus.Some H9N2 viruses exhibit a preference for binding to human-typeα-2,... A switch from avian-typeα-2,3 to human-typeα-2,6 receptors is an essential element for the initiation of a pandemic from an avian influenza virus.Some H9N2 viruses exhibit a preference for binding to human-typeα-2,6 receptors.This identifies their potential threat to public health.However,our understanding of the molecular basis for the switch of receptor preference is still limited.In this study,we employed the random forest algorithm to identify the potentially key amino acid sites within hemagglutinin(HA),which are associated with the receptor binding ability of H9N2 avian influenza virus(AIV).Subsequently,these sites were further verified by receptor binding assays.A total of 12 substitutions in the HA protein(N158D,N158S,A160 N,A160D,A160T,T163I,T163V,V190T,V190A,D193 N,D193G,and N231D)were predicted to prefer binding toα-2,6 receptors.Except for the V190T substitution,the other substitutions were demonstrated to display an affinity for preferential binding toα-2,6 receptors by receptor binding assays.Especially,the A160T substitution caused a significant upregulation of immune-response genes and an increased mortality rate in mice.Our findings provide novel insights into understanding the genetic basis of receptor preference of the H9N2 AIV. 展开更多
关键词 H9N2 Hemagglutinin(HA) Receptor binding preference random forest algorithm Host shift Interspecies transmission
原文传递
A rapid classification method of aluminum alloy based on laser-induced breakdown spectroscopy and random forest algorithm 被引量:8
2
作者 Liuyang ZHAN Xiaohong MA +4 位作者 Weiqi FANG Rui WANG Zesheng LIU Yang SONG Huafeng ZHAO 《Plasma Science and Technology》 SCIE EI CAS CSCD 2019年第3期148-154,共7页
As an important non-ferrous metal structural material most used in industry and production,aluminum(Al) alloy shows its great value in the national economy and industrial manufacturing.How to classify Al alloy rapidly... As an important non-ferrous metal structural material most used in industry and production,aluminum(Al) alloy shows its great value in the national economy and industrial manufacturing.How to classify Al alloy rapidly and accurately is a significant, popular and meaningful task.Classification methods based on laser-induced breakdown spectroscopy(LIBS) have been reported in recent years. Although LIBS is an advanced detection technology, it is necessary to combine it with some algorithm to reach the goal of rapid and accurate classification. As an important machine learning method, the random forest(RF) algorithm plays a great role in pattern recognition and material classification. This paper introduces a rapid classification method of Al alloy based on LIBS and the RF algorithm. The results show that the best accuracy that can be reached using this method to classify Al alloy samples is 98.59%, the average of which is 98.45%. It also reveals through the relationship laws that the accuracy varies with the number of trees in the RF and the size of the training sample set in the RF. According to the laws, researchers can find out the optimized parameters in the RF algorithm in order to achieve,as expected, a good result. These results prove that LIBS with the RF algorithm can exactly classify Al alloy effectively, precisely and rapidly with high accuracy, which obviously has significant practical value. 展开更多
关键词 LASER-INDUCED BREAKDOWN spectroscopy(LIBS) random forest(rf) aluminum(Al)alloy classification
在线阅读 下载PDF
Prostate cancer prediction forest algorithm that takes using the random into account transrectal ultrasound findings, age, and serum levels of prostate-specific antigen 被引量:5
3
作者 Li-Hong Xiao Pei-Ran Chen +4 位作者 Zhong-Ping Gou Yong-Zhong Li Mei Li Liang-Cheng Xiang Ping Feng 《Asian Journal of Andrology》 SCIE CAS CSCD 2017年第5期586-590,共5页
The aim of this study is to evaluate the ability of the random forest algorithm that combines data on transrectal ultrasound findings, age, and serum levels of prostate-specific antigen to predict prostate carcinoma. ... The aim of this study is to evaluate the ability of the random forest algorithm that combines data on transrectal ultrasound findings, age, and serum levels of prostate-specific antigen to predict prostate carcinoma. Clinico-demographic data were analyzed for 941 patients with prostate diseases treated at our hospital, including age, serum prostate-specific antigen levels, transrectal ultrasound findings, and pathology diagnosis based on ultrasound-guided needle biopsy of the prostate. These data were compared between patients with and without prostate cancer using the Chi-square test, and then entered into the random forest model to predict diagnosis. Patients with and without prostate cancer differed significantly in age and serum prostate-specific antigen levels (P 〈 0.001), as well as in all transrectal ultrasound characteristics (P 〈 0.05) except uneven echo (P = 0.609). The random forest model based on age, prostate-specific antigen and ultrasound predicted prostate cancer with an accuracy of 83.10%, sensitivity of 65.64%, and specificity of 93.83%. Positive predictive value was 86.72%, and negative predictive value was 81.64%. By integrating age, prostate-specific antigen levels and transrectal ultrasound findings, the random forest algorithm shows better diagnostic performance for prostate cancer than either diagnostic indicator on its own. This algorithm may help improve diagnosis of the disease by identifying patients at high risk for biopsy. 展开更多
关键词 diagnosis prostate cancer prostate-specific antigen random forest algorithm transrectal ultrasound characteristics
原文传递
A real-time intelligent lithology identification method based on a dynamic felling strategy weighted random forest algorithm 被引量:7
4
作者 Tie Yan Rui Xu +2 位作者 Shi-Hui Sun Zhao-Kai Hou Jin-Yu Feng 《Petroleum Science》 SCIE EI CAS CSCD 2024年第2期1135-1148,共14页
Real-time intelligent lithology identification while drilling is vital to realizing downhole closed-loop drilling. The complex and changeable geological environment in the drilling makes lithology identification face ... Real-time intelligent lithology identification while drilling is vital to realizing downhole closed-loop drilling. The complex and changeable geological environment in the drilling makes lithology identification face many challenges. This paper studies the problems of difficult feature information extraction,low precision of thin-layer identification and limited applicability of the model in intelligent lithologic identification. The author tries to improve the comprehensive performance of the lithology identification model from three aspects: data feature extraction, class balance, and model design. A new real-time intelligent lithology identification model of dynamic felling strategy weighted random forest algorithm(DFW-RF) is proposed. According to the feature selection results, gamma ray and 2 MHz phase resistivity are the logging while drilling(LWD) parameters that significantly influence lithology identification. The comprehensive performance of the DFW-RF lithology identification model has been verified in the application of 3 wells in different areas. By comparing the prediction results of five typical lithology identification algorithms, the DFW-RF model has a higher lithology identification accuracy rate and F1 score. This model improves the identification accuracy of thin-layer lithology and is effective and feasible in different geological environments. The DFW-RF model plays a truly efficient role in the realtime intelligent identification of lithologic information in closed-loop drilling and has greater applicability, which is worthy of being widely used in logging interpretation. 展开更多
关键词 Intelligent drilling Closed-loop drilling Lithology identification random forest algorithm Feature extraction
原文传递
Prediction of Permeability Using Random Forest and Genetic Algorithm Model 被引量:7
5
作者 JunhuiWang Wanzi Yan +3 位作者 Zhijun Wan Yi Wang Jiakun Lv Aiping Zhou 《Computer Modeling in Engineering & Sciences》 SCIE EI 2020年第12期1135-1157,共23页
Precise recovery of CoalbedMethane(CBM)based on transparent reconstruction of geological conditions is a branch of intelligent mining.The process of permeability reconstruction,ranging from data perception to real-tim... Precise recovery of CoalbedMethane(CBM)based on transparent reconstruction of geological conditions is a branch of intelligent mining.The process of permeability reconstruction,ranging from data perception to real-time data visualization,is applicable to disaster risk warning and intelligent decision-making on gas drainage.In this study,a machine learning method integrating the Random Forest(RF)and the Genetic Algorithm(GA)was established for permeability prediction in the Xishan Coalfield based on Uniaxial Compressive Strength(UCS),effective stress,temperature and gas pressure.A total of 50 sets of data collected by a self-developed apparatus were used to generate datasets for training and validating models.Statistical measures including the coefficient of determination(R2)and Root Mean Square Error(RMSE)were selected to validate and compare the predictive performances of the single RF model and the hybrid RF–GA model.Furthermore,sensitivity studies were conducted to evaluate the importance of input parameters.The results show that,the proposed RF–GA model is robust in predicting the permeability;UCS is directly correlated to permeability,while all other inputs are inversely related to permeability;the effective stress exerts the greatest impact on permeability based on importance score,followed by the temperature(or gas pressure)and UCS.The partial dependence plots,indicative of marginal utility of each feature in permeability prediction,are in line with experimental results.Thus,the proposed hybrid model(RF–GA)is capable of predicting permeability and thus beneficial to precise CBMrecovery. 展开更多
关键词 PERMEABILITY machine learning random forest genetic algorithm coalbed methane recovery
在线阅读 下载PDF
Basic Tenets of Classification Algorithms K-Nearest-Neighbor, Support Vector Machine, Random Forest and Neural Network: A Review 被引量:14
6
作者 Ernest Yeboah Boateng Joseph Otoo Daniel A. Abaye 《Journal of Data Analysis and Information Processing》 2020年第4期341-357,共17页
In this paper, sixty-eight research articles published between 2000 and 2017 as well as textbooks which employed four classification algorithms: K-Nearest-Neighbor (KNN), Support Vector Machines (SVM), Random Forest (... In this paper, sixty-eight research articles published between 2000 and 2017 as well as textbooks which employed four classification algorithms: K-Nearest-Neighbor (KNN), Support Vector Machines (SVM), Random Forest (RF) and Neural Network (NN) as the main statistical tools were reviewed. The aim was to examine and compare these nonparametric classification methods on the following attributes: robustness to training data, sensitivity to changes, data fitting, stability, ability to handle large data sizes, sensitivity to noise, time invested in parameter tuning, and accuracy. The performances, strengths and shortcomings of each of the algorithms were examined, and finally, a conclusion was arrived at on which one has higher performance. It was evident from the literature reviewed that RF is too sensitive to small changes in the training dataset and is occasionally unstable and tends to overfit in the model. KNN is easy to implement and understand but has a major drawback of becoming significantly slow as the size of the data in use grows, while the ideal value of K for the KNN classifier is difficult to set. SVM and RF are insensitive to noise or overtraining, which shows their ability in dealing with unbalanced data. Larger input datasets will lengthen classification times for NN and KNN more than for SVM and RF. Among these nonparametric classification methods, NN has the potential to become a more widely used classification algorithm, but because of their time-consuming parameter tuning procedure, high level of complexity in computational processing, the numerous types of NN architectures to choose from and the high number of algorithms used for training, most researchers recommend SVM and RF as easier and wieldy used methods which repeatedly achieve results with high accuracies and are often faster to implement. 展开更多
关键词 Classification algorithms NON-PARAMETRIC K-Nearest-Neighbor Neural Networks random forest Support Vector Machines
在线阅读 下载PDF
Anomaly Classification Using Genetic Algorithm-Based Random Forest Modelfor Network Attack Detection 被引量:7
7
作者 Adel Assiri 《Computers, Materials & Continua》 SCIE EI 2021年第1期767-778,共12页
Anomaly classification based on network traffic features is an important task to monitor and detect network intrusion attacks.Network-based intrusion detection systems(NIDSs)using machine learning(ML)methods are effec... Anomaly classification based on network traffic features is an important task to monitor and detect network intrusion attacks.Network-based intrusion detection systems(NIDSs)using machine learning(ML)methods are effective tools for protecting network infrastructures and services from unpredictable and unseen attacks.Among several ML methods,random forest(RF)is a robust method that can be used in ML-based network intrusion detection solutions.However,the minimum number of instances for each split and the number of trees in the forest are two key parameters of RF that can affect classification accuracy.Therefore,optimal parameter selection is a real problem in RF-based anomaly classification of intrusion detection systems.In this paper,we propose to use the genetic algorithm(GA)for selecting the appropriate values of these two parameters,optimizing the RF classifier and improving the classification accuracy of normal and abnormal network traffics.To validate the proposed GA-based RF model,a number of experiments is conducted on two public datasets and evaluated using a set of performance evaluation measures.In these experiments,the accuracy result is compared with the accuracies of baseline ML classifiers in the recent works.Experimental results reveal that the proposed model can avert the uncertainty in selection the values of RF’s parameters,improving the accuracy of anomaly classification in NIDSs without incurring excessive time. 展开更多
关键词 Network-based intrusion detection system(NIDS) random forest classifier genetic algorithm KDD99 UNSW-NB15
在线阅读 下载PDF
Object-based classification of hyperspectral data using Random Forest algorithm 被引量:3
8
作者 Saeid Amini Saeid Homayouni +1 位作者 Abdolreza Safari Ali A.Darvishsefat 《Geo-Spatial Information Science》 SCIE CSCD 2018年第2期127-138,共12页
This paper presents a new framework for object-based classification of high-resolution hyperspectral data.This multi-step framework is based on multi-resolution segmentation(MRS)and Random Forest classifier(RFC)algori... This paper presents a new framework for object-based classification of high-resolution hyperspectral data.This multi-step framework is based on multi-resolution segmentation(MRS)and Random Forest classifier(RFC)algorithms.The first step is to determine of weights of the input features while using the object-based approach with MRS to processing such images.Given the high number of input features,an automatic method is needed for estimation of this parameter.Moreover,we used the Variable Importance(VI),one of the outputs of the RFC,to determine the importance of each image band.Then,based on this parameter and other required parameters,the image is segmented into some homogenous regions.Finally,the RFC is carried out based on the characteristics of segments for converting them into meaningful objects.The proposed method,as well as,the conventional pixel-based RFC and Support Vector Machine(SVM)method was applied to three different hyperspectral data-sets with various spectral and spatial characteristics.These data were acquired by the HyMap,the Airborne Prism Experiment(APEX),and the Compact Airborne Spectrographic Imager(CASI)hyperspectral sensors.The experimental results show that the proposed method is more consistent for land cover mapping in various areas.The overall classification accuracy(OA),obtained by the proposed method was 95.48,86.57,and 84.29%for the HyMap,the APEX,and the CASI datasets,respectively.Moreover,this method showed better efficiency in comparison to the spectralbased classifications because the OAs of the proposed method was 5.67 and 3.75%higher than the conventional RFC and SVM classifiers,respectively. 展开更多
关键词 Object-based classification random forest algorithm multi-resolution segmentation(MRS) hyperspectral imagery
原文传递
Underground location algorithm based on random forest and environmental factor compensation 被引量:2
9
作者 Xin Qiao Fei Chang 《International Journal of Coal Science & Technology》 EI CAS CSCD 2021年第5期1108-1117,共10页
Aiming at the poor location accuracy caused by the harsh and complex underground environment,long strip roadway,limited wireless transmission and sparse anchor nodes,an underground location algorithm based on random f... Aiming at the poor location accuracy caused by the harsh and complex underground environment,long strip roadway,limited wireless transmission and sparse anchor nodes,an underground location algorithm based on random forest and compensation for environmental factors was proposed.Firstly,the underground wireless access point(AP)network model and tunnel environment were analyzed,and the fingerprint location algorithm was built.And then the Received Signal Strength(RSS)was analyzed by Kalman Filter algorithm in the offline sampling and real-time positioning stage.Meanwhile,the target speed constraint condition was introduced to reduce the error caused by environmental factors.The experimental results show that the proposed algorithm solves the problem of insufficient location accuracy and large fluctuation affected by environment when the anchor nodes are sparse.At the same time,the average location accuracy reaches three meters,which can satisfy the application of underground rescue,activity track playback,disaster monitoring and positioning.It has high application value in complex underground environment. 展开更多
关键词 Underground Coal Mine random forest Kalman Filter Compensation algorithm
在线阅读 下载PDF
Prediction of maximum upward displacement of shield tunnel linings during construction using particle swarm optimization-random forest algorithm 被引量:2
10
作者 Xiaowei YE Xiaolong ZHANG +2 位作者 Yanbo CHEN Yujun WEI Yang DING 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2024年第1期1-17,共17页
During construction,the shield linings of tunnels often face the problem of local or overall upward movement after leaving the shield tail in soft soil areas or during some large diameter shield projects.Differential ... During construction,the shield linings of tunnels often face the problem of local or overall upward movement after leaving the shield tail in soft soil areas or during some large diameter shield projects.Differential floating will increase the initial stress on the segments and bolts which is harmful to the service performance of the tunnel.In this study we used a random forest(RF)algorithm combined particle swarm optimization(PSO)and 5-fold cross-validation(5-fold CV)to predict the maximum upward displacement of tunnel linings induced by shield tunnel excavation.The mechanism and factors causing upward movement of the tunnel lining are comprehensively summarized.Twelve input variables were selected according to results from analysis of influencing factors.The prediction performance of two models,PSO-RF and RF(default)were compared.The Gini value was obtained to represent the relative importance of the influencing factors to the upward displacement of linings.The PSO-RF model successfully predicted the maximum upward displacement of the tunnel linings with a low error(mean absolute error(MAE)=4.04 mm,root mean square error(RMSE)=5.67 mm)and high correlation(R^(2)=0.915).The thrust and depth of the tunnel were the most important factors in the prediction model influencing the upward displacement of the tunnel linings. 展开更多
关键词 random forest(rf) Particle swarm optimization(PSO) Upward displacement of lining Machine learning prediction Shieldtunneling construction
原文传递
基于GA-RF的螺杆转子砂带磨削表面粗糙度预测
11
作者 李越 杨赫然 +2 位作者 孙兴伟 董祉序 刘寅 《制造技术与机床》 北大核心 2026年第1期201-207,共7页
为了系统分析砂带磨削工艺参数对螺杆转子表面质量的影响规律,从而为实际生产中的参数选择提供参考依据。为提高预测精度,文章构建基于遗传算法优化的随机森林预测模型,并设计了五因素五水平正交试验,试验装置为自主研发的多头螺杆磨削... 为了系统分析砂带磨削工艺参数对螺杆转子表面质量的影响规律,从而为实际生产中的参数选择提供参考依据。为提高预测精度,文章构建基于遗传算法优化的随机森林预测模型,并设计了五因素五水平正交试验,试验装置为自主研发的多头螺杆磨削装置,具体参数为工件轴向进给速度为100~300 mm/min、砂带线速度为4.4~13.3 m/s、砂带张紧压力为0.20~0.30 MPa、磨削压力为0.40~0.50 MPa、砂带粒度为60~180μm。试验结果表明,遗传-随机森林(genetic algorithm-random forest, GA-RF)模型的平均预测误差为9.06%,明显低于Lasso模型(25.96%)和SVR模型(30.68%);单因素分析显示,表面粗糙度随轴向进给速度增加而变大,随着砂带线速度升高而降低;当进给速度从100增至300 mm/min时,Ra值上升约27%;而线速度从4.4 m/s提高到13.3 m/s时,Ra值下降约35%。研究验证了遗传-随机森林(GA-RF)模型在砂带磨削质量预测中的有效性,同时揭示了关键工艺参数的影响规律。研究可为螺杆转子加工参数选择提供理论指导,对实际生产具有重要的参考价值。 展开更多
关键词 砂带磨削 接触轮式磨削 粗糙度预测 遗传算法 随机森林
在线阅读 下载PDF
Winter Wheat Yield Estimation Based on Sparrow Search Algorithm Combined with Random Forest:A Case Study in Henan Province,China 被引量:1
12
作者 SHI Xiaoliang CHEN Jiajun +2 位作者 DING Hao YANG Yuanqi ZHANG Yan 《Chinese Geographical Science》 SCIE CSCD 2024年第2期342-356,共15页
Precise and timely prediction of crop yields is crucial for food security and the development of agricultural policies.However,crop yield is influenced by multiple factors within complex growth environments.Previous r... Precise and timely prediction of crop yields is crucial for food security and the development of agricultural policies.However,crop yield is influenced by multiple factors within complex growth environments.Previous research has paid relatively little attention to the interference of environmental factors and drought on the growth of winter wheat.Therefore,there is an urgent need for more effective methods to explore the inherent relationship between these factors and crop yield,making precise yield prediction increasingly important.This study was based on four type of indicators including meteorological,crop growth status,environmental,and drought index,from October 2003 to June 2019 in Henan Province as the basic data for predicting winter wheat yield.Using the sparrow search al-gorithm combined with random forest(SSA-RF)under different input indicators,accuracy of winter wheat yield estimation was calcu-lated.The estimation accuracy of SSA-RF was compared with partial least squares regression(PLSR),extreme gradient boosting(XG-Boost),and random forest(RF)models.Finally,the determined optimal yield estimation method was used to predict winter wheat yield in three typical years.Following are the findings:1)the SSA-RF demonstrates superior performance in estimating winter wheat yield compared to other algorithms.The best yield estimation method is achieved by four types indicators’composition with SSA-RF)(R^(2)=0.805,RRMSE=9.9%.2)Crops growth status and environmental indicators play significant roles in wheat yield estimation,accounting for 46%and 22%of the yield importance among all indicators,respectively.3)Selecting indicators from October to April of the follow-ing year yielded the highest accuracy in winter wheat yield estimation,with an R^(2)of 0.826 and an RMSE of 9.0%.Yield estimates can be completed two months before the winter wheat harvest in June.4)The predicted performance will be slightly affected by severe drought.Compared with severe drought year(2011)(R^(2)=0.680)and normal year(2017)(R^(2)=0.790),the SSA-RF model has higher prediction accuracy for wet year(2018)(R^(2)=0.820).This study could provide an innovative approach for remote sensing estimation of winter wheat yield.yield. 展开更多
关键词 winter wheat yield estimation sparrow search algorithm combined with random forest(SSA-rf) machine learning multi-source indicator optimal lead time Henan Province China
在线阅读 下载PDF
Enhancing rock slope stability prediction using random forest machine learning:A case study
13
作者 Afiqah Ismail Ahmad Safuan A Rashid +10 位作者 Ali Dehghanbanadaki Rafiuddin Hakim Roslan Mohd Firdaus Md Dan@Azlan Abd Wahid Rasib Radzuan Saari Mushairry Mustaffar Azman Kassim Rini Asnida Abdullah Khairul Hazman Padil Norbazlan Mohd Yusof Norisam Abd Rahaman 《China Geology》 2025年第4期691-706,共16页
The prediction of slope stability is a complex nonlinear problem.This paper proposes a new method based on the random forest(RF)algorithm to study the rocky slopes stability.Taking the Bukit Merah,Perak and Twin Peak(... The prediction of slope stability is a complex nonlinear problem.This paper proposes a new method based on the random forest(RF)algorithm to study the rocky slopes stability.Taking the Bukit Merah,Perak and Twin Peak(Kuala Lumpur)as the study area,the slope characteristics of geometrical parameters are obtained from a multidisciplinary approach(consisting of geological,geotechnical,and remote sensing analyses).18 factors,including rock strength,rock quality designation(RQD),joint spacing,continuity,openness,roughness,filling,weathering,water seepage,temperature,vegetation index,water index,and orientation,are selected to construct model input variables while the factor of safety(FOS)functions as an output.The area under the curve(AUC)value of the receiver operating characteristic(ROC)curve is obtained with precision and accuracy and used to analyse the predictive model ability.With a large training set and predicted parameters,an area under the ROC curve(the AUC)of 0.95 is achieved.A precision score of 0.88 is obtained,indicating that the model has a low false positive rate and correctly identifies a substantial number of true positives.The findings emphasise the importance of using a variety of terrain characteristics and different approaches to characterise the rock slope. 展开更多
关键词 Slope stability prediction random forest algorithm Remote sensing in Geology Factor of Safety(FOS) Geometrical parameters Rock quality designation(RQD) Multilayer perceptron(MLP)
在线阅读 下载PDF
Random Forest Based Very Fast Decision Tree Algorithm for Data Stream
14
作者 DONG Zhenjiang LUO Shengmei +2 位作者 WEN Tao ZHANG Fayang LI Lingjuan 《ZTE Communications》 2017年第B12期52-57,共6页
The Very Fast Decision Tree(VFDT)algorithm is a classification algorithm for data streams.When processing large amounts of data,VFDT requires less time than traditional decision tree algorithms.However,when training s... The Very Fast Decision Tree(VFDT)algorithm is a classification algorithm for data streams.When processing large amounts of data,VFDT requires less time than traditional decision tree algorithms.However,when training samples become fewer,the label values of VFDT leaf nodes will have more errors,and the classification ability of single VFDT decision tree is limited.The Random Forest algorithm is a combinational classifier with high prediction accuracy and noise-tol-erant ability.It is constituted by multiple decision trees and can make up for the shortage of single decision tree.In this paper,in order to improve the classification accuracy on data streams,the Random Forest algorithm is integrated into the process of tree building of the VFDT algorithm,and a new Random Forest Based Very Fast Decision Tree algorithm named RFVFDT is designed.The RFVFDT algorithm adopts the decision tree building criterion of a Random Forest classifier,and improves Random Forest algorithm with sliding window to meet the unboundedness of data streams and avoid process delay and data loss.Experimental results of the classification of KDD CUP data sets show that the classification accuracy of RFVFDT algorithm is higher than that of VFDT.The less the samples are,the more obvious the advantage is.RFVFDT is fast when running in the multithread mode. 展开更多
关键词 DATA STREAM DATA classification random forest algorithm VFDT algorithm
在线阅读 下载PDF
Investigation of Nuclear Binding Energy and Charge Radius Based on Random Forest Algorithm
15
作者 CAI Boshuai YU Tianjun +3 位作者 LIN Xuan ZHANG Jilong WANG Zhixuan YUAN Cenxi 《原子能科学技术》 EI CAS CSCD 北大核心 2023年第4期704-712,共9页
The random forest algorithm was applied to study the nuclear binding energy and charge radius.The regularized root-mean-square of error(RMSE)was proposed to avoid overfitting during the training of random forest.RMSE ... The random forest algorithm was applied to study the nuclear binding energy and charge radius.The regularized root-mean-square of error(RMSE)was proposed to avoid overfitting during the training of random forest.RMSE for nuclides with Z,N>7 is reduced to 0.816 MeV and 0.0200 fm compared with the six-term liquid drop model and a three-term nuclear charge radius formula,respectively.Specific interest is in the possible(sub)shells among the superheavy region,which is important for searching for new elements and the island of stability.The significance of shell features estimated by the so-called shapely additive explanation method suggests(Z,N)=(92,142)and(98,156)as possible subshells indicated by the binding energy.Because the present observed data is far from the N=184 shell,which is suggested by mean-field investigations,its shell effect is not predicted based on present training.The significance analysis of the nuclear charge radius suggests Z=92 and N=136 as possible subshells.The effect is verified by the shell-corrected nuclear charge radius model. 展开更多
关键词 nuclear binding energy nuclear charge radius random forest algorithm
在线阅读 下载PDF
The Comparison between Random Forest and Support Vector Machine Algorithm for Predicting β-Hairpin Motifs in Proteins
16
作者 Shaochun Jia Xiuzhen Hu Lixia Sun 《Engineering(科研)》 2013年第10期391-395,共5页
Based on the research of predictingβ-hairpin motifs in proteins, we apply Random Forest and Support Vector Machine algorithm to predictβ-hairpin motifs in ArchDB40 dataset. The motifs with the loop length of 2 to 8 ... Based on the research of predictingβ-hairpin motifs in proteins, we apply Random Forest and Support Vector Machine algorithm to predictβ-hairpin motifs in ArchDB40 dataset. The motifs with the loop length of 2 to 8 amino acid residues are extracted as research object and thefixed-length pattern of 12 amino acids are selected. When using the same characteristic parameters and the same test method, Random Forest algorithm is more effective than Support Vector Machine. In addition, because of Random Forest algorithm doesn’t produce overfitting phenomenon while the dimension of characteristic parameters is higher, we use Random Forest based on higher dimension characteristic parameters to predictβ-hairpin motifs. The better prediction results are obtained;the overall accuracy and Matthew’s correlation coefficient of 5-fold cross-validation achieve 83.3% and 0.59, respectively. 展开更多
关键词 random forest algorithm Support Vector Machine algorithm β-Hairpin MOTIF INCREMENT of Diversity SCORING Function Predicted Secondary Structure Information
暂未订购
Random forest algorithm and regional applications of spectral inversion model for estimating canopy nitrogen concentration in rice 被引量:1
17
作者 LI Xuqing LIU Xiangnan LIU Meiling WU Ling 《遥感学报》 CSCD 北大核心 2014年第4期923-945,共23页
原文传递
Companies’ E-waste Estimation Based on General Equilibrium The­ory Context and Random Forest Regression Algorithm in Cameroon: Case Study of SMEs Implementing ISO 14001:2015
18
作者 Gilson Tekendo Djoukoue Idriss Djiofack Teledjieu Sijun Bai 《Journal of Management Science & Engineering Research》 2023年第2期60-81,共22页
Given the challenge of estimating or calculating quantities of waste electrical and electronic equipment(WEEE)in developing countries,this article focuses on predicting the WEEE generated by Cameroonian small and medi... Given the challenge of estimating or calculating quantities of waste electrical and electronic equipment(WEEE)in developing countries,this article focuses on predicting the WEEE generated by Cameroonian small and medium enterprises(SMEs)that are engaged in ISO 14001:2015 initiatives and consume electrical and electronic equipment(EEE)to enhance their performance and profitability.The methodology employed an exploratory approach involving the application of general equilibrium theory(GET)to contextualize the study and generate relevant parameters for deploying the random forest regression learning algorithm for predictions.Machine learning was applied to 80%of the samples for training,while simulation was conducted on the remaining 20%of samples based on quantities of EEE utilized over a specific period,utilization rates,repair rates,and average lifespans.The results demonstrate that the model’s predicted values are significantly close to the actual quantities of generated WEEE,and the model’s performance was evaluated using the mean squared error(MSE)and yielding satisfactory results.Based on this model,both companies and stakeholders can set realistic objectives for managing companies’WEEE,fostering sustainable socio-environmental practices. 展开更多
关键词 Electrical and electronic equipment(EEE) Waste from electrical and electronic equipment(WEEE) General equilibrium theory random forest regression algorithm DECISION-MAKING Cameroon
在线阅读 下载PDF
基于BPNN-EKF-GD-RF算法的锂离子电池组荷电状态估计方法
19
作者 来鑫 翁嘉辉 +4 位作者 杨一鹏 孙宇飞 周龙 郑岳久 韩雪冰 《机械工程学报》 北大核心 2025年第12期251-265,共15页
锂离子电池模组的荷电状态估计(State-of-charge, SOC)是影响电池性能的一个重要内部状态,是电池组进行其它状态估计的基础。然而它的估计准确性易受温度等外部因素影响,且电池间的不一致性也为电池组中各单体电池的SOC估计带来了困难... 锂离子电池模组的荷电状态估计(State-of-charge, SOC)是影响电池性能的一个重要内部状态,是电池组进行其它状态估计的基础。然而它的估计准确性易受温度等外部因素影响,且电池间的不一致性也为电池组中各单体电池的SOC估计带来了困难。提出一种将BP神经网络(Back propagation neural network, BPNN)与扩展卡尔曼滤波(Extended Kalman filter, EKF)算法相结合的电池组SOC估计方法。该方法首先基于先验SOC利用BPNN估计不同温度下“领导者”电池的端电压,将其与实测端电压对比后采用EKF算法完成SOC后验估计,同时基于电压差采用梯度下降(Gradient descent, GD)算法更新BPNN的输出层权重使算法更快收敛。在此基础上,设计修正策略利用随机森林(Random forest, RF)算法对“跟随者”电池的SOC进行调整估计。试验结果表明,所提的BPNN-EKF-GD-RF算法能实现电池组在不同温度下SOC的准确估计,常温下SOC估计误差保持在2.5%以内,在温度变化下电池组中单体电池SOC估计最大误差不超过3.2%,为复杂环境下锂离子电池组的SOC估计提供了一种高精度低复杂度方案。 展开更多
关键词 SOC估计 BP神经网络 扩展卡尔曼滤波 梯度下降算法 随机森林 锂离子电池组
原文传递
彬长矿区煤层采动导水裂隙带高度RF-BP模型预测对比研究
20
作者 姬亚东 刘譞 +5 位作者 朱开鹏 赵春虎 李凯 袁晨瀚 李盼盼 闫鹏珍 《煤矿安全》 北大核心 2025年第7期175-184,共10页
西部黄陇侏罗系煤田煤层赋存条件一般较厚,其中彬长矿区煤层厚度平均大于5 m,最厚可达14 m,且常采用综放开采工艺,造成煤层顶板导水裂隙带发育厚度大且发育规律不明,矿井涌水量居高不下,严重影响矿区安全生产。为研究彬长矿区煤矿工作... 西部黄陇侏罗系煤田煤层赋存条件一般较厚,其中彬长矿区煤层厚度平均大于5 m,最厚可达14 m,且常采用综放开采工艺,造成煤层顶板导水裂隙带发育厚度大且发育规律不明,矿井涌水量居高不下,严重影响矿区安全生产。为研究彬长矿区煤矿工作面开采扰动覆岩而导致的煤层顶板导水裂隙带发育高度,优选了煤层开采厚度、煤层埋深、顶板覆岩岩性、顶板构造特征、开采速度、工作面长度、采煤工艺等7个影响因素,通过AHP层次分析法分别计算出了上述各影响因素的权重,发现煤层开采厚度、工作面长度2个影响因素所占权重相对较大;通过Matlab对搜集的数据进行插值,使数据分布更为平滑;通过反向传播神经网络(BP)、遗传算法优化神经网络(GA-BP)、粒子群优化算法优化神经网络(PSO-BP)、随机森林(RF)算法对插值后数据进行回归拟合。研究发现,4种方法对原始数据的拟合效果都较好,其中随机森林RF相对其他模型对原始数据的拟合具有更高的准确度,训练集和测试集的均方根误差RMSE分别为0.037 41和0.055 16,决定系数R2分别为0.987 37和0.957 89。研究结果可为彬长矿区煤矿开采导水裂隙带发育高度的预测提供一定的参考。 展开更多
关键词 导水裂隙带 煤矿智能化 随机森林算法 BP神经网络 矿井涌水
在线阅读 下载PDF
上一页 1 2 95 下一页 到第
使用帮助 返回顶部