期刊文献+

基于训练集划分的随机森林算法 被引量:2

A Random Forest Algorithm Based on Training set Splitting
在线阅读 下载PDF
导出
摘要 本文提出了一种基于训练集划分的随机森林算法。该算法首先将多数类划分为多个不相交子集。然后将每个子集与少数类合并,进行决策树的训练。最后根据平均加权策略构建随机森林,并获取最终的分类规则。本文所提方法避免了原始样本信息的损失,而且保持了子分类器的样本平衡。在人工生成数据集上的仿真实验表明本文方法非常有效。 In this paper, a random forest algorithm based on the training set splitting is proposed. Firstly, the majority class is divided into multiple disjointed sunsets. Then combine each subset with the rare class to train a decision tree. Finally, construct a random forest based on the average weighted strategy, and obtain the final classification rules. The proposed method avoids the loss of the original sample information, and maizes the training set balanced for each decision tree. Experiments on the artificial imbalanced data show that this method is very effective.
作者 吴华芹
出处 《科技通报》 北大核心 2013年第10期124-126,共3页 Bulletin of Science and Technology
关键词 随机森林 不相交子集 决策树 平均加权 random forest disjointed sunsets a decision tree average weighted
  • 相关文献

参考文献7

  • 1Japkow Iczn, Stephen S. The class imbalance problem: asystematic study [J]. Intelligent Data Analysis Journal,2002, 6 (5): 429-450.
  • 2Gustavo E A, Batista P A, Ronaldo C, et al. A study ofthe behavior of several methods for balancing machinelearning training data [J]. SIGKDD Explorations, 2004,6(1): 20- 29.
  • 3Domngos P. METACOST: a general method for makingclassifiers cost sensitive [C]//. Proceedings of the 5th In-ternational Conference on Knowledge Discovery and DataMining. San Diego, CA:ACM Press, 1999: 155-164.
  • 4王文震.基于流形学习的视频中文文本检测算法[J].科技通报,2012,28(10):46-48. 被引量:11
  • 5Hawla N V, Bowyer K W, Hall L 0, et al. SMOTE: syn-thetic minority over-sampling technique[J]. Journal of Ar-tificial Intelligence Research, 2002, 16: 321-357.
  • 6Yang J,Yu X,Xie Z Q.A novel virtual sample generationmethod based on Gaussian distribution [JJ.Knowledge -Based Systems,2011,24 (6):740-748.
  • 7Kohavi R. A study of cross -validation and bootstrap foraccuracy estimation and model selection[C]//. In: WermterS,Riloff E, Scheler G, eds. Proc. 14th Joint Int. Conf. Ar-tificial Intelligence. San Mateo, CA: Morgan Kaufmann,1995. 1137-1145.

二级参考文献5

共引文献10

同被引文献36

引证文献2

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部