利用MLC++实现数据挖掘

Data Mining Using MLC++

下载PDF

导出

摘要数据挖掘是从大量原始数据中抽取隐藏知识的过程。大部分数据挖掘工具采用规则发现和决策树分类技术来发现数据模式和规则,其核心是归纳算法。与传统统计方法相比,基于机器学习技术得到的分类结果具有较好的可解释性。在针对特定的数据集进行数据挖掘时,如果缺乏相应的领域知识,用户或决策者就很难确定选择何种归纳算法。因此,需要尝试各种算法。借助MLC++,决策者能够轻而易举地比较不同分类算法对特定数据集的有效性,从而选择合适的分类算法。同时,系统开发人员也可以利用MLC++设计各种混合算法。 Data Mining is the process of extracting hidden knowledge from large volumes of raw data. Most data mining tools use rule discovery and decision tree technology to extract data patterns and rules; its core is the inductive algorithm. The classification results obtained using machine learning based technology are more explainable than the traditional methods. However, when performing data mining from specific data set, the user or the decision maker may not know how to choose the appropriate method without the corresponding domain knowledge. Therefore, the user must try various inductive algorithms. Using MLC ＋＋, the decision maker could compare the utility of different algorithms on specific dataset easily to choose the appropriate classification algorithm. The system developer could also use MLC ＋＋ to design hybrid classification algorithms.

作者刘晓平

机构地区中国科学院研究生院

出处《计算机仿真》 CSCD 2006年第4期103-105,113,共4页 Computer Simulation

关键词数据挖掘机器学习分类算法决策树程序设计 Data mining Machine learning Classification algorithms Decision trees Programming

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献4

1J R Quinlan.C4.5:Programs for machine learning[M].Morgan Kaufmann,1993.
2Celia C Bojarczuk,Heitor S Lopes.A constrained-syntax genetic programming system for discovering classification rules:application to medical data sets[J].Artificial Intelligence in Medicine 30,2004.27-48.
3Y Yang,G I Webb.Weighted proportional k-interval discretization for Naive-Bayes classifiers[C].Proceedings of the PAKDD,2003.501-512.
4I Cohen,F G Cozman,N Sebe,M C Cirelo,T S Huang.Semisupervised learning of classifiers:theory,algorithms,and their application to human-computer interaction[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2004,26(12):1553-1566.

1付兴兵,刘光远.粒子群多层感知器在地震预报中的应用研究[J].湖南工程学院学报（自然科学版）,2007,17(1):23-26. 被引量：2
2郑日军.数据挖掘综述[J].科协论坛（下半月）,2008(10):82-82. 被引量：4
3刘晓庆.浅析数据挖掘的研究现状及其应用[J].电脑知识与技术,2006(9):23-24. 被引量：5
4王宇辉,杨丽.基于概念格的数据挖掘方法在库存管理中的应用[J].计算机应用研究,2011,28(5):1745-1747. 被引量：2
5钟飞,廖冬初,杨光友,周国柱.利用MATLAB进行合成模糊推理[J].湖北工学院学报,2002,17(1):20-22. 被引量：2
6陈玉敏.基于神经网络的遥感影像分类研究[J].测绘信息与工程,2002,27(3):6-8. 被引量：28
7卢红,秦森.基于关联规则的数据挖掘技术在ERP中的应用[J].机械工程与自动化,2007(4):10-12. 被引量：1
8邓箴.基于二维关联边条件随机场的Web信息抽取[J].价值工程,2010,29(34):186-186.
9陆强,李文峰,赵学良,田娟.基于神经网络的传感器故障诊断的研究[J].微计算机信息,2008,24(28):275-276. 被引量：1
10王桂芹,黄道.数据挖掘技术综述[J].电脑应用技术,2007(2):9-14. 被引量：30

计算机仿真

2006年第4期

浏览历史

内容加载中请稍等...

利用MLC++实现数据挖掘

参考文献4

相关作者

相关机构

相关主题

浏览历史