期刊文献+

一种基于贝叶斯测度的有监督离散化方法 被引量:5

A New Supervised Discretization Method Based on Bayes Measure
在线阅读 下载PDF
导出
摘要 传统的朴素贝叶斯不能处理连续属性,文中基于贝叶斯测度提出一种有监督离散化方法。它能够在无先验知识的前提下,自动寻求最佳的离散子区间数目和区间划分。在此基础上根据MDL准则控制离散化子区间的数目,使学习方法的精确度和复杂度达到均衡。在UCI机器学习数据集上对该方法进行了验证,取得了良好的效果。 Standard naive Bayes can not handle continuous attributes. A new supervised discretization method is proposed, which is based on Bayes measure to automatically find the most appropriate boundaries for discretizationand the number of intervals. At the same time, it embodies tradeoff between the accuracy and the complexity of the learned discretization by applying MDL principle. Experimental results on UCI data sets indicate that the classification accuracy is substantially improved.
出处 《仪器仪表学报》 EI CAS CSCD 北大核心 2005年第8期786-789,共4页 Chinese Journal of Scientific Instrument
基金 国家自然科学基金项目(60275026)资助。
关键词 机器学习 朴素贝叶斯 有监督离散化 贝叶斯测度 Machine learning Naive Bayes Supervised discretization Bayes measure
  • 相关文献

参考文献10

  • 1N. Friedman, D. Geiger, M. Goldszmidt. Bayesian network classifiers. Machine Learning, 1997, 29:131~163.
  • 2A. McCallum, K. Nigam. A comparison of event models for naive Bayes text classification. Proc. of AA-AI-98 Workshop on Learning for Text Categorization, Madison, WI, 1998, 137~142.
  • 3R. O. Duda, P. E. Hart. Pattern classification and scene analysis. New York:Wiley, 1973.
  • 4J. Dougherty, R. Kohavi, M. Sahami. Supervised and unsupervised discretization of continuous features. Proc. of the 12th International Conference on Machine Learning, San Francisco, CA:Morgan Kaufmann Publishers, 1995, 194~202.
  • 5N. Friedman, M. Goldszmidt. Discretization of contin-uous attributes while learning Bayesian networks. Proc. of the 13th International Conference on Machine Lea-rning, Bari, Italy, 1996, 157~165.
  • 6李刚,童頫.基于混合概率模型的无监督离散化算法[J].计算机学报,2002,25(2):158-164. 被引量:16
  • 7边肇祺 张学工.模式识别[M].北京:清华大学出版社,2002.296-304.
  • 8B. W Silverman. Density estimation for statistics and data analysis. Monographs on Statistics and Applied Probability, 1986.
  • 9P. Smyth, A. Gray, U. Fayyad. Retrofitting decision tree classifiers using kernel density estimation. Proc. of the 12th International Conference on Machine Learning, Morgan Kaufmann Publisthers, 1995, 506~514.
  • 10.[EB/OL].http://ftp.ics.uci.edu/pub/machine-learning-databa-ses,.

二级参考文献14

  • 1[1]Catlett J. On changing continuous attributes into ordered discreteattributes. In: Proc European Working Session on Learning (EWSL91). LNAI-482, Porto,Portugal, 1991. 164-178
  • 2[2]Dougherty J, Kohavi R, Sahami M. Supervised and unsupervised discretizationof continuous features. In: Proc the 12th International Conference, Morgan KaufmannPublishers, 1995.194-202
  • 3[3]Quinlan J R. C4.5: Programs for Machine Learning. San Mateo: Morgan Kaufmann,1993
  • 4[4]Fayyad U, Irani K. Multi-interval discretizaton of continuous-valuedattributes for classification learning. In: Proc the 13th International JointConference on Artificial Intelligence, San Mateo, CA. Morgan Kaufmann Publishers,1993. 1022-1027
  • 5[5]Li G, Tong F. WILD: Weighted information-loss discretization algorithm forordinal attributes. In: Proc Conference on Intelligent Information Processing, the16th IFIP World Computer Congress 2000, Beijing, China, 2000.254-527
  • 6[6]Quinlan J R. Improved use of continuous attributes in C4.5. Journal ofArtificial Intelligence Research, 1996,4(1):77-90
  • 7[7]Wong A K C, Chiu D K Y. Synthesizing statistical knowledge from incompletemixed-mode data. IEEE Trans Pattern Analysis and Machine Intelligence, 1987,PAMI-9(6):796-805
  • 8[8]Banfield J D, Raftery A E. Model based Gaussian and non-Gaussian clustering.Biometrics, 1993,49(3):803-821
  • 9[9]Mackay D J C. Information Theory, Inference and Learning Algorithms.Cambridge: Cambridge University Press, 2000
  • 10[10]Dempster A P, Laird N M, Rubin D B. Maximum likelihood for incomplete data viathe EM algorithm. Journal of the Royal Statistical Society, Series B, 1977,39(1):1-38

共引文献41

同被引文献55

引证文献5

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部