期刊文献+

Half-global discretization algorithm based on rough set theory 被引量:2

Half-global discretization algorithm based on rough set theory
在线阅读 下载PDF
导出
摘要 It is being widely studied how to extract knowledge from a decision table based on rough set theory. The novel problem is how to discretize a decision table having continuous attribute. In order to obtain more reasonable discretization results, a discretization algorithm is proposed, which arranges half-global discretization based on the correlational coefficient of each continuous attribute while considering the uniqueness of rough set theory. When choosing heuristic information, stability is combined with rough entropy. In terms of stability, the possibility of classifying objects belonging to certain sub-interval of a given attribute into neighbor sub-intervals is minimized. By doing this, rational discrete intervals can be determined. Rough entropy is employed to decide the optimal cut-points while guaranteeing the consistency of the decision table after discretization. Thought of this algorithm is elaborated through Iris data and then some experiments by comparing outcomes of four discritized datasets are also given, which are calculated by the proposed algorithm and four other typical algorithras for discritization respectively. After that, classification rules are deduced and summarized through rough set based classifiers. Results show that the proposed discretization algorithm is able to generate optimal classification accuracy while minimizing the number of discrete intervals. It displays superiority especially when dealing with a decision table having a large attribute number. It is being widely studied how to extract knowledge from a decision table based on rough set theory. The novel problem is how to discretize a decision table having continuous attribute. In order to obtain more reasonable discretization results, a discretization algorithm is proposed, which arranges half-global discretization based on the correlational coefficient of each continuous attribute while considering the uniqueness of rough set theory. When choosing heuristic information, stability is combined with rough entropy. In terms of stability, the possibility of classifying objects belonging to certain sub-interval of a given attribute into neighbor sub-intervals is minimized. By doing this, rational discrete intervals can be determined. Rough entropy is employed to decide the optimal cut-points while guaranteeing the consistency of the decision table after discretization. Thought of this algorithm is elaborated through Iris data and then some experiments by comparing outcomes of four discritized datasets are also given, which are calculated by the proposed algorithm and four other typical algorithras for discritization respectively. After that, classification rules are deduced and summarized through rough set based classifiers. Results show that the proposed discretization algorithm is able to generate optimal classification accuracy while minimizing the number of discrete intervals. It displays superiority especially when dealing with a decision table having a large attribute number.
出处 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2009年第2期339-347,共9页 系统工程与电子技术(英文版)
关键词 half-global discretization continuous condition attributes correlation coefficient rough entropy STABILITY rough set theory half-global discretization, continuous condition attributes, correlation coefficient, rough entropy, stability, rough set theory
  • 相关文献

参考文献1

二级参考文献16

  • 1Nguyen S.H., Nguyen H.S.. Some efficient algorithms for rough set methods. In: Proceedings of the Conference of Information Processing and Management of Uncertainty in Knowledge-Based Systems, Granada, Spain, 1996, 1451~1456.
  • 2Susmaga R.. Analyzing discretizations of continuous attributes given a monotonic discrimination function. Intelligent Data Analysis, 1997, 1(4): 157~179.
  • 3Dai Jian-Hua, Li Yuan-Xiang. Study on discretization based on rough set theory. In: Proceedings of the first International Conference on Machine Learning and Cybernetics, Beijing, 2002, 1371~1373.
  • 4Chen Cai-Yun, Li Zhi-Guo, Qiao Sheng-Yong, Wen Shuo-Pin. Study on discretization in rough set based on genetic algorithm. In: Proceedings of the Second International Conference on Machine Learning and Cybernetics, Xi′an, 2003, 1430~1434.
  • 5Huang Jin-Jie, Li Shi-Yong. A GA-based approach to rough data model. In: Proceedings of the 5th World Congress on Intelligent Control and Automation, Hangzhou. 2004, 1880~1884.
  • 6Roy A., Pal S.K.. Fuzzy discretization of feature space for a rough set classier. Pattern Recognition Letters, 2003, 24(6): 895~902.
  • 7Wang Li-Hong, Zhang Shu-Cui, Fan Hui, Wu Geng-Feng. The information granulation in discretization. In: Proceedings of the Second International Conference on Machine Learning and Cybernedcs, Xi′an, 2003, 2620~2623.
  • 8Li Meng-Xin, Wu Cheng-Dong, Han Zhong-Hua, Yue Yong. A hierarchical clustering method for attribute discretization in rough set theory. In: Proceedings of the third International Conference on Machine Learning and Cybernetics, Shanghai, 2004, 3650~3654.
  • 9Shen L., Tay E.H.. A discretization method for rough sets theory. Intelligent Data Analysis, 2001, 5(5): 431~438.
  • 10Tay E.H., Shen L.. A modified Chi2 algorithm for discretization. IEEE Transactions on Knowledge and Data Engineering, 2002, 14(3): 666~670.

共引文献133

同被引文献45

引证文献2

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部