摘要
主要研究了粗糙集理论在决策表离散化中的应用,提出了一种新的基于粗糙集理论的决策表离散化算法.该算法是一种基于决策表属性重要性的算法,首先使用条件属性与决策属性的互信息来度量条件属性的重要性,并据此对条件属性按照重要性由小到大排序,然后按排序后的顺序,考察每个条件属性的所有断点,将冗余的断点去掉,从而将条件属性离散化.
The application of the rough set theory in the discretization of the decision table is studied, and a noval discretization algorithm based on the rough set theory is presented. The algorithm of this paper is based on the importance of condition attributes. Firstly, the inter-information between condition attributes and decision attributes is used to measure the importance of condition attributes, according to which the condition attributes are sorted in a descending order. Secondly, all break points of every condition attributes are examined and the redundant ones are eliminated. Finally, each value in the decision table is replaced by a number representing the break point, and then the decision table is discretized.
出处
《西安电子科技大学学报》
EI
CAS
CSCD
北大核心
2004年第3期469-472,共4页
Journal of Xidian University
关键词
粗糙集
决策表离散化
数据挖掘
rough set
decision table discretization
data mining