摘要
为减少冗余候选项集的产生,提出了一种基于向量矩阵的频繁项集挖掘算法FIS-Miner.在该算法中,将所有频繁1-项集按支持度升序进行排序并存储其对应的二进制位向量,将这些二进制位向量映射到向量矩阵进行分析找出所有的频繁项集,既实现了数据库的一次扫描又避免了大量候选项集的产生.探讨了该算法的实现步骤,并给出实例验证了该算法的有效性.
A new algorithm FIS-Miner (Frequent Item Sets Miner) is presented for discovering frequent item sets to decrease candidate generation based on vector matrix. All frequent 1-itemsets are put into an array in support degree ascending order, storing corresponding binary vector. All frequent item sets are found by analyzing these binary vectors and mapping vector matrix, therefore database needs to scan only one time while avoiding many candidates. This algorithm steps are explored and its validity is examined.
出处
《大连交通大学学报》
CAS
2008年第3期74-77,共4页
Journal of Dalian Jiaotong University
关键词
二进制位向量
向量矩阵
频繁项集
最大频繁项集
binary vector
vector matrix
frequent item sets
maximum frequent item sets