期刊文献+

基于方形对称邻域的局部离群点检测方法 被引量:5

Square symmetric neighborhood based local outlier detection algorithm
在线阅读 下载PDF
导出
摘要 针对NDOD(outlier detection algorithm based on neighborhood and density)算法在判断具有不同密度分布的聚类间过渡区域对象时存在的不足,以及为了降低算法时间复杂度,提出一种基于方形对称邻域的局部离群点检测方法。该算法改用方形邻域,吸收基于网格的思想,通过扩张方形邻域快速排除聚类点及避免"维灾";通过引入记忆思想,使得邻域查询次数及范围成倍地减小;同时新定义的离群度度量方法有利于提高检测精度。实验测试表明,该算法检测离群点的速度及精度均优于NDOD等算法。 NDOD may result in wrong estimation when objects are in the location where the density distributions in multiple clusters are significantly different.To void this problem and reduce the computational complexity,this paper proposed a new density based algorithm named SSNOD(square symmetric neighborhood based local outlier detection algorithm).By utilizing the grid-based idea,the algorithm partitioned dataset with square neighborhood and expaned neighborhood rapidly,it could get rid of non-outliers quickly and overcome "dimension curse".By absorbing memory idea,the times of neighborhood query and range were significantly decreased.Besides,computation accuracy could be improved within the novel metrics.Experimental result shows SSNOD is not only efficient in the computation but also more effective than NDOD in detection accuracy.
出处 《计算机应用研究》 CSCD 北大核心 2012年第2期472-474,共3页 Application Research of Computers
基金 国家自然科学基金资助项目(61073058)
关键词 数据挖掘 离群检测 方形对称邻域 局部离群度 data mining outlier detection square symmetric neighborhood local outlier degree
  • 相关文献

参考文献10

  • 1张卫旭,尉宇.基于密度的局部离群点检测算法[J].计算机与数字工程,2010,38(10):11-14. 被引量:12
  • 2胡彩平,秦小麟.一种基于密度的局部离群点检测算法DLOF[J].计算机研究与发展,2010,47(12):2110-2116. 被引量:54
  • 3HAN Jia-wei, KAMBER M. Data mining: concepts and techniques [ M]. 2nd ed. San Francisco: Morgan Kaufmann, 2006.
  • 4MARATEB H R, ROJAS-MARTINEZ M, MANSOURIAN M, et al. Outlier detection in high-density surface electromyographic signals [C ]//Proc of the 32nd Annual International Conference. 2010: 4850-4853.
  • 5BREUNIG M M, KRIEGEL H, NG R T, et al. LOF: identifying density-based local outliers[ C ]//Proc of ACM SIGMOD International Conference on Management of Data. New York: ACM Press, 2000: 93-104.
  • 6ESTER M, KRIEGEL H P, SANDER J, et al. A density-based algo- rithm for discovering clusters in large spatial databases with noise [C]//Proc of the 2nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM Press, 1996 : 226-231.
  • 7ZHOU Shui-geng, ZI-IAO Yue, GUAN Ji-hong, et al. A neighbor- hood-based clustering algorithm [ C ]//Proc of the 9th Pacific-Asia Conference on Knowledge Discovery and Data Mining. Berlin: Sprin- ger, 2005: 361-371.
  • 8JIN Wen, TUNG A K H,HAN Jia-wei, et al. Ranking outliers using symmetric neighborhood relationship [ C ]//Proc of the 10th Pacific- Asia Conference on Knowledge Discovery and Data Mining. Berlin: Springer, 2006: 93-104.
  • 9黄添强,秦小麟,叶飞跃.基于方形邻域的离群点查找新方法[J].控制与决策,2006,21(5):541-545. 被引量:16
  • 10陶运信,皮德常.基于邻域和密度的异常点检测算法[J].吉林大学学报(信息科学版),2008,26(4):398-403. 被引量:12

二级参考文献46

共引文献86

同被引文献52

  • 1邓波,张玉超,金松昌,林旺群.基于MapReduce并行架构的大数据社会网络社团挖掘方法[J].计算机研究与发展,2013,50(S2):187-195. 被引量:10
  • 2倪巍伟,陆介平,陈耿,孙志挥.基于k均值分区的数据流离群点检测算法[J].计算机研究与发展,2006,43(9):1639-1643. 被引量:20
  • 3常建龙,曹锋,周傲英+.基于滑动窗口的进化数据流聚类[J].软件学报,2007,18(4):905-918. 被引量:61
  • 4Ng R T, Han Jiawei. Efficient and effective clustering meth- ods for spatial data mining[C]//Proceedings of the 20th In- ternational Conference on Very Large Data Bases. 1994:144- 155.
  • 5He Zengyou, Xu Xiaofei, Huang Zhexue, et al. FP-outli- er : Frequent pattern based outlier detection [ J ]. Computer Science and Information Systems, 2005,2 ( 1 ) : 103-118.
  • 6Zhang Tian, Ramakrishnan R, Livny M. BIRCH: An effi- cient data clustering method for very large databases [ C ]// Proceedings of the 1996 ACM SIGMOD International Con- ference on Management of Data. 1996 : 103-114.
  • 7Han Jiawei, Kamber M. Data Mining: Concepts and Tech- niques [ M ]. 2nd Edition. San Francisco : Morgan Kauf- mann, 2006.
  • 8Marateb H R, Rojas-Martinez M, Mananas Villanueva M A, et al. Robust outlier detection in high-density surface electromyographic signals [ C]// Proceedings of the 2010 Annual International Conference of the IEEE Engineering in Medicine and Bioloav. 2010:4850-4853.
  • 9Guha S, Mishra N, Motwani R, et al. Clustering data streams[ C ]// Proceedings of the d lst Annual Symposium on Foundations of Computer Science. 2000:359-366.
  • 10O' Callaghan L, Mishra N, Meyerson A, et al. Streaming- data algorithms for high-quality clustering [ C ]// Proceed- ings of the 18th IEEE International Conference on Data En- gineering. 2002:685-694.

引证文献5

二级引证文献35

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部