期刊文献+

一种基于密度偏差抽样的孤立点检测算法 被引量:3

Outlier Detection Algorithms Based on Density Biased Sampling
在线阅读 下载PDF
导出
摘要 孤立点检测是一项有价值的、重要的知识发现任务。在对大规模数据集中的孤立点数据进行检测时,样本数据集的选择技术至关重要。本文提出了一种新的基于密度的偏差抽样技术作为数据约简的手段,并给出了基于密度偏差抽样的孤立点检测算法,该算法可以用来识别样本数据集低密度区域中的孤立点数据,并从理论和实验两个方面对其进行分析评估,分析与实践证明该算法是有效的。 Outlier detection is a meaningful and important knowledge discovery task. The choice of sampling data set is very important during the process of outlier detection in large data sets. We propose a new density biased sampling as a data reduction technique to speed up the operation of outlier detection in large data sets, and introduce an algorithm based on density biased sampling. The algorithm can identify outliers of the sparse region. Finally, by evaluating the proposed method and presenting a experimental evaluation, we verify the utility of our approach.
出处 《计算机科学》 CSCD 北大核心 2004年第10期206-208,共3页 Computer Science
基金 重庆市教委资助项目(030201)
关键词 孤立点 检测算法 数据约简 大规模数据 知识发现 点检 识别 样本数据 区域 抽样 Large data set, Biased sampling, Outlier detection
  • 相关文献

参考文献8

  • 1Han J, Kamber M. Data Mining: Concepts and Techniques.Copyright by Morgan Kaufmann Publishers, Inc. 2001
  • 2Palmer C R, Faloutsos C. Density biased sampling: An improved method for data mining and clustering. In: Proc. Of the ACM SIGMOD'2000, 2000
  • 3Guha S, Rastogi R,Shim K. CRUE: An Efficient Clustering Algorithm for Large Database. In: Proc. ACM SIGMOD, June 1998.73-84
  • 4Knorr E, Ng R. Algorithms for Mining Distance Based Outliers in Large Databases. In: Proc. Very Large Data Bases Conf.,Aug. 1998. 392-403
  • 5Barnett Y, Lewis T. Outliers in Statistical Data. John Wiley &Sons, 1994
  • 6Scott D. Multivariate Density Estimation: Theory, Practice and Visualization. Wiley and Sons, 1992
  • 7Wand M P, Jones M C. Kernel Smoothing. Monographs on Statistics and Applied Probability, Chapman and Hall, 1995
  • 8Knorr E, Ng R. Algorithms for Mining Distance-based Outliers in Large Datasets. In: Proc. 1998 Int. Conf. Very Large Data Base(VLDB98), New York, 1998(8) :392-403

同被引文献30

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部