期刊文献+

数据清洗技术研究 被引量:7

Research of the Data Cleaning Technique
在线阅读 下载PDF
导出
摘要 概括介绍了各种文献中对数据清洗技术的描述和定义,并简要介绍了几种能自动识别数据集中潜在错误的异常检测的方法,给出了在现实数据集中进行实验的结果,讨论了数据清洗问题未来的研究方向。 This paper gives an overview of the descriptions and definitions about data cleaning technique in existing literatures. And briefly introduced several error detection methods to automatically identify potential errors in data sets. Some brief experimental results supporting the use of such methods are given. Finally the future research directions necessary to address the data cleaning problems are discussed.
出处 《山东科技大学学报(自然科学版)》 CAS 2004年第2期55-57,共3页 Journal of Shandong University of Science and Technology(Natural Science)
关键词 数据清洗 异常检测 模式 聚类 关联规则 data cleaning error detection pattern clustering association rules
  • 相关文献

参考文献8

  • 1Galhardas H, Florescu D.An Extensible Framework for Data Cleaning[R].Institute National de Recherche en Informatique et en Automatique, Technical Report, 1999.
  • 2Hernandez MA and Stolfo JS.Real-world Data is Dirty:Data Cleansing and The Merge/Purge Problem[J].Journal of Data Mining and Knowledge Discovery,1998,(2).
  • 3Kimball R.Dealing with dirty Data[J].DBMS, 1996,9(10) :55.
  • 4Guyon I,Matic N and Vapnik V.Discovering Information Patterns and Data Cleaning.In Advances in Knowledge Discovery in Data Mining[M].MIT Press/AAAI Press,1996.
  • 5Simoudis E,Livezey B and Kerber R.Using Recon for Data Cleaning[A].Proceedings of KDD[C].1995.282-287.
  • 6Levitin A and Redman T.A Model of the data(life) cycles with application to quality[J].Information and Software Technology,1995,35(4): 217-223.
  • 7Jonathan I,Maletic Andrian Marcus.Data Cleansing: Beyond Integrity Analysis[J].Division of Computer Science 2000,(2).
  • 8Andrian Marcus, Jonathan I.Maletic.Utilizing Association Rules for the Identification of Errors in Data[R].Technical Report CS-00-04.

同被引文献58

引证文献7

二级引证文献56

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部