期刊文献+

可交互数据清洗系统研究

在线阅读 下载PDF
导出
摘要 在数据仓库构建和数据集成中,面临着大量的数据清洗任务。要把数据清洗过程做得灵活并不容易,已有的工具过于依赖特定的应用。该文分析了数据质量中存在的问题,数据清洗技术的现状、发展趋势,同时提出了一个可交互的数据清洗框架。
出处 《工程地质计算机应用》 2004年第2期18-21,29,共5页 Engineering Geology Computer Application
  • 相关文献

参考文献2

二级参考文献31

  • 1Bitton D,DeWitt D J.Duplicate record elimination in large data files.ACM Transactions on Database Systems,1983,8(2): 255~265
  • 2Monge A E,Elkan C P.An efficient domain-independent algorithm for detecting approximately duplicate database records.1997
  • 3Hernandez M,Stolfo S.The merge/purge problem for large databases.In:Proc.of the ACM SIGMOD International Conference on Management of Data,May 1995.127~138
  • 4Monge A E,Elkan C P.The field matching problem: Algorithms and applications.In: Proc.of the 2nd Int.Conf.on Knowledge Discovery and Data Mining,1996.267~270
  • 5Smith T F,Waterman M S.Identification of common molecular subsequences.Journal of Molecular Bilogy,1981,147:195~197
  • 6Lowrance R,Wagner R A.An extension of the string-to-string correction problem.J.ACM,1975,22(2): 177~183
  • 7Tarjian R E.Effiency of a good but not linear set union algorithm.Journal of the ACM,1975,22(2):215~225
  • 8Aebi, D., Perrochon, L. Towards improving data quality. In: Sarda, N.L., ed. Proceedings of the International Conference on Information Systems and Management of Data. Delhi, 1993. 273~281.
  • 9Wang, R.Y., Kon, H.B., Madnick, S.E. Data quality requirements analysis and modeling. In: Proceedings of the 9th International Conference on Data Engineering. Vienna: IEEE Computer Society, 1993. 670~677.
  • 10Rahm, E., Do, H.H. Data cleaning: problems and current approaches. IEEE Data Engineering Bulletin, 2000,23(4):3~13.

共引文献298

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部