期刊文献+

具有数据清理功能的交互式数据迁移及应用 被引量:11

Interactive data migration with data cleaning function and its application
在线阅读 下载PDF
导出
摘要 针对众多论文对数据迁移的研究多是泛泛地介绍数据迁移的方法,而没有考虑数据迁移过程中的数据清理问题,提出了一种具有数据清理功能的交互式数据迁移技术。该技术把数据迁移和数据清理紧密地结合在一起,具有开放的规则库和算法库。通过在规则库中定义规则以及从算法库中选择合适的清理算法,不仅能灵活、准确地完成数据的迁移,还能保证数据迁移后新系统的数据质量。该技术被应用于医疗保险信息系统再工程项目,取得很好的效果,从而说明这种数据迁移技术在实践中是可行的。 The ways of data migration are studied in many articles, but the data cleaning is not considered in the process of data migration.An interactive data migration technology with data cleaning function is proposed. This data migration technique combines data migration and data cleaning together tightly, and has exoteric rules library and algorithms library. Through defining rules in rules library and choosing proper cleaning algorithms from algorithms library, this data migration method migrates the data from original system to new system and ensures data quality. This technique is used in data migration of a medical insurance information system and the result is very good.This declarative data migration technique is feasible in practice.
作者 陈伟 丁秋林
出处 《吉林大学学报(信息科学版)》 CAS 2004年第2期148-153,共6页 Journal of Jilin University(Information Science Edition)
关键词 系统切换 数据清理 数据迁移 数据质量 system conversion data cleaning data migration data quality
  • 相关文献

参考文献8

  • 1[1]HEILER S, LEE W C, MITCHELL G. Repository support for metadata-based legacy migration [J]. IEEE Data Engineer Bulletin, 1999,22 (1): 37-42.
  • 2[2]RAHM E, DO H H. Data cleaning: problems and current approaches[J]. IEEE Data Engineer Bulletin, 2000,23(4):3-13.
  • 3[3]GALHARDAS H, FLORESCU D, SHASHA D,et al. Declarative data cleaning: language, model, and algorithms[C]//Proceedings of the 27th VLDB Conference. Roma: Morgan Kaufmann, 2001. 371-380.
  • 4[4]HERNANDEZ M A, STOLFO S J. Real-world data is dirty: data cleansing and the merge/purge problem [J]. Data Mining and Knowledge Discovery, 1998,2 (1): 9-37.
  • 5[5]MONGE A, MATCHING E. Algorithms within a duplicate detection system[J]. IEEE Data Engineer Bulletin, 2000,23(4): 14-20.
  • 6[6]LEE M L, LING T W, LOW W L. IntelliClean: A knowledge-based intelligent data cleaner[C]//Proceeding of the 6th ACM SIGKDD International Conference on Knowledge discovery and Data Mining. Boston: ACM Press, 2000:290-294.
  • 7[7]NAVARRO G. A guided tour to approximate string matching [J]. ACM Computing Surveys, 2001,33 (1): 31-88.
  • 8[8]BUNKE H, JIANG X Y, ABEGGLEN K,et al. On the weighted mean of a pair of strings[J]. Pattern Analysis &Applications, 2002,5(5): 23-30.

同被引文献64

引证文献11

二级引证文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部