期刊文献+

一个基于领域知识的数据清洗框架 被引量:5

A Domain Knowledge Based Data Cleaning Framework
在线阅读 下载PDF
导出
摘要 在给数据挖掘这类应用准备数据的过程中,面临着一系列数据清洗问题,而成功的数据清洗往往需要领域知识的支持。本文设计了一个基于领域知识的数据清洗框架,它在领域专家的支持下,通过抽样数据获得清洗规则;专家系统引擎利用获得的知识,在整个数据集上进行清洗;它具有自学习能力,在清洗过程中不断的优化清洗规则;它的知识库易扩展,框架通用性较强。 Many data cleaning problems will be accounted in data preparing process of data mining applications. Successful data cleaning methods often need the support of domain knowledge. This paper proposes a domain knowledge based data-cleaning framework. Supported by the expetts of the domain, it obtains cleaning rules through a sample data set. And using these rules, an expert engineer cleans the whole data set. It has the ability of self-study and can optimize the cleaning rules through the process of cleaning. Its knowledge base is easy to extend.
出处 《信息技术与信息化》 2005年第5期100-103,共4页 Information Technology and Informatization
  • 相关文献

参考文献5

  • 1Daniel Aebi, Louis Perrochon. Towards improving data quality[J]. In: Sarda, N.L., ed. Proceedings of the International Conference on Information Systems and Management of Data. Delhi, 1993. 273-281.
  • 2Rahm, E., Do, H.H. Data cleaning: problems and current approaches[J]. IEEE Data Engineering Bulletin, 2000,23(4):3-13.
  • 3Galhardas, H., Florescu, D., Shasha, D., et al. Declarative data cleaning: language, model and algorithms[J]. In: Apers, P., Atzeni, P., Ceri, S., et al, eds. Proceedings of the 27th International Conference on Very Large Data Bases. Roma: Morgan Kaufmann, 2001. 371-380.
  • 4M. L. Lee, H. Lu, T. W. Ling, and Y. T. Ko. Cleansing data for mining and warehousing[J]. In Proceedings of the 10th International Conference on Database and Expert Systems Applications (DEXA), pages 751-760, 1999.
  • 5Charles. Forgy.Rete: A fast algorithm for the many patterns/many objects match problem[J]. Artificial Intelligence, 19(1): 17-37, 1982.

同被引文献44

引证文献5

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部