摘要
数据质量的定义、数据质量问题的来源、数据质量提高途径等基本问题,是数据质量控制研究的基础。分析了现有数据质量定义的局限性和片面性,依据国际标准化组织对质量的定义,重新对其进行了定义。将数据质量问题来源分为四种情况:数据录入错误、测量错误、简化错误和数据集成错误。归纳了数据质量提高的具体手段,指出数据质量控制需综合应用管理和技术手段。校正了对以上基本问题的认识偏差,为更深入的数据质量研究提供了依据。
Some basic problems of data quality, such as definition, error source, improving approach, etc.are foundation for data quality control.The limitations and one-sidedness of existing data quality definitions are analyzed.Data quality is redefined according as quality definition coming from ISO.Data error sources are divided into four instances:data entry errors, measurement errors, distillation errors and data integration errors.Improving approaches of data quality are summarized, and both management and technology are needed in data quality control.Some errors in above basic problems are advised, and some bases for further research are given.
出处
《微计算机信息》
2010年第9期12-14,共3页
Control & Automation