摘要
作为数据仓库系统的关键部件,ETL完成数据抽取、清洗、转换和装载的工作,它是构建数据仓库的重要环节,同时也是构建数据仓库过程中出现问题最多的环节,所以针对这点,该文给出了一个可靠的同时易于扩展的ETL策略和架构。文章首先简单地介绍了数据仓库技术和ETL技术,包括ETL的相关概念、ETL在数据仓库中的功能和重要地位;然后重点介绍了这种ETL的具体策略和架构设计。
As the key component in the data warehouse system,ETL supports the processing about data extracting,cleaning,transforming and loading.It is one of the most important steps in building the data warehouse,at the same time,there are a lot of bugs about ETL in building the data warehouse.To avoid those potential bugs,this paper puts forward a reliable and easily distensible strategy and architecture of ETL.This paper briefly introduces the technology of data warehouse and ETL,including the concepts related with data warehouse and ETL,ETL's functions and the important location in data warehouse system,and then it emphasizes the details about this strategy and design of architecture of ETL.
出处
《计算机工程与应用》
CSCD
北大核心
2005年第10期172-174,229,共4页
Computer Engineering and Applications