摘要
Hadoop是一个可实现大规模分布式计算的开源软件平台,已经被广泛应用在云计算领域。从Hadoop分布式文件系统架构的整体入手,描述了其分布式数据存储、分布式任务分配、分布式并行计算和分布式数据库4个方面的核心内容,并论述了HDFS的工作原理、文件操作流程及Map/Reduce工作原理和计算过程。使开发人员深入地理解Hadoop架构的工作原理与实现过程,为云计算背景下的应用程序开发提供重要的参考。
Hadoop is an open-source software which can achieve large-scale distributed computing, so it is widely used in cloud computing. Starting with distributed file system architecture of hadoop, this paper describes distributed data storage, distributed task assignment, distributed parallel computing and distributed database. It discusses HDFS working principle, file operation progress and Map/Reduce working principle and computation procedure. Thus it makes developers in-depth understand working principle and implement procedure of hadoop architecture. It provides important reference for?application development of cloud computing background.
出处
《计算机与网络》
2012年第2期65-67,共3页
Computer & Network