摘要
数据副本管理是云存储系统的一个重要组成部分,对提高系统的可靠性和性能具有重要意义。一般而言,云计算环境中数据副本数目越少,其传输成本则愈大;而副本过多,存储成本又随之增加,可能导致总成本上升。从降低数据管理成本的角度,在权衡存储成本与传输成本的基础上研究面向最小成本的数据副本管理策略,主要包括:数据管理成本模型、创建副本必要性测试以及近似最小成本的副本布局策略等。以Amazon云平台数据管理成本模型为例进行实验,结果表明:面向最小成本的副本管理策略在满足用户响应时间等需求的同时,可以有效地降低数据中心的管理成本,推动企业(用户)积极运用云计算平台管理企业数据,促进云计算环境的和谐发展。
Data replica management is an important component in cloud storage system, which is important for improving the system reliability and high performance. In general, if the number of replicas increase, the transfer cost will be declined because of the data can transfer more effectively; but the storage cost is becoming large because of the existence of additional replicas. Aimed to reduce the cost of data management, this paper proposed a minimum-cost based data replication strategy in balancing storage cost and transfer cost, including the data management cost model, the necessity of adding data replica and an approximate algorithm that can automatically decide the number and their store places. Both the theoretical analysis and simulations conducted on general (random) data sets as well as specific real world applications with Amazon's cost model show that the minimum-cost replica strategy is close to or even the same as the minimum cost benchmark and the efficiency is very high for practical runtime utilization in the cloud. On the other side, this research can promote the enterprise (user) actively using cloud computing platform and the harmonious development of cloud computing environment.
出处
《计算机科学》
CSCD
北大核心
2014年第10期154-159,190,共7页
Computer Science
基金
山东省高等学校科技计划项目(J12LN33)
山东大学高新领域高校院所自主创新计划(201303015)
山东财经大学博士基金项目(2010034)资助
关键词
云计算
副本管理
最小成本
Cloud computing, Replication management, Minimum-cost