期刊文献+

一种Hadoop YARN的资源调度机制 被引量:3

A Resource Scheduling Mechanism of Hadoop YARN
在线阅读 下载PDF
导出
摘要 YARN是Hadoop中广泛应用的资源管理系统,支持MapReduce,Spark,Storm等多种计算框架,已成为大数据生态中的核心组件。然而,在Hadoop YARN现有的资源调度器中,采用基于资源预留的资源保障机制,会产生资源碎片,导致资源浪费。为提高集群的资源利用率和吞吐量,本文提出一种基于预约回填的资源分配机制。在该机制中,基于作业的优先级来决定是否对资源进行预约,并引入回填策略,在不影响预约作业执行的情况下,对资源进行回填使用。实验表明,使用基于预约回填的资源调度机制能够有效提高Hadoop YARN集群的资源利用率和吞吐量。 YARN is a resource management system widely used in Hadoop.It supports MapReduce,Spark,Storm and other computing frameworks,and has become the core component of big data ecology.However,in Hadoop YARN's existing resource scheduler,a resource guarantee mechanism based on resource reservation,will produce resource fragmentations,leading to a waste of resources.In order to improve the resource utilization and throughput of the cluster,this paper proposes a resource allocation mechanism based on reservation and backfill.In this mechanism,based on the priority of the job,it decides whether to make a reservation to the resource and introduce a backfill strategy to backfill the resource without affecting the execution of the reservation job.Experiments show that the resource scheduling mechanism based on reserved backfill can effectively improve the resource utilization and throughput of Hadoop YARN cluster.
出处 《计算机与现代化》 2017年第11期29-34,共6页 Computer and Modernization
关键词 HADOOP YARN 大数据 资源调度 预约回填 Hadoop YARN big data resource scheduler reserved backfill
  • 相关文献

参考文献9

二级参考文献78

  • 1周锋,李旭伟.一种改进的MapReduce并行编程模型[J].科协论坛(下半月),2009(2):65-66. 被引量:14
  • 2吴宝贵,丁振国.基于Map/Reduce的分布式搜索引擎研究[J].现代图书情报技术,2007(8):52-55. 被引量:9
  • 3孙广中,肖锋,熊曦.MapReduce模型的调度及容错机制研究[J].微电子学与计算机,2007,24(9):178-180. 被引量:26
  • 4Olf Arndt, Bernd Freisleben, Thilo Kielmann, et al. A comparative study of online scheduling algorithms for networks of workstations. Clnster Computing, 2000, 3(2): 95~112
  • 5B S Yoo, C R Das. Good processor management=fast allocation + efficient scheduling. 1997 Int'l Conf on Parallel Processing, Bloomingdale, 1997
  • 6D G Feitelson, B Nitzberg. Job characteristics of a production parallel scientific workload on the NASA Ames iPSC/860. IPPS'95 Workshop on Job Scheduling Strategies for Parallel Processing, California, 1995
  • 7J Subhlok, T Gross, T Suzuoka. Impact of job mix on optimizations for space sharing schedulers. Supercomputing'96, San Diego, 1996
  • 8R H Arpaci, A C Dusseau, A M Vahdat, et al. The interaction of parallel and sequential workloads on a network of workstations. 1995 ACM SIGMETRICS Conf on Measurement and Modeling of Computer Systems, Ottawa, 1995
  • 9David Talby, Dror G Feitelson. Supporting priorities and improving utilization of the IBM SP scheduler using slack-based backfilling. The 10th Symp on Parallel and Distributed Processing, Puerto Rico, 1997
  • 10D G Feitelson, A M Weil. Utilization and predictability in scheduling the IBM SP2 with backfilling. In: Proc of the 12th Int'l Parallel Processing Symp and the 9th Symp on Parallel and Distributed Processing. Los Alamitos, CA: IEEE Computer Society Press, 1998. 542~546

共引文献391

同被引文献23

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部