A Spark Scheduling Strategy for Heterogeneous Cluster 被引量：1

下载PDF

导出

摘要 As a main distributed computing system,Spark has been used to solve problems with more and more complex tasks.However,the native scheduling strategy of Spark assumes it works on a homogenized cluster,which is not so effective when it comes to heterogeneous cluster.The aim of this study is looking for a more effective strategy to schedule tasks and adding it to the source code of Spark.After investigating Spark scheduling principles and mechanisms,we developed a stratifying algorithm and a node scheduling algorithm is proposed in this paper to optimize the native scheduling strategy of Spark.In this new strategy,the static level of nodes is calculated,the dynamic factors such as the length of running tasks,and CPU usage of work nodes are considered comprehensively.And through a series of comparative experiments in alienation cluster,the new strategy costs less running time and lower CPU usage rate than the original Spark strategy,which verifies that the new schedule strategy is more effective one.

作者 Xuewen Zhang Zhonghao Li Gongshen Liu Jiajun Xu Tiankai Xie Jan Pan Nees

机构地区 School of Electronic Information and Electrical Engineering Eberly College of Science

出处《Computers, Materials & Continua》 SCIE EI 2018年第6期405-417,共13页 计算机、材料和连续体（英文）

基金 This work is supported by the National Natural Science Foundation of China(Grant No.61472248,61772337) the SJTU-Shanghai Songheng Content Analysis Joint Lab.

关键词 SPARK optimize scheduling stratifying algorithm performance optimization

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

参考文献1

1杨志伟,郑烇,王嵩,杨坚,周乐乐.异构Spark集群下自适应任务调度策略[J].计算机工程,2016,42(1):31-35. 被引量：20

二级参考文献10

1The Spark Software Foundation.Spark[EB/OL].[2015-01-08].http://spark.apache.org.
2The Apache Software Foundation.Hadoop[EB/OL].[2015-01-08].http://hadoop.apache.org.
3Xu Xiaolong,Cao Lingling,Wang Xinheng.Adaptive Task Scheduling Strategy Based on Dynamic Workload Adjustment for Heterogeneous Hadoop Clusters[J].IEEE Systems Journal,2014,(99):1-12.
4Nightingale E B,Chen P M,Flinn J.Speculative Execution in a Distributed File System[J].ACM Transactions on Computer Systems,2006,24(4):361-392.
5Yong M,Garegrat N,Mohan S.Towards a Resource Aware Scheduler in Hadoop[C]//Proceedings of the 7th IEEE International Conference on Web Services.Los Angeles,USA:IEEE Computer Society,2009:102-109.
6Zaharia M,Chowdhury M,Das T,et al.Resilient Distributed Datasets:A Fault-tolerant Abstraction for In-memory Cluster Computing,UCB/EECS-2011-82[R].University of California,Berkeley,2012.
7Zaharia M,Chowdhury M,Franklin M J,et al.Spark:Cluster Computing with Working Sets,UCB/EECS-2010-53[R].University of California,Berkeley,2010.
8Guo Zhenhua,Fox G,Zhou Mo.Investigation of Data Locality in MapReduce[C]//Proceedings of the 12th IEEE/ACM International Symposium on Cluster,Cloud and Grid Computing.Ottawa,Canada:IEEE Computer Society,2012:419-426.
9Typesafe Inc.akka[EB/OL].[2015-01-08].http://akka.io/.
10Massie M,Li B,Nickoles B,et al.Monitoring with Ganglia[M].Sebastopol,USA:O'Reilly Media,2012.

共引文献19

1黄廷辉,王玉良,汪振,崔更申.基于内存与文件共享机制的Spark I/O性能优化[J].计算机工程,2017,34(3):1-6. 被引量：8
2高原,任升,顾文杰.异构环境中HDFS数据块调度算法的设计与实现[J].计算机工程,2017,34(8):82-89. 被引量：2
3张靓,肖俊东,赵开敏.基于Spark的舰船网络数据解析存储系统设计与实现[J].舰船电子工程,2017,37(11):92-95. 被引量：2
4廖湖声,黄珊珊,徐俊刚,刘仁峰.Spark性能优化技术研究综述[J].计算机科学,2018,45(7):7-15. 被引量：24
5王欣,周云才.基于隐马尔科夫模型的spark作业异常分析[J].电脑知识与技术,2018,14(4Z):198-200.
6尉耀稳,余彬,李豪帅,沈鸿达.基于Spark平台的参数优化研究现状[J].电脑知识与技术,2019,15(1):11-13. 被引量：1
7廖旺坚,黄永峰,包从开.Spark并行计算框架的内存优化[J].计算机工程与科学,2018,40(4):587-593. 被引量：10
8裴树军,孔德凯,苗辉.DMS算法在Map/Reduce任务调度中的应用[J].哈尔滨理工大学学报,2019,24(1):71-77. 被引量：2
9杨亚乐,金同标,殷进勇.一种基于遗传和模拟退火算法的云计算任务调度算法[J].工业控制计算机,2019,32(5):92-94. 被引量：2
10熊霞,陶晓峰,高鲁鑫,邱志辉.综合能源一体化采集系统的多任务自适应实时调度方法[J].电测与仪表,2019,56(20):108-114. 被引量：12

同被引文献2

1Suzhen Wang,Yanpiao Zhang,Lu Zhang,Ning Cao,Chaoyi Pang.An Improved Memory Cache Management Study Based on Spark[J].Computers, Materials & Continua,2018(9):415-431. 被引量：2
2Suzhen Wang,Shanshan Geng,Zhanfeng Zhang,Anshan Ye,Keming Chen,Zhaosheng Xu,Huimin Luo,Gangshan Wu,Lina Xu,Ning Cao.A Dynamic Memory Allocation Optimization Mechanism Based on Spark[J].Computers, Materials & Continua,2019(8):739-757. 被引量：2

引证文献1

1Yi Liang,Shaokang Zeng,Xiaoxian Xu,Shilu Chang,Xing Su.SMConf: One-Size-Fit-Bunch, Automated Memory Capacity Configuration for In-Memory Data Analytic Platform[J].Computers, Materials & Continua,2021(2):1697-1717.

1Hanna Ericson,Malin Sunnergren,Annsofie Adolfsson.The Recollection of Morning Sickness and Their Support System as Documented by Women Post Pregnancy[J].Advances in Sexual Medicine,2013,3(4):67-75.
2Lars Jacobsson,Helia Ghanean,Birgitta Tornkvist.Internalized stigma of mental illness in Sweden and Iran—A comparative study[J].Open Journal of Psychiatry,2013,3(4):370-374.
3Christina Harrefors,Karin Axelsson,Anders Lundquist,Bengt Lundquist,Stefan Savenstedt.Professional caregivers’ perceptions on the prerequisites for and consequences of people with mild dementia using a digital photo diary[J].Open Journal of Nursing,2013,3(1):42-54.
4CHEN Huang,WANG Lide,SHEN Ping,DI Jun.Static Schedule Generation for Time-Triggered Ethernet Based on Fuzzy Particle Swarm Optimization[J].Chinese Journal of Electronics,2019,28(6):1250-1258. 被引量：8
5Lin Chen,Chunfang Yang,Fenlin Liu,Daofu Gong,Shichang Ding.Automatic Mining of Security-Sensitive Functions from Source Code[J].Computers, Materials & Continua,2018(8):199-210.
6Taskeen Zaidi,Vipin Saxena.Resources Allocation and Failures in Step Topology under Distributed Computing System[J].Journal of Software Engineering and Applications,2013,6(1):14-19.
7Yuankun Yan,Yan Kong,Zhangjie Fu.Dynamic Resource Scheduling in Emergency Environment[J].Journal of Information Hiding and Privacy Protection,2019,1(3):143-155. 被引量：3
8Aytul Kasapoglu.A Relational Sociological Study on the Effects of Uncertainties in the Case of Influenza in Turkey[J].International Journal of Clinical Medicine,2017,8(11):618-630.
9Andrés A. Zuno-Arce,Carlos U. Haubi-Segura.Acupuncture as a bioinformatics science and a proposal for the recognition of a new law of nature “Law of therapeutic neuromodulation”[J].Journal of Biomedical Science and Engineering,2012,5(10):597-601.
10REN XiaoJun,WANG ShiZheng,SU DongDong,GAO Liang,YUAN Qing,GAO XueYun.Noble-metal nanocluster as enzyme-mimetic catalyst for diagnostic analysis[J].Science China(Technological Sciences),2019,62(12):2306-2309. 被引量：1

Computers, Materials & Continua

2018年第6期

浏览历史

内容加载中请稍等...