期刊文献+

基于分片复用的多版本容器镜像加载方法 被引量:6

Container Image Deduplication Method Based on Chunking Reuse of Multi-versions
在线阅读 下载PDF
导出
摘要 容器将应用和支持软件、库文件等封装为镜像,通过发布新版本镜像实现应用升级,导致不同版本之间存在大量相同数据.镜像加载消耗大量时间,使容器启动时间从毫秒级延迟为秒级甚至是分钟级.复用不同版本之间的相同数据,有利于减少容器加载时间.当前,容器镜像采用继承和分层加载机制,有效实现了支持软件、库文件等数据的复用,但对于应用内部数据还没有一种可靠的复用机制.提出一种基于分片复用的多版本容器镜像加载方法,通过复用不同版本镜像之间的相同数据,提升镜像加载效率.方法的核心思想是:利用边界匹配数据块切分方法将容器镜像切分为细粒度数据块,将数据块哈希值作为唯一标识指纹,借助B-树搜索重复指纹判断重复数据块,减少数据传输.实验结果表明,该方法可以提高5.8X以上容器镜像加载速度. Container encapsulates the application,the supporting software,and the operating system libraries as an image.The application is updated through publishing a newer image version.That would lead a certain degree of duplications between the neighboring versions.The loading process of container image is time-consuming and delays the starting time of a container from milliseconds to seconds or minutes.Reusing the same data of previous versions can help to reduce the loading time.The layered loading and inheritance features adopted by container can help to reuse the supporting software and the operating system libraries effectively in image loading.However,reusing the application data is currently not supported.This study proposed a container image loading methodology based on chunking reuse of older versions to improve the image loading performance.A boundary matching based chunking method was used to divide the image layers into fine-gained data chunk,the chunk hash value was used as the unique identification fingerprint.The B-tree was used to find the same blocks and the same blocks were reused to speed up the loading process.Experimental results show that the proposed method can improve 5.8X container image loading speed.
作者 陆志刚 徐继伟 黄涛 LU Zhi-Gang;XU Ji-Wei;HUANG Tao(Technology Center of Software Engineering,Institute of Software,Chinese Academy of Sciences,Beijing 100190,China;State Key Laboratory of Computer Science(Institute of Software,Chinese Academy of Sciences),Beijing 100190,China;University of Chinese Academy of Sciences,Beijing 100190,China)
出处 《软件学报》 EI CSCD 北大核心 2020年第6期1875-1888,共14页 Journal of Software
基金 国家重点研发计划(2017YFC0804407) 国家自然科学基金(61602454,61872344) 北京市自然科学基金(4182070)。
关键词 容器 DOCKER 镜像 重复数据删除 container docker image deduplication
  • 相关文献

参考文献2

二级参考文献43

  • 1Bhagwat D,Pollack K,Long DDE,Schwarz T,Miller EL,P-ris JF.Providing high reliability in a minimum redundancy archival storage system.In:Proc.of the 14th Int'l Symp.on Modeling,Analysis,and Simulation of Computer and Telecommunication Systems (MASCOTS 2006).Washington:IEEE Computer Society Press,2006.413-421.
  • 2Zhu B,Li K.Avoiding the disk bottleneck in the data domain deduplication file system.In:Proc.of the 6th Usenix Conf.on File and Storage Technologies (FAST 2008).Berkeley:USENIX Association,2008.269-282.
  • 3Bhagwat D,Eshghi K,Mehra P.Content-Based document routing and index partitioning for scalable similarity-based searches in a large corpus.In:Berkhin P,Caruana R,Wu XD,Gaffney S,eds.Proc.of the 13th ACM SIGKDD Int'l Conf.on Knowledge Discovery and Data Mining (KDD 2007).New York:ACM Press,2007.105-112.
  • 4You LL,Pollack KT,Long DDE.Deep store:An archival storage system architecture.In:Proc.of the 21st Int'l Conf.on Data Engineering (ICDE 2005).Washington:IEEE Computer Society Press,2005.804-815.
  • 5Quinlan S,Dorward S.Venti:A new approach to archival storage.In:Proc.of the 1st Usenix Conf.on File and Storage Technologies (FAST 2002).Berkeley:USENIX Association,2002.89-102.
  • 6Sapuntzakis CP,Chandra R,Pfaff B,Chow J,Lam MS,Rosenblum M.Optimizing the migration of virtual computers.In:Proc.of the 5th Symp.on Operating Systems Design and Implementation (OSDI 2002).New York:ACM Press,2002.377-390.
  • 7Rabin MO.Fingerprinting by random polynomials.Technical Report,CRCT TR-15-81,Harvard University,1981.
  • 8Rivest R.The MD5 message-digest algorithm.1992.http://www.python.org/doc/current/lib/module-md5.html.
  • 9U.S.National Institute of Standards and Technology (NIST).Federal Information Processing Standards (FIPS) Publication 180-1:Secure Hash Standard.1995.http://www.itl.nist.gov/fipspubs/fip180-1.htm.
  • 10U.S.National Institute of Standards and Technology (NIST).Federal Information Processing Standards (FIPS) Publication 180-2:Secure Hash Standard.2002.http://csrc.nist.gov/publications/fips/fips180-2/fips180-2.pdf.

共引文献120

同被引文献20

引证文献6

二级引证文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部