The processor and the main memory in the traditional computing system cannot satisfy the requirements of the emerging large-scale applications in terms of computing power and memory capacity.Tackling with such challen...The processor and the main memory in the traditional computing system cannot satisfy the requirements of the emerging large-scale applications in terms of computing power and memory capacity.Tackling with such challenges,heterogeneous computing systems,which consist of specialized accelerators and high-performance storage devices,have demonstrated their superiority in various computing domains.However,data migration between the accelerators and the storage devices becomes the major performance bottleneck in such computing systems owing to the long data path imposed by the stale Von Neumann architecture.There already exist plenty of discussions on the software and hardware techniques to resolve the data migration issues in the heterogeneous computing system.In this paper,we present a survey of these techniques with respect to the system designs,architectural innovations,and application-level optimizations.We expect our survey would aid the development of the research community and inspire the researchers,who are interested in the relevant areas.展开更多
The emerging memory technologies, such as phase change memory (PCM), provide chances for high- performance storage of I/O-intensive applications. However, traditional software stack and hardware architecture need to...The emerging memory technologies, such as phase change memory (PCM), provide chances for high- performance storage of I/O-intensive applications. However, traditional software stack and hardware architecture need to be optimized to enhance I/O efficiency. In addition, narrowing the distance between computation and storage reduces the number of I/O requests and has become a popular research direction. This paper presents a novel PCM- based storage system. It consists of the in-storage processing enabled file system (ISPFS) and the configurable parallel computation fabric in storage, which is called an in-storage processing (ISP) engine. On one hand, ISPFS takes full advantage of non-volatile memory (NVM)'s characteristics, and reduces software overhead and data copies to provide low-latency high-performance random access. On the other hand, ISPFS passes ISP instructions through a command file and invokes the ISP engine to deal with I/O-intensive tasks. Extensive experiments are performed on the prototype system. The results indicate that ISPFS achieves 2 to 10 times throughput compared to EXT4. Our ISP solution also reduces the number of I/O requests by 97% and is 19 times more efficient than software implementation for I/O-intensive applications.展开更多
基金supported by Peking University start-up package(7100603645).
文摘The processor and the main memory in the traditional computing system cannot satisfy the requirements of the emerging large-scale applications in terms of computing power and memory capacity.Tackling with such challenges,heterogeneous computing systems,which consist of specialized accelerators and high-performance storage devices,have demonstrated their superiority in various computing domains.However,data migration between the accelerators and the storage devices becomes the major performance bottleneck in such computing systems owing to the long data path imposed by the stale Von Neumann architecture.There already exist plenty of discussions on the software and hardware techniques to resolve the data migration issues in the heterogeneous computing system.In this paper,we present a survey of these techniques with respect to the system designs,architectural innovations,and application-level optimizations.We expect our survey would aid the development of the research community and inspire the researchers,who are interested in the relevant areas.
基金Project supported by the National Basic Research Program of China(No.2017YFA0206101)the National Defense Innovation Fund of Chinese Academy of Sciences(No.CXJJ-16M106)+1 种基金the "Strategic Priority Research Program" of the Chinese Academy of Sciences(No.XDA09020402)the Science and Technology Council of Shanghai,China(Nos.14DZ2294900,13ZR1447200,and 14ZR1447500)
文摘The emerging memory technologies, such as phase change memory (PCM), provide chances for high- performance storage of I/O-intensive applications. However, traditional software stack and hardware architecture need to be optimized to enhance I/O efficiency. In addition, narrowing the distance between computation and storage reduces the number of I/O requests and has become a popular research direction. This paper presents a novel PCM- based storage system. It consists of the in-storage processing enabled file system (ISPFS) and the configurable parallel computation fabric in storage, which is called an in-storage processing (ISP) engine. On one hand, ISPFS takes full advantage of non-volatile memory (NVM)'s characteristics, and reduces software overhead and data copies to provide low-latency high-performance random access. On the other hand, ISPFS passes ISP instructions through a command file and invokes the ISP engine to deal with I/O-intensive tasks. Extensive experiments are performed on the prototype system. The results indicate that ISPFS achieves 2 to 10 times throughput compared to EXT4. Our ISP solution also reduces the number of I/O requests by 97% and is 19 times more efficient than software implementation for I/O-intensive applications.