张量转置(tensor transposition)作为基础张量运算原语,广泛应用于信号处理、科学计算以及深度学习等各种领域,在张量数据密集型应用及高性能计算中具有重要作用。随着能效指标在高性能计算系统中的重要性日益凸显,基于数字信号处理器(d...张量转置(tensor transposition)作为基础张量运算原语,广泛应用于信号处理、科学计算以及深度学习等各种领域,在张量数据密集型应用及高性能计算中具有重要作用。随着能效指标在高性能计算系统中的重要性日益凸显,基于数字信号处理器(digital signal processors,DSPs)的加速器已被集成至通用计算系统。然而,传统面向多核CPU和GPU的张量转置库因架构差异无法充分适配DSP架构。一方面,DSP架构的向量化计算潜力尚未得到充分挖掘;另一方面,其复杂的片上存储体系与多层次共享内存结构为张量并行程序设计带来了显著挑战。针对国产多核DSP的架构特点,提出ftmTT算法,并设计实现了一个面向多核DSP架构的通用张量转置库。ftmTT算法通过设计适配DSP架构的高效内存访问模式充分挖掘其并行化和向量化潜力,其核心创新包括:1)采用分块策略将高维张量转置转化为多核DSP平台所提供的矩阵转置内核操作;2)提出基于DMA点对点传输的张量数据块访存合并方案来降低数据搬运开销;3)通过双缓冲设计异步重叠转置计算与DMA传输实现计算通信隐藏,最终面向多核DSP实现高性能并行张量转置。在国产多核DSP平台FT-M7032的实验表明,ftmTT张量转置算法取得了最高达理论带宽75.96%的性能,达到FT-M7032平台STREAM带宽99.23%的性能。展开更多
The Carter model is used to characterize the dynamic behaviors of fracture growth and fracturing fluid leakoff.A thermo-fluid coupling temperature response forward model is built considering the fluid flow and heat tr...The Carter model is used to characterize the dynamic behaviors of fracture growth and fracturing fluid leakoff.A thermo-fluid coupling temperature response forward model is built considering the fluid flow and heat transfer in wellbore,fracture and reservoir.The influences of fracturing parameters and fracture parameters on the responses of distributed temperature sensing(DTS)are analyzed,and a diagnosis method of fracture parameters is presented based on the simulated annealing algorithm.A field case study is introduced to verify the model’s reliability.Typical V-shaped characteristics can be observed from the DTS responses in the multi-cluster fracturing process,with locations corresponding to the hydraulic fractures.The V-shape depth is shallower for a higher injection rate and longer fracturing and shut-in time.Also,the V-shape is wider for a higher fracture-surface leakoff coefficient,longer fracturing time and smaller fracture width.Additionally,the cooling effect near the wellbore continues to spread into the reservoir during the shut-in period,causing the DTS temperature to decrease instead of rise.Real-time monitoring and interpretation of DTS temperature data can help understand the fracture propagation during fracturing operation,so that immediate measures can be taken to improve the fracturing performance.展开更多
Shale oil reservoir is generally characterized by well-developed bedding planes,and multi-cluster fracturing is the most effective technique to achieve stable shale oil production.In this paper,a multi-cluster fractur...Shale oil reservoir is generally characterized by well-developed bedding planes,and multi-cluster fracturing is the most effective technique to achieve stable shale oil production.In this paper,a multi-cluster fracturing model for a horizontal well in shale with high-density bedding planes is established.The fracture morphology,fracture geometry,fracturing area and multiple fracture propagation mechanism are analyzed under simultaneous fracturing,sequential fracturing,and alternative fracturing.Results show that in the case of small cluster spacing and three clusters,the growth of the middle fracture is inhibited and develops along the bedding planes under both simultaneous fracturing and alternative fracturing.For sequential fracturing,the increase in the interval time between each fracturing advances the post fracturing fracture deflecting to the pre-existing fractures through the bedding planes.The reactivation of the bedding planes can promote the extension of the fracturing area.Increasing the injection rate and the number of clusters promotes the activation of bedding planes.However,it is preferable to reduce the number of clusters to obtain more main fractures.Compared with modified alternating fracturing and cyclic alternating fracturing,alternating shut-in fracturing creates more main fractures towards the direction of the maximum in-situ stress.The fracturing efficiency for high-density layered shale is ranked as simultaneous fracturing>alternative fracturing>sequential fracturing.展开更多
Hepatitis B virus remains a major cause of cirrhosis and hepatocellular carcinoma,with genetic polymorphisms and mutations influencing immune responses and disease progression.Nguyen et al present novel findings on sp...Hepatitis B virus remains a major cause of cirrhosis and hepatocellular carcinoma,with genetic polymorphisms and mutations influencing immune responses and disease progression.Nguyen et al present novel findings on specific human leukocyte antigen(HLA)alleles,including rs2856718 of HLA-DQ and rs3077 and rs9277535 of HLA-DP,which may predispose individuals to cirrhosis and liver cancer,based on multi-clustering analysis.Here,we discuss the feasibility of this approach and identify key areas for further investigation,aiming to offer insights for advancing clinical practice and research in liver disease and related cancers.展开更多
The human brain is a complex intelligent system composed of tens of billions of neurons interconnected through synapses,and its intricate network structure has consistently attracted numerous scientists to explore the...The human brain is a complex intelligent system composed of tens of billions of neurons interconnected through synapses,and its intricate network structure has consistently attracted numerous scientists to explore the mysteries of brain functions.However,most existing studies have only verified the biological mimicry characteristics of memristors at the single neuron-synapse level,and there is still a lack of research on memristors simulating synaptic coupling between neurons in multi-neuron networks.Based on this,this paper uses discrete memristors to couple dual discrete Rulkov neurons,and adds synaptic crosstalk between the two discrete memristors to form a neuronal network.A memristor-coupled dual-neuron map,called the Rulkov-memristor-Rulkov(R-M-R)map,is constructed to simulate synaptic connections between neurons in biological tissues.Then,the equilibrium points of the R-M-R map are studied.Subsequently,the effect of parameter variations on the dynamic performance of the R-M-R map is comprehensively analyzed using bifurcation diagram,phase diagram,Lyapunov exponent spectrum(LEs),firing diagram,and spectral entropy(SE)complexity algorithms.In the RM-R map,diverse categories of periodic,chaotic,and hyperchaotic attractors,as well as different states of firing patterns,can be observed.Additionally,different types of state transitions and coexisting attractors are discovered.Finally,the feasibility of the model in digital circuits is verified using a DSP hardware platform.In this study,the coupling principle of biological neurons is simulated,the chaotic dynamic behavior of the R-M-R map is analyzed,and a foundation is laid for deciphering the complex working mechanisms of the brain.展开更多
Si P微系统是一种高度集成化的系统,其内部可能集成1个或多个DSP、NOR Flash和DDR存储器、AI加速芯片等,有些复杂的微系统还集成了FPGA芯片。由于内部集成了多个微组件,芯片之间相互连接,传统的测试单一微组件的方法并不适用于微系统的...Si P微系统是一种高度集成化的系统,其内部可能集成1个或多个DSP、NOR Flash和DDR存储器、AI加速芯片等,有些复杂的微系统还集成了FPGA芯片。由于内部集成了多个微组件,芯片之间相互连接,传统的测试单一微组件的方法并不适用于微系统的测试。提出了一套DSP微组件测试方法,该系统包括1块专门的测试板、可调试的电脑测试环境和JTAG通信。与单一的DSP裸芯测试相比,它可以快速稳定地实现DSP微组件的性能测试,满足大批量生产测试的需求。展开更多
A treelike hybrid multi-cluster tool is composed of both single-arm and dual-arm cluster tools with a treelike topology. Scheduling such a tool is challenging. For a hybrid treelike multi-cluster tool whose bottleneck...A treelike hybrid multi-cluster tool is composed of both single-arm and dual-arm cluster tools with a treelike topology. Scheduling such a tool is challenging. For a hybrid treelike multi-cluster tool whose bottleneck individual tool is process-bound, this work aims at finding its optimal one-wafer cyclic schedule. It is modeled with Petri nets such that a onewafer cyclic schedule is parameterized as its robots' waiting time.Based on the model, this work proves the existence of its onewafer cyclic schedule that features with the ease of industrial implementation. Then, computationally efficient algorithms are proposed to find the minimal cycle time and optimal onewafer cyclic schedule. Multi-cluster tool examples are given to illustrate the proposed approach. The use of the found schedules enables industrial multi-cluster tools to operate with their highest productivity.展开更多
文摘张量转置(tensor transposition)作为基础张量运算原语,广泛应用于信号处理、科学计算以及深度学习等各种领域,在张量数据密集型应用及高性能计算中具有重要作用。随着能效指标在高性能计算系统中的重要性日益凸显,基于数字信号处理器(digital signal processors,DSPs)的加速器已被集成至通用计算系统。然而,传统面向多核CPU和GPU的张量转置库因架构差异无法充分适配DSP架构。一方面,DSP架构的向量化计算潜力尚未得到充分挖掘;另一方面,其复杂的片上存储体系与多层次共享内存结构为张量并行程序设计带来了显著挑战。针对国产多核DSP的架构特点,提出ftmTT算法,并设计实现了一个面向多核DSP架构的通用张量转置库。ftmTT算法通过设计适配DSP架构的高效内存访问模式充分挖掘其并行化和向量化潜力,其核心创新包括:1)采用分块策略将高维张量转置转化为多核DSP平台所提供的矩阵转置内核操作;2)提出基于DMA点对点传输的张量数据块访存合并方案来降低数据搬运开销;3)通过双缓冲设计异步重叠转置计算与DMA传输实现计算通信隐藏,最终面向多核DSP实现高性能并行张量转置。在国产多核DSP平台FT-M7032的实验表明,ftmTT张量转置算法取得了最高达理论带宽75.96%的性能,达到FT-M7032平台STREAM带宽99.23%的性能。
基金Supported by the National High-Tech Research Project(GJSCB-HFGDY-2024-004)National Natural Science Foundation of China(12402305)+2 种基金Postdoctoral Fellowship Program of CPSF(GZC20232200)China Postdoctoral Science Foundation(2024M762703)Sichuan Science and Technology Program(2025ZNSFSC1352)。
文摘The Carter model is used to characterize the dynamic behaviors of fracture growth and fracturing fluid leakoff.A thermo-fluid coupling temperature response forward model is built considering the fluid flow and heat transfer in wellbore,fracture and reservoir.The influences of fracturing parameters and fracture parameters on the responses of distributed temperature sensing(DTS)are analyzed,and a diagnosis method of fracture parameters is presented based on the simulated annealing algorithm.A field case study is introduced to verify the model’s reliability.Typical V-shaped characteristics can be observed from the DTS responses in the multi-cluster fracturing process,with locations corresponding to the hydraulic fractures.The V-shape depth is shallower for a higher injection rate and longer fracturing and shut-in time.Also,the V-shape is wider for a higher fracture-surface leakoff coefficient,longer fracturing time and smaller fracture width.Additionally,the cooling effect near the wellbore continues to spread into the reservoir during the shut-in period,causing the DTS temperature to decrease instead of rise.Real-time monitoring and interpretation of DTS temperature data can help understand the fracture propagation during fracturing operation,so that immediate measures can be taken to improve the fracturing performance.
基金the financial support from Intergovernmental International Science and Technology Innovation Cooperation Key Project(2022YFE0128400)National Natural Science Foundation of China(42307209)+2 种基金Shanghai Pujiang Program(2022PJD076)State Energy Center for Shale Oil Research and Development(33550000-22-ZC0613-0365)Natural Science Foundation of Qinghai Province(No.2024-ZJ-717).
文摘Shale oil reservoir is generally characterized by well-developed bedding planes,and multi-cluster fracturing is the most effective technique to achieve stable shale oil production.In this paper,a multi-cluster fracturing model for a horizontal well in shale with high-density bedding planes is established.The fracture morphology,fracture geometry,fracturing area and multiple fracture propagation mechanism are analyzed under simultaneous fracturing,sequential fracturing,and alternative fracturing.Results show that in the case of small cluster spacing and three clusters,the growth of the middle fracture is inhibited and develops along the bedding planes under both simultaneous fracturing and alternative fracturing.For sequential fracturing,the increase in the interval time between each fracturing advances the post fracturing fracture deflecting to the pre-existing fractures through the bedding planes.The reactivation of the bedding planes can promote the extension of the fracturing area.Increasing the injection rate and the number of clusters promotes the activation of bedding planes.However,it is preferable to reduce the number of clusters to obtain more main fractures.Compared with modified alternating fracturing and cyclic alternating fracturing,alternating shut-in fracturing creates more main fractures towards the direction of the maximum in-situ stress.The fracturing efficiency for high-density layered shale is ranked as simultaneous fracturing>alternative fracturing>sequential fracturing.
基金Supported by National Natural Science Foundation of China,No.32270768,No.82273970,No.32070726,and No.82370715National Key R&D Program of China,No.2023YFC2507904the Innovation Group Project of Hubei Province,No.2023AFA026.
文摘Hepatitis B virus remains a major cause of cirrhosis and hepatocellular carcinoma,with genetic polymorphisms and mutations influencing immune responses and disease progression.Nguyen et al present novel findings on specific human leukocyte antigen(HLA)alleles,including rs2856718 of HLA-DQ and rs3077 and rs9277535 of HLA-DP,which may predispose individuals to cirrhosis and liver cancer,based on multi-clustering analysis.Here,we discuss the feasibility of this approach and identify key areas for further investigation,aiming to offer insights for advancing clinical practice and research in liver disease and related cancers.
基金supported by the National Natural Science Foundation of China(Grant No.62571079)the Technological Innovation Projects in the Field of Artificial Intelligence in Liaoning Province(Grant No.2023JH26/10300011)+1 种基金the Basic Scientific Research Projects in the Department of Education of Liaoning Province(Grant No.LJ212410152049)the Liaoning Provincial Science and Technology Plan Joint Project(Grant No.2025-BSLH-041)。
文摘The human brain is a complex intelligent system composed of tens of billions of neurons interconnected through synapses,and its intricate network structure has consistently attracted numerous scientists to explore the mysteries of brain functions.However,most existing studies have only verified the biological mimicry characteristics of memristors at the single neuron-synapse level,and there is still a lack of research on memristors simulating synaptic coupling between neurons in multi-neuron networks.Based on this,this paper uses discrete memristors to couple dual discrete Rulkov neurons,and adds synaptic crosstalk between the two discrete memristors to form a neuronal network.A memristor-coupled dual-neuron map,called the Rulkov-memristor-Rulkov(R-M-R)map,is constructed to simulate synaptic connections between neurons in biological tissues.Then,the equilibrium points of the R-M-R map are studied.Subsequently,the effect of parameter variations on the dynamic performance of the R-M-R map is comprehensively analyzed using bifurcation diagram,phase diagram,Lyapunov exponent spectrum(LEs),firing diagram,and spectral entropy(SE)complexity algorithms.In the RM-R map,diverse categories of periodic,chaotic,and hyperchaotic attractors,as well as different states of firing patterns,can be observed.Additionally,different types of state transitions and coexisting attractors are discovered.Finally,the feasibility of the model in digital circuits is verified using a DSP hardware platform.In this study,the coupling principle of biological neurons is simulated,the chaotic dynamic behavior of the R-M-R map is analyzed,and a foundation is laid for deciphering the complex working mechanisms of the brain.
基金supported in part by Science and Technology Development Fund(FDCT)of Macao(106/2016/A3)the National Natural Science Foundation of China(U1401240)the Delta Electronics Inc and the National Research Foundation(NRF)Singapore under the Corp Lab@University Scheme
文摘A treelike hybrid multi-cluster tool is composed of both single-arm and dual-arm cluster tools with a treelike topology. Scheduling such a tool is challenging. For a hybrid treelike multi-cluster tool whose bottleneck individual tool is process-bound, this work aims at finding its optimal one-wafer cyclic schedule. It is modeled with Petri nets such that a onewafer cyclic schedule is parameterized as its robots' waiting time.Based on the model, this work proves the existence of its onewafer cyclic schedule that features with the ease of industrial implementation. Then, computationally efficient algorithms are proposed to find the minimal cycle time and optimal onewafer cyclic schedule. Multi-cluster tool examples are given to illustrate the proposed approach. The use of the found schedules enables industrial multi-cluster tools to operate with their highest productivity.