期刊文献+
共找到10,311篇文章
< 1 2 250 >
每页显示 20 50 100
Towards optimized tensor code generation for deep learning on sunway many-core processor
1
作者 Mingzhen LI Changxi LIU +8 位作者 Jianjin LIAO Xuegui ZHENG Hailong YANG Rujun SUN Jun XU Lin GAN Guangwen YANG Zhongzhi LUAN Depei QIAN 《Frontiers of Computer Science》 SCIE EI CSCD 2024年第2期1-15,共15页
The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability.Among th... The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability.Among the existing deep learning compilers,TVM is well known for its efficiency in code generation and optimization across diverse hardware devices.In the meanwhile,the Sunway many-core processor renders itself as a competitive candidate for its attractive computational power in both scientific computing and deep learning workloads.This paper combines the trends in these two directions.Specifically,we propose swTVM that extends the original TVM to support ahead-of-time compilation for architecture requiring cross-compilation such as Sunway.In addition,we leverage the architecture features during the compilation such as core group for massive parallelism,DMA for high bandwidth memory transfer and local device memory for data locality,in order to generate efficient codes for deep learning workloads on Sunway.The experiment results show that the codes generated by swTVM achieve 1.79x improvement of inference latency on average compared to the state-of-the-art deep learning framework on Sunway,across eight representative benchmarks.This work is the first attempt from the compiler perspective to bridge the gap of deep learning and Sunway processor particularly with productivity and efficiency in mind.We believe this work will encourage more people to embrace the power of deep learning and Sunwaymany-coreprocessor. 展开更多
关键词 sunway processor deep learning compiler code generation performance optimization
原文传递
Cooperative Computing Techniques for a Deeply Fused and Heterogeneous Many-Core Processor Architecture 被引量:13
2
作者 郑方 李宏亮 +3 位作者 吕晖 过锋 许晓红 谢向辉 《Journal of Computer Science & Technology》 SCIE EI CSCD 2015年第1期145-162,共18页
Due to advances in semiconductor techniques, many-core processors have been widely used in high performance computing. However, many applications still cannot be carried out efficiently due to the memory wall, which h... Due to advances in semiconductor techniques, many-core processors have been widely used in high performance computing. However, many applications still cannot be carried out efficiently due to the memory wall, which has become a bottleneck in many-core processors. In this paper, we present a novel heterogeneous many-core processor architecture named deeply fused many-core (DFMC) for high performance computing systems. DFMC integrates management processing ele- ments (MPEs) and computing processing elements (CPEs), which are heterogeneous processor cores for different application features with a unified ISA (instruction set architecture), a unified execution model, and share-memory that supports cache coherence. The DFMC processor can alleviate the memory wall problem by combining a series of cooperative computing techniques of CPEs, such as multi-pattern data stream transfer, efficient register-level communication mechanism, and fast hardware synchronization technique. These techniques are able to improve on-chip data reuse and optimize memory access performance. This paper illustrates an implementation of a full system prototype based on FPGA with four MPEs and 256 CPEs. Our experimental results show that the effect of the cooperative computing techniques of CPEs is significant, with DGEMM (double-precision matrix multiplication) achieving an efficiency of 94%, FFT (fast Fourier transform) obtaining a performance of 207 GFLOPS and FDTD (finite-difference time-domain) obtaining a performance of 27 GFLOPS. 展开更多
关键词 heterogeneous many-core processor data stream transfer register-level communication mechanism hardwaresynchronization technique processor prototype
原文传递
Fault Tolerance Mechanism in Chip Many-Core Processors 被引量:1
3
作者 张磊 韩银和 +1 位作者 李华伟 李晓维 《Tsinghua Science and Technology》 SCIE EI CAS 2007年第S1期169-174,共6页
As semiconductor technology advances, there will be billions of transistors on a single chip. Chip many-core processors are emerging to take advantage of these greater transistor densities to deliver greater performan... As semiconductor technology advances, there will be billions of transistors on a single chip. Chip many-core processors are emerging to take advantage of these greater transistor densities to deliver greater performance. Effective fault tolerance techniques are essential to improve the yield of such complex chips. In this paper, a core-level redundancy scheme called N+M is proposed to improve N-core processors’ yield by providing M spare cores. In such architecture, topology is an important factor because it greatly affects the processors’ performance. The concept of logical topology and a topology reconfiguration problem are introduced, which is able to transparently provide target topology with lowest performance degradation as the presence of faulty cores on-chip. A row rippling and column stealing (RRCS) algorithm is also proposed. Results show that PRCS can give solutions with average 13.8% degradation with negligible computing time. 展开更多
关键词 chip many-core processors YIELD fault tolerance RECONFIGURATION NETWORK-ON-CHIP
原文传递
Parallelization and sustainability of distributed genetic algorithms on many-core processors
4
作者 Yuji Sato Mikiko Sato 《International Journal of Intelligent Computing and Cybernetics》 EI 2014年第1期2-23,共22页
Purpose–The purpose of this paper is to propose a fault-tolerant technology for increasing the durability of application programs when evolutionary computation is performed by fast parallel processing on many-core pr... Purpose–The purpose of this paper is to propose a fault-tolerant technology for increasing the durability of application programs when evolutionary computation is performed by fast parallel processing on many-core processors such as graphics processing units(GPUs)and multi-core processors(MCPs).Design/methodology/approach–For distributed genetic algorithm(GA)models,the paper proposes a method where an island’s ID number is added to the header of data transferred by this island for use in fault detection.Findings–The paper has shown that the processing time of the proposed idea is practically negligible in applications and also shown that an optimal solution can be obtained even with a single stuck-at fault or a transient fault,and that increasing the number of parallel threads makes the system less susceptible to faults.Originality/value–The study described in this paper is a new approach to increase the sustainability of application program using distributed GA on GPUs and MCPs. 展开更多
关键词 Evolutionary computation Genetic algorithms Fault identification many-core processors PARALLELIZATION
在线阅读 下载PDF
Enabling Highly Efficient k-Means Computations on the SW26010 Many-Core Processor of Sunway TaihuLight 被引量:1
5
作者 Min Li Chao Yang +3 位作者 Qiao Sun Wen-Jing Ma Wen-Long Cao Yu-Long Ao 《Journal of Computer Science & Technology》 SCIE EI CSCD 2019年第1期77-93,共17页
With the advent of the big data era,the amounts of sampling data and the dimensions of data features are rapidly growing.It is highly desired to enable fast and efficient clustering of unlabeled samples based on featu... With the advent of the big data era,the amounts of sampling data and the dimensions of data features are rapidly growing.It is highly desired to enable fast and efficient clustering of unlabeled samples based on feature similarities. As a fundamental primitive for data clustering,the k-means operation is receiving increasingly more attentions today.To achieve high performance k-means computations on modern multi-core/many-core systems,we propose a matrix-based fused framework that can achieve high performance by conducting computations on a distance matrix and at the same time can improve the memory reuse through the fusion of the distance-matrix computation and the nearest centroids reduction.We implement and optimize the parallel k-means algorithm on the SW26010 many-core processor,which is the major horsepower of Sunway TaihuLight.In particular,we design a task mapping strategy for load-balanced task distribution,a data sharing scheme to reduce the memory footprint and a register blocking strategy to increase the data locality.Optimization techniques such as instruction reordering and double buffering are further applied to improve the sustained performance.Discussions on block-size tuning and performance modeling are also presented.We show by experiments on both randomly generated and real-world datasets that our parallel implementation of k-means on SW26010 can sustain a double-precision performance of over 348.1 Gflops,which is 46.9% of the peak performance and 84%of the theoretical performance upper bound on a single core group,and can achieve a nearly ideal scalability to the whole SW26010 processor of four core groups.Performance comparisons with the previous state-of-the-art on both CPU and GPU are also provided to show the superiority of our optimized k-means kernel. 展开更多
关键词 PARALLEL K-MEANS performance optimization SW26010 processor Sunway TaihuLight
原文传递
Zuchongzhi-3 Sets New Benchmark with 105-Qubit Superconducting Quantum Processor
6
作者 LIU Danxu GE Shuyun WU Yuyang 《Bulletin of the Chinese Academy of Sciences》 2025年第1期55-56,共2页
A team of researchers from the University of Science and Technology of China(USTC)of the Chinese Academy of Sciences(CAS)and its partners have made significant advancements in random quantum circuit sampling with Zuch... A team of researchers from the University of Science and Technology of China(USTC)of the Chinese Academy of Sciences(CAS)and its partners have made significant advancements in random quantum circuit sampling with Zuchongzhi-3,a superconducting quantum computing prototype featuring 105 qubits and 182 couplers. 展开更多
关键词 quantum circuit sampling superconducting quantum computing prototype zuchongzhi superconducting quantum processor QUBITS COUPLERS
在线阅读 下载PDF
基于PowerMILL PostProcessor的海德汉iTNC530系统PLANE指令后置处理研究
7
作者 康晓崇 《机械研究与应用》 2025年第5期102-107,共6页
后置处理在计算机辅助制造(CAM)与数控加工之间起到关键的桥梁作用,其性能直接影响加工精度和效率。该文基于PowerMILL后处理编辑器开发了一个针对海德汉iTNC530系统的后处理器,旨在实现PLANE指令的自动生成,以适应复杂的多轴加工任务... 后置处理在计算机辅助制造(CAM)与数控加工之间起到关键的桥梁作用,其性能直接影响加工精度和效率。该文基于PowerMILL后处理编辑器开发了一个针对海德汉iTNC530系统的后处理器,旨在实现PLANE指令的自动生成,以适应复杂的多轴加工任务。文章详细描述了开发流程,包括刀具方向向量的提取、旋转角度的计算以及PLANE指令的生成,并结合具体案例展示了如何应用数学模型与旋转矩阵进行刀具路径的优化控制。仿真验证结果表明,所开发的后置处理器能够生成高精度的数控程序,提高了加工的自动化程度和稳定性,可以为多轴加工中的后置处理开发提供实践指导和技术参考。 展开更多
关键词 后置处理开发 海德汉iTNC530 PLANE指令 数学模型
在线阅读 下载PDF
A Scalable Interconnection Scheme in Many-Core Systems
8
作者 Allam Abumwais Mujahed Eleyat 《Computers, Materials & Continua》 SCIE EI 2023年第10期615-632,共18页
Recent architectures of multi-core systems may have a relatively large number of cores that typically ranges from tens to hundreds;therefore called many-core systems.Such systems require an efficient interconnection n... Recent architectures of multi-core systems may have a relatively large number of cores that typically ranges from tens to hundreds;therefore called many-core systems.Such systems require an efficient interconnection network that tries to address two major problems.First,the overhead of power and area cost and its effect on scalability.Second,high access latency is caused by multiple cores’simultaneous accesses of the same shared module.This paper presents an interconnection scheme called N-conjugate Shuffle Clusters(NCSC)based on multi-core multicluster architecture to reduce the overhead of the just mentioned problems.NCSC eliminated the need for router devices and their complexity and hence reduced the power and area costs.It also resigned and distributed the shared caches across the interconnection network to increase the ability for simultaneous access and hence reduce the access latency.For intra-cluster communication,Multi-port Content Addressable Memory(MPCAM)is used.The experimental results using four clusters and four cores each indicated that the average access latency for a write process is 1.14785±0.04532 ns which is nearly equal to the latency of a write operation in MPCAM.Moreover,it was demonstrated that the average read latency within a cluster is 1.26226±0.090591 ns and around 1.92738±0.139588 ns for read access between cores from different clusters. 展开更多
关键词 many-core MULTI-CORE N-conjugate shuffle multi-port content addressable memory interconnection network
在线阅读 下载PDF
Typhoon Case Comparison Analysis Between Heterogeneous Many-Core and Homogenous Multicore Supercomputing Platforms
9
作者 LIU Xin YU Xiaolin +5 位作者 ZHAO Haoran HAN Qiqi ZHANG Jie WANG Chengzhi MA Weiwei XU Da 《Journal of Ocean University of China》 SCIE CAS CSCD 2023年第2期324-334,共11页
In this paper,a typical experiment is carried out based on a high-resolution air-sea coupled model,namely,the coupled ocean-atmosphere-wave-sediment transport(COAWST)model,on both heterogeneous many-core(SW)and homoge... In this paper,a typical experiment is carried out based on a high-resolution air-sea coupled model,namely,the coupled ocean-atmosphere-wave-sediment transport(COAWST)model,on both heterogeneous many-core(SW)and homogenous multicore(Intel)supercomputing platforms.We construct a hindcast of Typhoon Lekima on both the SW and Intel platforms,compare the simulation results between these two platforms and compare the key elements of the atmospheric and ocean modules to reanalysis data.The comparative experiment in this typhoon case indicates that the domestic many-core computing platform and general cluster yield almost no differences in the simulated typhoon path and intensity,and the differences in surface pressure(PSFC)in the WRF model and sea surface temperature(SST)in the short-range forecast are very small,whereas a major difference can be identified at high latitudes after the first 10 days.Further heat budget analysis verifies that the differences in SST after 10 days are mainly caused by shortwave radiation variations,as influenced by subsequently generated typhoons in the system.These typhoons generated in the hindcast after the first 10 days attain obviously different trajectories between the two platforms. 展开更多
关键词 heterogeneous many-core supercomputing platform homogenous multicore supercomputing platform comparison analysis typhoon case
在线阅读 下载PDF
Deep Packet Inspection Based on Many-Core Platform
10
作者 Ya-Ru Zhan Zhao-Shun Wang 《Journal of Computer and Communications》 2015年第5期1-6,共6页
With the development of computer technology, network bandwidth and network traffic continue to increase. Considering the large data flow, it is imperative to perform inspection effectively on network packets. In order... With the development of computer technology, network bandwidth and network traffic continue to increase. Considering the large data flow, it is imperative to perform inspection effectively on network packets. In order to find a solution of deep packet inspection which can appropriate to the current network environment, this paper built a deep packet inspection system based on many-core platform, and in this way, verified the feasibility to implement a deep packet inspection system under many-core platform with both high performance and low consumption. After testing and analysis of the system performance, it has been found that the deep packet inspection based on many-core platform TILE_Gx36 [1] [2] can process network traffic of which the bandwidth reaches up to 4 Gbps. To a certain extent, the performance has improved compared to most deep packet inspection system based on X86 platform at present. 展开更多
关键词 many-core PLATFORM Deep PACKET INSPECTION Application Layer PROTOCOL TILE_Gx36
在线阅读 下载PDF
Multiple Levels of Abstraction in the Simulation of Microthreaded Many-Core Architectures
11
作者 Irfan Uddin 《Open Journal of Modelling and Simulation》 2015年第4期159-190,共32页
Simulators are generally used during the design of computer architectures. Typically, different simulators with different levels of complexity, speed and accuracy are used. However, for early design space exploration,... Simulators are generally used during the design of computer architectures. Typically, different simulators with different levels of complexity, speed and accuracy are used. However, for early design space exploration, simulators with less complexity, high simulation speed and reasonable accuracy are desired. It is also required that these simulators have a short development time and that changes in the design require less effort in the implementation in order to perform experiments and see the effects of changes in the design. These simulators are termed high-level simulators in the context of computer architecture. In this paper, we present multiple levels of abstractions in a high-level simulation of a general-purpose many-core system, where the objective of every level is to improve the accuracy in simulation without significantly affecting the complexity and simulation speed. 展开更多
关键词 HIGH-LEVEL Simulations MULTIPLE LEVELS of ABSTRACTION Design Space Exploration many-core Systems
在线阅读 下载PDF
Proposal for sequential Stern-Gerlach experiment with programmable quantum processors
12
作者 胡孟军 缪海兴 张永生 《Chinese Physics B》 SCIE EI CAS CSCD 2024年第2期131-136,共6页
The historical significance of the Stern–Gerlach(SG)experiment lies in its provision of the initial evidence for space quantization.Over time,its sequential form has evolved into an elegant paradigm that effectively ... The historical significance of the Stern–Gerlach(SG)experiment lies in its provision of the initial evidence for space quantization.Over time,its sequential form has evolved into an elegant paradigm that effectively illustrates the fundamental principles of quantum theory.To date,the practical implementation of the sequential SG experiment has not been fully achieved.In this study,we demonstrate the capability of programmable quantum processors to simulate the sequential SG experiment.The specific parametric shallow quantum circuits,which are suitable for the limitations of current noisy quantum hardware,are given to replicate the functionality of SG devices with the ability to perform measurements in different directions.Surprisingly,it has been demonstrated that Wigner’s SG interferometer can be readily implemented in our sequential quantum circuit.With the utilization of the identical circuits,it is also feasible to implement Wheeler’s delayed-choice experiment.We propose the utilization of cross-shaped programmable quantum processors to showcase sequential experiments,and the simulation results demonstrate a strong alignment with theoretical predictions.With the rapid advancement of cloud-based quantum computing,such as BAQIS Quafu,it is our belief that the proposed solution is well-suited for deployment on the cloud,allowing for public accessibility.Our findings not only expand the potential applications of quantum computers,but also contribute to a deeper comprehension of the fundamental principles underlying quantum theory. 展开更多
关键词 sequential Stern-Gerlach quantum circuit quantum processor
原文传递
Efficient cache replacement framework based on access hotness for spacecraft processors
13
作者 GAO Xin NIAN Jiawei +1 位作者 LIU Hongjin YANG Mengfei 《中国空间科学技术(中英文)》 CSCD 北大核心 2024年第2期74-88,共15页
A notable portion of cachelines in real-world workloads exhibits inner non-uniform access behaviors.However,modern cache management rarely considers this fine-grained feature,which impacts the effective cache capacity... A notable portion of cachelines in real-world workloads exhibits inner non-uniform access behaviors.However,modern cache management rarely considers this fine-grained feature,which impacts the effective cache capacity of contemporary high-performance spacecraft processors.To harness these non-uniform access behaviors,an efficient cache replacement framework featuring an auxiliary cache specifically designed to retain evicted hot data was proposed.This framework reconstructs the cache replacement policy,facilitating data migration between the main cache and the auxiliary cache.Unlike traditional cacheline-granularity policies,the approach excels at identifying and evicting infrequently used data,thereby optimizing cache utilization.The evaluation shows impressive performance improvement,especially on workloads with irregular access patterns.Benefiting from fine granularity,the proposal achieves superior storage efficiency compared with commonly used cache management schemes,providing a potential optimization opportunity for modern resource-constrained processors,such as spacecraft processors.Furthermore,the framework complements existing modern cache replacement policies and can be seamlessly integrated with minimal modifications,enhancing their overall efficacy. 展开更多
关键词 spacecraft processors cache management replacement policy storage efficiency memory hierarchy MICROARCHITECTURE
在线阅读 下载PDF
面向含噪中规模量子处理器的量子机器学习 被引量:1
14
作者 石金晶 肖子萌 +2 位作者 王雯萱 张师超 李学龙 《计算机学报》 北大核心 2025年第3期602-631,共30页
量子计算与人工智能结合,在增强模型表达能力、加速和优化机器学习等方面可能产生颠覆性影响,有望突破人工智能领域所面临的可解释性差、最优解难等问题,量子人工智能已成为国内外重点关注的学科前沿。量子机器学习是量子人工智能领域... 量子计算与人工智能结合,在增强模型表达能力、加速和优化机器学习等方面可能产生颠覆性影响,有望突破人工智能领域所面临的可解释性差、最优解难等问题,量子人工智能已成为国内外重点关注的学科前沿。量子机器学习是量子人工智能领域的重要研究内容,它将量子计算基础理论与机器学习原理相结合,以实现具有量子加速的机器学习任务。随着量子计算软硬件的快速发展,含噪中规模量子(NISQ)处理器的学习优势被证明,国内外学者相继提出一系列量子机器学习方法,以挖掘量子计算助力人工智能技术发展的创新应用。然而,当前的量子机器学习仍局限于对算法的优化,缺乏系统层面的理论架构,仍有许多科学问题亟待解决。本文首先从量子机器学习系统表征角度出发,建立量子机器学习系统的层次模型,概括和总结了面向各类任务的量子机器学习方案,分析了量子机器学习在提高经典算法速度等方面可能体现的“量子优势”。接着根据量子机器学习系统的层次结构,从原理层、计算层、应用层这三个方面对现有量子机器学习方法进行了总结与梳理,系统性地分析和讨论了其中的关键问题与解决方案。最后,结合当前阶段量子人工智能的发展特点,重点分析了量子机器学习领域面临的科学问题与挑战,并对未来该领域的发展趋势进行了深入分析与展望。 展开更多
关键词 量子计算 量子人工智能 量子机器学习 量子算法 含噪中规模量子处理器
在线阅读 下载PDF
考虑机器数量增加的多处理机工件调度优化 被引量:1
15
作者 孙涛 王军强 黄永兴 《计算机集成制造系统》 北大核心 2025年第3期924-938,共15页
多处理机工件是在同一时刻由多台处理机并行加工的工件。面向以最小化最大完工时间为目标的多处理机工件调度,分析了机器数量增加对最大完工时间的影响,证明了最优调度方案和所提近似调度方案的最好情形影响比,揭示了最大完工时间随着... 多处理机工件是在同一时刻由多台处理机并行加工的工件。面向以最小化最大完工时间为目标的多处理机工件调度,分析了机器数量增加对最大完工时间的影响,证明了最优调度方案和所提近似调度方案的最好情形影响比,揭示了最大完工时间随着机器数量增加而减少并趋于稳定的规律。分析了机器数量增加的影响,一方面改善了调度目标,另一方面增加了机器投入成本。权衡最大完工时间减少和机器成本增加两方面影响,以最小化最大完工时间与机器成本加权和为目标决策机器数量。基于降序首次适应算法设计了近似算法,给出了调度优化方案,并证明了所提算法的最差性能比不超过2。通过仿真实验,验证了所提算法的最好情形影响比及算法的有效性。 展开更多
关键词 多处理机工件调度 资源扩充 最好情形影响比 近似算法 最差性能比
在线阅读 下载PDF
我国数据知识产权登记制度试点改革路径研究 被引量:1
16
作者 孟奇勋 程伟佳 戴运 《科技进步与对策》 北大核心 2025年第12期151-160,共10页
厘清数据、数据集合与数据产品,数据产权登记、数据产品登记与数据知识产权登记的关系,是推进数据知识产权登记制度试点改革的逻辑前提。通过对9项政策文本的比较,得出以下结论:①数据知识产权登记试点为我国数据知识产权保护规则构建... 厘清数据、数据集合与数据产品,数据产权登记、数据产品登记与数据知识产权登记的关系,是推进数据知识产权登记制度试点改革的逻辑前提。通过对9项政策文本的比较,得出以下结论:①数据知识产权登记试点为我国数据知识产权保护规则构建提供了有益经验,但仍然存在登记规则不统一、权益配置不清晰、应用场景有待拓展等现实挑战;②从法律关系来看,数据知识产权登记以特定的数据集合为对象,以数据处理者为主体,以有限排他权为权利内容;③在推进数据要素市场化配置改革和全国统一大市场建设背景下,亟待明确数据知识产权登记效力与审查标准、强化相关部门职能协同与监督管理,探索区块链赋能数据交易流通效率提升路径。 展开更多
关键词 数据产权 数据产品 数据集合 数据处理者 数据知识产权登记
在线阅读 下载PDF
基于ARM+FPGA的机载信息管理处理机设计
17
作者 王健 郭霖佯 +4 位作者 何自豪 周立辉 陈家福 李欣琦 周浩 《火力与指挥控制》 北大核心 2025年第4期85-92,共8页
为实现飞机在执行战术任务时对格式化链路消息的接收处理、态势信息综合处理、载机平台信息采集、指令应答与信息回传、雷达目标定位等功能,设计一种基于ARM+FPGA架构机载信息管理处理机。介绍机载信息管理处理机具体功能和应用,从硬件... 为实现飞机在执行战术任务时对格式化链路消息的接收处理、态势信息综合处理、载机平台信息采集、指令应答与信息回传、雷达目标定位等功能,设计一种基于ARM+FPGA架构机载信息管理处理机。介绍机载信息管理处理机具体功能和应用,从硬件和软件设计两个方面对系统结构进行详细阐述,完成机载信息综合处理模块、RS422和ARINC429相关接口的软硬件设计工作,使得机载信息管理处理机能够实时处理机载电台、显控机、雷达、导航系统等相关设备的信息,并根据系统通信协议的要求完成各类型系统数据的实时接收、解析、检索和发送任务。通过对系统的联调联试和测试工作,发现达到系统各项指标要求并验证了其可行性和稳定性。 展开更多
关键词 机载处理机 RS422 ARINC429 FPGA ARM
在线阅读 下载PDF
处理器数据预取器安全研究综述
18
作者 刘畅 黄祺霖 +4 位作者 刘煜川 林世鸿 秦中元 陈立全 吕勇强 《电子与信息学报》 北大核心 2025年第9期3038-3056,共19页
数据预取器是现代处理器用于提高性能的重要微架构组件。然而,由于在设计之初缺乏系统性的安全评估与考量,主流商用处理器中的预取器近年来被揭示出存在严重安全隐患,已被用于针对浏览器、操作系统和可信执行环境的侧信道攻击。面对这... 数据预取器是现代处理器用于提高性能的重要微架构组件。然而,由于在设计之初缺乏系统性的安全评估与考量,主流商用处理器中的预取器近年来被揭示出存在严重安全隐患,已被用于针对浏览器、操作系统和可信执行环境的侧信道攻击。面对这类新型微架构攻击,处理器安全研究亟需解决以下关键问题:如何系统性地分析攻击方法,全面认识预取器潜在风险,量化评估预取器安全程度,从而设计更加安全的数据预取器。为解决这些问题,该文系统调研了商用处理器中已知预取器设计及相关侧信道攻击,通过提取内存访问模式,为7种预取器建立行为模型,并基于此为20种侧信道攻击建立攻击模型,系统整理了各类攻击的触发条件和泄露信息,并分析可能存在的其他攻击方法。在此基础上,该文提出1套包含3个维度和24个指标的安全性评估体系,为数据预取器的安全性提供全面量化评估。最后,该文深入探讨了防御策略、安全预取器设计思路及未来研究方向。作为首篇聚焦于商用处理器数据预取器安全问题的综述性文章,该文有助于深入了解数据预取器面临的安全挑战,推动预取器的安全性量化评估体系构建,从而为设计更加安全的数据预取器提供指导。 展开更多
关键词 计算机体系结构 处理器 数据预取器 微架构安全 侧信道攻击
在线阅读 下载PDF
基于特征值均衡算法的车内主动噪声控制系统
19
作者 卢炽华 周曾志 +3 位作者 陈弯 王子嘉 刘志恩 孙孟雷 《噪声与振动控制》 北大核心 2025年第2期171-178,共8页
在车内主动噪声控制领域,滤波最小均方(Filtered-x Least-mean-square,FxLMS)算法在面对发动机的低频窄带噪声时,因为其步长因子优化在不同频率上的不一致性,影响整体噪声控制效果。本文提出一种基于特征值均衡的主动噪声控制算法,均衡F... 在车内主动噪声控制领域,滤波最小均方(Filtered-x Least-mean-square,FxLMS)算法在面对发动机的低频窄带噪声时,因为其步长因子优化在不同频率上的不一致性,影响整体噪声控制效果。本文提出一种基于特征值均衡的主动噪声控制算法,均衡FxLMS算法在面对多种频率成分噪声时的收敛特性,并使其主要频率成分的收敛性能更加均匀。最后,建立基于国产数字音频处理器(Digital Audio Processor,DSP)的车内双通道主动噪声控制系统,车辆定置发动机稳态与行驶过程非稳态加速两种工况的实车实验。实验结果表明,在稳态工况下,主要阶次的车内噪声降噪量最大值为39.35 dB(A);在非稳态工况下,车内总声压级最大降噪量为4.77 dB(A),可为车内主动噪声控制系统国产化应用提供参考。 展开更多
关键词 声学 主动噪声控制 次级通路 汽车发动机 特征值均衡 数字音频处理器
在线阅读 下载PDF
论个人信息处理者隐微权利结构与企业数据确权
20
作者 张建文 张锐 《大连理工大学学报(社会科学版)》 北大核心 2025年第3期63-70,共8页
《个人信息保护法》以“保护个人信息权益”和“促进个人信息合理利用”为立法目的,规定个人在个人信息处理活动中的权利的同时,隐微地规定了个人信息处理者的权利结构。这种权利结构并非单纯的“数据控制的事实状态”,而是处理者在符... 《个人信息保护法》以“保护个人信息权益”和“促进个人信息合理利用”为立法目的,规定个人在个人信息处理活动中的权利的同时,隐微地规定了个人信息处理者的权利结构。这种权利结构并非单纯的“数据控制的事实状态”,而是处理者在符合法律规定的处理条件下获得合法利益的方式。在揭示并确认处理者的隐微权利结构的前提下,数据确权特别是企业数据确权问题的重要性可能被极大降低甚至消解,试图在《个人信息保护法》规定的处理者隐微权利结构之外,构建一般数据权利体系或立法的尝试,并非就是最佳的方向。缓行数据确权之操切建议,也能够为司法实践拣选最佳的数据权益保护方案提供必要的时间和空间。 展开更多
关键词 个人信息 个人信息处理者 自主决定 合理利用 企业数据
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部