期刊文献+
共找到826篇文章
< 1 2 42 >
每页显示 20 50 100
Architecture-level performance/power tradeoff in network processor design
1
作者 陈红松 季振洲 胡铭曾 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2007年第1期45-48,共4页
Network processors are used in the core node of network to flexibly process packet streams. With the increase of performance, the power of network processor increases fast, and power and cooling become a bottleneck. A... Network processors are used in the core node of network to flexibly process packet streams. With the increase of performance, the power of network processor increases fast, and power and cooling become a bottleneck. Architecture-level power conscious design must go beyond low-level circuit design. Architectural power and performance tradeoff should be considered at the same time. Simulation is an efficient method to design modem network processor before making chip. In order to achieve the tradeoff between performance and power, the processor simulator is used to design the architecture of network processor. Using Netbeneh, Commubench benchmark and processor simulator-SimpleScalar, the performance and power of network processor are quantitatively evaluated. New performance tradeoff evaluation metric is proposed to analyze the architecture of network processor. Based on the high performance lnteI IXP 2800 Network processor eonfignration, optimized instruction fetch width and speed ,instruction issue width, instruction window size are analyzed and selected. Simulation resuits show that the tradeoff design method makes the usage of network processor more effectively. The optimal key parameters of network processor are important in architecture-level design. It is meaningful for the next generation network processor design. 展开更多
关键词 network processor design performance/power simulation tradeoff evaluation optimization
在线阅读 下载PDF
Reconfigurable Communication Processor: A New Approach for Network Processor
2
作者 孙华 陈青山 张文渊 《Journal of Shanghai Jiaotong university(Science)》 EI 2003年第1期43-47,共5页
As the traditional RISC+ASIC/ASSP approach for network processor design can not meet the today’s requirements, this paper described an alternate approach, Reconfigurable Processing Architecture, to boost the performa... As the traditional RISC+ASIC/ASSP approach for network processor design can not meet the today’s requirements, this paper described an alternate approach, Reconfigurable Processing Architecture, to boost the performance to ASIC level while reserve the programmability of the traditional RISC based system. This paper covers both the hardware architecture and the software development environment architecture. 展开更多
关键词 network processor reconfigurable processor run time reconfiguration field programmable gate array (FPGA) raduced instruction set circuit (RISC) application specific integrated circuit(ASIC)
在线阅读 下载PDF
SDN-Based Switch Implementation on Network Processors 被引量:1
3
作者 Yunchun Li Guodong Wang 《Communications and Network》 2013年第3期434-437,共4页
Virtualization is the key technology of cloud computing. Network virtualization plays an important role in this field. Its performance is very relevant to network virtualizing. Nowadays its implementations are mainly ... Virtualization is the key technology of cloud computing. Network virtualization plays an important role in this field. Its performance is very relevant to network virtualizing. Nowadays its implementations are mainly based on the idea of Software Define Network (SDN). Open vSwitch is a sort of software virtual switch, which conforms to the OpenFlow protocol standard. It is basically deployed in the Linux kernel hypervisor. This leads to its performance relatively poor because of the limited system resource. In turn, the packet process throughput is very low.In this paper, we present a Cavium-based Open vSwitch implementation. The Cavium platform features with multi cores and couples of hard ac-celerators. It supports zero-copy of packets and handles packet more quickly. We also carry some experiments on the platform. It indicates that we can use it in the enterprise network or campus network as convergence layer and core layer device. 展开更多
关键词 SDN OPEN vSwitch network processorS OpenFlow
在线阅读 下载PDF
An Improved Cache Mechanism for a Cache-Based Network Processor
4
作者 Hayato Yamaki Hiroaki Nishi 《通讯和计算机(中英文版)》 2013年第3期277-286,共10页
关键词 高速缓存机制 网络处理器 网络流量 上下文 网络内容 IP电话 仿真结果 数据包
在线阅读 下载PDF
High-Level Portable Programming Language for Optimized Memory Use of Network Processors
5
作者 Yasusi Kanada 《Communications and Network》 2015年第1期55-69,共15页
Network processors (NPs) are widely used for programmable and high-performance networks;however, the programs for NPs are less portable, the number of NP program developers is small, and the development cost is high. ... Network processors (NPs) are widely used for programmable and high-performance networks;however, the programs for NPs are less portable, the number of NP program developers is small, and the development cost is high. To solve these problems, this paper proposes an open, high-level, and portable programming language called “Phonepl”, which is independent from vendor-specific proprietary hardware and software but can be translated into an NP program with high performance especially in the memory use. A common NP hardware feature is that a whole packet is stored in DRAM, but the header is cached in SRAM. Phonepl has a hardware-independent abstraction of this feature so that it allows programmers mostly unconscious of this hardware feature. To implement the abstraction, four representations of packet data type that cover all the packet operations (including substring, concatenation, input, and output) are introduced. Phonepl have been implemented on Octeon NPs used in plug-ins for a network-virtualization environment called the VNode Infrastructure, and several packet-handling programs were evaluated. As for the evaluation result, the conversion throughput is close to the wire rate, i.e., 10 Gbps, and no packet loss (by cache miss) occurs when the packet size is 256 bytes or larger. 展开更多
关键词 network processors PORTABILITY HIGH-LEVEL Language Hardware INDEPENDENCE MEMORY Usage DRAM SRAM network Virtualization
在线阅读 下载PDF
Efficiency of Cache Mechanism for Network Processors 被引量:2
6
作者 徐波 常剑 +2 位作者 黄诗萌 薛一波 李军 《Tsinghua Science and Technology》 SCIE EI CAS 2009年第5期575-585,共11页
With the explosion of network bandwidth and the ever-changing requirements for diverse network-based applications,the traditional processing architectures,i.e.,general purpose processor(GPP) and application specific... With the explosion of network bandwidth and the ever-changing requirements for diverse network-based applications,the traditional processing architectures,i.e.,general purpose processor(GPP) and application specific integrated circuits(ASIC) cannot provide sufficient flexibility and high performance at the same time.Thus,the network processor(NP) has emerged as an alternative to meet these dual demands for today's network processing.The NP combines embedded multi-threaded cores with a rich memory hierarchy that can adapt to different networking circumstances when customized by the application developers.In today's NP architectures,multithreading prevails over cache mechanism,which has achieved great success in GPP to hide memory access latencies.This paper focuses on the efficiency of the cache mechanism in an NP.Theoretical timing models of packet processing are established for evaluating cache efficiency and experiments are performed based on real-life network backbone traces.Testing results show that an improvement of nearly 70% can be gained in throughput with assistance from the cache mechanism.Accordingly,the cache mechanism is still efficient and irreplaceable in network processing,despite the existing of multithreading. 展开更多
关键词 CACHE network processor efficiency evaluation
原文传递
Building RNC in All-IP Wireless Networks using Network Processors
7
作者 CHENGSheng NIXian-le ZHUXin-ning DINGWei 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2004年第2期86-91,共6页
This paper describes a solution to build network-processor-based Radio Network Controller (RNC) in all-IP wireless networks, it includes the structure of the 3rd Generation (3G) wireless networks and the role of netw... This paper describes a solution to build network-processor-based Radio Network Controller (RNC) in all-IP wireless networks, it includes the structure of the 3rd Generation (3G) wireless networks and the role of network nodes, such as Base Station (BS), RNC, and Packet-Switched Core Networks (PSCN). The architecture of IXP2800 network processor; the detailed implementation of the solution on IXP2800-based RNC are also covered. This solution can provide scalable IP forward features and it will be widely used in 3G RNCs. 展开更多
关键词 G 3GPP All-IP wireless networks RNC IP network processor IXP2800
原文传递
Architecture-Aware Session Lookup Design for Inline Deep Inspection on Network Processors
8
作者 徐波 何飞 +1 位作者 薛一波 李军 《Tsinghua Science and Technology》 SCIE EI CAS 2009年第1期19-28,共10页
Today's firewalls and security gateways are required to not only block unauthorized accesses by authenticating packet headers, but also inspect flow payloads against malicious intrusions. Deep inspection emerges as a... Today's firewalls and security gateways are required to not only block unauthorized accesses by authenticating packet headers, but also inspect flow payloads against malicious intrusions. Deep inspection emerges as a seamless integration of packet classification for access control and pattern matching for intrusion prevention. The two function blocks are linked together via well-designed session lookup schemes. This paper presents an architecture-aware session lookup scheme for deep inspection on network processors (NPs). Test results show that the proposed session data structure and integration approach can achieve the OC-48 line rate (2.5 Gbps) with inline stateful content inspection on the Intel IXP2850 NP. This work provides an insight into application design and implementation on NPs and principles for performance tuning of NP-based programming such as data allocation, task partitioning, latency hiding, and thread synchronization. 展开更多
关键词 session lookup deep inspection network processor performance optimization
原文传递
Hardwired Logic and Multithread Design in Network Processors
9
作者 李旭东 徐扬 +1 位作者 刘斌 王小军 《Tsinghua Science and Technology》 SCIE EI CAS 2004年第2期207-212,共6页
High-performance network processors are expected to play an important role in future high-speed routers. This paper focuses on two representative techniques needed for high-performance network processors: hardwired lo... High-performance network processors are expected to play an important role in future high-speed routers. This paper focuses on two representative techniques needed for high-performance network processors: hardwired logic design and multithread design. Using hardwired logic, this paper compares a single-thread design with a multithread design, and proposes general models and principles to analyze the clock frequency and the resource cost for these environments. Then, two IP header processing schemes, one in single-thread mode and the other in double-thread mode, are developed using these principles and the implementation results verified the theoretical calculation. 展开更多
关键词 network processor (NP) hardwired logic multithread IP header processing
原文传递
Speeding up the MATLAB complex networks package using graphic processors 被引量:1
10
作者 张百达 唐玉华 +1 位作者 吴俊杰 李鑫 《Chinese Physics B》 SCIE EI CAS CSCD 2011年第9期460-467,共8页
The availability of computers and communication networks allows us to gather and analyse data on a far larger scale than previously. At present, it is believed that statistics is a suitable method to analyse networks ... The availability of computers and communication networks allows us to gather and analyse data on a far larger scale than previously. At present, it is believed that statistics is a suitable method to analyse networks with millions, or more, of vertices. The MATLAB language, with its mass of statistical functions, is a good choice to rapidly realize an algorithm prototype of complex networks. The performance of the MATLAB codes can be further improved by using graphic processor units (GPU). This paper presents the strategies and performance of the GPU implementation of a complex networks package, and the Jacket toolbox of MATLAB is used. Compared with some commercially available CPU implementations, GPU can achieve a speedup of, on average, 11.3x. The experimental result proves that the GPU platform combined with the MATLAB language is a good combination for complex network research. 展开更多
关键词 complex networks graphic processors unit MATLAB Jacket Toolbox
原文传递
Optimized Processor for Sensor Networks Applications
11
作者 Ali Elkateeb 《通讯和计算机(中英文版)》 2012年第3期311-316,共6页
关键词 嵌入式处理器 传感器节点 网络应用 优化 节点设计 软核处理器 可重构系统 核心处理器
在线阅读 下载PDF
Neptune:一种通用网络处理器微结构模拟和性能仿真框架
12
作者 林涵越 吴婧雅 +2 位作者 卢文岩 钟浪辉 鄢贵海 《计算机研究与发展》 北大核心 2025年第5期1091-1107,共17页
网络包处理是网络设备的基本功能,涉及报文修改、校验和与哈希计算、数据包镜像或过滤、统计限速等多项任务.作为网络包处理的重要部件,网络处理器(network processor,NP)基于处理器结构,为网络设备提供线速的性能和充分的可编程能力,... 网络包处理是网络设备的基本功能,涉及报文修改、校验和与哈希计算、数据包镜像或过滤、统计限速等多项任务.作为网络包处理的重要部件,网络处理器(network processor,NP)基于处理器结构,为网络设备提供线速的性能和充分的可编程能力,但其架构多样,可分为单段式架构和多段式架构,现有模拟方法无法同时对二者性能进行模拟仿真.因此,提出一种通用网络处理器的结构模拟和性能仿真框架Neptune,采用多段式架构作为硬件抽象,使用事件链表、核间队列结构为数据通路和多段式架构模拟提供保障,同时满足单段式架构模拟需求.另外,借助同步图计算模式进行准确的并行模拟,并采用混合事件与时间驱动方法保障模拟高效性.实际测试中,Neptune以95%以上准确率支持2种架构的模拟,并以3.31MIPS的性能对网络处理器进行模拟,相较PFPSim取得1个数量级的性能提升.最后,展示了3个运用该框架进行网络处理器优化分析的应用案例. 展开更多
关键词 网络包处理 网络处理器 可编程数据面 专用处理器 模拟器
在线阅读 下载PDF
将SNN部署到类脑处理器的映射优化算法研究
13
作者 陈奥新 陈亮 +2 位作者 李千鹏 王智超 徐东君 《计算机工程与应用》 北大核心 2025年第11期156-165,共10页
近年来,具有生物合理性和能效优势的脉冲神经网络(SNN)受到广泛关注。然而,目前在类脑处理器上部署SNN的映射方案存在通信延迟高、拥塞严重、能耗高和节点连接性不足等问题,从而削弱了其实用性和执行效率。为解决这些问题,提出了基于KL(... 近年来,具有生物合理性和能效优势的脉冲神经网络(SNN)受到广泛关注。然而,目前在类脑处理器上部署SNN的映射方案存在通信延迟高、拥塞严重、能耗高和节点连接性不足等问题,从而削弱了其实用性和执行效率。为解决这些问题,提出了基于KL(Kernighan-Lin)和波尔兹曼退火差分进化(Boltzmann anneal differential evolution,BADE)的改进部署算法,用于将SNN映射到资源受限的类脑处理器上。该算法包括两个步骤:分区和映射。在分区阶段,通过在递归KL算法中引入全局优化策略(GRBKL)来最小化集群之间的通信延迟;在映射阶段,提出利用吸引子导向的BADE算法(BAFDE)寻找最小化通信延迟和最大拥塞的分配方式。用五个SNN实例对该算法进行了评估,结果表明,与SNEAP和SpiNeMap等方法相比,所提出的算法显著降低了通信延迟(分别降低了55.41%和94.73%)和最大拥塞(分别降低了81.27%和97.79%)。 展开更多
关键词 脉冲神经网络(SNN) 类脑处理器 启发式算法 片上网络(NOC)
在线阅读 下载PDF
面向智能物联网异构嵌入式芯片的自适应算子并行分割方法 被引量:1
14
作者 林政 刘思聪 +2 位作者 郭斌 丁亚三 於志文 《计算机科学》 北大核心 2025年第2期299-309,共11页
随着人民生活质量的持续提升与科技发展的日新月异,智能手机等移动设备在全球范围内得到了广泛普及。在这一背景下,深度神经网络在移动端的部署与应用成为了研究的热点。深度神经网络不仅推动了移动应用领域的显著进步,同时也对使用电... 随着人民生活质量的持续提升与科技发展的日新月异,智能手机等移动设备在全球范围内得到了广泛普及。在这一背景下,深度神经网络在移动端的部署与应用成为了研究的热点。深度神经网络不仅推动了移动应用领域的显著进步,同时也对使用电池供电的移动设备的能效管理提出了更高要求。当今移动设备中异构处理器的兴起给优化能效带来了新的挑战,在不同处理器间分配计算任务以实现深度神经网络并行处理和加速,并不一定能够优化能耗,甚至可能会增加能耗。针对这一问题,提出了一种能效优化的深度神经网络自适应并行计算调度系统。该系统包括一个运行时能耗分析器与在线算子划分执行器,能够根据动态设备条件动态调整算子分配,在保持高响应性的同时,优化了移动设备异构处理器上的计算能效。实验结果证明,相比基准方法,能效优化的深度神经网络自适应并行计算调度系统在移动设备深度神经网络上的平均能耗和平均时延减少了5.19%和9.0%,最大能耗和最大时延减少了18.35%和21.6%。 展开更多
关键词 深度神经网络 移动设备 能效优化 异构处理器 能耗预测
在线阅读 下载PDF
高性能可重构网络协议解析器的设计与实现
15
作者 张丽果 吴凯 +4 位作者 王文哲 张毅 王睿 曹亚莉 肖杉 《西安邮电大学学报》 2025年第3期58-67,共10页
针对当前网络中数据平面在处理新型协议时面临资源占用过多和性能较低的问题,设计并实现了一种可重构解析器。根据解析图生成解析指令实现解析器的重构,通过采用全等比较器组替代传统的三态内容寻址存储器(Ternary Content Addressable ... 针对当前网络中数据平面在处理新型协议时面临资源占用过多和性能较低的问题,设计并实现了一种可重构解析器。根据解析图生成解析指令实现解析器的重构,通过采用全等比较器组替代传统的三态内容寻址存储器(Ternary Content Addressable Memory,TCAM),降低资源占用。增加预处理解析器对常规协议帧进行预处理,采用多个解析器并行处理链路上的连续多层协议帧,提升数据包头解析速率。以100 Gbps智能网卡中网络协议处理为例,配置可重构解析器,在VCU118开发板上进行实现。实验结果表明,所提设计仅使用63844个查找表(Look-Up Table,LUT)和36346个触发器(Flip-Flop,FF),在解析结果完全正确的前提下,整个系统带宽最高可达58.3 Gbps。对比同类解决方案,提出的设计方法在提高性能的同时具有更低的资源占用。 展开更多
关键词 网络处理器 协议解析器 可重构报文处理 并行处理 现场可编程门阵列
在线阅读 下载PDF
基三众核架构中基于同步哈密顿环的无死锁策略
16
作者 李春峰 Karim Soliman +1 位作者 计卫星 石峰 《计算机研究与发展》 北大核心 2025年第4期930-949,共20页
确保片上网络(network-on-chip,NoC)中的数据传输无死锁,是NoC为多处理器片上系统(multi-processor system-on-chip,MPSoC)提供可靠通信服务的前提,决定了NoC甚至MPSoC的可用性.现有的通用防死锁策略难以发挥出特定拓扑结构的自身特点... 确保片上网络(network-on-chip,NoC)中的数据传输无死锁,是NoC为多处理器片上系统(multi-processor system-on-chip,MPSoC)提供可靠通信服务的前提,决定了NoC甚至MPSoC的可用性.现有的通用防死锁策略难以发挥出特定拓扑结构的自身特点和优势,甚至可能会增加网络延迟、功耗以及硬件复杂性.另外,由于路由级和协议级死锁存在显著差异,现有无死锁方案较难同时解决这2类死锁问题,影响了MPSoC的可靠性.利用基三众核架构(triplet-based many-core architecture,TriBA)中拓扑结构自身具有的哈密顿特性提出了基于同步哈密顿环的无死锁策略,该策略依据拓扑结构自身的对称轴和哈密顿边对数据传输进行分类,预防了协议级死锁并提高了数据传输速度;同时使用循环链表技术判断同一缓冲区内数据同步传输方向,消除了路由级死锁并降低了数据传输延迟.在优化前瞻路由算法基础上,设计了基于同步哈密顿环的无死锁路由机制HamSPR(Hamiltonian shortest path routing).GEM5仿真结果表明,与TriBA现有方法相比,HamSPR在合成流量下的平均数据包延迟和功耗分别降低了8.78%~65.40%和6.94%~34.15%,吞吐量提高了8.00%~59.17%;在PARSEC测试集下的应用运行时间和平均数据包延迟分别最高实现了16.51%和42.75%的降低.与2D-Mesh架构相比,TriBA在PARSEC测试集下的应用性能实现了1%~10%的提升. 展开更多
关键词 众核处理器 片上网络 基三众核架构 哈密顿特性 路由算法 死锁预防
在线阅读 下载PDF
人的认知与仿真建模及其在智能家居语音交互设计中的应用 被引量:1
17
作者 章薇 吴昌旭 《包装工程》 北大核心 2025年第4期226-236,共11页
目的通过自上而下的认知建模方法进行人机语音交互建模,实现人的绩效和满意度预测,为智能语音交互系统的设计方案的评估提供新思路和新方法。方法基于人的信息处理排队网络模型(QN-MHP),结合人机语音交互过程的理论研究,建立人机语音交... 目的通过自上而下的认知建模方法进行人机语音交互建模,实现人的绩效和满意度预测,为智能语音交互系统的设计方案的评估提供新思路和新方法。方法基于人的信息处理排队网络模型(QN-MHP),结合人机语音交互过程的理论研究,建立人机语音交互过程中人的绩效和满意度预测模型,以预测不同语音交互系统的识别情况(识别自然语言、识别限制语言)、语音交互系统的激活方法(唤醒激活、按一次/点击激活、按住说、直接说)、语音交互系统的平均识别准确率(连续变量)和语音交互系统的平均识别延迟(连续变量)下人使用语音交互系统完成任务的时间和用户满意度。结论本文基于人的信息处理排队网络模型(QN-MHP)建立了不同语音交互系统的设计参数下的人的绩效和满意度预测模型,将本模型应用于家居场景中,能用于评估不同的家居服务机器人的语音交互系统的设计方案的有效性。本研究建立的语音交互过程中人的绩效和满意度预测模型能为设计师和工程师提供便捷有效的智能语音交互系统的评估工具,从而为不同需求下的智能语音系统的设计、优化和应用提供参考。 展开更多
关键词 人机语音交互 人的信息排队网络模型 人的认知与仿真建模 任务完成时间 用户满意度
在线阅读 下载PDF
Trends of Communication Processors
18
作者 LIU Dake CAI Zhaoyun WANG Wei 《China Communications》 SCIE CSCD 2016年第1期1-16,共16页
Processors have been playing important roles in both communication infrastructure systems and terminals.In this paper,both application specific and general purpose processors for communications are discussed including... Processors have been playing important roles in both communication infrastructure systems and terminals.In this paper,both application specific and general purpose processors for communications are discussed including the roles,the history,the current situations,and the trends.One trend is that ASIPs(Application Specific Instruction-set Processors) are taking over ASICs(Application Specific Integrated Circuits) because of the increasing needs both on performance and compatibility of multi-modes.The trend opened opportunities for researchers crossing the boundary between communications and computer architecture.Another trend is the serverlization,i.e.,more infrastructure equipments are replaced by servers.The trend opened opportunities for researchers working towards high performance computing for communication,such as research on communication algorithm kernels and real time programming methods on servers. 展开更多
关键词 ASIP baseband processor network processor application processor server processor
在线阅读 下载PDF
存算一体架构下的神经网络处理器低功耗设计研究
19
作者 田德源 《计算机应用文摘》 2025年第19期163-165,共3页
针对传统冯·诺依曼架构下神经网络处理器所面临的“存储墙”与“功耗墙”问题,文章提出一种基于存算一体架构的低功耗神经网络处理器设计方法。该方法利用通过忆阻器阵列实现存储与计算的深度融合,并结合动态电压频率调整、近阈值... 针对传统冯·诺依曼架构下神经网络处理器所面临的“存储墙”与“功耗墙”问题,文章提出一种基于存算一体架构的低功耗神经网络处理器设计方法。该方法利用通过忆阻器阵列实现存储与计算的深度融合,并结合动态电压频率调整、近阈值计算及混合精度训练等关键技术,在28 nm FD-SOI工艺下实现了5.3 TOPS/W的能效表现。实验结果表明,该架构在处理ResNet-50网络时平均功耗仅为1.2 W,相较于传统架构功耗降低78%,并支持毫秒级动态功耗管理,能够为边缘计算及物联网设备提供高能效、低功耗解决方案。 展开更多
关键词 存算一体 神经网络处理器 低功耗设计 忆阻器 近阈值计算
在线阅读 下载PDF
一种板级异构核间多模通信的软硬件设计方法 被引量:1
20
作者 李锐 杜彬 王远波 《汽车电器》 2025年第6期100-102,共3页
随着车联网技术的高速发展和车载电控单元复杂性的提升,传统的单处理器难以满足数据交互与处理日益复杂和多样化的需求。文章提出一种板级异构核间多模通信机制,设计集成高实时性MCU和高性能SOC的硬件平台,并对异构多模通信的硬件结构... 随着车联网技术的高速发展和车载电控单元复杂性的提升,传统的单处理器难以满足数据交互与处理日益复杂和多样化的需求。文章提出一种板级异构核间多模通信机制,设计集成高实时性MCU和高性能SOC的硬件平台,并对异构多模通信的硬件结构进行阐述。在此基础上,提出分层、低耦合、高内聚的轻量级组件化软件设计方案,阐明驱动层、接口层、网络层、协议层、传输层和应用层的通信机制。该机制在提升异构多核环境运算效率的同时,实现处理器性能的优化,提高通信传输数据的品质。 展开更多
关键词 车联网 核间通信 MCU SOC 异构处理器
在线阅读 下载PDF
上一页 1 2 42 下一页 到第
使用帮助 返回顶部