期刊文献+

一种高性能子字并行乘法器的设计与实现 被引量:3

Design and implementation of high performance subword parallel multiplier
在线阅读 下载PDF
导出
摘要 提出了一种支持子字并行的乘法器体系结构,并完成了其VLSI设计与实现。该乘法器在16bit阵列子字并行结构的基础上,扩展了有符号与无符号之间的混合操作,采用多周期合并技术,实现了32bit宽度的子字并行,并支持子字模式的乘累加,同时采用流水线设计技术,能够在单周期内完成4个8×8、2个16×16或1个32×16的有符号/无符号乘法操作。0.18μm的标准单元库的实现表明该乘法器既能减小面积又能提高主频,是硬件消耗和运算性能的较好折衷,非常适用于多媒体微处理器的设计。 This paper proposes a multiplier architecture capable of supporting subword parallelism,and has completed its VLSI design and implementation using 0.18 μm standard cell process.Based on the 16 bit array sub-word parallel architecture,this multiplier extends signed and unsigned hybrid operations,and uses multi-cycle combination technique to realize 32 bit sub-word parallel operations.In addition,it also supports sub-word multiply accumulate operations.With pipelined operation style,it can per- form one 32×16,two 16×16,and four 8×8 bit signed/unsigned multiplications in single cycle respectively.The implementation resuits show that the proposed sub-word parallel multiplier can reduce the area and improve the frequency ,which is a good com- promise of hardware consumption and performance.h is well suitable for multimedia microprocessor design.
出处 《计算机工程与应用》 CSCD 北大核心 2007年第20期104-106,131,共4页 Computer Engineering and Applications
基金 国家自然科学基金(the National Natural Science Foundation of China under Grant No.90407022)
关键词 子字并行 乘法器 多媒体 subword parallelism multiplier multimedia
  • 相关文献

参考文献8

  • 1Corbal J.DLP+TLP processors for the next generation of media workloads[C]//Proc 7th Intl Symp on HPCA,2001.
  • 2Lee R B.Accelerating multimedia with enhanced microprocessors[J].IEEE Micro,1995,15:22-32.
  • 3Peleg A,Weiser U.MMX technology extension to the Intel architecture[J].IEEE Micro,1996,16:42-50.
  • 4Schmookler M S.A Low-power,high-speed implementation of a power PCTM microprocessor vector extension[C]//Comp Arith,Proc 14th IEEE Symp,1999.
  • 5Krithivasan S,Schulte M J.Multiplier architectures for media processing[C]//Proc 37th Asilomar Conf Signals,Systems,Computers,2003:2193-2197.
  • 6Tan D,Danysh A,Liebelt M.Multiple-precision fixed-point vector multiply-accumulator using shared segmentation[C]//Comp Arith,Proc 16th IEEE Symp,2003:12-19.
  • 7Farooqui A A,Oklobdzija V G.General data-path organization of a MAC unit for VLSI implementation of DSP processors[C]//Proc IEEE Int'l Symp Circuits and Systems,1998:260-263.
  • 8Suzuki K.A 2000-MOPS embedded RISC processor with a Rambus DRAM controller[J].IEEE J Solid-State Circuits,1999,34:1010-1021.

同被引文献12

引证文献3

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部