一种大数模乘运算的线性脉动阵列新结构被引量：2

Novel systolic implementation of modular multiplication for large operands

导出

摘要提出了一种新型的线性脉动阵列结构用来实现基于Ｍｏｎｔｇｏｍｅｒｙ算法的并行模乘运算，对于ｎ位模乘运算，需要２ｎ＋１１个时钟周期完成，为了减少每一周期内的运算量，在处理单元内部实现了三级流水线结构，使得每一周期的串行运算量仅为一级全加器，同时，由于处理单元间只有局部互连，连线延迟很小，于是这种新结构脉动阵列模乘器能在很高的频率下工作。另一个方面，每个处理单元结构简单，仅由４个全加器和１４个触发器构成，对于ｎ位模乘运算，总的规模约为４６ｎ＋１８４个门。所以，它在速度和面积上都是优化的，适于ＶＬＳＩ的实现。作为核心运算部件，能有效地用于如ＲＳＡ等许多公钥密码体制的加解密运算。对于０．８μｍＣＭＯＳ工艺，２００ＭＨｚ时钟是完全可行的，在仅使用一个模乘器条件下，５１２位模幂乘加解密运算速度能达到１２９ｋｂｉｔ／ｓ。 A novel systolic linear array modular multiplier is presented which ideally performs the parallel modular multiplication based on the algorithm of Montgomery. The total execution time for an n bit modular multiplication is 2n+11 clock cycles. To further increase the throughput the three stage pipeline architecture is adopted inside the processing element, so that every one bit result outputs at one clock cycle when the pipeline is filled. Each pipeline stage only contains the operation of an one bit full adder. Moreover, with the purely nearest neighbor communication, the interconnect delay is also very short. Therefore it can work at a high clock frequency. On the other hand, every processing element is simple, mainly consisting of four full adders and fourteen flip flops. For n bit modular multiplication, the cost of the hardware is 46 n +184 gates. So this novel linear systolic array for modular multiplication is a speed and area optimized system, suitable for the VLSI implementation. It can be used for modular exponentiation which is a kernel operation in many public key cryptosystems such as RSA. With clock frequency of 200MHz by using 0.8μm CMOS technology, the throughput can reach 129kb/s with a single modular multiplier chip.

作者陈弘毅盖伟新

机构地区清华大学微电子学研究所

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 1998年第3期11-15,共5页 Journal of Tsinghua University(Science and Technology)

基金国家自然科学重点基金

关键词脉动阵列模乘运算模幂乘运算公钥密码体制 systolic array modular multiplication modular exponentiation pipeline architecture public key cryptosystem 

分类号 TN47 [电子电信—微电子学与固体电子学] TN918.4 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献1

1盖伟新，The 2nd International Conference on ASIC Proceedings （ASICON’96）,，1996年，171页

同被引文献8

1陈弘毅，清华大学学报，1998年，38卷，3期，11页
2P L Montgomery. Modular multiplication without trial division[J].Mathematics of Computation,1985;44(170):519～521
3R L Rivest,A Shamir,L Adleman. A method for obtaining digital signatures and public-key cryptosystems[J].Communications of the ACM, 1978 ;21 (2): 120～126
4S-Y Kung. VLSI Array Processors[M].Englewood Cliffs NJ:Prentice-Hall, 1988
5J-H Hong,P-Y Tsai,C-W Wu. Interleaving schemes for a systolic RSA public-key cryptosystem based on an improved Montgomery's algorithm[C].In:Proceedings of the 11th VLSI Design/CAD Symposium, 2000:163～ 166
6C D Walter. Systolic modular multiplication[J].IEEE Transactions on Computers, 1993 ;42(3) :376～378
7J-H Hong,C-W Wu. Cellular-array modular multiplier for fast RSA public-key cryptosystem based on modified booth's algorithm[J].IEEE Transactions on Very Large Scale Integration(VLSI)Systems,2003;11(3) :474～484
8C D Walter. Montgomery exponentiation needs no final subtractions[J].Electronics Letters, 1999;35(21 ): 1831～1832

引证文献2

1刘强,佟冬,程旭.蒙哥马利算法到脉动阵列的规范映射方法[J].计算机工程与应用,2004,40(34):1-2. 被引量：1
2陈弘毅,盖伟新.大数模幂乘运算的VLSI实现[J].电子学报,1999,27(2):8-17. 被引量：5

二级引证文献6

1黄谆,白国强,陈弘毅.大数模乘脉动阵列的FPGA细粒度映射实现[J].微电子学与计算机,2005,22(7):31-35. 被引量：2
2王超杰.一种新的模乘幂密码并行算法研究[J].廊坊师范学院学报（自然科学版）,2008,8(4):18-20.
3张淑芬,郝福珍.RSA算法在FPGA上的实现[J].计算机工程与设计,2010,31(13):2962-2965. 被引量：1
4方宁,曹卫兵,倪冬鹤,狄冠东.基于Android平台并行运算机制的密码运算加速方案[J].网络与信息安全学报,2019,5(1):50-55.
5刘贤锋,王喜成.线性串行模乘器的设计[J].桂林电子工业学院学报,2002,22(5):9-13. 被引量：2
6张怡浩,田则,于敦山,盛世敏.一种Montgomery模乘算法硬件实现的改进电路[J].北京大学学报（自然科学版）,2004,40(1):80-84. 被引量：1

1李树国,周润德.智能卡公钥密码体制的模乘器[J].清华大学学报（自然科学版）,2002,42(10):1419-1422. 被引量：1
2陈弘毅,盖伟新.大数模幂乘运算的VLSI实现[J].电子学报,1999,27(2):8-17. 被引量：5
3谢琪.一种高效群签名方案的密码学分析[J].电子与信息学报,2007,29(6):1511-1513. 被引量：1
4丁宏,陈勤.大数模幂乘动态匹配快速算法及其应用[J].小型微型计算机系统,2002,23(11):1398-1400. 被引量：6
5王慧,王云.一种改进的RSA模幂乘算法[J].网络安全技术与应用,2008(6):85-86. 被引量：1
6王刚.RSA密码系统有效算法在移动通信中的应用[J].南方职业教育学刊,2012,2(4):6-8.
7刘悦,李桂丽,田莹.大数模幂乘算法的快速实现[J].信息技术,2003,27(5):25-27. 被引量：2
8张德富,陈燕平,沈平.一种并行图象标记算法[J].电子学报,1994,22(5):20-24.
9鲁瑞兵,魏少军.深亚微米集成电路高层次设计方法进展[J].电子商务,1999(4):7-11. 被引量：2
10冯凤萍.低成本串行通讯网络LIN总线及其应用[J].应用能源技术,2004(3):45-46.

清华大学学报（自然科学版）

1998年第3期

浏览历史

内容加载中请稍等...

一种大数模乘运算的线性脉动阵列新结构被引量：2

参考文献1

同被引文献8

引证文献2

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种大数模乘运算的线性脉动阵列新结构 被引量：2

参考文献1

同被引文献8

引证文献2

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种大数模乘运算的线性脉动阵列新结构被引量：2