期刊文献+
共找到8篇文章
< 1 >
每页显示 20 50 100
Parametric Study of Two-Body Floating-Point Wave Absorber 被引量:6
1
作者 Atena Amiri Roozbeh Panahi Soheil Radfar 《Journal of Marine Science and Application》 CSCD 2016年第1期41-49,共9页
In this paper, we present a comprehensive numerical simulation of a point wave absorber in deep water. Analyses are performed in both the frequency and time domains. The converter is a two-body floating-point absorber... In this paper, we present a comprehensive numerical simulation of a point wave absorber in deep water. Analyses are performed in both the frequency and time domains. The converter is a two-body floating-point absorber (FPA) with one degree of freedom in the heave direction. Its two parts are connected by a linear mass-spring-damper system. The commercial ANSYS-AQWA software used in this study performs well in considering validations. The velocity potential is obtained by assuming incompressible and irrotational flow. As such, we investigated the effects of wave characteristics on energy conversion and device efficiency, including wave height and wave period, as well as the device diameter, draft, geometry, and damping coefficient. To validate the model, we compared our numerical results with those from similar experiments. Our study results can clearly help to maximize the converter's efficiency when considering specific conditions. 展开更多
关键词 floating-point absorber wave energy energy absorption Wave Energy Converter (WEC) Power Take Off (PTO) numerical simulation
在线阅读 下载PDF
ASIC Design of Floating-Point FFT Processor 被引量:2
2
作者 陈禾 赵忠武 《Journal of Beijing Institute of Technology》 EI CAS 2004年第4期389-393,共5页
An application specific integrated circuit (ASIC) design of a 1024 points floating-point fast Fourier transform(FFT) processor is presented. It can satisfy the requirement of high accuracy FFT result in related fields... An application specific integrated circuit (ASIC) design of a 1024 points floating-point fast Fourier transform(FFT) processor is presented. It can satisfy the requirement of high accuracy FFT result in related fields. Several novel design techniques for floating-point adder and multiplier are introduced in detail to enhance the speed of the system. At the same time, the power consumption is decreased. The hardware area is effectively reduced as an improved butterfly processor is developed. There is a substantial increase in the performance of the design since a pipelined architecture is adopted, and very large scale integrated (VLSI) is easy to realize due to the regularity. A result of validation using field programmable gate array (FPGA) is shown at the end. When the system clock is set to 50 MHz, 204.8 μs is needed to complete the operation of FFT computation. 展开更多
关键词 application specific integrated circuit(ASIC) fast Fourier transform(FFT) floating-point PIPELINE very large scale integrated(VLSI)
在线阅读 下载PDF
Design of area and power efficient Radix-4 DIT FFT butterfly unit using floating point fused arithmetic 被引量:2
3
作者 Prabhu E Mangalam H Karthick S 《Journal of Central South University》 SCIE EI CAS CSCD 2016年第7期1669-1681,共13页
In this work, power efficient butterfly unit based FFT architecture is presented. The butterfly unit is designed using floating-point fused arithmetic units. The fused arithmetic units include two-term dot product uni... In this work, power efficient butterfly unit based FFT architecture is presented. The butterfly unit is designed using floating-point fused arithmetic units. The fused arithmetic units include two-term dot product unit and add-subtract unit. In these arithmetic units, operations are performed over complex data values. A modified fused floating-point two-term dot product and an enhanced model for the Radix-4 FFT butterfly unit are proposed. The modified fused two-term dot product is designed using Radix-16 booth multiplier. Radix-16 booth multiplier will reduce the switching activities compared to Radix-8 booth multiplier in existing system and also will reduce the area required. The proposed architecture is implemented efficiently for Radix-4 decimation in time(DIT) FFT butterfly with the two floating-point fused arithmetic units. The proposed enhanced architecture is synthesized, implemented, placed and routed on a FPGA device using Xilinx ISE tool. It is observed that the Radix-4 DIT fused floating-point FFT butterfly requires 50.17% less space and 12.16% reduced power compared to the existing methods and the proposed enhanced model requires 49.82% less space on the FPGA device compared to the proposed design. Also, reduced power consumption is addressed by utilizing the reusability technique, which results in 11.42% of power reduction of the enhanced model compared to the proposed design. 展开更多
关键词 floating-point arithmetic floating-point fused dot product Radix-16 booth multiplier Radix-4 FFT butterfly fast fouriertransform decimation in time
在线阅读 下载PDF
An improved boundary element method for modelling a self-reacting point absorber wave energy converter 被引量:1
4
作者 Qian-Long Xu Ye Li Zhi-Liang Lin 《Acta Mechanica Sinica》 SCIE EI CAS CSCD 2018年第6期1015-1034,共20页
A numerical model based on a boundary element method (BEM) is developed to predict the performance of two-body selfreacting floating-point absorber (SRFPA) wave energy systems that operate predominantly in heave.The k... A numerical model based on a boundary element method (BEM) is developed to predict the performance of two-body selfreacting floating-point absorber (SRFPA) wave energy systems that operate predominantly in heave.The key numerical issues in applying the BEM are systematically discussed.In particular,some improvements and simplifications in the numerical scheme are developed to evaluate the free surface Green's function,which is a main element of difficulty in the BEM.For a locked SRFPA system,the present method is compared with the existing experiment and the Reynolds-averaged NavierStokes (RANS)-based method,where it is shown that the inviscid assumption leads to substantial over-prediction of the heave response.For the unlocked SRFPA model we study in this paper,the additional viscous damping primarily induced by flow separation and vortex shedding,is modelled as a quadratic drag force,which is proportional to the square of body velocity.The inclusion of viscous drag in present method significantly improves the prediction of the heave responses and the power absorption performance of the SRFPA system,obtaining results excellent agreement with experimental data and the RANS simulation results over a broad range of incident wave periods,except near resonance in larger wave height scenarios.It is found that the wave overtopping and the re-entering impact of out-of-water floating body are observed more frequently in larger waves,where these non-linear effects are the dominant damping sources and could significantly reduce the power output and the motion responses of the SRFPA system. 展开更多
关键词 BOUNDARY element method floating-point ABSORBER HEAVE Green's function VISCOUS drag
在线阅读 下载PDF
Adaptive point cloud compression based on precision‑aware floating‑point encoding
5
作者 Yanpeng Han Yizhuo Wang +2 位作者 Fawang Liu Jianhua Gao Weixing Ji 《CCF Transactions on High Performance Computing》 2025年第4期349-364,共16页
In distributed autonomous driving simulation systems,the autonomous driving algorithm and the simulator are usually deployed on different nodes.The simulator sends real-time sensor data,including 3D point clouds,to th... In distributed autonomous driving simulation systems,the autonomous driving algorithm and the simulator are usually deployed on different nodes.The simulator sends real-time sensor data,including 3D point clouds,to the algorithm.3D point clouds captured by LiDAR(Light Detection and Ranging)are large and require high transmission performance.Insuf-ficient bandwidth can significantly increase latency in point cloud transmission.This paper proposes a precision-aware floating-point encoding method to reduce the data size of the point cloud with an acceptable level of error while maintain-ing brilliant performance.Point cloud precision and spatial distribution exhibit direct dependencies on LiDAR configura-tions,while network transmission demonstrates dynamic bandwidth variations.This paper proposes a precision-adaptive floating-point compression framework that enables real-time adaptation of point cloud representations through coordinated analysis of LiDAR parameters and network conditions.Experimental evaluation demonstrates substantial latency reduction(up to 56.2%)under constrained bandwidth scenarios,and improved system resilience against network fluctuations through dynamic bitrate adaptation. 展开更多
关键词 Autonomous driving simulation floating-point compression Point cloud Network-adaptive transmission
在线阅读 下载PDF
An empirical study of error‑free transformations for enhancing mathematical function precision
6
作者 Dongting Chen Jie Shen +1 位作者 Chun Huang Xin Yi 《CCF Transactions on High Performance Computing》 2025年第3期194-210,共17页
In numerical computation,the inherent rounding errors of floating-point operations often affect the precision of mathematical functions.The use of high-precision achieved through software-dependent simulation for prec... In numerical computation,the inherent rounding errors of floating-point operations often affect the precision of mathematical functions.The use of high-precision achieved through software-dependent simulation for precision compensation may result in significant performance overhead.Error-free transformations(EFT)technology,based on hardware-supported precision to approximate high-precision implementation,can effectively balance accuracy and performance.However,enhancing the precision of mathematical functions is a very complex and challenging issue.There is a lack of relevant research on when EFT technology can be used to improve the precision of mathematical functions,what effects can be achieved,and what impact it may have on program performance.In this work,we present an empirical study on the applicability and effectiveness of using error-free transformations(EFT)in floating-point computation to assess their potential and limitations in improving precision over mathematical functions.We select 42 mathematical functions from the GNU Scientific Library(GSL),known for significant rounding errors.We evaluate the EFT techniques from three aspects:the applicability of EFT for different mathematical functions(especially at the maximum error point and its vicinity),the precision improvement of EFT in input domains near the error-triggering input,and the performance of EFT compared with the high-precision versions.Experimental results show that EFT has advantages in reducing floating-point errors across 27 functions.Furthermore,while improving the accuracy of mathematical functions within specific input ranges near the maximum error input,EFT achieves a 10.92×speedup compared to long double precision and a 2426.3×speedup compared to mpmath.These findings suggest that EFT achieves computational accuracy to the real results with much lower overhead than conventional high-precision calculations,which makes EFT a promising technology for balancing accuracy and performance in high performance computing. 展开更多
关键词 Mathematical functions floating-point error Error-free transformations
在线阅读 下载PDF
Model adjointization and its cost 被引量:2
7
作者 CHENGQiang ZHANGLinbo WANGBin 《Science in China(Series F)》 2004年第5期587-611,共25页
In this article, the least program behavior decomposition method (LPBD) is put forward from a program structure point of view. This method can be extensively used both in algorithms of automatic differentiation (AD) a... In this article, the least program behavior decomposition method (LPBD) is put forward from a program structure point of view. This method can be extensively used both in algorithms of automatic differentiation (AD) and in tools design, and does not require programs to be evenly separable but the cost in terms of operations count and memory is similar to methods using checkpointing. This article starts by summarizing the rules of adjointization and then presents the implementation of LPBD. Next, the definition of the separable program space, based on the fundamental assumptions (FA) of automatic differentiation, is given and the differentiation cost functions are derived. Also, two constants of fundamental importance in AD, s and m, are derived under FA. Under the assumption of even separability, the adjoint cost of simple and deep decomposition is subsequently discussed quantitatively using checkpointing. Finally, the adjoint costs in terms of operations count and memory through the LPBD method are shown to be uniformly dependent on the depth of structure or decomposition. 展开更多
关键词 automatic differentiation model adjointization least program behavior floating-point operations.
原文传递
Parallel Error Detection for Leading Zero Anticipation 被引量:1
8
作者 张戈 胡伟武 齐子初 《Journal of Computer Science & Technology》 SCIE EI CSCD 2006年第6期901-906,共6页
The algorithm and its implementation of the leading zero anticipation (LZA) are very vital for the performance of a high-speed floating-point adder in today's state of art microprocessor design. Unfortunately, in p... The algorithm and its implementation of the leading zero anticipation (LZA) are very vital for the performance of a high-speed floating-point adder in today's state of art microprocessor design. Unfortunately, in predicting "shift amount" by a conventional LZA design, the result could be off by one position. This paper presents a novel parallel error detection algorithm for a general-case LZA. The proposed approach enables parallel execution of conventional LZA and its error detection, so that the error-indicatlon signal can be generated earlier in the stage of normalization, thus reducing the critical path and improving overall performance. The circuit implementation of this algorithm also shows its advantages of area and power compared with other previous work. 展开更多
关键词 computer arithmetic floating-point addition leading zero anticipation
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部