In this paper we present a motion compensation (MC) design for the newest Audio Video coding Standard (AVS) of China. Because of compression-efficient techniques of variable block size (VBS) and sub-pixel interpolatio...In this paper we present a motion compensation (MC) design for the newest Audio Video coding Standard (AVS) of China. Because of compression-efficient techniques of variable block size (VBS) and sub-pixel interpolation, intensive pixel calculation and huge memory access are required. We propose a parallel serial filtering mixed luma interpolation data flow and a three-stage multiplication free chroma interpolation scheme. Compared to the conventional designs, the integrated architecture supports about 2.7 times filtering throughput. The proposed MC design utilizes Vertical Z processing order for reference data re-use and saves up to 30% memory bandwidth. The whole design requires 44.3k gates when synthesized at 108 MHz clock frequency using 0.18-μm CMOS technology and can support up to 1920×1088@30 fps AVS HDTV video decoding.展开更多
Recently,human motion prediction has gained significant attention and achieved notable success.However,current methods primarily rely on training and testing with ideal datasets,overlooking the impact of variations in...Recently,human motion prediction has gained significant attention and achieved notable success.However,current methods primarily rely on training and testing with ideal datasets,overlooking the impact of variations in the viewing distance and viewing angle,which are commonly encountered in practical scenarios.In this study,we address the issue of model invariance by ensuring robust performance despite variations in view distances and angles.To achieve this,we employed Riemannian geometry methods to constrain the learning process of neural networks,enabling the prediction of invariances using a simple network.Furthermore,this enhances the application of motion prediction in various scenarios.Our framework uses Riemannian geometry to encode motion into a novel motion space to achieve prediction with an invariant viewing distance and angle using a simple network.Specifically,the specified path transport square-root velocity function is proposed to aid in removing the view-angle equivalence class and encode motion sequences into a flattened space.Motion coding by the geometry method linearizes the optimization problem in a non-flattened space and effectively extracts motion information,allowing the proposed method to achieve competitive performance using a simple network.Experimental results on Human 3.6M and CMU MoCap demonstrate that the proposed framework has competitive performance and invariance to the viewing distance and viewing angle.展开更多
A series solution for surface motion amplification due to underground group cavities for incident plane P waves is derived by Fourier-Bessel series expansion method. It is shown that underground group cavities signifi...A series solution for surface motion amplification due to underground group cavities for incident plane P waves is derived by Fourier-Bessel series expansion method. It is shown that underground group cavities significantly am-plify the surface ground motion nearby. It is suggested that the effect of subways on ground motion should be con-sidered when the subways are planned and designed.展开更多
Aiming at the higher bit-rate occupation of motion vector encoding and more time load of full-searching strategies, a multi-resolution motion estimation and compensation algorithm based on adjacent prediction of frame...Aiming at the higher bit-rate occupation of motion vector encoding and more time load of full-searching strategies, a multi-resolution motion estimation and compensation algorithm based on adjacent prediction of frame difference was proposed.Differential motion detection was employed to image sequences and proper threshold was adopted to identify the connected region.Then the motion region was extracted to carry out motion estimation and motion compensation on it.The experiment results show that the encoding efficiency of motion vector is promoted, the complexity of motion estimation is reduced and the quality of the reconstruction image at the same bit-rate as Multi-Resolution Motion Estimation(MRME) is improved.展开更多
Fine scalability can provide not only precise rate control for constant bitrate (CBR) traffic, but also accurate quality control for variable bitrate (VBR) traffic. Motion JPEG2000 is a codec that can provide fine sca...Fine scalability can provide not only precise rate control for constant bitrate (CBR) traffic, but also accurate quality control for variable bitrate (VBR) traffic. Motion JPEG2000 is a codec that can provide fine scalability with bitstreams. An efficient rate control approach utilizing a single buffer and two kinds of threshold for Motion JPEG2000 under resource constraint was proposed, which can offer good result in the constant quality video.展开更多
This paper proposes a motion-based region growing segmentation scheme for the object-based video coding, which segments an image into homogeneous regions characterized by a coherent motion. It adopts a block matching ...This paper proposes a motion-based region growing segmentation scheme for the object-based video coding, which segments an image into homogeneous regions characterized by a coherent motion. It adopts a block matching algorithm to estimate motion vectors and uses morphological tools such as open-close by reconstruction and the region-growing version of the watershed algorithm for spatial segmentation to improve the temporal segmentation. In order to determine the reliable motion vectors, this paper also proposes a change detection algorithm and a multi-candidate pro- screening motion estimation method. Preliminary simulation results demonstrate that the proposed scheme is feasible. The main advantage of the scheme is its low computational load.展开更多
基金(No. Y106574) supported by the Natural Science Foundationof Zhejiang Province, China
文摘In this paper we present a motion compensation (MC) design for the newest Audio Video coding Standard (AVS) of China. Because of compression-efficient techniques of variable block size (VBS) and sub-pixel interpolation, intensive pixel calculation and huge memory access are required. We propose a parallel serial filtering mixed luma interpolation data flow and a three-stage multiplication free chroma interpolation scheme. Compared to the conventional designs, the integrated architecture supports about 2.7 times filtering throughput. The proposed MC design utilizes Vertical Z processing order for reference data re-use and saves up to 30% memory bandwidth. The whole design requires 44.3k gates when synthesized at 108 MHz clock frequency using 0.18-μm CMOS technology and can support up to 1920×1088@30 fps AVS HDTV video decoding.
基金supported by the Beijing Municipal Science and Technology Commission and Zhongguancun Science Park Management Committee,No.Z221100002722020National Nature Science Foundation of China,No.62072045Innovation Transfer Fund of Peking University Third Hospital,No.BYSYZHKC2021110。
文摘Recently,human motion prediction has gained significant attention and achieved notable success.However,current methods primarily rely on training and testing with ideal datasets,overlooking the impact of variations in the viewing distance and viewing angle,which are commonly encountered in practical scenarios.In this study,we address the issue of model invariance by ensuring robust performance despite variations in view distances and angles.To achieve this,we employed Riemannian geometry methods to constrain the learning process of neural networks,enabling the prediction of invariances using a simple network.Furthermore,this enhances the application of motion prediction in various scenarios.Our framework uses Riemannian geometry to encode motion into a novel motion space to achieve prediction with an invariant viewing distance and angle using a simple network.Specifically,the specified path transport square-root velocity function is proposed to aid in removing the view-angle equivalence class and encode motion sequences into a flattened space.Motion coding by the geometry method linearizes the optimization problem in a non-flattened space and effectively extracts motion information,allowing the proposed method to achieve competitive performance using a simple network.Experimental results on Human 3.6M and CMU MoCap demonstrate that the proposed framework has competitive performance and invariance to the viewing distance and viewing angle.
基金Supported by National Natural Science Foundation of China (50378063), Excellent Young Teachers Program of MOE and SRF for ROCS, MOE.
文摘A series solution for surface motion amplification due to underground group cavities for incident plane P waves is derived by Fourier-Bessel series expansion method. It is shown that underground group cavities significantly am-plify the surface ground motion nearby. It is suggested that the effect of subways on ground motion should be con-sidered when the subways are planned and designed.
基金Supported by the National Natural Science Foundation of China (No. 60803036)the Scientific Research Fund of Heilongjiang Provincial Education Department (No.11531013)
文摘Aiming at the higher bit-rate occupation of motion vector encoding and more time load of full-searching strategies, a multi-resolution motion estimation and compensation algorithm based on adjacent prediction of frame difference was proposed.Differential motion detection was employed to image sequences and proper threshold was adopted to identify the connected region.Then the motion region was extracted to carry out motion estimation and motion compensation on it.The experiment results show that the encoding efficiency of motion vector is promoted, the complexity of motion estimation is reduced and the quality of the reconstruction image at the same bit-rate as Multi-Resolution Motion Estimation(MRME) is improved.
文摘Fine scalability can provide not only precise rate control for constant bitrate (CBR) traffic, but also accurate quality control for variable bitrate (VBR) traffic. Motion JPEG2000 is a codec that can provide fine scalability with bitstreams. An efficient rate control approach utilizing a single buffer and two kinds of threshold for Motion JPEG2000 under resource constraint was proposed, which can offer good result in the constant quality video.
文摘This paper proposes a motion-based region growing segmentation scheme for the object-based video coding, which segments an image into homogeneous regions characterized by a coherent motion. It adopts a block matching algorithm to estimate motion vectors and uses morphological tools such as open-close by reconstruction and the region-growing version of the watershed algorithm for spatial segmentation to improve the temporal segmentation. In order to determine the reliable motion vectors, this paper also proposes a change detection algorithm and a multi-candidate pro- screening motion estimation method. Preliminary simulation results demonstrate that the proposed scheme is feasible. The main advantage of the scheme is its low computational load.