In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on...In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on human visual saliency model in H.264/AVC. Firstly, we modifie Itti's saliency model. Secondly, target bits of each frame are allocated through the correlation of saliency region between the current and previous frame, and the complexity of each MB is modified through the saliency value and its Mean Absolute Difference (MAD) value. Lastly, the algorithm was implemented in JVT JM12.2. Simulation results show that, comparing with traditional rate control algorithm, the proposed one can reduce the coding bit rate and improve the reconstructed video subjective quality, especially for visual saliency region. It is very suitable for wireless video transmission.展开更多
Fine scalability can provide not only precise rate control for constant bitrate (CBR) traffic, but also accurate quality control for variable bitrate (VBR) traffic. Motion JPEG2000 is a codec that can provide fine sca...Fine scalability can provide not only precise rate control for constant bitrate (CBR) traffic, but also accurate quality control for variable bitrate (VBR) traffic. Motion JPEG2000 is a codec that can provide fine scalability with bitstreams. An efficient rate control approach utilizing a single buffer and two kinds of threshold for Motion JPEG2000 under resource constraint was proposed, which can offer good result in the constant quality video.展开更多
The JPEG2000 image compression standard is the powerful encoder which can provide phenomenal rate-control performance. The post-compression rate-distortion(PCRD) algorithm in JPEG2000 is not efficient. It requires enc...The JPEG2000 image compression standard is the powerful encoder which can provide phenomenal rate-control performance. The post-compression rate-distortion(PCRD) algorithm in JPEG2000 is not efficient. It requires encoding all coding passes even though a large contribution of them will not be contained in the final code-stream. Tier-1 encoding in the JPEG2000 standard takes a significant amount of memory and coding time. In this work, a low-complexity rate distortion method for JPEG2000 is proposed. It is relied on a reverse order for the resolution levels and the coding passes. The proposed algorithm encodes only the coding passes contained in the final code-stream and it does not need any post compression rate control part. The computational complexity of proposed algorithm is negligible, making it suitable to compression and attaining a significant performance. Simulations results show that the proposed algorithm obtained the PSNR values are comparable with the optimal PCRD.展开更多
This paper presents a new video coding system based on wavelet transform and its rate control scheme over ATM networks. First, three dimensional wavelet transform is performed for the original image sequence, and an e...This paper presents a new video coding system based on wavelet transform and its rate control scheme over ATM networks. First, three dimensional wavelet transform is performed for the original image sequence, and an extension of set partitioning in hierarchical trees algorithm is employed to quantize the wavelet coefficients. Then, the output rate of the coder is controlled at group of frame scale, ensuring that it conforms to the parameters of a leaky bucket controller. Several leaky buckets with different sizes are discussed too. Simulation shows the efficiency of this codec and the effectiveness of the proposed rate control scheme.展开更多
Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable cha...Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable characteristics,it is difficult to establish a very accurate rate-distortion(R-D)model and acquire effective rate control performance.Considering the excellent control ability and low computing complexity of the fuzzy logic in non-linear systems,this paper proposes a bitrate control algorithm based on a fuzzy controller,named the Fuzzy Rate Control Algorithm(FRCA),for All-Intra(AI)and low-delay(LD)video source coding.Contributions of the proposed FRCA mainly consist of four aspects.First,fuzzy logic is adopted to minimize the deviation between the actual and the target buffer size in the hypothetical reference decoder(HRD).Second,a fast lookup table is employed in fuzzy rate control,which reduces computing cost of the control process.Third,an input domain determination scheme is proposed to improve the precision of the fuzzy controller.Fourth,a novel scene change detection is introduced and integrated in the FRCA to adaptively adjust the Group-of-Pictures(GOP)length when the source content fluctuates.The FRCA can be transplanted and implemented in various industry coders.Extensive experiments show that the FRCA has accurate variable bit-rate control ability and maintains a steady buffer size during the encoding processes.Compared with the default configuration encoding under AI and LD,the proposed FRCA can achieve the target bit rates more accurately in various classical encoders.展开更多
For rate control (RC) of hierarchical structure coding, an independent rate-quantization (R-Q) model was proposed based on mean absolute differences (MADs) in different temporal levels (TLs). In the proposed R-Q model...For rate control (RC) of hierarchical structure coding, an independent rate-quantization (R-Q) model was proposed based on mean absolute differences (MADs) in different temporal levels (TLs). In the proposed R-Q model, a novel MAD model was developed according to the hierarchical structure. The experimental results demonstrate that the proposed algorithm provides better performance, in terms of average peak signal-to-noise ratio (PSNR) and quality smoothness, than the H.264 reference model, JM14.2, under various sequences.展开更多
传统编码器虽然具有可控的码率,但却无法有效控制编码视频的质量,存在随图像内容变化而产生抖动的缺陷。针对当前互联网带宽特性,如自适应码率(Adaptive Bitrate,ABR)网络带宽控制技术中带宽固定的限制条件比传统广播网松弛,在以失真为...传统编码器虽然具有可控的码率,但却无法有效控制编码视频的质量,存在随图像内容变化而产生抖动的缺陷。针对当前互联网带宽特性,如自适应码率(Adaptive Bitrate,ABR)网络带宽控制技术中带宽固定的限制条件比传统广播网松弛,在以失真为约束的条件下,提出了一种新的率失真优化的失真分配方案,根据每个编码单元的拉格朗日乘子与图像组(Group of Pictures,GOP)级别的乘子之间的相互关系模型,设计了以帧级为单元的失真分配策略。基于高效率视频编码(High Efficiency Video Coding,HEVC)模型随机编码结构的默认配置下,对通用测试条件中规定的标准测试序列,实验结果显示质量一致性限制的编码器率失真性能Bj ntegaard Delta-Peak Signal to Noise Rate(BD-PSNR)提升了0.057 dB,编码后的图像组失真的方差减小了50%,能有效地减少编码视频的质量抖动,具有更加平稳的编码质量。展开更多
为了使立体视频中的比特分配更加符合人眼视觉感知特性,提出了一种非对称质量的立体视频编码码率控制算法。首先,建立了左右帧的码率分配比例与量化参数差值之间的立体指数RRQ(Rate-ratio Quantization)模型。然后,将码率控制算法分为SG...为了使立体视频中的比特分配更加符合人眼视觉感知特性,提出了一种非对称质量的立体视频编码码率控制算法。首先,建立了左右帧的码率分配比例与量化参数差值之间的立体指数RRQ(Rate-ratio Quantization)模型。然后,将码率控制算法分为SGOP(Stereoscopic Group of Pictures)层、立体图像对层和帧层等3个码率控制层。在SGOP层计算每个SGOP的目标码率和关键帧的量化参数;在立体图像对层根据剩余比特数和缓冲区饱和度计算每个立体图像对的目标比特;在帧层则通过分析双目视觉掩蔽效应,用一种适合于立体视频的率失真优化方法合理分配左右帧的目标码率。实验结果表明,本文算法的码率控制偏差平均值为0.21%;立体视频客观质量比对称质量算法和Wang的算法分别提高了0.23dB和0.06dB,且质量波动较为稳定。因此,该算法基本满足网络带宽传输要求。由于充分利用了人眼双目视觉特性,可满足人们对立体视频的视觉需求。展开更多
基金supported by National Natural Science Foundation of China under Grant No.610700800973 Sub-Program Projects under Grant No.2009CB320906+3 种基金National Science and Technology of Major Special Projects under Grant No.2010ZX03004-003S&T Planning Project of Hubei Provincial Department of Education under Grant No. Q20112805H&SPlanning Project of Hubei Provincial Department of Education under Grant No.2011jyte142Science Foundation of HubeiProvincial under Grant No.2010CDB05103
文摘In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on human visual saliency model in H.264/AVC. Firstly, we modifie Itti's saliency model. Secondly, target bits of each frame are allocated through the correlation of saliency region between the current and previous frame, and the complexity of each MB is modified through the saliency value and its Mean Absolute Difference (MAD) value. Lastly, the algorithm was implemented in JVT JM12.2. Simulation results show that, comparing with traditional rate control algorithm, the proposed one can reduce the coding bit rate and improve the reconstructed video subjective quality, especially for visual saliency region. It is very suitable for wireless video transmission.
文摘Fine scalability can provide not only precise rate control for constant bitrate (CBR) traffic, but also accurate quality control for variable bitrate (VBR) traffic. Motion JPEG2000 is a codec that can provide fine scalability with bitstreams. An efficient rate control approach utilizing a single buffer and two kinds of threshold for Motion JPEG2000 under resource constraint was proposed, which can offer good result in the constant quality video.
文摘The JPEG2000 image compression standard is the powerful encoder which can provide phenomenal rate-control performance. The post-compression rate-distortion(PCRD) algorithm in JPEG2000 is not efficient. It requires encoding all coding passes even though a large contribution of them will not be contained in the final code-stream. Tier-1 encoding in the JPEG2000 standard takes a significant amount of memory and coding time. In this work, a low-complexity rate distortion method for JPEG2000 is proposed. It is relied on a reverse order for the resolution levels and the coding passes. The proposed algorithm encodes only the coding passes contained in the final code-stream and it does not need any post compression rate control part. The computational complexity of proposed algorithm is negligible, making it suitable to compression and attaining a significant performance. Simulations results show that the proposed algorithm obtained the PSNR values are comparable with the optimal PCRD.
文摘This paper presents a new video coding system based on wavelet transform and its rate control scheme over ATM networks. First, three dimensional wavelet transform is performed for the original image sequence, and an extension of set partitioning in hierarchical trees algorithm is employed to quantize the wavelet coefficients. Then, the output rate of the coder is controlled at group of frame scale, ensuring that it conforms to the parameters of a leaky bucket controller. Several leaky buckets with different sizes are discussed too. Simulation shows the efficiency of this codec and the effectiveness of the proposed rate control scheme.
基金supported by ZTE Industry-Academia-Research Cooperation Funds under Grant No.CON1503180004the Postdoctoral Science Foundation of China under Gant No.2014M552342the Foundation of Science and Technology Department of Sichuan Province,China under Grant No.2014GZ0005
文摘Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable characteristics,it is difficult to establish a very accurate rate-distortion(R-D)model and acquire effective rate control performance.Considering the excellent control ability and low computing complexity of the fuzzy logic in non-linear systems,this paper proposes a bitrate control algorithm based on a fuzzy controller,named the Fuzzy Rate Control Algorithm(FRCA),for All-Intra(AI)and low-delay(LD)video source coding.Contributions of the proposed FRCA mainly consist of four aspects.First,fuzzy logic is adopted to minimize the deviation between the actual and the target buffer size in the hypothetical reference decoder(HRD).Second,a fast lookup table is employed in fuzzy rate control,which reduces computing cost of the control process.Third,an input domain determination scheme is proposed to improve the precision of the fuzzy controller.Fourth,a novel scene change detection is introduced and integrated in the FRCA to adaptively adjust the Group-of-Pictures(GOP)length when the source content fluctuates.The FRCA can be transplanted and implemented in various industry coders.Extensive experiments show that the FRCA has accurate variable bit-rate control ability and maintains a steady buffer size during the encoding processes.Compared with the default configuration encoding under AI and LD,the proposed FRCA can achieve the target bit rates more accurately in various classical encoders.
基金National Natural Science Foundations of China (No. 60972035,No. 61074009)Natural Science Foundation Program of Shanghai,China ( No. 10ZR1432800)
文摘For rate control (RC) of hierarchical structure coding, an independent rate-quantization (R-Q) model was proposed based on mean absolute differences (MADs) in different temporal levels (TLs). In the proposed R-Q model, a novel MAD model was developed according to the hierarchical structure. The experimental results demonstrate that the proposed algorithm provides better performance, in terms of average peak signal-to-noise ratio (PSNR) and quality smoothness, than the H.264 reference model, JM14.2, under various sequences.
文摘传统编码器虽然具有可控的码率,但却无法有效控制编码视频的质量,存在随图像内容变化而产生抖动的缺陷。针对当前互联网带宽特性,如自适应码率(Adaptive Bitrate,ABR)网络带宽控制技术中带宽固定的限制条件比传统广播网松弛,在以失真为约束的条件下,提出了一种新的率失真优化的失真分配方案,根据每个编码单元的拉格朗日乘子与图像组(Group of Pictures,GOP)级别的乘子之间的相互关系模型,设计了以帧级为单元的失真分配策略。基于高效率视频编码(High Efficiency Video Coding,HEVC)模型随机编码结构的默认配置下,对通用测试条件中规定的标准测试序列,实验结果显示质量一致性限制的编码器率失真性能Bj ntegaard Delta-Peak Signal to Noise Rate(BD-PSNR)提升了0.057 dB,编码后的图像组失真的方差减小了50%,能有效地减少编码视频的质量抖动,具有更加平稳的编码质量。
文摘为了使立体视频中的比特分配更加符合人眼视觉感知特性,提出了一种非对称质量的立体视频编码码率控制算法。首先,建立了左右帧的码率分配比例与量化参数差值之间的立体指数RRQ(Rate-ratio Quantization)模型。然后,将码率控制算法分为SGOP(Stereoscopic Group of Pictures)层、立体图像对层和帧层等3个码率控制层。在SGOP层计算每个SGOP的目标码率和关键帧的量化参数;在立体图像对层根据剩余比特数和缓冲区饱和度计算每个立体图像对的目标比特;在帧层则通过分析双目视觉掩蔽效应,用一种适合于立体视频的率失真优化方法合理分配左右帧的目标码率。实验结果表明,本文算法的码率控制偏差平均值为0.21%;立体视频客观质量比对称质量算法和Wang的算法分别提高了0.23dB和0.06dB,且质量波动较为稳定。因此,该算法基本满足网络带宽传输要求。由于充分利用了人眼双目视觉特性,可满足人们对立体视频的视觉需求。