In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on...In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on human visual saliency model in H.264/AVC. Firstly, we modifie Itti's saliency model. Secondly, target bits of each frame are allocated through the correlation of saliency region between the current and previous frame, and the complexity of each MB is modified through the saliency value and its Mean Absolute Difference (MAD) value. Lastly, the algorithm was implemented in JVT JM12.2. Simulation results show that, comparing with traditional rate control algorithm, the proposed one can reduce the coding bit rate and improve the reconstructed video subjective quality, especially for visual saliency region. It is very suitable for wireless video transmission.展开更多
Intelligent video coding(IVC),which dates back to the late 1980s with the concept of encoding videos with knowledge and semantics,includes visual content compact representation models and methods enabling structural,d...Intelligent video coding(IVC),which dates back to the late 1980s with the concept of encoding videos with knowledge and semantics,includes visual content compact representation models and methods enabling structural,detailed descriptions of visual information at different granularity levels(i.e.,block,mesh,region,and object)and in different areas.It aims to support and facilitate a wide range of applications,such as visual media coding,content broadcasting,and ubiquitous multimedia computing.We present a high-level overview of the IVC technology from model-based coding(MBC)to learning-based coding(LBC).MBC mainly adopts a manually designed coding scheme to explicitly decompose videos to be coded into blocks or semantic components.Thanks to emerging deep learning technologies such as neural networks and generative models,LBC has become a rising topic in the coding area.In this paper,wefirst review the classical MBC approaches,followed by the LBC approaches for image and video data.We also discuss and overview our recent attempts at neural coding approaches,which are inspiring for both academic research and industrial implementation.Some critical yet less studied issues are discussed at the end of this paper.展开更多
提出一种基于人类视觉灵敏度与空间加权离散余弦系数差异度的显著性检测模型.该模型将图像块的离散余弦低频系数作为其特征向量,以取代颜色和亮度等基本特征.每个图像块的显著性不仅计算与其余所有图像块的空间加权特征差异度之和,还用...提出一种基于人类视觉灵敏度与空间加权离散余弦系数差异度的显著性检测模型.该模型将图像块的离散余弦低频系数作为其特征向量,以取代颜色和亮度等基本特征.每个图像块的显著性不仅计算与其余所有图像块的空间加权特征差异度之和,还用人类视觉灵敏度加权.通过与6种典型的显著性检测模型在3个眼动跟踪数据集上进行对比实验,结果表明,该模型显著性检测性能优于所有对比算法.此外,将该显著性检测模型用于新一代高效率视频编码(high efficiency video coding,HEVC)中也获得了很好的效果.展开更多
基金supported by National Natural Science Foundation of China under Grant No.610700800973 Sub-Program Projects under Grant No.2009CB320906+3 种基金National Science and Technology of Major Special Projects under Grant No.2010ZX03004-003S&T Planning Project of Hubei Provincial Department of Education under Grant No. Q20112805H&SPlanning Project of Hubei Provincial Department of Education under Grant No.2011jyte142Science Foundation of HubeiProvincial under Grant No.2010CDB05103
文摘In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on human visual saliency model in H.264/AVC. Firstly, we modifie Itti's saliency model. Secondly, target bits of each frame are allocated through the correlation of saliency region between the current and previous frame, and the complexity of each MB is modified through the saliency value and its Mean Absolute Difference (MAD) value. Lastly, the algorithm was implemented in JVT JM12.2. Simulation results show that, comparing with traditional rate control algorithm, the proposed one can reduce the coding bit rate and improve the reconstructed video subjective quality, especially for visual saliency region. It is very suitable for wireless video transmission.
基金supported by the National Natural Science Foundation of China(Grant No.62025101,62088102,62101007 and 61931014)the Young Elite Scientist Sponsorship Program by the Beijing Association of Science and Technology(Grant No.BYSS2022019).
文摘Intelligent video coding(IVC),which dates back to the late 1980s with the concept of encoding videos with knowledge and semantics,includes visual content compact representation models and methods enabling structural,detailed descriptions of visual information at different granularity levels(i.e.,block,mesh,region,and object)and in different areas.It aims to support and facilitate a wide range of applications,such as visual media coding,content broadcasting,and ubiquitous multimedia computing.We present a high-level overview of the IVC technology from model-based coding(MBC)to learning-based coding(LBC).MBC mainly adopts a manually designed coding scheme to explicitly decompose videos to be coded into blocks or semantic components.Thanks to emerging deep learning technologies such as neural networks and generative models,LBC has become a rising topic in the coding area.In this paper,wefirst review the classical MBC approaches,followed by the LBC approaches for image and video data.We also discuss and overview our recent attempts at neural coding approaches,which are inspiring for both academic research and industrial implementation.Some critical yet less studied issues are discussed at the end of this paper.
文摘提出一种基于人类视觉灵敏度与空间加权离散余弦系数差异度的显著性检测模型.该模型将图像块的离散余弦低频系数作为其特征向量,以取代颜色和亮度等基本特征.每个图像块的显著性不仅计算与其余所有图像块的空间加权特征差异度之和,还用人类视觉灵敏度加权.通过与6种典型的显著性检测模型在3个眼动跟踪数据集上进行对比实验,结果表明,该模型显著性检测性能优于所有对比算法.此外,将该显著性检测模型用于新一代高效率视频编码(high efficiency video coding,HEVC)中也获得了很好的效果.
文摘针对高动态范围(HDR)视频较之于传统低动态范围(LDR)视频所需存储资源和传输带宽急剧增加的问题,本文提出了一种基于视觉感知特性的HDR视频编码的动态率失真优化算法,以提高高效视频编码(HEVC)Main 10编码HDR视频的性能。本文通过引入视觉选择性关注信息,对不同区域采取非均等的失真权重分配策略,优化常规的失真计算方法;同时,为了进一步去除视频中的感知冗余,融合视频内容的纹理特性自适应调节拉格朗日乘子,并应用于编码量化器动态调节量化参数,实现编码比特和失真感知权衡。实验结果表明:与HEVC Main 10相比,在相同HDR-VDP和PSNR DE质量指标下,所提算法平均节省7.46%和6.53%码率,最大分别节省18.52%和11.49%,所提算法在保持视觉质量的前提下能够有效降低码率。