Hot Research Topics in Video Coding and Systems - Advanced Video Coding Systems

Game Development Reference

In-Depth Information

thresholds are derived from the spatial or transform domain of the original frame.

The residuals that are lower than the JND thresholds can be ignored, as the perceptual

quality would not be influenced. By contrast, when the residuals are larger than the

JND threshold, the JND threshold is subtracted from the prediction residual, which

can efficiently reduce the energy of transform coefficients.

In this way, the coding efficiency can be improved from the perceptual quality

point of view. Moreover, the JND-based residual filtering has another advantage

of reducing the computational complexity of the encoder, as fewer coefficients are

coded compared to the conventional coding scheme.

12.2.2.2 Perceptual Quantization

The main source of distortion in video coding is the quantization process, which

directly determines the reconstructed video quality with the quantization parameter.

Therefore, how to select the optimal quantization parameter for each coding frame,

unit, and frequency has been a hot research topic in perceptual video coding.

As the visual sensitivity can be reflected by the JND threshold, the local quanti-

zation parameter adjustment scheme with JND is proposed. For example, in Chen

and Guillemot ( 2010 ), researchers applied the FJND model in the quantization para-

meter adjustment process. For the MB with lower JND value, smaller quantization

parameter should be used, as it is more sensitive to the HVS.

Though JND can efficiently characterize the near-threshold distortions, in video

coding, the distortion is usually visible. Therefore, it is more accurate to formu-

late the visual sensitivity from the suprathreshold computational models. In Tang

et al. ( 2006 ), the spatial-temporal sensitivity models are applied in rate control. In

Chen et al. ( 2012 ), adaptive quantizationmatrix selection and updatingmethod is pro-

posed, in which the macroblock is classified into three types: no frequency weighting,

detailed preserving, and undetailed. In Wang et al. ( 2013 ), the divisive normalization

idea is applied in video coding, where the frame residuals are transformed into a

perceptually uniform space by adjusting the quantization parameters for each MB.

The proposed framework is shown in Fig. 12.3 , and it has also been implemented

into AVS2 coding standard. Experimental results show that 5-7% bitrate reduction

can be achieved in terms of SSIM.

The nonuniform distribution of the cone receptors and ganglion cells in the retina

makes the attention point sampledwith the highest resolution, and it rapidly decreases

Fig. 12.3 Framework of the divisive normalization-based perceptual video coding framework

Search WWH ::

Custom Search

Home