Block Structures and Parallelism Features in HEVC - High Efficiency Video Coding (HEVC)

Graphics Reference

In-Depth Information

high-definition (HD) video content, with typical picture resolutions of 1280 720 or

1920 1080 luma samples, they have been primarily designed for video resolutions

ranging from QCIF (176 144 luma samples) to standard definition (720 480 or

720 576 luma samples). Due to the popularity of HD video and the growing interest

in Ultra HD (UHD) formats [ 13 ] with resolutions of, for example, 3840 2160 or

even 7680 4320 luma samples, HEVC [ 18 ] has been designed with a focus on

high resolution video. However, for such large picture resolutions, restricting the

largest block size that can be used for signaling prediction parameters to 16 16

luma samples as in prior video coding standards is inefficient in rate-distortion

sense [ 4 , 26 , 37 ]. For typical HD or UHD video content, many picture areas that

can be described by the same motion parameters are much larger than blocks of

16 16 luma samples. Signaling a coding mode for each 16 16 macroblock

would already require a substantial amount of the target bit rate. Furthermore, due

to the increased spatial correlation between neighboring samples in high-resolution

video, using transform sizes larger than 16 16 for coding the residual signal can

also be advantageous for many image parts. The support of larger block sizes for

intra-picture prediction, motion-compensated prediction and transform coding was

one of the key aspects in many proposals for HEVC, for example [ 25 , 27 , 44 ].

Even though the coding of HD and UHD video was one important aspect in

the HEVC development, the standard has been designed to provide an improved

coding efficiency relative to its predecessor H.264 j MPEG-4 AVC for all existing

video coding applications. While increasing the size of the largest supported block

size is advantageous for high-resolution video, it may have a negative impact on

coding efficiency for low-resolution video, in particular if low-complexity encoder

implementations are used that are not capable of evaluating all supported sub-

partitioning modes. For this reason, HEVC includes a flexible mechanism for

partitioning video pictures into basic processing units of variable sizes.

As already mentioned, in HEVC, each picture is partitioned into square-shaped

coding tree blocks (CTBs) such that the resulting number of CTBs is identical

for both the luma and chroma picture components (assuming a non-monochrome

video format). 2 Consequently, each CTB of luma samples together with its two

corresponding CTBs of chroma samples and the syntax associated with these sample

blocks is subsumed under a so-called coding tree unit (CTU). A CTU represents

the basic processing unit in HEVC and is in that regard similar to the concept

of a macroblock in prior video coding standards. The luma CTB covers a square

picture area of 2 N 2 N luma samples. In the 4:2:0 chroma sampling format, each

of the two chroma CTBs covers the corresponding area of 2 N 1 2 N 1 chroma

samples of one of the two chroma components. The parameter N is transmitted in

the sequence parameter set and can be chosen by the encoder among the values

N D 4, 5, and 6, corresponding to CTU sizes of 16 16, 32 32,and64 64

2 The profiles defined in version 1 of the HEVC standard [ 18 ] only support video in the 4:2:0

chroma sampling format; monochrome video is not supported in these profiles.

Search WWH ::

Custom Search

Home