Inter-Picture Prediction in HEVC - High Efficiency Video Coding (HEVC)

Graphics Reference

In-Depth Information

abrupt changes in the field of motion parameters, which is common in natural video

content. One reason is that the quad-tree structure systematically does not allow for

joint description of child blocks that belong to different parent blocks. Also, the fact

that a block can only be divided into exactly four child blocks will eventually lead

to ineffective borders.

To remedy these inherent drawbacks of the quad-tree structure, HEVC uses block

merging which allows us to code motion parameters very cheaply (in terms of bit

rate) in these ineffective border situations [ 9 , 12 , 23 ]. The algorithm was inspired

by the work of [ 10 ], in which the authors show that rate-distortion optimized

tree pruning for quadtree-based motion models can be substantially improved by

introducing a subsequent leaf merging step. In [ 22 ], the authors study the benefits of

leaf merging on a broader theoretical basis. Here we leave it at the intuitive example

given above and concentrate on the integration of block merging into HEVC.

Following from the observation of ineffective borders, block merging introduces

a terse syntax allowing for a sub-block to explicitly reuse the exact same motion

parameters contained in neighboring blocks. Like AMVP, it compiles a list of

candidate motion parameter tuples by picking from neighboring blocks. Then, an

index is signaled which identifies the candidate to be used. Block merging also

allows for temporal prediction by including into the list a candidate obtained from

previously coded pictures. A more detailed description is given in the following.

5.2.2.2

Merge Candidate List Construction

Although they appear similar, there is one main difference between the AMVP

and the merge candidate list. The AMVP list only contains motion vectors for

one reference list while a merge candidate contains all motion data including

the information whether one or two reference picture lists are used as well as a

reference index and a motion vector for each list. This significantly reduces motion

data signaling overhead. Section 5.2.2.3 describes the signaling in detail as well as

discusses how parsing robustness is achieved. Overall, the merge candidate list is

constructed based on the following candidates:

up to four spatial merge candidates that are derived from five spatial neighboring

blocks

one temporal merge candidate derived from two temporal, co-located blocks

additional merge candidates including combined bi-predictive candidates and

zero motion vector candidates

Spatial Candidates

The first candidates in the merge candidate list are the spatial neighbors. Here, the

same neighboring blocks as for the spatial AMVP candidates are considered which

are described in Sect. 5.2.1.1 and illustrated in Fig. 5.4 b. In order to derive a list

of motion vector predictors for AMVP, one MVP is derived from A0 and A1 and

Search WWH ::

Custom Search

Home