Scene Video Coding - Advanced Video Coding Systems - page 148

Game Development Reference

In-Depth Information

Table 8.7 Memory cost (byte) for each pixel in background modeling

Item

RA

GMM-1

GMM-5

MS

SWRA

Buffered pixel

1(char)

1(char)

1(char)

M(char)

1(char)

Mean values

1(float)

2(double)

2(double)

1

2(float)

Weight

0

1(double)

1(double)

0

2(char)

Threshold

0

1(double)

1(double)

0

1(float)

Match points

0

1(char)

1(char)

0

0

Tot a l

5

34

345 = 170

M = 120

14

8.4 Surveillance Coding in AVS

In AVS and AVS2 standard, the above segmentwise-weighted average background

model has been adopted (encoder) for background model-based surveillance video

coding methods. The related tools include background modeling, background pic-

ture (G-picture) based prediction, i.e., background-predictive picture (S-picture),

and background difference prediction (Zhang et al. 2014a ). Moreover, AVS2 also

supports the signalization of the regions of interest (ROI) in picture header. While

combining with the G/GB picture (MBG mechanism), surveillance video's ROI can

be easily detected from background subtraction. And then, following the signaliza-

tion method of ROI in AVS2, the decoder can obtain the ROI regions directly from

video stream without any complex operations. Therefore, ROI signalization com-

bined with MBG is one practical tool for fast ROI extraction directly from video

streams.

In this section, we firstly introduce the background model-based video codec

architecture. Secondly, we will introduce the novel tools for better utilizing theMBG,

i.e., background difference prediction and MVP related to MBG.

8.4.1 Video Codec Architecture

In AVS and AVS2, G-picture and S-picture are defined to further exploit the temporal

redundancy and suppress background noise. The G-picture is a special I-picture and

its reconstructed picture is stored in a separate background memory. The S-picture

is a special P-picture which can be only predicted from reconstructed G-picture. For

surveillance video coding, the G-picture can be a nondisplay MBGwhich is modeled

from input pictures and encoded into stream to guarantee the decoding match, and

then each P-picture can also utilize G-picture as prediction reference or be encoded

utilizing background difference prediction.

Next Page

Advanced Video Coding Systems

Search WWH ::

Custom Search

Home