Intelligent Video System - Advanced Video Coding Systems

Game Development Reference

In-Depth Information

Fig. 10.2 The effects of video compression quality on feature analysis, object detection, and face

recognition. a SIFT features are extracted on video frames with different definitions and different

QPs, and then the average similarity is calculated between SIFT features of the original frames and

those of the corresponding transformed frames. b In the object detection experiment, the well-known

deformable part model (DPM) trained by PASCAL VOC 2009 dataset (Felzenszwalb et al. 2010 )is

used as the detector for pedestrians and vehicles. c The face recognition algorithm (Su et al. 2009 )

is used

same time, however, such a video surveillance systemwould produce a larger amount

of video data, leading to much more expensive costs for the network bandwidth and

the storage system. To solve this problem, most of current video surveillance sys-

tems often compress a high-definition video of 1.5Gbps to 10Mbps by using H.264

high profile (H.264 HP for short) (Ayers and Shah 2001 ). Some systems even set

the compression rate to 300:1 or higher so as to further reduce the network band-

width and the storage capacity. Nevertheless, high compression ratio will inevitably

bring difficulties in visual feature extraction, consequently reducing the recognition

accuracy of video analysis and recognition more or less.

To evaluate the effects of video compression on typical analysis tasks, we con-

ducted several experiments on one 1080p uncompressed video selected from the

PKU-SVD-Adataset, whichwas then resized to different resolutions and compressed

with different quantization parameters (QPs). Generally, QP regulates howmuch spa-

tial detail is saved in the compression. When QP is very small, almost all that details

retain; As QP increases, some of that details are aggregated so that the bit rate drops,

at the price of increase in distortion and loss of quality. As shown in Fig. 10.2 ,we

can see that for all the three tasks, the performance is gradually going down with the

increase of QPs. In this case, the advantages of introducing high-definition cameras

would be fully offset by the used video coding technology.

In order to overcome the limitations of centralized intelligent video coding system,

the distributed intelligent video coding system should at least have the following two

features:

1. Recognition-friendliness should be its first and most important objective. In such

a smart system, the majority of coding bits should be used to represent the objects

or regions-of-interest (ROIs). It even can identify the clear facial images, extract

the features of the objects and track their trajectories in real time in the coding

process. These pieces of information are highly useful for further analysis and

recognition, yet cannot be easily obtained in the traditional surveillance video

systems.

Search WWH ::

Custom Search

Home