Mapping GOPS in an Improved DVC to H.264 Video Transcoder - Advanced Concepts for Intelligent Vision Systems

Information Technology Reference

In-Depth Information

2 Related Work

The main task of a transcoder is to convert from a source format to another one; this

task must be executed as efficiently as possible though. Consequently, transcoding

techniques focus on improving the second stage by using information gathered during

the first one. The key issues to manage are the time reduction and the quality - bitrate

penalty. Transcoding algorithms between traditional video coding standards are more

suitable to be used because both formats keep many features whereas DVC is more

different.

Firstly, in 2005 [6] WZ coding was proposed as a candidate in a transcoding scene,

however it only introduced the idea and the benefits of this new transcoding paradigm to

support low cost video communications but it did not offer a practical implementation.

The first WZ transcoder architecture was presented in 2008 by Peixoto et al. in [3]. In

this approach the authors designed a WZ to H.263 transcoder to support mobile

communications. This transcoder makes a mapping between WZ and H.263 GOPs,

including some Motion Estimation (ME) refinement for P and B slices.

In our previous work, we proposed the first transcoding architecture between WZ

and H.264 [4] available in the literature. This work improves the H.264 ME using the

Motion Vectors (MV) calculated in the Side Information (SI) to reduce the H.264

searching area with negligible RD impact. Nevertheless, in the previous approach we

employed a WZ GOP size of 2 to be transcoded to a H.264 GOP size of 2 which

means transcoding from K - WZ - K DVC pattern to I - P - I H.264 pattern. This

solution is not very useful in a real implementation due to the high bitrate generated

because one of every two frames is an I-frame. Moreover the refinement technique is

improved for P frames and generalized for longer H.264 patterns. Other

improvements of this approach with respect to the previous one is related to the DVC

implementation, as, in present work, it is based on VISNET-II project [7] which is

more realistic than the architecture used in [4] because it implements lossy key frame

coding, on-line correlation noisy modeling and do not use the ideal procedure call at

decoder for the stopping criterion. In other words, this work extends the approach

presented in [4] to a more realistic GOP size and format implementation. Moreover,

this work is a generalization to support whatever GOP size or format incoming from

DVC stage to be transcoded to I11P GOP pattern using a low complexity algorithm.

3 Proposed Video Transcoder

In a real scenario, video transcoding should be able to convert efficiently different

patterns. For this reason, the main aim of the proposal is to provide an architecture

which supports practical GOP patterns making efficiently the transcoding process

through exploiting the information that the WZ decoding algorithm provide in order

to reduce the H.264 encoding time on the ME process.

Mobile-to-mobile communications need to execute low complexity algorithms at

both ends. In the proposed architecture the source employs the DVC encoders and the

destination employs the H.264 decoder, so terminal devices support the lower

complexity parts in both paradigms (as Figure 1 shows). On the other hand, the

Search WWH ::

Custom Search

Home