Full Parallax 3D Video Content Compression - Novel 3D Media Technologies

Image Processing Reference

In-Depth Information

combined with a Principal Component Analysis (PCA, also called Karhunen-

Loeve Transform or Hotelling Transform). DWT is applied to viewpoint images,

and then PCA is applied to the resulting coefficients. Several kinds of DWT filters

are proposed (e.g., Dauchechies wavelets). In [ 27 ], the SPIHT (Set Partitioning in

Hierarchical Trees) method allows to display/transmit progressively the integral

image as a quality scalable bitstream. Two algorithms (2D and 3D) are proposed.

The first one is a 2D-DWT applied to the integral image and followed by the

2D-SPIHT. The second is based on the creation of a volume of viewpoint images on

which a 3D-DWT is applied and followed by 3D-SPIHT.

Elemental Images-Based Methods Another approach consists in encoding the

viewpoint images or the MIs of a still integral image as if they were a video

sequence (called pseudo video sequence) and then exploiting the temporal com-

pression tools of traditional video coders [ 28 , 29 ]. This approach shares some ideas

with the next two: multi-view and self-similarity. The method proposed in [ 30 ]

exploits the inter-MIs redundancies (using the optical characteristic that MIs have

overlapping zones). In [ 31 ], a KLT is applied to the viewpoint images.

Multi-view We consider here the integral image as a group of viewpoint images

that is encoded as a multi-view sequence (using inter-view prediction). In [ 32 ] and

[ 33 ], the viewpoint images are encoded using MVC encoder [ 34 ]. The exploitation

of temporal correlation and inter-view correlations induces an increase in complex-

ity. The evolutionary strategy (ES) proposed in [ 35 ] is based on the evolution theory

and allows an optimization of the coding scheme. In [ 36 ], ES is also applied and

combined to a half-pixel precision for

the motion/disparity prediction and

compensation.

Self-Similarity The method described in this part exploits the non-local spatial

correlation between MIs. The algorithm is mainly the same as for the inter predic-

tion modes (of H.264/AVC [ 37 ] and HEVC [ 38 ]) but within one frame. A block

matching algorithm is used to find a block similar to the current block in the causal

zone in the current frame (which corresponds to the blocks that have already been

coded and reconstructed). This similar block is then used as a predictor in the same

manner as for a temporal prediction and compensation. In [ 39 ] and [ 40 ], the

implementation in H.264/AVC of the INTRA_SS (for INTRA Self Similarity)

modes is described. These publications show the BD-rate gain brought by the SS

mode and also the interest of a precise partitioning in macro-blocks. In [ 41 ], the SS

mode is implemented in HEVC and the interest of the CU partitioning is shown.

Conti et al. [ 42 ] shows the BD-rate gain brought by this method for video sequences

(i.e., with a competition with inter-prediction modes). In [ 43 ] a scalable coding

scheme is described as follows: the layer 0 corresponds to an extracted view, the

layer 1 corresponds to a set of extracted views, and the layer 2 is the integral image.

Layers 0 and 1 are encoded, respectively, with reference HEVC and MV-HEVC

encoders. For layer 2 the self-similarity method is used. An inter-layer prediction is

Search WWH ::

Custom Search

Home