Features and Matching - Computer Vision for Visual Effects

Graphics Reference

In-Depth Information

where the integers m , n

0

specify the degree of themoment for each color channel.Mindruet al. showed various

ratios of sums and products of the color moments to be invariant to both geometric

(e.g., affine) and photometric transformations of an image, which can be concate-

nated to form a descriptor. Montesinos et al. [ 333 ] extended first-order grayscale

differential invariants to form a color descriptor.

The popularity of SIFT led to various attempts to “colorize” the SIFT descriptor.

An obvious approach is simply to concatenate the SIFT descriptors for a feature point

location computed in three color channels (e.g., RGB or HSV) to create a 3

≥

0 specify the order of themoment, and the integers r , g , b

≥

128 =

384-dimensional descriptor, possibly reducing the dimensionality with PCA. van de

Weijer and Schmid [ 509 ] augmented the standard 128-dimensional SIFT descrip-

tor computed on luminance values with an additional 222 dimensions representing

color measurements, including histograms of weighted hue values and photometric

invariants. More straightforwardly, Abdel-Hakimand Farag [ 1 ] proposed to apply the

usual SIFT detector/descriptor to a color invariant image obtained as the ratio of two

linear functions of RGB. Both Burghouts and Geusebroek [ 77 ] and van de Sande et

al. [ 508 ] recently presented surveys and evaluations of color descriptors, generally

concluding that augmenting the SIFT descriptor with color information improved

repeatability results over using luminance alone.

Even if a grayscale feature detector/descriptor scheme is dictated by the appli-

cation, it may still be possible to improve performance by modifying the input to

the detector. For example, Gooch et al. [ 174 ] proposed an algorithm for processing

a color image into a one-channel image that differs from the traditional luminance

image. Instead, adjacent pixel differences in the new image are optimized to match a

function of color differences in the original image as well as possible. It may also be

possible to apply an algorithm like that of Collins et al. [ 101 ] to adaptively choose the

most discriminating one-dimensional space over the course of object tracking. For

example, the green channel may bemost discriminating for tracking a certain feature

on an actor in a natural environment, but a combination of red and blue channels

may be better as the actor crosses into a green-screen background.

×

4.5

ARTIFICIAL MARKERS

In a controlled environment such as a movie set, it's common to introduce artificial

markers for tracking — that is, patterns designed to be robustly, unambiguously

detected. Then, instead of using a generic feature detector, a customized detector

can be designed for rapidly locating and identifying such features. Themost common

use of artificial markers today is for augmented reality as opposed to visual effects.

Figure 4.22 illustrates several types of artificial markers, discussed inmore detail next.

Two-dimensional bar codes used to encode information are now commonplace,

for example in the context of tracking shipped packages. More recently, QR codes 12

(Figure 4.22 a) have become popular for encoding information into a pattern of black

and white squares, which can be detected and decoded quickly by mobile devices

like cell phones. However, these patterns must fill a large percentage of an image to

12 QR Code is a registered trademark of Denso Wave Incorporated, Japan.

Computer Vision for Visual Effects

Search WWH ::

Custom Search

Home