Color - Computer Graphics: Principles and Practice

Graphics Reference

In-Depth Information

the first black-and-white television signal!), you might argue that if a human can

distinguish, say, 100 levels of intensity, then we should use 100 different numbers

to represent these. It would be silly to use 200 different numbers, because we'd

have different numbers representing different, but indistinguishable, intensities. If

we were representing values in binary, we'd be wasting a bit by going from 100

values to 200 values. Similarly, if we encoded only 50 different intensity levels,

we'd get nonsmooth intensity gradients in our display.

If you simply take all possible intensity values and divide them equally (i.e.,

you quantize the intensity signal), you'd find that to capture perceptual differ-

ences that were significant at the low-intensity levels you'd need to use very small

buckets. But those same buckets would be redundant at high-intensity levels. In

fact, you would be far better off encoding the number L ∗ , because each quantized

range of L ∗ values would correspond to the same amount of perceptual varia-

tion. By choosing the bucket size correctly, you could most efficiently encode the

brightness.

To recover the intensity at the receiving end of the channel, you would invert

the formula for L ∗ (roughly, you'd take the third power of L ∗ , and multiply by

the constant Y n ) and arrange for your television screen to emit the corresponding

intensity.

As it happens, the cathode ray tubes (CRTs) that were used in early televisions

have an interesting characteristic: The intensity emitted is proportional to the 2

power of an applied voltage. Since 2 is fairly close to three, this meant that you

could take the L ∗ value and use it as a voltage to determine the color of each pixel,

approximately.

To be clear: The visual system's response to intensity is nonlinear and looks

approximately like I 1 / 3 ; the CRT's output intensity in response to applied voltage

is also nonlinear and looks like I = kV 5 / 2 . Combining these two results in a nearly

linear overall effect (a 6 power law).

In fact, video engineers defined a “signal representative of luminance” (which

has later, in some video literature, been incorrectly called “luminance”); this signal

approximately encodes the 0.42 power of luminance. Why use 0.42 instead of

0.33? One answer is that if you used 0.4 instead, then the 2 power law of the CRT

would cancel it exactly: This allows you to simplify the electronics in a consumer

television, and at the cost of only a minor inefficiency in the encoding of the

signal. The use of 0.42 instead of 0.4 has been explained by the observation that

the viewing circumstances for television (much less bright than outdoors) are not

the same as the circumstances under which the signal was captured (often bright

lights or outdoors in daylight); the slight adjustment is meant to help compensate

for this.

You can experience the distinction between high-light and low-light percep-

tion of intensity by considering a garden at midday on a slightly overcast day (so

that the lighting is reasonably diffuse), and the same garden just after sunset on

that day. Only the light levels change. Because our perception of “lightness” is

supposed to be approximately logarithmic, the difference in lightness between the

leaves of a plant and its flower should be the same at midday and at twilight. In

practice, they are not, appearing to be lower-contrast at twilight, and we must do

some adjusting to compensate.

Figure 28.22: Surrounding con-

text can vary our perception of

tones.

To experience this effect directly, we can use the area surrounding some gray

values as a proxy for the ambient illumination. Figure 28.22 show three gray

squares surrounded by white and black borders. The gray squares in each column

(Figure

concept

from

Poynton [Poyb].)

Computer Graphics: Principles and Practice

Search WWH ::

Custom Search

Home