Java Reference
In-Depth Information
assume that a byte (which is a machine-specific collection of bits, usually eight bits on mod-
ern computers) is exactly the same thing as a “character”—a letter, digit, or other linguistic
element. However, Java is designed to be used internationally, and eight bits is simply not
enough to handle the many different character sets used around the world. Script-based lan-
guages like Arabic and Indian languages, and pictographic languages like Chinese and
Japanese, each have many more than 256 characters, the maximum that can be represented in
an eight-bit byte. The unification of these many character code sets is called, not surprisingly,
Unicode. Both Java and XML use Unicode as their character sets, allowing you to read and
write text in any of these human languages. But you should use Reader s and Writer s, not
Stream s, for textual data.
Search WWH ::




Custom Search