Java Reference
In-Depth Information
Also,ISO/IEC8859-2(alsoknownasLatin-2)definesasimilarcharacterset/encod-
ing covering Central and Eastern European countries.
DespiteISO's/IEC'sbestefforts,aplethoraofcharacter sets/encodingsisstillinad-
equate. For example, most character sets/encodings only allow you to create docu-
mentsinacombinationofEnglishandoneotherlanguage(orasmallnumberofother
languages).Youcannot,forexample,useanISO/IECcharacterset/encodingtocre-
ateadocumentusingacombinationofEnglish,French,Turkish,Russian,andGreek
characters.
This andotherproblems arebeingaddressed byaninternational effortthat hascre-
atedandiscontinuingtodevelop Unicode ,asingleuniversalcharacterset.Because
UnicodecharactersaretwiceasbigasISO/IECcharacters,Unicodeusesoneofsev-
eral variable-length encoding schemes known as Unicode Transformation Format
(UTF) to encode Unicode characters for efficiency. For example, UTF-8 encodes
every character in the Unicode character set in one to four bytes (and is backward
compatible with ASCII).
Theterms character set and character encoding areoftenusedinterchangeably.They
meanthesamethinginthecontextofISO/IECcharactersets,whereacodepointis
the encoding. However, these terms are different in the context of Unicode, where
UnicodeisthecharactersetandUTF-8isoneofseveralpossiblecharacterencodings
for Unicode characters.
Writer and Reader Classes Overview
The java.io packageprovidesseveralwriterandreaderclassesthataredescendents
oftheabstract Writer and Reader classes. Figure8-7 revealsthehierarchyofwriter
classes.
Search WWH ::




Custom Search