What Character Encodings are available in DeltaWalker?

DeltaWalker supports the same wide-range of character encodings supported by the Java 2 Platform, Standard Edition, v. 1.4.2 or newer, as listed in the tables below.

The first column in both tables contains the names of the encodings, a.k.a. canonical names, that DeltaWalker uses. When an encoding has an alternative name, it is listed in the second column of the tables. Although the alternative names are being gradually phased out, they are listed here as many people may be more familiar with them than with the newer, canonical ones. The third column gives a brief description of each encoding.

Basic Encodings

Name Alternative Name Description
US-ASCII ASCII American Standard Code for Information Interchange
windows-1250 Cp1250 Windows Eastern European
windows-1251 Cp1251 Windows Cyrillic
windows-1252 Cp1252 Windows Latin-1
windows-1253 Cp1253 Windows Greek
windows-1254 Cp1254 Windows Turkish
windows-1257 Cp1257 Windows Baltic
ISO-8859-1 ISO8859_1 ISO 8859-1, Latin Alphabet No. 1
ISO-8859-2 ISO8859_2 Latin Alphabet No. 2
ISO-8859-4 ISO8859_4 Latin Alphabet No. 4
ISO-8859-5 ISO8859_5 Latin/Cyrillic Alphabet
ISO-8859-7 ISO8859_7 Latin/Greek Alphabet
ISO-8859-9 ISO8859_9 Latin Alphabet No. 5
ISO-8859-13 ISO8859_13 Latin Alphabet No. 7
ISO-8859-15 ISO8859_15 Latin Alphabet No. 9
KOI8-R KOI8_R KOI8-R, Russian
UTF-8 UTF8 Eight-bit UCS Transformation Format
UTF-16   Sixteen-bit UCS Transformation Format, byte order identified by an optional byte-order mark
UTF-16BE UnicodeBigUnmarked Sixteen-bit Unicode Transformation Format, big-endian byte order
UTF-16LE UnicodeLittleUnmarked Sixteen-bit Unicode Transformation Format, little-endian byte order

 

Extended Encodings

Name

Alternative Name Description
windows-1255 Cp1255 Windows Hebrew
windows-1256 Cp1256 Windows Arabic
windows-1258 Cp1258 Windows Vietnamese
ISO-8859-3 ISO8859_3 Latin Alphabet No. 3
ISO-8859-6 ISO8859_6 Latin/Arabic Alphabet
ISO-8859-8 ISO8859_8 Latin/Hebrew Alphabet
windows-31j MS932 Windows Japanese
EUC-JP EUC_JP JISX 0201, 0208 and 0212, EUC encoding Japanese
x-EUC-JP-LINUX EUC_JP_LINUX JISX 0201, 0208 , EUC encoding Japanese
Shift_JIS SJIS Shift-JIS, Japanese
ISO-2022-JP ISO2022JP JIS X 0201, 0208, in ISO 2022 form, Japanese
x-mswin-936 MS936 Windows Simplified Chinese
GB18030   Simplified Chinese, PRC standard
x-EUC-CN EUC_CN GB2312, EUC encoding, Simplified Chinese
GBK   GBK, Simplified Chinese
ISCII91 ISCII91 ISCII91 encoding of Indic scripts
x-windows-949 MS949 Windows Korean
EUC-KR EUC_KR KS C 5601, EUC encoding, Korean
ISO-2022-KR ISO2022KR ISO 2022 KR, Korean
x-windows-950 MS950 Windows Traditional Chinese
x-MS950-HKSCS MS950_HKSCS Windows Traditional Chinese with Hong Kong extensions
x-EUC-TW EUC_TW CNS11643 (Plane 1-3), EUC encoding, Traditional Chinese
Big5   Big5, Traditional Chinese
Big5-HKSCS Big5_HKSCS Big5 with Hong Kong extensions, Traditional Chinese
TIS-620 TIS620 TIS620, Thai

 

Related topics