Unicode encodings

  • UCS-2: direct encoding of Unicode, chars from BMP are directly represented as their ordinal numbers

  • UCS-4: dtto, but for whole Unicode at 4 bytes -- not efficient, 4 bytes even for US-ASCII, EU-langs...

UTF encodings are the most important for XML, particularly UTF-8 (but parsers must know both).