Unicode/ISO-10646 Alphabets
Groups marked with * have not yet been finalized.
- 0020 - 007E
- Basic Latin (graphic part of US-ASCII)
- 00A0 - 00FF
- Latin-1 Supplement (right half of ISO 8859-1)
- 0100 - 017F
- Latin Extended-A
- 0180 - 024F
- Latin Extended-B (extended in 1.1)
- 0250 - 02AF
- IPA Extensions
- 02B0 - 02FF
- Spacing Modifier Letters
- 0300 - 036F
- Combining Diacritical marks
- 0370 - 03CF
- Basic Greek (based on ISO 8859-7)
- 03D0 - 03FF
- Greek Symbols and Coptic
- 0400 - 04FF
- Cyrillic (based on ISO 8859-5, extended in 1.1)
- 0500 - 052F
- unassigned
- 0530 - 058F
- Armenian
- 0590 - 05CF
- Hebrew Extended-A
- 05D0 - 05EA
- Basic Hebrew (based on ISO 8859-8)
- 05EB - 05FF
- Hebrew Extended-B
- 0600 - 0652
- Basic Arabic (based on ISO 8859-6)
- 0653 - 06FF
- Arabic Extended (extended in 1.1)
- 0700 - 08FF
- * Ethiopic (?)
- 0900 - 097F
- Devanagari (based on ISCII 1988)
- 0980 - 09FF
- Bengali (based on ISCII 1988)
- 0A00 - 0A7F
- Gurmukhi (based on ISCII 1988)
- 0A80 - 0AFF
- Gujarati (based on ISCII 1988)
- 0B00 - 0B7F
- Oriya (based on ISCII 1988)
- 0B80 - 0BFF
- Tamil (based on ISCII 1988)
- 0C00 - 0C7F
- Telugu (based on ISCII 1988)
- 0C80 - 0CFF
- Kannada (based on ISCII 1988)
- 0D00 - 0D7F
- Malayalam (based on ISCII 1988)
- 0D80 - 0DFF
- * Sinhala
- 0E00 - 0E7F
- Thai (based on TIS 620-2533:1990)
- 0E80 - 0EFF
- Lao (based on TIS 620-2533:1990)
- 0F00 - 0F7F
- * Burmese
- 0F80 - 0FDF
- * Khmer
- 1000 - 105F
- * Tibetan
- 1060 - 109F
- * Mongolian
- 10A0 - 10CF
- Georgian Extended
- 10D0 - 10FF
- Basic Georgian
- 1100 - 11FF
- Hangul Jamo (added in 1.1)
- 1200 - 125F
- * Ethiopian (?)
- 1E00 - 1EFF
- Latin Extended Additional (added in 1.1)
- 1F00 - 1FFF
- Greek Extended (added in 1.1)
Part of Notes on CJK Character Codes and Encodings.