Bit |
Description |
0 |
Basic Latin |
1 |
Latin-1 Supplement |
2 |
Latin Extended-A |
3 |
Latin Extended-B |
4 |
IPA Extensions |
5 |
Spacing Modifier Letters |
6 |
Combining Diacritical Marks |
7 |
Basic Greek |
8 |
Greek Symbols and Coptic |
9 |
Cyrillic |
10 |
Armenian |
11 |
Basic Hebrew |
12 |
Hebrew Extended |
13 |
Basic Arabic |
14 |
Arabic Extended |
15 |
Devanagari |
16 |
Bengali |
17 |
Gurmukhi |
18 |
Gujarati |
19 |
Oriya |
20 |
Tamil |
21 |
Telugu |
22 |
Kannada |
23 |
Malayalam |
24 |
Thai |
25 |
Lao |
26 |
Basic Georgian |
27 |
Georgian Extended |
28 |
Hangul Jamo |
29 |
Latin Extended Additional |
30 |
Greek Extended |
31 |
General Punctuation |
32 |
Subscripts and Superscripts |
33 |
Currency Symbols |
34 |
Combining Diacritical Marks for Symbols |
35 |
Letter-like Symbols |
36 |
Number Forms |
37 |
Arrows |
38 |
Mathematical Operators |
39 |
Miscellaneous Technical |
40 |
Control Pictures |
41 |
Optical Character Recognition |
42 |
Enclosed Alphanumerics |
43 |
Box Drawing |
44 |
Block Elements |
45 |
Geometric Shapes |
46 |
Miscellaneous Symbols |
47 |
Dingbats |
48 |
Chinese, Japanese, and Korean (CJK) Symbols and Punctuation |
49 |
Hiragana |
50 |
Katakana |
51 |
Bopomofo |
52 |
Hangul Compatibility Jamo |
53 |
CJK Miscellaneous |
54 |
Enclosed CJK |
55 |
CJK Compatibility |
56 |
Hangul |
57 |
Reserved for Unicode Subranges |
58 |
Reserved for Unicode Subranges |
59 |
CJK Unified Ideographs |
60 |
Private Use Area |
61 |
CJK Compatibility Ideographs |
62 |
Alphabetic Presentation Forms |
63 |
Arabic Presentation Forms-A |
64 |
Combining Half Marks |
65 |
CJK Compatibility Forms |
66 |
Small Form Variants |
67 |
Arabic Presentation Forms-B |
68 |
Halfwidth and Fullwidth Forms |
69 |
Specials |
70-127 |
Reserved for Unicode Subranges |