I searched for the Unicode range of Chinese characters on the Internet and generally gave "U + 4e00 .. u + 9fa5 ". But in fact, this range is incomplete, and even the basic fullwidth (Chinese) punctuation is not included. Based on the latest Unicode 5.0 version, sort it out as follows:
Note: In most application scenarios, we can use only (1), (2), (3), (4), (5) sets as the basis for CJK judgment.
1) Standard CJK text
Http://www.unicode.org/Public/UNIDATA/Unihan.html
Code Point Range |
Block name |
Release |
U + 3400 .. u + 4db5 |
CJK uniied ideographs extension |
3.0 |
U + 4e00 .. u + 9fa5 |
CJK uniied ideographs |
1.1 |
U + 9fa6 .. u + 9fbb |
CJK uniied ideographs |
4.1 |
U + f900.. U + fa2d |
CJK compatibility ideographs |
1.1 |
U + fa30.. U + fa6a |
CJK compatibility ideographs |
3.2 |
U + fa70.. U + fad9 |
CJK compatibility ideographs |
4.1 |
U + 20000 .. u + 2a6d6 |
CJK uniied ideographs Extension B |
3.1 |
U + 2f800 .. u + 2fa1d |
CJK compatibility supplement |
3.1 |
2) ASCII, all Chinese Punctuation, half width Katakana, half width hirakana, half width Korean letters: FF00-FFEF
Http://www.unicode.org/charts/PDF/UFF00.pdf
3) CJK radical supplement: 2e80-2eff
Http://www.unicode.org/charts/PDF/U2E80.pdf
4) CJK punctuation: 3000-303f
Http://www.unicode.org/charts/PDF/U3000.pdf
5) CJK strokes: 31c0-31ef
Http://www.unicode.org/charts/PDF/U31C0.pdf
6) Kangxi: 2f00-2fdf
Http://www.unicode.org/charts/PDF/U2F00.pdf
7) Chinese character structure description character: 2ff0-2fff
Http://www.unicode.org/charts/PDF/U2FF0.pdf
8) phonetic symbol: 3100-312f
Http://www.unicode.org/charts/PDF/U3100.pdf
9) phonetic symbols (extensions of South Fujian and Hakka): 31a0-31bf
Http://www.unicode.org/charts/PDF/U31A0.pdf
10) Japanese hirakana: 3040-309f
Http://www.unicode.org/charts/PDF/U3040.pdf
11) Japanese Katakana: 30a0-30ff
Http://www.unicode.org/charts/PDF/U30A0.pdf
12) Japanese Katakana pinyin Extension: 31f0-31ff
Http://www.unicode.org/charts/PDF/U31F0.pdf
13) Korean pinyin: AC00-D7AF
Http://www.unicode.org/charts/PDF/UAC00.pdf
14) Korean letters: 1100-11ff
Http://www.unicode.org/charts/PDF/U1100.pdf
15) compatible letters in Korean: 3130-318f
Http://www.unicode.org/charts/PDF/U3130.pdf
16) taixuan Sutra Symbol: 1d300-1d35f
Http://www.unicode.org/charts/PDF/U1D300.pdf
17) Yijing sixty-14 pictures: 4dc0-4dff
Http://www.unicode.org/charts/PDF/U4DC0.pdf
18) Yi-wen syllable: A000-A48F
Http://www.unicode.org/charts/PDF/UA000.pdf
19) Yi document first: A490-A4CF
Http://www.unicode.org/charts/PDF/UA490.pdf
20) Braille: 2800-28ff
Http://www.unicode.org/charts/PDF/U2800.pdf
21) CJK letter and month: 3200-32ff
Http://www.unicode.org/charts/PDF/U3200.pdf
22) CJK special symbols (date merging): 3300-33ff
Http://www.unicode.org/charts/PDF/U3300.pdf
23) decorative symbols (not for CJK purposes): 2700-27bf
Http://www.unicode.org/charts/PDF/U2700.pdf
24) Miscellaneous symbols (not for CJK purposes): 2600-26ff
Http://www.unicode.org/charts/PDF/U2600.pdf
English vertical punctuation: FE10-FE1F
Http://www.unicode.org/charts/PDF/UFE10.pdf
26) CJK compatible symbols (vertical variants, underscores, comma): FE30-FE4F
Http://www.unicode.org/charts/PDF/UFE30.pdf