Java to determine whether it is a Chinese character)

Last Update:2018-12-06 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Document directory

Java code

Public Boolean VD (string Str ){
Char [] chars = Str. tochararray ();
Boolean isgb2312 = false;
For (INT I = 0; I <chars. length; I ++ ){
Byte [] bytes = ("" + chars [I]). getbytes ();
If (bytes. Length = 2 ){
Int [] ints = new int [2];
Ints [0] = bytes [0] & 0xff;
Ints [1] = bytes [1] & 0xff;
If (ints [0]> = 0x81 & ints [0] <= 0xfe & ints [1]> = 0x40 & ints [1] <= 0xfe) {
Isgb2312 = true;
Break;
}
}
}
Return isgb2312;
}

First, import java. util. RegEx. Pattern and Java. util. RegEx. matcher.
The two packages are followed by the code

Java code

Public Boolean isnumeric (string Str)
{
Pattern pattern = pattern. Compile ("[0-9] *");
Matcher isnum = pattern. matcher (STR );
If (! Isnum. Matches ()){
Return false;
}
Return true;
}
Java. Lang. character. isdigit (CH [0])

----------------- Another type ----------------- Java code

Public static void main (string [] ARGs ){
Int COUNT = 0;
String RegEx = "[\ u4e00-\ u9fa5]";
// System. Out. println (RegEx );
String STR = "Chinese fdas ";
// System. Out. println (STR );
Pattern P = pattern. Compile (RegEx );
Matcher M = P. matcher (STR );
While (M. Find ()){
For (INT I = 0; I <= M. groupcount (); I ++ ){
Count = count + 1;
}
}
System. Out. println ("Total" + Count + "count ");
}

-------------------------------------------------------------------

Method for Determining whether a Java string contains Chinese Characters

Java uses Unicode-encoded char variables in the range of 0-65535 unsigned values, which can represent 65536 characters. Basically, all characters on the earth can be included, in reality, we want to determine whether a character is a Chinese character or whether a character in a string contains a Chinese character to meet business needs, the string class has such a method to get its character length (). For example, the Java code

String S1 = "I am a Chinese ";
String S2 = "imchinese ";
String S3 = "Im Chinese ";
System. Out. println (S1 + ":" + new string (S1). Length ());
System. Out. println (s2 + ":" + new string (S2). Length ());
System. Out. println (S3 + ":" + new string (S3). Length ());

Output:
I am a Chinese: 5
Imchinese: 9
Im Chinese: 5
As you can see, if the string contains double-byte characters, Java will encode each character in double-byte format. If it is a single-byte character, it will be encoded in single-byte format.
So according to the above rules, combined with a QQ nickname? G tea? I Zhuhai elder brother's prompt is resolved by judging whether the string length is the same as the character byte length to determine whether there is a double byte character Java code

System. Out. println (s1.getbytes (). Length = s1.length ())? "S1 has no Chinese characters": "S1 has Chinese characters ");
System. Out. println (s2.getbytes (). Length = s2.length ())? "S2 has no Chinese characters": "S2 has Chinese characters ");
System. Out. println (s3.getbytes (). Length = s3.length ())? "S3 has no Chinese characters": "S3 has Chinese characters ");

Output:
S1 has Chinese Characters
S2 has no Chinese Characters
S3 has Chinese characters //
This way, we can determine whether a string contains double-byte characters. However, it is a bit difficult to accurately determine whether a string contains Chinese characters, we know that many characters in other countries are double-byte in Unicode.
Therefore, we need to further determine how to determine the encoding range of Chinese characters. I used this method, that is, the notepad now outputs the characters between 0 and, we can see that the first Chinese character is '1' and the last one is '?? '(I don't know it now). It's much easier to judge Chinese characters. For example, we can compare the encoding range of characters, finally, I will give you some results. The Chinese characters are basically concentrated in the range of [20901,], with a total of Chinese characters (if it's a little less, it's just how much you know)

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Java to determine whether it is a Chinese character)

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support

Java to determine whether it is a Chinese character)

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support