International - English

Cart Console

Topic Center

Contact Sales

Home > Developer > C++

The processing of Chinese in C + + and the problem of Chinese character garbled (wchar_t)

Last Update:2014-11-08 Source: Internet

Author: User

Tags locale

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

The processing of Chinese characters in C + +

Today programming language and programming environment with China's development began to support the Chinese language, but the support for the Chinese language is generally flawed, and there are different from the compilation environment, resulting in Chinese in the current C + + has a lot of problems, and many versions of Chinese support is incomplete, take dev-c++ And VS2005, for example, there are a lot of differences between the support for MSDN and the online storytelling in those code.

And what I'm going to talk about is how to apply Chinese in C + +.

First, the text is outside the scope of the general char, so we can not use a single char to store our Chinese characters, so we mostly introduce wchar_t this wide character data type. But in the compilation environment I used is generally defined as wchar_t, which is a recognized definition in the C + + language, his space is the same size as the unsigned short , so there is this internal definition:typedef unsigned short wchar_t, he is a person of.

In dev-c++ we have a lot of methods that are not available, for VS2005, we can define and apply many methods and many library functions are not available in dev-c++. The input and output methods mentioned in MSDN and many web materials, like wcin and Wout, are displayed undefined in dev-c++, which means that dev-c++ does not support these methods. The input and output of simple wide characters are as follows:

#include <iostream>

using namespace std;

int Main ()

{

wchar_t a[3 ];

wcin >> A;

wcout << a << Endl;

return 0;

}

But this can only enter a single kanji character, if more than 2 Chinese words will have overflow error, and in such a way, although we used WCHAR but completely did not highlight our purpose, it is still a Chinese character accounted for two wchar_t units , And we have no way to manipulate the characters inside the character so this is not feasible, but this is the use of C, in C + + WCHAR It has been modified, so that Chinese support is better.

in the C + + in which wchar_t is a language built-in data type, wchar_t the length is determined by the implementation. Now we are officially starting to discuss the issue of support and application of Chinese in our C + + .

C + + is a good language, it is in order to adapt to the development of different locales, it joins a header file called the locale package, which defines the abbreviations of different languages and languages. This is an important aspect of the way we use wchar_t to conduct Chinese in one aspect of the operation. has a significant impact on our input and output.

The first example of our application is this:

#include <iostream>

#include <locale>

using namespace std;

int Main ()

{

Locale Loc ( "chinese-simplified");

Wcin.imbue (Loc);

Wcout.imbue (Loc);

The above three lines of code and setlocale (Lc_all, "CHS") have the same effect.

wchar_t c[4 ];

wcin >> C;

wcout << c <<endl;

return 0;

}

Here I use the door #include <locale> to include this header file, and then generate a locale object, the parameters are only in Chinese, We then set the imbue of our inputs and outputs to changes the role of the locale. So our input and output will become the stored procedures we need in the Chinese text unit. In the example above C[4] can enter 3 Chinese characters, the last one is ' "", so that the initial achieved the desired effect. At the same time we can take out each of the positioning of the Chinese characters, and char the same data processing.

The processing of Chinese in C + + and the problem of Chinese character garbled (wchar_t)

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

chinese character encoding chinese character ocr chinese character counter chinese character word count chinese character identifier chinese character identifier chinese character code converter

The concept and difference of C/s architecture and B/S archit... 01-13

Curl_easy_perform problem in Libcurl in C + + 01-13

[]linux Socket network programming, file transfer, data trans... 07-30

Advantages and disadvantages of C + + copy constructors 11-12

C # HttpWebRequest Stunt get web page information based on UR... 04-05

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

What's Trending

Top 10 Tags

datastax versions naming convention zookeeper client class definition md5 microsoft sql server 2005 data structures exception handling error handling

Top 10 Keywords

microsoft download center down wordpress address url site address url wordpress address url windows installer 4 0 download 302 not found web address url definition site address url wordpress db2 integer mac os installation step by step pdf abbreviation for return

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

The processing of Chinese in C + + and the problem of Chinese character garbled (wchar_t)

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support