Set the STDOUT encoding in Python

Last Update:2017-01-13 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Sometimes in the context of a process, locale is set to support only ASCII character sets (such as Lang=c). At this point Python will set the standard output and standard error encoding to ASCII, resulting in the output of the Chinese times wrong.

One solution is to set up a locale that supports UTF-8, but it needs to be set before the Python process starts. After startup, initialization is done, and setting locale does not reinitialize those objects.

Another way is to write bytes directly to the Sys.stdout.buffer place. Theoretically no problem at all, but write the program to be very tired ...

I went to find out how to gracefully get a new sys.stdout out. Python 3 I/O no longer uses the I/O function of the C standard library, but directly uses the interface provided by the OS. The package is in the IO module, with buffered, buffered, binary, text.

After studying the documentation, Sys.stdout is an IO. Textiowrapper, there is a buffer property, inside is an IO. BufferedWriter. We use it to build a new IO. Textiowrapper, specify the encoding as UTF-8:

Import Sys
Import IO

Def setup_io ():
Sys.stdout = sys.__stdout__ = io. Textiowrapper (
Sys.stdout.detach (), encoding= ' Utf-8 ', line_buffering=true)
Sys.stderr = sys.__stderr__ = io. Textiowrapper (
Sys.stderr.detach (), encoding= ' Utf-8 ', line_buffering=true)
In addition to the encoding can be set here, you can set error handling and buffering. So this technique can also be used to tolerate coding errors and to change the buffer of standard output (no need to add-U at startup).

In fact, this is still not thorough enough. Python is useful in many places to the default encoding. For example, when subprocess specifies Universal_newlines=true, Python automatically decodes the standard input, output, and error, but before Python 3.6, the code here cannot be manually specified. Also has the parameter the code, also cannot specify (but may pass the bytes past).

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Set the STDOUT encoding in Python

Contact Us

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support