Coded thinking Map:
That is, the file is not encoded, in fact, in the binary format is saved in the hard disk. However, it is necessary to use the appropriate encoding to process the write read so that the operating system fits the relevant software/font and is drawn to the screen for viewing. So the key question is to know what encoding was used to compile the string data originally! However, before Unicode is used in the form of similar comparison tables to organize the encoding, can not be unified from the serial data stream itself to solve different text
Words come.
Unicode is the encoding method, the universal code, the detailed realization way: Utf-8 utf-16
For I/O output be sure to do a similar operation like encode (' Utf-8 ')
Just Unicode in Python is an intrinsic function.
ust r i ng = Uni code (ST R eam, ' GBK ') This means First convert to Unicode, then convert to the encoding you want
Python Practical coding problem: Chinese! The pain of Eternity