UTF-8, Unicode Standardized forms, bom_ application tips

Source: Internet
Author: User
Tags dreamweaver
The confusion mentioned in "testing the Dreamweaver to make UTF-8 coded Web pages"
http://www.cnbruce.com/blog/showlog.asp?cat_id=27&log_id=999

"Ah Han" friend of the words to dispel doubts: that is, check "include Unicode signature (BOM)"



For more information, the following is part of the Help document:

to set the document encoding, use the default encoding pop-up menu.

Default encoding specifies the encoding to use when creating a new page, and the encoding to use when opening a document without specifying any encoding. For more information, see Understanding Document encoding.

If you select Unicode (UTF-8) as the document encoding, no entity encoding is required because UTF-8 can safely represent all characters. If you choose another document encoding, you might want to represent some characters in entity encoding. For more information about character entities, see http://www.w3.org/TR/REC-html40/sgml/entities.html.

If you select Unicode (UTF-8) as your default encoding, select a Unicode standardized form.

There are four types of Unicode standardized forms. The most important thing is to standardize form C because it is the most common form used for the character model of the World Wide Web. Macromedia provides three other Unicode standardized forms as supplements.

In Unicode, some characters look similar, but can be stored in a document in different ways. For example, " "(e-tonal) can be expressed as a single character" E-diacritics ", or two characters" normal Latin e "+" combination of notes ". A Unicode combination character is a character that is used in conjunction with the previous character, so the diacritics appear above "Latin e". Both of these forms are displayed as the same printing style, but each form stored in the file is different.

Standardization is the process of ensuring that all characters that can be saved in different forms are saved in the same form. That is, everything in the document "Characters are saved as a single" e-note "or" E "+" combination of notes, "instead of being saved in one document in both of these forms.

For more information about Unicode standardization and specific forms that you can use, see the Unicode Web site, which is http://www.unicode.org/reports/tr15.

If you select Unicode (UTF-8) as the default encoding, you can select the include Unicode signature (BOM) option to include a byte order mark (BOM) in the document.

A BOM is 2 to 4 bytes at the beginning of a text file, identifies the file as Unicode, and also identifies the byte order of the following bytes. Because UTF-8 does not have byte order, you can choose to add UTF-8 BOM. This is required for UTF-16 and UTF-32.

I paid special attention to the two documents with the same code content, and the document bytes using the BOM were 3 bytes more than the unused document bytes

On the Dreamweaver encountered in this problem, said the big, small also not small, ordinary negligence ah ...
Copy Code code as follows:
TXT file default Open tool is Nodepad, save as Utf-8 after the Notepad tool will be forced to add files to the head utf-8 BOM. Will destroy the original contents of the file, if your file content is all ASCII, you do not have to save as Utf-8, because the Utf-8 encoding includes ASCII. Even if you want to save into a utf-8 also want to save into a no BOM format, you can do with UE 11.0

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.