What is sitemap?
Sitemap is something created by google, that is, a website map. But this map is different from what we call a website map. This website map is written in xml, and strictly follow google's standards.
What are the benefits of submitting sitemap?
Submitting sitemap is beneficial to search engines, such as dynamic web pages. Google logs on to a website and first checks whether sitemap exists. If yes, it reads it first. If no, it crawls the website pages one by one. Sitemap is the information list on the page of your website. Google bot crawls pages one by one.
XML Sitemaps format
The Sitemaps protocol format is composed of XML tags. All data values of Sitemaps should be escaped by the entity. The file itself should be encoded by the UTF-8.
The following is a Sitemaps example that contains only one URL and uses all the optional tags. Optional. It is italic.
<? Xml version = "1.0" encoding = "UTF-8"?>
<Urlset xmlns = http://www.google.com/schemas/sitemap/0.9>
<Url>
<Loc> http://www.neesoo.cn/archives/71.html </loc>
<Lastmod> 2009-03-07 </lastmod>
<Changefreq> yearly </changefreq>
<Priority> 0.2 </priority>
</Url>
<Url>
<Loc> http://www.neesoo.cn/archives/71.html </loc>
<Lastmod> 2009-03-06 </lastmod>
<Changefreq> yearly </changefreq>
<Priority> 0.2 </priority>
</Url>
</Urlset>
Note: the blue part remains unchanged in the general format. The black part in the middle is a module, that is, each entry must be submitted to the search engine.
The included website is an independent module. Then there will be such modules as the number of URLs.
Sitemaps should start with <urlset> and end with </urlset>.
Each url contains a <url> entry marked as the parent.
Each <url> parent tag contains a <loc> sub-tag entry.
Next I will explain the meaning of the label separately.
<Loc> </loc>
The address in the middle of the two tags is required. The format is http://www.neesoo.cn/archives/71.html. this network endpoint should begin with an agreement (for example, http) and end with a slash. The value must be less than 2048 characters long.
<Lastmod>
Description of the optional tag: The last modification date of the file. This date should be in W3C Datetime format. If necessary, the time part can be omitted, but only YYYY-MM-DD is used. Column: 2009-03-06.
<Changefreq>
Optional label description: the frequency at which the page may change. This value provides general information for search engines, which may be unrelated to the frequency of page capturing by search engines. Valid values:
Always
Hourly
Daily
Weekly
Monthly
Yearly
Never
The value "always" should be used to describe the document that changes each access. The value "never" should be used to describe the archived website.
<Priority>
Optional. The priority of this URL depends on the priority of other URLs on your website. Valid values range from 0.0 to 1.0. This value does not affect the comparison between your web page and other web pages. It only tells the search engine that you think your web page is the most important, in this way, they can sort your page captures in your favorite way.
The default priority of a webpage is 0.5.
To understand the functions of these tags, we can make sitemap. xml suitable for our website based on the situation of our website.
Note:
The xml file must be UTF-8 encoded format. You can open the xml file in notepad and choose encoding (or converter) as the UTF-8 when saving it.
Considering the workload of making sitemap. xml, you can use third-party software. Here we recommend the tiger sitemap generator. The google administrator tool also provides the sitemap generator. However, you must have the Server Management permission generated through IIS and install it on the server. Not very convenient.