Absrtact: Along with the search engine technology unceasing promotion, crawls the website content deeper and deeper, in addition the CMS system's vigorous promotion, the website structure more conforms to the search engine's request, in short search engine crawls content to become easier, the general situation
With the search engine technology continues to improve, crawl the depth of the site content more and more deeply, in addition to the CMS system to promote the structure of the site more in line with the requirements of search engines, in short, search engine crawling content becomes easier, in general, even if not to the search engine submitted site map, the same , so sitemap map to search engine crawling still help?
I personally think that Sitemap map to search engine crawling content is still helpful, especially for large stations and new sites to crawl content to help very big, here is worth noting, because the site size, structure, type is different, then we make Sitemap map will also be different, such as information-type web site , because the content is huge, it is suggested that the map of txt format may be better. Several matters to be noted are as follows:
1, under normal circumstances, we are in the production of Sitemap, we recommend that you use the XML format, do not recommend that you use HTML and TXT format, if your site is the electrical business or information type site, the information is very large, and the structure is also complex, the page is very much, This time you can consider using the TXT format. This mode is not recommended because TXT mode is not able to set the update frequency and weight.
2, if your site is not static, nor pseudo static, is purely Dynamic Web site, then sitemap map URL do not appear parameters, such as question marks, equal signs and other special symbols, that is, not with the parameters of these special symbols. We can solve special parameter symbols in the form of transformations, because characters such as question mark and equal sign need to be represented by corresponding code.
3, it is recommended to put some important URL URLs as far as possible, such as Column page URL, channel page URL, topic page URL, and so on, of course, what kind of URL page is important, is entirely determined by the site itself, I here just to cite a few examples, why should the important URL on the top? As we all know, Spiders are crawling from left to right, up and down, for example, before a peer to submit a deleted channel to the site map, and still put on the front, the result of this deletion of the column of all the pages are crawled, and the other did not delete the column because many pages have not been crawled.
4, if your site for some reason, some stations put the map after the production and has been submitted to the search engine page deleted, but the URL in the Sitamap map and the site is not synchronized with the deletion, then there will be 404 pages, so, in the deletion of the site page, Be sure to synchronize the deletion of the URLs in the Sitamap map.
5, the new station does not recommend the use of longer than the daily cycle, so as to avoid problems. All page update frequencies are synch cheating. Also do not all update frequency is identical, impossible. In addition, the weight should pay attention to, do not recommend all the site weights are 1.0, because this site is the weight of the site is the weight of the distribution, rather than the site outside the weight distribution, the total weight of the site set unchanged. SEO need to consider those pages need to promote, can be appropriate to give high weights, do not promote the page does not recommend setting high weights.