In the optimization of the site, the optimization of the site collection is often the first to be carried out, for the query method we are basically using the Search engine site command or plus inurl, intitle and other methods, but these methods to query the collected data is often inaccurate, then if you want to know the exact search engine (or relatively accurate) included methods, how to do it, in fact, there are many methods, as long as your intentions, or you can query the site is relatively much more accurate number of included. Below A5 webmaster Net SEO team (http://seo.admin5.com/) on several queries to accurately include methods (to Baidu, Google as an example)
1, the use of Baidu Webmaster platform or Baidu statistical tools
In some time ago, Baidu in the webmaster platform released the site's accurate data , and accurate to every day, that is, it is updated every day, so long as we registered webmaster platform or Baidu statistics, you can accurately know the site's collection of data, and every day has data, very convenient, in order to facilitate our view, Also used in reverse chronological order, Baidu official said this data is today the most accurate data.
This method is believed that most people already knew, also is not surprising, hehe! Dry goods in the back, you have to continue to look down.
2, use sitemap to query the data from the website
Now in addition to Google Webmaster tools to submit sitemap files, in the Baidu Webmaster platform can also submit map files, format can be TXT format, XML format and sitemap index file format, submitted, the crawl effect is very obvious, because Baidu is not fully open now, The invitation code is required to submit the Sitemap file. Below we mainly to Google as an example to illustrate that the map file can allow technical assistance to solve, in fact, the simplest can use the TXT format, in TXT file, only need each row out of the URL can be, the middle can not have empty lines, but also to use the absolute address form, But the most commonly used is the XML format, because it provides a URL, but also provides a priority, update frequency, time and other factors, more convenient search engine to crawl, when we make a good submission, in Google Administrator tool will show this:
The number of indexed URLs in the picture, Google is to crawl the map file after the accurate collection of data, if we put all the URLs in the site into the map, submitted, the use of this method to know its accurate collection of quantity, note that when the URL is more than a few Sitemap files can be submitted, A sitemap file URL number does not exceed 50,000, and the file can not be more than 50M before compression (Baidu can not exceed 10M).
3, the use of Rank Tracker tool query included
Rank Tracker is a foreign very excellent query keyword ranking tool, it can be found in batches of tens of thousands of words ranking, very powerful, we can use it to query the site's collection, the method is to export the URL of the site, import it into rank tracker for bulk inquiries, URL as the site's keywords to query rankings, ranked first, on behalf of the collection.
4, the use of locomotive acquisition tools to check the collection
First you have to export the URL of the website, then according to Baidu's URL features, the site's page URL as the keyword in Baidu search, batch generated in Baidu query after this URL, use the locomotive tools to collect the content of these URLs, to collect the characteristics of these pages, such as not found, sorry and so on words , when the collection of these content, on behalf of not included, with the total number of URLs minus these are not included in the number of URLs is the site's collection of data. A5 Webmaster Net SEO team original. A5 Professional to provide SEO diagnostic services, Enterprise qq:800017899. Reprint please indicate http://400.admin5.com400 telephone application.
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.