Understanding of some search engine crawl information

Source: Internet
Author: User

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

Baidu and Google, Yahoo show problems and time and ranking relationship, search engine capture law:

Search engine is based on the value of a website to scan, valuable, updated quickly, good information site to crawl more opportunities, crawl time is also long. If the bandwidth is wide enough, the configuration is high enough, every day will usher in a lot of search engine spiders come crawling information. Because the site information can be modified, Baidu was included in your information when you have checked the value of your information ranking, and there is no illegal national security. When included, any changes will result in a drop in the ranking of information, so although your information on the homepage of Baidu, you do not change the information, because Baidu will also be a match for him two times, that is, Baidu take his snapshot to compare with your information. If the same old position does not move, this is also a lot of people do the site why are the main reasons for static. Favorable search engine to give a good ranking.

Dynamic website good or static site good?

Many experts believe that the static is good, because the top. The fact page is exactly the same. But from the day he was static he was on the right to locate the day, according to search engines to provide people with the latest information principles. He certainly doesn't have a long list. But dynamic every time to read from the database and easy to modify, so there are a lot of dynamic information on the first shot. In Baidu and Google soon displayed on the home page.

Then the Dynamic Web page is the best because he requires a fairly high server configuration. (The price of the server in hundreds of thousands of to tens of millions of prices, naturally there are many companies do not want to spend the money) we look at Baidu, GOOGLE, Yahoo these world-class companies are still using dynamic. Static Web: Advantage 1. Open fast, top. Disadvantage: The time is not long, the actual effect does not engage. Dynamic Web page: Advantages 1. High effectiveness, ranked first. Disadvantage: Requires high server and bandwidth.

Baidu and Google, Yahoo show the problem and time and ranking relationship?

Because the big search engine has a lot of server cluster composition, probably all have thousands of sets of components. To make it work fast. The server cluster forms a mirror and disk array, which turns the tens of thousands of hard drives into a hard drive so that you find the information you want in the search results quickly. Then our web page was photographed by Baidu, Baidu will put the information to thousands of hard drive separate storage, so you read faster. If we put our 1 million piece of information on a hard drive, it is impossible to display 1 million messages in 0.0001 seconds with approximately 30G of data. So our information by Baidu automatically put on many hard drives to store snapshots. For quick extraction.

When we use the site command to search, the display is 400,000 pieces of information, the database is generally 60.7 million pieces of data. Baidu and Google, like the garbage or useless information deleted, or automatically slowly deleted. When we use Baidu to site: The day's information shows very little, 1. Because there is a lot of information is not shown after the collection, or in the background of Baidu evaluation.

2. Because of Baidu's hard disk server, you use command site: When only show a host of information, there are a lot of not show up. So you see fewer results. Such as: The day included 20,000 pieces of information, distributed in Baidu's 2000 Server hard disk, you use site: the day of information may only show about 10-20 articles. But you see 1000 or more after 10 days or 20 days. Have time our site Baidu Day of information, see only 1 or 3 pieces, but in fact you see all the time is a lot, such as the input command site:www.dgsaiwen.net.

There is time for us to look at the information is 100,000 or 50,000 on the description of the daily information on Baidu will suddenly show up one day. So: site: Just a reference command. The results of 30–40% can be evaluated. The most critical core problem: Your site must have a lot of information and very useful data.

My current new station Www.dgsaiwen.net adhere to update every day, now included rate of more than 90%.

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.