Analyze the habits of Google and Baidu search engines

Source: Internet
Author: User

Google search engine habits

Google, as the world's largest multilingual search engine, has formed its own webpage indexing habits and established its own set of standards in its history. The study of Goolge's webpage indexing habits is conducive to better cater to Google's search engine taste, so as to improve the webpage indexing volume and indexing ranking.

For the moment, we will not study Google's indexing of other languages. For Chinese, Google indexing has the following features:

1. High sensitivity and rapid response

Google has a high level of knowledge about the newly created website. Of course, the newly created website must have external links or submit website logon information to Google. Otherwise, even if Google's search technology is even more powerful, it is difficult for Google to find a website that only the webmaster can see. Google provides two ways to create a website: First, external links to the website, and second, submitting website login data to Google. In general, the latter's indexing speed is relatively fast, and the former depends on Google's indexing frequency of new website external links. If Google has a high rating on external linked websites and a high indexing frequency, it will find that the speed of the new website is also high, and the date on which the new website is indexed will be advanced.

 

2. Repeat correlation and importance

 

Google uses PageRank technology to check the entire network link structure and determine which webpages are most important. Then perform hypertext match analysis to determine which webpages are related to the specific search being executed. After considering the overall importance and relevance to specific queries, Google puts the most relevant and reliable search results first. This is also one of the features of Google's webpage indexing.

3. Fast changes and high mobility

Google's roaming bot regularly captures the Web and indexed a large number of Web pages. The next capture completed later will notice the new website, changes to the existing website and invalid links, and adjust the content changes in the search results.


4. Text descriptions that emphasize links

Google will index the text description of the link as a keyword, so we must carefully design the text description of the link when making a link, so that it not only conforms to the positioning of the website but also has no relevance, to win the trust of Google.


5. Pay more attention to the description of webpage Meta tags

Most of the time, when Google displays the search results, the Description of the web page is displayed and occupies a heavier part.

Technologies used by Google:

 

PageRank technology: PageRank can objectively evaluate the importance of web pages. PageRank does not calculate the number of direct links. Instead, it interprets the link from webpage A to webpage B as one vote for webpage B. In this way, PageRank evaluates the importance of the page based on the number of votes received by page B.


Hypertext match analysis: Google's search engine also analyzes webpage content. However, Google's technology does not simply scan Web-based text (website publishers can control this type of text through meta tags, instead, it analyzes all the content of the web page, as well as the font, partition, and precise location of each text. Google also analyzes the content of adjacent webpages to ensure that the results most relevant to user queries are returned.

Baidu search engine indexing habits

Baidu is the world's largest Chinese search engine, and its search technology for Chinese web pages is somewhat ahead of Google. Baidu has the same or similar characteristics with Google in some aspects, and has the following features:

1. Pay more attention to the first impression

The first impression that a website gives Baidu is important. Compared with Google, Baidu's search engine has a high level of human engagement. That is to say, in some aspects, people may decide whether to include webpages rather than by machines. Therefore, before you log on to the Baidu search engine, you 'd better enrich the content, increase the original content, and increase the relevance between Web keywords and content so that you can get a better impression on Baidu.

 

2. Sensitive to webpage updates


Baidu is more sensitive to web page updates than Google, which may be related to Baidu's local character. The Baidu search engine is updated every week, and the website has different update rates based on their importance. The frequency ranges from several days to January. Therefore, the indexing time is basically indicated in Baidu's search results.

 

3. Pay more attention to the homepage

Baidu attaches more importance to the home page than Google does, which is in the same line with the aforementioned "attaching more importance to the first impression of indexing. When displaying search results, Baidu often displays the homepage of a website, instead of a specific content page (when it thinks it is not important enough ). Relatively speaking, its user experience is discounted, and its "Baidu snapshot" user volume is increased.

 

4. Links that place greater emphasis on absolute addresses

Baidu attaches great importance to the indexing of absolute addresses when indexing webpages. The web page snapshot function provided by Baidu does not parse the absolute path of relative addresses. I wonder whether this is the negligence of Baidu technology or a major manifestation of its preference.

5. Pay more attention to the date of collection


Baidu attaches great importance to the webpage's indexing date and is also a reference point for its search result ranking. The sooner the page is indexed, the higher the ranking, sometimes, we do not even consider relevance to put the content that is considered important first, but click it to find out outdated information or junk information. This is the technology that Baidu needs to improve.

 

Technologies used by Baidu:

 

Baidu uses the following technologies: "An Internet image and a quasi-Image website identification method", which solves the problem of repeated retrieval of similar information by search engines and saves network resources and local resources, improve the quality and efficiency of system services. "a vocabulary-based computer indexing and retrieval method", this method is used to analyze and process a piece of continuous text information, by adding invisible words, you can improve the retrieval quality of the vocabulary indexing and retrieval systems, so that users can obtain more accurate retrieval results; "One method of recording and analyzing online information using snapshots" is to take snapshots of a specific information on the internet multiple times, retain the current status of the information. Through analysis of a series of snapshot information, we can obtain valid data to conveniently obtain information changes on the Internet.

 

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.