SEO Novice must see search engine work principle of two

Source: Internet
Author: User

Search engine working principle of three stages:
1. Crawl and crawl
2. pretreatment
3. Ranking

The day before yesterday on the A5 search engine work principle of crawling and crawling http://www.admin5.com/article/20110630/356286.shtml, interested can go to see, now then go on to talk about Pretreatment, Search engine through crawl and crawl after the original page into the database, and can not be used directly in the query ranking processing. You can imagine how many pages the search engine contains, if you wait for the user to enter the keyword to do the operation ranking, it is obviously unrealistic, so these pages are first preprocessed, so that when the user input keywords, the ranking program will call the database has been preprocessed data, and then calculate the ranking and display to the user to see.

We take Baidu as an example, the search engine will extract the text content in the page file, then according to the content of Chinese participle, such as "Pipe Bender Price", will be divided into "elbow" "Pipe bender" "Price" of these three words, see here you will understand why I previously mentioned in the article do not carry out keyword accumulation, Because the accumulation will be considered cheating, do not accumulate can achieve similar effect, so it is very important to understand the principles of search engine work.

Chinese often some words appear in the frequency is very high, but in fact has no effect on the content, such as "" "" "" "Ah" "ah" and the like, these words are called stop words, search engine to go to some stop words, make the theme more prominent. There is such as the site will have copyright information, advertising and other things, this will generally be removed. After these, the search engine will also go to the page to be heavy, that is, the same article often repeated on different sites, will delete duplicate content. This is not absolute, for a variety of reasons, duplication of content will still exist, but we'd better stick to the original, at least pseudo original, here to say, the so-called pseudo original should do, go on to the point of heavy, then you will understand how to do false original, go to the basic method of the page feature is the key word to calculate, That is, the main content of the page to choose the most representative part of the key words, this part of the keyword is often the highest frequency of keywords, will generally choose about 10, so you simply change the first paragraph, change the paragraph order can not make the article into original, so the key is to change the keyword, such as the text of the key word is a computer, You change into a computer, in short, the most frequent occurrence of the keyword replaced, so that can achieve the original results.

After the above steps, search engine extracts the keywords on the page, according to the word segmentation program, the page into a set of keywords, while recording the frequency of each keyword on the page, location and so on, so that each page is recorded as a set of keywords. And then by keyword arrangement, each keyword corresponding a series of pages, when the user search this keyword, the sorting program to find this keyword, and then you can see the keyword all the pages.

There is also a point of addition is the link relationship, the page on which links to which pages, each page has what to import links, links use what anchor text, these complex links to link the relationship between the formation of the site and the weight of the page. It will take a lot of time to talk about it, and I'll write it alone later if I'm free.

Tomorrow is free words will write the principle of ranking, today there are many things to do, this article by the Zhangjiagang pipe Bender http://www.zjgjixie.com Webmaster Contributors, reproduced please leave a link. Other related enterprises, machinery, such as links to Web sites, some please add qq:26043721



Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.