Analysis of Google and Baidu is how to identify the article false original

Source: Internet
Author: User
Tags filter key words

First stone I write this article entirely from the personal long-term search engine of an observation and summary, we think that the analysis of the wrong or improper please correct me, stone is willing to exchange and accept, after all, stone research SEO also has a period of time, although the highest level of SEO is to forget seo, but SEO technology is still very interesting, The Stone to research SEO technology is purely personal interest, no other meaning, write this article is also for you old and new webmaster to do a reference.

1th, about which words are spiders do not like it? So let's take a look: In general, the search engine will filter the ",", then, ah "repetition rate is very high words, some people will ask why?" very simple, because such words are useless words that are not helpful to rank.

2nd, talking here to talk about false original Baidu and Google is how to algorithm, and determine? Why sometimes the conversion of synonyms is not valid. So starting here is a little bit of personal experience summed up. We all know that the current network in the market there is a pile of false original tools can be the word false original, such as "Computer" pseudo original "computer" such as synonyms, then what reason does not believe that a powerful search engine will not be false original? So certainly, the search engine will be synonymous with false original, when the search engine encountered "computer" and " Computer ", they will be automatically converted here to assume a, so many cases of synonyms false original is not included in the reason is here.

3rd, focus on why sometimes not only the synonym conversion and even upset sentences and paragraphs still have no effect. When a search engine filters out useless words, and all kinds of synonyms into the a,b,c,d after the beginning to extract the key words of this page a,c,e if you do not understand, then (for example, the actual possible extraction of the keyword is not ace three but 1 to dozens of are probably). And the words are recorded in a fingerprint. This means that the articles and texts that have been converted by the synonyms and the paragraphs that have been disrupted are considered identical to the search engine. If you do not understand, then a good pondering, small wear language skills are not very good, I hope we can understand.

4th, this more in-depth explanation why several paragraphs of the article reorganization of the article may still be identified by the search engine. Would it be strange for everyone? First of all, since Baidu can produce fingerprints can naturally decode fingerprints, paragraph reorganization of the article is only important keyword increase or decrease, such as two articles the first important keyword is ABC, and the second is AB, then the search engine may use its own an internal similarity recognition algorithm, If the percentage difference is below a certain value to release the article and give weight, if the difference of the percentage is higher than a certain value then will be judged to repeat the article so as not to give a snapshot, also not given weight. This is why a few articles in the paragraph of the reorganization of the article is still likely to be identified by the search engine reasons.

5th, I want to explain why some artifacts can still be included in the original article is very good. My reasoning above is only for Baidu to identify pseudo original algorithm of the general framework, in fact, Google Baidu for the identification of false original work to be more large and more complex, Google will change 200 times a year algorithm enough to see the complexity of the algorithm. Why some false original articles can still be included in the good. There are only two reasons:

NO1. Due to the weight of the site itself is very high, compared to those large portal sites, even if not for the original copy of other people's article or will be included to give weight. This is nothing to discuss, you are anxious not to come!

NO2. Search engine absolutely impossible to filter all pseudo original, it is impossible, just like the Turing of artificial intelligence can never be perfect to have human emotion. Do you understand? How does the search engine judge false originality have a certain understanding?

Summary: Above is the Huamei net (http://www.huamiweb.com/) stone to Baidu and Google how to identify false original algorithm experience, if you webmaster learned I write is not reliable, you can correct out, after all, we are together to explore the more advanced SEO learning layer, believe that every webmaster has a certain research on the search engine, welcome everyone to their own bright spot and share the same SEO stage. First A5 reproduced please indicate the source.



Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.