First of all I wrote this article entirely from my long-term observation summary, if there is anything wrong please correct me. After all, I study SEO also has a period of time, although said that the highest level of SEO is to forget seo, but the SEO technology is still very interesting, I study SEO technology is purely personal interest, write this article is to you webmaster do a good reference.
First, the search engine will filter out ", yes, ah," and so the repetition rate is very high to the rank of useless words that do not help.
Second, here's why it is sometimes ineffective to convert synonyms. Starting from here, it's my personal experience. Since there is a pile of false original tools on the market can be false original words such as "computer" pseudo original "computer", then what reason does not believe that a powerful search engine will not be false original? Therefore, the search engine must be synonymous with false original, when the search engine encountered "computer" and "Computer", will automatically convert them here. Suppose to be a, so many cases of synonyms false original is not included in the reason.
Third, here is the reason why sometimes not only the synonyms are converted and the sentences and passages are still not valid. When the search engine filters out useless words and converts all kinds of synonyms into a,b,c,d, it starts to extract the key words from the page a,c,e (for example, the actual number of keywords that can be extracted is not ace three but 1 to dozens of is possible). And the words are recorded in a fingerprint. This means that the articles and texts that have been converted by the synonyms and the paragraphs that have been disrupted are considered identical to the search engine.
Four, this deeper explanation of why a few paragraphs of the article reorganization is still likely to be identified by the search engine. First of all, since Baidu can produce fingerprints can naturally decode fingerprints, paragraph reorganization of the article is only important keyword increase or decrease, such as two articles the first important keyword is ABC, and the second is AB, then the search engine may use its own an internal similarity recognition algorithm, If the percentage difference is below a certain value to release the article and give weight, if the difference of the percentage is higher than a certain value then will be judged to repeat the article so as not to give a snapshot, also not given weight. This is why a few articles in the paragraph of the reorganization of the article is still likely to be identified by the search engine reasons.
Five, I want to explain why some false original articles can still be included in the good. My reasoning above is only for Baidu to identify pseudo original algorithm of the general framework, in fact, Google Baidu for the identification of false original work to be more large and more complex, Google will change 200 times a year algorithm enough to see the complexity of the algorithm. Why some false original articles can still be included in the good. There are only two reasons:
1. The weight of the site itself is high, even if not for the original copy other people's article or will be included to give weight.
2. Search engines are absolutely impossible to filter all pseudo original, it is impossible, just as the Turing of artificial intelligence can never be perfect to have human emotions.
Personal advice:
1 everyone to do the garbage collection of friends to pay attention to, you can make a sum of time to eat a pen. But I also hope that you can consider the future is not a different direction to do? If Baidu changes some algorithms to make the judgment false original more intelligent, even if some small changes may be your doom. In addition this year, Google also declared war on the garbage station, hehe you see it.
2 You are honest to write the original webmaster, you absolutely choose the road. But at the same time also pay attention to their own copyright issues Oh.
This article for my original finally also welcome you to have what good idea we together exchange my station is Jiangsu Enterprise SEO www.seohcit.com