Say original and pseudo original in search engine judgment

Source: Internet
Author: User

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

Original and pseudo original became an important topic in the post-Internet era. That is, how to protect the "proof of the content of the King" problem, for the portal type of large internet companies, perhaps they have professional editors and writers, but as far as I know, it is still not escape to reprint others ' articles. How to achieve a balance between original and non-original, is to do the site operators and editors must carry out a point of control.

How do search engines identify original and pseudo original?

In the current computer, it is impossible to achieve real AI identification content, perhaps the English department is a little better, after all, the English system is limited, each independent English meaning is independent or related. And the English language has the default habit of "-" to differentiate.

And Chinese is obviously different. The same meaning can be described in countless words, ever-changing. For example, the word "peach" has more meaning. So computers are impossible to discern. So how does the search engine determine original and pseudo original? The following is the realization of the idea.

First of all, the search engine to the two articles for organic screening, as compared to the object, then how to know this is more relevant to the article? Of course, the key words, according to the key words of the article, which is why the article to build a certain proportion of the keyword reasons, at least how to distinguish the article in the keyword, Search engine own algorithm to solve, no longer repeat.

After taking out two articles, the computer is analyzed:

1, set a ratio, such as the definition of M, callout is a factor of 0.5.

2, put a article, according to the number of words, paragraph is divided into three paragraphs. B Article paragraph is divided into three paragraphs, and then to compile the algorithm, can also be understood to become encryption and so on, that is, the text into a symbol. For example, a phrase is compiled to become a string such as AAACBDFBCDFSDAFEFASDFASD. Of course, it is not necessarily the use of ABCD characters, the advantage of this is because of the convenience of computer comparison and processing.

3, then put a, b two articles after the second step of processing, and then through the algorithm, the similarity between the two articles how much, (estimated this contrast algorithm is very complex, I can only guess.) Will get a value, that is, similar to the above 1 mentioned in the M coefficient, according to the standard, such as the above 0.5 means that the same, below the said is not the same, if the same as the use of search engine crawling other parameters to determine who is original, or is not original.

How do we deal with the original decision of the search engine?

Villains, outsmart, the Internet is never the absolute spear and shield, in the current computer can not really achieve artificial intelligence, so, original and false original is a topic for the moment. To do the most strong pseudo original can be as follows three steps:

1, the title must be changed, and to change the Acme. Chinese writing is very complex, the same meaning can be expressed in many ways, if you really can not change, then I tell you a way, is to write the title to 20-25 words, you must be very special.

2, if you have good writing skills, you read people's articles, you can immediately form a certain framework in the out, and then use your language to describe, plus pictures and other rich text to decorate, that is absolutely a rare false original article. For example, our car market China Network has a professional editorial staff, for the release of a variety of automotive news are a large number of false original effect.

3, content disorder. There are many rubbish stations on the Internet. Why people can get the ranking and flow of keywords, the reason is that the acquisition of information for false original, it can become original, the most important reason is that Chinese characters are too complex. Program to build a thesaurus, by matching synonymous words, you can basically achieve the fluency of the statement, and reduce a large number of similarities. As for the content of the article to express the author what the real image, the computer is not read.

Original and False original is a pair of angels and demons, you do not have to hate others to your article to False original, you most condemn others bad character. The so-called article a big copy. The real master is of course the high-end. Let false original come more crazy!

Article original, reprint please keep this trip Shenzhen auto Show: http://www.carixy.com/shenzhenchezhan/201009/

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.