Page recognition search engine duty is not a gift

Source: Internet
Author: User
Tags copy

April 13, 2010, in Baidu Bar Webmaster Club, well-known Chinese SEO expert Zac in the "generation of original content can not be identified" post asked: "Their original content ranking is often more than reprint or plagiarism, webmaster can do what to prevent or improve?" My station updates original content every day, And Baidu is updated every day, but the other people's reprint is included, my article on the search is not. I persisted for nearly 4 months original, but Baidu still threw me to more than 500! "

Two years ago, Zac on behalf of the webmaster on the original collection of questions and representative of Baidu Lee's dialogue

More than two years later, the above questions in the state of the situation is still no change, even worse, a variety of "copy collection Pseudo original" valuable original content page is easily baidu Web search by keyword Index method recommended to search netizens, and original content of the first site is Mingluosunshan. This objectively condoned the kind of aiming at Baidu flaw, to "copy collection false original" as the basis of the so-called SEO constantly flooded.

Unsurprisingly, in this year's August 10 Baidu "Webmaster outpatient open Day" activities, the original identification problem has become a webmaster, seoer Baidu Search engineers continue to ask the question of Lee.

Also predictably, Baidu search engineer Lee's answer is still two years ago, "this, can only say that Baidu's strategy is not perfect, we have been improving" of the pirated-"We are designing a better original recognition algorithm."

And the people concerned about Baidu dynamic can easily be found, Baidu Lee "We are designing a better set of original identification algorithm," The answer is completely on the July 2 Baidu web search anti-cheating team "for the low quality site measures have taken effect" "to combat low-quality sites (false original and no original site) measures have been in force" The total negation-we still remember, at that time, "for the measures of the low quality site has entered into effect," said: "To provide quality, original resources webmaster, because we reduce or even eliminate the rankings of low quality sites, you will get more traffic from Baidu."

But less than two months later, Baidu search engineer Lee's answer to completely negate the Baidu web search anti-cheating team, this really surprised.

And, after two years two times in the face of "original content" to identify questions, Baidu Lee has taken "look around him" approach perfunctory coping: Two years ago Lee's answer to "from the user experience point of view, some of the reprint may not be worse than the original ..." But the domestic reprint, many are Qiatouquwei, to make the original person more injured ", more for the domestic reprint is not standardized problem; and this year, Baidu Lee's answer is:" (Baidu received the claim that they are original complaints) more than 80% are ineffective, and even a large number of old Chinese medicine claimed 3-5 days to cure the incurable disease of the website, The entire content is not readable, claiming that he is a high-quality website.

There is no denying that Lee is the truth, but the actual accumulation of details does not equal to the real whole, these universal reality does not mean that the Chinese network has no high-quality original content exists, but not Baidu is not able to identify the original site to start the reason. As the saying goes, "no diamond, don't stir China live", Baidu Lee such a statement can only prove that Baidu's original identification and removal of duplicate page ability without any progress.

It must be emphasized that understand the ability to identify the original page is the weak point of all search engines, many grassroots creators at the end of the article added a copyright note to mark the beginning of the site, at the same time to high-quality industry site submissions "content synchronization" way to guide the search engine and reprint webmaster-although the links are more text-only links But Baidu search engineer Lee said, "Let's make it clear: is it possible to recognize and process links in plain text form (not tags)?" The answer is yes, search engine spider need to discover and crawl the link on the Internet in time, as to what form the link is, not important "gave them confidence."

But to the dismay of these stationmaster, in a large number of industry authority to submit a website no Baidu Lee said "Qiatouquwei reprint", and high-weight contributors or reproduced Web page generation and was significantly earlier than the search engine included in the "Copy collection Pseudo original" site, or appeared a large number of original starting page was Baidu ignored, "Copy collection pseudo original" site high ranking-many of these pages or casually intercepted part of the article, did not fully express the theme of the article, simply can not achieve Baidu advertised as "Better user experience" standard.

Must see is, although the original page identification has always been a search engine weakness, but not all search engines face many high-weighted URLs to the original starting page performance are as bad as Baidu. As the domestic well-known seoer Wang Tong said, in the same face of the Chinese network, "copy collection pseudo original" rampant situation, for the copyright notice site guidelines for the original first page (plus the release time, link universality and link Site page weight, etc.), Google has not appeared in the so-called "most understand Chinese" Baidu such a crushing situation-related search by the former location of large tracts are copied to capture false original page occupied, original starting page without a trace phenomenon.

This explains, "Most understand the Chinese" Baidu in the keyword index will be recommended to search the Web site users, must complete the original identification work with the removal of duplicate page work (to identify key recommendations of High-quality information page and important supplementary pages) basically did not complete-because the technical level is very low, urgent to catch up, And Baidu Lee's argument is just constantly looking for excuses for Baidu.

And, by contrast, Google's better performance in identifying the original starting address proves that Baidu does not care about the original starting page, it cares only more original content--but lack of due copyright awareness, I think, this is Baidu has long been the identification of original site algorithm behind the most important reason, "must not also, is not also."

No wonder, the domestic well-known seoer Wang Tong "Baidu 628 adjustment, is to combat the original website," A lot of webmaster, seoer in the heart.

In fact, the original first page of the identification technology, if you can improve, can help the search engine anti cheating ability of a large increase, directly frustrate those for profit in a variety of ways to deceive search engine seoer of the plot, to seriously committed to high-quality original content of confidence.

Only Baidu with the actual action to respect the many small and medium-sized original site owners of the labor, encourage them to constantly play the wisdom and ingenuity, to carry out the original work, in order to guide more day Shing in the "copy collection false original" webmaster, Seoer will focus on the "most can reflect the core value of the site" For Baidu, this step although very difficult, but it is conducive to the future development of the search engine a big move.

And, have to remind Baidu Web search is, as soon as possible to a more reasonable algorithm to successfully solve the webmaster reflects the "original content of the problem", is not Baidu for many grassroots webmaster gifts (well-known sites do not pee Baidu, Taobao directly shielded Baidu), but the current "copyright law" and other relevant laws required Baidu must fulfill the " Basic obligations ". Baidu can't feel too good about themselves.

Where to go, the road at the foot of the search engine for their own choice (this article by Gouyn12 original starting, copyright, Wenzing, reprint, please indicate the form of the link to the origin of the article http://www.gouyn12.com/cnnet/327.html).



Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.