Can Baidu effectively distinguish the pseudo-originality of secondary sorting?

Source: Internet
Author: User

High-quality original articles have always been the killer of improving the website ranking index. The quality of original articles determines the popularity of webpages/websites at the user level, for seo/seo.html such as Baidu/Google "target =" _ blank "> search engines, a high-quality original article will give priority to display opportunities, which is fair, before the absence of "pseudo-Original", reprinting of articles was the main channel for Internet communication. In the face of a large number of reposts, how did Baidu distinguish the first publisher of original articles?

According to LEE of the Baidu webmaster platform, Baidu's original spark plan is being implemented. Baidu's original recognition system can quickly achieve repeated aggregation of webpages and Link pointing relationship analysis. Aggregated collection and originality based on content similarity, and similar webpages are aggregated together as candidate sets for original recognition; identify and determine the original webpage by hundreds of factors such as author, release time, link direction, user comment, author and Site Historical originality, and forwarding track. Finally, the value analysis system is used to determine the value of the original content and then appropriately guide the final sorting.

Baidu still cannot effectively distinguish pseudo-original articles
What is pseudo-original?
1/use software to collect articles from other websites. The article published with simple replacement of synonyms is the original pseudo-original article. this type of pseudo-original Baidu may use the Chinese word segmentation technology to find out the cloud computing matching technology.
2/articles that have been processed twice and reorganized, such pseudo-original articles are hard to be identified or cannot be identified at all.

The Baidu "origin" algorithm aims to find the source of the initial release. distinguishing originality and pseudo originality is still a problem that Baidu needs to solve, and I believe this problem is hard to solve in the short term, the indexing volume and traffic are the culprit of a large number of repeated/similar articles.

Pseudo-originality is a challenge for Baidu, and it also challenges the nerves of webmasters who rely on various collection and release software to survive. baidu may be able to differentiate the original pseudo-original replacement of simple synonyms, but Baidu cannot solve this problem at least within a certain period of time for the second-hand articles to be reorganized.

Note that the "Luoluo 2.0 algorithm" is specific to the soft texts of out-of-band links, because most of these soft texts contain a certain direction of the anchor, and the pseudo-original may have no connection, the purpose is to attract search engines. Some articles are automatically produced by software and have no value for reading.

Therefore, do not post too many external links/URLs. Appropriate intra-site connections can effectively increase the weight of the article.

Baidu's original spark program, Baidu official explanation:

We have been committed to the identification and sorting algorithm adjustment of original content. However, in the current Internet environment, the rapid identification of original content is indeed facing a great challenge, and the computing data is large, the collection methods are endless, the website construction methods and templates of different sites are very different, and the content extraction is complicated. These factors will affect the identification of original algorithms, and even lead to judgment errors. At this time, Baidu and the webmaster need to work together to maintain the ecological environment of the Internet. The webmaster recommends the original content, and the search engine gives preferential treatment to the original content through certain judgments, so as to jointly promote the improvement of the ecosystem and encourage originality, this is the original spark program ".

At the same time, LEE said that Baidu's original "origin" algorithm has made some progress through experiments and real online data, in Phase I, the high-quality original content of some key original news sites is marked and presented by the author in Baidu search results, and the sorting and traffic have also been reasonably improved. At present, the invitation mechanism is mainly used. Currently, only websites with tens of millions of traffic are invited. For example, large news websites such as sina and international online. Explanation of high-quality original articles

The following are the best quality original resources:

This website is the first, non-plagiarism and imitation, content and form all have unique resources;
This website is the first resource with social consensus value and complies with relevant national regulations;
Reposted and simply processed content is not in this scope;


For small and medium/micro websites. baidu gradually opened real-time push on the Baidu webmaster platform to ensure the first registration of original articles on these websites. "real-time push ping" does not ensure that the webpage is "received in seconds ", it is the release of original notifications to Baidu

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.