Rookie must know: What is the impact of copying content on SEO

Source: Internet
Author: User
Keywords Copy SEO

Intermediary transaction SEO diagnosis Taobao guest Cloud host technology Hall

Do SEO inseparable from the content, no content is not included, not included also no rankings, it should be said that the problem is a lot of webmaster and SEO headaches, to some extent than do outside the chain is also difficult, especially in the face of professional not counterparts, the industry is not familiar with the writing of the copy, content simply let SEO want to go crazy, Because the content I can't help feeling, do seo really hurt ...

Good, novice webmaster and rookie SEO thought content so difficult to do? Why do not I think, online so many wonderful articles, reproduced over not just got? This is the most direct and most taken for granted, but the result of this approach is that your site is full of copies of a lot of content, nothing new, for search engines, A new station starts with a lot of such content, basically announces your website SEO work to fail. About the content of the replication of SEO in the end what kind of impact, how to produce, how to avoid, this is the article to talk about the topic, I hope to give the novice to bring some warning.

What do you mean, copy content?

Copy content is also called duplicate content, theoretically defined as more than two URL content or similar high, such a URL may be a website, more is happening in different websites. About the causes of replication content, it is necessary to do a simple introduction, so that SEO at work to know how to avoid, as follows:

1. Technical reasons. Web site URL is not standardized, not standardized issues will allow a site to produce a large number of copies of content, almost every content will exist more than two repetitions.

2. Enterprise Station Product Station. Chengdu Red Land Studio to do SEO consultant when this feeling most obvious, different regions to see the URL is not the same, when the service content in addition to the price almost exactly the same. For the product station, agents or retailers are often directly from the manufacturer of the product information reproduced intact, we are not discussing copyright issues, there is nothing wrong. The problem is that most agents, retail will be directly copied, in addition to the contact details of other content is rarely changed, in these sites filled with a large number of duplicate content, these content is highly repetitive! has a great impact on SEO.

3. Website structure. Yes, in a large number of product sites based on product prices, upload time, intervals, comments and other factors to sort the page, the same products have different URLs, resulting in at least three or four pages of duplication. What is evident in blogging is time and classified archives, which create a variety of page versions, creating a large number of site-wide duplicates.

4, News class website. My friend used to be a news site, directly using RSS to generate the content he needs, which makes him feel proud, because not much effort will be able to get the full prescription of the news. However, these news content has been in the original text and other sites early on hundreds of times, the last site is basically not included, announced aborted.

5. Less content of Web pages. On the site there will be a large part of the common content, such as advertising, copyright notices, description text, up and down navigation bar and so on. The text of the page is too small, it will make the search engine looks like these pages are highly repetitive pages.

6, reprint and plagiarism. This should be the SEO industry caused by the current mass replication of the main reason for the content. All kinds of reprint, all kinds of plagiarism, as well as mirror sites, collection and so on. There is no explanation for this, just hope that people in the industry should have a sense of copyright.

7. HTTP status code problem. I give an example, a forum with the Phpwind Forum, a specific post URL is http://www.seo147.com/read.php?tid=137, if not to do the technical processing, TID after the number of arbitrary tens of thousands, such as 100000, The server still returns a 200 status code, just like the original tid=137 content, which can cause horror in-site duplication. Rookie must pay attention to.

Since there are so many factors to reproduce the content, then how do we normally check the contents of the hands have a copy of the version? Very simple, take the text in front of a paragraph, plus double quotes directly Baidu cable, from the search engine results can be intuitive to find the repetition of the article. For example, I have written in A5 "Junior high school graduates SEO entrepreneurial experience to inspire those who are still wandering rookie", the combination of titles was not hit until A5 was released, and now there are nearly 1000 pages in Google search, but unfortunately most of the reprints are not complying with copyright, Some even replaced the name of the author ...

What's the harm of copying content?

This is I strongly want to tell the novice webmaster and rookie SEO personnel, because I worry that you start for the map convenient and fast and a large number of reprint, carefully read the following explanations, you must not be willing to let your site just set up to reproduce reproduced.

First of all to clarify a misunderstanding, may be a lot of new people will understand this: the site is full of copy content will be punished by the search engine. The correct understanding is: Search engines in no way because our site has some copy content to deal with us, it should do is from several such Content version to identify the most original source, and then given the ranking, and the copy version is often ranked back, or over time will be eliminated, so that no rankings. But keep in mind that Big Mac-level sites are not.

At present, the search engine in the judgement of the original error rate is also relatively high, a lot of owners feel helpless, hard to do the original by others plagiarism sent to other sites, the twinkling of an instant is included, and the original page on their own site looks like a punishment, because the search engine to judge the original failure, did not give us the score.

Copy the other harm of the content, the station appears repeat is serious, the problem is webmaster do optimization and link easily distracted, and search engine think the most suitable URL may be different from what you think, this is a kind of cost waste. The same site is filled with repeated content will be decentralized weight, resulting in unnecessary internal competition, in the outside, the loss of the best ranking opportunities, while squeezing out other pages are included in the opportunity.

If the site is full of more than 70% of other sites repeat content, will cause the search engine's attention, they will be suspicious of the quality of the site, resulting in punishment, the beginning from the snapshot and included beginning to reflect, and then will reduce the keyword ranking, delete already included page to the last hair is very likely. I wrote an article in A5: "Talking about the site collection and false original this path of no return", the click Rate is very high, so I can assert that the webmaster heart is to know the harmful content of duplication, but still "helpless."

The next thing to consider is how to eliminate duplicate content.

For me above the reasons for the reproduction of the content, webmaster and novice SEO can be amended to avoid, for example, the problem of Web site normalization can be resolved through technical means. Instead of Web site normalization caused by the duplication of content issues, you can already included in the repeat page left 301, or by writing robots.txt file wildcard only guarantee a version is included. Also can be on the specific page of the NOINDEX (do not want to be indexed) This meta-robots tag. Do not want to include a repeat page link to add nofollow.

There is a better way to solve the replication content of the product station. This method is I in Zac predecessors seo actual password among the learned, that is the use of canonical tags. For example, a clothing site, the same style of clothing may have different sizes, the difference between the size of the color, so that the same size of clothing will be due to the color of multiple URLs, content is almost exactly the same. This time if you make canonical, the user in the browser to get the page will not turn to see the page, although it will be different, but the search engine will be weighted to a size, so as a whole to avoid duplication of content. Regrettably, however, Baidu does not seem to support the label.

The site to solve the replication content is actually very troublesome, because you can control your own site, but you can't control other websites on the internet, for SEO, we can only do the following two points:

1, in their own original content to join the copyright notice, the requirement to reprint the link to the original page, the original version of the external chain should be more than reproduced, for the current search engine technology, this is the most important judgment information!

2, adhere to the original, the site adhere to the original, great benefits, as long as you can adhere to a certain time, the weight of the site must increase, plus a good site unique content to the search engine left a deep impression, the content of the site is judged as an original opportunity to greatly increase.

If through the above methods, SEO and webmaster still can not solve the problem of duplication, and even found a large number of sites to copy your content, resulting in the original content of all failure, you can also take the initiative to contact with each other, through various means, such as communication = complaints to the space business, even to the search This is indeed a thorny issue in the Chinese Internet environment, where copyright is less important. Anyway, sincerely hope that our industry is becoming more and more standardized. Do you know enough about copying content? Article by Sichuan Art College Entrance Examination Training http://www.htdart.com SEO consultant feeds, hope to reprint webmaster can leave a link, thank you very much.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.