A comprehensive explanation of issues related to "web page similarity"

Source: Internet
Author: User
Tags aliyun change compared content copy google + high host

Intermediary trading http://www.aliyun.com/zixun/aggregation/6858.html"> SEO diagnostic Taobao customer hosting technology hall

Webmasters in the construction site, sometimes encountered a problem, that is the issue of web page similarity. What is the page similarity? The so-called web page similarity, that is simply two pages Similarities between the two pages can be the same website on the web, it can not be the same website on the page, but also on other sites on the web.Search engine in the web page, usually two pages to be compared To see if the two pages are similar, the so-called similar, that is, two pages in most of the page content are the same, then you can think of two pages are similar.Search engine in the comparison of two pages, Is to use a certain algorithm to compare, search engines usually use two methods to compare: one is based on the page summary to compare, if multiple web page digests md5 value, prove that these pages have a high degree of similarity. The other is based on keywords that appear on the page, according to the word frequency, you can take N words high frequency, if its md5 value, you can recognize These pages have a high degree of similarity.The Google search engine to the webpage similarity setting is 60%, which means that if the similarity of two pages more than 60%, then the webpage is no longer included, if similar The degree is close to 60%, then the web pages to be compared may also be included, but the search engine gives the weight is relatively low.This is the attitude of the search engine to the web page similarity.Thus, web page similarity on our website The main thing is whether the web pages to be compared can be included, the other is basically no effect.

There are two main reasons causing web page similarity problems: ① In the same web site, copying the old web page into a new web page resulted in relatively few changes in the title, key words, description information, and content of the web page. As a result, The search engine determines that the similarity is high. ② between different sites, the original has been included in the contents of the page to bring, and make a slight change or pseudo-original, resulting in less change inside the content, or just paragraph adjustment, the content is not how the changes, which Similar with the copy, the search engine is judged as high similarity. Search engines to determine the similarity in the web when it is very smart, not what we imagine a simple comparison from start to finish, but intelligent analysis and comparison, we do not take this chance, think Copy the content of other web pages over, a simple change you can get away.

There is a problem, we need to make corrections, next time to avoid the same mistake again. For the web page similarity problem, we know the reason, we can prescribe the right medicine. The most effective way to solve the similarity of web pages is to make your web page truly original. If the content of your web page is original, as long as the content is of high quality, it will not be recorded because of the similarity of web pages. Can be included. If you do not have too much time as a webmaster to write original articles, or limited to the level of problems can not write high-quality original articles, then you can also be pseudo-original, but we suggest that you in order to avoid the web page similarity issues, you Need to make substantial changes to the original article, the extent of the amendment must be at least 50%, so that it may be included in the search engine. In addition, we suggest that webmasters, in doing the page, in order to save time, if you need to copy the original page, then we also suggest that you page title, keywords, description information and content also make substantial changes, or you The website is difficult to be included. In the revised time can use some different code to replace the original code, such as the use of iFrame framework to replace the previous part of the content and so on. The author engaged in the field of website construction done for a long time, met because of web page similarity problem is not included in the case of a few, Google webmaster management platform also has an html document tool, if the two pages of the title and description information Almost, webmaster tool will prompt you which two pages of high similarity, then we can modify it. Advice webmasters a lot into search engine management platform, using the features provided inside, and sometimes good for website construction.

This article from Yunnan Pacific Science and Technology Information Project (http://www.ynynyn.com/), respect for the author's labor results, please indicate the source reproduced.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.