The signal-to-noise ratio of a Web page refers to the ratio of text content to all HTML code on a webpage, which is also the basic knowledge of SEO that we must understand. From the principle of the search engine, its crawl system is the first to download the entire Web page, and then the contents of the text extracted, after analysis to remove the HTML format, clear noise, and then participle, and finally deposited into the index library. In this process, search engine will also go through the process of denoising, we can clearly know that the higher the signal-to-noise ratio, the search engine spiders crawl more efficient, search the spider every day to deal with a lot of documents, how to quickly extract the subject information of the Web page is an important task.
In fact, it's not just the ratio of all the text to the code, but also the ratio of useful and unwanted information in the current page's textual content. What is useful information, such as the theme of my article is the Web page signal-to-noise ratio, the entire article has 1000 words, and the current page all the text content has 2000 text, and other text is with the signal-to-noise ratio, which is irrelevant information is noise. Therefore, improve the Web page Signal-to-noise score of two aspects: including optimizing the code and optimize the content.
First, the noise removal code
We know that the first step in search engine denoising is to clear the HTML format, so the first step to improve the signal-to-noise ratio of the Web page is to optimize the HTML code. Why we often say that the Web page code to meet the standards of the international standard, code to be concise, to use div+css, in fact, are based on this principle. In fact, a lot of friends just see the article on the Internet to write code, but do not know why to do so, this is what I suggest you first learn the reason for SEO principle (I know, practice is greater than theory, but if the theory is not, how to practice, not a starting point). The noise removal code includes the following:
To reduce the use of JS, you must use the JS code for encapsulation.
Encapsulate the CSS code.
Reduce the div layer nesting (many friends do not know the principle, blindly pursue div+css, but also produce a lot of redundant code. )
Reduce the use of pictures and flash
Ii. Removal of noise content
Similarly, the search engine extracts the content of the Web page, but also to analyze two times to noise, that is, to determine the current page theme. So in this process, how can we let search engines more accurately judge our web page theme (which is the problem of relevance), how to improve the relevance of the page? Then it is to reduce the page noise content.
We are very common is a number of E-commerce site Product detail page, may be part of the E-commerce site SEO personnel did not pay attention to the content of the product introduced below some information on the distribution method or help the description of the content, the existence of improved product page of the similarity, but also reduce the signal-to-noise ratio. This information from the user experience point of view is friendly, is to enhance the trust of the site, but from the point of view of the search engine is a certain disadvantage, so we can use the IFRAME or JS to encapsulate the call, so that both sides take care of. Mainly include the following aspects:
Repeating content for encapsulation calls
Export an unnecessary list of links for encapsulation calls
Streamline copyright information
Increase the text length of related content
The method is above several, how to realize also must see oneself to the technology understanding or grasps. Although we know that the search engine in the index preprocessing phase will be on the Web page denoising, but if we do a good job of the Web page signal-to-noise ratio, on the one hand, reduce the workload of the search engine, so as to enhance its index on our site to capture the efficiency of the search engine to improve the accuracy of judgment Well, it's conceivable that our pages are more trusted.
Author: Shanhan Starting Shanhan SEO blog (www.xiaohan86.com), reproduced please retain the Copyright information: http://www.xiaohan86.com/2011061188.html