How should the content of the website be solved?

Source: Internet
Author: User

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

Many novice webmaster in doing the site when ignoring the important step of SEO optimization, that is the principle of search engines, The working principle of search engine is divided into so several steps: The first step is to crawl → The second is to build a library → put into the database to sort → Baidu through the analysis of user needs to users most need to display the results before the user. If the site needs to be searched, then you must know that your station exists, crawl, filter, currently 4 million websites in China, the data is calculated in billion, Baidu is not all the pages are crawled. Of course, Baidu believes that the value of the index will be established, we often say that included, the premise is to know the existence of links.

So how to make the site content by search engines to crawl better and have a good collection? Here are two ways: 1. Proactively submit sitemap site map to webmaster platform; 2. Passive crawl. Hair outside the chain of spiders, a lot of people hair is the homepage of the link, this site weights and rankings are very influential, this point we must pay attention!

Proactive submission and passive crawl which is good? In fact, there is no difference between the two. First of all to know why your station does not crawl.

First, the analysis of the domain name has been punished before. If you are punished, it will take 4-6 weeks to check the period, this time period if the site normal operation, there will be no problem. If Baidu even know the existence of links, will not crawl. To do a domain name survey, the first domain name in Baidu or Google check, to see if this domain name has been used, there may be other people used the domain name, did not continue to renew, that the domain name before the violation of the operation.

Second, spiders can't come at all. Spiders visit this domain name when DNS resolution, domain name to IP, to find the IP server to visit, if the DNS did the hands and feet, or space business to tamper with, resulting in spiders can not catch. The space Trader Shields the spider, which creates pressure. Once a friend asked Chongqing SEO Zengxiaorong, heard that his site 20 days or more than 10 days new station began to collect, let me help him analyze why, I told him can go to verify Baidu Webmaster platform, and then will receive Baidu Webmaster platform information reminders, search Engine v. Crawl site, the site of the search engine to ban the whole station. This time need to change space, he changed space immediately after can be included. (SEO update Technology Group →_→ 138426856)

Baidu Webmaster platform inside pressure feedback, capture pressure that search engine in the unit time to a Web server access frequency and total number of times. If 0, prove not to go. The pressure value of 716, this value can only prove the search engine to go, but to which pages do not know. If you go to the homepage, or you want to be included in the page did not go, did not visit the inside page, you see this value is no use, that how to see whether to crawl it?

Server log

See what pages spiders visit. If there is a stand-alone server, VPS can do their own (light years log analysis is a very good tool), but also to judge the true and false spiders, because Baidu spider is not necessarily true, webmaster tools on the site when the query, will also produce false spiders.

1. Look at the log, need to judge the true and false spiders, some spiders are not true, some people simulate spiders on the site collection, this time will produce false spiders.

2. If the site is a dynamic program, the site is set to pseudo static, logging path is dynamic, will not record pseudo static, if it is pure static can see directly. Because it is difficult to parse the path for pseudo static.

Problem Analysis:

1, the permission set some is needs the member to be possible to enter, therefore the spider cannot enter. Robots have shielded these paths and can't crawl them.

2, the structure problem crawl difficulty. If a site's structure is very complex, messy, it is possible to crawl the search engine crawled to give up crawling, this for any seoer should be noted. This article "How to optimize the site to the top page ranking" on the analysis of the site structure and path optimization methods.

3, credit if found too much spam, crawling back to the page filter, sorting, and then filter, build index. It eliminates empty pages and meaningless pages. The entire page is a Flash landing Page registration page or product page is a picture, then some are empty pages, meaningless, such is not necessary to be included.

Judging page value Score

To achieve the standard, itself scored high and low, here is divided into two points:

1, these depend on the weight of the site itself high and low. High weight, included very easy, the standard also reduced a lot.

2, the quality of the page score content is original, or copy, is not a rare article, the site's customers useful. To know Baidu launched the Spark program to encourage the original and original will add points, and will have a good ranking. If you can't write the original, you can add additional content and value to the original content, which is also a good article.

3. Whether there are external links within the page. External links can be voted on the page, the better to achieve the inclusion standard.

If you want to write your own description to be caught, want to do the keyword as far as possible in the description.

In addition to the above, but also to consider the nature of the link, the requirements of the page is what? related needs and issues, small title to be attractive, to allow customers to see the reason, to attract users to quickly locate what he wants, the level of clarity.

Summary: To solve the content included in the problem, first see whether the domain name is punished, space business has no shielding spiders, often check the server log, to identify the true and false spiders, inside the page score to do a good job inside the page outside the chain.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.