Six ways to make Web sites difficult to search engines

Source: Internet
Author: User

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

For each webmaster, the most critical of a certain too search engine can normally be included, once the site can not normally include everything is wishful thinking.

Why is there a difficulty in the collection? Once the site in the wording of the error, the search engine will not be able to collect the site. This is because the search engine robot is a very simple software program, it itself on the site does not have any understanding of the ability, just through some inherent standards to judge the quality of the site, (specific tutorials please see www.hngwyw.com) a few lead to the common reasons for inclusion difficulties are as follows:

Reason one: Session ID used in your URL

Many search engines do not include Web pages that contain session IDs because they can cause duplication of content. If possible, avoid the session ID appearing on your Web site and use the session to store the conversation ID.

Reason two: Your Web page contains too much redundant code

The Web page contains Java code, CSS code, and other scripting code. This code is not directly related to the content, and it makes it difficult to make changes to the actual content on the site, so it's often difficult to get the search engine included.

Reason three: The Web site contains too many dynamic URLs

The site contains too many dynamic Web sites, may cause search engine robot searching and crawl difficulties. For this problem some people who are familiar with the site construction may think of dynamic Web pages, is indeed the problem, dynamically generated pages (including ASP, PHP) may cause the crawler can not be included, if your URL contains too many variables, search engine bots may ignore your Web page. Workaround, using a static page.

Reason four: The website does not have the construction good situation on the line (if dead chain many)

This is easy to ignore, before your website online, even remove those useless dead chain, also cannot because "good-looking" and keep.

Reason five: The site's robots.txt file is corrupted or an error occurs when writing (for example, a typo). If search engine robots mistakenly understand your robots.txt files, they may completely ignore your Web page. The solution to this problem is to double-check your robots.txt files and make sure that the parameters on your Web page are correct.

Reason six: Site navigation problems

Most search engine bots cannot parse Java or DHTML menus, and of course use flash and Ajax menus to make them worse.

As mentioned above, search engine robot is a very simple program, they follow the HTML link, once the link error on the inclusion will also bring difficulties. The above situation is from a general to analyze, not very perfect, I hope friends to add.

Reprint please specify; www.hngwyw.com Welcome reprint!

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.