Website static page generation strategy 3: Anti-collection policy

Source: Internet
Author: User
Tags send cookies

Currently, there are many anti-collection methods. First, we will introduce the common anti-collection methods and their drawbacks and collection countermeasures:

1. determine the number of times an IP address accesses the website page within a certain period of time. If the access speed is significantly higher than that of normal users, access from this IP address is denied.

Disadvantages:
1. This method is only applicable to dynamic pages, such as asp, jsp, php, etc. .. static pages cannot determine the number of times a certain IP Address has accessed this site.
2. This method will seriously affect the indexing of search engine spider because the indexing speed of search engine spider is faster and multithreading. This method also rejects the search engine spider from indexing site files.
Collection countermeasure: The collection speed can only be slowed down or not collected
Suggestion: Create an IP address library for search engine spider, and only allow search engine spider to quickly browse site content. It is not easy to collect the IP address library of search engine spider. A search engine spider may not have only one fixed IP address.
Comment: This method is effective for data collection prevention, but may affect the search engine's indexing.

2. Use javascript to encrypt the content page

Disadvantage: This method is applicable to static pages, but it seriously affects the search engine's indexing status. The content received by the search engine is also encrypted.
Collection countermeasure: It is recommended that you do not collect it. If you need to collect it, you can also collect the JS script for password cracking.
Suggestion: no improvement suggestions currently
Comment: We recommend that you do not use this method for webmasters with search engine traffic.

3. Replace the specific mark on the content page with "specific mark + hidden copyright text"

Disadvantages: This method has minor drawbacks. It only increases the page file size a little bit, but is easy to perform reverse collection.
Collection countermeasure: replace or replace the collected copyrighted text containing the hidden copyrighted text with your own copyright.
Suggestion: no improvement suggestions currently
Comments: I feel that it is of little practical value. Even adding random hidden texts is also a perfect addition.

4. Users can only browse after login

Disadvantages: This method seriously affects search engine spider's indexing
Collection countermeasure: at present, some people have published countermeasure articles. For specific countermeasure, see ASP thief program how to use XMLHTTP to submit forms and send cookies or sessions.
Suggestion: no improvement suggestions currently
Comment: We recommend that you do not use this method for webmasters with search engine traffic. However, this method is somewhat effective in preventing general collection programs.

5. Paging using javascript and vbscript scripts

Disadvantages: Search Engine indexing is affected.
Collection countermeasure: Analyze javascript and vbscript scripts, find out their paging rules, and create a page set for this site.
Suggestion: no improvement suggestions currently
Comment: anyone who understands the scripting language can find out its paging rules.

6. You can only view the data through the link on this site, for example, Request. ServerVariables ("HTTP_REFERER ")

Disadvantages: Search Engine indexing is affected.
Collection countermeasure: I don't know if I can simulate the webpage source .... Currently, I have no collection countermeasures for this method.
Suggestion: no improvement suggestions currently
Comment: We recommend that you do not use this method for webmasters with search engine traffic. However, this method is somewhat effective in preventing general collection programs.

As can be seen from the above, the commonly used anti-collection methods will either have a great impact on Search Engine indexing, or the Anti-collection effect is not good, and the anti-collection effect is not effective. Is there any effective method to prevent collection without affecting search engine indexing? So proceed!

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.