Ip address banned during crawling

Source: Internet
Author: User
This post was last edited by zzf.pdf at 22:36:33, January 24 ,.

Recently, I need to capture the content of a website. I used snoopy to capture the content. When I first found that an ip address was blocked, I switched the user-agent to google's spider according to the solution on the Internet, and forged an ip address with snoopy (a random ip address is changed for every capture) however, after capturing more than one hundred pages, it is still blocked by the ip address, and thus cannot be crawled. Is there any good solution?


Reply to discussion (solution)

The access is too frequent.

The access is too frequent. What should we do? sleep, but the amount of data to be captured is large. if sleep is used, there is not much time to capture

Snoopy is used to forge an ip address.

This is a fool of yours ..

Can I forge an ip address? This...

Take care of who you are.

Snoopy is used to forge an ip address.

This is a fool of yours.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.