Crawl the site of the code implementation a lot, if considering the crawl to download a lot of content scrapy framework is undoubtedly a good tool. Scrapy = Search+pyton. The installation process is briefly listed below. PS: Be sure to download the Python version, or you will be reminded that Python is not found when you install it. We recommend that you install 32 bits because some versions of the software must be 64 bit hard to find. (My is XP system)
1. Install Python
After installation remember to configure the environment to add the Python directory and the scripts directory under the Python directory to the System environment variable path. Enter Python in cmd if the version information description is configured (see below). Python.
2. Installing lxml
lxml is a library written in Python that allows you to work with XML quickly and flexibly. Click here to select the corresponding Python version to install. Verify that the installation is successful, such as.
3. Installing Setuptools
To install the egg file, click here to download the corresponding version of Setuptools for python2.7.
4. Installing Zope.interface
You can use the third step download setuptools to install the egg file, now also has the EXE version, click here to download.
5. Installing twisted
Twisted is an event-driven network engine framework implemented in Python, click here to download.
6. Installing Pyopenssl
Pyopenssl is a python-OpenSSL interface, click here to download.
7. Installing Win32py
To provide Win32API, click here to download
8. Installing Scrapy
Finally the turn to install Scrapy, directly in the cmd input easy_install scrapy enter. Verification of success or not after installation on the cmd command line.
Installation is complete, start using it!
Python+scrapy Installation