1. First you need some basic knowledge of Python and related development environment, no relevant basis of the students recommend to go to NetEase cloud Mooc to watch learning related tutorials
2. What is a web crawler?
We use the Internet to enter the connection in the browser, and then the server will return to us the relevant information, and the web crawler is a continuous connection to obtain the desired information program. So then we have two main parts of learning content: 1. Access to the Network 2. Information acquisition and extraction.
3. Purpose and method of study
It is important to note that this series emphasizes practical application skills, so for the study of skill classes, the individual thinks
1. Need constant practice, practice will find the problem
2. The basis of theory needs a little understanding, do not overemphasize the integrity of the theory, such as the HTTPA protocol HTML files, concrete content has a basic concept, like a license you can not dismantle the engine every day. Of course, if you want to be a qualified old driver, then you can re-learn the relevant knowledge
Python crawler (i)