I need the data such as:
Http://map.baidu.com/?newmap=1&reqflag=pcmap&biz=1&qt=s&wd=1&c=131&tn=B_NORMAL_MAP &nn=0&ie=utf-8&l=12&b=%2812925648.97,4823379.72;12990672.97,4828435.72%29&t=1368604536591
Change the parameters to get different data. Put him in the file or database on it.
Originally I would like to write their own, the link we want to download to put together, the cycle of downloading, processing the storage on the line.
In addition, you can consider the agent, multithreading processing.
But the boss wants me to find an open-source crawler tool.
The tool also asked me to link it and tell him how I would handle the data to work.
There is no way, to ask you, which crawler tools have such a function ah, thank you.
I only have PHP and simple python, so I want to try to be written in both languages, thank you again.
Reply to discussion (solution)
Find an open source or you need to change it to the trouble. PHP's curl is available, supporting proxies.
Python words socket is very good, and multi-threaded words threading processing is very convenient.
Write your own Bai ~
And tell him it's the best tool.