Magpierss in the use of Snoopy, which makes me a bit interested to study this boom. On the SF, the source code was found. Incredibly is a class, but do not laugh at Oh, the function is very powerful.
The official profile, I translated the next (Khan ... have been acting as a translator lately)
Snoopy is a PHP class that mimics the functionality of a Web browser, which accomplishes the task of getting web content and sending forms.
Here are some of its features:
1, easy to crawl the content of the Web page
2, easy to crawl the text of the page (remove the HTML code)
3, easy to crawl web links
4, support Agent Host
5, support the basic user/password Authentication mode
6. Support custom user Agent,referer,cookies and header content
7. Support the browser steering, and can control the steering depth
8, can expand the link in the Web page into a high-quality URL (default)
9, convenient to submit data and get the return value
10. Support for tracking HTML frame (v0.92 added)
11. Pass cookies when supporting the re-turn
Here are some simple examples, such as the words we crawl my blog
Include "Snoopy.class.php";
$snoopy = new Snoopy;
$snoopy->fetchtext ("Http://www.phpobject.net/blog");
Echo $snoopy->results;
?>
^_^, good to be, in the like crawl links
Include "Snoopy.class.php";
$snoopy = new Snoopy;
$snoopy->fetchlinks ("Http://www.phpobject.net/blog");
Print_r ($snoopy->results);
?>
Hey, the effect is good, and all are the URLs we need, and there is no kind of/blog/read.php/85.htm that kind of things.
Submit data In addition those later test ...
yuanma/php/snoopy-1.2.3.tar.gz Local Download
The above describes the Lazysnoopy Snoopy PHP version of the network client to provide local download, including the lazysnoopy aspect of the content, I hope that the PHP tutorial interested in a friend to help.