Php uses simple_html_dom to parse the HTML example, simplehtmldom
This example describes how php parses HTML using simple_html_dom. We will share this with you for your reference. The details are as follows:
Today I wrote two crawlers, one using Python and the other using PHP. To be honest, the two implementation methods are very convenient.
The following describes how to use the simple_html_dom open-source class to parse HTML code:
1. Download simple_html_dom:
Http://sourceforge.net/projects/simplehtmldom/files/
Or click here to download.
2. Usage example:
<?php// example of how to use basic selector to retrieve HTML contentsinclude('../simple_html_dom.php');// get DOM from URL or file$html = file_get_html('http://www.google.com/');// find all linkforeach($html->find('a') as $e) echo $e->href . '<br>';// find all imageforeach($html->find('img') as $e) echo $e->src . '<br>';// find all image with full tagforeach($html->find('img') as $e) echo $e->outertext . '<br>';// find all div tags with id=gbarforeach($html->find('div#gbar') as $e) echo $e->innertext . '<br>';// find all span tags with class=gb1foreach($html->find('span.gb1') as $e) echo $e->outertext . '<br>';// find all td tags with attribite align=centerforeach($html->find('td[align=center]') as $e) echo $e->innertext . '<br>';// extract text from tableecho $html->find('td[align="center"]', 1)->plaintext.'<br>