web crawler gadget (IE)--Wu Jie
Http://club.excelhome.net/thread-1095707-1-1.html
The advantage of using IE to extract Web pages is that what you see is what you get, and the information you can see on a webpage is generally available.
This tool does not feature much, mainly to facilitate the extraction of the information displayed on the page of the elements of the code. Hope to have a little help to everyone.
Web crawling gadget. rar(22.91 KB, Downloads: 2426)
How to use this tool:
1, enter the URL in B1, can be open Web page, can also be open
2, A2 and B2 content do not change, the second row of other cells can enter the element's individual property names. Where the InnerText cell has a drop-down option
3. Click "Start Analysis" to analyze the page elements.
4, column A is the object code of each element.
5. After the InnerText column finds the content you want to extract, select the row and click "Generate Excel".
You can extract tables from table labels or download an img tag image.
6. In the newly generated Excel, click the "Execute Code" button to see if the required data can be generated.
If the generated data does not match the data that is being analyzed, the reason may be:
1, the Web page is not fully loaded, the corresponding label data has not been loaded, the code automatically extracted the label data thereafter.
Possible workarounds: Plus do...loop time delay.
2, the page is a Dynamic Web page, the number of the label variable.
Possible solution: The element has the ID name of the word with getElementById ("ID name") name to obtain, no words catch shifting XMLHTTP method extract it.
3, need to select the relevant options or need to log in to extract.
Possible workaround: First do a login or select the relevant options and then extract
This tool is intended for beginners, the browser can parse, but cannot give a specific element of the VBA code. The VBA code of the net catch can be generated directly in the tool. Ajax, frame content can also automatically generate code.
Web crawler widgets (ie method)