Use the axwebbrowser Control
Reference mshtml
[1) Remove HTML tags and their attributes
[2) axwebbrower opens a page
[3) retrieve html Source code
1) Remove the HTML Tag and its attributes.
Private string getonevalue (string tempstr)
{
If (tempstr. length> 0)
{
Tempstr = regularexpressionsofhtml (tempstr );
Tempstr = tempstr. substring (0, tempstr. Length-1 );
}
Return tempstr;
}
Public static string regularexpressionsofhtml (string tempcontent)
{
// Tempcontent = system. Text. regularexpressions. RegEx. Replace (tempcontent, "<[^>] +>", ""); // any number
Tempcontent = system. Text. regularexpressions. RegEx. Replace (tempcontent, "<[^>] *>", ""); // match
Return tempcontent;
}
2) axwebbrower opens a page
String url = "**********";
Object zero = 0;
Object emptystring = "";
Axwebbrowser. navigate (URL, ref zero, ref emptystring );
3) retrieve the HTML sourceCode
Better in axwebbrower_documentcomplete events
Reference: Using mshtml;
Ihtmldocument2 htmldocument = (ihtmldocument2) axwebbrowser1.document;
String strhtml = htmldocument. Body. innerhtml. tostring (); // GET html
String [] arhtml = strhtml. Split ('\ n ');
All HTML sources are saved in arhtml.