Is it because someone crawls the content on the webpage and places it on another website. The following describes a common method:
Use the HtmlAgilityPack component.
Public String GetHtml () {string url = "http://t.news.fx168.com/"; HttpWebRequest request = HttpWebRequest. create (url) as HttpWebRequest; using (HttpWebResponse response = request. getResponse () as HttpWebResponse) {using (Stream stream = response. getResponseStream () {HtmlDocument doc = new HtmlDocument (); doc. load (stream, System. text. encoding. UTF8); HtmlNode node = doc. documentNode. selectSingleNode ("// div [@ class = 'hzh _ FX168_news_main_left_listbg3 ']"); return node. innerHtml ;}}}
You can try it and capture the news list on the content page of FireWire Express. The crawling rule is to capture the content in the CLASS hzh_FX168_news_main_left_listbg3 of the div.