#-*-coding:utf-8-*-#Python 2.7#Xiaodeng#http://tieba.baidu.com/p/2460150866#Label Operations fromBs4ImportBeautifulSoupImporturllib.requestImportRe#if it is a URL, you can use this method to read the page#Html_doc = "http://tieba.baidu.com/p/2460150866"#req = urllib.request.Request (html_doc)#webpage = urllib.request.urlopen (req)#html = webpage.read ()HTML=""""""Soup= BeautifulSoup (HTML,'Html.parser')#Document Object#find a tag and find only a label#print (SOUP.A) #<a class= "sister" href= "Http://example.com/elsie" id= "Xiaodeng" ><!--Elsie--></a > forKinchSoup.find_all ('a'): Print(k)Print(k['class'])#check the Class property of a tag Print(k['ID'])#Check the ID value of the a tag Print(k['href'])#Check the href value of a tag Print(k.string)#Check the A-tag string #tag.get (' CALSS ') can also achieve this effect
Python3 using BeautifulSoup to crawl the data in a tag using a dictionary method