Topic Center

Contact Sales

首頁 > 開發者 > Python

Python爬蟲爬取OA幸運飛艇平台擷取資料

最後更新：2018-06-25 來源：互聯網

上載者：User

創建阿里雲帳戶，並獲得超過 40 款產品的免費試用版；而企業帳戶則可以享有總值 $1200 的免費試用版。立即註冊！

標籤：chrome瀏覽器 htm 代碼 ret set attr 函數 params ima

安裝BeautifulSoup以及requests

開啟window 的cmd視窗輸入命令pip install requests 執行安裝，等待他安裝完成就可以了

BeautifulSoup庫也是同樣的方法

我使用的編譯器的是sublime text 3，覺得是挺好用的一個編譯軟體

其他工具： Chrome瀏覽器

Python版本： Python3.6

運行平台： Windows

1、首先我們搜尋OA幸運飛艇平台熱門排行榜：【×××。com/h5】企娥:217 1793 408

擷取網頁的代碼：

[python] view plain copy
def getHTMLText(url,k):
try:
if(k==0):
a={}
else:
a={‘offset‘:k}
r = requests.get(url,params=a,headers={‘User-Agent‘: ‘Mozilla/4.0‘})
r.raise_for_status()
r.encoding = r.apparent_encoding
return r.text
except:
print("Failed!")
經過觀察其中因為每一頁的網址其offset都不相同，故只要改變offset=k便可擷取每一頁的資訊

通過main函數以改變URL：

[python] view plain copy
def main():
basicurl=‘×××。com/h5‘
k=0
while k<=100:
html=getHTMLText(basicurl,k)
k+=10
getname(html)
通過BeautifulSoup的方法層層擷取標籤中的資訊，並for迴圈輸出

[python] view plain copy
def getname(html):
soup = BeautifulSoup(html, "html.parser")
paihangList=soup.find(‘dl‘,attrs={‘class‘:‘board-wrapper‘})
mov=[]
actor=[]
for movlist in paihangList.find_all(‘dd‘):
movitem=movlist.find(‘div‘,attrs={‘class‘:‘movie-item-info‘})
movname=movitem.find(‘p‘,attrs={‘class‘:‘name‘}).getText()
actors=movlist.find(‘div‘,attrs={‘class‘:‘board-item-main‘})
actorname=actors.find(‘p‘,attrs={‘class‘:‘star‘}).getText()
b=actorname.replace(‘\n‘,‘‘)
c=b.replace(‘ ‘,‘‘)
actor.append(c)
mov.append(movname)
mode= "{0:<30}\t{1:<50}"
for i,j in zip(mov,actor):
print(mode.format(i,j,chr(12288)))

Python爬蟲爬取OA幸運飛艇平台擷取資料

本文章原先以中文撰寫並發佈於 aliyun.com，亦設英文版本，僅作資訊用途。本網站不對文章的準確性，完整性或可靠性或其任何翻譯作出任何明示或暗示的陳述或保證。如對該文章有任何疑慮或投訴，請傳送電郵至 info-contact@alibabacloud.com 並提供相關疑慮或投訴的詳細說明。職員會於 5 個工作天內與您聯絡，一經驗證之後，即會刪除該侵權內容。

相關關鍵詞：

Python中的底線的用法介紹 01-13

python讀寫ini檔案樣本(python讀寫檔案)_python 01-19

python CMDB開發 09-19

python：發送郵件 12-08

python學習筆記2-列（list） 12-08

python學習筆記1-賦值與字串 12-08

聯繫我們

該頁面正文內容均來源於網絡整理，並不代表阿里雲官方的觀點，該頁面所提到的產品和服務也與阿里云無關，如果該頁面內容對您造成了困擾，歡迎寫郵件給我們，收到郵件我們將在5個工作日內處理。

如果您發現本社區中有涉嫌抄襲的內容，歡迎發送郵件至： info-contact@alibabacloud.com 進行舉報並提供相關證據，工作人員會在 5 個工作天內聯絡您，一經查實，本站將立刻刪除涉嫌侵權內容。

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Python爬蟲爬取OA幸運飛艇平台擷取資料

聯繫我們

熱門內容

熱門主題

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support