Python的學習（三十） ---- Python實現檔案md5校正_

Python的學習（三十） ---- Python實現檔案md5校正__Python

最後更新：2018-07-24 來源：互聯網

上載者：User

創建阿里雲帳戶，並獲得超過 40 款產品的免費試用版；而企業帳戶則可以享有總值 $1200 的免費試用版。立即註冊！

Linux下校正檔案MD5值，最簡單的方法就是執行md5sum命令
md5sum filename
原本打算用subprocess調用系統命令來擷取md5值，

import subprocess,shlexcmd = "md5sum filename"p = subprocess(shlex.split(cmd), stdout=subprocess.PIPE)print p.stdout.read()

不過python有內建的MD5模組hashlib，用起來簡單很多，
Python Hashlib模組的使用說明 http://docs.python.org/2/library/hashlib.html
fd = hashlib.md5() #擷取一個MD5密碼編譯演算法對象
fd.update("string") #指定需要加密的字串
fd.hexdigest() #擷取加密後的16進位字串

執行個體

 #!/usr/bin/env python            #coding : utf-8 3  4 import sys  import hashlib                                                    def md5sum(filename):                fd = open(filename,"r")     fcont = fd.r     fd.close()              fmd5 = hashlib.md5(fcont)     return fmd5                                               if __name__ == "__main__":           fmd5 = md5sum(sys.argv[1])     print fmd5.hexdigest()

其中fmd5 = hashlib.md5(fcont)等同於
fmd5 = hashlib.md5(fcont)
fmd5.update(fcont)

需要注意的是，傳入 hashlib.md5() 的應該是檔案內容而不是檔案名稱，這樣才是對檔案內容產生md5校正碼；
另外，調用了 hashlib.md5() 後返回的是一個對象，想要獲得linux下 md5sum 同樣的效果，還要調用一下 hexdigest() 方法。

但是，這個方法有點過於粗暴，當檢驗大檔案時，一次將所有檔案內容讀入記憶體，實在耗費較大，
網上給出執行個體http://blog.csdn.net/shanliangliuxing/article/details/10115397，
根據檔案塊長度，依次擷取檔案內容讀入記憶體，通過update()逐次更新校正值，

#!/usr/bin/env python 2 #coding : utf-8 3 import hashlib    def md5hex(word):     """ MD5密碼編譯演算法，返回32位小寫16進位符號     """      if isinstance(word, unicode):         word = word.encode("utf-8")     elif not isinstance(word, str):         word = str(word)     m = hashlib.md5()     m.update(word)     return m.hexdigest()   def md5sum(fname):     """ 計算檔案的MD5值     """     def read_chunks(fh):         fh.seek(0)         chunk = fh.read(8096)         while chunk:             yield chunk             chunk = fh.read(8096)         else: #最後要將遊標放迴文件開頭             fh.seek(0)     m = hashlib.md5()     if isinstance(fname, basestring) \             and os.path.exists(fname):         with open(fname, "rb") as fh:             for chunk in read_chunks(fh):                 m.update(chunk)     #上傳的檔案快取 或 已開啟的檔案流     elif fname.__class__.__name__ in ["StringIO", "StringO"] \             or isinstance(fname, file):         for chunk in read_chunks(fname):             m.update(chunk)     else:         return ""40     return m.hexdigest()

本文章原先以中文撰寫並發佈於 aliyun.com，亦設英文版本，僅作資訊用途。本網站不對文章的準確性，完整性或可靠性或其任何翻譯作出任何明示或暗示的陳述或保證。如對該文章有任何疑慮或投訴，請傳送電郵至 info-contact@alibabacloud.com 並提供相關疑慮或投訴的詳細說明。職員會於 5 個工作天內與您聯絡，一經驗證之後，即會刪除該侵權內容。

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More