&lt;&lt;Python基礎教程&gt;&gt;學習筆記 | 第11章

<<Python基礎教程>>學習筆記 | 第11章 | 檔案和素材

最後更新：2017-04-21 來源：互聯網

上載者：User

創建阿里雲帳戶，並獲得超過 40 款產品的免費試用版；而企業帳戶則可以享有總值 $1200 的免費試用版。立即註冊！

標籤：檔案名稱 read -- 標準輸出迭代器佔用緩衝重用 stat

開啟檔案

open(name[mode[,buffing])

name: 是強制選項，模式和緩衝是可選的

#假設檔案不在。會報以下錯誤:

>>> f = open(r‘D:\text.txt‘,‘r‘)Traceback (most recent call last):  File "<stdin>", line 1, in <module>IOError: [Errno 2] No such file or directory: ‘D:\\text.txt‘

檔案模式

NOTE:

1. 預設的方式，比方說open(‘filename‘)是讀模式

2. r+, 則表示可讀寫

3. 假設是二進位檔案或圖形檔案，則必須用緩衝模式

4. 普通的w模式會覆蓋檔案的內容，a模式則不會.

5. rb則能夠用來讀取二進位檔案.

6, 通過參數模式中使用U參數，可以在開啟檔案時使用通用的分行符號支援模式，不管\r,\n\r,都會換成\n。而不用考慮執行的平台.

緩衝:

第三個參數。可選

0或者False: 無緩衝。全部操作直接針對硬碟

1或者True: 有緩衝，記憶體取代硬碟，速度快，僅僅有close,flush才寫入硬碟同步.

> 1 : 表示緩衝區的大小

-1 : 表示預設的緩衝區大小

基本檔案方法

NOTE: 類檔案對象是支援一些檔案的方法的對象，比方file方法。最重要的兩個方法,read,write方法.還有urllib.urlopen返回對象，他們支援的方法有: read,readline,readlines

三種標準的流

sys.stdin :標準輸入資料流,可通過輸入或者使用管道把它和其他程式的標準輸出連結起來提供文本

sys.stdout :放置input,raw_input寫入的資料，可在螢幕上顯示，也可通過|串連到其他程式的標準輸入

sys.stderr :錯誤資訊，比方說棧追蹤被寫入sys.stderr.

讀和寫

最重要的能力是提供讀寫

#對空檔案來說: 提供寫時。會在已在字串末尾追加,

>>> f = open(‘somefile.txt‘,‘w‘)>>> f.write(‘Hello,‘)>>> f.write(‘World!‘)>>> f.close()#somefile.txt檔案內容Hello,World!

#對於非空檔案:提供w方法時，會覆蓋檔案裡的內容

>>> f = open(‘somefile‘,‘w‘)>>> f.write(‘This is 1st line.\n‘)>>> f.write(‘This is 2nd line.‘)>>> f.close()#somefile.txt檔案內容This is 1st line.This is 2nd line.

簡單讀取的範例:

>>> f = open(‘somefile.txt‘,‘r‘)>>> f.read(16)#先讀取16個字元‘This is 1st line‘>>> f.read()  #會讀取剩下的內容，除非seek定位到0，又一次讀取‘.\nThis is 2nd line.‘>>> f.close()

管道輸出

$ cat somefile.txt | python somescript.py | sort

一個簡單範例: 統計一個文本中單詞的數量

$ cat somefile.txt
This is a book!
That is a dog!
Who are you?

指令碼清單

#somescript.pyimport systext  = sys.stdin.read()  words = text.split()print "Word Count:", len(words)

輸出結果:

# cat somefile.txt | python somescript.pyWord Count: 11

隨機訪問:

用seek和tell來訪問自己感興趣的部分

seek(offset[,whence]). offset，位移量,Whence值

0: 開始位置

1: 當前位置

2: 檔案末尾

簡單範例:

>>> f = open(‘somefile.txt‘,‘w‘)>>> f.write(‘01234567890123456789‘)>>> f.seek(5)>>> f.write(‘Hello,World!‘)>>> f.close()>>> f = open(‘somefile.txt‘)>>> f.read()‘01234Hello,World!789‘#用tell來返回當前檔案的位置>>> f = open(‘somefile.txt‘)>>> f.read(3)‘012‘>>> f.read(2)‘34‘>>> f.tell()5L

讀寫行:

readline : 讀取行,包含分行符號

readlines: 讀取全部行

write: 寫一行, 注意:沒有writeline方法

writelines: 寫多行

NOTE: 怎樣推斷不同的行以什麼結尾? os.linesep

#UNIX >>> import os>>> os.linesep‘\n‘#WINDOWS>>> import os>>> os.linesep‘\r\n‘

關閉檔案

時刻記得close()來關閉檔案，這樣做的目的:

1. 出於安全考慮,防止檔案由於某些原因崩潰。寫不進資料

2. 出於資料同步考慮,close(),才會往硬碟中寫資料

3. 出於效率的考慮，記憶體中的資料可清空一部分出來

為了確保程式結束時close(),能夠用try/finally結合使用

# Open your file heretry:    # Write data to your filefinally:    file.close()

NOTE: 一般檔案在close()之後才會寫入硬碟，假設想不運行close()方法。又能夠看到寫入的內容，那麼flush就派上用場了.

使用基本方法:

#測試文本somefile.txt

Welcome to this file

There is nothing here except

This stupid haiku

首先讀取指定字元

>>> f = open(r‘d:\Learn\Python\somefile.txt‘)>>> f.read(7)‘Welcome‘>>> f.read(4)‘ to ‘>>> f.close()

其次讀取全部的行

>>> f = open(r‘d:\Learn\Python\somefile.txt‘,‘r‘)>>> print f.read()Welcome to this fileThere is nothing here exceptThis stupid haiku

接著是讀取行

>>> f.close()>>> f = open(r‘d:\Learn\Python\somefile.txt‘)>>> for i in range(3):...     print str(i) + ‘:‘ + f.readline()...0:Welcome to this file1:There is nothing here except2:This stupid haiku

再讀取全部行:

>>> import pprint>>> pprint.pprint(open(‘somefile.txt‘).readlines())[‘Welcome to this file\n‘, ‘There is nothing here except\n‘, ‘This stupid haiku‘]

以下是寫檔案

>>> f = open(r‘somefile.txt‘,‘w‘)>>> f.write(‘this\nis no\nhaiku‘)>>> f.close()執行檔案後，內容例如以下:thisis nohaiku

最後是writelines

>>> f = open(r‘somefile.txt‘)>>> lines = f.readlines()>>> f.close()>>> lines[1] = "isn‘t a\n">>> f = open(‘somefile.txt‘,‘w‘)>>> f.writelines(lines)>>> f.close()執行後。檔案內容例如以下:this isn‘t ahaiku

對檔案內容進行迭代

主要的方法如: read,readline,readlines,還有xreadline和檔案迭代器

以下的範例都使用的虛擬函數process()，表示每一個字元或每行處理過程

def process(string):

print ‘Processing‘, string

更實用的實現是:在資料結構中儲存資料。計算和值。

用re模組來替代模式或者添加行號.假設要實現上面的功能。

則應該講filename變數設定為實際的檔案名稱.

按位元組處理

def process(string):    print ‘Processing...‘, stringf = open(‘somefile.txt‘)char = f.read(1)while char:    process(char)    char = f.read(1)f.close()

代碼重用一般是件壞事，懶惰是美德。重寫下代碼例如以下:

def process(string):        print ‘Processing...‘, stringf = open(‘somefile.txt‘)while True:        char = f.read(1)        if not char:                break        process(char)f.close()

NOTE: 這樣寫就比上面要好，避免了反覆的代碼.

按行操作

f = open(filename)while True:    line = f.readline()    if not line:        break    process(line)f.close()

讀取全部內容

假設檔案不是非常大，能夠用read()，或者readlines()讀取的內容作為字串來處理.

#用read來迭代每一個字串

f = open(r‘D:\Work\Python\somefile.txt‘)for char in f.read():    process(char)f.close()

#用readlines來迭代行

f = open(r‘D:\Work\Python\somefile.txt‘,‘r‘)for line in f.readlines():    process(line)f.close()

使用fileinput實現懶惰行迭代

在須要對一個大檔案進行迭代時。readlines會佔用太多的記憶體。

這個時候能夠使用while迴圈和readline方法來替代。

import fileinputdef process(string):    print ‘Processing...‘, stringfor line in fileinput.input(‘somefile.txt‘):    process(line)

檔案迭代器

#Python中檔案是能夠迭代的，寫起來也非常優雅

f = open(‘somefile.txt‘)for line in f:    print line,f.close()

#假設希望Python來完畢關閉的動作，對檔案進行迭代,而不使用變數儲存變數

#代碼能夠更加精簡

for line in open(‘somefile.txt‘):    print line,

sys.stdin也是能夠迭代的，簡單代碼例如以下:

import sysfor line in sys.stdin:    print line,執行結果:D:\Work\Python>python file.py#輸入以下兩行Hello,World!Hello,Jerry!^Z      #按下CTRL+Z鍵後。輸入的內容，顯示Hello,World!Hello,Jerry!

#能夠對檔案迭代器執行和普通迭代器同樣的操作。比方將它們轉換為字串列表，這樣所達到的效果和使用readlines一樣.例如以下例:

>>> f = open(‘somefile.txt‘,‘w‘)>>> f.write(‘First line\n‘)>>> f.write(‘Second line\n‘)>>> f.write(‘Third line\n‘)>>> f.close()>>> lines = list(open(‘somefile.txt‘))>>> lines[‘First line\n‘, ‘Second line\n‘, ‘Third line\n‘]>>> first,second,third = open(‘somefile.txt‘)>>> first‘First line\n‘>>> second‘Second line\n‘>>> third‘Third line\n‘

NOTE:

1.用序列來做解包操作很使用

2.讀檔案操作時，能夠不用close()

WITH語句的使用

with語句使用所謂的上下文管理器對代碼塊進行封裝。同意上下文管理器實現一些設定和清理操作。

比如：檔案能夠作為上下文管理器使用，它們能夠關閉自身作為清理的一部分。

NOTE：在PYTHON2.5中，須要使用from __future__ import with_statement進行with語句的匯入

with open(‘test.txt‘) as myfile:    while True:        line = myfile.readline()        if not line:            break        print line,#假設這樣寫的話。就無需關閉檔案了。

------

本章新函數

file(name[,mode[,buffering]]) 開啟一個檔案並返回一個檔案對象

open(name[,mode[,buffering]]) file的別名;在開啟檔案，使用open而不是file

<<Python基礎教程>>學習筆記 | 第11章 | 檔案和素材

本文章原先以中文撰寫並發佈於 aliyun.com，亦設英文版本，僅作資訊用途。本網站不對文章的準確性，完整性或可靠性或其任何翻譯作出任何明示或暗示的陳述或保證。如對該文章有任何疑慮或投訴，請傳送電郵至 info-contact@alibabacloud.com 並提供相關疑慮或投訴的詳細說明。職員會於 5 個工作天內與您聯絡，一經驗證之後，即會刪除該侵權內容。

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More