python統計文字檔內單詞數量的方法

來源:互聯網
上載者:User
本文執行個體講述了python統計文字檔內單詞數量的方法。分享給大家供大家參考。具體實現方法如下:

# count lines, sentences, and words of a text file# set all the counters to zerolines, blanklines, sentences, words = 0, 0, 0, 0print '-' * 50try: # use a text file you have, or google for this one ... filename = 'GettysburgAddress.txt' textf = open(filename, 'r')except IOError: print 'Cannot open file %s for reading' % filename import sys sys.exit(0)# reads one line at a timefor line in textf: print line,  # test lines += 1 if line.startswith('\n'):  blanklines += 1 else:  # assume that each sentence ends with . or ! or ?  # so simply count these characters  sentences += line.count('.') + line.count('!') + line.count('?')  # create a list of words  # use None to split at any whitespace regardless of length  # so for instance double space counts as one space  tempwords = line.split(None)  print tempwords # test  # word total count  words += len(tempwords)textf.close()print '-' * 50print "Lines   : ", linesprint "Blank lines: ", blanklinesprint "Sentences : ", sentencesprint "Words   : ", words# optional console wait for keypressfrom msvcrt import getchgetch()

希望本文所述對大家的python程式設計有所協助。

  • 相關文章

    聯繫我們

    該頁面正文內容均來源於網絡整理,並不代表阿里雲官方的觀點,該頁面所提到的產品和服務也與阿里云無關,如果該頁面內容對您造成了困擾,歡迎寫郵件給我們,收到郵件我們將在5個工作日內處理。

    如果您發現本社區中有涉嫌抄襲的內容,歡迎發送郵件至: info-contact@alibabacloud.com 進行舉報並提供相關證據,工作人員會在 5 個工作天內聯絡您,一經查實,本站將立刻刪除涉嫌侵權內容。

    A Free Trial That Lets You Build Big!

    Start building with 50+ products and up to 12 months usage for Elastic Compute Service

    • Sales Support

      1 on 1 presale consultation

    • After-Sales Support

      24/7 Technical Support 6 Free Tickets per Quarter Faster Response

    • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.