用python實(shí)現(xiàn)簡(jiǎn)單的行數(shù)統(tǒng)計(jì)功能(暫未排除注釋行姑曙,僅計(jì)算除卻空行以外的行數(shù))
步驟
- 遍歷目錄下文件名模叙,獲取后綴,指定要進(jìn)行統(tǒng)計(jì)的文件類型
- 對(duì)文件內(nèi)容進(jìn)行讀取新啼,排除空行內(nèi)容追城,除空行外,讀取到內(nèi)容則計(jì)數(shù)加1
代碼:
# coding=utf-8
import os
import time
basedir = os.path.abspath('test')
filelists = []
# 指定想要統(tǒng)計(jì)的文件類型
whitelist = ['txt', 'py']
#遍歷文件, 遞歸遍歷文件夾中的所有
def getFile(basedir):
global filelists
for parent, dirnames, filenames in os.walk(basedir):
#for dirname in dirnames:
# getFile(os.path.join(parent,dirname)) #遞歸
for filename in filenames:
ext = filename.split('.')[-1]
print(ext)
#只統(tǒng)計(jì)指定的文件類型燥撞,略過一些log和cache文件
if ext in whitelist:
filelists.append(os.path.join(parent,filename))
#統(tǒng)計(jì)一個(gè)文件的行數(shù)
def countLine(fname):
count = 0
for file_line in open(fname).readlines():
if file_line != '' and file_line != '\n': #過濾掉空行
count += 1
print(fname + '----' , count)
return count
if __name__ == '__main__' :
startTime = time.clock()
getFile(basedir)
print(filelists)
totalline = 0
for filelist in filelists:
totalline = totalline + countLine(filelist)
print('total lines:',totalline)
print('Done! Cost Time: %0.2f second' % (time.clock() - startTime))
- 更多代碼詳情參考我的Github