導語
Python有好多非常實用的函數總是讓人覺得相見恨晚6楞病(握拳)
String
- 判斷string的開頭是否含有某個字段startStr
str.startswith(startStr)
- 將string用指定格式輸出
print "{0} and {1} are couple.".format('Rose', 'Jack')
# which is equal to
print "%s and %s are couple." % (Rose', 'Jack')
# print string with equal width
print "{0:10} | {0:10} | {0:10}".format(str1, str2, str3)
List
- 將List用指定符號連接成String
" ".join(list) # use a space to join the list
Dict
- 有一種dict,當查詢其鍵值時芙沥,若查詢值不存在也不會報錯
from collections import defaultdict
"""
dataType can be null, or can be claimed with certain data type
"""
dict = defaultdict(dataType)
Date
- 拿到一個時間格式的字段,如何快速分離出year才沧、month奇颠、day、hour暴构、min、sec段磨?
dateutil.parser.Parse
可以解析大多數常見的日期格式
from dateutil.parser import parse
date = parse(dateString)
day = date.days
DataFrame
- Excel中的PivotTabel在統(tǒng)計分布的時候十分好用啊取逾,在
pandas.dataframe
中也有簡單快速有效的函數實現這一功能
"""
ATTENTION: return a dataframe with distinct values of this COLUMN serving as index
"""
df['COLUMN'].value_counts()
- 將dataframe保存成csv
df.to_csv('df.csv', index = False)
編碼
關于中文的編碼問題筆者只能感到惆悵……這時一定會提到一個包:codecs
- 寫入txt/csv文件時中文亂碼
import codecs
# open csv/txt file with utf-8 encoding
csvFile = codecs.open('csvFile.csv', encoding = 'utf-8')
csv.write('XXX')
- 寫入中文的csv用Excel打開會亂碼
這里涉及到了另一種概念——BOM頭,具體可參考UTF8 + BOM產生問題與小結如果希望寫入的文件含有BOM頭苹支,則可參照以下代碼:
with open('csvFile.csv', 'w') as csvFile:
csvFile.write(codecs.BOM_UTF8)
csvFile.write('XXX')
若希望在讀取文件之前刪除BOM頭:
csvFile = codecs.open('csvFile', 'r', 'utf_8_sig')
當然砾隅,若文件不大的話,也可以使用編輯器進行轉碼债蜜,例如:notepad++晴埂、ultraedit
Terminal
讓Terminal的輸出有字體顏色差異:
STYLE = {
'fore': {
'black': 30, 'red': 31, 'green': 32, 'yellow': 33,
'blue': 34, 'purple': 35, 'cyan': 36, 'white': 37,
},
'back': {
'black': 40, 'red': 41, 'green': 42, 'yellow': 43,
'blue': 44, 'purple': 45, 'cyan': 46, 'white': 47,
},
'mode': {
'bold': 1, 'underline': 4, 'blink': 5, 'invert': 7,
},
'default': {
'end': 0,
}
}
def use_style(string, mode='', fore='', back=''):
mode = '%s' % STYLE['mode'][mode] if STYLE['mode'].has_key(mode) else ''
fore = '%s' % STYLE['fore'][fore] if STYLE['fore'].has_key(fore) else ''
back = '%s' % STYLE['back'][back] if STYLE['back'].has_key(back) else ''
style = ';'.join([s for s in [mode, fore, back] if s])
style = '\033[%sm' % style if style else ''
end = '\033[%sm' % STYLE['default']['end'] if style else ''
return '%s%s%s' % (style, string, end)
# example print
print use_style('Hello World!', mode='bold', fore='red')