1避咆、寫詞云的思路:數(shù)據(jù)采集-分詞-生成詞云
2裕坊、用到的模塊:wordcloud包竹。如果沒有這個模塊,cmd進入python所在目錄的Scripts文件夾籍凝,通過pip安裝周瞎。pip install wordcloud。安裝出現(xiàn)以下錯誤:
從https://www.microsoft.com/en-us/download/details.aspx?id=44266下載Microsoft Visual C++ Compiler for Python 2.7
從https://www.microsoft.com/en-us/download/details.aspx?id=44266下載Microsoft Visual C++ Compiler for Python 2.7饵蒂。安裝以下即可声诸,接下來遇到什么安裝包沒安裝的,在cmd環(huán)境下直接pip install jieba(工具包名)
3退盯、開始編碼:導(dǎo)入模塊
from wordcloud import WordCloud
import os
cur_path=os.path.dirname(__file__)+"./resource"
print(cur_path)
#打開索要分析的數(shù)據(jù)文件
with open(os.path.join(cur_path,'love_en.txt'),'r') as fp:
text=fp.read()
print(text)
wordCloud=WordCloud().generate(text) #統(tǒng)計這個文件里面的詞頻彼乌,當(dāng)?shù)纖ordCloud里面
image=wordCloud.to_image()
image.show()
整個代碼
效果圖如下:
4、wordcloud支持英文分詞渊迁,jieba支持中文分詞
用jieba來處理中文慰照,生成自定義形狀