Python爬取網(wǎng)頁(yè)數(shù)據(jù)基本步驟:
1擅羞、獲取數(shù)據(jù):Requests厘灼、Urllib
2、解析數(shù)據(jù):BeautifulSoup栖忠、XPath
3崔挖、保存數(shù)據(jù):MongoDB、MySQL庵寞、SQLite狸相、CSV、Excel ……
相關(guān)資料文檔記錄:
Awesome Python中文版整理:https://github.com/jobbole/awesome-python-cn
selenium + python 中文文檔: https://python-selenium-zh.readthedocs.io/zh_CN/latest/
Requests官方文檔:http://cn.python-requests.org/zh_CN/latest/
快速上手Requests:http://docs.python-requests.org/zh_CN/latest/user/quickstart.html
Urllib官方文檔:?https://docs.python.org/3/library/urllib.html
Python官方文檔:?https://docs.python.org/3/library/index.html
笨方法學(xué)Python:https://www.kancloud.cn/kancloud/learn-python-hard-way/49863
Python 3 教程(RUNOOB.COM): http://www.runoob.com/python3/python3-tutorial.html
Python教程(廖雪峰官方網(wǎng)址):https://www.liaoxuefeng.com/wiki/0014316089557264a6b348958f449949df42a6d3a2e542c000
HTTP教程(RUNOOB.COM):http://www.runoob.com/http/http-tutorial.html
python之pip常用命令:https://blog.csdn.net/ouyanggengcheng/article/details/72821092
Xpath教程:http://www.w3school.com.cn/xpath/?
爬蟲入門到精通-網(wǎng)頁(yè)的解析(xpath):https://zhuanlan.zhihu.com/p/25572729
Python爬蟲利器三之Xpath語法與lxml庫(kù)的用法:https://blog.csdn.net/freeking101/article/details/64461574
Python正則表達(dá)式:http://www.runoob.com/python/python-reg-expressions.html
正則表達(dá)式30分鐘入門:http://deerchao.net/tutorials/regex/regex.htm?
Beautiful Soup 中文教程: http://www.pythonclub.org/modules/beautifulsoup/start
Beautiful Soup 4.2.0 文檔: https://www.crummy.com/software/BeautifulSoup/bs4/doc.zh/
Windows系統(tǒng)?MongoDB?各個(gè)64位版本下載地址: http://dl.mongodb.org/dl/win32/x86_64
MONGODB MANUAL https://docs.mongodb.com/manual/tutorial/install-mongodb-on-windows/
echart:https://blog.csdn.net/coraline_m/article/details/51418263
? ??????????http://pyecharts.org/#/zh-cn/api
? ??????????http://echarts.baidu.com/index.html
numpy: http://www.numpy.org/
? ??????????https://www.yiibai.com/numpy/
matplotlib:http://python.jobbole.com/89077/