urllib2 第一篇
資源訪問:urlopen()
數(shù)據(jù)讀缺馀骸:read()
urlopen()-->response
-->response-->read()抓取網(wǎng)頁數(shù)據(jù)
-->response-->info()抓去網(wǎng)頁請求報(bào)頭信息
-->response-->geturl()抓取訪問地址
-->response-->getcode()抓取訪問錯誤代碼
import urllib2
req = urllib2.Request('https://www.baidu.com')
response = urllib2.urlopen(req)
pg = response.read()
print (pg)
運(yùn)行結(jié)果:
urllib2 第二篇
偽造瀏覽器請求
請求對象Request
-->Reauest-->add_header(key,value):設(shè)置請求頭數(shù)據(jù)
-->Reauest-->add_header(key):根據(jù)名稱獲取請求頭數(shù)據(jù)
# -*- coding:utf-8 -*-
import urllib2
from urllib2 import Request
#偽造瀏覽器
Headers={"User-Agent":"Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/53.0.2785.101 Safari/537.36"}
req = Request('https://www.baidu.com',headers=Headers)
response = urllib2.urlopen(req)
pg = response.read()
print (pg)
運(yùn)行結(jié)果: