十四、Scrapy框架–實(shí)戰(zhàn)–zcool網(wǎng)站精選圖高速下載(3)
settings.py 設(shè)置
ITEM_PIPELINES= {
??'imagedownload.pipelines.ImagedownloadPipeline': 300,
???'scrapy.pipelines.images.ImagesPipeline': 1
}
pipeline.py?代碼
from scrapy.pipelines.images import ImagesPipeline
from ..imagedownload import settings
import os
import re
?
?
class ImagedownloadPipeline(ImagesPipeline):
??? def get_media_requests(self, item, info):
??????? media_requests =super(ImagedownloadPipeline, self).get_media_requests(item, info)
??????? for media_request in media_requests:
??????????? media_request.item = item
??????? return media_requests
?
??? def file_path(self, request, response=None,info=None):
??????? origin_path =super(ImagedownloadPipeline, self).file_path(request, response, info)
??????? title = request.item['title']
??????? title = re.sub(r,'[\\/:\*\?"<>]', "", title)
?
??????? save_path =os.path.join(settings.IMAGES_STORE, title)
??????? if not os.path.exists(save_path):
??????????? os.mkdir(save_path)
??????? imsge_name =origin_path.replace("full/", "")
??????? return os.path.join(save_path,imsge_name)
上一篇文章 第六章 Scrapy框架(十三) 2020-03-15 地址:
http://www.reibang.com/p/23a56b78deee
下一篇文章 第六章 Scrapy框架(十五) 2020-03-17 地址:
http://www.reibang.com/p/5053c6dddbcc
以上資料內(nèi)容來(lái)源網(wǎng)絡(luò),僅供學(xué)習(xí)交流删顶,侵刪請(qǐng)私信我竖螃,謝謝。