最近在研究python的scray框架
嘗試去爬各種東西
然后出現一坨坨類似亂碼的東西
然后進行各種嘗試
趴下了得到的是unicode編碼,要進行unicode轉義
------------------------------------------
Scray框架的編碼問題解決方法是
pilelines.py文件
# -*- coding: utf-8 -*-
import json
import codecs
# Define your item pipelines here
#
# Don't forget to add your pipeline to the ITEM_PIPELINES setting
# See: http://doc.scrapy.org/en/latest/topics/item-pipeline.html
class QiubaiPipeline(object):
def __init__(self):
self.file = codecs.open('qiubai.json','w',encoding='utf-8') #這里面讀入json
def process_item(self, item, spider):
line = json.dumps(dict(item))
self.file.write(line.decode('unicode-escape')) #這里解碼
return item
def spider_closed(self, spider):
self.file.close()
settings.py文件
ITEM_PIPELINES = {
'qiubai.pipelines.QiubaiPipeline': 500,
}
然后你就可以看見美麗的中文了