子类重写def item_completed(self, results, item, info),可以实现文件重命名功能 #9

dingyuanhong2006 · 2019-08-27T05:14:49Z

from scrapy.pipelines.images import ImagesPipeline
from scrapy import Request
from ImageSpider.settings import IMAGES_STORE as images_store
import os

class ImagespiderPipeline(ImagesPipeline):

def get_media_requests(self, item, info):
    # 循环每一张图片地址下载，若传过来的不是集合则无需循环直接yield
    for image_url in item['imgurl']:
        yield Request(image_url)

# def file_path(self, request, response=None, info=None):
#     # 重命名，若不重写这函数，图片名为哈希，就是一串乱七八糟的名字
#     image_guid = request.url.split('/')[-1]  # 提取url前面名称作为图片名。
#     return image_guid

# def item_completed(self, results, item, info):
# 	#重命名文件,并把默认路径D:\ImageSpider\full\*图片 
# 	#修改为D:\ImageSpider\*.jpg,提取item['imgurl']中url前面名称作为图片名
# 	#功能上类似file_path
# 	image_path = [x["path"] for ok, x in results if ok]
# 	for i in range(len(image_path)):
# 		os.rename(images_store+'/'+image_path[i],images_store+'/'+item['imgurl'][i].split('/')[-1])

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

子类重写def item_completed(self, results, item, info),可以实现文件重命名功能 #9

子类重写def item_completed(self, results, item, info),可以实现文件重命名功能 #9

dingyuanhong2006 commented Aug 27, 2019

子类重写def item_completed(self, results, item, info),可以实现文件重命名功能 #9

子类重写def item_completed(self, results, item, info),可以实现文件重命名功能 #9

Comments

dingyuanhong2006 commented Aug 27, 2019