# pexels

**Repository Path**: HSCR7/pexels

## Basic Information

- **Project Name**: pexels
- **Description**: 国外图片网站pexels的爬虫存库
- **Primary Language**: Python
- **License**: Not specified
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 0
- **Forks**: 0
- **Created**: 2019-12-26
- **Last Updated**: 2020-12-17

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

# pexels

#### 描述

国外图片网站pexels的爬虫存库

#### 使用

1. cd 到 mypexel 目录下
2. 如果需要保存到数据库，执行sql文件
3. 如果不保存数据库，稍作修改
4. 启动命令- scrapy crawl mypexel

#### 下载参数设置

1. 设置存储路径

    save_path = "F:\\pexels\\商务\\"
2. 如需要入库，categry为当前选择的爬取的分类（如果不需要入库忽略）

    categry='商务'
    englishCategry='business'

3. 下载地址(search/搜索条件) 如在此需要搜索商务
    
    b_url='https://www.pexels.com/search/business/?format=html&seed=&page={}&type='

4. 数量(一页大概30条数据，50为50页)

    for i in range(1,50):
        
***如果入库失败，可能是数据库的字符编码不正确，设置数据字符集为utf8mb4 -- UTF-8 Unicode***