Home  >  Article  >  Backend Development  >  Python crawls Baidu beauty pictures

Python crawls Baidu beauty pictures

PHPz
PHPzOriginal
2017-04-04 10:34:213116browse

Through pythonAutomatically download Baidu beauties in batchesPictures

The effect is like this

Python crawls Baidu beauty pictures

##Effect display

My code

import requests
import re

Python crawls Baidu beauty pictures

Turn page to view picture mode

#获取每页图片的访问链接
def get_page():
    urls=['http://image.baidu.com/search/flip?tn=baiduimage&ie=utf-8&word=%E7%BE%8E%E5%A5%B3%E5%9B%BE%E7%89%87&pn={}&gsm=3c00000000003c'.format(num) for num in range(0,20000,20)]
    for url in urls:
        print(url)
        get_img_link(url)
#从网页中获取每个图片的访问链接
def get_img_link(url):
    r=requests.get(url)
    #print(r.encoding)
    r.encoding='utf-8'
    html_code=r.text
    reg=re.compile(r'"objURL":"(.*?)"')
    imgs=re.findall(reg,html_code)
    # print(imgs)
    for img in imgs:
        #print(img)
        down_img(img)
#图片下载保存再本地
def down_img(url):
    web_data=requests.get(url)
    filename=url.split('/')[-1]
    targetfile='E:/pict_baidu/{}'.format(filename)
    with open(targetfile,'wb') as f:
        f.write(web_data.content)
if name=='main':
    get_page()
Summary

  • Crawler idea-Get multi-page access link->Get picture link for each page-> Image download;

  • The use of regular expressions;

  • The use of format and with open as syntax;

  • Adjustment of encoding method;

  • ##Usage of requests and re modules
  • .

The above is the detailed content of Python crawls Baidu beauty pictures. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn