Community Learn Tools Library Leisure

English

Home > Backend Development > Python Tutorial > Detailed introduction to the simple crawler function based on Python3.4

Detailed introduction to the simple crawler function based on Python3.4

巴扎黑

Release： 2017-09-16 10:16:36

Original

1583 people have browsed it

This article mainly introduces Python3.4 programming to implement simple crawling and crawler functions, involving Python3.4 web page crawling and regular parsing related operating techniques. Friends in need can refer to the following

The examples of this article are described Python3.4 programming implements simple crawler function. Share it with everyone for your reference, the details are as follows:

import urllib.request
import urllib.parse
import re
import urllib.request,urllib.parse,http.cookiejar
import time
def getHtml(url):
  cj=http.cookiejar.CookieJar()
  opener=urllib.request.build_opener(urllib.request.HTTPCookieProcessor(cj))
  opener.addheaders=[(&#39;User-Agent&#39;,&#39;Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2272.101 Safari/537.36&#39;),(&#39;Cookie&#39;,&#39;4564564564564564565646540&#39;)]
  urllib.request.install_opener(opener)
  page = urllib.request.urlopen(url)
  html = page.read()
  return html
#print ( html)
#html = getHtml("http://weibo.com/")
def getimg(html):
  html = html.decode(&#39;utf-8&#39;)
  reg=&#39;"screen_name":"(.*?)"&#39;
  imgre = re.compile(reg)
  src=re.findall(imgre,html)
  return src
#print ("",getimg(html))
uid=[&#39;2808675432&#39;,&#39;3888405676&#39;,&#39;2628551531&#39;,&#39;2808587400&#39;]
for a in list(uid):
  print (getimg(getHtml("http://weibo.com/"+a)))
  time.sleep(1)

Copy after login

The above is the detailed content of Detailed introduction to the simple crawler function based on Python3.4. For more information, please follow other related articles on the PHP Chinese website!

Related labels：

crawl Simple

source：php.cn

Previous article：Summary of eight sorting algorithms implemented in Python (Part 1) Next article：Python development MapReduce series WordCount Demo

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Latest Articles by Author

How to add elements to php array

2023-03-14 15:58:02
Example showing JS implementing a simple multiple-choice assessment system

1970-01-01 08:00:00
PHP solution to restrict multiple submissions of the same IP

2023-03-15 07:38:01
Using regular expressions to implement form validation in HTML

1970-01-01 08:00:00
Detailed explanation of this pointing issue in JavaScript strict mode

1970-01-01 08:00:00
Example code for building a tree menu (including multi-level menu) in Java

1970-01-01 08:00:00
Detailed explanation of examples of CSS3 implementing smooth transition when hover leaves

1970-01-01 08:00:00
Swiper carousel image source code sharing analysis

1970-01-01 08:00:00
Summarize and organize VsCode plug-ins

1970-01-01 08:00:00
HttpUtils request tool class code

1970-01-01 08:00:00

Latest Issues

function_exists() cannot determine the custom function Function test () {return true;} if (function_exists ('test')) {echo "test is function...

From 2024-04-29 11:01:01

0

3

2287

How to display the mobile version of Google Chrome Hello teacher, how can I change Google Chrome into a mobile version?

From 2024-04-23 00:22:19

0

11

2423

The child window operates the parent window, but the output does not respond. The first two sentences are executable, but the last sentence cannot be implemented.

From 2024-04-19 15:37:47

0

1

2038

There is no output in the parent window document.onclick = function(){ window.opener.document.write('I am the output of the child ...

From 2024-04-18 23:52:34

0

1

1920

Where is the courseware about CSS mind mapping? Courseware

From 2024-04-16 10:10:18

0

0

1994

Related Topics

More>

Popular Recommendations

Popular Tutorials

More>

Related Tutorials

Popular Recommendations

Latest courses

Latest Downloads

More>

Web Effects

Website Source Code

Website Materials

Front End Template