Article Topic Learning Download Q&A Programming Dictionary Game Recent Updates

简体中文(ZH-CN) English(EN) 繁体中文(ZH-TW) 日本語(JA) 한국어(KO) Melayu(MS) Français(FR) Deutsch(DE)

Home> Backend Development> Python Tutorial> body text

python使用urllib模块和pyquery实现阿里巴巴排名查询

WBOY

Release： 2016-06-16 08:45:35

Original

1296 people have browsed it

urllib基础模块的应用，通过该类获取到url中的html文档信息，内部可以重写代理的获取方法

复制代码代码如下:

class ProxyScrapy(object):
def __init__(self):
self.proxy_robot = ProxyRobot()
self.current_proxy = None
self.cookie = cookielib.CookieJar()

def __builder_proxy_cookie_opener(self):
cookie_handler = urllib2.HTTPCookieProcessor(self.cookie)
handlers = [cookie_handler]

if PROXY_ENABLE:
self.current_proxy = ip_port = self.proxy_robot.get_random_proxy()
proxy_handler = urllib2.ProxyHandler({'http': ip_port[7:]})
handlers.append(proxy_handler)

opener = urllib2.build_opener(*handlers)
urllib2.install_opener(opener)
return opener

def get_html_body(self,url):
opener = self.__builder_proxy_cookie_opener()

request=urllib2.Request(url)
#request.add_header("Accept-Encoding", "gzip,deflate,sdch")
#request.add_header("Accept", "text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8")
#request.add_header("Cache-Control", "no-cache")
#request.add_header("Connection", "keep-alive")

try:
response = opener.open(request,timeout=2)

http_code = response.getcode()
if http_code == 200:
if PROXY_ENABLE:
self.proxy_robot.handle_success_proxy(self.current_proxy)
html = response.read()
return html
else:
if PROXY_ENABLE:
self.proxy_robot.handle_double_proxy(self.current_proxy)
return self.get_html_body(url)
except Exception as inst:
print inst,self.current_proxy
self.proxy_robot.handle_double_proxy(self.current_proxy)
return self.get_html_body(url)

Related labels：

urllib模块 pyquery 阿里巴巴排名查询

source：php.cn

Previous article：使用python的chardet库获得文件编码并修改编码 Next article：使用go和python递归删除.ds store文件的方法

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Latest Articles by Author

MATIC Will Be Upgraded to POL Tomorrow – Polygon | Aggregated

2024-09-05 03:46:10
Ethervista (VISTA) Surges 120% as Meme Coin Creation Platform Gains Momentum

2024-09-05 03:44:10
Dogecoin (DOGE) Price Prediction: Bears Remain in Control

2024-09-05 03:43:10
Popcat Value Prediction as POPCAT/USDT Perpetual Futures Trading Goes Live on OKX

2024-09-05 03:39:10
NFT Market Faces Unprecedented Crisis, with a Shocking 96% of NFTs Now Considered 'Dead'

2024-09-05 03:34:23
Polygon Replaces Its Native $MATIC Token With $POL to Improve Efficiency and Functionality

2024-09-05 03:32:10
Uniswap Labs Settles with CFTC after Violating Derivatives Trading Regulations

2024-09-05 03:31:10
DTX Exchange Presale Emerges as a Potential Lifeline for NEAR and SUI Traders Amidst Market Turbulence

2024-09-05 03:30:10
ETFSwap (ETFS) Presale Shakes Crypto Market As It Gears Up For Massive Rally

2024-09-05 03:28:10
VC Giants a16z, Union Square Ventures Get Subpoenaed by New York About Uniswap: Sources

2024-09-05 03:27:10

Latest Issues

Module is not defined in Vue project I just created a new Vue application by running npmini tvue@latest as specified in the off...

From 2023-11-17 12:38:53

0

2

394

How to set Laravel Spatie permission setting method to define a set of permissions for each user based on role? I have 4 types of users using my system: 1. Super Admins 2. Team Super Admins, 3. Administ...

From 2023-11-14 12:58:58

0

1

292

Sass error: File already loaded: @import '~src/css/quasar.variables.scss', 'quasar/src/css/variables.sass'; src\css\quasar.variables.scss I'm using the quasar framework and while compiling my project I encountered the above erro...

From 2023-11-06 21:38:55

0

1

219

When the React application is running, the module '@babel/plugin-proposal-private-property-in-object' cannot be found. I created a React application using npxcreate-react-appmy_app but when I run the applicati...

From 2023-11-03 17:03:48

0

1

307

Call a stub module function in the same module I can't find a way to stub a function called from the same module where the function is de...

From 2023-11-03 13:47:45

0

2

197

Related Topics

More>

Popular Recommendations

Popular Tutorials

More>

Related Tutorials

Popular Recommendations

Latest courses

Latest Downloads

More>

Web Effects

Website Source Code

Website Materials

Front End Template

About us Disclaimer Sitemap: php.cn：Public welfare online PHP training，Help PHP learners grow quickly！