python - 除了代理ip,香港的服务器爬取国内网站还有更好的方法吗?

Question

我在做一只淘宝的爬虫,但是用的是香港的服务器,但是比较困惑:因为每次爬淘宝的首页时候,就自动给我跳转到香港淘宝~~导致源代码和内容都不一样~请问如果遇到这种情况要怎么处理呢? 简单来说,比如采集58同城如果我...

PHP中文网 · Answer

Disable redirection, take requests as an example:

r = requests.get('http://github.com/', allow_redirects=False)
r.status_code  # 302
r.url  # http://github.com, not https.
r.headers['Location']  # https://github.com/ -- the redirect destination

PHP中文网 · Answer

If you want to collect from Beijing, just enter the city name, but it is protected by PGTID

http://bj.58.com/?PGTID=0d000...

Jianyi uses selenium

迷茫 · Answer

Sometimes the server will redirect based on the geographical location information corresponding to your IP. You should have no other way except to find a proxy. .

Php8, I'm coming too

Learn website layout in 30 minutes

Shangguan Oracle Beginner to Proficient Video Tutorial

Your first line of UNI-APP code

Flutter from scratch to app launch

Brother Lian New Linux Video Tutorial

AXURE 9 Video Tutorial (Suitable for Product Manager Interactive Product Design UI)

Zero Basic Proficiency PS Video Tutorial

16 day UI video tutorial to get you started

PS Techniques and Slicing Techniques Video Tutorial

Alibaba Cloud Environment Construction and Project Launch Video Tutorial

Overview of Computer Networks - Basic Knowledge that Programmers Must Master

Essential Tutorial for Programmers - HTTP Protocol Explanation

Websocket Video Tutorial