Dear python crawler experts, take a look and see how to deal with anti-crawling on this website.
曾经蜡笔没有小新2017-05-18 11:01:00
0
4
748
https://www.everysaving.co.uk Crawling the data of this website through python, but the data cannot be returned! I added the header and proxy IP to crawl, but it didn't work. I hope you guys can give it a try. . .
The proxy access website can be seen in the picture below:
Through https://www.17ce.com/, I found that almost all mainland China is blocked, and the HTTP status returns 403. The security policy level of this website is relatively high. It is recommended to use a high-anonymity proxy VPN or server in Europe and the United States to reduce the frequency of crawling.
The proxy access website can be seen in the picture below:
Through https://www.17ce.com/, I found that almost all mainland China is blocked, and the HTTP status returns 403.
The security policy level of this website is relatively high. It is recommended to use a high-anonymity proxy VPN or server in Europe and the United States to reduce the frequency of crawling.
Fiddler captures packets, and you can send whatever the browser sends
Your address cannot be accessed directly through the browser. Is it blocked?
I can’t access it if I click on it directly. I tested it using a proxy in Singapore and it can be opened