python - 爬取某淘宝店铺所有宝贝遇到的问题?
天蓬老师
天蓬老师 2017-04-18 10:05:34
0
1
487
天蓬老师
天蓬老师

欢迎选择我的课程,让我们一起见证您的进步~~

reply all(1)
阿神

Finally found it through careful analysis. I originally thought that Taobao had set up some algorithms to encrypt these numbers in order to restrict crawlers, but in fact it did not.

First of all, the 152 in jsonp152 is meaningless. It is a value related to the time when the request is sent, but it has no impact on us. You can enter a number at will. For example, callback=jsonp122 will return jsonp122("{data:...}"). Just send it when sending a request. It is a random number.

Secondly, 12133771151 is extracted from the page returned by sending "https://tianziyujushangcheng.taobao.com/search.htm".

The general process is to send https://tianziyujushangcheng....Then the browser parses some js files in the returned page, and then sends the request https://tianziyujushangcheng....to retrieve the product information. Therefore, you cannot get specific product information by directly crawling the store homepage. You need to analyze the entire network request and find this xhr request to get the desired information.

Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template