This set of "Python Practical Crawler Video Tutorial" is a very powerful python practical video tutorial. Friends who already have a certain understanding of Python and have mastered Python and want to take a step further can learn this set of tutorials!
Course playback address: //m.sbmmt.com/course/603.html
The teacher’s teaching style:
The teacher’s lectures are simple, clear, layer-by-layer analysis, interlocking, rigorous argumentation, rigorous structure, and use the logical power of thinking to attract students’ attention Strength, use reason to control the classroom teaching process. By listening to the teacher's lectures, students not only learn knowledge, but also receive thinking training, and are also influenced and influenced by the teacher's rigorous academic attitude
The more difficult point in this video is the Python crawler:
When we browse the Internet every day, we often see some good-looking pictures. We want to save and download these pictures, or users can use Make desktop wallpaper, or use it as design material.
Our most common method is to right-click the mouse and select Save As. However, some pictures do not have a save as option when you right-click the mouse. The other way is to capture them with a screenshot tool, but this will reduce the clarity of the picture. Okay~! In fact, you are very good. Right-click to view the page source code.
We can use python to implement such a simple crawler function and crawl the code we want locally. Let's take a look at how to use python to implement such a function.
1. Get the entire page data
First we can get the entire page information of the image to be downloaded.
getjpg.py
#coding=utf-8 import urllib def getHtml(url): page = urllib.urlopen(url) html = page.read() return html html = getHtml("http://tieba.baidu.com/p/2738151262") print html
The Urllib module provides an interface for reading web page data. We can read it like a local file. Data on www and ftp. First, we define a getHtml() function:
The urllib.urlopen() method is used to open a URL address.
The read() method is used to read the data on the URL, pass a URL to the getHtml() function, and download the entire page. Executing the program will print out the entire web page.
The above is the detailed content of Recommended materials for Python practical crawler video tutorials. For more information, please follow other related articles on the PHP Chinese website!