How to crawl Baidu pages and corresponding strings in php

墨辰丷
Release: 2023-03-31 16:24:01
Original
1485 people have browsed it

This article mainly introduces the method of php to obtain Baidu included, Baidu hot words and Baidu snapshots. It analyzes the techniques of php to capture Baidu pages and corresponding string analysis with examples. It is of great practical value. Friends who need it can refer to it.

The example in this article describes the method of using PHP to obtain Baidu included, Baidu hot words and Baidu snapshots. The details are as follows:

Get Baidu included:

Copy after login

Get Baidu’s hot words

(.*)<\/table>/is', $templateRss, $_description)) { $templateRss = $_description [0]; $templateRss = str_replace("&", "&", $templateRss); } $templateRss = "" . $templateRss; $xml = @simplexml_load_String($templateRss); foreach ($xml->tbody->tr as $temp) { if (!empty ($temp->td->a)) { $keyArray [] = trim(($temp->td->a)); } } return $keyArray; } print_r(getBaiduHotKeyWord());
Copy after login

This is what I found online and was slightly modified. Write the following code into the php file
Baidu included and Baidu snapshot time

(.*)/"; /*用以匹配快照日期的字符串*/ $times = "/d{4}-d{1,2}-d{1,2}/"; /*匹配快照日期的正则表达式,如:2011-8-4*/ $s0 = @file_get_contents($all); /*将site:www.jb51.net的网页置入$s0字符串中*/ $s1 = @file_get_contents($today); preg_match($utf_pattern,$s0,$all_num); /*匹配"找到相关结果数*个"*/ preg_match($utf_pattern,$s1,$today_num); preg_match($kz_pattern,$s0,$temp); preg_match($times,$temp[0],$screenshot); if($all_num[1] == "") $all_num[1] = 0; if($today_num[1] == "") $today_num[1] = 0; if($screenshot[0] == "") $screenshot[0] = "暂无快照"; ?>   Test 
日期百度收录百度今日收录百度快照日期

百度收录:

百度今日收录:

百度快照日期:

Copy after login

Summary: The above is the entire content of this article, I hope it will be helpful to everyone's study.

Related recommendations:

php method to implement multi-dimensional array traversal and unset deletion

php implementation for Tips for copying and moving files

php method to capture Thunder VIP accounts based on the collection class Snoopy

The above is the detailed content of How to crawl Baidu pages and corresponding strings in php. For more information, please follow other related articles on the PHP Chinese website!

Related labels:
source:php.cn
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template
About us Disclaimer Sitemap
php.cn:Public welfare online PHP training,Help PHP learners grow quickly!