Article Topic Learning Download Q&A Programming Dictionary Game Recent Updates

简体中文(ZH-CN) English(EN) 繁体中文(ZH-TW) 日本語(JA) 한국어(KO) Melayu(MS) Français(FR) Deutsch(DE)

Home > Backend Development > PHP Tutorial > body text

抓取url和网页内容

WBOY

Release： 2016-06-23 14:38:32

Original

815 people have browsed it

由于技术不够，整天在逛论坛。看到许多关于抓取网页内容（file_get_contents）和抓取url（这个不知道用什么）对这个听感兴趣。望大神指点下这是怎么回事？最好能帮我整个源码嘎嘎。叫我参考下。

回复讨论(解决方案)

自己百度先吧

我用 php socket 和 curl写过真实的例子，至于 file_get_contents更简单了，原理都一样，你看以看看
不足之处请指点，
http://blog.csdn.net/zkg510168343/article/details/12996699
http://blog.csdn.net/zkg510168343/article/details/16983161

curl
手册里有例子，必须要看手册阿

百度一下，可以找到很多的。

百度 php 采集

baidu下 file_get_contents()和curl 抓取采集

$url='http://www.iheima.com/';
$con=file_get_contents($url);
if ($con){

preg_match_all('/

(.+).*
(.+)/isU', $con, $temp,PREG_SET_ORDER);
foreach ($temp as $key=> $v){
$title=$v[2];
$v_url=$v[1];
$des=$v[3];
$con_url=file_get_contents($v_url);
if ($con_url){
$tags='';
preg_match('/keywords" content="(.+)"/isU', $con_url,$tags);
$tags=trim($tags[1],',');

preg_match('/class="txs_Content".*>(.+)/isU', $con_url,$txt);
$txt=$txt[1];
}

}
} 够清楚了吧

有个开源的simple_html_dom
$html = file_get_html('http://www.baidu.com');
可以根据各种抓取，如id,css等方法抓取网页内容

Related labels：

抓取url和网页内容

source：php.cn

Previous article：请教会多种编程语言的朋友：当初是怎么克服对第一语言的依赖的？ Next article：CakePHP求救求救~

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Latest Articles by Author

What is a NullPointerException, and how do I fix it?

2024-10-22 09:46:29
From Novice to Coder: Your Journey Begins with C Fundamentals

2024-10-13 13:53:41
Unlocking Web Development with PHP: A Beginner's Guide

2024-10-12 12:15:51
Demystifying C: A Clear and Simple Path for New Programmers

2024-10-11 22:47:31
Unlock Your Coding Potential: C Programming for Absolute Beginners

2024-10-11 19:36:51
Unleash Your Inner Programmer: C for Absolute Beginners

2024-10-11 15:50:41
Automate Your Life with C: Scripts and Tools for Beginners

2024-10-11 15:07:41
PHP Made Easy: Your First Steps in Web Development

2024-10-11 14:21:21
Build Anything with Python: A Beginner's Guide to Unleashing Your Creativity

2024-10-11 12:59:11
The Key to Coding: Unlocking the Power of Python for Beginners

2024-10-11 12:17:31

Latest Issues

return(); doesn't work for 1 route but works for almost the same route I have 2 routes, one for unsubscribing and one for restoring, both routes are the same exc...

From 2024-04-04 17:34:09

0

1

311

How to use CSS selectors to target div elements containing specific attributes or tags? I'm working on a POS system that generates a website. Most of the code is proprietary so I...

From 2024-04-03 22:25:59

0

1

277

Unable to receive information from my mySQL database despite having seen tutorials doing the exact same thing So I'm trying to make a program that reads and writes to a MySQL database, which led me to...

From 2024-04-03 14:46:04

0

1

298

What is the modern equivalent of LESS string-set For an example of string sets and their uses, see Using LESS String Set Properties and Con...

From 2024-04-02 13:29:36

0

1

361

Extract relevant information about Chrome extensions I'm trying to build a Chrome extension that aggregates information from a range of website...

From 2024-03-30 12:17:29

0

1

402

Related Topics

More>

Popular Recommendations

Popular Tutorials

More>

Related Tutorials

Popular Recommendations

Latest courses

Latest Downloads

More>

Web Effects

Website Source Code

Website Materials

Front End Template

About us Disclaimer Sitemap: php.cn：Public welfare online PHP training，Help PHP learners grow quickly！