PHP使用Snoopy类实现页面抓取的方法-tutoriel php-php.cn

本篇文章主要介绍PHP使用Snoopy类实现页面抓取的方法，感兴趣的朋友参考下，希望对大家有所帮助。

本文实例讲述了php中Snoopy类用法，具体分析如下：

这里演示了php中如何通过Snoopy抓取网页信息

/* You need the snoopy.class.php from http://snoopy.sourceforge.net/ */ include("snoopy.class.php"); $snoopy = new Snoopy; // need an proxy?: //$snoopy->proxy_host = "my.proxy.host"; //$snoopy->proxy_port = "8080"; // set browser and referer: $snoopy->agent = "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)"; $snoopy->referer = "http://www.jonasjohn.de/"; // set some cookies: $snoopy->cookies["SessionID"] = '238472834723489'; $snoopy->cookies["favoriteColor"] = "blue"; // set an raw-header: $snoopy->rawheaders["Pragma"] = "no-cache"; // set some internal variables: $snoopy->maxredirs = 2; $snoopy->offsiteok = false; $snoopy->expandlinks = false; // set username and password (optional) //$snoopy->user = "joe"; //$snoopy->pass = "bloe"; // fetch the text of the website www.google.com: if($snoopy->fetchtext("http://www.google.com")){ // other methods: fetch, fetchform, fetchlinks, submittext and submitlinks // response code: print "response code: ".$snoopy->response_code."
\n"; // print the headers: print "Headers:
"; while(list($key,$val) = each($snoopy->headers)){ print $key.": ".$val."
\n"; } print "
\n"; // print the texts of the website: print "".htmlspecialchars($snoopy->results)."\n"; } else { print "Snoopy: error while fetching document: ".$snoopy->error."\n"; }
        
         Copier après la connexion

总结：以上就是本篇文的全部内容，希望能对大家的学习有所帮助。

相关推荐：

PHP基于memcache实现环形队列的方法

php操作图片的大小修改、加水印、生成验证码、输出及保存

PHP读取配置文件类实例

Ce qui précède est le contenu détaillé de. pour plus d'informations, suivez d'autres articles connexes sur le site Web de PHP en chinois!