Solution to php web page crawling garbled code: 1. Use "mbconvertencoding" to convert encoding; 2. Set and add "curl_setopt($ch, CURLOPT_ENCODING, 'gzip');" option; 3. Add header at the top code.
Recommended: "PHP Video Tutorial"
php crawling page garbled code
The solution to garbled characters similar to ������� appears when crawling the page is as follows
1. Convert encoding
str=mbconvertencoding(str, “utf-8”, “GBK”);
2. Data is compressed by gzip
When curl obtains data, add the following options:
curl_setopt($ch, CURLOPT_ENCODING, 'gzip');
Using the file_get_contents function requires installing the zlib library
$data = file_get_contents("compress.zlib://".$url);
3. After obtaining the data, garbled characters are displayed
at the top Add the following code
header("Content-type: text/html; charset=utf-8");
The above is the detailed content of How to solve the garbled problem of php web crawling. For more information, please follow other related articles on the PHP Chinese website!