This article mainly introduces the method of intercepting n characters starting at a certain position in the GBK document in PHP. Interested friends can refer to it. I hope it will be helpful to everyone.
cut.php:
#!/usr/bin/php <?php define('INPUT_FILE', 't.txt'); define('OUTPUT_FILE', 'a.txt'); $pos = max(intval($argv[1]), 0); $len = max(intval($argv[2]), 0); $file_size = filesize(INPUT_FILE); if($pos >= $file_size) exit; $fp = fopen(INPUT_FILE, 'rb'); $point = 0; //current byte position $string = ''; while(ftell($fp) < $file_size) { if($point >= $pos + $len) break;$byte = fread($fp, 1); //php version >= 5.4 $char = unpack('C', $byte)[1]; if($char <= 0x7f) { //single byte if($point >= $pos) $string .= $byte; $point += 1; continue; } else { //double bytes if($point >= $pos) { $string .= $byte.fread($fp, 1); } else { fseek($fp, 1, SEEK_CUR); } $point += 1; continue; } } fclose($fp); file_put_contents(OUTPUT_FILE, $string); ?>
Source file t.txt content:
dkei20王nnso
Test command:
./cut.php 6 1
View results:
hexdump -C t.txt && hexdump - C a.txt
The above is the entire content of this article, I hope it will be helpful to everyone's study.
Related recommendations:
Processing Sublime Text openingGBKFormat Chinese garbled characters
Details how to solve the problem of Chinese garbled characters in Sublime Text opening GBK format
php Convert the string from GBK# through iconv ##Method to convert to UTF8 character set
The above is the detailed content of PHP implements a method of intercepting n characters starting at a certain position in a GBK document. For more information, please follow other related articles on the PHP Chinese website!