In PHP, you can use the strlen() function to calculate the number of bytes in a string. This function calculates the length of a string in bytes and can count the character length (number of bytes) occupied by the string. The syntax is "strlen (string)". In the calculation of this function, English commas, numbers, English letters, decimal points, etc. only occupy one byte; while a GB2312-encoded Chinese character occupies two bytes, and a UTF-8-encoded Chinese character occupies three bytes.
The operating environment of this tutorial: Windows 7 system, PHP version 8.1, DELL G3 computer
Characters occupied by different encoding formats in php The bytes are different:
In the ANSI encoding format, one Chinese character occupies 2 bytes and one English character occupies 1 byte;
UTF-8 encoding format: One Chinese character occupies 3 bytes, and one English character occupies 1 byte.
#In PHP, you can use the strlen() function to calculate the number of bytes in a string.
The strlen() function calculates the length of a string in bytes.
strlen($string)
This function only accepts one parameter $string (the string to be calculated), and returns 0 if the value passed in this parameter is empty.
In the strlen() function, (in English mode) commas, numbers, English letters, decimal points.
, underscores, spaces, etc. only occupy one character in length ( byte).
'; ?>
Analysis: In the string in the above example, there are 10 English letters, a space, and an English exclamation mark; total: 12 bytes.
In the strlen() function, a GB2312-encoded Chinese character occupies two characters in length (bytes), and a UTF-8-encoded Chinese character occupies three characters in length (bytes).
'; ?>
In the above example, "!" is a Chinese exclamation mark, so it occupies three bytes.
'; ?>
#In the above example, it contains two Chinese characters, so it occupies 6 bytes.
'; ?>
In the above example, it contains 5 English characters, 6 Chinese characters, a Chinese exclamation mark and a Chinese comma; therefore it occupies 5 (8*3) = 29 characters Festival.
Note:When calculating the number of characters in a string, the strlen() function can only process English strings and is not friendly to Chinese or mixed Chinese and English strings. At this time, you need to use the
mb_strlen() function, which can calculate the string length based oncharacters
.
Extended knowledge: mb_strlen() function
mb_strlen() function can also return the length of the string. The syntax format is as follows:
mb_strlen($str [, $encoding = mb_internal_encoding()])
$str is the string whose length needs to be calculated
$encoding is an optional parameter, which is the character encoding. If omitted, the internal character encoding is used.
The return value of the mb_strlen() function is the number of characters contained in the string $str with $encoding encoding. If $encoding is invalid, false is returned.
Note: Unlike the strlen() function, in the mb_strlen() function, whether it is Chinese characters, English, numbers, decimal points, underscores and spaces, they only occupy one character in length.
[Example] Use the mb_strlen() function to get the string length.
'; $str = "欢迎来到PHP中文网!"; echo '
字符串 “'.$str.'” 的长度为:'.mb_strlen($str,'utf-8').'
'; ?>
It can be seen that we can well handle the length problem of Chinese strings by setting the corresponding$encoding
parameters.
Recommended learning: "PHP Video Tutorial"
The above is the detailed content of How to calculate how many bytes a string occupies in php. For more information, please follow other related articles on the PHP Chinese website!