php method to convert characters into entities: 1. Use the htmlentities() function to convert characters into HTML entities; 2. Use the htmlspecialchars() function to convert some predefined characters ("&" , "6745434bf92a2979d589ede8e8e7bea9", etc.) into HTML entities.
The operating environment of this tutorial: windows7 system, PHP7.1 version, DELL G3 computer
php Convert characters into entities
1. Use the htmlentities() function
htmlentities() function to convert characters into HTML entities.
Syntax:
htmlentities(string,flags,character-set,double_encode)
Parameters |
Description |
string |
Required. Specifies the string to be converted. |
flags |
Optional. Specifies how to handle quotes, invalid encodings, and which document type to use. Available quote types:
- ENT_COMPAT - Default. Only double quotes are encoded.
- ENT_QUOTES - Encodes double and single quotes.
- ENT_NOQUOTES - Do not encode any quotes.
Invalid encoding:
- ENT_IGNORE - Ignore invalid encodings instead of having the function return an empty string. This should be avoided as this may have an impact on security.
- ENT_SUBSTITUTE - Substitutes an invalid encoding with the specified character with the Unicode substitution character U FFFD (UTF-8) or FFFD; instead of returning an empty string.
- ENT_DISALLOWED - Replace invalid code points in the specified document type with the Unicode replacement character U FFFD (UTF-8) or FFFD;.
Additional flags specifying the document type to use:
- ENT_HTML401 - Default. Code processed as HTML 4.01.
- ENT_HTML5 - Process code as HTML 5.
- ENT_XML1 - Code processed as XML 1.
- ENT_XHTML - Processing code as XHTML.
|
character-set |
Optional. A string specifying the character set to be used. Allowed values:
- UTF-8 - Default. ASCII compatible multi-byte 8-bit Unicode
- ISO-8859-1 - Western Europe
- ISO-8859-15 - Western Europe (adds French and Finnish euro symbols missing from ISO-8859-1 Chinese letters)
- cp866 - DOS-specific Cyrillic character set
- cp1251 - Windows-specific Cyrillic character set
- cp1252 - Windows-specific Western European character set
- KOI8- R - Russian
- BIG5 - Traditional Chinese, mainly used in Taiwan
- GB2312 - Simplified Chinese, National Standard Character Set
- BIG5-HKSCS - Big5## with Hong Kong extension
#Shift_JIS - Japanese - EUC-JP - Japanese
- MacRoman - Character set used by Mac operating systems
-
Comments: In versions prior to PHP 5.4, unrecognized character sets were ignored and replaced by ISO-8859-1. As of PHP 5.4, unrecognized character sets are ignored and replaced by UTF-8.
|
double_encode
| Optional. A Boolean value that specifies whether to encode existing HTML entities. TRUE - Default. Each entity will be converted. - FALSE - Existing HTML entities will not be encoded.
-
|
示例:通过使用西欧字符集,把一些字符转换为 HTML 实体:
<?php
$str = "My name is Øyvind Åsane. I&#39;m Norwegian.";
echo htmlentities($str, ENT_QUOTES, "ISO-8859-1"); // Will only convert double quotes (not single quotes), and uses the character-set Western European
?>
上面代码的 HTML 输出如下(查看源代码):
<!DOCTYPE html>
<html>
<body>
My name is &Oslash;yvind &Aring;sane. I&#039;m Norwegian.
</body>
</html>
上面代码的浏览器输出如下:
My name is Øyvind Åsane. I&#39;m Norwegian.
2、使用htmlspecialchars()函数
htmlspecialchars() 函数把一些预定义的字符转换为 HTML 实体。
预定义的字符是:
语法:
htmlspecialchars(string,flags,character-set,double_encode)
参数 |
描述 |
string |
必需。规定要转换的字符串。 |
flags |
可选。规定如何处理引号、无效的编码以及使用哪种文档类型。 可用的引号类型:
- ENT_COMPAT - 默认。仅编码双引号。
- ENT_QUOTES - 编码双引号和单引号。
- ENT_NOQUOTES - 不编码任何引号。
无效的编码:
- ENT_IGNORE - 忽略无效的编码,而不是让函数返回一个空的字符串。应尽量避免,因为这可能对安全性有影响。
- ENT_SUBSTITUTE - 把无效的编码替代成一个指定的带有 Unicode 替代字符 U+FFFD(UTF-8)或者 FFFD; 的字符,而不是返回一个空的字符串。
- ENT_DISALLOWED - 把指定文档类型中的无效代码点替代成 Unicode 替代字符 U+FFFD(UTF-8)或者 FFFD;。
规定使用的文档类型的附加 flags:
- ENT_HTML401 - 默认。作为 HTML 4.01 处理代码。
- ENT_HTML5 - 作为 HTML 5 处理代码。
- ENT_XML1 - 作为 XML 1 处理代码。
- ENT_XHTML - 作为 XHTML 处理代码。
|
character-set |
可选。一个规定了要使用的字符集的字符串。 允许的值:
- UTF-8 - 默认。ASCII 兼容多字节的 8 位 Unicode
- ISO-8859-1 - 西欧
- ISO-8859-15 - 西欧(加入欧元符号 + ISO-8859-1 中丢失的法语和芬兰语字母)
- cp866 - DOS 专用 Cyrillic 字符集
- cp1251 - Windows 专用 Cyrillic 字符集
- cp1252 - Windows 专用西欧字符集
- KOI8-R - 俄语
- BIG5 - 繁体中文,主要在台湾使用
- GB2312 - 简体中文,国家标准字符集
- BIG5-HKSCS - 带香港扩展的 Big5
- Shift_JIS - 日语
- EUC-JP - 日语
- MacRoman - Mac 操作系统使用的字符集
注释:在 PHP 5.4 之前的版本,无法被识别的字符集将被忽略并由 ISO-8859-1 替代。自 PHP 5.4 起,无法被识别的字符集将被忽略并由 UTF-8 替代。
|
double_encode |
可选。一个规定了是否编码已存在的 HTML 实体的布尔值。
- TRUE - 默认。将对每个实体进行转换。
- FALSE - 不会对已存在的 HTML 实体进行编码。
|
返回值::
示例:把一些预定义的字符转换为 HTML 实体
<?php
$str = "Jane & &#39;Tarzan&#39;";
echo htmlspecialchars($str, ENT_COMPAT); // 默认,仅编码双引号
echo "<br>";
echo htmlspecialchars($str, ENT_QUOTES); // 编码双引号和单引号
echo "<br>";
echo htmlspecialchars($str, ENT_NOQUOTES); // 不编码任何引号
?>
输出结果:
Jane & &#39;Tarzan&#39;
Jane & &#39;Tarzan&#39;
Jane & &#39;Tarzan&#39;
推荐学习:《PHP视频教程》
The above is the detailed content of How to convert characters into entities in php. For more information, please follow other related articles on the PHP Chinese website!
Statement:The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn