I haven’t blogged for a few days. In fact, there is a lot that needs to be summarized because I have been busy working on projects recently. However, the problem of intercepting garbled strings containing a mixture of Chinese and English in the Smarty template that has puzzled me for several days has finally been solved. , so record it, friends who need it can take a look:
The reason for garbled characters:
For string interception, the truncate function is only suitable for English users. For Chinese users, using truncate will cause garbled characters. Moreover, for Chinese and English mixed strings, the actual display lengths of the same number of intercepted strings are different. The length of one Chinese character is roughly equivalent to the length of two English characters. In addition, truncate is not compatible with GB2312, UTF-8 and other encodings at the same time.
Solution: Write an extension class yourself and use
The class file location of smarty’s truncate variable regulator used by ThinkPHP: ThinkPHPLibraryVendorSmartyplugins, one of which is modifier.truncate.php, we don’t use this, we use it ourselves Write one to implement
File name: modifier.smartTruncate.php
<?<span>php </span><span>/*</span><span>* * 中英文多编码字符串截取 </span><span>*/</span><span>function</span> smartDetectUTF8(<span>$string</span><span>) { </span><span>static</span><span>$result</span> = <span>array</span><span>(); </span><span>if</span>(! <span>array_key_exists</span>(<span>$key</span> = <span>md5</span>(<span>$string</span>), <span>$result</span><span>)) { </span><span>$utf8</span> = "<span> /^(?: [\x09\x0A\x0D\x20-\x7E] # ASCII | [\xC2-\xDF][\x80-\xBF] # non-overlong 2-byte | \xE0[\xA0-\xBF][\x80-\xBF] # excluding overlongs | [\xE1-\xEC\xEE\xEF][\x80-\xBF]{2} # straight 3-byte | \xED[\x80-\x9F][\x80-\xBF] # excluding surrogates | \xF0[\x90-\xBF][\x80-\xBF]{2} # planes 1-3 | [\xF1-\xF3][\x80-\xBF]{3} # planes 4-15 | \xF4[\x80-\x8F][\x80-\xBF]{2} # plane 16 )+$/xs </span>"<span>; </span><span>$result</span>[<span>$key</span>] = <span>preg_match</span>(<span>trim</span>(<span>$utf8</span>), <span>$string</span><span>); } </span><span>return</span><span>$result</span>[<span>$key</span><span>]; } </span><span>function</span> smartStrlen(<span>$string</span><span>) { </span><span>$result</span> = 0<span>; </span><span>$number</span> = smartDetectUTF8(<span>$string</span>) ? 3 : 2<span>; </span><span>for</span>(<span>$i</span> = 0; <span>$i</span> < <span>strlen</span>(<span>$string</span>); <span>$i</span> += <span>$bytes</span><span>) { </span><span>$bytes</span> = <span>ord</span>(<span>substr</span>(<span>$string</span>, <span>$i</span>, 1)) > 127 ? <span>$number</span> : 1<span>; </span><span>$result</span> += <span>$bytes</span> > 1 ? 1.0 : 0.5<span>; } </span><span>return</span><span>$result</span><span>; } </span><span>function</span> smartSubstr(<span>$string</span>, <span>$start</span>, <span>$length</span> = <span>null</span><span>) { </span><span>$result</span> = ''<span>; </span><span>$number</span> = smartDetectUTF8(<span>$string</span>) ? 3 : 2<span>; </span><span>if</span>(<span>$start</span> < 0<span>) { </span><span>$start</span> = <span>max</span>(smartStrlen(<span>$string</span>) + <span>$start</span>, 0<span>); } </span><span>for</span>(<span>$i</span> = 0; <span>$i</span> < <span>strlen</span>(<span>$string</span>); <span>$i</span> += <span>$bytes</span><span>) { </span><span>if</span>(<span>$start</span> <= 0<span>) { </span><span>break</span><span>; } </span><span>$bytes</span> = <span>ord</span>(<span>substr</span>(<span>$string</span>, <span>$i</span>, 1)) > 127 ? <span>$number</span> : 1<span>; </span><span>$start</span> -= <span>$bytes</span> > 1 ? 1.0 : 0.5<span>; } </span><span>if</span>(<span>is_null</span>(<span>$length</span><span>)) { </span><span>$result</span> = <span>substr</span>(<span>$string</span>, <span>$i</span><span>); } </span><span>else</span><span> { </span><span>for</span>(<span>$j</span> = <span>$i</span>; <span>$j</span> < <span>strlen</span>(<span>$string</span>); <span>$j</span> += <span>$bytes</span><span>) { </span><span>if</span>(<span>$length</span> <= 0<span>) { </span><span>break</span><span>; } </span><span>if</span>((<span>$bytes</span> = <span>ord</span>(<span>substr</span>(<span>$string</span>, <span>$j</span>, 1)) > 127 ? <span>$number</span> : 1) > 1<span>) { </span><span>if</span>(<span>$length</span> < 1.0<span>) { </span><span>break</span><span>; } </span><span>$result</span> .= <span>substr</span>(<span>$string</span>, <span>$j</span>, <span>$bytes</span><span>); </span><span>$length</span> -= 1.0<span>; } </span><span>else</span><span> { </span><span>$result</span> .= <span>substr</span>(<span>$string</span>, <span>$j</span>, 1<span>); </span><span>$length</span> -= 0.5<span>; } } } </span><span>return</span><span>$result</span><span>; } </span><span>function</span> smarty_modifier_smartTruncate(<span>$string</span>, <span>$length</span> = 80, <span>$etc</span> = '...',<span>$break_words</span> = <span>false</span>, <span>$middle</span> = <span>false</span><span>) { </span><span>if</span> (<span>$length</span> == 0<span>) </span><span>return</span> ''<span>; </span><span>if</span> (smartStrlen(<span>$string</span>) > <span>$length</span><span>) { </span><span>$length</span> -= smartStrlen(<span>$etc</span><span>); </span><span>if</span> (!<span>$break_words</span> && !<span>$middle</span><span>) { </span><span>$string</span> = <span>preg_replace</span>('/\s+?(\S+)?$/', '', smartSubstr(<span>$string</span>, 0, <span>$length</span>+1<span>)); } </span><span>if</span>(!<span>$middle</span><span>) { </span><span>return</span> smartSubstr(<span>$string</span>, 0, <span>$length</span>).<span>$etc</span><span>; } </span><span>else</span><span> { </span><span>return</span> smartSubstr(<span>$string</span>, 0, <span>$length</span>/2) . <span>$etc</span> . smartSubstr(<span>$string</span>, -<span>$length</span>/2<span>); } } </span><span>else</span><span> { </span><span>return</span><span>$string</span><span>; } } </span>?>
Note: When judging the character length, a Chinese character counts as 1.0 and an English character counts as 0.5. When intercepting substrings, it will not There are uneven situations.
Usage:
{<span>$content</span>|smartTruncate:5:"..."}
ok, if there is no problem in the test, please point it out
The above introduces the solution for intercepting garbled strings containing mixed Chinese and English in the ThinkPHP+Smarty template, including smarty templates and thinkphp. I hope it will be helpful to friends who are interested in PHP tutorials.