国产av日韩一区二区三区精品,成人性爱视频在线观看,国产,欧美,日韩,一区,www.成色av久久成人,2222eeee成人天堂

Home php教程 php手冊 比較discuz和ecshop的截取字符串函數(shù)php版

比較discuz和ecshop的截取字符串函數(shù)php版

Jun 13, 2016 am 11:58 AM
discuz ecshop php under function and string intercept Compare source code Version Version of

下面先給出兩個版本函數(shù)的源代碼以及簡單測試,最后我會給出一個實用性更強的字符串截取函數(shù)。需要注意的是:這里討論的字符串截取問題都是針對UTF-8編碼的中文字符串。
discuz版本

復(fù)制代碼 代碼如下:


/**
* [discuz] 基于PHP沒有安裝 mb_substr 等擴展截取字符串,如果截取中文字則按2個字符計算
* @param $string 要截取的字符串
* @param $length 要截取的字符數(shù)
* @param $dot 替換截掉部分的結(jié)尾字符串
* @return 返回截取后的字符串
*/
function cutstr($string, $length, $dot = '...') {
// 如果字符串小于要截取的長度則直接返回
// 此處使用strlen獲取字符串長度有很大的弊病,比如對字符串“新年快樂”要截取4個中文字符,
// 那么必須知道這4個中文字符的字節(jié)數(shù),否則返回的字符串可能會是“新年快樂...”
if (strlen($string) return $string;
}
// 轉(zhuǎn)換原字符串中htmlspecialchars
$pre = chr(1);
$end = chr(1);
$string = str_replace ( array ('&', '"', '' ), array ($pre . '&' . $end, $pre . '"' . $end, $pre . '' . $end ), $string );
$strcut = ''; // 初始化返回值
// 如果是utf-8編碼(這個判斷有點不全,有可能是utf8)
if (strtolower ( CHARSET ) == 'utf-8') {
// 初始連續(xù)循環(huán)指針$n,最后一個字位數(shù)$tn,截取的字符數(shù)$noc
$n = $tn = $noc = 0;
while ( $n $t = ord ( $string [$n] );
if ($t == 9 || $t == 10 || (32 // 如果是英語半角符號等,$n指針后移1位,$tn最后字是1位
$tn = 1;
$n++;
$noc++;
} elseif (194 // 如果是二字節(jié)字符$n指針后移2位,$tn最后字是2位
$tn = 2;
$n += 2;
$noc += 2;
} elseif (224 // 如果是三字節(jié)(可以理解為中字詞),$n后移3位,$tn最后字是3位
$tn = 3;
$n += 3;
$noc += 2;
} elseif (240 $tn = 4;
$n += 4;
$noc += 2;
} elseif (248 $tn = 5;
$n += 5;
$noc += 2;
} elseif ($t == 252 || $t == 253) {
$tn = 6;
$n += 6;
$noc += 2;
} else {
$n++;
}
// 超過了要取的數(shù)就跳出連續(xù)循環(huán)
if ($noc >= $length) {
break;
}
}
// 這個地方是把最后一個字去掉,以備加$dot
if ($noc > $length) {
$n -= $tn;
}
$strcut = substr ( $string, 0, $n );
} else {
// 并非utf-8編碼的全角就后移2位
for ($i = 0; $i $strcut .= ord ( $string [$i] ) > 127 ? $string [$i] . $string [++ $i] : $string [$i];
}
}
// 再還原最初的htmlspecialchars
$strcut = str_replace( array ($pre . '&' . $end, $pre . '"' . $end, $pre . '' . $end ), array ('&', '"', '' ), $strcut );
$pos = strrpos ( $strcut, chr ( 1 ) );
if ($pos !== false) {
$strcut = substr ( $strcut, 0, $pos );
}
return $strcut . $dot; // 最后把截取加上$dot輸出
}


discuz版本的最大缺陷在于使用 strlen 獲取原始字符串的長度,并用來和傳入的要截取長度參數(shù)(字節(jié)數(shù))進行比較,由于UTF-8的中文字符的字節(jié)數(shù)是不固定的,所以就會面臨這樣的窘境:如果要截取4個中文字符應(yīng)該指定多大的截取長度呢?8字節(jié)還是12字節(jié)呢?。。。這是無法預(yù)計的,也正是因為這個問題discuz的cutstr實際是有bug的,通過下面的測試結(jié)果能看出:

復(fù)制代碼 代碼如下:


$str1 = "欲窮千里目";
echo my_cutstr($str1, 10, "...")."\n"; // 輸出:欲窮千里目... [這是一個bug,想想是什么原因?qū)е拢縘
echo my_cutstr($str1, 15, "...")."\n"; // 輸出:欲窮千里目


導(dǎo)致上述bug的原因在與cutstr函數(shù)在截取字符的時候是將一個中文字按2個字符算,那么5個中文字就是10字符,而原始字符串的長度是15字節(jié),所以cutstr認為“成功地”從15字符的串上截取了10個字符,然后加上了“尾巴”。要解決這個bug只要在判斷一下返回的子串是否和原始串相同,如果相同就不加“尾巴”。
ecshop版

復(fù)制代碼 代碼如下:


/**
* [ecshop] 基于PHP的 mb_substr,iconv_substr 這兩個擴展來截取字符串,中文字符都是按1個字符長度計算;
* 該函數(shù)僅適用于utf-8編碼的中文字符串。
*
* @param $str 原始字符串
* @param $length 截取的字符數(shù)
* @param $append 替換截掉部分的結(jié)尾字符串
* @return 返回截取后的字符串
*/
function sub_str($str, $length = 0, $append = '...') {
$str = trim($str);
$strlength = strlen($str);
if ($length == 0 || $length >= $strlength) {
return $str;
} elseif ($length $length = $strlength + $length;
if ($length $length = $strlength;
}
}
if ( function_exists('mb_substr') ) {
$newstr = mb_substr($str, 0, $length, 'utf-8');
} elseif ( function_exists('iconv_substr') ) {
$newstr = iconv_substr($str, 0, $length, 'utf-8');
} else {
//$newstr = trim_right(substr($str, 0, $length));
$newstr = substr($str, 0, $length);
}
if ($append && $str != $newstr) {
$newstr .= $append;
}
return $newstr;
}


ecshop版的特點和缺點都在于將中文字符算作一個字符,如果原始字符串中不含中文,比如:abcd1234,如果本意是要截取4個中文字符或者8個英文字符,那么使用ecshop的版本就得不到期望的結(jié)果,返回值的是:abcd。下面是簡單的測試結(jié)果:

復(fù)制代碼 代碼如下:


$str1 = "白日依山盡,黃河入海流";
echo $str1."\n";
echo my_sub_str($str1, 4, "...")."\n"; // 輸出:白日依山...
$str2 = "白1日2依3山4";
echo $str2."\n";
echo my_sub_str($str2, 4, "...")."\n"; // 輸出:白1日2...


優(yōu)化版
截取中文字符串的大部分應(yīng)用場景是“原始字符串可以是中文、英文、數(shù)字混雜的,中文字按2個字符算,英文數(shù)字按1個字符算”,針對這個需求下面給出一個實現(xiàn)版本:

復(fù)制代碼 代碼如下:


/**
* 字符串截取,中文字符按2個字符計算,同時支持GBK和UTF-8編碼
* @param $string 要截取的字符串
* @param $length 要截取的字符數(shù)
* @param $append 添加到子串后的尾巴
* @return 返回截取后的字符串
*/
function substring($string, $length, $append = false) {
if ( $length return '';
}
// 檢測原始字符串是否為UTF-8編碼
$is_utf8 = false;
$str1 = @iconv("UTF-8", "GBK", $string);
$str2 = @iconv("GBK", "UTF-8", $str1);
if ( $string == $str2 ) {
$is_utf8 = true;
// 如果是UTF-8編碼,則使用GBK編碼的
$string = $str1;
}
$newstr = '';
for ($i = 0; $i $newstr .= ord ($string[$i]) > 127 ? $string[$i] . $string[++$i] : $string[$i];
}
if ( $is_utf8 ) {
$newstr = @iconv("GBK", "UTF-8", $newstr);
}
if ($append && $newstr != $string) {
$newstr .= $append;
}
return $newstr;
}


測試結(jié)果見下(GBK和UTF-8的結(jié)果一致):

復(fù)制代碼 代碼如下:


$str1 = "白日依山盡,黃河入海流";
echo substring($str1, 4, "...")."\n"; // 輸出:白日...
echo substring($str1, 5, "...")."\n"; // 輸出:白日依...
$str2 = "12白34日56依78山";
echo substring($str2, 4, "...")."\n"; // 輸出:12白...
echo substring($str2, 5, "...")."\n"; // 輸出:12白3...


作者:edwardlost' blog
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undress AI Tool

Undress AI Tool

Undress images for free

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Using std::chrono in C Using std::chrono in C Jul 15, 2025 am 01:30 AM

std::chrono is used in C to process time, including obtaining the current time, measuring execution time, operation time point and duration, and formatting analysis time. 1. Use std::chrono::system_clock::now() to obtain the current time, which can be converted into a readable string, but the system clock may not be monotonous; 2. Use std::chrono::steady_clock to measure the execution time to ensure monotony, and convert it into milliseconds, seconds and other units through duration_cast; 3. Time point (time_point) and duration (duration) can be interoperable, but attention should be paid to unit compatibility and clock epoch (epoch)

How does PHP handle Environment Variables? How does PHP handle Environment Variables? Jul 14, 2025 am 03:01 AM

ToaccessenvironmentvariablesinPHP,usegetenv()orthe$_ENVsuperglobal.1.getenv('VAR_NAME')retrievesaspecificvariable.2.$_ENV['VAR_NAME']accessesvariablesifvariables_orderinphp.iniincludes"E".SetvariablesviaCLIwithVAR=valuephpscript.php,inApach

Why We Comment: A PHP Guide Why We Comment: A PHP Guide Jul 15, 2025 am 02:48 AM

PHPhasthreecommentstyles://,#forsingle-lineand/.../formulti-line.Usecommentstoexplainwhycodeexists,notwhatitdoes.MarkTODO/FIXMEitemsanddisablecodetemporarilyduringdebugging.Avoidover-commentingsimplelogic.Writeconcise,grammaticallycorrectcommentsandu

PHP header redirect not working PHP header redirect not working Jul 14, 2025 am 01:59 AM

Reasons and solutions for the header function jump failure: 1. There is output before the header, and all pre-outputs need to be checked and removed or ob_start() buffer is used; 2. The failure to add exit causes subsequent code interference, and exit or die should be added immediately after the jump; 3. The path error should be used to ensure correctness by using absolute paths or dynamic splicing; 4. Server configuration or cache interference can be tried to clear the cache or replace the environment test.

PHP prepared statement get result PHP prepared statement get result Jul 14, 2025 am 02:12 AM

The method of using preprocessing statements to obtain database query results in PHP varies from extension. 1. When using mysqli, you can obtain the associative array through get_result() and fetch_assoc(), which is suitable for modern environments; 2. You can also use bind_result() to bind variables, which is suitable for situations where there are few fields and fixed structures, and it is good compatibility but there are many fields when there are many fields; 3. When using PDO, you can obtain the associative array through fetch (PDO::FETCH_ASSOC), or use fetchAll() to obtain all data at once, so the interface is unified and the error handling is clearer; in addition, you need to pay attention to parameter type matching, execution of execute(), timely release of resources and enable error reports.

PHP check if a string starts with a specific string PHP check if a string starts with a specific string Jul 14, 2025 am 02:44 AM

In PHP, you can use a variety of methods to determine whether a string starts with a specific string: 1. Use strncmp() to compare the first n characters. If 0 is returned, the beginning matches and is not case sensitive; 2. Use strpos() to check whether the substring position is 0, which is case sensitive. Stripos() can be used instead to achieve case insensitive; 3. You can encapsulate the startsWith() or str_starts_with() function to improve reusability; in addition, it is necessary to note that empty strings return true by default, encoding compatibility and performance differences, strncmp() is usually more efficient.

how to avoid undefined index error in PHP how to avoid undefined index error in PHP Jul 14, 2025 am 02:51 AM

There are three key ways to avoid the "undefinedindex" error: First, use isset() to check whether the array key exists and ensure that the value is not null, which is suitable for most common scenarios; second, use array_key_exists() to only determine whether the key exists, which is suitable for situations where the key does not exist and the value is null; finally, use the empty merge operator?? (PHP7) to concisely set the default value, which is recommended for modern PHP projects, and pay attention to the spelling of form field names, use extract() carefully, and check the array is not empty before traversing to further avoid risks.

PHP prepared statement with IN clause PHP prepared statement with IN clause Jul 14, 2025 am 02:56 AM

When using PHP preprocessing statements to execute queries with IN clauses, 1. Dynamically generate placeholders according to the length of the array; 2. When using PDO, you can directly pass in the array, and use array_values to ensure continuous indexes; 3. When using mysqli, you need to construct type strings and bind parameters, pay attention to the way of expanding the array and version compatibility; 4. Avoid splicing SQL, processing empty arrays, and ensuring data types match. The specific method is: first use implode and array_fill to generate placeholders, and then bind parameters according to the extended characteristics to safely execute IN queries.

See all articles