$str = '這是<p>一道<a ;
將以上字符串截取前7個字符顯示出來,最終應(yīng)該要這個結(jié)果:
'這是<p>一道<a
要求:
如果字符串中有HTML標(biāo)簽就略過不記數(shù)
如果截取完之后有HTML標(biāo)簽被截斷了,那么要在最后把截斷的標(biāo)簽再補(bǔ)上結(jié)束標(biāo)簽
學(xué)習(xí)是最好的投資!
沒有推測題目的用途,單純按要求一寫了個正則替換
function pure_cut($str, $len) {
$reg = '/' . str_repeat('[^<>]((?:<[^>]+>)+)?', $len) . '$/u';
$str = preg_replace_callback($reg, function($matches) {
array_shift($matches);
$replace = join('', $matches);
return $replace;
}, $str, 7);
return $str;
}
echo pure_cut($str, 7);
不過要求2 沒太明白。在要求1滿足的情況下,html 標(biāo)簽是不會被破壞的,不需要專門去修復(fù)啊。
應(yīng)該是截取富文本編輯框的內(nèi)容吧.
<?php
$str = '這是<p>一道<a ;
function truncate($text, $length = 100, $ending = '...', $exact = false, $considerHtml = true) {
if ($considerHtml) {
if (mb_strlen(strip_tags($text)) <= $length) {
return $text;
}
preg_match_all('/(<.+?>)?([^<>]*)/s', $text, $lines, PREG_SET_ORDER);
$total_length = mb_strlen($ending);
$open_tags = array();
$truncate = '';
foreach ($lines as $line_matchings) {
if (!empty($line_matchings[1])) {
if (preg_match('/^<(\s*.+?\/\s*|\s*(img|br|input|hr|area|base|basefont|col|frame|isindex|link|meta|param)(\s.+?)?)>$/is', $line_matchings[1])) {
} else if (preg_match('/^<\s*\/([^\s]+?)\s*>$/s', $line_matchings[1], $tag_matchings)) {
$pos = array_search($tag_matchings[1], $open_tags);
if ($pos !== false) {
unset($open_tags[$pos]);
}
} else if (preg_match('/^<\s*([^\s>!]+).*?>$/s', $line_matchings[1], $tag_matchings)) {
array_unshift($open_tags, strtolower($tag_matchings[1]));
}
$truncate .= $line_matchings[1];
}
$content_length = mb_strlen(preg_replace('/&[0-9a-z]{2,8};|&#[0-9]{1,7};|[0-9a-f]{1,6};/i', ' ', $line_matchings[2]));
if ($total_length+$content_length > $length) {
$left = $length - $total_length;
$entities_length = 0;
if (preg_match_all('/&[0-9a-z]{2,8};|&#[0-9]{1,7};|[0-9a-f]{1,6};/i', $line_matchings[2], $entities, PREG_OFFSET_CAPTURE)) {
foreach ($entities[0] as $entity) {
if ($entity[1]+1-$entities_length <= $left) {
$left--;
$entities_length += mb_strlen($entity[0]);
} else {
break;
}
}
}
$truncate .= mb_substr($line_matchings[2], 0, $left+$entities_length);
break;
} else {
$truncate .= $line_matchings[2];
$total_length += $content_length;
}
if($total_length >= $length) {
break;
}
}
} else {
if (mb_strlen($text) <= $length) {
return $text;
} else {
$truncate = mb_substr($text, 0, $length - mb_strlen($ending));
}
}
if (!$exact) {
$spacepos = mb_strrpos($truncate, ' ');
if (isset($spacepos)) {
$truncate = mb_substr($truncate, 0, $spacepos);
}
}
$truncate .= $ending;
if($considerHtml) {
foreach ($open_tags as $tag) {
$truncate .= '</' . $tag . '>';
}
}
return $truncate;
}
echo truncate($str, 7, '', true, true);