PHP中正则表达式函数（Perl兼容）

PHP为使用Perl兼容的正则表达式搜索字符串提供了7个函数，分别是preg_grep()、preg_match()、preg_match_all()、preg_quote()、preg_replace()、preg_replace_callback()和preg_split()

1， preg_grep()函数，将数组中符合要求的元素组成数组返回。函数样式如下，

array  preg_grep(string $pattern,array  $input)，$pattern为查询条件，$input待查询数组。

如：

<?php
$foods = array("pasta","steak","fish","potatoes");
$food = preg_grep("/^p/",$foods);
var_dump($food);
?>

输出：

array(2) { [0]=> string(5) "pasta" [3]=> string(8) "potatoes" }

2，preg_split()函数，将字符串按照要求分割成数组返回

array preg_split(string $pattern,string $input,int $limit)，$pattern分隔符，$input输入字符串，$limit为限定返回数组中元素个数。

如：

<?php
$delimitedText="Jason+++Gilmore++++++Columbus++++OH";
$fields = preg_split("/(\+)+/",$delimitedText);
var_dump($fields);
?>

输出：

array(4) { [0]=> string(5) "Jason" [1]=> string(7) "Gilmore" [2]=> string(8) "Columbus" [3]=> string(2) "OH" }

3，preg_match()函数，在字符串查找某个元素，存在返回true，否则返回false。

int preg_match(string $pattern,string $subject,$matches)，$pattern查询的字符串，$subject待查询的字符串，$matches存放查询结果的数组，$matches[0]包含完整模式匹配到的文本，$matches[1]包含第一个捕获子组匹配到的文本，依次类推。

如：

$line = "Vim is the greatest word processor ever created! vim is greate";
if(preg_match("/\bvim\b/i",$line,$matches)){
    echo "字符串存在元素！";
}
var_dump($matches);

输出：

字符串存在元素！array(1) { [0]=> string(3) "Vim" }

4，preg_match_all()函数，preg_match_all()与preg_match()功能类似，不过还是有差异。preg_match()结果匹配一次成功后停止匹配；preg_match_all()实现所有结果的匹配。

如：

$pattern = ‘/^(http:\/\/)?([^\/]+)/i‘;
$subject = ‘http://www.php.net/index.html‘;
preg_match_all($pattern,$subject,$match);
var_dump($match);

输出：

array (size=3)
  0 => 
    array (size=1)
      0 => string ‘http://www.php.net‘ (length=18)
  1 => 
    array (size=1)
      0 => string ‘http://‘ (length=7)
  2 => 
    array (size=1)
      0 => string ‘www.php.net‘ (length=11)

数组中第一个$matchs[0]就是整个$pattern匹配到的文本元素

$matchs[1] 匹配到的是子组 (http:\/\/) 匹配到的文本

$matchs[2] 匹配到的是子组(^\/+) 匹配到的文本

5，preg_replace()函数，在数组或字符串中查找并替换某部分。

mixed preg_replace(mixed $pattern,mixed $replacement,mixed $str,int $limit)，$pattern替换值，$replacement被替换值，$str查找替换的字符串，$limit指定应当发生多少次匹配，不设置或设置为-1将替换所有出现的情况。$pattern和$replacement可以使数组，即$replacement[i]替换$pattern[i]。

如：

<?php
$text = "This is a link to http://www.wjgilmore.com/.";
echo preg_replace("/http:\/\/(.*)\//","<a href=\"\${0}\">\${0}</a>",$text);
?>

输出：

This is a link to http://www.wjgilmore.com/.

如：

<?php
$draft = "In 2007 the company faced plummeting revenues and scandal";
$keywords = array("/faced/","/plummeting/","/scandal/");
$replacements = array("celebrated","skyrocketing","expansion");
echo preg_replace($keywords,$replacements,$draft);
?>

输出：

In 2007 the company celebrated skyrocketing revenues and expansion

6，preg_replace_callback()函数，此函数与preg_replace()函数类似，此处的被替换值$replacement可以用函数实现。

mixed preg_replace_callback(mixed $pattern,callback callback,mixed$str,int $limit)

如：

function acronym($matches){
    //通常: $matches[0]是完成的匹配
    //$matches[1]是第一个捕获子组的匹配
    //以此类推
    $acronyms = array(
        'WWW'=>'World Wide Web',
        'IRS'=>'Internal Revenue Service',
        'PDF'=>'Portable Document Format'
    );
    if(isset($acronyms[$matches[1]])){
        return $matches[1]."(".$acronyms[$matches[1]].")";
    }else{
        return $matches[1];
    }
}
$text = "The <acronym>IRS</acronym> offers tax forms in <acronym>PDF</acronym> format on the <acronym>WWW</acronym>";
$newtext = preg_replace_callback("/<acronym>(.*)<\/acronym>/U","acronym",$text);
print_r($newtext);
?>

输出：

The IRS(Internal Revenue Service) offers tax forms in PDF(Portable Document Format) format on the WWW(World Wide Web)

7，preg_quote()函数，在字符串中特殊字符前差一个反斜线。这些特殊字符包括$^*()+={}[]|\\:<>。其形式为：

string preg_quote(string $str,string $delimiter)，$delimiter用于指定用于正则表达式的定界符，使用它也要用反斜线转义。

如：

<?php
$text = "Tickets for the bout are going for $500.";
echo preg_quote($text);
?>

输出：

Tickets for the bout are going for \$500\.

秒客网

PHP中正则表达式函数（Perl兼容）

相关文章