PHP字符串总结

定义：一个字符串 string 就是由一系列的字符组成，其中每个字符等同于一个字节。这意味着 PHP 只能支持 256 的字符集，因此不支持 Unicode 。相信很多新手在官网看到这句话的时候都会觉得很疑惑，明明PHP不是可以打印出中文吗？为什么会说字符串不支持Unicode呢？实际上，这句话应该这样理解：字符串会原生支持ascii字符集，会直接保存不进行编码，但是对于其他字符集会自动根据文档的编码来进行编码，所以如果文档的编码设置正确的话，是可以正确将保存在字符串里面的编码正确读取出来的。

常用函数
因为字符串的函数非常多，我粗略的数了一下，接近100个，所以想要都记住是不太现实的，只能把常用记住，其他需要用到的时候再来看手册吧。

连接与分割

1.explode — 使用一个字符串分割另一个字符串

用法：

array explode ( string $delimiter , string $string [, int $limit ] )

示例：

$pizza  = "piece1 piece2 piece3 piece4 piece5 piece6";
$pieces = explode(" ", $pizza);
echo $pieces[0]; // piece1
echo $pieces[1]; // piece2

2.implode — 将一个一维数组的值转化为字符串

用法：

string implode ( string $glue , array $pieces )

示例：

$array = array('lastname', 'email', 'phone');
$comma_separated = implode(",", $array);

echo $comma_separated; // lastname,email,phone

3.str_split — 将字符串转换为数组

用法：

array str_split ( string $string [, int $split_length = 1 ] )

示例：

$str = "Hello Friend";

$arr1 = str_split($str);
$arr2 = str_split($str, 3);

print_r($arr1);
print_r($arr2);

//输出的结果
Array
(
    [0] => H
    [1] => e
    [2] => l
    [3] => l
    [4] => o
    [5] =>
    [6] => F
    [7] => r
    [8] => i
    [9] => e
    [10] => n
    [11] => d
)

Array
(
    [0] => Hel
    [1] => lo
    [2] => Fri
    [3] => end
)

4.mb_split — 使用正则表达式分割多字节字符串

用法：

array mb_split ( string $pattern , string $string [, int $limit = -1 ] )

示例：

To split an string like this: "日、に、本、ほん、語、ご" using the "、" delimiter i used:

     $v = mb_split('、',"日、に、本、ほん、語、ご");

but didn't work.

The solution was to set this before:

       mb_regex_encoding('UTF-8');
      mb_internal_encoding("UTF-8"); 
     $v = mb_split('、',"日、に、本、ほん、語、ご");

and now it's working:

Array
(
    [0] => 日
    [1] => に
    [2] => 本
    [3] => ほん
    [4] => 語
    [5] => ご
)

5.preg_split — 通过一个正则表达式分隔字符串

用法：

array preg_split ( string $pattern , string $subject [, int $limit = -1 [, int $flags = 0 ]] )

示例：

//使用逗号或空格(包含" ", \r, \t, \n, \f)分隔短语
$keywords = preg_split("/[\s,]+/", "hypertext language, programming");
print_r($keywords);

//输出
Array
(
    [0] => hypertext
    [1] => language
    [2] => programming
)

6.strstr — 查找字符串的首次出现，其忽略大小写版本为stristr

用法：

string strstr ( string $haystack , mixed $needle [, bool $before_needle = false ] )

示例：

$email  = 'name@example.com';
$domain = strstr($email, '@');
echo $domain; // 打印 @example.com

$user = strstr($email, '@', true); // 从 PHP 5.3.0 起
echo $user; // 打印 name

7.strrchr — 查找指定字符在字符串中的最后一次出现

用法：

string strrchr ( string $haystack , mixed $needle )

示例：

$path = '/www/public_html/index.html';
$filename = substr(strrchr($path, "/"), 1);
echo $filename; // "index.html"

8.strpos — 查找字符串首次出现的位置，对应的最后出现的函数为strrpos

用法：

mixed strpos ( string $haystack , mixed $needle [, int $offset = 0 ] )

9.substr — 返回字符串的子串

用法：

string substr ( string $string , int $start [, int $length ] )

示例：

$rest = substr("abcdef", -1);    // 返回 "f"
$rest = substr("abcdef", -2);    // 返回 "ef"
$rest = substr("abcdef", -3, 1); // 返回 "d"

查找与替换

1.preg_match — 执行一个正则表达式匹配

用法：

int preg_match ( string $pattern , string $subject [, array &$matches [, int $flags = 0 [, int $offset = 0 ]]] )

示例：

//从URL中获取主机名称
preg_match('@^(?:http://)?([^/]+)@i',
    "http://www.php.net/index.html", $matches);
$host = $matches[1];//$host为www.php.net

//获取主机名称的后面两部分
preg_match('/[^.]+\.[^.]+$/', $host, $matches);
echo "domain name is: {$matches[0]}\n";
//输出domain name is: php.net


Regex quick reference
[abc]     A single character: a, b or c
[^abc]     Any single character but a, b, or c
[a-z]     Any single character in the range a-z
[a-zA-Z]     Any single character in the range a-z or A-Z
^     Start of line
$     End of line
\A     Start of string
\z     End of string
.     Any single character
\s     Any whitespace character
\S     Any non-whitespace character
\d     Any digit
\D     Any non-digit
\w     Any word character (letter, number, underscore)
\W     Any non-word character
\b     Any word boundary character
(...)     Capture everything enclosed
(a|b)     a or b
a?     Zero or one of a
a*     Zero or more of a
a+     One or more of a
a{3}     Exactly 3 of a
a{3,}     3 or more of a
a{3,6}     Between 3 and 6 of a

options: i case insensitive m make dot match newlines x ignore whitespace in regex o perform #{...} substitutions only once

2.preg_replace — 执行一个正则表达式的搜索和替换，preg_filter()等价于preg_replace() 除了它仅仅返回(可能经过转化)与目标匹配的结果

用法：

mixed preg_replace ( mixed $pattern , mixed $replacement , mixed $subject [, int $limit = -1 [, int &$count ]] )

示例：

$string = 'The quick brown fox jumped over the lazy dog.';
$patterns = array();
$patterns[0] = '/quick/';
$patterns[1] = '/brown/';
$patterns[2] = '/fox/';
$replacements = array();
$replacements[2] = 'bear';
$replacements[1] = 'black';
$replacements[0] = 'slow';
echo preg_replace($patterns, $replacements, $string);

//输出The bear black slow jumped over the lazy dog.

3.wordwrap — 打断字符串为指定数量的字串

用法：

string wordwrap ( string $str [, int $width = 75 [, string $break = "\n" [, bool $cut = false ]]] )

示例：

$text = "A very long woooooooooooord.";
$newtext = wordwrap($text, 8, "\n", true);

echo "$newtext\n";

//输出
A very
long
wooooooo
ooooord.

4.nl2br — 在字符串所有新行之前插入 HTML 换行标记

用法：

string nl2br ( string $string [, bool $is_xhtml = true ] )

5.htmlspecialchars — Convert special characters to HTML entities htmlspecialchars_decode — 将特殊的 HTML 实体转换回普通字符

用法：

string htmlspecialchars ( string $string [, int $flags = ENT_COMPAT | ENT_HTML401 [, string $encoding = ini_get("default_charset") [, bool $double_encode = true ]]] )

6.htmlentities — Convert all applicable characters to HTML entities

用法：

string htmlentities ( string $string [, int $flags = ENT_COMPAT | ENT_HTML401 [, string $encoding = ini_get("default_charset") [, bool $double_encode = true ]]] )

示例：

$str = "A 'quote' is <b>bold</b>";

// Outputs: A 'quote' is <b>bold</b>
echo htmlentities($str);
//和htmlspecialchars的输出一样

// Outputs: A 'quote' is <b>bold</b>
echo htmlentities($str, ENT_QUOTES);

加密

1.md5 — 计算字符串的 MD5 散列值，md5_file — 计算指定文件的 MD5 散列值

用法：

string md5 ( string $str [, bool $raw_output = false ] )

2.sha1 — 计算字符串的 sha1 散列值,sha1_file — 计算文件的 sha1 散列值

用法：

string sha1 ( string $str [, bool $raw_output = false ] )

3.hash — 生成哈希值（消息摘要），hash_file — 使用给定文件的内容生成哈希值

用法：

string hash ( string $algo , string $data [, bool $raw_output = false ] )

4.crc32 — 计算一个字符串的 crc32 多项式

用法：

int crc32 ( string $str )

生成 str 的 32 位循环冗余校验码多项式。这通常用于检查传输的数据是否完整。

其他

1.addslashes — 使用反斜线引用字符串

用法：

string addslashes ( string $str )

2.lcfirst — 使一个字符串的第一个字符小写，ucfirst — 将字符串的首字母转换为大写

用法：

string lcfirst ( string $str )

3.ucwords — 将字符串中每个单词的首字母转换为大写

用法：

string ucwords ( string $str )

4.strtoupper — 将字符串转化为大写，strtolower — 将字符串转化为小写

用法：

string strtoupper ( string $string )

5.trim — 去除字符串首尾处的空白字符（或者其他字符），ltrim删除字符串开头的空白字符（或其他字符），rtrim删除字符串末端的空白字符（或者其他字符）

用法：

string trim ( string $str [, string $charlist = " \t\n\r\0\x0B" ] )

6.sprintf — 返回一个格式化的字符串

用法：

string sprintf ( string $format [, mixed $args [, mixed $... ]] )

//Possible format values:
%% - Returns a percent sign
%b - Binary number
%c - The character according to the ASCII value
%d - Signed decimal number (negative, zero or positive)
%e - Scientific notation using a lowercase (e.g. 1.2e+2)
%E - Scientific notation using a uppercase (e.g. 1.2E+2)
%u - Unsigned decimal number (equal to or greather than zero)
%f - Floating-point number (local settings aware)
%F - Floating-point number (not local settings aware)
%g - shorter of %e and %f
%G - shorter of %E and %f
%o - Octal number
%s - String
%x - Hexadecimal number (lowercase letters)
%X - Hexadecimal number (uppercase letters)

//Additional format values. These are placed between the % and the letter (example %.2f):

+ (Forces both + and - in front of numbers. By default, only negative numbers are marked)
' (Specifies what to use as padding. Default is space. Must be used together with the width specifier. Example: %'x20s (this uses "x" as padding)
- (Left-justifies the variable value)
[0-9] (Specifies the minimum width held of to the variable value)
.[0-9] (Specifies the number of decimal digits or maximum string length)

7.strcmp — 二进制安全字符串比较，strcasecmp — 二进制安全比较字符串（不区分大小写）

用法：

int strcmp ( string $str1 , string $str2 )

8.strrev — 反转字符串

用法：

string strrev ( string $string )

参考网站

最后编辑于：2017.12.03 10:23:25

人面猴
序言：七十年代末，一起剥皮案震惊了整个滨河市，随后出现的几起案子，更是在滨河造成了极大的恐慌，老刑警刘岩，带你破解...
沈念sama阅读 203,324评论 5赞 476
死咒
序言：滨河连续发生了三起死亡事件，死亡现场离奇诡异，居然都是意外死亡，警方通过查阅死者的电脑和手机，发现死者居然都...
沈念sama阅读 85,303评论 2赞 381
救了他两次的神仙让他今天三更去死
文/潘晓璐我一进店门，熙熙楼的掌柜王于贵愁眉苦脸地迎上来，“玉大人，你说我怎么就摊上这事。” “怎么了？”我有些...
开封第一讲书人阅读 150,192评论 0赞 337
道士缉凶录：失踪的卖姜人
文/不坏的土叔我叫张陵，是天一观的道长。经常有香客问我，道长，这世上最难降的妖魔是什么？我笑而不...
开封第一讲书人阅读 54,555评论 1赞 273
港岛之恋（遗憾婚礼）
正文为了忘掉前任，我火速办了婚礼，结果婚礼上，老公的妹妹穿的比我还像新娘。我一直安慰自己，他们只是感情好，可当我...
茶点故事阅读 63,569评论 5赞 365
恶毒庶女顶嫁案：这布局不是一般人想出来的
文/花漫我一把揭开白布。她就那样静静地躺着，像睡着了一般。火红的嫁衣衬着肌肤如雪。梳的纹丝不乱的头发上，一...
开封第一讲书人阅读 48,566评论 1赞 281
城市分裂传说
那天，我揣着相机与录音，去河边找鬼。笑死，一个胖子当着我的面吹牛，可吹牛的内容都是我干的。我是一名探鬼主播，决...
沈念sama阅读 37,927评论 3赞 395
双鸳鸯连环套：你想象不到人心有多黑
文/苍兰香墨我猛地睁开眼，长吁一口气：“原来是场噩梦啊……” “哼！你这毒妇竟也来了？” 一声冷哼从身侧响起，我...
开封第一讲书人阅读 36,583评论 0赞 257
万荣杀人案实录
序言：老挝万荣一对情侣失踪，失踪者是张志新（化名）和其女友刘颖，没想到半个月后，有当地人在树林里发现了一具尸体，经...
沈念sama阅读 40,827评论 1赞 297
护林员之死
正文独居荒郊野岭守林人离奇死亡，尸身上长有42处带血的脓包…… 初始之章·张勋以下内容为张勋视角年9月15日...
茶点故事阅读 35,590评论 2赞 320
白月光启示录
正文我和宋清朗相恋三年，在试婚纱的时候发现自己被绿了。大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
茶点故事阅读 37,669评论 1赞 329
活死人
序言：一个原本活蹦乱跳的男人离奇死亡，死状恐怖，灵堂内的尸体忽然破棺而出，到底是诈尸还是另有隐情，我是刑警宁泽，带...
沈念sama阅读 33,365评论 4赞 318
日本核电站爆炸内幕
正文年R本政府宣布，位于F岛的核电站，受9级特大地震影响，放射性物质发生泄漏。R本人自食恶果不足惜，却给世界环境...
茶点故事阅读 38,941评论 3赞 307
男人毒药：我在死后第九天来索命
文/蒙蒙一、第九天我趴在偏房一处隐蔽的房顶上张望。院中可真热闹，春花似锦、人声如沸。这庄子的主人今日做“春日...
开封第一讲书人阅读 29,928评论 0赞 19
一桩弑父案，背后竟有这般阴谋
文/苍兰香墨我抬头看了看天上的太阳。三九已至，却和暖如春，着一层夹袄步出监牢的瞬间，已是汗流浃背。一阵脚步声响...
开封第一讲书人阅读 31,159评论 1赞 259
情欲美人皮
我被黑心中介骗来泰国打工，没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留，地道东北人。一个月前我还...
沈念sama阅读 42,880评论 2赞 349
代替公主和亲
正文我出身青楼，却偏偏与公主长得像，于是被迫代替她去往敌国和亲。传闻我的和亲对象是个残疾皇子，可洞房花烛夜当晚...
茶点故事阅读 42,399评论 2赞 342

PHP字符串总结

连接与分割

查找与替换

加密

其他

推荐阅读更多精彩内容