grep,egrep及相应的正则表达式用法详解

目录

1、grep简介

    1.1、grep是什么

    1.2、grep使用格式

    1.3、grep常用选项

2、正则表达式与grep

    2.1、正则表达式简介

    2.2基本正则表达式grep

    2.3、扩展正则表达式grep -E、egrep

     2.4、fgrep


1、grep简介

    1.1、grep是什么

                  grep (Global search REgular expression and Print out the line)

                    作用:文本搜索工具,根据用户指定的文本模式(正则表达元字符以及正常字符组合而成)对目标文件进行逐行搜索,显示能匹配到的行;    

                    模式:由正则表达式的原字符及文本字符所编写的过滤条件;

    

    1.2、grep使用格式

                    grep [OPTIONS] PATTERN [FILE…]

                            [root@localhost ~]# grep "UUID" /etc/fstab 

                    grep [OPTIONS] [-e PATTERN | -f FILE] [FILE…]

                        [root@localhost ~]# grep -e "UUID" /etc/fstab 

                 

    1.3、grep常用选项

                OPTIONS:

                    –color=auto:对匹配到的文本着色后高粱显示;

                    -i, –ignore-case:忽略字符的大小写;

                    -o:仅显示匹配到的字符串本身;

                    -v, –invert-match:显示不能被模式匹配到的行;

                    -E:支持使用扩展的正则表达式元字符;

                    -q, –quiet, –silent:静默模式,即不输出任何信息;

                    -A #:after,后#行

                    -B #:before,前#行

                    -C #:context,前后各#行


2、正则表达式与grep

    2.1、正则表达式简介

              正则表达式:Regual Expression,REGEXP

                    由一类特殊字符及文本字符所编写的模式,其中有些字符不表示其字面意义(元字符),而是用于表示控制或通配的功能;

                元字符:\(hell0[[:space:]]\+\)\+

                分两类:

                        基本正则表达式:BRE

                        扩展正则表达式:ERE

    2.2基本正则表达式与grep

                基本正则表达式元字符:

                                    字符匹配:

                                            . :匹配任意单个字符;

                                            []:匹配指定范围内的任意单个字符;

                                            [^]:匹配指定范围外的任意单个字符;

                                            [:digit:]、[:lower:]、[:upper:]、[:alpha:]、[:alnum:]、[:punct:]、[:space:]


                                    匹配次数:用在要指定其出现的次数的字符的后面,用于限制其前面字符出现的次数;默认工作于贪婪模式;

                                            *:匹配其前面的字符任意次;0,1,多次;

                                                例如:grep "x*y"

                                                           abxy

                                                           aby

                                                           xxxxxy

                                                           yab

                                            .*:匹配任意长度的任意字符

                                            \?:匹配其前面的字符0次或1次;即其前面的字符是可有可无的;

                                                例如:grep "x\?y"

                                                           abxy

                                                           aby

                                                           xxxxxy

                                                           yab

                                            \+:匹配其前面的字符1次或多次;即其前面的字符要出现出现至少1次;

                                                例如:grep "x\+y"

                                                           abxy

                                                           xxxxxy

                                            \{m\}:匹配其前面的字符m次;

                                                例如:grep "x\{2\}y"

                                                           xxxxxy

                                            \{m,n\}:匹配其前面的字符至少m次,至多n次;

                                            \{0,n}:至多n次

                                            \{m,\}:至少m次

                                                例如:grep "x\{1,5\}y"

                                                           abxy

                                                           xxxxxy

                                位置锚定:

                                            ^:行首锚定;由于模式的最左侧;

                                            $:行尾锚定;用于模式的最右侧;

                                            ^PATTERN$:用PATTERN来匹配整行;

                                            ^$:空白行;

                                            ^[[:space:]]*$:空行或包含空白字符的行;

                                单词:非特殊字符组成的连续字符(字符串)都称为单词;

                                        \<或\b:词首锚定,用于单词模式的左侧;

                                        \>或\b:词尾锚定,用于单词模式的右侧;

                                        \<PATTERN\>:匹配完整单词;


                                练习:

                                    1、显示/ect/passwd文件中不以/bin/bash结尾的行;

                                        [root@localhost ~]# grep -v "/bin/bash$" /etc/passwd

                                    2、找出/etc/passwd文件中两位数或三位数;

                                        [root@localhost ~]# grep "\<[[:digit:]]\{2,3\}\>" /etc/passwd

                                        [root@localhost ~]# egrep "\<[0-9]{2,3}\>" /etc/passwd

                                    3、找出/etc/rc.d/rc.sysinit或/etc/group2.cfg文件中,以至少一个空白字符开头,且后面非空白字符的行;

                                        [root@localhost ~]# grep "^[[:space:]]\+[^[:space:]]" /etc/grub2.cfg

                                    4、找出"netstat -tan"命令的结构中以'LISTEN'后跟0、1或多个空白字符结尾的行;

                                        [root@localhost ~]# netstat -tan | grep "LISTEN[[:space:]]*$"


                                分组及引用

                                    \(\):将一个或多个字符捆绑在一起,当作一个整体进行处理;

                                        \(xy\)*ab

                                    Note:分组括号中的模式匹配到的内容会被正则表达式引擎自动记录于内部的变量中,这些变量为:

                                        \1:模式从左侧起,第一个左括号以及与之匹配的右括号之间的模式所匹配到的字符;

                                        \2:模式从左侧起,第二个左括号以及与之匹配的右括号之间的模式所匹配到的字符;

                                        \3

                                        …

                                        He likes his lover.

                                        He loves his lover.

                                        She likes her liker.

                                        She loves her lover.


                                        grep "\(l..e\).*\1" lovers.txt


                                后向引用:引用前面的分组括号中的模式所匹配到的字符;

    2.3、扩展正则表达式grep -E、egrep

        egrep:

                        支持扩展的正则表达式实现类似grep文本过滤功能;grep -E

                        egrep [OPTIONS] PATTERN [FILE…]

                    选项:

                        -i, -o, -v, -q, -A, -B, -C

                        -G:支持基本正则表达式

                        扩展正则表达式的元字符:

                    字符匹配

                        . :任意单个字符

                        []:指定范围内的任意单个字符

                        [^]:指定范围外的任意单个字符

                    次数匹配

                        *:任意次,0,1或多次;

                        ?:0次或1次,其前的字符是可有可无的;

                        +:其前字符至少1次

                        {m}:其前的字符m次

                        {m,n}:至少m次,至多n次;

                        {0,n}

                        {m,}

                    位置锚定

                        ^:行首锚定;

                        $:行尾锚定;

                        \<,\b:词首锚定;

                        \>,\b:词尾锚定;

                    分组及引用

                        ():分组;括号内的模式匹配到的字符会被记录于正则表达式引擎内部变量中;

                        后向引用:\1, \2 …

                        或:

                            a|b:a或者b;

                                C|cat:C或者cat

                                (C|c)at:Cat或者cat

                        练习:

                            1、找出/proc/meminfo文件中,所有以大写或小写s开头的行,至少有三种实现方式;

                                ~]# grep -i "^s"  /proc/meminfo

                                ~]# grep "^[sS]" /proc/meminfo

                                ~]# grep -E "^(s|S)" /proc/meminfo 

                            2、显示当前系统上root、centos或user1用户的相关信息;

                                ~]# grep -E "^(root|centos|user1)\>" /etc/passwd

                            3、找出/etc/rc.d/init.d/functions文件中某单词后面跟一个小括号的行;

                                ~]# grep -E "[_[:alnum:]]+\(\)" /etc/rc.d/init.d/functions

                            4、使用echo命令输出一绝对路径,使用egrep取出基名;

                                ~]# echo /etc/sysconfig/ | egrep -o "[^/]+/?$"

                                ~]# echo /etc/sysconfig/network-scripts/ | egrep -o "[[:alnum:]-]+/?$"

                            进一步:取出其路径名;类似于对其执行dirname命令的结果;

                                ~]# echo /etc/sysconfig/network-scripts/ | egrep -o "^/.*(/[[:alnum:]])" | egrep -o "^/.*/" | egrep -o  "^.*[-[:alnum:]]"

                            5、找出ifconfig命令结果中的1-255之间的数值;

                                ~]# ifconfig | egrep -o "\<([1-9]|[1-9][0-9]|1[0-9]{2}|2[0-4][0-9]|25[0-5])\>"

                            6、课外作业:找出ifconfig命令结果中的ip地址;

                                ~]# ifconfig |egrep "(\<([0,1]?[0-9]?[0-9]|2[0-4][0-9]|25[0-5])\>\.){3}\<([0,1]?[0-9]?[0-9]|2[0-4][0-9]|25[0-5])\>"

                            7、添加用户bash,testbash,basher以及nologin(其shell为/sbin/nologin);而后找出/etc/passwd文件中用户名同shell名的行;

                                ~]# cat /etc/passwd | egrep "(\<[[:alnum:]]+\>).*\1$"

                                ~]# grep -E "^([^:]+\>).*\1$" /etc/passwd

     2.4、fgrep

                    fgrep:不支持正则表达式元字符;

                            当无需要用到元字符去编写模式时,使用fgrep必能更好;


回目录











原创文章,作者:Future,如若转载,请注明出处:http://www.178linux.com/9671

(0)
FutureFuture
上一篇 2016-08-22 09:29
下一篇 2016-08-22 09:29

相关推荐

  • Linux网络管理之网卡别名及网卡绑定配置

    在日常的运维工作当中,有时候需要在一块物理网卡上配置多个IP地址,这就是网卡子接口的概念,以及多块网卡实现网卡的绑定,通俗来讲就是多块网卡使用的一个IP地址,下面我来详细说明实现的过程。 &创建网卡子接口 在CentOS系统当中网络是由NetworkManager这个服务来管理的,它提供了一个图形的界面,但此服务不支持物理网卡子接口的设置,所以在配置网卡子接…

    系统运维 2016-09-02
  • 企业实时同步方案—-Rsync+Sersync

    原创作品,允许转载,转载时请务必以超链接形式标明文章 原始出处 、作者信息和本声明。否则将追究法律责任。http://nolinux.blog.51cto.com/4824967/1433623     在博文企业实时同步方案—-Sersync介绍中我们详细介绍了Sersync的原理,设计架构以及…

    Linux干货 2015-03-30
  • Linux运维利器之ClusterShell

    Via:http://blogread.cn/it/article/4573?f=wb 如果你有若干台数据库服务器,突然你想知道它们当前的即时负载情况,你会怎么办?挨个登录上去uptime一下?感觉有点傻,写个shell?浪费时间,直接用ClusterShell吧! ClusterShell的安装与配置     Clu…

    Linux干货 2015-05-12
  • 单引号、双引号和反引号使用方法

        首先向马哥和马哥团队敬个礼,谢谢你们带我进入Linux的世界,刚开始接触linux时,经常看到单引号、双引号和反引号特殊符号,经常搞混淆和误用,应该用双引号,有时用到单引号。经过几天的混乱引用,总算搞明白了一些,简单总一下三者的使用和区别。  单引号 ‘’     单引号里的内容全部以普通字符的含义进…

    Linux干货 2015-07-29
  • PHP5.4的变化关注—What has changed in PHP 5.4.x

    What has changed in PHP 5.4.x Most improvements in PHP 5.4.x have no impact on existing code. There are a few incompatibilities and new features that should be …

    Linux干货 2015-06-17