grep,egrep及相应的正则表达式用法详解

目录

1、grep简介

    1.1、grep是什么

    1.2、grep使用格式

    1.3、grep常用选项

2、正则表达式与grep

    2.1、正则表达式简介

    2.2基本正则表达式grep

    2.3、扩展正则表达式grep -E、egrep

     2.4、fgrep


1、grep简介

    1.1、grep是什么

                  grep (Global search REgular expression and Print out the line)

                    作用:文本搜索工具,根据用户指定的文本模式(正则表达元字符以及正常字符组合而成)对目标文件进行逐行搜索,显示能匹配到的行;    

                    模式:由正则表达式的原字符及文本字符所编写的过滤条件;

    

    1.2、grep使用格式

                    grep [OPTIONS] PATTERN [FILE…]

                            [root@localhost ~]# grep "UUID" /etc/fstab 

                    grep [OPTIONS] [-e PATTERN | -f FILE] [FILE…]

                        [root@localhost ~]# grep -e "UUID" /etc/fstab 

                 

    1.3、grep常用选项

                OPTIONS:

                    –color=auto:对匹配到的文本着色后高粱显示;

                    -i, –ignore-case:忽略字符的大小写;

                    -o:仅显示匹配到的字符串本身;

                    -v, –invert-match:显示不能被模式匹配到的行;

                    -E:支持使用扩展的正则表达式元字符;

                    -q, –quiet, –silent:静默模式,即不输出任何信息;

                    -A #:after,后#行

                    -B #:before,前#行

                    -C #:context,前后各#行


2、正则表达式与grep

    2.1、正则表达式简介

              正则表达式:Regual Expression,REGEXP

                    由一类特殊字符及文本字符所编写的模式,其中有些字符不表示其字面意义(元字符),而是用于表示控制或通配的功能;

                元字符:\(hell0[[:space:]]\+\)\+

                分两类:

                        基本正则表达式:BRE

                        扩展正则表达式:ERE

    2.2基本正则表达式与grep

                基本正则表达式元字符:

                                    字符匹配:

                                            . :匹配任意单个字符;

                                            []:匹配指定范围内的任意单个字符;

                                            [^]:匹配指定范围外的任意单个字符;

                                            [:digit:]、[:lower:]、[:upper:]、[:alpha:]、[:alnum:]、[:punct:]、[:space:]


                                    匹配次数:用在要指定其出现的次数的字符的后面,用于限制其前面字符出现的次数;默认工作于贪婪模式;

                                            *:匹配其前面的字符任意次;0,1,多次;

                                                例如:grep "x*y"

                                                           abxy

                                                           aby

                                                           xxxxxy

                                                           yab

                                            .*:匹配任意长度的任意字符

                                            \?:匹配其前面的字符0次或1次;即其前面的字符是可有可无的;

                                                例如:grep "x\?y"

                                                           abxy

                                                           aby

                                                           xxxxxy

                                                           yab

                                            \+:匹配其前面的字符1次或多次;即其前面的字符要出现出现至少1次;

                                                例如:grep "x\+y"

                                                           abxy

                                                           xxxxxy

                                            \{m\}:匹配其前面的字符m次;

                                                例如:grep "x\{2\}y"

                                                           xxxxxy

                                            \{m,n\}:匹配其前面的字符至少m次,至多n次;

                                            \{0,n}:至多n次

                                            \{m,\}:至少m次

                                                例如:grep "x\{1,5\}y"

                                                           abxy

                                                           xxxxxy

                                位置锚定:

                                            ^:行首锚定;由于模式的最左侧;

                                            $:行尾锚定;用于模式的最右侧;

                                            ^PATTERN$:用PATTERN来匹配整行;

                                            ^$:空白行;

                                            ^[[:space:]]*$:空行或包含空白字符的行;

                                单词:非特殊字符组成的连续字符(字符串)都称为单词;

                                        \<或\b:词首锚定,用于单词模式的左侧;

                                        \>或\b:词尾锚定,用于单词模式的右侧;

                                        \<PATTERN\>:匹配完整单词;


                                练习:

                                    1、显示/ect/passwd文件中不以/bin/bash结尾的行;

                                        [root@localhost ~]# grep -v "/bin/bash$" /etc/passwd

                                    2、找出/etc/passwd文件中两位数或三位数;

                                        [root@localhost ~]# grep "\<[[:digit:]]\{2,3\}\>" /etc/passwd

                                        [root@localhost ~]# egrep "\<[0-9]{2,3}\>" /etc/passwd

                                    3、找出/etc/rc.d/rc.sysinit或/etc/group2.cfg文件中,以至少一个空白字符开头,且后面非空白字符的行;

                                        [root@localhost ~]# grep "^[[:space:]]\+[^[:space:]]" /etc/grub2.cfg

                                    4、找出"netstat -tan"命令的结构中以'LISTEN'后跟0、1或多个空白字符结尾的行;

                                        [root@localhost ~]# netstat -tan | grep "LISTEN[[:space:]]*$"


                                分组及引用

                                    \(\):将一个或多个字符捆绑在一起,当作一个整体进行处理;

                                        \(xy\)*ab

                                    Note:分组括号中的模式匹配到的内容会被正则表达式引擎自动记录于内部的变量中,这些变量为:

                                        \1:模式从左侧起,第一个左括号以及与之匹配的右括号之间的模式所匹配到的字符;

                                        \2:模式从左侧起,第二个左括号以及与之匹配的右括号之间的模式所匹配到的字符;

                                        \3

                                        …

                                        He likes his lover.

                                        He loves his lover.

                                        She likes her liker.

                                        She loves her lover.


                                        grep "\(l..e\).*\1" lovers.txt


                                后向引用:引用前面的分组括号中的模式所匹配到的字符;

    2.3、扩展正则表达式grep -E、egrep

        egrep:

                        支持扩展的正则表达式实现类似grep文本过滤功能;grep -E

                        egrep [OPTIONS] PATTERN [FILE…]

                    选项:

                        -i, -o, -v, -q, -A, -B, -C

                        -G:支持基本正则表达式

                        扩展正则表达式的元字符:

                    字符匹配

                        . :任意单个字符

                        []:指定范围内的任意单个字符

                        [^]:指定范围外的任意单个字符

                    次数匹配

                        *:任意次,0,1或多次;

                        ?:0次或1次,其前的字符是可有可无的;

                        +:其前字符至少1次

                        {m}:其前的字符m次

                        {m,n}:至少m次,至多n次;

                        {0,n}

                        {m,}

                    位置锚定

                        ^:行首锚定;

                        $:行尾锚定;

                        \<,\b:词首锚定;

                        \>,\b:词尾锚定;

                    分组及引用

                        ():分组;括号内的模式匹配到的字符会被记录于正则表达式引擎内部变量中;

                        后向引用:\1, \2 …

                        或:

                            a|b:a或者b;

                                C|cat:C或者cat

                                (C|c)at:Cat或者cat

                        练习:

                            1、找出/proc/meminfo文件中,所有以大写或小写s开头的行,至少有三种实现方式;

                                ~]# grep -i "^s"  /proc/meminfo

                                ~]# grep "^[sS]" /proc/meminfo

                                ~]# grep -E "^(s|S)" /proc/meminfo 

                            2、显示当前系统上root、centos或user1用户的相关信息;

                                ~]# grep -E "^(root|centos|user1)\>" /etc/passwd

                            3、找出/etc/rc.d/init.d/functions文件中某单词后面跟一个小括号的行;

                                ~]# grep -E "[_[:alnum:]]+\(\)" /etc/rc.d/init.d/functions

                            4、使用echo命令输出一绝对路径,使用egrep取出基名;

                                ~]# echo /etc/sysconfig/ | egrep -o "[^/]+/?$"

                                ~]# echo /etc/sysconfig/network-scripts/ | egrep -o "[[:alnum:]-]+/?$"

                            进一步:取出其路径名;类似于对其执行dirname命令的结果;

                                ~]# echo /etc/sysconfig/network-scripts/ | egrep -o "^/.*(/[[:alnum:]])" | egrep -o "^/.*/" | egrep -o  "^.*[-[:alnum:]]"

                            5、找出ifconfig命令结果中的1-255之间的数值;

                                ~]# ifconfig | egrep -o "\<([1-9]|[1-9][0-9]|1[0-9]{2}|2[0-4][0-9]|25[0-5])\>"

                            6、课外作业:找出ifconfig命令结果中的ip地址;

                                ~]# ifconfig |egrep "(\<([0,1]?[0-9]?[0-9]|2[0-4][0-9]|25[0-5])\>\.){3}\<([0,1]?[0-9]?[0-9]|2[0-4][0-9]|25[0-5])\>"

                            7、添加用户bash,testbash,basher以及nologin(其shell为/sbin/nologin);而后找出/etc/passwd文件中用户名同shell名的行;

                                ~]# cat /etc/passwd | egrep "(\<[[:alnum:]]+\>).*\1$"

                                ~]# grep -E "^([^:]+\>).*\1$" /etc/passwd

     2.4、fgrep

                    fgrep:不支持正则表达式元字符;

                            当无需要用到元字符去编写模式时,使用fgrep必能更好;


回目录











原创文章,作者:Future,如若转载,请注明出处:http://www.178linux.com/9671

(0)
FutureFuture
上一篇 2016-08-22
下一篇 2016-08-22

相关推荐

  • 推荐-File System manager

    文件系统(File system) :     文件系统概要    文件系统的分类    文件系统的管理工具             mkfs btrfs ext xfs&nbsp…

    Linux干货 2016-03-26
  • MariaDB数据类型总结

        数据类型是数据的一种属性,它决定了数据的存储格式、有效范围及其它相应的限制。MariaDB的数据类型包括:字符型、整型、浮点值、日期时间型、布尔型及内建类型。 一、字符型     1、CHAR和VARCHAR类型     &nbsp…

    Linux干货 2015-06-30
  • 关于文件权限管理了解和使用

                    文件权限管理   文件属性格式              文件属性操作 chown          设置文件的所有者…

    系统运维 2016-08-05
  • SQL优化大全

    1. 优化SQL步骤 1. 通过 show status和应用特点了解各种 SQL的执行频率        通过 SHOW STATUS 可以提供服务器状态信息,也可以使用 mysqladmin extende d-status 命令获得。 SHOW STATUS 可以根据需要显示 session 级别的统计结果和 g…

    Linux干货 2015-04-13
  • tcpdump诊断nginx问题

    自从上级公司离职后,快一年的时间没有写php程序和搭建LNMP环境,一直在做db运维和运维工具开发方面的事情。 最近决定开发开发一些自动话方面工具,重新拾起php,于是在测试机上搭建一个LNMP环境。 1、 nginx 404错误 一般的出现404错误,找不到页面资源。首先用strace排除了存在文件访问权限的问题,其次nginx的root参数设置也没有任何…

    Linux干货 2016-05-03
  • LAMP+NFS实现双web服务负载均衡

        一、实验拓扑          二、系统环境      1、主机A、主机B、主机C:CentOS 6.5        测试PC:         Windows 7 旗舰…

    Linux干货 2015-07-06