Selecting text within two patterns using the command line

Question

I am trying to extract text from within a large file, however I am only interested in the text between two patterns.

Sample text looks like this:

0409CharlesRactive

My desired output should be only the text within the name tag, nothing before and nothing after. In example:

Output: Charles

In this case the starting pattern is and ending pattern

How can I achieve this using grep/sed/awk?

Ed Morton · Accepted Answer

Using GNU awk for multi-char RS:

$ awk -v RS='' '!(NR%2)' file
Charles

The above will work whether or not there are newlines anywhere in your input file and no matter how many times ... appears on one line or split across lines, it only requires that and always appear as pairs in the input file:

$ cat file
CharlesWilliam
Edward

   John Boy Walton   
$ awk -v RS='' '!(NR%2)' file
Charles
William
Edward

   John Boy Walton

and if you want to strip any leading/trailing white space from the names it's a simple tweak:

$ awk -v RS='[[:space:]]*[[:space:]]*' '!(NR%2)' file
Charles
William
Edward
John Boy Walton

Selecting text within two patterns using the command line

Answers (2)

Related Questions