How to extract the word after pattern, but the word is in next line?

Question

I want to extract every word that comes after the pattern, however, I can only extract the word is in the same line with the pattern, if the word is come right after a line break I'm not able to get it. For example,

Gary is a college student.
Steve and John are college
teachers.

I want to extract "student" and "teachers", but I only got "student" back. My solution is

grep -oP '(?<=college )[\w+]*' | sort | uniq

Shawn · Accepted Answer

Tools like grep are fundamentally line oriented. GNU grep has a -z option to use 0 bytes as delimiters instead of newlines, though, which will let you treat the input file as a single big 'line':

$ grep -Pzo 'college\s+\K\w+' input.txt | tr '\0' '
'
student
teachers

How to extract the word after pattern, but the word is in next line?

Answers (2)

Related Questions