awk how to set record separator as multiple consecutive empty lines or lines only include space and/or tab characters?

Question

I know I can use RS="" to set record separator as multiple consecutive empty lines. However if those lines contain space or tab characters it will not work. I'm thinking to set RF as some kind of regular expression to do the match. But it's hard, since in this case often will be used as the field separator FS. Any suggestions?

Jotne · Accepted Answer

Here is a way to do it:

awk '!NF {$0=""}1' file | awk -v RS="" '{print NR,$0}'

The first awk counts the fields on the line. This will be 0 if you have blank lines or lines with spaces and tabs only. Then it just change the line to nothing. After this you can use the RS=""

Here is a gnu awk version (due to multiple characters in RS):

awk -v RS="
([[:space:]]*
)+" '{print NR,$0}' file

It may work without parentheses, but I am not sure if all will be covered then:

awk -v RS="
[[:space:]]*
+" '{print NR,$0}' file

awk how to set record separator as multiple consecutive empty lines or lines only include space and/or tab characters?

Answers (2)

Related Questions