filter multiline record file based if one of the lines meet condition ( word count)

Question

everyone

I am looking for a way to keep the records from txt file that meet the following condition:

This is the example of the data:

aa bb cc 
11 22 33 
44 55 66
77 88 99 

aa bb cc 
11 22 33 44 55 66 77
44 55 66 66
77 88 99

aa bb cc 
11 22 33 44 55
44 55 66 
77 88 99 77

...

Basically, it's a file where one record where there are total 5 lines, 4 lines contain strings/numbers with tab delimeter , and the last is the new line .

The first line of the record always has 3 elements, while the number of elements in 2nd 3rd and 4th line can be different.

What I need to do is to remove every record(5 lines block) where total number of elements in the second line > 3 ( and I don't care about the number of elements in all the rest lines) . The output of the example should look like this:

aa bb cc 
11 22 33 
44 55 66
77 88 99 

...

so only the record where i have 3 elements are kept and recorded in the new txt file.

I tried to do it with awk by modifying FS and RS values like this:

awk 'BEGIN {RS="

"; FS="
";}
{if(length($2)==3) print $2"

"; }' test_filter.txt

but if(length($2)==3) is not correct, as I should count the number of entries in 2nd field instead of counting the length, which I can't find how to do.. any help would be much appreaciated!

thanks in advance,

filter multiline record file based if one of the lines meet condition ( word count)

Answers (1)

Related Questions