Get first N occurances of uniq lines, not only one

Question

i have file with rows where are two fields separated by whitespace:

fieldA fieldX
fieldB fieldX
fieldC fieldX
fieldD fieldX
fieldE fieldX
fieldA fieldY
fieldB fieldY
fieldC fieldY

I need to get first N rows of type in second column. What I do is sort -k2 | uniq -f1 --all-repeated=prepend | grep "^$" -A3 which should work but uniq -f1 gives me something different than uniq -f1 --all-repeated=prepend. Do I understand it correctly that prepend should only add emtpy line before unique chunk?

Or is there a better approach?

Thanks

twalberg · Accepted Answer

Here's one idea using awk:

awk -v maxlines= ' ++count[$2] <= maxlines { print } '

That will not require sorting the file (but you could still sort it first if there are other reasons you want to...).

Get first N occurances of uniq lines, not only one

Answers (2)

Related Questions