AWK using file to remove csv rows

Question

I have the following csv:

old.csv

irrelevant,irrelevant,Abc@gmail.com,irrelevant
irrelevant,irrelevant,zyx@gmail.com,irrelevant
irrelevant,irrelevant,yZ@yahoo.com,irrelevant
irrelevant,irrelevant,that@email.com,irrelevant
irrelevant,irrelevant,this@email.com,irrelevant
irrelevant,irrelevant,def@gmail.com,irrelevant
irrelevant,irrelevant,anoTher@mydomain.com,irrelevant

that I need to remove the rows containing emails from this file:

remove.txt

abc@gmail.com
yz@yahoo.com
this@email.com
another@mydomain.com

And I need the output to be this:

new.csv

irrelevant,irrelevant,zyx@gmail.com,irrelevant
irrelevant,irrelevant,that@email.com,irrelevant
irrelevant,irrelevant,def@gmail.com,irrelevant

I've tried this, but it doesn't work. Can anyone help?

awk -F, 'BEGIN{IGNORECASE = 1};NR==FNR{remove[$1]++;next}!($1 in remove)' remove.txt old.csv > new.csv

Ed Morton · Accepted Answer

IGNORECASE is gawk-specific, you may not be using gawk.
You're testing the wrong field.
Incrementing the array element does nothing useful.

Try this:

awk -F, 'NR==FNR{remove[tolower($1)];next}!(tolower($3) in remove)' remove.txt old.csv > new.csv

AWK using file to remove csv rows

Answers (2)

Related Questions