Printing Columns of two files if content of first column in both file matches in Unix

Question

I have two files as below

file1
name|address office
AK|Victoria Street
BK|Admond Street
DK|Business Street

file2
name|address home
AK|Nilofer Villa
ck|Bluewaters
bk|Homingo Apartment

the command or line of code should compare the first column of the two files and merge the columns as name|address office|address home, and replace the NA wherever is not matched,content of the files can be huge. full output should be as below

file3
name|address office|address home
AK|Victoria Street |Nilofer Villa
BK|Admond Street|Homingo Apartment
DK|Business Street|NA
CK|NA|Bluewaters

here is what I have tried so far:

awk -F '|' 'NR==FNR{c[$1]++;next};c[$1] > 0' file1 file2

but above lines of code are not merging, just producing the output as difference based on column name. that too case sensitive
name|address home AK|Nilofer Villa

Please Help, have checked few questions also, but not solving my purpose.

Ed Morton · Accepted Answer

$ cat tst.awk
BEGIN { FS=OFS="|" }
{
    name = (FNR>1 ? toupper($1) : $1)
    if (!seen[name]++) {
        names[++numNames] = name
        vals[name,1] = vals[name,2] = "NA"
    }
    vals[name,ARGIND] = $2
}
END {
    for (nameNr=1; nameNr<=numNames; nameNr++) {
        name = names[nameNr]
        print name, vals[name,1], vals[name,2]
    }
}

$ awk -f tst.awk file1 file2
name|address office|address home
AK|Victoria Street|Nilofer Villa
BK|Admond Street|Homingo Apartment
DK|Business Street|NA
CK|NA|Bluewaters

The above uses GNU awk for ARGIND, with other awks just add FNR==1{ARGIND++} at the start of the script.

Printing Columns of two files if content of first column in both file matches in Unix

Answers (2)

Related Questions