Output the line number when there is a matching value, for each column

Question

Say I've got a file.txt

Position name1 name2 name3
       2     A     G     F
       4     G     S     D
       5     L     K     P
       7     G     A     A
       8     O     L     K
       9     E     A     G

and I need to get the output:

name1 name2 name3
    2     2     7
    4     7     9
    7     9

It outputs each name, and the position numbers where there is an A or G

In file.txt, the name1 column has an A in position 2, G's in positions 4 and 7... therefore in the output file: 2,4,7 is listed under name1 ...and so on

Strategy I've devised so far (not very efficient): reading each column one at a time, and outputting the position number when a match occurs. Then I'd get the result for each column and cbind them together using r.

I'm fairly certain there's a better way using awk or bash... ideas appreciated.

Ed Morton · Accepted Answer

$ cat tst.awk
NR==1 {
    for (nameNr=2;nameNr<=NF;nameNr++) {
        printf "%5s%s", $nameNr, (nameNr maxHits ? numHits[nameNr] : maxHits)
        }
    }
}
END {
    for (hitNr=1; hitNr<=maxHits; hitNr++) {
        for (nameNr=2;nameNr<=NF;nameNr++) {
            printf "%5s%s", hits[nameNr,hitNr], (nameNr

Output the line number when there is a matching value, for each column

Answers (2)

Related Questions