awk: find matching pattern1 in file containg pattern2

Question

I am parsing plenty of files and searching for correspondences with awk. I am stuck searching for a way to find the file containing pattern1 and search pattern2 only in this file.

example:

file1:  
text xyz 122e345a rxyc  
abc 25b57790c

file 2:  
text tio 36e79a89 opgb  
abc b0894e35o  

file 3:  
text diowps aaaacc  
abc 122e345a

What I want as result should be:

25b57790c

While the first pattern that I have is:

122e345a

The only solution I had for now was to do it in 2 steps:

FILE=$(awk '$3 == "122e345a" {print FILENAME}' * )  
awk '$1 == "abc" {print $2}' $FILE

I can have a one liner like this one:

awk '$1 == "abc" {print $2}' $(awk '$3 == "122e345a" {print FILENAME}' * )

But I would like to avoid the double awk call, can't it be done in one single awk command?

Kusalananda · Accepted Answer

file != FILENAME       { found = 0 }
         $3 == a       { found = 1; file = FILENAME }
found && $1 == b       { print $2  }

or, for GNU awk:

BEGINFILE              { found = 0 }
         $3 == a       { found = 1 }
found && $1 == b       { print $2  }

This is very similar to markp's solution (and makes similar assumptions), but may be run on any number of input files without the use of a shell loop:

$ awk -f script.awk a="122e345a" b="abc" file[123]
25b57790c

The script(s) also assumes that the patterns that you'd like to search for are actually fixed strings in specific columns (as indicated by the question).

Since there's no way of "rewinding" a file in awk, you need to pass over the file twice if you want to find the second string before the first string. The code at the end of the question itself is a solution for that.

Alternatively, you may save the whole file in a variable and go through that once you find the first string (that solution not included here).

awk: find matching pattern1 in file containg pattern2

Answers (2)

Related Questions