Regex pattern as variable in AWK

Question

Let's say I have a file with multiple fields and field 1 needs to be filtered for 2 conditions. I was thinking of turning those conditions into a regex pattern and pass them as variables to the awk statement. For some reason, they are not filtering out the records at all. Here is my attempt that runs fine, but doesn't filter out the results per conditions, except when fed directly into awk without variable assignment.

regex1="/abc|def/"; # match first field for abc or def;
regex2="/123|567/"; # and also match the first field for 123 or 567;

cat file_name \
| awk -v pat1="${regex1}" -v pat2="${regex2}" 'BEGIN{FS=OFS="	"} {if ( ($1~pat1) && ($1~pat2) ) print $0}'

Update: Fixed a syntax error related to missing parenthesis for the if conditions in the awk. (I had it fixed in the code I ran).

Sample data

abc:567    1
egf:888    2

Expected output

abc:567    1

The problem is that I am getting all the results instead of the ones that satisfy the 2 regex for field 1

Note that the match needs to be wildcarded instead of exact match. Meaning 567 as defined in the regex pattern should also match on 567_1 if available.

Ed Morton · Accepted Answer

It seems like the way to implement what you want to do would be:

awk -F'	' '
($1 ~ /abc|def/) && 
($1 ~ /123|567/)
' file

or probably more robustly:

awk -F'	' '
{ split($1,a,/:/) }
(a[1] ~ /abc|def/) && 
(a[2] ~ /123|567/)
' file

What's wrong with that?

EDIT here is me running the OPs code before and after fixing the inclusion of regexp delimiters (/) in the dynamic regexp strings:

$ cat tst.sh
#!/usr/bin/env bash

regex1="/abc|def/"; #--match first field for abc or def;
regex2="/123|567/"; #--and also match the first field for 123 or 567;

cat file_name \
| awk -v pat1="${regex1}" -v pat2="${regex2}" 'BEGIN{FS=OFS="	"} $1~pat1 && $1~pat2'

echo "###################"

regex1="abc|def"; #--match first field for abc or def;
regex2="123|567"; #--and also match the first field for 123 or 567;

cat file_name \
| awk -v pat1="${regex1}" -v pat2="${regex2}" 'BEGIN{FS=OFS="	"} $1~pat1 && $1~pat2'
$
$ ./tst.sh
###################
abc:567    1

Regex pattern as variable in AWK

Answers (2)

Related Questions