Regular expression in awk in bash shell script

Question

I'm totally a regular expression newbie and I think the problem of my code lies in the regular expression I use in match function of awk.

#!/bin/bash
...
line=$(sed -n '167p' models.html)
echo "line: $line"
cc=$(awk -v regex="[0-9]" 'BEGIN { match(line, regex); pattern_match=substr(line, RSTART, RLENGTH+1); print pattern_match}')
echo "cc: $cc"

The result is:

line:  0.97
cc:

In fact, I want to extract the numerical value 0.97 into variable cc.

jas · Accepted Answer

Three things:

You need to pass the value of line into awk with -v:

awk -v line="$line" ...

Your regular expression only matches a single digit. To match a float, you want something like

[0-9]+\.[0-9]+

No need to add 1 to the match length for the substring

substr(line, RSTART, RLENGTH)

Putting it all together:

line='0.97'
echo "line: $line"
cc=$(awk -v line="$line" -v regex="[0-9]+\.[0-9]+" 'BEGIN { match(line, regex); pattern_match=substr(line, RSTART, RLENGTH); print pattern_match}')
echo "cc: $cc"

Result:

line: 0.97
cc: 0.97

Regular expression in awk in bash shell script

Answers (2)

Related Questions