How to subtract the number of the wc -l output in bash script?

Question

I want the output to filter out the number of specific lines in a file, so I count both the content that I need and I don't need and do subtraction. But somehow the output is not changing.

Here is my script:

#!/bin/bash

for file in "$1"/*;
do
    cat "$file" | while read line;
do
    countContent1="$(grep '$$' | wc -l)"
    countContent2="$(grep '$showReview$' | wc -l)"
    valuableReviews="$(($countContent1-$countContent2))"
    echo "$(b=${file##*/}; echo ${b%.*})" $valuableReviews
done
done | sort -r -n -k 2

note that both and showReview are on the same line in the file. The output is only the number of the line contain , there's no subtraction.

Here is part of the file:

lass=
Empfehlenswert....   showReview(11348491, 'full');  
Sep 28, 2006
-1
-1
4
-1
4
-1
5
-1
4
-1

Charles Duffy · Accepted Answer

This makes more sense if you take out the inner while read loop:

#!/bin/bash

for file in "$1"/*; do
    countContent1=$(grep -c '[<]Content[>]' <"$file")
    countContent2=$(grep -c 'showReview' <"$file")
    valuableReviews=$((countContent1 - countContent2))
    b=${file##*/}; b=${b%.*}
    echo "$b $valuableReviews"
done | sort -r -n -k 2

Note:

We're redirecting "$file" into each copy of grep, so grep is counting content in the file instead of content on stdin.
We've removed the while read loop entirely, and are letting grep iterate over the individual lines of each file, rather than trying to do that in bash. (Consequently, we now run grep twice per file, not twice per line of each file).
We aren't using command substitutions unnecessarily. $(...) has a significant performance penalty (lower than running an external command, but still much higher than doing everything in the parent process).

It would be still faster to replace the entire program with just one copy of awk:

#!/bin/awk -f

/[<]Content[>]/ {
  ++allContent
  if ($0 ~ /showReview/) {
    ++valuableReviews
  }
}
FILENAME != fn {
  if(fn) { print(fn, ": ", (allContent - valuableReviews)); }
  allContent = 0; valuableReviews = 0; fn = FILENAME;
}
END {
  print(fn, ": ", (allContent - valuableReviews))
}

...called as ./theAwkScript "$1"/*

How to subtract the number of the wc -l output in bash script?

Answers (2)

Related Questions