Grep files in between wget recursive downloads

Question

I am trying to recursively download several files using wget -m, and I intend to grep all of the downloaded files to find specific text. Currently, I can wait for wget to fully complete, and then run grep. However, the wget process is time consuming as there are many files and instead I would like to show progress by grep-ing each file as it downloads and printing to stdout, all before the next file downloads.

Example:

download file1
  grep file1 >> output.txt
download file2
  grep file2 >> output.txt
...

Thanks for any advice on how this could be achieved.

repzero · Accepted Answer

As c4f4t0r pointed out

 wget -m -O - |grep --color 'pattern'

using grep's color function to highlight the patterns may seem helpful especially when dealing with bulky data output to terminal.

EDIT:

Below is a command line you can use. it creates a file called file and save the output messages from wget.Afterwards it tails the message file.

Using awk to find any lines with "saved" and extract filename, then use grep to pattern from filename.

 wget -m websites  &> file &  tail -f -n1 file|awk -F "\'|\`"  '/saved/{system( ("grep  --colour pattern ") $2)}'

Grep files in between wget recursive downloads

Answers (2)

Related Questions