awk: Output to different processes

Question

I have awk script which splits big file into several files by some condition. Than I'm running another script over each file in parallel.

awk -f script.awk -v DEST_FOLDER=tmp input.file
find tmp/ -name "*.part" | xargs -P $ALLOWED_CPUS --replace --verbose /bin/bash -c "./process.sh {}"

The question is: are there any way to run ./process.sh:

before first script is done, because process.sh processes file line by line (one line too long to be passed to xargs directly);
each new file has a header (added in script.awk) that should be run before the rest of file;
limit amount of parallel processes;
GNU parallel,inotifywait is not an option;
assume dest folder is empty, files name are unknown.

The purpose of optimization to get rid of waiting until the awk is done while some files are ready to be processed.

hek2mgl · Accepted Answer

Once you have created a file, you can pass the filename to a process' or script's input:

awk '{print name_of_created_file | "./process.sh &"}'

& sends process.sh to the background, so that they can run in parallel. However, this is a gawk extension and not POSIX. Check the manual

Answers (2)