How to combine multiple sed and awk commands?

Question

I have a folder with about 2 million files in it. I need to run the following commands:

sed -i 's//<item><title>/g;s/rel="nofollow"//g;s/<\/a> •/]]><\/wp:meta_value><\/wp:postmeta><content:encoded><![CDATA[/g;s/By <a href="http:\/\/www.website.com\/authors.*itemprop="author">/<wp:postmeta><wp:meta_key><![CDATA[custom_author]]><\/wp:meta_key><wp:meta_value><![CDATA[/g' /home/testing/*

sed -i '$a]]></content:encoded><wp:status><![CDATA[draft]]></wp:status><wp:post_type><![CDATA[post]]></wp:post_type><dc:creator><![CDATA[Database]]></dc:creator></item>\' /home/testing/*

awk -i inplace 1 ORS=' ' /home/testing/*
</code></pre>

<p>The problem I'm having is that when I run the first command, it cycles through all 2 million files, then I move on to the second command and so on.  The problem is that I'm basically having to open files 6 million times in total.</p>

<p>I'd prefer that when each file is opened, all 3 commands are run on it and then it moves on to the next.  Hopefully that makes sense.</p>

mklement0 · Accepted Answer

Assuming that your files are small enough for a single file to fit into memory as a whole (and assuming GNU sed, which your use of -i without an option-argument implies):

sed -i -e ':a;$!{N;ba}; s/.../.../g; ...; $a...' -e 's/
/ /g' /home/testing/*

^{s/.../.../g; ...; and $a... in the command above represent your actual substitution and append commands.}

:a;$!{N;ba}; reads each input file as a whole, and then performs the desired substitutions, appending, and replacement of all newlines with a single space each.^[1]

This allows you to make do with a single sed command per input file.

^{[1] Your awk 1 ORS=' ' command actually creates output with a trailing space instead of a newline. By contrast, 's/
/ /g' applied to the whole input file will only place a space between lines, and terminate the overall file with a newline (assuming the input file ended in one).}

How to combine multiple sed and awk commands?

Answers (2)

Related Questions