Meaning of following sed lines used in bash script

Question

I recently come across the following line in a bash script

sed -e :a -e '/^
*$/{$d;N;ba' -e '}' | sed -e '$s/,$/
/'

input to the first part of pipe is given by another pipe and input is of the form 1,2.3,2.453,23.5345,

Floris · Accepted Answer

Quite the expression. Let's try to pick it apart. The first few commands are

sed -e     invokes `sed` with the `-e` flag: "expression follows"
:a         a label - can be used with a branch statement (think "goto")
'/
*$/    any number of carriage returns followed by end of string
{$d;N;ba'  delete the last line; next; branch to label a
-e '}'     close the bracket

This can really be thought of as the one-line equivalent of a sed script file:

:a         # label a 
{          # start of group of commands
/
*$/     # select a line that has carriage returns and then end of string
           #(basically empty lines at end of file)
$d;        # delete the last line ($ = last line, d = delete)
N;         # next
ba         # branch to a
}          # end of group of commands

at the end of this we have no empty lines left at the input. You can test this with a file that has empty lines at the end - you will find that when you run it through this first part of the script, the empty lines are gone.

Now let's look at the second (easier) bit:

sed -e     invoke sed on the output of the previous command
'$s        substitute in the last line
/,$/
/    a comma before the end of the line with a newline

In other words, the whole script seems to do:

Remove all empty lines at the end of the input, then strip the comma at the end of the last line that was not an empty line and replace it with a newline

Meaning of following sed lines used in bash script

Answers (2)

Related Questions