Remove unnecessary blank lines

Question

Let's imagine that the following code example is valid:

begin
   statement1;

   begin


      statement2;

      statement3;
      statement4;

      statement5;

   end;

   statement6;

end;

I would like to remove all unnecessary blank lines in this code example:

begin
   statement1;

   begin
      statement2;

      statement3;
      statement4;

      statement5;
   end;

   statement6;
end;

So basically if a line ends with the keyword begin then all blank lines until the next line that contains a statement should be removed and if a line ends with the keyword end; then all blank lines until the previous line that contains a statement should be removed.

Using Sublime Text I created two regular expressions:

Find: begin( )* and Replace: begin
Find: ( )*([[:space:]])*end; and Replace: end;

My questions:

How can I convert both regular expression so that they can be used with sed (in-place)?
The second regular expression drops all existing blank spaces before the keyword end;. How could this problem be fixed?

Walter A · Accepted Answer

With GNU sed 4.2 you have the option -z:

sed -rz 's/begin
+/begin
/g;s/
+([^
]*end;)/
\1/g' file

Work-around with older sed (when original file is without )

tr '
' '
' < file | sed -r 's/begin
+/begin
/g;s/
+([^
]*end;)/
\1/g' | tr '
' '
'

Remove unnecessary blank lines

Answers (2)

Related Questions