Joining specific lines in file

Question

I have a text file (snippet below) containing some public-domain corporate earnings report data, formatted as follows:

Current assets:
Cash and cash equivalents
                                  $ 21,514       $ 21,120
Short-term marketable securities
                                    33,769         20,481
Accounts receivable
                                    12,229         16,849
Inventories
                                     2,281          2,349

and what I'm trying to do (with sed) is the following: if the current line starts with a capital letter, and the next line starts with whitespace, copy the last N characters from the next line into the last N columns of the current line, then delete the next line. I'm doing it this way, because there are other lines in the files that begin with whitespace that I want to ignore. The results should look like the following:

Current assets:
Cash and cash equivalents         $ 21,514       $ 21,120
Short-term marketable securities    33,769         20,481
Accounts receivable                 12,229         16,849
Inventories                          2,281          2,349

The closest I've come to getting what I want is:

sed -i -r ':a;N;$!ba;s/[^A-Z]*
([[:space:]])/\1/g' file.txt

and I believe I've got the pattern matching ok, but the subsequent substitution really messes up the alignment of the columns of numbers. When I first started this, this seemed like a simple operation, but hours of searching and experimenting haven't helped. I'm open to any solutions that use something else other than sed, but would prefer to keep it strictly bash. Thank you much!

Joining specific lines in file

Answers (1)

Related Questions