Awk splitting a line by spaces where there are spaces in each field

Question

I've got an R summary table like so:

       employee     salary        startdate
 John Doe  :1   Min.   :21000   Min.   :2007-03-14
 Jolie Hope:1   1st Qu.:22200   1st Qu.:2007-09-18
 Peter Gynn:1   Median :23400   Median :2008-03-25
                Mean   :23733   Mean   :2008-10-02
                3rd Qu.:25100   3rd Qu.:2009-07-13
                Max.   :26800   Max.   :2010-11-01

and I need to produce an output csv file like so:

employee,,salary,,startdate,,
John Doe,1,Min.,21000,Min.,2007-03-14
Jolie Hope,1,1st Qu.,22200,1st Qu.,2007-09-18
Peter Gynn,1,Median,23400,Median,2008-03-25
,,Mean,23733,Mean,2008-10-02
,,3rd Qu.,25100,3rd Qu.,2009-07-13
,,Max.,26800,Max.,2010-11-01

so that in excel it looks something like this:

output in excel

However it doesn't suffice to split the fields by one or more white spaces,

 awk -F "[ ]+" '{ print $3 }'

It works for the header, but not for the remaining lines:

salary
Doe
Hope:1
Gynn:1
:23733
Qu.:25100
:26800

Is this problem solvable using awk (and maybe sed)?

Ed Morton · Accepted Answer

This uses GNU awk for FIELDWIDTHS, etc. and relies on the first line of input after the header always having all fields populated. It includes the positions that are just :s as output fields, I expect you can figure out how to skip those if you do want to use this solution:

$ cat tst.awk
BEGIN { OFS="," }
NR==1 {
    for (i=1;i<=NF;i++) {
        printf "%s%s", $i, (i

Awk splitting a line by spaces where there are spaces in each field

Answers (2)

Related Questions