Append Columns to CSV in For Loop

Question

I'm looking for some help with my script. I'm trying to loop over a bunch of CSV files, cut out the 3rd column, and append that to an output file as a new column. Here's what I have so far:

#!/bin/bash

for n in ~/sampledir/*
do
    awk -F "," '{print $3","}' $n >> output.csv
done

The output looks like this:

Column3,
3,
33,
333,
3333,
33333,
Column3,
3,
33,
333,
3333,
33333,
Column3,
3,
33,
333,
3333,
33333,
Column3,
3,
33,
333,
3333,
33333,
Column3,
3,
33,
333,
3333,
33333,

What I want is for the new information to be appended to the CSV in columns, so rather than the output above, I want this:

Column3,Column3,Column3,Column3,Column3,Column3,
3,3,3,3,3,3,
33,33,33,33,33,33,
333,333,333,333,333,333,
3333,3333,3333,3333,3333,3333,
33333,33333,33333,33333,33333,33333,

Any guidance would be helpful. Thanks y'all.

Mitchell P · Accepted Answer

If the awk that you're using is gawk, you could have an awk script like this:

BEGIN { 
    FS="," 
    file_num = 0
    max_num_rows = 0
}

BEGINFILE { file_num++  }

{ 
    data[FNR SUBSEP file_num] = $3 
    if (FNR > max_num_rows) { max_num_rows++ }
}

END {

    for (i = 1; i <= max_num_rows; i++) {
        printf data[i SUBSEP 1]

        for (j = 2; j <= file_num; j++) {
            printf "," data[i SUBSEP j]
        }
        printf "
"
    }
}

and then use like:

awk -f script.awk ~/sampledir/* > output.csv

Basic idea is to store the data you want into a multidimensional awk array, and then loop over the rows and columns and print the data. If you don't have BEGINFILE available, you could do some extra logic to do something similar. This should also work if the number of rows are different in the files.

Append Columns to CSV in For Loop

Answers (2)

Related Questions