What linux commands can I use to sort columns in a tab-separated text file?

Question

I need to compare two versions of the same file. Both are tab-separated and have this form:

...
...

So each row has a different number of markers (the number varies between 1 and 10) and they all come from a small set of possible markers. So a file looks like this:

fileXZMA
fileBY
fileMMCBY

What I need is:

Sort the file by rows
Sort the markers in each row so that they are in alphabetical order

So for the example above, the result would be

fileBY
fileMBCMY
fileXAMZ

It's easy to do #1 using sort but how do I do #2?

UPDATE: It's not a duplicate of this post since my rows are of different length and I need each rows (the entries after the filename) sorted individually, i.e. the only column that gets preserved is the first one.

RomanPerekhrest · Accepted Answer

awk solution:

awk 'BEGIN{ FS=OFS="	"; PROCINFO["sorted_in"]="@ind_str_asc" }
     { split($0,b,FS); delete b[1]; asort(b); r=""; 
         for(i in b) r=(r!="")? r OFS b[i] : b[i]; a[$1] = r 
     }
     END{ for(i in a) print i,a[i] }' file

The output:

fileB   Y
fileM   B   C   M   Y
fileX   A   M   Z

PROCINFO["sorted_in"]="@ind_str_asc" - sort mode
split($0,b,FS); - split the line into array b by FS (field separator)
asort(b) - sort marker values

What linux commands can I use to sort columns in a tab-separated text file?

Answers (2)

Related Questions