How to remove dupes within lines of delimited text

Question

What's a smart and easy way to remove dupes (not necessarily consecutive) within delimited items on a line.

BEFORE:

apple,banana,apple,cherry,cherry
delta,epsilon,delta,epsilon
apple pie,delta,delta

AFTER:

apple,banana,cherry
delta,epsilon
apple pie,delta

Should work on a Mac. Allow unicode. Any shell method/language/command. Dupes are not necessarily consecutive.

Note: this question is a variation of How to remove dupes from blocks of text -- which is for blocks of text separated with blank lines.

bian · Accepted Answer

awk -F, '{ for(i=1;i<=NF;i++) if( split($0,t,$i)>2 ) sub($i",","") }1' file             
banana,apple,cherry
delta,epsilon
apple pie,delta

sed version:

sed -r 's/(.+)(.*),\1/\1\2,/g;s/,$//' file
apple,banana,cherry
delta,epsilon
apple pie,delta

Just Code.

Answers (2)