using sed for extracting multiple matches

Question

I have the following line:

echo AS:i:0  UQ:i:0  ZZ:Z:mus.sup  NM:i:0  MD:Z:50  ZZ:Z:cas.sup  CO:Z:endOfLine|sed 's/.*$ZZ:Z:.*[ ]$.*/\1/g'

which outputs:

ZZ:Z:cas.sup

I'd like to use sed for extracting both ZZ:Z entries from the given line, such as (please avoid awk since the position of ZZ:Z entries may differ per each line in my file):

preferable output:

ZZ:Z:mus.sup  ZZ:Z:cas.sup

Or possibly:

ZZ:Z:mus.sup  
ZZ:Z:cas.sup

Thanks.

SLePort · Accepted Answer

Try grep with the -o (or --only-matching) flag:

$ grep -o 'ZZ:Z:[^ ]* ' <<< "AS:i:0  UQ:i:0  ZZ:Z:mus.sup  NM:i:0  MD:Z:50  ZZ:Z:cas.sup  CO:Z:endOfLine"
ZZ:Z:mus.sup 
ZZ:Z:cas.sup

Or with sed, based on this @potong answer:

sed 's/ZZ:Z:/\n&/g;s/[^\n]*\n$ZZ:Z:[^ ]* $[^\n]*/\1 /g;s/.$//'

If you have only two occurrences of the pattern per line:

sed -n 's/.*$ZZ:Z[^ ]*$.*$ZZ:Z[^ ]*$.*/\1 \2/p' <<< "AS:i:0  UQ:i:0  ZZ:Z:mus.sup  NM:i:0  MD:Z:50  ZZ:Z:cas.sup  CO:Z:endOfLine"

using sed for extracting multiple matches

Answers (2)

Related Questions