Replace strings with a subset of it

Question

I have a data frame like below:

s1 AA AG AG GG AA
s2 GTTGTT GTTGTT GTTGTT GTTGTT GTTGTT
S3 TT CC TC TT TC
S3 AGTTAGTT AGTTAGTT AGTTAGTT AGTTAGTT AGTTAGTT
S3 GCGCGCGC GCGCGCGC GCGCGCGC GCGCGCGC GCGCGCGC

and I want to find every string in the dataframe which has more than two characters (like GTTGTT) , and divide the string in two parts (all the string are even) (GTT GTT) and then get the first character from each part (GG). so my dataframe will be like this:

s1 AA AG AG GG AA
s2 GG GG GG GG GG
S3 TT CC TC TT TC
S3 AA AA AA AA AA
S3 GG GG GG GG GG

Any suggestions is appreciated. Thank you in advance

Replace strings with a subset of it

Answers (1)

Related Questions