R: explode a character-string and get the last element (row-wise)

Question

I have the following data-frame

df <- data.frame(var1 = c("f253.02.ds.a01", "f253.02.ds.a02", "f253.02.ds.x.a01", "f253.02.ds.x.a02", "f253.02.ds.a10", "test"))
df

What's the easiest way to extract the last two digits of the variable var1? (e.g. 1, 2, 10, NA) I was experimenting with separate(), but the number of points in the characters is not always the same. Maybe with regular expressions?

akrun · Accepted Answer

With separate, we can use a regex lookaround

library(dplyr)
library(tidyr)
df %>% 
  separate(var1, into = c('prefix', 'suffix'),
      sep="(?<=[a-z])(?=\d+$)", remove = FALSE, convert = TRUE)

-output

#              var1         prefix suffix
#1   f253.02.ds.a01   f253.02.ds.a      1
#2   f253.02.ds.a02   f253.02.ds.a      2
#3 f253.02.ds.x.a01 f253.02.ds.x.a      1
#4 f253.02.ds.x.a02 f253.02.ds.x.a      2
#5   f253.02.ds.a10   f253.02.ds.a     10
#6             test           test     NA

R: explode a character-string and get the last element (row-wise)

Answers (2)

Related Questions