Use REGEX in R to extract specific string in value as a new column?

Question

I have a column that contains string of characters/values that looks like this

Current

111111~24-JUL-17 10:43:36~6.14

Desired Output

24-JUL-17 10:43:36

Hoping to take everything between the '~' --> So Date/Time and disregard everything else.

I am have this code right now but only seems to take part of it

df$Last <- gsub(".+\s(.+)$", "\1", df$col1)

akrun · Accepted Answer

We can use sub in base R

df$c1 <- sub(".*~([^~]+)~.*", "\1", df$c1)
df$c1
#[1] "24-JUL-17 10:43:36" "24-JUL-21 10:34:36"

df <- data.frame(c1 = c('111111~24-JUL-17 10:43:36~6.14',
       '111111~24-JUL-21 10:34:36~6.14'))

Answers (2)