R how to match and extract character letters of different length in a string

Question

So I have a column of contract names df$name like below

FB210618C00280000
ADM210618C00280000 M210618P00280000

I would like to extract the FB, ADM and M. That is I want to extract characters in the string and they are of different length and stop once the first number occurs, and I don't want to extract the C or P.

The below code will give me the C or P

stri_extract_all_regex(df$name, "[a-z]+")

akrun · Accepted Answer

We can use stri_extract_first from stringi

library(stringi)
stri_extract_first(df$name, regex = "[A-Z]+")
#[1] "FB"  "ADM" "M"

Or we can use base R with sub

sub("\d+.*", "", df$name)
#[1] "FB"  "ADM" "M"

Or use trimws from base R

trimws(df$name, whitespace = "\d+.*")

data

df <- data.frame(name = c("FB210618C00280000", "ADM210618C00280000", 
    "M210618P00280000"))

R how to match and extract character letters of different length in a string

Answers (2)

data

Related Questions