user3538508
user3538508

Reputation: 21

how to remove special characters and number patterns from a string in R

I have a string "<U+7F85><U+934F><U+6DC7> <U+2730> Sascha Banks"

I want to exclude everything except the name "Sacha Banks".

I perform:

name1<-c("<U+7F85><U+934F><U+6DC7> <U+2730> Sascha Banks ")
name2<-str_replace_all(name1, "[^[:alnum:]]", " ")

Actual Output: " U 7F85 U 934F U 6DC7 U 2730 Sascha Banks "

Expected Output: " Sascha Banks "

Please correct me.

Upvotes: 0

Views: 353

Answers (3)

Gary Weissman
Gary Weissman

Reputation: 3627

Try

gsub("<[^>]*>", "", name1)
## [1] "  Sascha Banks "

Upvotes: 1

Tyler Rinker
Tyler Rinker

Reputation: 109844

If you don't care to learn the regex this is a pretty straight forward approach that removes all angle brackets:

library(qdap)
bracketX("<U+7F85><U+934F><U+6DC7> <U+2730> Sascha Banks", "angle")

## [1] "Sascha Banks"

Upvotes: 0

CHP
CHP

Reputation: 17189

Try

x <- "<U+7F85><U+934F><U+6DC7> <U+2730> Sascha Banks"
gsub("(<.*>)", "", x)
## [1] " Sascha Banks"

Upvotes: 3

Related Questions