Replace second space for
if applies in R

Question

I have a vector of text, lets say:

vector <- c("20 DE NOVIEMBRE",  "CENTRO", "EL ARENAL 4A SECCION",     "IGNACIO ZARAGOZA", "JARDIN BALBUENA", "MOCTEZUMA 2A SECCION",    "MORELOS", "PEON DE LOS BAOS")

I want to substitute second space, if exists, with the special character " ".

I've tried this:

  vector <- gsub(".* .*( ).*", "\
", vector)

But didn't work.

This is the expected result:

c("20 DE
NOVIEMBRE",  "CENTRO", "EL ARENAL
4A SECCION",     "IGNACIO ZARAGOZA", "JARDIN BALBUENA", "MOCTEZUMA 2A
SECCION",    "MORELOS", "PEON DE
LOS BAOS")

How can I get it?

Tim Biegeleisen · Accepted Answer

One approach, using sub with capture groups:

vector <- sub("^(\S+) (\S+) ", "\1 \2
", vector)
vector

[1] "20 DE
NOVIEMBRE"      "CENTRO"                "EL ARENAL
4A SECCION"
[4] "IGNACIO ZARAGOZA"      "JARDIN BALBUENA"       "MOCTEZUMA 2A
SECCION"
[7] "MORELOS"               "PEON DE
LOS BAOS"

Data:

vector <- c("20 DE NOVIEMBRE",  "CENTRO", "EL ARENAL 4A SECCION",
            "IGNACIO ZARAGOZA", "JARDIN BALBUENA", "MOCTEZUMA 2A SECCION",
            "MORELOS", "PEON DE LOS BAOS")

The regex logic here simply says to capture the first and second words, given by \S+, consuming the first and second space as well. Note that this would only match should the input in fact have a second space. Then, we replace with the same, but substituting a line feed in place of the second space.

Replace second space for \n if applies in R

Answers (2)

Related Questions