Make readxl::read_excel rename only the second duplicate column in R

Question

In readr, the read_csv command handles duplicate column names by renaming the second duplicate and leaves the first unaltered. See the following example, taken from https://github.com/tidyverse/readxl/issues/53.

readr::read_csv("x,x,y
1,2,3
")
#> Warning: Duplicated column names deduplicated: 'x' => 'x_1' [2]
#> # A tibble: 1 × 3
#>       x   x_1     y
#>     
#> 1     1     2     3

How can I get readxl::read_excel handle duplicate columns the same way?

lroha · Accepted Answer

You can use the .name_repair argument and pass make.unique() as a function:

library(readxl)

read_excel(path = "temp.xlsx", .name_repair = ~make.unique(.x, sep = "_"))

# A tibble: 1 x 3
      x   x_1     y
    
1     1     2     3

Make readxl::read_excel rename only the second duplicate column in R

Answers (1)

Related Questions