writing combinations in R

Question

I have a dataframe (df) like this

name  col1   col2
pippo A;B;C  E;F;G;
pluto G;H    X;Y;Z;E;O;D

I'd like to write all possible combinations between 1 element of col1 and 1 element of col 2 and for each returned as a dataframe, for example

name     col1   col2
pippo      A       E
pippo      A       F
pippo      A       G
pippo      B       E
... and so on.

Considering that I have all alphabet letters and the number of elements in col1 and col2 can variate (from 1 element to 10), is it possible with R?

akrun · Accepted Answer

We can use crossing after splitting the columns by ;

library(dplyr)
library(tidyr)
library(purrr)
df %>%
  transmute(name, new = map2(strsplit(col1, ";"),
         strsplit(col2, ";"), ~ crossing(col1 = .x, col2 = .y))) %>% 
  unnest(c(new))

-output

# A tibble: 21 x 3
#   name  col1  col2 
#     
# 1 pippo A     E    
# 2 pippo A     F    
# 3 pippo A     G    
# 4 pippo B     E    
# 5 pippo B     F    
# 6 pippo B     G    
# 7 pippo C     E    
# 8 pippo C     F    
# 9 pippo C     G    
#10 pluto G     D    
# … with 11 more rows

data

df <- structure(list(name = c("pippo", "pluto"), col1 = c("A;B;C", 
"G;H"), col2 = c("E;F;G;", "X;Y;Z;E;O;D")), class = "data.frame", 
row.names = c(NA, 
-2L))

writing combinations in R

Answers (2)

data

Related Questions