Reorganise Data in r

Question

I have a table for example:

House,Name1,Email1@xyz.com
Flat,Name2;Name3,Email2@xyz.com;Email3@xyz.com
Mobile Home,Name4,Email4@xyz.com
Camper-Van,Name5;Name6;Name7;Name8,Email5@xyz.com;Email6@xyz.com;Email7@xyz.com;Email8@xyz.com

and I need:

House,Name1,Email1@xyz.com
Flat,Name2,Email2@xyz.com
Flat,Name3,Email3@xyz.com
Mobile Home,Name4,Email4@xyz.com
Camper-Van,Name5,Emil5@xyz.com
Camper-Van,Name6,Email6@xyz.com
Camper-Van,Name7,Email7@xyz.com
Camper-Van,Name8,Email8@xyz.com

The problem is, the number of names and emails for one kind of housing is unknown.

I generated three lists:

Housing:      
House
Flat
Campervan 

Names:
Name1
Name2
Name3
Name4
Name5
Name6
Name7
Name8

Email:
Email1@xyz.com
Email2@xyz.com
...
Email8@xyz.com

But I am stuck how to repeat House and Flat and Campervan as much as there are names or emails (both always exact the same amount) for each category in Column 1. This would make all List match each other in length.

If I was able to this I could just generate the information I need. Any help is appreciated.

ATTENTION: names and Email adress are not the same so for example Name1 is hans his email might be Peter@foo.org by numbering names and emails i did try to show that emails and names are kind of sorted and can not be enlistetd randomly

rg255 · Accepted Answer

With the data in a data.table (convert using setDT()), using data.table joins and the data.table tstrsplit() function:

library(data.table)
# Data for the demo (please provide this yourself in future questions)
dt1 <-
  data.table(type = c("House", "Flat", "Mobile", "Camper-van"),
             name = c("Name1", "Name2;Name3", "Name4", "Name5;Name6;Name7;Name8"),
             mail = c("Email1", "Email2;Email3", "Email4", "Email5;Email6;Email7;Email8"))

# solution
dt1[, c("type" = list(type), tstrsplit(name, ";"))][, melt(.SD, id.vars="type")][!is.na(value), .(.I, type, "name" = value)][
  dt1[, c("type" = list(type), tstrsplit(mail, ";"))][, melt(.SD, id.vars="type")][!is.na(value), .(.I, "mail" = value)], on="I"][, -c("I")]

Reorganise Data in r

Answers (2)

Related Questions