Match the columns of a table from another table to get the desired column in the main table in R

Question

I have this data.table.

library(data.table)
class<- c("a","c","v","f","r","b","t","o");
value<-c(0.76,  0.91,   1.94,   0.37,   1.35,   0.75,   1.95,   1.69);
vehicle<-c("we",    "df",   "rt",   "yh",   "uj",   "er",   "ed","we")
carbon<-c(0.984,    0.27,   0.419,  0.469,  0.132,  0.865,  0.562,  0.133)
cap<-c(3,   2,  1,  6,  "y",    "t",    4,  6)
up<-c(4,    2,  3,  "d",    "t",    "y",    "u",    "i")
down<-c("t",    "e",    "r",    3,  4,  5,  2,  1)
amt<-c( 34, 23, 12, 67, 87, 43, 23, 12)
df<-data.table(class,value,vehicle,carbon,cap,up,down,amt)

and this another mapping table

up<-c("d","i",4)
vehicle<-c("yh",    "we",   "we")
exercise<-c("ty",45,    "k")
map<-data.table(cbind(vehicle,up,exercise))

i need the column exercise in the table df

I am currently using this code, which produces the desired results. and I am happy with it.

df[,names(map)[length(names(map))]:= 
                map$exercise[match(do.call(paste0,df[, which(names(df) %in% names(map)[1:(ncol(map)-1)]),with = FALSE]),
                                                   do.call(paste0,map[,1:(ncol(map)-1)]))] ]

So basically what this code does is.

identify the columns from mapping table in the main table.
concatenate those columns.
do a match of these concatenated columns with the concatenated columns of mapping table.
index the desired column from mapping table and fix it to the main table.

So the desired result is

> df$exercise
[1] "k"  NA   NA   "ty" NA   NA   NA   "45"

But sometimes columns of the mapping table order is changed.

for e.g. changed mapping table is Notice that now the order is up and then vehicle. and in this case the above code will not produce the desired result, infact it would be all NA.

up<-c("d","i",4)
vehicle<-c("yh",    "we",   "we")
exercise<-c("ty",45,    "k")
map<-as.data.frame(cbind(up,vehicle,exercise))
setDT(map)

So my code only works if the order of the columns in mapping table is same as in the main table. If my code can be changed to perform the same results but considering the order of the columns. ideally would want this as dynamic as possible.

mapping table can have as many columns as in the main table and an additional column which needs to be inserted in the main table.

Please comment if you need any further clarification. I would appreciate if my given code can be edited and provided. any other code is also welcome. I prefer data.table package use.

Mohit · Accepted Answer

The code below works perfectly and should always be preferred.

setcolorder(map[df, on = .NATURAL], union(names(df), names(map)))[]

The other code in the answers to this question doesn't considers any irregularities in the mapping.

Thank you Merijn-van-tilborg for your valuable contribution.

Match the columns of a table from another table to get the desired column in the main table in R

Answers (2)

Related Questions