Reputation: 49
I wanted to create dummy variables for every unique values in a column in R.
My data:
Desired o/p:
Any help would be highly appreciated.
Thanks in advance.
Upvotes: 1
Views: 1670
Reputation: 76565
Base R solution:
xtabs( ~ id + sku_name, df1)
# sku_name
#id AB AMART AMZ CLIQ FK GOOGLE JIOMART
# 1 1 0 0 0 1 1 0
# 2 0 0 1 0 0 0 1
# 3 0 0 0 1 0 0 0
# 4 0 1 0 0 0 0 0
Data.
df1 <- data.frame(id = c(1,2,1,1,2,3,4),
sku_id = c(234, 345, 213, 233, 456, 678, 657),
sku_name = c("GOOGLE", "AMZ", "FK", "AB", "JIOMART", "CLIQ", "AMART"))
Upvotes: 3
Reputation: 389135
You can create a dummy column and use pivot_wider
from tidyr
:
library(dplyr)
df %>%
mutate(n = 1) %>%
select(-sku_id) %>%
tidyr::pivot_wider(names_from = sku_name, values_from = n,
names_prefix = 'sku_', values_fill = list(n = 0))
# id sku_Google sku_AMZ sku_FK sku_AB sku_JIOMART sku_CLIQ sku_AMART
# <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
#1 1 1 0 1 1 0 0 0
#2 2 0 1 0 0 1 0 0
#3 3 0 0 0 0 0 1 0
#4 4 0 0 0 0 0 0 1
Data
df <- data.frame(id = c(1, 2, 1, 1:4), sku_id = c(234,345,213,233, 456, 678,657),
sku_name = c('Google', 'AMZ', 'FK', 'AB', 'JIOMART', 'CLIQ', 'AMART'))
Upvotes: 4
Reputation: 138
dcast
from package reshape2
.
df <- dcast(id ~sku_name,fun.aggregate="length")
Upvotes: 1