Grouping and Counting instances?

Question

Is it possible to group and count instances of all other columns using R (dplyr)? For example, The following dataframe

Turns to this (note: y is value that is being counted)

EDIT:- explaining the transformation, x is what I'm grouping by, for each number grouped, i want to count how many times 0 and 1 and 2 was mentioned, as in the first row in the transformed dataframe, we counted how many times x = 1 was equal to 0 in the other columns (y), so 0 was in column a one time, column b two times and column c one time

x  y  a  b  c
1  0  1  2  1
1  1  1  0  2
1  2  1  1  0
2  1  1  0  1
2  2  0  1  0

Paul Hiemstra · Accepted Answer

I'd use a combination of gather and spread from the tidyr package, and count from dplyr:

library(dplyr)
library(tidyr)
df = data.frame(x = c(1,1,1,2), a = c(0,1,2,1), b = c(0,0,2,2), c = c(0,1,1,1))
res = df %>% 
    gather(variable, value, -x) %>% 
    count(x, variable, value) %>% 
    spread(variable, n, fill = 0)
# Source: local data frame [5 x 5]
#
#   x value a b c
# 1 1     0 1 2 1
# 2 1     1 1 0 2
# 3 1     2 1 1 0
# 4 2     1 1 0 1
# 5 2     2 0 1 0

Essentially, you first change the format of the dataset to:

head(df %>% 
    gather(variable, value, -x))
#  x variable value
#1 1        a     0
#2 1        a     1
#3 1        a     2
#4 2        a     1
#5 1        b     0
#6 1        b     0

Which allows you to use count to get the information regarding how often certain values occur in columns a to c. After that, you reformat the dataset to your required format using spread.

Grouping and Counting instances?

Answers (2)

Related Questions