Find the number of times a unique value is appeared in more than one files and the number of those files

Question

I have those 3 dataframes below:

Name<-c("jack","jack","bob","david","mary")
n1<-data.frame(Name)

Name<-c("jack","bill","dean","mary","steven")
n2<-data.frame(Name)

Name<-c("fred","alex","mary")
n3<-data.frame(Name)

I would like to create a new dataframe with 3 columns.All unique names present across all 3 source files in Column 1,the number of source files in which it's located, in Column 2, and the total number of instances of that name across all files, in Column 3.

The result should be like

Name Number_of_files Number_of_instances
1   jack               2                   3
2    bob               1                   1
3  david               1                   1
4   mary               3                   3
5   bill               1                   1
6   dean               1                   1
7 steven               1                   1
8   fred               1                   1
9   alex               1                   1

Is there an automated way to achieve all these at once?

tmfmnk · Accepted Answer

One dplyr possibility could be:

bind_rows(n1, n2, n3, .id = "ID") %>%
 group_by(Name) %>%
 summarise(Number_of_files = n_distinct(ID),
           Number_of_instances = n())

  Name   Number_of_files Number_of_instances
                             
1 alex                 1                   1
2 bill                 1                   1
3 bob                  1                   1
4 david                1                   1
5 dean                 1                   1
6 fred                 1                   1
7 jack                 2                   3
8 mary                 3                   3
9 steven               1                   1

Find the number of times a unique value is appeared in more than one files and the number of those files

Answers (2)

Related Questions