Keep the key that has only one specific value in R

Question

I have the following data frame

filen<-c('510-1','510-2','510-2','510-2','510-3','510-3','510-4')
disp<-c('g','ng','ng','ng','g','ng','ng')

df<-data.frame(filen,disp)


  filen disp
1 510-1    g
2 510-2   ng
3 510-2   ng
4 510-2   ng
5 510-3    g
6 510-3   ng
7 510-4   ng

Basically I want to isolate the file numbers where ng is the only type of disp associated with that filen. So that I get a dataset like this. How do I do this using dplyr

filen disp
510-2  ng
510-4  ng

akrun · Accepted Answer

We can group by 'filen', filter the groups where all the 'disp' values are 'ng' and get the distinct rows

library(dplyr)
df %>%
   group_by(filen) %>%
   filter( all(disp == 'ng')) %>%
   distinct
# A tibble: 2 x 2
# Groups:   filen [2]
#  filen disp 
#    
#1 510-2 ng   
#2 510-4 ng

Or

df %>% 
   distinct %>%
   group_by(filen) %>%
   filter(n_distinct(disp) == 1, disp == 'ng')

Or we can use data.table

library(data.table)
setDT(unique(df))[,  .SD[uniqueN(disp)==1 & disp == "ng"], filen]

Keep the key that has only one specific value in R

Answers (2)

Related Questions