Fill zeros for missing values in R

Question

I am trying to deal with this problem. I have a df with a date column and I want to count the occurences per hour. Here is what I've done:

x <- df %>% 
  mutate(hora = hour(date)) %>% 
  select(hora) %>% 
  count(hora)

that gives as a result:

> x
# A tibble: 19 x 2
    hora     n
    
 1     0     1
 2     1     1
 3     3     1
 4     8     4
 5     9     7
 6    10    10
 7    11    14
 8    12    10
 9    13     8
10    14     4
11    15     5
12    16    12
13    17     4
14    18    12
15    19     9
16    20     5
17    21     2
18    22     4
19    23     4

As you can see, there are hours that don't show up that would have n=0, like 2 or 4:7. What I want is it to add the hours that are not in x with n=0 so the table is complete.

The expected output should be something like this:

I tried creating a table with hours 0:23 and all n=0 and trying to sum the two tables but obviously that didn't work. I also tried x$hour <- 0:23, thinking that the missing values would be added, but it didn't work as well.

Ronak Shah · Accepted Answer

You could convert hora to factor and use .drop = FALSE in count

library(dplyr)
library(lubridate)

df %>%  
   mutate(hora = factor(hour(date), levels = 0:23)) %>%  
   count(hora, .drop = FALSE)

Another option is to use complete :

df %>%  
  mutate(hora = hour(date)) %>%  
  count(hora) %>%
  tidyr::complete(hora = 0:23, fill = list(n = 0))

Fill zeros for missing values in R

Answers (2)

Related Questions