How does R's group_by exactly interact with other dplyr verbs?

Question

I'm coming from SQL and struggling to understand how R's group_by works. Reading the documentation that it simply "changes how it acts with the other dplyr verbs" does not explain anything. I'm specifically confused on how it interacts with the aggregate function max in the following snippet:

df <- db %>%
  tbl("data_table") %>% 
  group_by(site_id) %>%
  # get most recent started
  filter(start_date == max(start_date, na.rm=T), 
         end_date == max(end_date, na.rm=T)) %>%
  rename(field_name = name) %>%
  collect()

I'm translating this into SQL, so I'm having to do a sub-query to get the max start/end dates with the group by and then join that to a general query on data_table to get the field_name.

How exactly does group_by interact with other dplyr verbs, or max in this instance?

How does R's group_by exactly interact with other dplyr verbs?

Answers (1)

Related Questions

How does R&#39;s group_by exactly interact with other dplyr verbs?

Answers (1)

Related Questions

How does R's group_by exactly interact with other dplyr verbs?