How to sum specific rows in a dataframe using R?

Question

I'm working on a research paper and I've got a dataframe that includes some departments and their budgets over a period of time. Let's take the following dataframe as an example.

df
departments   budget
         
 test1        100
 test2        200
 test3        300

For my case, "test1" and "test3" are two different expressions that actually refer to the same department. So I need to sum their budgets.

Here's the result that I expect

df
departments   budget
         
 test1        400
 test2        200

Haritz Laboa · Accepted Answer

There is no need of using IDs. If your goal is combining every test3 with test1, and getting the sum of budget of this join, you can use dplyr functions like this:

library(dplyr)

df %>%
  mutate(departments, departments = ifelse(departments=="test3", "test1", departments)) %>%
  group_by(departments) %>% 
  count(departments, wt=budget) -> df

The code above will give you the result you are looking for.

How to sum specific rows in a dataframe using R?

Answers (2)

Related Questions