How can I subtract rows with matching value in one column in R?

Question

I have a dataframe that looks like this (truncated from real data):

   host month    score        se
1   V43     0 8.000000 0.4472136
2   V43     1 6.000000 0.0000000
3   V43     3 6.000000 0.0000000
4   V51     0 6.000000 0.0000000
5   V51     1 7.333333 0.4216370
6   V51     3 7.333333 0.2108185
7   V51     6 6.000000 0.0000000

I want to subtract the month 0 score for each host from score for each month for that host. Each host's month 0 score needs to be applied separately, so that it'd look like this:

   host month     score         se
1   V43     0  0.000000 0.4472136
2   V43     1 -2.000000 0.0000000
3   V43     3 -2.000000 0.0000000
4   V51     0  0.000000 0.0000000
5   V51     1  1.333333 0.4216370
6   V51     3  1.333333 0.2108185
7   V51     6  0.000000 0.0000000

In other words, I want to have each month show the difference from the starting point rather than absolute value.

Finding the month 0 rows is easy enough but I can't figure out how I can then match each row with the right host and do the subtraction. Is there a way to do this without using a for loop?

Hong Ooi · Accepted Answer

Use plyr, and order your data frame by host and month first.

ddply(df, .(host), transform, score=score-score[1])

How can I subtract rows with matching value in one column in R?

Answers (2)

Related Questions