R Find the average of all rows and create a new data frame for ploting

Question

I am new to R. I want to sum up all rows and create a new dataframe. The new data frame will be used for a line chart. For example, If I have source data like this:

Date       ID  hour0 hour1 hour1 ... hour24
2015-01-01 X1  20     30    40         100
2015-01-01 X1  30     40    50         400
.......................................
2015-12-31 X1  40     50    60         400

I want to find the average of all rows(Except rows of Date and ID). So, in my example, it will be a new data frame of (30,40,50,...,300). Is there a way to do the conversion?

After the conversion, I want to plot the number in a line chart, where x axis can be just 0,1,2,3,4,5..etc.

Can I get some help? Thanks!

Gregor Thomas · Accepted Answer

It seems like you want the sum/average up each column, not each row. That is, you want the average of the hour0 column, of the hour1 column, etc.

Here's a good solution:

# special functions for this purpose
colSums(df[, -(1:2)]) # sum all columns except the first two
colMean(df[, -(1:2)]) # average all columns except the first two

# general purpose, works with any function
sapply(df[, -(1:2)], sum) # sum all columns except the first two
sapply(df[, -(1:2)], mean) # average all columns except the first two
sapply(df[, -(1:2)], sd) # standard deviation of all columns except the first two
             # because colSds() isn't a built-in function like colMeans or colSums

To plot any of these, assign the result (give it a name, say, my_sum <- ...), and then you can do plot(my_sum, type = "l") to generate a simple line plot.

R Find the average of all rows and create a new data frame for ploting

Answers (2)

Related Questions