R - Sum numeric values in selected rows and columns based on specific factor values

Question

I have the following data.frame:

Engine      | MPG | Test_Distance
1. V6       | 17  |       751
2. V4       | 22  |       1850
3. V4-Hybrid| 26  |       210
4. V6-Hybrid| 24  |       85
5. Flat4    | 26  |       4560
6. V6-Hybrid| 28  |       124
7. Flat4    | 17  |       3455
8. V4       | 17  |       1642

Where Engine is a Factor vector, and MPG and Test_Distance are both numeric vectors.

Prior to making more complex stat calculations and plots, I want to simplify the data.frame by sorting:

the Engine column by types (creating new values/rows and removing old ones),
the MPG column with an average (mean) per Engine_type,
the Test_Distance column by adding numeric values per type,
add a new row with total averages.

Note: there are many other columns in this data.frame, but I only put three to simplify the approach.

Here's the resulting data.frame I'd like to have:

Engine_Type | MPG_avg | Test_Distance_total
1. Vx       |   18.7  |       4243
2. Vx_Hybrid|   26    |       419
3. Flatx    |   14.4  |       8015
4. TOTALS   |   19.7  |       12677

I tried using the dplyr and plyr packages and following functions: aggregate, rowSums, colSums, data.table. But to no avail. I thought of creating a temp data.frame, then re-integrate the new values in the original data.frame, but I'm hoping there's a quicker way to do it.

Any suggestion?

R - Sum numeric values in selected rows and columns based on specific factor values

Answers (1)

data

Related Questions