How best to index for max values in data frame?

Question

Here dataset in use is genotype from the cran package,MASS.

> names(genotype)
[1] "Litter" "Mother" "Wt"

> str(genotype)
'data.frame':   61 obs. of  3 variables:
 $ Litter: Factor w/ 4 levels "A","B","I","J": 1 1 1 1 1 1 1 1 1 1 ...
 $ Mother: Factor w/ 4 levels "A","B","I","J": 1 1 1 1 1 2 2 2 3 3 ...
 $ Wt    : num  61.5 68.2 64 65 59.7 55 42 60.2 52.5 61.8 ...

This was the given question from a tutorial: Exercise 6.7. Find the heaviest rats born to each mother in the genotype() data.

tapply, whence split by factor genotype$Mother gives:

> tapply(genotype$Wt, genotype$Mother, max)
   A    B    I    J 
68.2 69.8 61.8 61.0

Also:

> out <- tapply(genotype$Wt, genotype[,1:2],max)
> out
      Mother
Litter    A    B    I    J
     A 68.2 60.2 61.8 61.0
     B 60.3 64.7 59.0 51.3
     I 68.0 69.8 61.3 54.5
     J 59.0 59.5 61.4 54.0

First tapply gives the heaviest rats from each mother , and second (out) gives a table that allows me identify which type of litter of each mother was heaviest. Is there another way to match which Litter is has the most weight for each mother, for instance if the 2 dim table is real large.

Robert · Accepted Answer

From stats:

aggregate(. ~ Mother, data = genotype, max)

or

aggregate(Wt ~ Mother, data = genotype, max)

How best to index for max values in data frame?

Answers (2)

data

Related Questions