Data subsetting in R

Question

I have a data frame with thousands of rows and 3 columns: value, experiment and ratio. Value contains values (both positive and negative); experiment the experiment number (either E1, E2 or E3), and ratio contains one of three terms (X.Y, Y.Z or Z.X).

I need for each of the three ratios, extract all columns for the 50 values closest to 0, bearing in mind that this is very likely to be a mixture of positive and negative values.

The only (naive) way I can think of is to subset/extract the data for each ratio, then sort (order) it based on value, and subset again to get the 25 negative values closest to 0 and 25 positive values closest to 0.

Any better way?

Neal Fultz · Accepted Answer

My solution uses by to order and :

by(df, df$RATIO, function(x) x[ order(abs(x$VALUE))[1:50] , ] )

This will return a list, each element containing one subset.

Data subsetting in R

Answers (2)

Related Questions