Creating Subset Datasets in R based on distinct field values

Question

I am trying to split a dataset into multiple subsets based on distinct field values. For example:

SaleID Country
Sale1 US
Sale2 Mexico
Sale3 US
Sale4 Canada
Sale5 Canada
Sale6 Mexico
Sale7 Canada

Desired output (3 subsets)

Data 1
SaleID Country
Sale1 US
Sale3 US

Data 2
SaleID Country
Sale2 Mexico
Sale6 Mexico


Data 3
Sale4 Canada
Sale5 Canada
Sale7 Canada

Any ideas? My actual dataset has close to 100 distinct countries. I have explored using the sqldf package in R but no luck yet. Any help would be appreciated

G. Grothendieck · Accepted Answer

It would be best to create a list L of those data sets. Then you can refer to L$US or L[["US"]], say, and refer to all the names using names(L).

L <- split(DF, DF$Country)

It would be possible to copy them to individual data frames in the global environment using the following line but it is not really recommended:

list2env(L, .GlobalEnv)

Creating Subset Datasets in R based on distinct field values

Answers (2)

Related Questions