Split and save in new data.frames

Question

I have a big data.frame (144 columns). I would like to split it in groups of 3 columns each (subfile or sub data.frame), and then save the sub data.frames in separated files. In other words: file1 will contain columns from 1 to 3, file2 will contain columns from 6 to 9 and so on.

Any idea about?

Just an example:

  Hb1  Int1  Value1   Hb2  Int2  Value2         
   A     c     0.3     SW   n     0.34        
   V     sd    0.45    FG   b     0.345    
   N     wer   0.76    GH   m     0.67

So: File "output1" will contain:

  Hb1  Int1   Value1   
   A     c     0.3
   V     sd    0.45    
   N     wer   0.76

File "output2" will contain:

 Hb2    Int2  Value2     
 SW       n    0.34    
 FG       b    0.345    
 GH       m    0.67

and so on.

I tried to add a column to the transposed data.frame containing Index values such that:

Index = rep(1: 48, each = 3)

Then I tried to split the big data.frame according to the Index column but I'm not able to go on.

Jilber Urbina · Accepted Answer

Maybe this is useful for you:

# A simple function (EDIT: FIXED) 
Split_and_save_DF <- function(DF, split){
  # Spliting your data frame by columns to get several data.frames
  DFlist <-lapply(seq(1, ncol(DF), split), function(x, i){x[, i:(i+(split-1))]}, x=DF)
  # Saving each data.frames as .txt file
  invisible(sapply(1:length(DFlist), function(x, i) write.table(x[[i]], file=paste0('DF', i, '.txt')), x=DFlist))
}

Example

DF <- data.frame(matrix(rnorm(144*12, 100, 30), ncol=144))
dim(DF) # a dataframe with 12 rows and 144 cols
Split_and_save_DF(DF=DF, split=3) # will produce 48 DF's

Where DF is the data.frame, and split is the number of columns you want the dataframe to be split by.

It's not a nice answer but it does what you want.

This function will split your DF and will save each new DF in your current working directory with names such as: DF1.txt, DF2.txt, DF3.txt.... so that you can read each file by doing:

read.table("DF1.txt", header=TRUE) # and so on

In order to check the output:

dim(read.table("DF1.txt", header=TRUE)) # checking dims of new DF's
[1] 12  3

Split and save in new data.frames

Answers (2)

Example

Related Questions