Using a column index and loop to transform dataframe

Question

I am trying to write a function that uses indices that takes key-value pairs and stacks them.

Here is my data:

mydata<-structure(list(groupA = c("Rugby for Chipmunks", "Rugby for Chipmunks", "Rugby for Chipmunks", "Chafing Explained"), First = c(5, 3.57142857142857, 5, 4.5), groupB = c("Pylons for Priests", "Eating Creosote", "Eating Creosote", "Eating Creosote"), Second = c(4, 4, 3.16666666666667, 2.1666667), groupC = c("Wow for YOU!", "Advanced Cats for Bears", "Blue Paint Only", "Mockingbirds"), Third = c(5, 3, NaN, 4), groupD = c("How to Sell Pigeons", "How to Sell Pigeons", "How to Sell Pigoens", "Larger Boulders"), Fourth = c(4.3, 3, 4.1, 3.4), groupE = c("Making Money with Pears", "Making Money with Pears", "Why Walnuts?", "Responding to Idiots Part II"), Fifth = c(5, 3, 5, 4.16666666666667)), row.names = c(NA, -4L), class = c("tbl_df", "tbl", "data.frame"))

I want to use indices because my future tasks will have dataframes with different column names and widths. My approach uses a function to determine if a column is odd/even, and then extract pairs of columns until I reach the last odd numbered column. Note that the function must account for required order of odd-even indices for each extraction (group name and corresponding score):

odd <- function(x) x%%2 != 0
outfile<-list()

moveit<-function(df){
  for (i in 1:dim(df)[2])    # define number of loops
    if (  i==dim(df)[2]-1  )  {break} # stop at least odd-numbered column
    if ( odd(i)==FALSE) {next}  # skip when i is not an odd numbered index 
  print(i)
  outfile[[i+1]]<-df[ ,c(i,i+1)]
}

result<-moveit(mydata)
str(result)

You can see the result is only the last key-value pair. Why? How can I adjust the function to extract all key-value pairs into one dataframe?

akrun · Accepted Answer

We can create a numeric index with gl and split the dataset into list of data.frame, rename the list elements with map and join it rowwise

library(dplyr)
library(purrr)
split.default(mydata, as.integer(gl(ncol(mydata), 2, ncol(mydata)))) %>% 
      map_dfr(~ .x %>% 
                  rename_all(~ c('group', 'value')))

The above can also be made into a No package zone

lst1 <-  split.default(mydata, as.integer(gl(ncol(mydata), 2, ncol(mydata)))) 
do.call(rbind, lapply(lst1, setNames, c("group", "value")))

In the OP's code, the 'outfile' list is initialized with length 0. Instead it can be

odd <- function(x) x%%2 != 0
outfile <- vector('list', ncol(mydata))

moveit<-function(df){
  for (i in seq_along(df)) {   
    if(odd(i)){  
      outfile[[i]]<-df[ ,c(i,i+1)]
    }
 }
 Filter(Negate(is.null), outfile)
}

result <- moveit(mydata)

Also, the main issue is that the 'outfile' is not returned at the end

odd <- function(x) x%%2 != 0
outfile<-list()
moveit<-function(df){
  for (i in 1:dim(df)[2]) {   # define number of loops
    if (  i==dim(df)[2]-1  )  {break} # stop at least odd-numbered column
    if ( odd(i)==FALSE) {next}  # skip when i is not an odd numbered index 
  print(i)
  outfile[[i+1]]<-df[ ,c(i,i+1)]
 }
 outfile
}

result<-moveit(mydata)

NOTE: No packages are used here as well

Using a column index and loop to transform dataframe

Answers (2)

Revised movit

Related Questions