Get value of column as column name in R

Question

I have a df:

  product   store   store1  review   review1 
  book      A       B                 
  shirt     A       B                 
  pen       A       B       
  cd        A       B        0         2
  dress     A       B        2         1
  magazine  A       B        3         1

I want the values in the store columns to become column names and I want to insert the review values in that column, so the output looks like this:

  product   A       B 
  book      0       2
  shirt     2       1
  pen       3       1

There are two issues with this problem. First of all, the store names will change a lot in the future, so I can't use code like this:

 names(newdf)[names(newdf) == 'store'] <- 'a'

Secondly, I need the values from the review and review1 column to start from the first row in the A and B column, so to say. For example, column a = book = 0, shirt = 2, magazine = 3.

I'm really stuck on this, any help would be much appreciated!

Reproducible code:

df <- data.frame(product = c("book","shirt", "pen", "cd", "dress", "magazine"), store=c("A", "A", "A", "A", "A", "A"),
                 store1=c("B", "B", "B", "B", "B", "B"), review=c("", "", "", 0, 2, 3), review1 =c("", "", "", 2, 1, 1))

Alex Knorre · Accepted Answer

It is difficult to develop a universal method for this because it is unclear whether the number of columns will be the same, or can they be matched one-to-one all the time (say, for every storeN there is reviewerN) etc. Here's the code that does what you wanted, but I am not sure if it fits the purpose. You should have explained your problem more thoroughly.

df <- data.frame(product = c("book","shirt", "pen", "cd", "dress", "magazine"), store=c("A", "A", "A", "A", "A", "A"),
                 store1=c("B", "B", "B", "B", "B", "B"), review=c("", "", "", 0, 2, 3), review1 =c("", "", "", 2, 1, 1))


# Convert factors to character
df <- data.frame(lapply(df, as.character), stringsAsFactors=FALSE)

# Blanks to NAs
df[df==""] <- NA

# Indicate which columns contain values to rename other columns
cols_with_values <- c(2,3)

# save first row in these columns
new_columns_names <- as.character(df[1,cols_with_values])

# Kill them!
df[,cols_with_values] <- NULL

# Rename columns
names(df) <- c(names(df[1]), new_columns_names)

# Show rows without NAs
df[complete.cases(df), ]
   product A B
4       cd 0 2
5    dress 2 1
6 magazine 3 1

As for your second problem, it is a mad task because it renders your data as very, very messy. I mean, how do you know that book = 3, not shirt?

Get value of column as column name in R

Answers (1)

Related Questions