Convert from wide to long format by matching column suffixes in dplyr

Question

Let's say I have the following data:

dat <- read.table(text="pairing feelings_pre feelings_post ingroup_pre ingroup_post
0 22.0 22.6 66.3 67.5
1 22.0 28.5 63.2 64.6", header=T)

I am trying to transform this data from wide format to long so that I can plot pre and post scores as a line chart in ggplot. So I need a column that is "pre", that is set to 1 if the column of interest has the "_pre" suffix, and set to 0 if the column has a "_post" suffix.

The partial example of the resulting dataframe would look like:

dat <- read.table(text="pairing variable value pre
0 feelings_pre 22.0 1
0 feelings_post 22.6 0
0 ingroup_pre 66.3 1
0 ingrop_post 67.5 0", header=T)

I have been trying to use spread and separate with a regex matcher, but have not been able to get it to work. Any ideas?

arg0naut91 · Accepted Answer

Try:

library(dplyr)

dat %>% filter(pairing == 0) %>%
  gather(variable, value, -pairing) %>%
  mutate(pre = +(grepl("_pre", variable)))

Output:

  pairing      variable value pre
1       0  feelings_pre  22.0   1
2       0 feelings_post  22.6   0
3       0   ingroup_pre  66.3   1
4       0  ingroup_post  67.5   0

Note that this is if you'd like to filter out 0 pairing (as you don't have it in your example).

However, since you said this is partial, you'd just leave the filter part out and get also the results for pairing where it is equal to 1.

Convert from wide to long format by matching column suffixes in dplyr

Answers (2)

Related Questions