Calculating R squared from multiple columns

Question

I'm very new to R and have been trying to figure out how to calculate R^2 from a few columns within a large data set of approx 300+ columns.

Example:

rcalc <- data.frame('x1' = c(694, 702, 701), 'x2'=c(652, 659, 655),
 'x3'=c(614, 612, 613), 'y1'= c(17.97, 17.95, 17.96), 'y2' = c(12.03, 12.0,
 12.1), 'y3' = c(0.09, 0.1, 0.1))

From here I am stuck.

The formula in excel I can do, and looks like this:

RSQ(X1:X3, Y1:Y3) or RSQ(694:652:614, 17.97:12.03:0.09)

So, each row needs to be calculated for R^2. I was able to use the 'lm' command but was only able to do this for 1 row: I had to take the value from each column of x (x1:x3) and stack them into 1 column, then each value from each column y (y1:y3) and stack into 1 column. Then performed the following:

rsqrd = lm(x~y, data=rcalc)
summary(rsqrd)$r.squared

This worked but again, only for 1 row. I'm not sure how to do this for thousands of rows. I hope this wasn't too confusing. Any help is greatly appreciated.

Troubleshooting:

with pivot_longer:

row col obs value
1   c   300_0 DUT Ip2_comp  784.9775
1   c   300_12 DUT Ip2_comp 864.4234
1   c   300_18 DUT Ip2_comp 919.3384
1   c   300_0 REF O2    0.09
1   c   300_12 REF O2   11.95
1   c   300_18 REF O2   17.98
2   c   300_0 DUT Ip2_comp  781.5785
2   c   300_12 DUT Ip2_comp 865.5541
2   c   300_18 DUT Ip2_comp 921.0646
2   c   300_0 REF O2    0.09

With Pivot_wider:

row obs c
1   300_0 DUT Ip2_comp  784.9775
1   300_12 DUT Ip2_comp 864.4234
1   300_18 DUT Ip2_comp 919.3384
1   300_0 REF O2    0.09
1   300_12 REF O2   11.95
1   300_18 REF O2   17.98
2   300_0 DUT Ip2_comp  781.5785
2   300_12 DUT Ip2_comp 865.5541
2   300_18 DUT Ip2_comp 921.0646

Calculating R squared from multiple columns

Answers (1)

Related Questions