Medical physicist
Medical physicist

Reputation: 2594

Showing fitted values with R and dplyr

I have the data frame DF. I am using R and dplyr to analise it.

DF contains:

>   glimpse(DF)
Observations: 1244160
Variables:
$ Channel  (int) 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0...
$ Row      (int) 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17,...
$ Col      (int) 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1...
$ mean     (dbl) 776.0667, 786.6000, 833.4667, 752.3333, 831.6667, 772.9333...

I fit it with:

Fit <-  DF %>%
    group_by(Channel) %>% 
    do(fit = lm(mean ~ Col + poly(Row, 2), data = .))

How can I get another column in DF with the data (given Channel, Row and Col) fitted by Fit?

Upvotes: 3

Views: 737

Answers (2)

Medical physicist
Medical physicist

Reputation: 2594

I found an easy solution based on this question dplyr::do() requires named function?

Fit <-  DF %>%
    group_by(Channel) %>% 
    do({
        fit = lm(mean ~ Col + poly(Row, 2), data = .)
        pred <- predict(fit)
        data.frame(., pred)
    })

Upvotes: 3

vrajs5
vrajs5

Reputation: 4126

If you could have provided more fields apart from Channel, Col and Row we would have given better direction. Right now, I have prepared solution for given Channel & Col. You can always add Row and define lm using other fields you have.

I think following should work for you,

library(dplyr)
df = data.frame(Channel=c(rep(0,50),rep(1,50),rep(2,100)), 
                Row = 1:200, 
                Col = c(rep(1,50),rep(2,100),rep(3,50)), 
                mean = rnorm(200))
glimpse(df)
Fit <-  df %>%
  group_by(Channel,Col) %>% 
  do(fit = lm(mean ~ poly(Row, 2), data = .))
Fit    
Source: local data frame [4 x 3]
Groups: <by row>    
  Channel Col     fit
1       0   1 <S3:lm>
2       1   2 <S3:lm>
3       2   2 <S3:lm>
4       2   3 <S3:lm>
Fit$fit
[[1]]    
Call:
lm(formula = mean ~ poly(Row, 2), data = .)    
Coefficients:
  (Intercept)  poly(Row, 2)1  poly(Row, 2)2  
       0.1403         0.2171        -0.6281
[[2]]    
Call:
lm(formula = mean ~ poly(Row, 2), data = .)
Coefficients:
  (Intercept)  poly(Row, 2)1  poly(Row, 2)2  
     -0.07416       -0.39332        0.57889
[[3]]
Call:
lm(formula = mean ~ poly(Row, 2), data = .)
Coefficients:
  (Intercept)  poly(Row, 2)1  poly(Row, 2)2  
       0.1349        -0.3405         1.5679
[[4]]
Call:
lm(formula = mean ~ poly(Row, 2), data = .)
Coefficients:
  (Intercept)  poly(Row, 2)1  poly(Row, 2)2  
       0.0379         1.2867        -1.1028

Upvotes: 0

Related Questions