plyr parallel error-handling and warnings

Question

This is a common construction I use for error-handling:

x <- tryCatch(foo(), error=function(e){
    warning(e)
    NULL})

I run foo against lots of data objects, some of which might fail for whatever reason, if so I want the result to be NULL so that my entire run doesn't stop, but I also want to have the warning so I can see what failed and why.

I often run these from plyr, like this, and let's suppose some of them fail:

x <- llply(1:4, .fun=function(i) {  
    result<-tryCatch({
        if(i %% 2==0) stop(i)
        i}, error=function(e) {
           warning(e)
           NULL})
    result})
 x

Result:

 Warning messages:
 1: In doTryCatch(return(expr), name, parentenv, handler) : 2
 2: In doTryCatch(return(expr), name, parentenv, handler) : 4

 > x
 [[1]]
 [1] 1

 [[2]]
 NULL

 [[3]]
 [1] 3

 [[4]]
 NULL

However suppose I turn on parallel computing with the same code.

 require(doParallel)
 registerDoParallel(cores=4)
 x <- llply(1:4, .parallel=TRUE, .fun=function(i) {  
      result<-tryCatch({
          if(i %% 2==0) stop(i)
          i}, error=function(e) {
              warning(e)
              NULL})
      result})

  Result: 
  Error in do.ply(i) : task 2 failed - "2"

The job fails on an error in any of the tasks and no result is constructed. warning(e) was somehow converted to an error. I can get around this by commenting out warning(e) and then I get the desired result of NULLs in my data structure when there was an error, but then I lose the information about what happened.

In fact, I don't know any good way to throw warnings from parallel plyr. They seem to be squelched. If that's a limitation as a consequence of parallelism, that makes sense. But I think the warnings becoming errors behaviour is weird and I'd like to understand what's going on here.

plyr parallel error-handling and warnings

Answers (1)

Related Questions