Starting c++11 thread with a lambda capturing local variable

Question

I have a rather basic problem and not sure where it originates from: lambda capture evaluation in a concurrent environment or misuse of boost filesystem library.
This is sample code:

#include 
#include 
#include 
#include 

using namespace std;
using namespace boost::filesystem;

void query(const string filename)
{
    cout << filename << " ";
}

int main() {
    path p("./");
    vector thrs;

    for(auto file = directory_iterator(p); file != directory_iterator(); ++file)
    {
        thread th( [file] {query(file->path().string());} );
        thrs.push_back(move(th));
    }

    for (auto& t : thrs)
        t.join();

    return 0;
}

which at runtime gives:

:~/workspace/sandbox/Release$ l
main.o  makefile  objects.mk  sandbox*  sources.mk  subdir.mk
:~/workspace/sandbox/Release$ LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/a/custom-libs/boost-1_59/lib/ ./sandbox 
./subdir.mk ./sources.mk ./sandbox ./objects.mk ./main.o ./main.o

Notice the race condition - not all files end up being passed to thread function (at this run makefile is missing).
I am able to find a workaround by extracting the argument in a local var, rewriting the loop body as:

    auto fn = file->path().string();
    thread th( [fn] {query(fn);} );
    thrs.push_back(move(th));

Where is the race condition coming from?
Isn't file->path().string() evaluated right at the time of thread creation?

David Schwartz · Accepted Answer

Isn't file->path().string() evaluated right at the time of thread creation?

No. Since query must be called on the new thread, the statement query(file->path().string()) must be executed on the new thread. So it's executed some time after thread creation when the thread gets around to doing stuff.

You captured file. Period.

It's conceptually no different from:

string * j = new string ("hello");
thread th( [j] { cout << *j << std::endl; } );
*j = "goodbye";
th.join();

You captured j. Not *j. And while j's value doesn't change, the value of the thing that j refers to changes. So who knows what it will be when the thread finally dereferences it.

You might think that you're capturing the iterator's value and therefore you'll be okay because that value won't change. Unfortunately, that's just not how this iterator is implemented. It's implemented in a way that allows it to irretrievably discard previous information when it's incremented, so incrementing this type of iterator has the effect of incrementing copies of that iterator too.

If you capture the value of something that refers to something whose value you don't capture, if that second value is changed, the thing you captured now refers to a different value. Always know exactly what you are capturing and how you are capturing it. Sadly, you cannot safely capture by value instances of classes you don't deeply understand.

Starting c++11 thread with a lambda capturing local variable

Answers (2)

Related Questions