Trying to understand python multithreading

Question

Please consider this code:

import threading

def printer():
    for i in range(2):
        with lock:
            print ['foo', 'bar', 'baz']

def main():
    global lock
    lock = threading.Lock()
    threads = [threading.Thread(target=printer) for x in xrange(2)]
    for t in threads:
        t.start()
        t.join()

main()

I can understand this code and it is clear: We create two threads and we run them sequentially - we run second thread only when first thread is finished. Ok, now consider another variant:

import threading

def printer():
    for i in range(2):
        with lock:
            print ['foo', 'bar', 'baz']

def main():
    global lock
    lock = threading.Lock()
    threads = [threading.Thread(target=printer) for x in xrange(2)]
    for t in threads:
        t.start()
    for t in threads:
        t.join()

main()

What happens here? Ok, we run them in parallel, but what is the purpose of make main thread waiting for child threads in second variant? How it can influence on the output?

Sam Hartman · Accepted Answer

In the second variant, the ordering of execution is much less defined. The lock is released each time through the loop in printer. In both variants, you have two threads and two loops within a thread. In the first variant, since only one thread runs at a time, you know the total ordering. In the second variant, each time the lock is released, the thread running may change. So you might get

thread 1 loop 1
thread 1 loop 2
thread 2 loop 1
thread 2 loop 2

or perhaps * thread 2 loop 1 * thread 1 loop 1 * thread 1 loop 2 * thread 2 loop 2

The only constraint is that loop1 within a given thread runs before loop 2 within that thread and that the two print statements come together since the lock is held for both of them.

In this particular case I'm not sure the call to t.join() in the second variant has an observable effect. It guarantees that the main thread will be the last thread to end, but I'm not sure that in this code you can observe that in any way. In more complex code, joining the threads can be important so that cleanup actions are only performed after all threads terminate. This can also be very important if you have daemon threads, because the entire program will terminate when all non-daemon threads terminate.

Trying to understand python multithreading

Answers (2)

Related Questions