How to kill threads spawned when using the multiprocessing's Pool imap_unordered

Question

I'm trying to speed up a simple Python program using multiprocessing's Pool. Specifically: the imap_unordered function.

In my case I'm searching for a specific object with specific properties, and checking this property takes a long time, hence the reason I want to spread the load over my CPU cores.

I created the following code:

from multiprocessing import Pool as ThreadPool 
pool = ThreadPool(4) 

some_iterator = (create_item() for _ in range(100000))

results = pool.imap_unordered(my_function, some_iterator)

for result in results:
  if is_favourable(result):
    break

Unfortunately, after calling break, there is still a lot of activity in the threads (as can be observed in my computers activity monitor). How should I keep searching for results till I find a favourable one, or how can I stop iterating over all items using the imap_unordered iterator?

martineau · Accepted Answer

For starters, your example code is not using a multiprocessing ThreadPool because your import statement is wrong (it's just allowing access to the regular Pool class via that name).

Regardless, you can just use the Pool/ThreadPool as a context manager since Python 3.3 and put the loop inside it. This will cause its terminate() method to be called automatically when the context is exited (due to the break statement in the example below), and it will will immediately stop the working processes.

from multiprocessing import current_process
from multiprocessing.pool import ThreadPool
from random import randint
import time

def create_item():
    return randint(0, 20)

def is_favourable(value):
    return value < 20

def my_function(value):
    print(current_process().name, value)
    time.sleep(2)
    return value * 2

if __name__ == '__main__':
    with ThreadPool(4) as pool:  # Use as context manager (Python 3.3+)
        some_iterator = (create_item() for _ in range(10000))
        start = time.time()
        results = pool.imap_unordered(my_function, some_iterator)
        for result in results:
            print('result:', result)
            if is_favourable(result):
                break  # Stop loop and exit Pool context.

    print('done')
    print(time.time() - start)

If you're using an older version of Python, you can just explicitly call pool.terminate() immediately before the break statement (and not use a with statement).

How to kill threads spawned when using the multiprocessing's Pool imap_unordered

Answers (2)

Related Questions

How to kill threads spawned when using the multiprocessing&#39;s Pool imap_unordered

Answers (2)

Related Questions

How to kill threads spawned when using the multiprocessing's Pool imap_unordered