Multiprocessing in a for loop

Question

I have the below matching() function with for loop to which I am passing a big generator(unique_combinations).

It takes days to process so I wanted to use multiprocessing for elements in the loop to speed things up but I just can't figure out how to do it.

I find it difficult to understand the logic behind concurrent.futures in general.

    results = []
    match_score = []

    def matching():    
        for pair in unique_combinations:        
            if fuzz.ratio(pair[0], pair[1]) > 90:    
                results.append(pair)    
                match_score.append(fuzz.ratio(pair[0], pair[1]))

    def main():    
        executor = ProcessPoolExecutor(max_workers=3)    
        task1 = executor.submit(matching)    
        task2 = executor.submit(matching)    
        task3 = executor.submit(matching)

    if __name__ == '__main__':
        main()

print(results)
print(match_score)

I am assuming this should speed up the execution.

Multiprocessing in a for loop

Answers (1)

Related Questions