Paralelization python list comprehension not enhancing performance

Question

Recently I have been trying to parallelize some list comprehensions to speed up my code but I found out that the parallelization lead to a worse time execution... can someone help me understand why?

my computer is an i7 4 cores 8 threads around 3GHz core speed and I am using python 2.7

Here you have an example of my code:

import numpy as np
import multiprocessing as mulpro
import itertools
d1 = 0.1;
d2 = 0.2;
data = range(100000) #Array of data
#example of list comprehension
data2 = [i + np.random.uniform(d1,d2) for i in data] #this is faster than the following
#example of multiprocessing
def parAddRandom(array):
    array = list(array)
    return (array[0] + np.random.uniform(array[1],array[2]))
pool = mulpro.Pool(processes=8)
data3 =  pool.map(parAddRandom, itertools.izip(data, itertools.repeat(d1), itertools.repeat(d2)))

I would expect the code to be faster by parallelization, as 8 cores are being used except from just 1, but it is not...

EDIT:

If I modify the code so the function parAddRandom only accepts one value then it is extremely faster...

import numpy as np
import multiprocessing as mulpro
import itertools
data = range(100000) #Array of data
#example of list comprehension
data2 = [i + np.random.uniform(d1,d2) for i in data] #Now this is not faster than the following
#example of multiprocessing
def parAddRandom(value):
    return (value + np.random.uniform(0.1,0.2))
pool = mulpro.Pool(processes=8)
data3 =  pool.map(parAddRandom, data)

But I still need to be able to modify the parameters "d1" and "d2" from the previous code...

Paralelization python list comprehension not enhancing performance

Answers (1)

Related Questions