Why is this numba code to store and process pointcloud data slower than the pure Python version?

Question

I need to store some data structure like that:

{'x1,y1,z1': [[p11_x,p11_y,p11_z], [p12_x,p12_y,p12_z], ..., [p1n_x,p1n_y,p1n_z]],
 'x2,y2,z2': [[p21_x,p21_y,p21_z], [p22_x,p22_y,p22_z], ..., [p2n_x,p2n_y,p2n_z]],
 ...
 'xn,yn,zn': [[pn1_x,pn1_y,pn1_z], [pn2_x,pn2_y,pn2_z], ..., [pnm_x,pnm_y,pnm_z]]}

Every key is a grid cell index and the value is a list of classified points. The list can be variable length but I can set it static, for example 1000 elements.

For now I tried something like this:

np.zeros(shape=(100,100,100,50,3))

But if I use numba.jit with that function the execution time is few times worse than with pure Python.

Simple Python example of what I want to do:

def split_into_grid_py(points: np.array):
    grid = {}
    for point in points:
        r_x = round(point[0])
        r_y = round(point[1])
        r_z = round(point[2])
        try:
            grid[(r_x, r_y, r_z)].append(point)
        except KeyError:
            grid[(r_x, r_y, r_z)] = [point]
    return grid

Is there any efficient way of doing that with numba? Per 10 execution in loop times are like:

numba: 7.050494909286499
pure Python: 1.0014197826385498

With the same data set, so it's crap optimization.

My numba code:

@numba.jit(nopython=True)
def split_into_grid(points: np.array):
    grid = np.zeros(shape=(100,100,100,50,3))
    for point in points:
        r_x = round(point[0])
        r_y = round(point[1])
        r_z = round(point[2])
        i = 0
        for cell in grid[r_x][r_y][r_z]:
            if not np.sum(cell):
               grid[r_x][r_y][r_z][i] = point
               break
            i += 1
    return grid

Why is this numba code to store and process pointcloud data slower than the pure Python version?

Answers (1)

Related Questions