Convert a pandas Series of lists into a numpy array

Question

I want to convert a pandas Series of strings of list of numbers into a numpy array. What I have is something like:

ds = pd.Series(['[1 -2 0 1.2 4.34]', '[3.3 4 0 -1 9.1]'])

My desired output:

arr = np.array([[1, -2, 0, 1.2, 4.34], [3.3, 4, 0, -1, 9.1]])

What I have done so far is to convert the pandas Series to a Series of a list of numbers as:

ds1 = ds.apply(lambda x: [float(number) for number in x.strip('[]').split(' ')])

but I don't know how to go from ds1 to arr.

Shubham Sharma · Accepted Answer

Use Series.str.strip + Series.str.split and create a new np.array with dtype=float:

arr = np.array(ds.str.strip('[]').str.split().tolist(), dtype='float')

Result:

print(arr)

array([[ 1.  , -2.  ,  0.  ,  1.2 ,  4.34],
       [ 3.3 ,  4.  ,  0.  , -1.  ,  9.1 ]])

Answers (2)