How to convert the "rows" of a pandas Series into columns of a DataFrame?

Question

I have the following pandas Series, ser1 of shape (100,).

import pandas as pd
ser1 = pd.Series(...)
print(len(ser1)) 
##  prints (100,)

The length of each ndarray within this Series is length 150000, where each element is a character.

len(print(ser1[0]))
##  prints 150000

ser1.head()
sample1       xhtrcuviuvjhgfsrexvuvhfgshgckgvghfsgfdsdsg...
sample2       jhkjhgkjvkjgfjyqerwqrbxcvmkoshfkhgjknlkdfk...
sample3       sdfgfdxcvybnjbvtcyuikjhbgfdftgyhujhghjkhjn...
sample4       bbbbbbadfashdwkjhhguhoadfopnpbfjhsaqeqjtyi...
sample5       gfjyqedxcvrexvuvcvmkoshdftgyhujhgcvmkoshfk...
dtype: object

I would like to covert this pandas Series into a pandas DataFrame such that each element of this pandas Series "row" is a DataFrame column. That is, each element of that Series array would be an individual column. In this case, ser1 would have 150000 columns.

print(type(df_ser1)) # DataFrame of ser1
## outputs 
df_ser1.head()
     samples    char1    char2    char3    char4    char5    char6
0    sample1    x        h        t        r        c        u
1    sample2    j        h        k        j        h        g
2    sample3    s        d        f        g        f        d
3    sample4    b        b        b        b        b        b
........

How would one convert a pandas Series to a DataFrame in this way?

The most obvious idea would be to do

df_ser = ser1.to_frame

but this does not separate elements into individual Dataframe columns:

df_ser = ser1.to_frame
df_ser.head()
                                                       0
sample1       xhtrcuviuvjhgfsrexvuvhfgshgckgvghfsgfdsdsg...
sample2       jhkjhgkjvkjgfjyqerwqrbxcvmkoshfkhgjknlkdfk...
sample3       sdfgfdxcvybnjbvtcyuikjhbgfdftgyhujhghjkhjn...
......

Somehow, one would iterate though each element of the "Series row" and create a column, though I'm not sure how computationally feasible that is. (It's not very pythonic.)

How would one do this?

piRSquared · Accepted Answer

Consider a sample series ser1

ser1 = pd.Series(
    'abc def ghi'.split(),
    'sample1 sample2 sample3'.split())

Apply with pd.Series after having made the string a list of chars.

ser1.apply(lambda x: pd.Series(list(x))) \
    .rename(columns=lambda x: 'char{}'.format(x + 1))

        char1 char2 char3
sample1     a     b     c
sample2     d     e     f
sample3     g     h     i

How to convert the "rows" of a pandas Series into columns of a DataFrame?

Answers (2)

Sample Data

Solution

Runtime

Related Questions

How to convert the &quot;rows&quot; of a pandas Series into columns of a DataFrame?

Answers (2)

Sample Data

Solution

Runtime

Related Questions

How to convert the "rows" of a pandas Series into columns of a DataFrame?