IPython + pandas oneliner not working

Question

I am trying to do a oneliner on IPython but I get SyntaxError: invalid syntax. The code is the following:

 for zzz in ddd.index: zzz1 = zzz.split('///'); zzz3 = [zzz2.strip() for zzz2 in zzz1 if len(zzz1) > 1]; for zzz4 in zzz3: ddd.ix[zzz4]['Class'] = ddd.ix[zzz]['Class']; del ddd.ix[zzz]

I can explain it as: For each value on the index of DataFrame ddd I split it using /// as a separator. Then, if there are multiple values returned, I create a row for each value and remove the original row. In example I have:

             Class
lal          1
eri /// iii  2
aks          3

I want to obtain

             Class
lal          1
eri          2
iii          2
aks          3

The first column (`lal', 'eri', ... ) is the index of dataframe.

How can I achieve this? I have searched through the documentation but I did not manage out how to do it.

Thanks

DSM · Accepted Answer

Here's a version at the opposite end of the spectrum from @Jeff's: horribly slow, but pretty clear, I think.

index_pairs = [(ind, subind.strip()) for ind in df.index for subind in ind.split("///")]
old_i, new_i = zip(*index_pairs)
df2 = df.ix[list(old_i)]
df2.index = new_i

Note that this assumes the original indices are unique.

Start with our frame:

>>> df
             Class
lal              1
eri /// iii      2
aks              3

Make a list of pairs connecting the original index with as many new subindices as needed:

>>> index_pairs = [(ind, subind.strip()) for ind in df.index for subind in ind.split("///")]
>>> index_pairs
[('lal', 'lal'), ('eri /// iii', 'eri'), ('eri /// iii', 'iii'), ('aks', 'aks')]

Transpose:

>>> old_i, new_i = zip(*index_pairs)
>>> old_i
('lal', 'eri /// iii', 'eri /// iii', 'aks')
>>> new_i
('lal', 'eri', 'iii', 'aks')

Use the old indices to index into df:

>>> df2 = df.ix[list(old_i)]
>>> df2
             Class
lal              1
eri /// iii      2
eri /// iii      2
aks              3

Reset the indices:

>>> df2.index = new_i
>>> df2
     Class
lal      1
eri      2
iii      2
aks      3

IPython + pandas oneliner not working

Answers (2)

Related Questions