Replace second item in sublist with row value of dataframe

Question

I have a nested list, and would like to replace the second item of each sublist with the row values of the dataframe. Here's my dataframe and list:

import pandas as pd
mydata = [{'id' : '12'},
          {'id' : '34'},
          {'id' : '56'},
          {'id' : '78'},]
df = pd.DataFrame(mydata)

L1 = [ ['elephant',0], ['zebra',1], ['lion',2], ['giraffe',3]  ]

The desired result would be: [ ['elephant',12], ['zebra',34], ['lion',56], ['giraffe',78] ]

This is my code:

for i in L1:
    for j, row in df.iterrows():
        i[1] = df["id"][j]

Which outputs: [['elephant', '78'], ['zebra', '78'], ['lion', '78'], ['giraffe','78']]

EdChum · Accepted Answer

Use a list comprehension to generate a list of the first elements, then zip them with the id col:

In[32]:
list(zip([x[0] for x in L1], df['id'].tolist()))

Out[32]: [('elephant', '12'), ('zebra', '34'), ('lion', '56'), ('giraffe', '78')]

If you insist on a list of lists you can just turn the above into a list:

In[35]:
L2 = list(zip([x[0] for x in L1], df['id'].tolist()))
L2

Out[35]: [('elephant', '12'), ('zebra', '34'), ('lion', '56'), ('giraffe', '78')]

In[36]:
[list(x) for x in L2]

Out[36]: [['elephant', '12'], ['zebra', '34'], ['lion', '56'], ['giraffe', '78']]

A pure pandas method would be to construct a df from your list:

In[41]:
df2 = pd.DataFrame(L1)
df2

Out[41]: 
          0  1
0  elephant  0
1     zebra  1
2      lion  2
3   giraffe  3

then concatenate them:

In[43]:
merged = pd.concat([df,df2], axis=1)
merged

Out[43]: 
   id         0  1
0  12  elephant  0
1  34     zebra  1
2  56      lion  2
3  78   giraffe  3

Then simply sub-select the cols of interest and call .values to return a np array and then tolist:

In[46]:
merged[[0,'id']].values.tolist()

Out[46]: [['elephant', '12'], ['zebra', '34'], ['lion', '56'], ['giraffe', '78']]

Replace second item in sublist with row value of dataframe

Answers (2)

Related Questions