How to conditionally do a vlookup in Pandas dataframe

Question

I am trying to figure out how to do a vlookup to pick out the latest price to fill up a second table. An example below. For item #1, the latest price is at Month 6 (=$6) while item #2 is at Month 5 (=$4). What's the best way to fill up Table B? Note: There might be occasion that item_id cannot be found in Table A if the item is new.

Any guidance? Many Thanks.

Table A (Reference)

| Item_ID | Month | Price |
|---------|-------|-------|
| 1       | 4     | 10    |
| 1       | 5     | 8     |
| 1       | 6     | 6     |
| 2       | 5     | 4     |

Table B (To Fill)

| Shop_ID | Item_ID | Price |
|---------|---------|-------|
| 1       | 1       | 6     |
| 1       | 2       | 4     |

chthonicdaemon · Accepted Answer

You can first find the latest information, then merge it to create the table:

import pandas


tableA = pandas.DataFrame({'Item_ID': {0: 1, 1: 1, 2: 1, 3: 2},
                           'Month': {0: 4, 1: 5, 2: 6, 3: 5},
                           'Price': {0: 10, 1: 8, 2: 6, 3: 4}})
tableB = pandas.DataFrame({'Item_ID': {0: 1, 1: 2}, 
                           'Price': {0: 6, 1: 4}, 
                           'Shop_ID': {0: 1, 1: 1}})

latest = tableA.loc[tableA.groupby('Item_ID')['Month'].idxmax()]
result = tableB[['Shop_ID', 'Item_ID']].merge(latest[['Item_ID', 'Price']],
                                              on='Item_ID')

This yields

       Shop_ID  Item_ID  Price
0        1        1      6
1        1        2      4

How to conditionally do a vlookup in Pandas dataframe

Answers (2)

Related Questions