filter pandas rows by other dataframe columns

Question

I have 3 dataframes already sorted with date and p_id and with no null values as:

First DataFrame

df1 = pd.DataFrame([['2018-07-05',8.0,1],
                    ['2018-07-15',1.0,1],
                    ['2018-08-05',2.0,1],
                    ['2018-08-05',2.0,2]],
      columns=["purchase_date", "qty", "p_id"])

Second DataFrame

df2 = pd.DataFrame([['2018-07-15',2.0,1],
                    ['2018-08-04',7.0,1],
                    ['2018-08-15',1.0,2]], 
      columns=["sell_date", "qty", "p_id"])

Third DataFrame

df3 = pd.DataFrame([['2018-07-25',1.0,1],
                    ['2018-08-15',1.0,1]],
      columns=["expired_date", "qty", "p_id"])

dataframe looks like:

1st: (Holds Purchase details)

    purchase_date   qty     p_id
0   2018-07-05      8.0     1
1   2018-07-15      1.0     1
2   2018-08-05      2.0     1
3   2018-08-05      2.0     2

2nd: (Holds Sales Details)

    sell_date   qty    p_id
0   2018-07-15  2.0    1
1   2018-08-04  7.0    1
2   2018-08-15  1.0    2

3rd: (Holds Expiry Details)

    expired_date    qty   p_id
0   2018-07-25      1.0   1
1   2018-08-15      1.0   1

Now What I want to do is find when the product that has expired was bought
following FIFO (product first purchased will expire first)

Explanation: Consider product with id 1

By date 2018-07-15

We had 8+1 purchased quantity and -2 sold quantity i.e. total of 8+1-2 quantity in stock , -ve sign signify quantity deduction

By date 2018-07-25

1 quantity expired so first entry for our new when_product_expired dataframe will be:

purchase_date     expired_date    p_id
2018-07-05        2018-07-25      1

And then for next expiry entry

By date 2018-08-04

7 quantity were sold out so current quantity will be 8+1-2-7 = 0

By date 2018-08-05

2 quantity were bought so current quantity is 0+2

By date 2018-08-15

1 quantity expired

So a new and final entry will be:

purchase_date     expired_date    p_id
2018-07-05        2018-07-25      1
2018-08-05        2018-08-15      1

This time the product expired was one that was purchased on 2018-07-25

Actually I have date time, so purchase and sell time will never be equal (you may assume), also before selling and expire, there will always be some quantity of product in stock, i.e. data is consistent
And Thank you in advance :-)

Updated

What by now I am thinking is rename all date fields to same field name and append purchase, sell, expired dataframe with negative sign, but that won't help me

df2.qty = df2.qty*-1
df3.qty=df3.qty*-1
new = pd.concat([df1,df2, df3],sort=False)
      .sort_values(by=["purchase_date"],ascending=True)
      .reset_index(drop=True)

filter pandas rows by other dataframe columns

Explanation: Consider product with id 1

And then for next expiry entry

Updated

Answers (1)

Related Questions