Merge two DataFrames based on a column condition and values of a specific column with Pandas in Python 3.x

Question

refering to my a few days ago asked question i now have an additional problem with my data. I have following two DataFrames:

    >>> df1
        A  B   date
   0    1  1   2015-02
   1    1  1   2015-03
   2    2  2   2017-01
   3    2  2   2017-02

    >>> df2
        A  B  C            02-2015  03-2015   01-2017   02-2017
   0    1  1  2013-07-01   0.10     0.22      0.55      0.77
   1    1  1  2015-01-01   0.20     0.12      0.99      0.125
   2    2  2  2016-12-01   0.13     0.15      0.15      0.245
   3    2  2  2016-01-01   0.33     0.1       0.888     0.64

What i want is following DataFrame:

    >>> df1
        A  B   date      value
   0    1  1   2015-02   0.20
   1    1  1   2015-03   0.12
   2    2  2   2017-01   0.15
   3    2  2   2017-02   0.245

My current code looks like following:

df1['value'] = df2.set_index('A', 'B').lookup(
            df1.set_index('A', 'B').index, df1['date'])

This does not work and my df1 is a NoneType because in df2 are duplicate rows with condition A and B == 1. What I want is an additional condition where it first extracts the earliest date for each unqiue A and B, which would be for A and B == 1 the date 2015-02.

From df2 it should take row number 1 because the delta in months is only 1 instead of row 0 where the delta will be 18.

Many thanks in advance!

Merge two DataFrames based on a column condition and values of a specific column with Pandas in Python 3.x

Answers (1)

Related Questions