Using pytest with dataframes to test specific columns

Question

I am writing pytest tests that use panda's dataframes and I am trying to write the code as general as I can. (I can always check element by element but trying to avoid that)

so I have an input dataframe that contains some ID column like this

ID,othervalue, othervalue2
00001,  4,   3
00001,  3,   3
00001,  2,   0
00003,  5,   2
00003,  2,   1
00003,  2,   9

and I do

def test_df_against_angle(df, angle):
    result = do_some_calculation(df, angle)

Now, result is also a dataframe that contains a ID column and it also contains a decision column that can take a value like "plus", "minus" (or "pass", "fail" or something like that) Something like

ID, someresult,  decision, someotherresult
00001,   4,       plus,       3
00001,   2,       plus,       2
00002,   2,       minus,       2
00002,   1,       minus,       5
00002,   0,       minus,       9

I want to add an assertion (or several) that asserts the following (Not all at once, I mean, different assertions since I have not yet decide which would be better):

All decision values corresponding to an ID are the same
The decision values corresponding to an ID are different than the ones of the other ID
The decision of ID 00001 is plus and the one of 00002 is minus

I know that pandas have some assertion to compare equal dataframes but how can I go for this situation?

Using pytest with dataframes to test specific columns

Answers (1)

Related Questions