Maximum difference of values in the columns of a pandas dataframe

Question

I have a dataframe with 8 columns as follows:

Index
Name of the counties
Population from 2010 Census
Population from 2011 Census
Population from 2012 Census
Population from 2013 Census
Population from 2014 Census
Population from 2015 Census

I need to find the county that has had the largest absolute change in population within the period 2010-2015?

e.g. If County Population in the 5 year period is 100, 120, 80, 105, 100, 130, then its largest change in the period would be |130-80| = 50. I am able to come up with a solution using for loops and conditionals but it doesn't seem to be the best way to solve the problem. How can I write a simple code using pandas dataframe functions?

busybear · Accepted Answer

Use min and max methods for dataframe while setting the parameter axis to 1. If you set your column 'Name of the counties' as your index, it makes it a little easier. Then you can use idxmax to find which county has the largest range.

df = df.set_index('Name of the counties')
(df.max(axis=1) - df.min(axis=1)).idxmax())

Maximum difference of values in the columns of a pandas dataframe

Answers (2)

Related Questions