Pandas Groupby with Aggregate, and Quantiles

Question

I am attempting to groupby a pandas DataFrame and calculate quantiles and aggregates from a column.

Here's a sample DataFrame:

import pandas as pd
import numpy as np

df = pd.DataFrame({
                   'id': [1, 1, 1, 2],
                   'cat': ['p','p','p','n'],
                   'num': [5, 10, 10, 5],
                   'v': [np.nan, np.nan, np.nan, 'v2'],
                   'p': [1000, 1300, 1400, 1100]
                 })

I am looking for a solution that can scale with n # of categorical and numeric columns. For numeric and categorical columns, aggregate using mode function.

With p, create two new columns, range of .25 and .75 quantiles and min and max.

Expected output:

id  cat num  v    pquantile     min-max    

1   p   10   NaN  1075 - 1325   1000 - 1400  
2   n   5    v2   1100          1100

Also, aggregate function mode needs to be able to handle a tie.

Pandas Groupby with Aggregate, and Quantiles

Answers (1)

Related Questions