Merge column labels with Pandas MultiIndex

Question

I'm trying to work with data in a pandas dataframe which I am importing from an Excel spreadsheet.

I am importing the data so it has a multi-index structure.

blid_df = pd.read_excel('OriginalClean.xlsx', header=[0,1,2], index_col=None)

This produces this dataframe.

I want to index by Country which I am able to do using set_index however all my countries become tuples (e.g Australia,).

I also want to make the country type sit at the correct level and remove these unnamed level labels.

Below is an example of what I am trying to achieve:

Timeless · Accepted Answer

Maybe I'm wrong but I imagine/suppose that you're reading a multi-header spreadsheet this way :

df = pd.read_excel("file.xlsx", header=[0, 1, 2])

You can try this instead :

df = (
    pd.read_excel("file.xlsx", index_col=[0, 1], header=[0, 1, 2]) # 1st chain
        .rename_axis(index=["Country", "Country Type"], columns=[None]*3)
)

df.index.nlevels   # should be 2 (previously 1)
df.columns.nlevels # should be 3

If you're not dealing with an Excel file, replace the first chain with df.set_index(list(df.columns[:2])).

Merge column labels with Pandas MultiIndex

Answers (2)

Related Questions