Plot grid of histograms based on group variable using plotly

Question

I have a data frame that contains multiple variables where each variable is logically connected to a factor level of an additional group variable. I would like to plot a histogram of each variable in such a way that it is possible to show a grid of multiple histograms 'group-wise'.

Here's an example data frame df_melt (the variables var_1,var_2,var_3,var_4 are logically connected to the factor level 'foo', the variables var_5,var_6,var_7 belong to factor level 'bar'):

import numpy as np
import pandas as pd

# simulate data and create plot-ready dataframe
np.random.seed(42)
var_values = np.random.randint(low=1,high=100,size=(100,7))
var_names = ['var_1','var_2','var_3','var_4','var_5','var_6','var_7']
group_names = ['foo','foo','foo','foo','bar','bar','bar']

df = pd.DataFrame(var_values,columns=var_names)
multi_index = pd.MultiIndex.from_arrays([df.columns,group_names],names=['variable','group'])
df.columns = multi_index
df_melt = pd.melt(df)

The output should look like this:

These stackoverflow posts might help to provide an answer, but I was not able to come up with a solution on my own:

Plotting a grouped pandas data in plotly

Plotly equivalent for pd.DataFrame.hist

Plot grid of histograms based on group variable using plotly

Answers (1)

Related Questions