Normalizing to bin height with matplotlib

Question

I have a set of histograms, each one using a single column of a pandas dataframe and the matplotlib.pyplot.hist function. However, each set of data is a different length, so I want to normalize each histogram; using the built in density option does not make sense for my data, so I want to divide each bin height by the maximum bin height.

Overall I want to know how to 1- extract the bin heights from the histogram made by plt.hist 2- divide all the bin heights by the maximum (got confused by datatypes here, I think Im trying to divide two tuples?) 3- plot a new histogram with the normalized bin heights.

Ideally I want to do this in an order where I can tweak my choice of bin number in the original plot and then re-run to update both the original and normalized plot.

I tried naming what the plt.hist function returns and then dividing by the max, but the only version of this that did not throw an error gave me a plot that made no sense (I think I divided the values Im binning instead of the bin heights, I also don't really understand what n, bins, and patches are)

(n, bins, patches) = plt.hist(df['values'], bins=50)

plt.hist(df['values']/max(n), bins 50)

Normalizing to bin height with matplotlib

Answers (1)

Related Questions