Why does the plot of conditional means (conditional mode) or random effects look like this?

Question

I am fitting a random effects model with glmer from lme4 in R. The model looks OK to me.

My understanding is that the random effects come from a normal distribution with mean 0 and variance 1.632 (see above). So I was expecting the distribution of conditional means (or conditional modes, obtained by using getME(modelfit, 'b')) should more or less follow a bell curve. However, when I plot the histogram of the conditional means, I found it very strangel it looks like 2 separate distributions separated by 0. The plot is here:

The corresponding Q-Q plot of the conditional modes:

Does anyone know what this means? Is there some strong confounder? Or can it just behave like this?

Ben Bolker · Accepted Answer

@RomanLustrik is correct to distinguish between the underlying assumption of Normality of the conditional mode and the estimates of the conditional modes themselves. The estimates need not be Normal; see ?qqmath.ranef.mer for diagnostic plots of the distribution of the conditional modes. If the distribution of your conditional modes is far from Normal, then you may indeed have a problem. Unfortunately, relaxing the assumption of Normality makes the modeling somewhat harder. You might, for example, be able to use a latent mixture model where you assume that the conditional modes are drawn from a mixture of two Normals - but I don't know offhand of an R package that implements this; if I were going to do it I would probably implement it using a toolbox like JAGS or Stan.

Before you go that direction, it's important to note that the characteristics of your data (approximately 2 Bernoulli observations per group) are such that the default Laplace approximation is expected to be very bad. Try nAGQ=10 (or even higher); it will slow your fitting considerably, but may improve the results.

Why does the plot of conditional means (conditional mode) or random effects look like this?

Answers (2)

Related Questions