Calculating AIC number manually Given a distribution of data and some distribution string

Question

Suppose I have the following data:

 array([[0.88574245, 0.3749999 , 0.39727183, 0.50534724],
        [0.22034441, 0.81442653, 0.19313024, 0.47479565],
        [0.46585887, 0.68170517, 0.85030437, 0.34167736],
        [0.18960739, 0.25711086, 0.71884116, 0.38754042]])

and knowing that this data follows normal distribution. How do I calculate the AIC number ? The formula is

2K - 2log(L)

K is the total parameters, for normal distribution the parameter is 3(mean,variance and residual). i'm stuck on L, L is suppose to be the maximum likelihood function, I'm not sure what to pass in there for data that follows normal distribution, how about for Cauchy or exponential. Thank you.

Update: this question appeared in one of my coding interview.

StupidWolf · Accepted Answer

For a given normal distribution, the probability of y given

import scipy.stats

def prob( y = 0, mean = 0, sd = 1 ):
    return scipy.stats.norm( mean, sd ).pdf( y )

For example, given mean = 0 and sd = 1, the probability of value 0, is prob( 0, 0, 1 )

If we have a set of values 0 - 9, the log likelihood is the sum of the log of these probabilities, in this case the best parameters are the mean of x and StDev of x, as in :

import numpy as np
x = range( 9 )
logLik = sum( np.log( prob( x, np.mean( x ), np.std( x ) ) ) )

Then AIC is simply:

K = 2
2*K - 2*( logLik )

For the data you provide, I am not so sure what the three columns and row reflect. So do you have to calculate three means and three StDev-s? It's not very clear.

Hopefully this above can get you started

Calculating AIC number manually Given a distribution of data and some distribution string

Answers (2)

Related Questions