Scipy erfcinv unexpectedly blows up near 1e-16

Question

I've been using scipy.special.erfcinv to calculate Z scores from pvalues. However, when the pvalues become very small, erfcinv gets unexpectedly large. Any ideas?

Example:

In [1]: import numpy as np
In [2]: from scipy.special import erfcinv
In [3]: erfcinv(2e-16) * np.sqrt(2)
Out[3]: 8.2095361516013874
In [4]: erfcinv(1e-16) * np.sqrt(2)
Out[4]: 1.7976931348623155e+308

I'm running python 2.6.6 with scipy 0.10.1.

zero323 · Accepted Answer

Short answer: the closer you get to the limits of floating point arithmetic precision the weirder things happens: (http://docs.oracle.com/cd/E19957-01/806-3568/ncg_goldberg.html, http://www.seas.ucla.edu/~vandenbe/103/lectures/flpt.pdf)

A little bit longer. First lets look at erfcinv function:

def erfcinv(y):
    return ndtri((2-y)/2.0)/sqrt(2)

If we take y = 2e-16:

In [96]: (2 - 2e-16) / 2
Out[96]: 0.9999999999999999

When we take y = 1e-16:

In [97]: (2 - 1e-16) / 2
Out[97]: 1.0

Now we look at ndtri:

x=ndtri(y) returns the argument x for which the area udnder the
Gaussian probability density function (integrated from minus infinity
to x) is equal to y.

Now everything should be clear, am I right? As you can suspect:

In [99]: ndtri(1)
Out[99]: inf

Your results can be a little bit different - in my case:

In [101]: erfcinv(1e-16) * np.sqrt(2)
Out[101]: inf

Scipy erfcinv unexpectedly blows up near 1e-16

Answers (2)

Related Questions