Rank an array: Exclude NaN and assign lowest rank to highest number

Question

I have an array/pandas row:

array = [0.8, np.nan, 0.1, -0.5, 0.7]

I want this output:

array = [1, np.nan, 3, 4, 2]

These methods are ranking in the wrong direction for me:

scipy.stats.mstats.rankdata
scipy.stats.rankdata

user2285236 · Accepted Answer

Since you mentioned Pandas, you can use Series.rank method:

arr = [0.8, np.nan, 0.1, -0.5, 0.7]
pd.Series(arr).rank(ascending=False)
Out: 
0    1.0
1    NaN
2    3.0
3    4.0
4    2.0
dtype: float64

This creates and returns a Pandas Series. If you want to avoid creating a Series, as @ajcr noted in the comments, you can use the rank function. This returns an ndarray:

pd.algos.rank_1d_float64(arr, ascending=False)
Out: array([  1.,  nan,   3.,   4.,   2.])

Rank an array: Exclude NaN and assign lowest rank to highest number

Answers (2)

Related Questions