find the index that gives the second largest difference between two lists in Python

Question

I would like to identify the index between two equal length lists that gives the second maximum absolute value of the difference between each row.

import random
import pandas as pd
random.seed(2)
l1 = pd.DataFrame([random.randrange(100) for _ in range(10)])
l2 = pd.DataFrame([random.randrange(100) for _ in range(10)])

l1-l2

0
0   -20
1   -66
2   6
3   -28
4   -66
5   74
6   30
7   -42
8   -18
9   -15

Now, I can use idxmax() to get the index giving me the largest absolute value difference, which is row 5. My question is how can I get the index giving the second largest difference value?

(l1 - l2).abs().idxmax()
0    5
dtype: int64

Quang Hoang · Accepted Answer

Option 1: The easy way: sort, then slice (complexity O(n log n))

(l1 - l2).abs().sort_values([0], ascending=False).index[1]

Option 2: nlargest, then idxmin (complexity O(n) ):

(l1 - l2).abs().nlargest(2, columns=[0]).idxmin()

Note your data actually have two rows with value 66 so you might get random answer between 1 and 4.

find the index that gives the second largest difference between two lists in Python

Answers (2)

Related Questions