Find indices in one DataFrame column of each match in a second column

Question

I have a DataFrame that looks this way:

I want to find, for each row, the index of the match between the current row's previous value in the current column, such that I get a new series called idx_previous as follows:

So far I have tried using the Pandas.Series.where() function to see the location. If I do:

import pandas as pd
df = pd.DataFrame({'current':['a','aa','ab','aaa','aab','aba','abb'],
    'previous':['','a','a','aa','aa','ab','ab']})

df['idx_previous'] = ''
for previous in df.previous[1:]:
    df.loc[df.previous==previous, 'idx_previous'] = df.loc[df.current == 
previous].index[0]

I can get what I want, but this seems like an un-elegant workaround. Is there some method that would be better suited for this task? Thanks.

Note: previous is, by definition, the string in current to element N-1. And current is made up of all unique values.

Find indices in one DataFrame column of each match in a second column

Answers (1)

Related Questions