How to get all 'first' instances of grouped and recurring values?

Question

I have a large dataframe (1.5mln,13) and I want to retrieve the index of all the first occurences of grouped events.

The events are repeating in groups of varying lenghts like in my example date.

How can I get a list with all the first 'a' events, and all the first 'b' events?

Example data:

data = {'event':  ['a','a','a','a','a','b','b','b','b','a','a','a','b','b','b','b','b','a','a','a','b','b','b','b']}
df = pd.DataFrame (data, columns = ['event'])

Valdi_Bo · Accepted Answer

As I understood, you want the first row from a sequence of consecutive rows with the same value in event column.

The code to get this result is:

df[df.event != df.event.shift()]

(compare the current value with the previous, looking for "different" cases, then use this intermediate result in boolean indexing).

For your data sample the result is:

How to get all 'first' instances of grouped and recurring values?

Answers (1)

Related Questions

How to get all &#39;first&#39; instances of grouped and recurring values?

Answers (1)

Related Questions

How to get all 'first' instances of grouped and recurring values?