Find the occurence of events

Question

I have an array with five different events, each event occurs for different intervals more than one time.

Ex:.

array(['walking', 'walking', 'walking', 'walking', 'Running', 'Running',
       'Running', 'Running', 'walking', 'walking', 'walking', 'walking',
       'walking', 'Standing', 'Standing', 'Standing', 'walking', 'walking',
       'walking'], dtype='



.... (3245 long)

I want to extract an array for each event that indicates the intervals for each event. 

The results should be as the following for the example above: 

Walking_occurence = [
(0,3),
(8,12),
(16,18)
]

Chris Adams · Accepted Answer

Here is a potential approach using pandas.Series with cumsum and groupby:

import pandas as pd

a = np.array(['walking', 'walking', 'walking', 'walking', 'Running',
              'Running', 'Running', 'Running', 'walking', 'walking',
              'walking', 'walking', 'walking', 'Standing', 'Standing',
              'Standing', 'walking', 'walking', 'walking'])

s = pd.Series(a)

s_out = ((s != s.shift()).cumsum().reset_index()
          .groupby([0, s])['index']
          .agg(['min', 'max'])
          .apply(tuple, axis=1))

# print(s_out)
# 1  walking       (0, 3)
# 2  Running       (4, 7)
# 3  walking      (8, 12)
# 4  Standing    (13, 15)
# 5  walking     (16, 18)

You could then do a further groupby opperation to get your desired results:

s_out = s_out.groupby(level=1, sort=False).apply(np.array)

[out]

walking     [(0, 3), (8, 12), (16, 18)]
Running                        [(4, 7)]
Standing                     [(13, 15)]
dtype: object

Find the occurence of events

Answers (2)

Related Questions