Split dataset when time intervals exceed specific value and assign a new trip ID to new groups

Question

I have a dataset of GPS locations with corresponding Trip ID, date and time, and time intervals in minutes between successive points within each trip:

> example
 TripID            DATIM INTV
1   522 22/05/2010 11:05  120
2   522 22/05/2010 13:05  120
3   522 22/05/2010 15:05  120
4   522 22/05/2010 17:05  120
5   522 22/05/2010 19:05  120
6   522 22/05/2010 21:05  120
7    10 28/05/2010 11:05  120
8    10 28/05/2010 13:05  120
9    10 29/05/2010 09:05 1200
10   10 29/05/2010 11:05  120
11   10 29/05/2010 13:05  120
12   10 29/05/2010 15:05  120
13   10 29/05/2010 17:05  120
14  657 04/06/2010 11:05  120
15  657 04/06/2010 13:05  120
16  657 04/06/2010 15:05  120

I want to split the data within trips when time intervals exceed 240 min, and assign a new TripID to the new group. In my example, I want to assign a new trip ID to the rows 9 to 13, as the time interval between row 8 and 9 exceeds 240 min, to obtain the following dataset:

 TripID            DATIM INTV
1   522 22/05/2010 11:05  120
2   522 22/05/2010 13:05  120
3   522 22/05/2010 15:05  120
4   522 22/05/2010 17:05  120
5   522 22/05/2010 19:05  120
6   522 22/05/2010 21:05  120
7    10 28/05/2010 11:05  120
8    10 28/05/2010 13:05  120
9   333 29/05/2010 09:05 1200
10  333 29/05/2010 11:05  120
11  333 29/05/2010 13:05  120
12  333 29/05/2010 15:05  120
13  333 29/05/2010 17:05  120
14  657 04/06/2010 11:05  120
15  657 04/06/2010 13:05  120
16  657 04/06/2010 15:05  120

Here is the bit of code I started to write:

TripIDs<-unique(example$TripID)

for (i in length(TripIDs)){
  Trip<-example[which(example$TripID == TripIDs[i]),] #split by trip
  breaks<-Trip$INTV[Trip$INTV>=1200] #define the breaks
  groups<-cut(Trip$INTV,breaks = breaks) #cut the trip at defined breaks
  ddply(Trip,"groups",**function()**) # assign a new name to each group of the trip
}

My problem is using the ddply function, which requires a function to assign a unique name to each new group of the trip. I am not sure the ddply function is appropriate here, and wanted to ask if anybody had an idea on how to split the data within my trip when time intervals exceed 240 min and assign a unique Trip ID to each new created group.

Many thanks

Split dataset when time intervals exceed specific value and assign a new trip ID to new groups

Answers (1)

Related Questions