Segmenting data frame by row based on specific pattern

Question

Sorry for not precisely titling the problem. Let me elaborate as follows:

I have data frame like this:

I want to segment it into three parts:

First part:

second part:

third part:

12     v
13     v
14     x

That is, each part will contain at least two "state" ("v" and "x") regardless of the number and part with pattern "v,v,x,x,v" (x followed with v) should not occur.

Sven Hohenstein · Accepted Answer

You can create a unique identifier with cumsum and use split to split the data frame.

split(dat, c(0, cumsum(with(dat, state[-1] == "v" & head(state, -1) == "x"))))

where dat is the name of your data frame.

The result is a list including three data frames.

$`0`
  state
1     v
2     v
3     v
4     v
5     x
6     x
7     x

$`1`
   state
8      v
9      v
10     x
11     x

$`2`
   state
12     v
13     v
14     x

Segmenting data frame by row based on specific pattern

Answers (2)

Related Questions