Create new number variable using pandas?

Question

I have df given below.

df

I want to create a new variable named FLAG1 using VAR and FLAG. The final output given below.

Final Output: -

VAR FLAG    FLAG1
A      1        1
A      1        1
A      1        1
B      1        1
B      1        1
B      0        2
B      0        3
B      1        4
B      1        4
B      1        4
B      0        5
C      0        1
C      0        2

Scott Boston · Accepted Answer

You can using this bit of logic:

df['FLAG1'] =  (df.groupby('VAR')['FLAG'] 
                  .transform(lambda x: ((x != x.shift()) | (x == 0)).cumsum()))

Output:

   VAR  FLAG  FLAG1
0    A     1      1
1    A     1      1
2    A     1      1
3    B     1      1
4    B     1      1
5    B     0      2
6    B     0      3
7    B     1      4
8    B     1      4
9    B     1      4
10   B     0      5
11   C     0      1
12   C     0      2

Create new number variable using pandas?

Answers (2)

Related Questions