Allotting unique identifier to a group of groups in pandas dataframe

Question

Given a frame like this

import pandas as pd
df = pd.DataFrame({'A':[1,2,3,4,6,3,7,3,2,11,13,10,1,5],'B':[1,1,1,2,2,2,2,3,3,3,3,3,4,4], 
                   'C':[1,1,1,1,1,1,1,2,2,2,2,2,3,3]})

I want to allot a unique identifier to multiple groups in column B. For example, going from top for every two groups allot a unique identifier as shown in red boxes in below image. The end result would look like below:

Currently I am doing like below but it seems to be over kill. It's taking too much time to update even 70,000 rows:

b_unique_cnt = df['B'].nunique()
the_list = list(range(1, b_unique_cnt+1))
slice_size = 2
list_of_slices = zip(*(iter(the_list),) * slice_size)
counter = 1
df['D'] = -1
for i in list_of_slices:
    df.loc[df['B'].isin(i), 'D'] = counter
    counter = counter + 1

df.head(15)

Allotting unique identifier to a group of groups in pandas dataframe

Answers (1)

Related Questions