grouping rows in a list of lists in pandas

Question

I have a dataframe that looks like this:

ID Description
1  A
1  B
1  C
2  A
2  C
3  A

I would like to group by the ID column and get the description as a list of list like this:

ID Description
1  [["A"],["B"],["C"]]
2  [["A"],["C"]]
3  [["A"]]

The df.groupby('ID')['Description'].apply(list) but this create only the "first level" of lists.

jpp · Accepted Answer

This is slightly different to @jezrael in that the listifying of strings is done via map. In addition call reset_index() adds "Description" explicitly to output.

import pandas as pd

df = pd.DataFrame([[1, 'A'], [1, 'B'], [1, 'C'], [2, 'A'], [2, 'C'], [3, 'A']], columns=['ID', 'Description'])

df.groupby('ID')['Description'].apply(list).apply(lambda x: list(map(list, x))).reset_index()

# ID Description
# 1 [[A], [B], [C]] 
# 2 [[A], [C]] 
# 3 [[A]]

grouping rows in a list of lists in pandas

Answers (2)

Related Questions