Pandas boolean dataframe creation from sets

Question

I want to create a boolean Dataframe from sets,

So there are 4 sets, each containing a collection of names

a = { a collection of names }
b = { another collection of names}
c = { ... } 
d = { ... }

And the result should be a Dataframe that looks like this:

 Name   |   a   |   b   |  c    |   d 
 --------------------------------------
'John'  | True  | True  | False | True
'Mike'  | False | True  | False | False
   .
   .
   .

I want a way to do this in Python using Pandas and in an efficient manner.

One way to do is to pick each name and see if it's in each set and then add that name to the Dataframe. But there should be faster ways like merging the sets and applying some function.

Andrew L · Accepted Answer

I've put together some random sample data that should scale:

a = ['foo', 'bob']
b = ['foo', 'john', 'jeff']

df
   name
0  jeff
1  john
2   bob

df['a'] = df.name.isin(a)
df['b'] = df.name.isin(b)

df
   name      a      b
0  jeff  False   True
1  john  False   True
2   bob   True  False

Pandas boolean dataframe creation from sets

Answers (2)

Related Questions