Creating python data frame from list of dictionary

Question

I have the following data:

sentences = [{'mary':'N', 'jane':'N', 'can':'M', 'see':'V','will':'N'},
     {'spot':'N','will':'M','see':'V','mary':'N'},
     {'will':'M','jane':'N','spot':'V','mary':'N'},
     {'mary':'N','will':'M','pat':'V','spot':'N'}]

I want to create a data frame where each key (from the pairs above) will be the column name and each value (from above) will be the index of the row. The values in the data frame will be counting of each matching point between the key and the value.

The expected result should be:

df = pd.DataFrame([(4,0,0),
                   (2,0,0),
                   (0,1,0),
                   (0,0,2),
                   (1,3,0),
                   (2,0,1),
                   (0,0,1)],
                  index=['mary', 'jane', 'can', 'see', 'will', 'spot', 'pat'],
                  columns=('N','M','V'))

Sreeram TP · Accepted Answer

pd.DataFrame(sentences).T.stack().groupby(level=0).value_counts().unstack().fillna(0)

       M    N   V
can   1.0   0.0 0.0
jane  0.0   2.0 0.0
mary  0.0   3.0 1.0
pat   0.0   0.0 1.0
see   0.0   0.0 2.0
spot  0.0   2.0 1.0
will  3.0   1.0 0.0

Cast as int if needed to.

pd.DataFrame(sentences).T.stack().groupby(level=0).value_counts().unstack().fillna(0).cast("int")

Creating python data frame from list of dictionary

Answers (2)

Related Questions