Calculating conditional probability from list

Question

Trying to calculate the conditional probability from a given list. Say I have the following list:

[[ 1, 0, 0, 0, 1, 5],
 [ 0, 1, 0, 1, 0, 3],
 [ 1, 0, 0, 0, 1, 5],
 [ 0, 0, 1, 1, 0, 2],
 [ 0, 0, 1, 0, 1, 1]]

Each 'column' represents a binary attribute, the last 'column' is the class attribute. To find conditional probability of an attribute, I need to calculate P(X|Y).

In Python list, how can I

count the number of frequency of the attribute given it is a Y class?
count the total frequency for the class?

The above is easily doable in pandas, but I am actually clueless on how to tackle it with a Python list.

Mustafa Aydın · Accepted Answer

frequency of the attribute given it is a Y class?

class_ = 3
attr_index = 1
attr_freq_given_cls = sum(a_list[attr_index]
                          for a_list in list_of_lists
                          if a_list[-1] == class_)

Since attributes are from {0, 1}, summing yields the number of occurences; and indexing with -1 gives the label.

count the total frequency for the class?

from collections import Counter
class_freqs = Counter(a_list[-1] for a_list in list_of_lists)

good luck with that naive Bayes :)

Calculating conditional probability from list

Answers (2)

Related Questions