Bayesian network algorithm for dummies?

Question

Is there a simple and easy explanation for the algorithm for Bayesian networks without all the bombastic terms? I am not allowed to use libraries, so please do not just give me a library and tell me how easy it is to use it.

So I understand we have conditional probability tables (CPT), and we are supposed to multiply them together and perform variable elimination, but how do I implement this in code? So I tried representing each CPT as a list of dictionaries:

{
            {
                "Cavity": "True",
                "own_value": "True",
                "probability": 0.9
            },
            {
                "Cavity": "True",
                "own_value": "False",
                "probability": 0.1
            },
            {
                "Cavity": "False",
                "own_value": "True",
                "probability": 0.4
            },
            {
                "Cavity": "False",
                "own_value": "False",
                "probability": 0.6
            }
}

And then inserting these CPTs into another dict.

However, I am unable to go forward as to how to multiply these CPTs and where to keep them? E.g. P(A|B, C, D) * P(D|E, F) = P(A, D|B, C, E, F)

The product of 2 CPTs can result in a factor that has multiple heads, so a dict no longer works...

Basically the following code snippet is the algorithm that I can come up with so far, where I store a queue of nodes and pass them into this function until all known probabilities of all nodes are found. And of course, the queue is already in topological order such that the parents is always guaranteed to be a known probability.

But as you have realized right now, this algorithm is only able to answer P(A = true) or P(C = false), etc

        cpt = self.conditional_probabilities[nodeName] #get my cpt
        for parent in self.dependencies[nodeName]:
            parentProb = self.knownProbabilities[parent]

            #matching of the parent and multiplying the probability
            for probability in parentProb:
                varName = probability
                varProb = parentProb[probability]
                for item in cpt:
                    if parent in item and (item[parent] == varName):
                        item["probability"] = item["probability"] * varProb

            #eliminate the parent node
            #some code here

        #insert known probability into my known probabilities table 
        #e.g. insert P(A) into a dict() of P(B), P(C), ... where A, B, C are keys
        nodeProbability = dict()
        for item in cpt:
            state = item["own_value"]
            nodeProbability[state] = item["probability"]

        self.knownProbabilities[nodeName] = nodeProbability

And this is unable to perform Bayesian network queries like P(A = true, B = false| C = false, D = "donkey", E = "cat").

Where D, E has a domain of ["donkey", "cat", "dog", "mouse"] and the rest are Boolean.

TL;DR Can anyone explain to me using simple data structures (list, pq, dict, etc), how to implement a Bayesian Network?

Bayesian network algorithm for dummies?

Answers (1)

Related Questions