A Bayesian network is a probabilistic model represented by a direct acyclic graph G = {V, E}, where the vertices are random variables Xi, and the edges determine a conditional dependence among them. In the following diagram, there's an example of simple Bayesian networks with four variables:
The variable x4 is dependent on x3, which is dependent on x1 and x2. To describe the network, we need the marginal probabilities P(x1) and P(x2) and the conditional probabilities P(x3|x1,x2) and P(x4|x3). In fact, using the chain rule, we can derive the full joint probability as:
The previous expression shows an important concept: as the graph is direct and acyclic, each variable is conditionally independent of all other variables that are not successors given its predecessors. To formalize this concept, we can define the function...