Download the notebook here! Interactive online version: binder

Causal Graphs

Definitions, patterns, and strategies

Definitions

  • A node represents a random variable labeled by letter. Observed random variables are marked by solid circle • and unobserved - by hollow circle ◦.

  • An edge shows dependence between joining variables.

  • Adjacent variables are connected by an edge.

  • Adjacent edges meet at a variable.

  • A directed edge represents the cause by a single-headed arrow.

  • A parent/child is the starting(tail)/ending(head) variable. Therefore, a directed edge represents a direct effect of a parent on a child.

  • A root is a variable that has no parent. In other words, it is an exogenous variable determined only by forces outside of the graph.

  • A sink is a variable with no children.

  • A path is a sequence of adjacent edges.

  • A directed path is a path traced out entirely along arrows tail-to-head. If there is a directed path from \(A\) to \(B\), \(A\) is an ancestor of \(B\); \(B\) is a descendant of \(A\).

  • A directed acyclic graph (DAG) is a graph with only arrows for edges and no feedback loops (i.e. no variable is its own ancestor or its own descendant):

ac4a3465f37e40328095cabb3ddaa3bf

  • Joint dependence of two variables on one or more common causes is shown either with unobservable variable or with bidirected dashed curved edge:

f912fb9935a64b15bd970f85ecabf9e1

2b0168028521479baa5198d9475f0f9f

Patterns

  • Chain of mediation is a relationship when \(A\) affects \(B\) through \(A\)’s causal effect on \(C\) and \(C\)’s causal effect on \(B\):

cc84831cce7c4c49a41e4e176bcfd3f0

  • Mutual dependence is a relationship when \(A\) and \(B\) are both caused by \(C\) (a variable \(C\) that affects both the dependent and independent variable is called a confounding variable):

4cb7c6124dd24b36a73e0f9d4e8d32ad

  • Mutual causation is a relationship when \(A\) and \(B\) are both causes of \(C\) (a variable \(C\) that has two arrows running into it is called a collider):

3bee3f75f32d43f2933592916e4a6033

  • A back-door path is a path between any causally ordered sequence of two variables that include a directed edge that points to the first variable.

  • Conditioning as a modeling strategy means transforming one graph into a simpler set of component graphs where fewer causes are represented.

Strategies

A back-door criterion is a set of conditions used to determine whether or not conditioning on a given set of observed variable will identify the causal effect. The causal effect is identified by conditioning on a set of variables \(Z\) if and only if all back-door paths between the causal variable and the outcome variable are blocked after conditioning on \(Z\). All back-door paths are blocked by \(Z\) if and only if each back-door path: - contains a chain of mediation \(A → C → B\) where the middle variable \(C\) is in \(Z\), or - contains a fork of mutual dependence \(A ← C → B\), where the middle variable \(C\) is in \(Z\), or - contains an inverted fork of mutual causation \(A → C ← B\), where the middle variable \(C\) and all of \(C\)’s decendents are not in \(Z\).

A front-door criterion is an empirical strategy used to identify the causal relationship flowing from \(A\) to \(B\) if one can find a mechanism C which: - lies on the causal path between \(A\) and \(B\), and - it is the only such mechanism, and - it is not affected by the unobserved confounder \(U\):

96b27b9aa79d4551a858d2b99cdfb824

You can find more on front-door criterion application in the Bellemare & Bloem (2020) paper.

References