Download the notebook here! Interactive online version: binder

Causal graphs

Introduction

Graph notation less general than potential outcome framework, but

  • thinking about causal systems

  • uncover identification strategies

    It is useful to separate the inferential problem into statistical and identification components. Studies of identification seek to characterize the conclusions that could be drawn if one could use the sampling process to obtain an unlimited number of observations. (Manski, 1995)

The two most crucial ingredients for an identification analysis are:

  • The set of assumptions about causal relationships that the analysis is willing to assert based on theory and past research, including assumptions about relationships between variables that have not been observed but that are related both to the cause and outcome of interest.

  • The pattern of information one can assume would be contained in the joint distribution of the variables (associations) in the observed dataset if all members of the population had been included in the sample that generated the dataset.

\(\rightarrow\) causal graphs offer an effective and efficient representation for both

Basic elements of causal graphs

  • nodes

  • edges

  • directed paths

    • parent and child

    • descendant

5027bf4ccff140e3a7d195e07166213a

Two representations of the joint dependence of \(A\) and \(B\) on an unobserved common cause.

d13c544dbe844701a3e81533a0e53d9b

Let’s look at some basic patterns that will turn out to appear frequently.

  • chain of mediation

  • fork of mutual causation

  • inverted fork of mutual dependence

d638b10688f64ef89eefec7586f5ed24

What about the unconditional and conditional association of \(A\) and \(B\) in each of these cases?

  • While there is unconditional dependence between them in the first two cases, there is not in the third.

The collider variable \(C\) in the third setting does not generate an unconditional association between \(A\) and \(B\). However, as we will revisit in more detail later, it can create a conditional association that needs to be handled with care.

Conditioning and confounding

14e29d30d7bd4fd1a6da17c4a08f39f7

The causal effects \(C \rightarrow D\) and \(C \rightarrow Y\) render the total association between \(D\) and \(Y\) unequal to the causal effect \(D \rightarrow Y\).

  • \(C\) is a confounding variable that affects both the dependent and independent variable.

  • Conditioning is a modelig strategy that allows to determine causal effects in the presence of observed confounders.

\(\rightarrow\) What happens if \(C\) is unobserved?

How about an example from educational choice where we have observed and unobserved confounders?

31b9fb7b16204ae7a9dadb611bc76e83

What identification strategies come to mind?

Resources