I just covered instrumental variables in my course on causal inference, and so I have two-stage least squares (2SLS) estimation on the brain. In this post I’ll share something I realized in the course of prepping for class: that standard errors from 2SLS estimation are equivalent to delta method standard errors based on the Wald IV estimator. (I’m no econometrician, so this had never occurred to me before. Perhaps it will be interesting to other non-econometrician readers. And perhaps the econometricians can point me to the relevant page in Wooldridge or Angrist and Pischke or whomever that explains this better than I have.)
Let’s consider a system with an outcome
With a single-instrument, the 2SLS estimator of
where
The delta-method approximation for
Substituting the estimators in place of parameters, and using heteroskedasticity-consistent (HC0, to be precise) estimators for
Connecting delta-method and 2SLS
To demonstrate this claim, let’s first partial out the covariates, taking
The HC0 variance and covariance estimators for these coefficients have the usual sandwich form:
where
Remember this formula because we’ll return to it shortly.
Now consider the 2SLS estimator. To calculate this, we begin by taking the fitted values from the regression of
We then regress
The HC0 variance estimator corresponding to the 2SLS estimator is
where
But
The 2SLS variance estimator is therefore
which agrees with
So what?
If you’ve continued reading this far…I’m slightly amazed…but if you have, you may be wondering why it’s worth knowing about this relationship. The equivalence between the 2SLS variance estimator and the delta method interests me for a couple of reasons.
- First is that I had always taken the 2SLS variance estimator as being conditional on
–that is, not accounting for random variation in the treatment assignment. The delta-method form of the variance makes it crystal clear that this isn’t the case—the variance does include terms for and . - On the other hand, there’s perhaps a sense that equivalence with the 2SLS variance estimator (the more familiar form) validates the delta method variance estimator—that is, we wouldn’t be doing something fundamentally different by using the delta method variance with a Wald estimator. For instance, we might want to estimate
and/or by some other means (e.g., by estimating as a marginal effect from a logistic regression or estimating with a multi-level model). It would make good sense in this instance to use the Wald estimator and to estimate its variance using the delta method form. - One last reason I’m interested in this is that writing out the variance estimators will likely help in understanding how to approach small-sample corrections to
. But I’ll save that for another day.