Instrumental variables estimation

In statistics, econometrics, epidemiology and related disciplines, the method of instrumental variables (IV) is used to estimate causal relationships when controlled experiments are not feasible or when a treatment is not successfully delivered to every unit in a randomized experiment.[1] Intuitively, IVs are used when an explanatory variable of interest is correlated with the error term (endogenous), in which case ordinary least squares and ANOVA give biased results. A valid instrument induces changes in the explanatory variable (is correlated with the endogenous variable) but has no independent effect on the dependent variable and is not correlated with the error term, allowing a researcher to uncover the causal effect of the explanatory variable on the dependent variable.

Instrumental variable methods allow for consistent estimation when the explanatory variables (covariates) are correlated with the error terms in a regression model. Such correlation may occur when:

  1. changes in the dependent variable change the value of at least one of the covariates ("reverse" causation),
  2. there are omitted variables that affect both the dependent and explanatory variables, or
  3. the covariates are subject to measurement error.

Explanatory variables that suffer from one or more of these issues in the context of a regression are sometimes referred to as endogenous. In this situation, ordinary least squares produces biased and inconsistent estimates.[2] However, if an instrument is available, consistent estimates may still be obtained. An instrument is a variable that does not itself belong in the explanatory equation but is correlated with the endogenous explanatory variables, conditionally on the value of other covariates.

In linear models, there are two main requirements for using IVs:

  • The instrument must be correlated with the endogenous explanatory variables, conditionally on the other covariates. If this correlation is strong, then the instrument is said to have a strong first stage. A weak correlation may provide misleading inferences about parameter estimates and standard errors.[3][4]
  • The instrument cannot be correlated with the error term in the explanatory equation, conditionally on the other covariates. In other words, the instrument cannot suffer from the same problem as the original predicting variable. If this condition is met, then the instrument is said to satisfy the exclusion restriction.
  1. ^ Imbens, G.; Angrist, J. (1994). "Identification and estimation of local average treatment effects". Econometrica. 62 (2): 467–476. doi:10.2307/2951620. JSTOR 2951620. S2CID 153123153.
  2. ^ Bullock, J. G.; Green, D. P.; Ha, S. E. (2010). "Yes, But What's the Mechanism? (Don't Expect an Easy Answer)". Journal of Personality and Social Psychology. 98 (4): 550–558. CiteSeerX 10.1.1.169.5465. doi:10.1037/a0018933. PMID 20307128. S2CID 7913867.
  3. ^ https://www.stata.com/meeting/5nasug/wiv.pdf[full citation needed]
  4. ^ Nichols, Austin (2006-07-23). "Weak Instruments: An Overview and New Techniques". {{cite journal}}: Cite journal requires |journal= (help)