Truncated regression model

Truncated regression models are a class of models in which the sample has been truncated for certain ranges of the dependent variable. That means observations with values in the dependent variable below or above certain thresholds are systematically excluded from the sample. Therefore, whole observations are missing, so that neither the dependent nor the independent variable is known. This is in contrast to censored regression models where only the value of the dependent variable is clustered at a lower threshold, an upper threshold, or both, while the value for independent variables is available.^[1]

Sample truncation is a pervasive issue in quantitative social sciences when using observational data, and consequently the development of suitable estimation techniques has long been of interest in econometrics and related disciplines.^[2] In the 1970s, James Heckman noted the similarity between truncated and otherwise non-randomly selected samples, and developed the Heckman correction.^[3]^[4]

Estimation of truncated regression models is usually done via parametric maximum likelihood method. More recently, various semi-parametric and non-parametric generalisation were proposed in the literature, e.g., based on the local least squares approach^[5] or the local maximum likelihood approach,^[6] which are kernel based methods.

^ Breen, Richard (1996). Regression Models : Censored, Samples Selected, or Truncated Data. Thousand Oaks: Sage. pp. 2–4. ISBN 0-8039-5710-6.
^ Amemiya, T. (1973). "Regression Analysis When the Dependent Variable is Truncated Normal". Econometrica. 41 (6): 997–1016. doi:10.2307/1914031. JSTOR 1914031.
^ Heckman, James J. (1976). "The Common Structure of Statistical Models of Truncation, Sample Selection, and Limited Dependent Variables and a Simple Estimator for Such Models". Annals of Economic and Social Measurement. 15: 475–492.
^ Heckman, James J. (1979). "Sample Selection Bias as a Specification Error". Econometrica. 47 (1): 153–161. doi:10.2307/1912352. JSTOR 1912352.
^ Lewbel, A.; Linton, O. (2002). "Nonparametric Censored and Truncated Regression" (PDF). Econometrica. 70 (2): 765–779. doi:10.1111/1468-0262.00304. S2CID 120113700.
^ Park, B. U.; Simar, L.; Zelenyuk, V. (2008). "Local Likelihood Estimation of Truncated Regression and its Partial Derivatives: Theory and Application" (PDF). Journal of Econometrics. 146 (1): 185–198. doi:10.1016/j.jeconom.2008.08.007. S2CID 55496460.

[1] Breen, Richard (1996). Regression Models : Censored, Samples Selected, or Truncated Data. Thousand Oaks: Sage. pp. 2–4. ISBN 0-8039-5710-6.

[2] Amemiya, T. (1973). "Regression Analysis When the Dependent Variable is Truncated Normal". Econometrica. 41 (6): 997–1016. doi:10.2307/1914031. JSTOR 1914031.

[3] Heckman, James J. (1976). "The Common Structure of Statistical Models of Truncation, Sample Selection, and Limited Dependent Variables and a Simple Estimator for Such Models". Annals of Economic and Social Measurement. 15: 475–492.

[4] Heckman, James J. (1979). "Sample Selection Bias as a Specification Error". Econometrica. 47 (1): 153–161. doi:10.2307/1912352. JSTOR 1912352.

[5] Lewbel, A.; Linton, O. (2002). "Nonparametric Censored and Truncated Regression" (PDF). Econometrica. 70 (2): 765–779. doi:10.1111/1468-0262.00304. S2CID 120113700.

[6] Park, B. U.; Simar, L.; Zelenyuk, V. (2008). "Local Likelihood Estimation of Truncated Regression and its Partial Derivatives: Theory and Application" (PDF). Journal of Econometrics. 146 (1): 185–198. doi:10.1016/j.jeconom.2008.08.007. S2CID 55496460.

[1]

[2]

[3]

[4]

[5]

[6]