Censoring, Left and Right
Censoring, Left and Right
Censoring occurs when values of a variable within a certain range are unobserved, but it is known that the variable falls within this range. This differs from truncation, where values of a variable within a certain range are unobserved and it is unknown when the variable falls within this range. Both phenomena represent a loss of information, but the loss is less with censoring than with truncation. The two are sometimes confused in the literature; some examples are given in Léopold Simar and Paul Wilson (2007). George Maddala (1983) and Takeshi Amemiya (1984) list a number of empirical applications where censoring occurs.
Consider a sample of n draws Y _{i }, i = 1 …, n from a distribution function F(y ) = P (Y≤ y ). If the sample is left censored at c _{1}, then the values Y_{i} are not observed; instead, values are observed, where otherwise. For the cases where , all that is known about the underlying corresponding values Y_{i} is that they are less than or equal to c_{1}. Alternatively, if the sample is rightcensored at c _{2}, then values Y_{i}^{*} are observed, where if Y_{i} < c_{2}, and otherwise. In this scenario, for the cases where all that is kinown about the Y_{i} is that they are greater than or equal to c_{2}. Samples can also be both left and rightcensored.
In models of duration, rightcensoring often occurs, but leftcensoring can also occur. For example, if agents are observed in some state (e.g., unemployment, in the case of individuals, or solvency, in the case of firms) until either they are observed to exit the state or until the period of observation ends, then some agents may still be in the given state at the end of the observation window. Observations on these agents will be rightcensored. Similarly, at the beginning of the study, some (perhaps all) agents are observed to be already in the state of interest; for any agents whose time of entry into the state is unknown, their duration in the given state is leftcensored (and perhaps also rightcensored).
To illustrate censoring in a regression context, suppose
where E(ε_{i}) = 0. If Y_{i} is censored, then one must estimate the model
after replacing Y_{i} in (1) with , which necessarily results in a new error term in (2). Unless the censoring occurs in the extreme tails of the distribution of Y, ordinary least squares (OLS) estimation of the coefficients in (2) will yield biased and inconsistent estimates since OLS does not account for the censoring.
Censored regression models are typically estimated by the maximum likelihood method. If the errors in model (1) are assumed normally distributed with mean 0 and variance σ^{2}, then in the case of leftcensoring at c_{1} the likelihood function is given by
where ψ and Φ denote the standard normal density and distribution functions, respectively. This model was first proposed by James Tobin (1958), and is sometimes called the tobit model. The first product in (3) gives, for each observed value Y_{i}^{*} equal to c _{1}, the probability of obtaining a draw Y from F(y ) less than c_{1}.
The models presented above potentially suffer from several problems. Heteroskedasticity in the error terms can lead to inconsistent estimation. D. Petersen and Donald Waldman (1981) proposed modifications of the tobittype models involving specification of particular models for the error variances. John Cragg (1971) proposed a generalized version of the tobit model that allows the probability of censoring to be independent of the regression model for the uncensored data. Perhaps the most vexing problem is the requirement of a distributional assumption for the errors in (2). It is straightforward to assume distributions other than the normal distribution and then work out the resulting likelihood functions, but rather more difficult to avoid such assumptions altogether by using semi or nonparametric methods. Adrian Pagan and Aman Ullah (1999) discuss several proposals, but these involve significant increases in computational burden or data requirements.
SEE ALSO Censoring, Sample; Heckman Selection Correction Procedure; Heteroskedasticity; Logistic Regression; Probabilistic Regression; Properties of Estimators (Asymptotic and Exact)
BIBLIOGRAPHY
Amemiya, Takeshi. 1984. Tobit Models: A Survey. Journal of Econometrics 24: 3–61.
Cragg, John G. 1971. Some Statistical Models for Limited Dependent Variables with Application to the Demand for Durable Goods. Econometrica 39 (5): 829–844.
Maddala, George S. 1983. LimitedDependent and Qualitative Variables in Econometrics. Cambridge, U.K.: Cambridge University Press.
Pagan, Adrian, and Aman Ullah. 1999. Nonparametric Econometrics. Cambridge, U.K.: Cambridge University Press.
Petersen, D., and Donald Waldman. 1981. The Treatment of Heteroskedasticity in the Limited Dependent Variable Model. Unpublished working paper. Department of Economics. Chapel Hill: University of North Carolina.
Simar, Léopold, and Paul W. Wilson. 2007. Estimation and Inference in Twostage, Semiparametric Models of Productive Efficiency. Journal of Econometrics 136 (1): 31–64.
Tobin, James. 1958. Estimation of Relationships for Limited Dependent Variables. Econometrica 26 (1): 24–36.
Paul W. Wilson
Cite this article
Pick a style below, and copy the text for your bibliography.

MLA

Chicago

APA
"Censoring, Left and Right." International Encyclopedia of the Social Sciences. . Encyclopedia.com. 13 Nov. 2018 <https://www.encyclopedia.com>.
"Censoring, Left and Right." International Encyclopedia of the Social Sciences. . Encyclopedia.com. (November 13, 2018). https://www.encyclopedia.com/socialsciences/appliedandsocialsciencesmagazines/censoringleftandright
"Censoring, Left and Right." International Encyclopedia of the Social Sciences. . Retrieved November 13, 2018 from Encyclopedia.com: https://www.encyclopedia.com/socialsciences/appliedandsocialsciencesmagazines/censoringleftandright
Citation styles
Encyclopedia.com gives you the ability to cite reference entries and articles according to common styles from the Modern Language Association (MLA), The Chicago Manual of Style, and the American Psychological Association (APA).
Within the “Cite this article” tool, pick a style to see how all available information looks when formatted according to that style. Then, copy and paste the text into your bibliography or works cited list.
Because each style has its own formatting nuances that evolve over time and not all information is available for every reference entry or article, Encyclopedia.com cannot guarantee each citation it generates. Therefore, it’s best to use Encyclopedia.com citations as a starting point before checking the style against your school or publication’s requirements and the mostrecent information available at these sites:
Modern Language Association
The Chicago Manual of Style
http://www.chicagomanualofstyle.org/tools_citationguide.html
American Psychological Association
Notes:
 Most online reference entries and articles do not have page numbers. Therefore, that information is unavailable for most Encyclopedia.com content. However, the date of retrieval is often important. Refer to each style’s convention regarding the best way to format page numbers and retrieval dates.
 In addition to the MLA, Chicago, and APA styles, your school, university, publication, or institution may have its own requirements for citations. Therefore, be sure to refer to those guidelines when editing your bibliography or works cited list.