Skip to main content

Censoring, Sample

Censoring, Sample



In order to define censoring in sample data, researchers first distinguish between latent data, the data that are hidden from the observer, and observed data, which are their measured counterparts. The latent data are modified between the original data creation and the observation by the analyst. Censoring occurs when certain values in the latent data are transformed so that their identities in the original data are masked or hidden from the observer.

A natural, familiar case is institutional censoring. Certain government statistics, such as income data on individuals and line of business data on firms, are censored to mask the identities of the persons or businesses. Thus income might be reported not as their original values, but only as being in a certain bracket. Censoring also occurs naturally in the way that certain data are observed. A leading example is the observations on durations in medical statistics. Observed data on the longevity after the surgery of heart transplant patients, or the length of survival after onset of a disease, are naturally censored if the individual leaves the observation setting before the transition takes place. Thus the hospital may at some point lose contact with the heart transplant patient. The observation consists of the knowledge that the patient was still alive at the time of his or her exit from the study, but not how long he or she survived. In another familiar case, the true levels of demand for sporting and entertainment events are not revealed by ticket sales because the venue may sell out. The observed reflection of the demand is only ticket sales, limited by the capacity of the venue. Some economic phenomena, known as corner solutions, also lead to censoring when the observed counterpart to a variable of interest has a boundary value. Thus the amount of insurance an individual desires may be censored at zero. The amount of investment that a business undertakes might be recorded as zero if the assets of the business are allowed to depreciate, such that the true investment is actually negative.


Models of statistical phenomena usually describe relationships between, or co-movements of, variables. Censoring interferes with this sort of modeling. (See Greene 2003 and Maddala 1983 for analysis and extensions.) Suppose that the relationship occurs between the latent variables of interest. If a variable x* is expected to explain the movement of variable y, and x* is censored to reveal x, then the analyst will measure movements in y that are associated with movements in x* when x does not change. In the opposite case, when it is y* that is censored and x that is not, the analyst will observe movement in x that should be associated with movement in y, but is not. Either case leads to a distortion in the measured relationship between a censored variable and an uncensored one. Contemporary model builders accommodate this type of distortion by building specific models for the censoring process along with the relationship of interest. Controversy arises over the many assumptions that must be made in order to make reasonable analysis feasible. Ultimately, the estimated relationship, such as that between prices and ticket sales for sporting events, is also a function about what is assumed about the underlying process whereby true underlying data are translated into observed, censored data, for example, how true demand is translated into ticket sales.

SEE ALSO Censoring, Left and Right


Greene, William H. 2003. Econometric Analysis. 5th ed. Upper Saddle River, NJ: Prentice Hall, 2003.

Maddala, G. S. 1983. Limited-Dependent and Qualitative Variables in Econometrics. Cambridge, U.K.: Cambridge University Press.

William Greene

Cite this article
Pick a style below, and copy the text for your bibliography.

  • MLA
  • Chicago
  • APA

"Censoring, Sample." International Encyclopedia of the Social Sciences. . 20 Apr. 2019 <>.

"Censoring, Sample." International Encyclopedia of the Social Sciences. . (April 20, 2019).

"Censoring, Sample." International Encyclopedia of the Social Sciences. . Retrieved April 20, 2019 from

Learn more about citation styles

Citation styles gives you the ability to cite reference entries and articles according to common styles from the Modern Language Association (MLA), The Chicago Manual of Style, and the American Psychological Association (APA).

Within the “Cite this article” tool, pick a style to see how all available information looks when formatted according to that style. Then, copy and paste the text into your bibliography or works cited list.

Because each style has its own formatting nuances that evolve over time and not all information is available for every reference entry or article, cannot guarantee each citation it generates. Therefore, it’s best to use citations as a starting point before checking the style against your school or publication’s requirements and the most-recent information available at these sites:

Modern Language Association

The Chicago Manual of Style

American Psychological Association

  • Most online reference entries and articles do not have page numbers. Therefore, that information is unavailable for most content. However, the date of retrieval is often important. Refer to each style’s convention regarding the best way to format page numbers and retrieval dates.
  • In addition to the MLA, Chicago, and APA styles, your school, university, publication, or institution may have its own requirements for citations. Therefore, be sure to refer to those guidelines when editing your bibliography or works cited list.