Skip to main content

Unit Root and Cointegration Regression

Unit Root and Cointegration Regression




The study of the evolution through time of a variable or group of variables has existed since the dawn of empirical analysis in the mid-seventeenth century. The formulation of explicit statistical models for a time series process {yt } is typically in the form of an autoregressive moving average model

where the innovation or shock εt is assumed independently, identically distributed with mean zero and variance σ 2.

When the roots of lie outside the unit circle, the process {yt } is stationary in the sense that yt has a fixed mean and variance for all t, and the covariance between any pair of values depends only on their distance apart in time, not on their absolute location. When the roots lie inside the unit circle, the process {yt } is explosive in the sense that it quickly becomes unbounded as t increases. If there are d unit roots for the solution , and others are outside the unit circle, the process is called integrated of order d, I (d ), because the process can be interpreted as a d -fold partial sum of stationary process for d = 1, where yd-1, t = (1-L )dyt, L denotes the backward shift operator, Lsyt = yt-s (Box and Jenkins 1970). When there is only one unit root (d = 1), it is called an I(1) process or a unit root process ; thus when all the roots are outside the unit circle, it is also called an I(0) process.

Unit root process is intermediate between stationary and explosive process. Nelson and Plosser (1982) have found that many economic and financial time series are I (1) or I (2). There are many reasons that a time series of a variable is integrated. For instance, real business-cycle models predict that many real economic variables would contain a unit root. There is also the efficient market hypothesis in which the best predictor of tomorrows price y t +1 is todays price yt (Fama 1970).

When y t + 1 = yt + εt, it is commonly referred to as a random walk process. Halls consumption function theory (1978) implies that, under a simple version of the permanent income hypothesis, future changes in consumption are unpredictable, so consumption follows a random walk. The predictions of these theories often extend to multivariate relations. For example, if income has a unit root, then consumption as a function of income will also have a unit root. The residuals that can be viewed as a linear combination of consumption and income will not have a unit root. When some linear combination of I (d ) variables has a lower order of integration, these variables are cointegrated, in Engle and Grangers terminology (1987).


If a variable, say yt, has a unit root, the shock to the variable never dies out. A unit root process can be written as the sum of past shocks plus an I (0) current shock. Because the sum of past shock is a dominant term in yt, this permanent component constitutes a stochastic trend. Information on the degree of persistence in a time series and, in particular, on its order of integration, can help to guide the construction or testing of economic theories.

Box and Jenkins (1970) proposed an informal data-analytic basis for the choice of d. Formal testing of a unit root is usually conducted by fitting a p -th order autoregressive form

or its error-correction representation,

where δ = (1- L ), and The null that Σaj = 1 is equivalent to the null ρ = 0.

Time series regressions that include integrated variables have statistical properties that are very different from the time series regressions with stationary variables. The least squares estimator of ρ when the regressor is I (1) converges to its true value 0 at the speed T, not at speed T 1/2, and has a limiting distribution that is a functional of Brownian motion (Chan and Wei 1988). Dickey and Fuller (1979) derive the t -statistics for the null hypothesis ρ = 0 by considering the limiting behavior of quadratic forms. The distribution of Dickey-Fuller t -statistic is tabulated by Dickey (1976).

Applying Dickey-Fuller procedure to the first difference of yt provides a test of I (2) against I (1). When the process {yt } may possess two or more unit roots, Dickey and Pantula (1987) recommend a downward, sequential t -test procedure starting with the greatest suspected number of unit roots, d (i.e., transform (2) into equivalent formulation (3) in terms of δdyt ), because the upward testing procedure starting with a test for a single unit root is an inconsistent test. Pantula (1989) proves that the distribution of the relevant t -statistic under each null has the standard Dickey-Fuller (1979) distribution.

From the perspective of empirical work, (2) may be overly restrictive. The process {y t} may also be a function of trend and serially correlated I (0) errors. Moreover, {y t} may be subject to trend breaks or structural breaks with unknown breakdays, or trend may be misspecified (Perron 1989). The bulk of the large literature on tests for unit roots has been to propose tests that (1) are asymptotically similar under the general I (1) null, in the sense that the null distribution depends neither on the parameters of the trend process nor the parameters describing the short-run dynamics of εt (Phillips and Perron 1988); (2) have good power in large samples; and (3) exhibit small size distortions. Although the asymptotic size and power vary greatly across the proposed test statistics, the Monte Carlo findings for finite sample yield similar conclusions: The proposed tests have relatively low power against I (0) alternatives, and there are substantial size distortions for the tests. For instance, with 100 stationary observations, the one-sided 5 percent significance level Dickey-Fuller t -test has a power of only 0.19. On the other hand, processes that are I (1) but which have moderate negative autocorrelation in first difference are incorrectly rejected with high probability. (For detail, see Stock 1994).


Multivariate time series methods are widely used by empirical economists. One important implication of economic theory is that certain ratios are stable. For instance, in response to growth in productivity and population, neoclassical growth models (King, Plosser, and Rebelo 1988) predict that output (y ), consumption (c ), and investment (i ) will grow in a balanced way. That is, even though yt, ct, and it increase permanently in response to increases in productivity and population, there are not permanent shifts in ct yt or it ty. In other words, these variables are cointegrated.

For ease of exposition, here we shall only focus on the classical analysis of I (1) and I (0) systems. Let {w̰ t} be a sequence of m × 1 vector of I (1) time series observations. Vector autoregressive model (VAR) of the form

provides a flexible and tractable framework for analyzing w̰ t (as in Hsiao 1979 and Sims 1980). Transforming (4) into an errorcorrection form provides a framework for analyzing both longrun and short-run economic relations

whereand j = 1, , p, and Im denotes

the m -rowed identity matrix. If there are m unit roots, , each series in the system is governed by a different stochastic trend. When there are fewer than m unit roots in (4), say n, then rank (*p) = r, r = m n, 1 n m. When rank , one can write where α̰ and β̰ are m × r full column rank matrices. Since w̰t is I (1), δ w̰ is I (0), β̰w̰t-p must be I (0) to ensure compatibility between the left-hand and right-hand side of (5). Therefore, the matrix, β̰ is called the cointegrating matrix that describes equilibrium or long-run relations within a fully dynamic framework. The matrix α̰ transmits the deviations from the long-run relations, ḛt = β̰w̰t, into respective elements of w̰t. The matrix , j = 1, , p 1, describes the short-run adjustment pattern that provides information on how soon the long-run equilibrium is restored after any of the variables in the system are perturbed by a shock. In economics, the existence of long-run relations and the strength of attraction to such a state depends on the actions of a market or on government intervention. The concept of cointegration has been applied in a variety of economic models including the relationship between capital and output, real wages and labor productivity, and so on (King, Plosser, Stock, and Watson).

Several issues arise from the cointegration analysis. First, because β̰w̰t is I (0), w̰t must be driven by some common trends. In other words, cointegration signifies comovements among trending variables. However, only the rank of cointegration can be uniquely determined. The cointegrating matrix β̰ is not uniquely defined. Any linear combination of cointegrating vector is a cointegrating vector or for any r × r nonsingular constant matrix C. (Therefore, the maximum number of linearly independent cointegrating vectors in a system of m I (1) variable, is (m 1). Normalization rule for the unique determination of the cointegrating matrix has to be used.

Second, although the least squares estimator with integrated regressor is consistent, it has very different statistical properties from the least squares estimator with stationary regressors (Chan and Wei 1988; Phillips and Durlauf 1986). Some of the estimated coefficients of (4) converge to the true values at speed T 1/2 and are asymptotically normally distributed; some converge to the true values at speed T and have non-normal asymptotic distributions (Sims, Stock, and Watson 1990). This raises the issue of statistical inference because in some instances the usual test statistics can be approximated by chi-square distributions while in other circumstances, they cannot.

Third, to overcome the miscentering and skewness of the usual test statistics because of the issues of unit root distribution, one can either condition on the innovations driving the common stochastic trends of the system (Phillips 1991) or use the reduced rank regression technique (Johansen 1988; 1991). In a system of m I (1) variables, the possible common trends vary between 1 and m. However, the knowledge of the rank of cointegration or nonstationarity (location of unit roots) is not generally known a priori. Johansen (1991) proposes likelihood-based tests for the rank of cointegration by considering the likelihood function of (5), whereas Stock and Watson (1988) propose a rank test for based on the fact that if there are r linearly independent cointegrating relations, then w̰t are driven by m r common trends, hence the sum of cross-products of w̰t divided by T 2 can only have rank m r. Unfortunately, neither method provides reliable inference in finite sample. Many Monte Carlo studies show that there are serious size and power distortions (Ho and Soresen 1996; Gonzalo and Pitarakis 1999).

The literature on unit root and cointegration analysis has greatly enhanced our understanding of dynamic econometric modeling of economic time series and provides a useful repertoire of tools for empirical analysis. However, unit root tests and cointegration analysis also raise serious finite sample issues. Tests with better finite sample size and power remain to be developed. More attention to economic theory and integrating economic analysis with time series analysis could also be fruitful.

SEE ALSO White Noise


Banerjee, Anindya, Juan J. Dolado, John W. Galbraith, and David F. Hendry. 1993. Cointegration, Error-Correction, and the Econometric Analysis of Non-Stationary Data. Oxford: Oxford University Press.

Bierens, Herman J. 2001. Unit Roots. In A Companion to Theoretical Econometrics, ed. Badi H. Baltagi, 610633. Oxford: Blackwell.

Box, George E. P., and Gwilym M. Jenkins. 1970. Time Series Analysis, Forecasting, and Control. San Francisco: Holden Day.

Chan, Ngai H., and Ching Z. Wei. 1988. Limiting Distributions of Least Squares Estimates of Unstable Autoregressive Processes. Annals of Statistics 16: 367401.

Dickey, David A. 1976. Estimation and Testing of Nonstationary Time Series. PhD diss., Iowa State University.

Dickey, David A., and Wayne A. Fuller. 1979. Distribution of the Estimators for Autoregressive Time Series with a Unit Root. Journal of the American Statistical Association 74: 427431.

Dickey, David A., and Sastry G. Pantula. 1987. Determining the Order of Differencing in Autoregressive Processes. Journal of Business and Economic Statistics 5: 455462.

Diebold, Francis X., and Marc Nerlove. 1989. Unit Roots in Economic Time Series: A Selective Survey. In Advances in Econometrics: Cointegration, Spurious Regressions, and Unit Roots, vol. 8, eds. Thomas B. Fomby and George F. Rhodes, 369. Greenwich, CT: JAI Press.

Engle, Robert F., and Clive W. J. Granger. 1987. Cointegration and Error Correction: Representation, Estimation, and Testing. Econometrica 55: 251276.

Fama, Eugene F. 1970. Efficient Capital Markets: A Review of Theory and Empirical Work. Journal of Finance 25: 383417.

Gonzalo, Jesus, and J. Y. Pitarakis. 1999. Dimensionality Effect in Cointegrated Systems. In Granger Festschrift, eds. Robert Engle and H. White, 212229. Oxford: Oxford University Press.

Hall, Robert E. 1978. Stochastic Implication of the Life Cycle-Permanent Income Hypothesis: Theory and Evidence. Journal of Political Economy 86: 971987.

Ho, Mun, and Bent Sørensen. 1996. Finding Cointegration Rank in High Dimensional Systems Using the Johansen Test: An Illustration Using Data Based on Monte Carlo Simulations. Review of Economics and Statistics 78: 726732.

Hsiao, Cheng. 1979. Autoregressive Modelling of Canadian Money and Income Data. Journal of the American Statistical Association 74: 553560.

Johansen, Søren. 1988. Statistical Analysis of Cointegration Vectors. Journal of Economic Dynamics and Control 12: 231254.

Johansen, Søren. 1991. Estimation and Hypothesis Testing of Cointegration Vectors in Gaussian Vector Autoregressive Models. Econometrica 59: 15511580.

King, Robert G., Charles I. Plosser, and Sergio T. Rebelo. 1988. Production Growth and Business Cycles II: New Directions. Journal of Monetary Economics 21: 309342.

Nelson, Charles R., and Charles I. Plosser. 1982. Trends and Random Walks in Macroeconomic Time Series: Some Evidence and Implications. Journal of Monetary Economics 10: 139162.

Pantula, Sastry G. 1989. Testing for Unit Roots in Time Series Data. Econometric Theory 5: 256271.

Perron, Pierre. 1989. The Great Crash, the Oil Price Shock, and the Unit Root Hypothesis. Econometrica 57: 13611401.

Phillips, Peter C. B. 1991. Optimal Inference in Cointegrating Systems. Econometrica 59: 283306.

Phillips, Peter C. B., and Pierre Perron. 1988. Testing for a Unit Root in a Time Series Regression. Biometrika 75: 335346.

Sims, Christopher A., James H. Stock, and Mark W. Watson. 1990. Inference in Linear Time Series Models with Some Unit Roots. Econometrica 58: 113144.

Stock, James H. 1994. Unit Roots, Structural Breaks, and Trends. In Handbook of Econometrics, vol. 4, eds. Robert F. Engle and Daniel L. McFadden, 27392841. Amsterdam: North-Holland.

Stock, James H., and Mark W. Watson. 1988. Testing for Common Trends. Journal of the American Statistical Association 83: 10971107.

Watson, Mark W. 1994. Vector Autoregressions and Cointegration. In Handbook of Econometrics, vol. 4, eds. Robert F. Engle and Daniel L. McFadden, 28442915. Amsterdam: North-Holland.

Cheng Hsiao

Cite this article
Pick a style below, and copy the text for your bibliography.

  • MLA
  • Chicago
  • APA

"Unit Root and Cointegration Regression." International Encyclopedia of the Social Sciences. . 22 Jan. 2019 <>.

"Unit Root and Cointegration Regression." International Encyclopedia of the Social Sciences. . (January 22, 2019).

"Unit Root and Cointegration Regression." International Encyclopedia of the Social Sciences. . Retrieved January 22, 2019 from

Learn more about citation styles

Citation styles gives you the ability to cite reference entries and articles according to common styles from the Modern Language Association (MLA), The Chicago Manual of Style, and the American Psychological Association (APA).

Within the “Cite this article” tool, pick a style to see how all available information looks when formatted according to that style. Then, copy and paste the text into your bibliography or works cited list.

Because each style has its own formatting nuances that evolve over time and not all information is available for every reference entry or article, cannot guarantee each citation it generates. Therefore, it’s best to use citations as a starting point before checking the style against your school or publication’s requirements and the most-recent information available at these sites:

Modern Language Association

The Chicago Manual of Style

American Psychological Association

  • Most online reference entries and articles do not have page numbers. Therefore, that information is unavailable for most content. However, the date of retrieval is often important. Refer to each style’s convention regarding the best way to format page numbers and retrieval dates.
  • In addition to the MLA, Chicago, and APA styles, your school, university, publication, or institution may have its own requirements for citations. Therefore, be sure to refer to those guidelines when editing your bibliography or works cited list.