Serial Correlation
Serial Correlation
Serial correlation is a statistical term that refers to the linear dynamics of a random variable. Economic variables tend to evolve parsimoniously over time and that creates temporal dependence. For instance, as the economy grows, the level of gross national product (GNP) today depends on the level of GNP yesterday; or the present inflation rate is a function of the level of inflation in previous periods since it may take some time for the economy to adjust to a new monetary policy.
Consider a time series data set {Y_{t}, X _{1t }. …, X_{kt} } for t = 1, 2, …, T. Our interest is to estimate a regression model like Y =1 β_{0}+β_{1}X_{1t} + …, β_{k}X_{kt} + ε_{t}. For instance, Y_{t } is the inflation rate, and X _{1t }, …, X_{kt} is a set of regressors such as unemployment and other macroeconomic variables. Under the classical set of assumptions, the GaussMarkov theorem holds, and the ordinary least squares (OLS) estimator of the β _{i }’s is the best, linear, and unbiased estimator (BLUE). Serial correlation is a violation of one of the classical assumptions. Technically, we say that there is serial correlation when the error term is linearly dependent across time, that is, the cov(ε_{t },ε_{s }) ≠ 0 for t ≠ s. We also say that the error term is autocorrelated. The covariance is positive when on average positive (negative) errors tend to be followed by positive (negative) errors; and the covariance is negative when positive (negative) errors are followed by negative (positive) errors. In either case, a covariance that is different from zero will happen when the dependent variable Y_{t} is correlated over time and the regression model does not include enough lagged dependent variables to account for the serial correlation in Y_{t}. The presence of serial correlation invalidates the Gauss Markov theorem. The OLS estimator can still be unbiased and consistent (large sample property), but it is no longer the best estimator, the minimum variance estimator. More importantly, the OLS standard errors are not correct, and consequently the ttests and Ftests are invalid.
There are several models that can take into account the serial correlation of ε_{t }: the autoregressive model AR(p), that is, ε_{t} = ρ_{1}ε_{t – 1} + ρ_{2}ε_{t – 2} + … ρ_{p}ε_{t – p} + v_{t}, where v_{t} is now uncorrelated with zero mean and constant variance; the moving average MA(q) ε_{t} = θ_{q}v_{t – q} + … θ_{1}v _{t – 1 } + v_{t} ; or a mixture model ARMA(p,q). Testing for serial correlation in the error term of a regression model amounts to assessing whether the parameters ρ_{i }’s and θ_{i }’s are statistically different from zero. A popular model within economics is the AR(1) model: ε_{t} = ρ_{1}ε_{t – 1} + v_{t}. Within this model, the null hypothesis to test is H_{0} :ρ_{1} = 0. If we reject the null hypothesis, we conclude that there is serial correlation in the error term. To implement the test, we proceed by running OLS in the regression model. We retrieve the OLS residuals ε̂_{t } and, assuming that the regressors X _{1t }, X _{2t }, …, X_{kt } are strictly exogenous, we regress ε̂_{t } on ε̂_{t  1 }. A tstatistic for H_{0} : ρ_{1} = 0 will be asymptotically valid. If the autoregressive model is of a large order, an Ftest for a joint hypothesis as H_{0} : ρ_{1} = ρ_{2} = … = ρ_{p} = 0 will also be valid. If the regressors are not strictly exogenous, the auxiliary regression of ε̂_{t1 } on ε̂_{t – 1} should be augmented with the set of regressors X _{1t }, X _{2t }, …, X_{kt} for the ttest and Ftest to be valid. There is also a very popular statistic, the DurbinWatson, which also requires strict exogeneity that tests for AR(1) serial correlation. The main shortcoming of this test is the difficulty in obtaining its null distribution. Though there are tabulated critical values, the test leads to inconclusive results in many instances.
Once we conclude that there is serial correlation in the error term, we have two ways to proceed depending upon the exogeneity of the regressors. If the regressors are strictly exogenous, we proceed to model the serial correlation and to transform the data accordingly. A regression model based on the transformed data is estimated with generalized least squares (GLS). If the regressors are not exogenous, we proceed to make the OLS standard errors robust against serial correlation. In the first case, let us assume that there is serial correlation of the AR(1) type, that is, ε_{t} = ρ_{1}ε_{t – 1} + v_{t}. In order to eliminate the serial correlation, we proceed to transform the data by quasidifferencing. For simplicity, suppose that the regression model is Y_{t} = β_{0} + β_{1} X_{1t} + ε_{t}. The following transformation will produce a regression model with an uncorrelated error term:
Y_{t} – ρ_{1} Y_{t – 1} = β_{0}(1 – ρ_{1}) + β_{1}(X_{1t} – ρ_{1}X_{1t – 1}) + ε_{t} – ρ_{1}ε_{t – 1} = β_{0}(1 – ρ_{1}) + β_{1}(X_{1t} – ρ_{1}X_{1t – 1}) + v_{t}
If ρ_{1} is known, it is easy to obtain the quasidifferenced data, that is, Y_{t} – ρ _{1} Y_{t1} and X_{t} – ρ_{1}X _{t1 } and proceed to run OLS in the model with the transformed data. This will produce a GLS estimator of the β _{i }’s that now will be BLUE as the new error term v_{t} is free of serial correlation. In practice, ρ _{1} is not known and needs to be consistently estimated. The estimate ρ̂_{1} is obtained from the auxiliary regression of ε̂_{t} on ε̂_{t}. We proceed by quasidifferencing the data, Y_{t} – ρ̂_{1}Y_{t –1} and X_{t} – ρ̂ _{1}X _{t – 1}, and as before, running OLS with the transformed data. The estimator of the β_{i }’s now is called the feasible GLS estimator (FGLS), which is a biased estimator, though asymptotically is still consistent. In practice, the FGLS estimator is obtained by iterative procedures known as the Cochrane Orcutt procedure, which does not consider the first observation, or the PraisWinsten procedure, which includes the first observation. When the sample size is large, the difference between the two procedures is negligible.
In the second case, when the regressors are not strictly exogenous, we should not apply FGLS estimation because the estimator will not be even consistent. In this instance, we modify the OLS standard errors to make them robust against any form of serial correlation. There is no need to transform the data as we just run OLS with the original data. The formulas for the robust standard errors, which are known as the HAC (heteroscedasticity and autocorrelation consistent) standard errors, are provided by Whitney Newey and Kenneth West (1987). Nowadays, most of the econometric software calculates the HAC standard errors, though the researcher must input the value of a parameter that controls how much serial correlation should be accounted for. Theoretically, the value of this parameter should grow with the sample size. Newey and West advised researchers to choose the integer part of 4(T /100)^{2/9} where T is the sample size. Although by computing the HAC standard errors we avoid the explicit modeling of serial correlation in the error term, it should be said that they could be inefficient, in particular when the serial correlation is strong and the sample size is small.
SEE ALSO Least Squares, Ordinary; Pooled Time Series and Crosssectional Data; Properties of Estimators (Asymptotic and Exact); Time Series Regression; Unit Root and Cointegration Regression
BIBLIOGRAPHY
Newey, Whitney K., and Kenneth D. West. 1987. A Simple, Positive, SemiDefinite Heteroskedasticity and Autocorrelation Consistent Covariance Matrix. Econometrica 55: 703708.
Gloria GonzálezRivera
Cite this article
Pick a style below, and copy the text for your bibliography.

MLA

Chicago

APA
"Serial Correlation." International Encyclopedia of the Social Sciences. . Encyclopedia.com. 24 Aug. 2019 <https://www.encyclopedia.com>.
"Serial Correlation." International Encyclopedia of the Social Sciences. . Encyclopedia.com. (August 24, 2019). https://www.encyclopedia.com/socialsciences/appliedandsocialsciencesmagazines/serialcorrelation
"Serial Correlation." International Encyclopedia of the Social Sciences. . Retrieved August 24, 2019 from Encyclopedia.com: https://www.encyclopedia.com/socialsciences/appliedandsocialsciencesmagazines/serialcorrelation
Citation styles
Encyclopedia.com gives you the ability to cite reference entries and articles according to common styles from the Modern Language Association (MLA), The Chicago Manual of Style, and the American Psychological Association (APA).
Within the “Cite this article” tool, pick a style to see how all available information looks when formatted according to that style. Then, copy and paste the text into your bibliography or works cited list.
Because each style has its own formatting nuances that evolve over time and not all information is available for every reference entry or article, Encyclopedia.com cannot guarantee each citation it generates. Therefore, it’s best to use Encyclopedia.com citations as a starting point before checking the style against your school or publication’s requirements and the mostrecent information available at these sites:
Modern Language Association
The Chicago Manual of Style
http://www.chicagomanualofstyle.org/tools_citationguide.html
American Psychological Association
Notes:
 Most online reference entries and articles do not have page numbers. Therefore, that information is unavailable for most Encyclopedia.com content. However, the date of retrieval is often important. Refer to each style’s convention regarding the best way to format page numbers and retrieval dates.
 In addition to the MLA, Chicago, and APA styles, your school, university, publication, or institution may have its own requirements for citations. Therefore, be sure to refer to those guidelines when editing your bibliography or works cited list.