[Home]  [Syllabus]  [Statnotes]  [Links]  [Lab]  [Instructor]  [Home]

Time Series Analysis


  • Autocorrelation is the serial correlation of error terms for estimates of a time series variable, resulting from the fact that the value of a datum in time t in the series is dependent on the value of that datum in time t - 1 (or some higher lag). Autocorrelation can be detected visually by examining a regression line scatterplot. Distances from the regression line to the actual values represent error. When no autocorrelation exists, dots representing actual values will be randomly scattered around the full length of the series. Negative autocorrelation exists when, as one moves along the x axis, the next observation tends to be lower than the previous one, then the next one higher, then lower, and so on, indicating a negative or reactive dependency exists for observations in the series. A positive autocorrelation extis when, as one moves along the x axis, there are series of above-the-line observations, then series of below-the-line observations, then more highs, etc., indicating that a positive or inertial dependency exists for the observations.

  • Decomposition refers to separating a time series into trend, cyclical, and irregular effects. Decomposition may be linked to de-trending and de-seasonalizing data so as to leave only irregular effects, which are the main focus of time series analysis.

  • Model order refers to the model's lag order. A lag order of -1 is common, indicating variables on the right hand side are lagged one time unit (that is, the variable on the left hand side of the equation -- the variable being forecast -- is matched with values of the independents one time period previous. Testing for model order follows identification and specification, and is the first step in estimation. Model order tests include the likelihood ratio test (LR), the final prediction error test (FPE), and the autoregressive transfer function test (CAT), to name a few.


    Assumptions

    1. Stationarity is a critical assumption of time series analysis, stipulating that statistical descriptors of the time series are invariant for different ranges of the series. Weak stationarity assumes only that the mean and variance are invariant. Strict stationarity also requires that the series is normally distributed. Stationarity is tested by the following tests: Durbin-Watson, Dickey-Fuller, Augmented D-F, and Root Examination for univariate time series. There is also a test (Fountis-Dickey) for joint stationarity when modeling two time series together. Stationarity may also be inspected graphically in SPSS by selecting Graphs, Sequence, to visually inspect for a linear or quadratic slope. Testing stationarity is a first step in time series modeling. These may be followed by tests for normality: the normal distribution test, Jarqua-Bear, or studentized range tests.

    2. Uncontrolled autocorrelation. Time series analysis requires stationarity be established through differencing or some other technique. If two variables trend upward in raw data, as do GNP and entertainment expenditures, they will tend to correlate highly when a linear technique such as OLS (ordinary least-squares) regression is applied. In fact, many if not most nationally aggregated variables are of this type. For data in such series, the value of any given datum is largely determined by the value of the preceding datum in the series. This autocorrelation must be controlled before inferences may be made about correlation with other variables. Failure to control autocorrelation is vary apt to lead to spurious results, thinking there is a strong effect of, say, entertainment expenditures on GNP.

      More technically, significance tests of OLS regression estimates assume non-autocorrelation of the error terms. Error terms at sequential points in the series should constitute a random series. It is also assumed that the mean of the error terms will be zero (because estimates are half are above and half below the actual values), and the variance of the error terms will be constant throughout the time series. When, as in many time series, the value of a datum in time t largely determines the value of the subsequent datum in time t + 1, a dependency exists linking the error terms and the non-autocorrelation assumption is violated. The practical effect is that the significance of OLS estimates is computed to be far better than actual, leading the researcher to think that significant relationships exist when they do not. The Durbin-Watson test is the standard test for autocorrelation.

    3. Applying Linear Techniques to Nonlinear Data. OLS regression assumes linear relationships. Applying linear techniques to nonlinear data will underestimate relationships and increase error of estimate. As with other uses of OLS regression, the linearity assumption is not violated by adding power or other nonlinear transform terms to the equation (ex., income-squared). The researcher must conduct tests for linearity. A common test is Ramsey's RESET test, discussed in the section on data assumptions. There are a variety of other tests for linear or nonlinear dependence, including the Keenan, Luukkonen, McLeod-Li, and Hsieh tests. If non-linearity is present, it may be possible to eliminate it by double differencing or data transformation (ex., logarithmic).

    4. Arbitrary model lag order. Model lag order can have great effects on results. While tests exist to determine the optimal model order, these tests are purely statistical in nature. The researcher should have a theoretical basis establishing the face validity of the order of the model he or she has put forward.

    5. No outliers. As in other forms of regression, outliers may affect conclusions strongly and misleadingly.

    6. Random shocks. If shocks are present in the time series, they are assumed to be randomly distributed with a mean of 0 and a constant variance.

    7. Uncorrelated random error. Residuals in a good time series model will be randomly distributed, exhibit a normal distribution, have non-significant autocorrelations and partial autocorrelations, and have a mean of 0 and homogeneity of variance over time. Correlated error does not bias estimates but does inflate standard errors, making statistical inference problematic. The Durbin-Watson test is the standard test for correlated error.



    Frequently Asked Questions


    Bibliography



    @c 2006, 2008 G. David Garson
    last update: 1/30/08.