Model Evaluation Statistics

Contents

13 Post-estimation Evaluation Statistics
  13.1 Introduction
  13.2 Graphic analysis
  13.3 Recursive graphics
  13.4 Dynamic analysis
  13.4.1 Static long-run solution
  13.4.2 Roots of the autoregressive lag polynomial
  13.5 Collinearity analysis
  13.6 Forecasts
  13.6.1 Constancy tests
  13.7 Diagnostic tests
  13.7.1 Portmanteau statistic
  13.7.2 Test for autocorrelated residuals
  13.7.3 Test for autocorrelated squared residuals
  13.7.4 Test for normality
  13.7.5 Test for heteroscedasticity using squares
  13.7.6 Test for heteroscedasticity using squares and cross-products
  13.7.7 Small-sample properties of the mis-specification tests
  13.8 Linear restrictions test
  13.9 Exclusion restrictions
  13.10 Tests for omitted variables
  13.11 Encompassing tests
List of figures
  Figure:13.1 Selecting mis-specification tests: QQ plots for T=100
  Figure:13.2 Mis-specification test distributions in large and small samples
List of tables
  Table:13.1 Test battery

Chapter 13 Post-estimation Evaluation Statistics

13.1 Introduction

To check that the GUM is a congruent specification (formally defined in Ch. 9), PcGets computes a battery of mis-specification tests, selected by the user. These tests seek to ascertain that the residuals approximate homoscedastic, normal, white noise, so are not autocorrelated, heteroscedastic (on several measures), or non-normal. There are also checks for parameter constancy. Empirical congruence is shown by satisfactory performance on all these checks.

It is important that congruence is tested and confirmed prior to model selection, as only then will conventional inference procedures be reliable. However, the larger the number of tests that are computed, and the looser the significance levels, the higher the probability that one will reject by chance (see Ch. 10 for a detailed discussion). Conversely, the fewer the tests and the tighter their significance levels, the lower their power to detect mis-specification. The user must balance such considerations, or select one of the two pre-programmed strategies.

Less obviously, but of equal importance, the nominal significance levels of the tests must be close to their actual null rejection frequencies for the models being analyzed. For some model types and sample sizes, some tests are `badly behaved', in that they reject far more often under the null than their nominal significance levels: examples noted below include the portmanteau statistic, and the general heteroscedasticity test (see §13.7.7).

It must be stressed that the tests are only applied once for mis-specification in the GUM. Thereafter, they act as diagnostics for invalid model simplifications; thus, their repeated application does not alter their status. However, re-specifying the GUM after a mis-specification test rejects does alter its characteristics, as it now becomes a design criterion -- although, of course, it has by then served its purpose of revealing model mis-specification.

The remaining sections explain the evaluation tests computed in PcGets, concluding with a brief examination of their behaviour in finite samples.

13.2 Graphic analysis

Graphic analysis focuses on graphical inspection of individual equations. Let yt, y^t denote respectively the actual (that is, observed) values and the fitted values of the selected equation, with residuals e^t=yt-y^t, t=1,...,T. When H observations are retained for forecasting, then y^T+1,...,y^T+H are the 1-step forecasts.

Eight different types of graph are available:

  1. Actual and Fitted Values

    (yt,y^t) over t. This is a graph showing the fitted (y^t) and actual values (yt) of the dependent variable over time, including the forecast period;

  2. Cross-plot of Actual and Fitted Values

    y^t against yt, also including the forecast period;

  3. Scaled Residuals

    ( e^t/s^e) over t, where s^e2=(T-k)-1RSS is the full-sample equation error variance.

  4. Forecasts and Outcomes

    The static or dynamic forecasts can be plotted in a graph over time: y^T+h is shown with error bars, bands or fans of ±2SE( eT+h) centered on y^T+h (that is, an approximate 95% confidence interval for the forecast); yT+h is shown in-sample.

  5. Residual Correlogram

    This plots the correlogram using e^t as the xt variable in (eq:13.15).

  6. Residual Spectrum

    This plots the estimated spectral density (see the GiveWin book) using e^t as the xt variable.

  7. Residual Density Optionally with Histogram and Normal

    By default the histogram of the standardized residuals e^t/Ö(T-1RSS), t=1,...,T, and the estimated density fe^(.)^ are graphed using the settings described in the GiveWin book.

  8. Residual QQ Plot

    The estimated QQ plot of the residuals is shown with a standard normal for comparison.

The residuals can be saved to the database for further inspection.

13.3 Recursive graphics

Recursive methods estimate the model at each t for t=M-1,...,T. For forward recursion, the process is initialized by estimation over 1,...,M-1, which is followed by recursive updating over M,...,T: essentially the order is reversed for backward recursion. The output generated by the recursive procedures is most easily studied graphically, viewing multiple graphs together on screen. Recursive estimation throws the spotlight on parameter constancy.

Let b^t denote the k parameters estimated from a sample of size t, and yj-xj'b^t the residuals at time j evaluated at the parameter estimates based on the sample 1,...,t.

We now consider the generated output:

  1. Coefficients ±2 Standard Errors

    The graph shows b^it±2SE[b^i,t] for each selected coefficient i  ( i=1,...,k) over t=M,...,T.

  2. t-values

    b^it/SE[b^i,t] for each selected coefficient i  ( i=1,...,k) over t=M,...,T.

  3. Residual Sum of Squares

    The residual sum of squares at each t is RSSt=åj=1t(yj-xj'b^t)2 for t=M,...,T.

  4. Standardized Innovations

    The standardized innovations (or standardized recursive residuals) for RLS are:

    nt=(yt-xt'b^t-1)/(wt)1/2 where wt=1+xt'( Xt-1'Xt-1) -1xt for t=M,...,T.

    As pointed out in §12.62, s2wt is the 1-step forecast error variance of (12.4), and b^M-1 are the coefficient estimates from the initializing OLS estimation.

  5. 1-Step Residuals ±2s^e

    The 1-step residuals yt-xt'b^t are shown bordered by 0±2s^e,t over M,...,T. Points outside the 2 standard-error region are either outliers or are associated with coefficient changes.

  6. 1-Step F-Tests(1-Step Chow-Tests)

    1-step forecast tests are F( 1,t-k-1) under the null of constant parameters, for t=M,...,T. A typical statistic is calculated as:

    ( RSSt-RSSt-1) ( t-k-1)

    RSSt-1
    =
    nt2/wt

    s^e,t-12
    .
    (eq:13.1)

    Normality of yt is needed for this statistic to be distributed as an F.

  7. Break-point F-Tests (N¯-Step Chow-Tests)

    Break-point F-tests are F( T-t+1,t-k-1) for t=M,...,T. These are, therefore, sequences of Chow tests and are called N¯ because the number of forecasts goes from N=T-M+1 to 1. When the forecast period exceeds the estimation period, this test is not necessarily optimal relative to the covariance test based on fitting the model separately to the split samples. A typical statistic is calculated as:

    ( RSST-RSSt-1) ( t-k-1)

    RSSt-1(T-t+1)
    . =
    1

    T-t+1
    åm=tT nm2/ wm

    s^ e,t-12
    .
    (eq:13.2)

    This test is closely related to the CUSUMSQ statistic in Brown, Durbin and Evans (1975).

  8. Forecast F-Tests (N­-Step Chow- Tests)

    Forecast F-tests are F( t-M+1,M-k-1) for t=M,...,T, and are called N­ as the forecast horizon increases from M to T. This tests the model over 1 to M-1 against an alternative which allows any form of change over M to T. A typical statistic is calculated as:

    ( RSSt-RSSM-1) ( M-k-1)

    RSSM-1(t-M+1)
    .
    (eq:13.3)

The statistics in (eq:13.1)--(eq:13.3) are variants of Chow (1960) tests: they are scaled by 1-off critical values from the F-distribution at any selected probability level as an adjustment for changing degrees of freedom, so that the significant critical values become a straight line at unity. Note that the first and last values of (eq:13.1) respectively equal the first value of (eq:13.3) and the last value of (eq:13.2).

Test statistics from (eq:13.1)--(eq:13.3) are not calculated if instrumental variables are present.

13.4 Dynamic analysis

The general class of models estimable in PcGets has the form:

b0( L) yt=åi=1qbi( L) xi,t+et
(eq:13.4)

where b0( L) and the bi( L) are polynomials in the lag operator L, and q is the number of regressors: k remains the number of estimated coefficients. For simplicity, we take all polynomials to be of equal length m:

bi( L) =åj=0mbijLj,  i=0,...,q,

with b00=1. Letting a(L)=-åj=1majLj-1, we can write (eq:13.4) as:

yt=a( L) yt-1+åi=1qbi( L) xi,t+et.
(eq:13.5)

Finally, we use a=(a1,...,am)' and bi=(bi0,...,bim), i=1,...,q.

After estimation or selection, PcGets can analyze the polynomials involved, and it computes such functions as their roots and sums.

13.4.1 Static long-run solution

When working with dynamic models, concepts such as equilibrium solutions, steady-state growth paths, mean lags of responses etc., are generally of interest. In the simple model:

yt=a1yt-1+b10x1,t+b11x1,t-1+et,
(eq:13.6)

where all the variables are stationary, a static equilibrium is defined by:

E[ x1,t] =x1* for all t,
(eq:13.7)

in which case, E[ yt] =y* will also be constant if |a1|<1, and yt will converge to:

E[ yt] =kE[ x1,t] or y*=kx1* where k=
( b10+b11)

( 1-a1)
(eq:13.8)

For non-stationary, but cointegrated data, the first expression (eq:13.8) can be re-interpreted as E[ yt-kx1,t*] =0.

PcGets computes estimates of k, called static long-run parameters, and associated standard errors. If b0( 1) ¹0, the general long-run solution of (eq:13.4) is given by:

y*=åi=1q
bi( 1)

b0( 1)
xi*=åi=1qkixi*.
(eq:13.9)

The expression yt-Skixi,t is called the equilibrium-correction mechanism (EqCM) and can be stored in the data set.

The standard errors of k^=(k^1...k^q)' are calculated from:

V[ k^] ^=J^V[ b^] ^J^' when J=
k

b'
.
(eq:13.10)

PcGets calculates J analytically (see Bårdsen, 1989).

13.4.2 Roots of the autoregressive lag polynomial

Consider the simple autoregressive model:

yt=a1yt-1+a2yt-2+et,

which can be written as:

( 1-a1L-a2L2) yt=et.
(eq:13.11)

The eigenvalues of the dynamics li (inverses of the roots) are such that:

( 1-a1L-a2L2) =( 1-l1L) ( 1-l2L) ,

so a1=l1+l2 and a2=-l1l2. More generally:

åi=0naiLi=Õi=1n( 1-liL) .

PcGets reports the li.

13.5 Collinearity analysis

A collinearity analysis is provided for both the general and specific models. The second-moment matrix MXX=T-1X'X , and the matrix of correlations:

ri,j=
mi,j

(mi,imj,j)½
,

are reported, together with the eigenvalues of the former, which should all be real and positive. A comparison of the eigenvalues before and after selection can be useful, as can large ratios of the biggest to the smallest eigenvalues. Eigenvalues of R close to zero indicate the presence of collinearity.

13.6 Forecasts

If dynamic forecasts are requested, a tabular presentation provides the forecasts and their standard errors; if the forecasting is in-sample, the actual outcome and the forecast error with the t-value thereof are also reported. As noted above, forecasts and their standard-error based bounds can be graphed.

13.6.1 Constancy tests

The test of parameter constancy has the form of a Chow (1960) test:

h3=
( RSST-RSST1) /( T-T1)

RSST1/(T1-k)
app~   F( T-T1,T1-k) on H0
(eq:13.12)

where H0 is parameter constancy. In (eq:13.12), RSS T is the full-sample residual sum of squares, RSST1 is for the relevant sub-sample, and there are k regressors. For fixed regressors, the Chow (1960) test is exactly distributed as an F, but is only approximately (or asymptotically) so in dynamic models.

The (Chow) test of parameter constancy out-of-sample has the form:

h3=
( RSST+H-RSST) /H

RSST/(T-k)
app~   F( H,T-k) on H0
(eq:13.13)

where H0 is parameter constancy.

13.7 Diagnostic tests

Irrespective of the estimator selected, a wide range of model evaluation criteria are automatically computed. Tests are available for residual autocorrelation, conditional heteroscedasticity, non-normality, unconditional heteroscedasticity, functional-form mis-specification and omitted variables. The diagnostic tests of this section concern the past (checking that the errors are a homoscedastic, normal, innovation process relative to the information available), whereas the forecast statistics discussed above concern the future (and encompassing tests concern information specific to rival models).

Many test statistics in PcGets have either a c2 distribution or an F distribution. Diagnostic tests are usually reported as:

                   value      prob
normality test    7.5674    0.0227
AR   1-4 test     1.7694    0.1428

where the two tests shown are a c2(2) and an F(4,T-4) (with four degrees of freedom in the numerator, and T-4 in the denominator). The observed values are 7.5674 and 1.7694, and the probabilities of getting values as large (or larger) under their null distributions are 0.0227 and 0.1428. Test outcomes that exceed the user-selected significance levels are shown by two stars.

Probability values for F-tests are calculated using an algorithm based on Majunder and Bhattacharjee (1973a) and Cran, Martin and Thomas (1977).

[Note: As recommended in Cran, Martin and Thomas (1977), the approach in Pike and Hill (1966) is used for the logarithm of the gamma function.]

Those for the c2 are based on Shea (1988). The significance points of the F-distribution derive from Majunder and Bhattacharjee (1973b).

Many of the diagnostic tests are calculated using an auxiliary regression (see Pagan, 1984), so take the form of TR2 for the auxiliary regression: they are asymptotically distributed as c2( s) under their nulls, and have the usual additive property for independent c2s. Following Kiviet (1986) and Harvey (1990), F-approximations of the form:

R2

1-R2
.
T-k-s

s
~F( s,T-k-s)
(eq:13.14)

are calculated, as these seem to be better behaved in small samples. The precise statistics calculated are now described.

13.7.1 Portmanteau statistic

The correlogram of a variable xt is the series {rj} where rj is the correlation coefficient between xt and xt-j for j=1,...,s:

rj=
åt=j+1T( xt-x0) ( xt-j-xj)

(åt=j+1T( xt-x0) 2åt=j+1T( xt-j-xj) 2)½
.
(eq:13.15)

Here, xi=åt=j+1Txt-i/(T-j) is the sample mean of xt-i (t=j+1,...,T), where i=0,1,...,s, so that rj corresponds to a correlation coefficient (note the difference from the definition in many time-series textbooks, where the denominator is defined in terms of å1T(xt-x)2: however, this difference tends to be small, and vanishes asymptotically).

The residual correlogram is defined as in (eq:13.15), but using the residuals from the fitted model, rather than the original data, i.e., correlations {rj} are computed between the residuals ût and ût-j. The portmanteau statistic is defined by:

LB( s) =T2åj=1s
rj2

T-j
(eq:13.16)

as in Box and Pierce (1970), but with the degrees-of-freedom correction suggested by Ljung and Box (1978). It was designed as a goodness-of-fit test for stationary, autoregressive moving-average models. Under the assumptions of the test, LB(s) is asymptotically distributed as c2(s-n) after fitting an AR(n) model. A value LB(s)³2s for large s suggests that the residuals are not white noise. However, Monte Carlo simulations of its behaviour in fitted conditional econometric equations suggest it should not form part of the test battery in model selection: see e.g., Teräsvirta (1976) and §13.7.7. An appropriate test for autocorrelated residuals is provided in §13.7.2 following.

13.7.2 Test for autocorrelated residuals

This is the Lagrange-multiplier (LM) test for rth-order residual autocorrelation, distributed as c2( r) in large samples, under the null hypothesis that there is no autocorrelation (that is, that the errors are white noise). In standard usage, r~= 1/2 s for s in §13.7.1 above, so this provides a type of portmanteau test (see Godfrey, 1978). However, any orders from 1 up to 12 can be selected to test against:

ut=åi=praiut-i+et where 0£p£r.

The F-form suggested in (e.g.) Kiviet (1986) and Harvey (1990) is the recommended diagnostic test. The LM test is calculated by regressing the residuals on all the regressors of the original model and the lagged residuals for lags p to r (missing residuals are set to zero). The LM test c2(r-p+1) is TR2 from this regression (or the F-equivalent), and the error autocorrelation coefficients are estimated by the coefficients of the lagged residuals. For an excellent exposition, see Pagan (1984).

13.7.3 Test for autocorrelated squared residuals

This is the ARCH (autoregressive conditional heteroscedasticity test: see Engle, 1982a) which in its present form tests the hypothesisg=0 in the model:

E[ ut2½ut-1,...,ut-r] =c0+åi=1rgiut-i2

where g=( g1,...,gr) ' . Then TR2 from the regression of ût2 on a constant and ût-12 to u^t-r2 (called the ARCH test) is asymptotically distributed as c2( r) on H0: g=0: as usual, the F-form is reported. Both first-order and higher-order lag forms are easily calculated (see Engle, Hendry and Trumbull, 1985, for small-sample evidence on the former).

13.7.4 Test for normality

This is a test for whether the skewness and kurtosis of the residuals correspond to that of a normal distribution: see Doornik and Hansen (1994). For a variable xt, let m and sx2 denote its mean and variance, and write mi=E[ xt-m] i, so that sx2=m2 . The skewness and kurtosis are defined respectively as:

Öb1=
m3

m23/2
and b2=
m4

m22
.
(eq:13.17)

A normal variate has Öb1=0 and b2=3.

Sample estimates of these four parameters are given by

x= 1/T åt=1Txt,    mi= 1/T åt=1T( xt-x) i,    Öb1=
m3

m23/2
and b2=
m4

m22
.
(eq:13.18)

Bowman and Shenton (1975) consider a c2( 2) test based on Öb1 and b2 but deem it unsuitable unless the sample size is very large: the statistics Öb1 and b2 are not independently distributed, and the sample kurtosis approaches normality very slowly (see Shenton and Bowman, 1977, and D'Agostino, 1970). Instead, let z1 and z2 denote transformations of skewness and kurtosis, designed to make these statistics closer to the standard normal. The resulting test is (see Doornik and Hansen, 1994):

z12+z22 app~ c2( 2) .
(eq:13.19)

Applied to residuals, this test gives appropriate rejection frequencies under the null of normality, using c2(2) critical values: see §13.7.7.

13.7.5 Test for heteroscedasticity using squares

This test (called the hetero test in PcGets' output) is based on White (1980), and involves an auxiliary regression of {e^t2} on the original regressors {xi,t} and all their squares {xi,t2}. The null is unconditional homoscedasticity, and the alternative is that the variance of the {et} process depends on xt and on the xi,t2. Variables that are redundant when squared are automatically removed. The output comprises the F-test based on TR2.

13.7.6 Test for heteroscedasticity using squares and cross-products

White (1980) also suggested this test (called the hetero-x test in PcGets' output) which is only calculated where there is a sufficiently-large sample relative to the number of regressors. It is based on an auxiliary regression of the squared residuals {ût2} on all squares and cross-products of the original regressors (that is, on r=½k( k+1) variables). Then, if T>>k( k+1) , the test is calculated; redundant variables are automatically removed. The usual F-approximation to the c2 based on TR2 is reported. This is a general test for heteroscedastic errors: H0 is that the errors are homoscedastic or, if heteroscedasticity is present, it is unrelated to the xs.

Unfortunately, Monte Carlo simulations of its behaviour in fitted conditional econometric equations suggest it should not form part of the test battery in model selection, even in relatively large samples. Godfrey and Orme (1994) also show that this test does not have power against omitted variables.

13.7.7 Small-sample properties of the mis-specification tests

Figure Figure:13.1 displays quantile--quantile (QQ) plots of the empirical distributions of seven potential mis-specification tests when estimating the correct specification, the general model, and the finally-selected model in a Monte Carlo study for 1000 replications with T=100 (from Krolzig and Hendry, 2001).

MCdiag7

Figure:13.1 Selecting mis-specification tests: QQ plots for T=100

Some strong deviations are evident from the theoretical distributions (shown by the diagonals): the portmanteau statistic (see Box and Pierce, 1970) rejects serial independence of the errors too often in the correct specification, never in the general, and too rarely in the final model. The hetero-x test (see White, 1980) often has degrees-of-freedom problems for the GUM, but anyway performs badly for both the true and final models. Such incorrect finite-sample sizes of these two diagnostic tests would induce excess rejections of congruent GUMs, resulting in an increased overall size: thus, the portmanteau and the hetero-x diagnostics should be excluded from any battery of test statistics. Overall, therefore, we recommend the five mis-specification tests shown in table 13.1.

Table:13.1 Test battery

Test Alternative Statistic Sources
Chow(tT) Predictive failure over a subset Chow (1960, p.594-595),
of (1-t) T obs.F((1-t) T,tT-k) Hendry (1979)
portmanteau(r) r-th order residual autocorrelation c2(r) Box and Pierce (1970)
normality test Skewness and excess kurtosis c2(2) Jarque and Bera (1980),
Doornik and Hansen (1994)
AR 1-p test p-th order residual autocorrelation F(p,T-k-p) Godfrey (1978),
Harvey (1981, p.173)
ARCH 1-p test p-th order autoregressive Engle (1982b),
conditional heteroskedasticity F(p,T-k-p) Engle, Hendry and Trumbull (1985)
hetero test Heteroscedasticity quadratic White (1980),
in regressors xi2 F(q,T-k-q-1) Nicholls and Pagan (1983)
There are T observations and k regressors in the model under the null.
The value of q may differ across statistics, as may those of k and T across models.
By default, PcGets sets p=4, r=12, and compute two Chow tests at t1=[0.5T]/T and t2=[0.9T]/T.

Figure:13.2 compares the empirical distributions of these test statistics in large and small samples (T=1000 and T=100) for the DGP and GUM. The first two rows of graphs demonstrate that the test distributions are unaffected by the presence of (strongly-exogenous) nuisance regressors (i.e., variables that enter the GUM but not the DGP). For small samples (the second two rows), the properties of the mis-specification tests are still satisfactory -- except for the hetero test -- and are close to the distributions of those tests under the null of the DGP.

For any given model, PcNaive could be used to simulate the rejection frequencies under the null of the tests to be applied, simulating a GUM similar to that under analysis.

MCdiag

Figure:13.2 Mis-specification test distributions in large and small samples

13.8 Linear restrictions test

Writing the model in matrix form as y=Xb+e, the null hypothesis of p linear restrictions can be expressed as H0: Rb=r, with R a (p×k) matrix and r a p×1 vector. This test is well explained in most econometrics textbooks, and uses the unrestricted estimates (that is, it is a Wald test).

13.9 Exclusion restrictions

This subset form of the linear restrictions tests is for: H0: bi=...=bj=0: any choice of coefficients can be made, so a wide range of specification hypothesis can be tested.

13.10 Tests for omitted variables

This tests if any variables of the GUM which has been dropped by GETS or GETSIVE should be added to the selected model. The model itself remains unchanged. If the model is written in matrix form as

y=Xb+Wg+e,

then H0: g=0 is being tested. The test exploits the fact that on H0:

(T)½g^  ®  D    Np[ 0,s2( W'MXW/T) -1] where MX=IT-X( X'X) -1X',
(eq:13.20)

then for p added variables.:

g^'( W'MXW) g^

s^e2
.
T-k-p

p
~F( p,T-k-p) .
(eq:13.21)

Since ( X'X) -1 is already calculated, the F-statistic is easily computed by partitioned inversion. Computations for IVE are only slightly more involved.

13.11 Encompassing tests

Encompassing tests are used to select between final-selected models from path searches (see Cox, 1961, Pesaran, 1974, Ericsson, 1983, Mizon, 1984, Mizon and Richard, 1986, and Hendry and Richard, 1989). The F-test is for each model parsimoniously encompassing their union. This is the only encompassing test which is invariant to the choice of common regressors in models. Thus, the F -test yields the same numerical outcome for the first model parsimoniously encompassing either the union of the two models under consideration, or the orthogonal complement to the first model relative to the union.

Let the first model (M1) under consideration have k1+k2 regressors ( x1,t,x2,t) and the second (M2) have k2+k3 ( x2,t,x3,t) , so the x2,t are in common, and their union (MU) comprises the k=k1+k2+k3 non-redundant set:

x1,t,x2,t,x3,t.

Let RSS 1, RSS2 and RSSU denote the residual sums of squares from M1, M2 and MU respectively; and RSSG from the GUM. Then, the encompassing test of M1E  M2 is equivalent to testing the parsimonious encompassing M1EpMU. For a sample of size T, PcGets uses the F-test:

h4=
( RSS1-RSSU) /k3

RSSU/(T-k)
app~   F( T-k,k3) on H0.
(eq:13.22)

A similar construction is used for the other encompassing tests.

References

Akaike, A. (1973). "Information theory and an extension of the maximum likelihood principle" In Petrov, B. N., and Saki, F. L.(eds.), Second International Symposium of Information Theory. Budapest.

Akaike, H. (1985). "Prediction and entropy" In Atkinson, A. C., and Fienberg, S. E.(eds.), A Celebration of Statistics, pp. 1--24. New York: Springer-Verlag.

Akerlof, G. A. (1979). "Irving Fisher on his head: The consequences of constant target-threshold monitoring of money holdings" Quarterly Journal of Economics, 93, 169--188.

Amemiya, T. (1980). "Selection of regressors" International Economic Review, 21, 331--354.

Anderson, T. W. (1971). The Statistical Analysis of Time Series. New York: John Wiley & Sons.

Andrews, D. W. K. (1991). "Heteroskedasticity and autocorrelation consistent covariance matrix estimation" Econometrica, 59, 817--858.

Banerjee, A., Dolado, J. J., Galbraith, J. W., and Hendry, D. F. (1993). Co-integration, Error Correction and the Econometric Analysis of Non-Stationary Data. Oxford: Oxford University Press.

Bårdsen, G. (1989). "The estimation of long run coefficients from error correction models" Oxford Bulletin of Economics and Statistics, 50.

Bean, C. R. (1977). "More consumers' expenditure equations" Academic panel paper (77)35, H.M. Treasury, London.

Bean, C. R. (1978). "The determination of consumers' expenditure in the UK" Government economic service working paper 4, H.M. Treasury, London.

Bontemps, C., and Mizon, G. E. (2001). "Congruence and encompassing" In Stigum, B.(ed.), Studies in Economic Methodology. Cambridge, Mass.: MIT Press.

Boswijk, H. P. (1992). Cointegration, Identification and Exogeneity, Vol. 37 of Tinbergen Institute Research Series. Amsterdam: Thesis Publishers.

Bowman, K. O., and Shenton, L. R. (1975). "Omnibus test contours for departures from normality based on Öb1 and b2" Biometrika, 62, 243--250.

Box, G. E. P., and Jenkins, G. M. (1976). Time Series Analysis, Forecasting and Control. San Francisco: Holden-Day. First published, 1970.

Box, G. E. P., and Pierce, D. A. (1970). "Distribution of residual autocorrelations in autoregressive-integrated moving average time series models" Journal of the American Statistical Association, 65, 1509--1526.

Breusch, T. S. (1990). "Simplified extreme bounds" in Granger 1990, pp. 72--81.

Brown, R. L., Durbin, J., and Evans, J. M. (1975). "Techniques for testing the constancy of regression relationships over time (with discussion)" Journal of the Royal Statistical Society B, 37, 149--192.

Burns, A. F., and Mitchell, W. C. (1946). Measuring Business Cycles. New York: NBER.

Campos, J., and Ericsson, N. R. (1999). "Constructive data mining: Modeling consumers' expenditure in Venezuela" Econometrics Journal, 2, 226--240.

Carruth, A., and Henley, A. (1990). "Can existing consumption functions forecast consumer spending in the late 1980s?" Oxford Bulletin of Economics and Statistics, 52, 211--222.

Chatfield, C. (1995). "Model uncertainty, data mining and statistical inference" Journal of the Royal Statistical Society, A, 158, 419--466. With discussion.

Chow, G. C. (1960). "Tests of equality between sets of coefficients in two linear regressions" Econometrica, 28, 591--605.

Chow, G. C. (1981). "Selection of econometric models by the information criteria" In Charatsis, E. G.(ed.), Proceedings of the Econometric Society European Meeting 1979, Ch. 8. Amsterdam: North-Holland.

Clayton, M. K., Geisser, S., and Jennings, D. E. (1986). "A comparison of several model selection procedures" In Goel, P., and Zellner, A.(eds.), Bayesian Inference and Decision Techniques: Elsevier Science.

Clements, M. P., and Hendry, D. F. (1998). Forecasting Economic Time Series. Cambridge: Cambridge University Press.

Clements, M. P., and Hendry, D. F. (1999a). Forecasting Non-stationary Economic Time Series. Cambridge, Mass.: MIT Press.

Clements, M. P., and Hendry, D. F. (1999b). "Modelling methodology and forecast failure" Unpublished typescript, Economics Department, University of Oxford.

Coen, P. G., Gomme, E. D., and Kendall, M. G. (1969). "Lagged relationships in economic forecasting" Journal of the Royal Statistical Society A, 132, 133--163.

Cox, D. R. (1961). "Tests of separate families of hypotheses" In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1, pp. 105--123 Berkeley: University of California Press.

Cox, D. R. (1962). "Further results on tests of separate families of hypotheses" Journal of the Royal Statistical Society B, 24, 406--424.

Cran, G. W., Martin, K. J., and Thomas, G. E. (1977). "A remark on algorithms. AS 63: The incomplete beta integral. AS 64: Inverse of the incomplete beta function ratio" Applied Statistics, 26, 111--112.

D'Agostino, R. B. (1970). "Transformation to normality of the null distribution of g1" Biometrika, 57, 679--681.

Davidson, J. E. H., and Hendry, D. F. (1981). "Interpreting econometric evidence: The behaviour of consumers' expenditure in the UK" European Economic Review, 16, 177--192. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Davidson, J. E. H., Hendry, D. F., Srba, F., and Yeo, J. S. (1978). "Econometric modelling of the aggregate time-series relationship between consumers' expenditure and income in the United Kingdom" Economic Journal, 88, 661--692. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Deaton, A. S. (1982). "Model selection procedures or, does the consumption function exist" In Chow, G. C., and Corsi, P.(eds.), Evaluating the Reliability of Macro-Economic Models, Ch. 5. New York: John Wiley.

Doan, T., Litterman, R., and Sims, C. A. (1984). "Forecasting and conditional projection using realistic prior distributions" Econometric Reviews, 3, 1--100.

Doornik, J. A. (1999). Object-Oriented Matrix Programming using Ox 3rd ed. London: Timberlake Consultants Press.

Doornik, J. A., and Hansen, H. (1994). "A practical test for univariate and multivariate normality" Discussion paper, Nuffield College.

Doornik, J. A., and Hendry, D. F. (1996). GiveWin: An Interactive Empirical Modelling Program. London: Timberlake Consultants Press.

Doornik, J. A., and Hendry, D. F. (2001a). Econometric Modelling using PcGive 10, Volume II. London: Timberlake Consultants Press.

Doornik, J. A., and Hendry, D. F. (2001b). Interactive Monte Carlo Experimentation in Econometrics using PcNaive. London: Timberlake Consultants Press.

Engle, R. F. (1982a). "Autoregressive conditional heteroscedasticity, with estimates of the variance of United Kingdom inflation" Econometrica, 50, 987--1007.

Engle, R. F. (1982b). "Autoregressive conditional heteroskedasticity with estimates of the variance of UK inflation". 50, 987--1008.

Engle, R. F., Hendry, D. F., and Richard, J.-F. (1983). "Exogeneity" Econometrica, 51, 277--304. Reprinted in Hendry, D. F., Econometrics: Alchemy or Science? Oxford: Blackwell Publishers, 1993, and Oxford University Press, 2000; and in Ericsson, N. R. and Irons, J. S. (eds.) Testing Exogeneity, Oxford: Oxford University Press, 1994.

Engle, R. F., Hendry, D. F., and Trumbull, D. (1985). "Small sample properties of ARCH estimators and tests" Canadian Journal of Economics, 43, 66--93.

Engle, R. F., and White, H.(eds.)(1999). Cointegration, Causality and Forecasting. Oxford: Oxford University Press.

Ericsson, N. R. (1983). "Asymptotic properties of instrumental variables statistics for testing non-nested hypotheses" Review of Economic Studies, 50, 287--303.

Ericsson, N. R., Campos, J., and Tran, H.-A. (1990). "PC-GIVE and David Hendry's econometric methodology" Revista De Econometria, 10, 7--117.

Ericsson, N. R., Hendry, D. F., and Prestwich, K. M. (1998). "The demand for broad money in the United Kingdom, 1878--1993" Scandinavian Journal of Economics, 100, 289--324.

Ericsson, N. R., and Irons, J. S.(eds.)(1994). Testing Exogeneity. Oxford: Oxford University Press.

Faust, J., and Whiteman, C. H. (1997). "General-to-specific procedures for fitting a data-admissible, theory-inspired, congruent, parsimonious, encompassing, weakly-exogenous, identified, structural model of the DGP: A translation and critique" Carnegie--Rochester Conference Series on Public Policy, 47, 121--161.

Friedman, M., and Schwartz, A. J. (1982). Monetary Trends in the United States and the United Kingdom: Their Relation to Income, Prices, and Interest Rates, 1867--1975. Chicago: University of Chicago Press.

Frisch, R., and Waugh, F. V. (1933). "Partial time regression as compared with individual trends" Econometrica, 1, 221--223.

Gilbert, C. L. (1986). "Professor Hendry's econometric methodology" Oxford Bulletin of Economics and Statistics, 48, 283--307. Reprinted in Granger, C. W. J. (ed.) (1990), Modelling Economic Series. Oxford: Clarendon Press.

Godfrey, L. G. (1978). "Testing for higher order serial correlation in regression equations when the regressors include lagged dependent variables" Econometrica, 46, 1303--1313.

Godfrey, L. G., and Orme, C. D. (1994). "The sensitivity of some general checks to omitted variables in the linear model" International Economic Review, 35, 489--506.

Godfrey, L. G., and Veale, M. R. (1999). "Alternative approaches to testing by variable addition" Mimeo, York University, UK.

Gourieroux, C., and Monfort, A. (1995). "Testing, encompassing, and simulating dynamic econometric models" Econometric Theory, 11, 195--228.

Granger, C. W. J. (1969). "Investigating causal relations by econometric models and cross-spectral methods" Econometrica, 37, 424--438.

Granger, C. W. J.(ed.)(1990). Modelling Economic Series. Oxford: Clarendon Press.

Haavelmo, T. (1944). "The probability approach in econometrics" Econometrica, 12, 1--118. Supplement.

Hannan, E. J., and Quinn, B. G. (1979). "The determination of the order of an autoregression" Journal of the Royal Statistical Society, B, 41, 190--195.

Harris, R. I. D. (1995). Using Cointegration Analysis in Econometric Modelling. London: Prentice Hall.

Harvey, A. C. (1981). The Econometric Analysis of Time Series. Deddington: Philip Allan.

Harvey, A. C. (1990). The Econometric Analysis of Time Series, 2nd ed. Hemel Hempstead: Philip Allan.

Hendry, D. F. (1976). "The structure of simultaneous equations estimators" Journal of Econometrics, 4, 51--88. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Hendry, D. F. (1979). "Predictive failure and econometric modelling in macro-economics: The transactions demand for money" In Ormerod, P.(ed.), Economic Modelling, pp. 217--242. London: Heinemann. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Hendry, D. F. (1980). "Econometrics: Alchemy or science?" Economica, 47, 387--406. Reprinted in Hendry, D. F., Econometrics: Alchemy or Science? Oxford: Blackwell Publishers, 1993, and Oxford University Press, 2000.

Hendry, D. F. (1985). "Monetary economic myth and econometric reality" Oxford Review of Economic Policy, 1, 72--84. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Hendry, D. F. (1994). "HUS revisited" Oxford Review of Economic Policy, 10, 86--106.

Hendry, D. F. (1995a). Dynamic Econometrics. Oxford: Oxford University Press.

Hendry, D. F. (1995b). "Econometrics and business cycle empirics" Economic Journal, 105, 1622--1636.

Hendry, D. F. (1996). "On the constancy of time-series econometric equations" Economic and Social Review, 27, 401--422.

Hendry, D. F. (1997). "On congruent econometric relations: A comment" Carnegie--Rochester Conference Series on Public Policy, 47, 163--190.

Hendry, D. F. (1999). "An econometric analysis of US food expenditure, 1931--1989" in Magnus, and Morgan 1999, pp. 341--361.

Hendry, D. F. (2000a). Econometrics: Alchemy or Science? Oxford: Oxford University Press. New Edition.

Hendry, D. F. (2000b). "Epilogue: The success of general-to-specific model selection" In Econometrics: Alchemy or Science?, pp. 467--490. Oxford: Oxford University Press. New Edition.

Hendry, D. F., and Doornik, J. A. (1994). "Modelling linear dynamic econometric systems" Scottish Journal of Political Economy, 41, 1--33.

Hendry, D. F., and Doornik, J. A. (1999). "The impact of computational tools on time-series econometrics" In Coppock, T.(ed.), Information Technology and Scholarship, pp. 257--269. Oxford: Oxford University Press.

Hendry, D. F., and Doornik, J. A. (2001). Econometric Modelling using PcGive 10: Volume I. London: Timberlake Consultants Press.

Hendry, D. F., and Ericsson, N. R. (1991a). "An econometric analysis of UK money demand in `Monetary Trends in the United States and the United Kingdom' by Milton Friedman and Anna J. Schwartz" American Economic Review, 81, 8--38.

Hendry, D. F., and Ericsson, N. R. (1991b). "Modeling the demand for narrow money in the United Kingdom and the United States" European Economic Review, 35, 833--886.

Hendry, D. F., and Krolzig, H.-M. (1999a). "General-to-specific model selection using PcGets for Ox" Unpublished paper, Economics Department, Oxford University.

Hendry, D. F., and Krolzig, H.-M. (1999b). "Improving on `Data mining reconsidered' by K.D. Hoover and S.J. Perez" Econometrics Journal, 2, 202--219.

Hendry, D. F., and Krolzig, H.-M. (2000). "The econometrics of general-to-simple modelling" Mimeo, Economics Department, Oxford University.

Hendry, D. F., Leamer, E. E., and Poirier, D. J. (1990). "A conversation on econometric methodology" Econometric Theory, 6, 171--261.

Hendry, D. F., and Mizon, G. E. (1978). "Serial correlation as a convenient simplification, not a nuisance: A comment on a study of the demand for money by the Bank of England" Economic Journal, 88, 549--563. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Hendry, D. F., and Mizon, G. E. (1990). "Procrustean econometrics: or stretching and squeezing data" in Granger 1990, pp. 121--136.

Hendry, D. F., and Mizon, G. E. (1993). "Evaluating dynamic econometric models by encompassing the VAR" In Phillips, P. C. B.(ed.), Models, Methods and Applications of Econometrics, pp. 272--300. Oxford: Basil Blackwell.

Hendry, D. F., and Mizon, G. E. (1999). "The pervasiveness of Granger causality in econometrics" in Engle, and White 1999.

Hendry, D. F., and Mizon, G. E. (2000). "Reformulating empirical macro-econometric modelling" Oxford Review of Economic Policy, 16, 138--159.

Hendry, D. F., and Morgan, M. S. (1995). The Foundations of Econometric Analysis. Cambridge: Cambridge University Press.

Hendry, D. F., Muellbauer, J. N. J., and Murphy, T. A. (1990). "The econometrics of DHSY" In Hey, J. D., and Winch, D.(eds.), A Century of Economics, pp. 298--334. Oxford: Basil Blackwell.

Hendry, D. F., and Neale, A. J. (1987). "Monte Carlo experimentation using PC-NAIVE" In Fomby, T., and Rhodes, G. F.(eds.), Advances in Econometrics, Vol. 6, pp. 91--125. Greenwich, Connecticut: Jai Press Inc.

Hendry, D. F., and Richard, J.-F. (1982). "On the formulation of empirical models in dynamic econometrics" Journal of Econometrics, 20, 3--33. Reprinted in Granger, C. W. J. (ed.) (1990), Modelling Economic Series. Oxford: Clarendon Press and in Hendry D. F., op. cit., (1993) and (2000).

Hendry, D. F., and Richard, J.-F. (1989). "Recent developments in the theory of encompassing" In Cornet, B., and Tulkens, H.(eds.), Contributions to Operations Research and Economics. The XXth Anniversary of CORE, pp. 393--440. Cambridge, MA: MIT Press.

Hendry, D. F., and von Ungern-Sternberg, T. (1981). "Liquidity and inflation effects on consumers' expenditure" In Deaton, A. S.(ed.), Essays in the Theory and Measurement of Consumers' Behaviour, pp. 237--261. Cambridge: Cambridge University Press. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Hendry, D. F., and Wallis, K. F.(eds.)(1984). Econometrics and Quantitative Economics. Oxford: Basil Blackwell.

Hoover, K. D., and Perez, S. J. (1999). "Data mining reconsidered: Encompassing and the general-to-specific approach to specification search" Econometrics Journal, 2, 167--191.

Jarque, C. M., and Bera, A. K. (1980). "Efficient tests for normality, homoscedasticity and serial independence of regression residuals" Economics Letters, 6, 255--259.

Johansen, S. (1988). "Statistical analysis of cointegration vectors" Journal of Economic Dynamics and Control, 12, 231--254. Reprinted in R.F. Engle and C.W.J. Granger (eds), Long-Run Economic Relationships, Oxford: Oxford University Press, 1991, 131--52.

Johansen, S. (1992). "Testing weak exogeneity and the order of cointegration in UK money demand" Journal of Policy Modeling, 14, 313--334.

Judge, G. G., and Bock, M. E. (1978). The Statistical Implications of Pre-Test and Stein-Rule Estimators in Econometrics. Amsterdam: North Holland Publishing Company.

Judge, G. G., Griffiths, W. E., Hill, R. C., Lütkepohl, H., and Lee, T.-C. (1985). The Theory and Practice of Econometrics, 2nd ed. New York: John Wiley.

Kent, J. T. (1986). "The underlying nature of nonnested hypothesis tests" Biometrika, 73, 333--343.

Keynes, J. M. (1939). "Professor Tinbergen's method" Economic Journal, 44, 558--568.

Keynes, J. M. (1940). "Comment" Economic Journal, 50, 154--156.

Kiviet, J. F. (1985). "Model selection test procedures in a single linear equation of a dynamic simultaneous system and their defects in small samples" Journal of Econometrics, 28, 327--362.

Kiviet, J. F. (1986). "On the rigor of some mis-specification tests for modelling dynamic relationships" Review of Economic Studies, 53, 241--261.

Koopmans, T. C. (1947). "Measurement without theory" Review of Economics and Statistics, 29, 161--179.

Koopmans, T. C., Rubin, H., and Leipnik, R. B. (1950). "Measuring the equation systems of dynamic economics" In Koopmans, T. C.(ed.), Statistical Inference in Dynamic Economic Models, No. 10 in Cowles Commission Monograph, Ch. 2. New York: John Wiley & Sons.

Krolzig, H.-M. (2001). "General-to-specific reductions of vector autoregressive processes" Economics discussion paper 2000-W34, Nuffield College, Oxford..

Krolzig, H.-M., and Hendry, D. F. (2001). "Computer automation of general-to-specific model selection procedures" Journal of Economic Dynamics and Control, 25, 831--866.

Leamer, E. E. (1978). Specification Searches. Ad-Hoc Inference with Non-Experimental Data. New York: John Wiley.

Leamer, E. E. (1983a). "Let's take the con out of econometrics" American Economic Review, 73, 31--43. Reprinted in Granger, C. W. J. (ed.) (1990), Modelling Economic Series. Oxford: Clarendon Press.

Leamer, E. E. (1983b). "Model choice and specification analysis" In Griliches, Z., and Intriligator, M. D.(eds.), Handbook of Econometrics, Vol. 1, Ch. 5. Amsterdam: North-Holland.

Leamer, E. E. (1984). "Global sensitivity results for generalized least squares estimates" Journal of the American Statistical Association, 79, 867--870.

Leamer, E. E. (1990). "Sensitivity analyses would help" in Granger 1990, pp. 88--96.

Ljung, G. M., and Box, G. E. P. (1978). "On a measure of lack of fit in time series models" Biometrika, 65, 297--303.

Lovell, M. C. (1983). "Data mining" Review of Economics and Statistics, 65, 1--12.

Lütkepohl, H. (1991). Introduction to Multiple Time Series Analysis. Berlin: Springer.

Magnus, J. R., and Morgan, M. S.(eds.)(1999). Methodology and Tacit Knowledge: Two Experiments in Econometrics. Chichester: John Wiley and Sons.

Majunder, K. L., and Bhattacharjee, G. P. (1973a). "Algorithm AS 63. The incomplete beta integral" Applied Statistics, 22, 409--411.

Majunder, K. L., and Bhattacharjee, G. P. (1973b). "Algorithm AS 64. Inverse of the incomplete beta function ratio" Applied Statistics, 22, 411--414.

Mayo, D. (1981). "Testing statistical testing" In Pitt, J. C.(ed.), Philosophy in Economics, pp. 175--230: D. Reidel Publishing Co. Reprinted as pp. 45--73 in Caldwell B. J. (1993), The Philosophy and Methodology of Economics, Vol. 2, Aldershot: Edward Elgar.

McAleer, M., Pagan, A. R., and Volker, P. A. (1985). "What will take the con out of econometrics?" American Economic Review, 95, 293--301. Reprinted in Granger, C. W. J. (ed.) (1990), Modelling Economic Series. Oxford: Clarendon Press.

Mizon, G. E. (1977a). "Inferential procedures in nonlinear models: An application in a UK industrial cross section study of factor substitution and returns to scale" Econometrica, 45, 1221--1242.

Mizon, G. E. (1977b). "Model selection procedures" In Artis, M. J., and Nobay, A. R.(eds.), Studies in Modern Economic Analysis, pp. P97--120. Oxford: Basil Blackwell.

Mizon, G. E. (1984). "The encompassing approach in econometrics" in Hendry, and Wallis 1984, pp. 135--172.

Mizon, G. E. (1995). "Progressive modelling of macroeconomic time series: the LSE methodology" In Hoover, K. D.(ed.), Macroeconometrics: Developments, Tensions and Prospects, pp. 107--169. Dordrecht: Kluwer Academic Press.

Mizon, G. E., and Richard, J.-F. (1986). "The encompassing principle and its application to non-nested hypothesis tests" Econometrica, 54, 657--678.

Moore, H. L. (1914). Economic Cycles -- Their Law and Cause. New York: MacMillan.

Muellbauer, J. N. J. (1994). "The assessment: Consumer expenditure" Oxford Review of Economic Policy, 10, 1--41.

Nicholls, D. F., and Pagan, A. R. (1983). "Heteroscedasticity in models with lagged dependent variables" Econometrica, 51, 1233--1242.

Pagan, A. R. (1984). "Model evaluation by variable addition" in Hendry, and Wallis 1984, pp. 103--135.

Pagan, A. R. (1987). "Three econometric methodologies: A critical appraisal" Journal of Economic Surveys, 1, 3--24. Reprinted in Granger, C. W. J. (ed.) (1990), Modelling Economic Series. Oxford: Clarendon Press.

Paroulo, P. (1996). "On the determination of integration indices in I(2) systems" Journal of Econometrics, 72, 313--356.

Pesaran, M. H. (1974). "On the general problem of model selection" Review of Economic Studies, 41, 153--171.

Pesaran, M. H.(ed.)(1987). The Limits of Rational Expectations. Oxford: Basil Blackwell.

Pike, M. C., and Hill, I. D. (1966). "Logarithm of the gamma function" Communications of the ACM, 9, 684.

Rahbek, A., Kongsted, H. C., and Jørgensen, C. (1999). "Trend-stationarity in the I(2) cointegration model" Journal of Econometrics, 90, 265--289.

Sargan, J. D. (1964). "Wages and prices in the United Kingdom: A study in econometric methodology (with discussion)" In Hart, P. E., Mills, G., and Whitaker, J. K.(eds.), Econometric Analysis for National Economic Planning, Vol. 16 of Colston Papers, pp. 25--63. London: Butterworth Co. Reprinted as pp. 275--314 in Hendry D. F. and Wallis K. F. (eds.) (1984). Econometrics and Quantitative Economics. Oxford: Basil Blackwell, and as pp. 124--169 in Sargan J. D. (1988), Contributions to Econometrics, Vol. 1, Cambridge: Cambridge University Press.

Sargan, J. D. (1973). "Model building and data mining" Discussion paper, London School of Economics. Presented to the Association of University Teachers of Economics, Meeting, Manchester, April 1973.

Sargan, J. D. (1980). "Some tests of dynamic specification for a single equation" Econometrica, 48, 879--897. Reprinted as pp. 191--212 in Sargan J. D. (1988), Contributions to Econometrics, Vol. 1, Cambridge: Cambridge University Press.

Sargan, J. D. (1981). "The choice between sets of regressors" Mimeo, Economics Department, London School of Economics.

Savin, N. E. (1984). "Multiple hypothesis testing" In Griliches, Z., and Intriligator, M. D.(eds.), Handbook of Econometrics, Vol. 2--3, Ch. 14. Amsterdam: North-Holland.

Sawa, T. (1978). "Information criteria for discriminating among alternative regression models" Econometrica, 46, 1273--1292.

Schwarz, G. (1978). "Estimating the dimension of a model" Annals of Statistics, 6, 461--464.

Shea, B. L. (1988). "Algorithm AS 239: Chi-squared and incomplete gamma integral" Applied Statistics, 37, 466--473.

Shenton, L. R., and Bowman, K. O. (1977). "A bivariate model for the distribution of Öb1 and b2" Journal of the American Statistical Association, 72, 206--211.

Shibata, R. (1980). "Asymptotically efficient selection of the order of the model for estimating parameters of a linear process" Annals of Statistics, 8, 147--164.

Sims, C. A. (1980). "Macroeconomics and reality" Econometrica, 48, 1--48. Reprinted in Granger, C. W. J. (ed.) (1990), Modelling Economic Series. Oxford: Clarendon Press.

Sims, C. A., Stock, J. H., and Watson, M. W. (1990). "Inference in linear time series models with some unit roots" Econometrica, 58, 113--144.

Smith, G. W. (1986). "A dynamic Baumol-Tobin model of money demand" Review of Economic Studies, 53, 465--469.

Spanos, A. (1989). "On re-reading Haavelmo: A retrospective view of econometric modeling" Econometric Theory, 5, 405--429.

Sullivan, R., Timmermann, A., and White, H. (1998). "Dangers of data-driven inference: The case of calendar effects in stock returns" Mimeo, Economics Department, University of California at San Diego.

Summers, L. H. (1991). "The scientific illusion in empirical macroeconomics" Scandinavian Journal of Economics, 93, 129--148.

Teräsvirta, T. (1976). "Effect of feedback on the distribution of the portmanteau statistic" Manuscript, London School of Economics.

Theil, H. (1971). Principles of Econometrics. London: John Wiley.

Tinbergen, J. (1940a). Statistical Testing of Business-Cycle Theories. Geneva: League of Nations. Vol. I: A Method and its application to Investment Activity.

Tinbergen, J. (1940b). Statistical Testing of Business-Cycle Theories. Geneva: League of Nations. Vol. II: Business Cycles in the United States of America, 1919--1932.

Vuong, Q. H. (1989). "Likelihood ratio tests for model selection and nonnested hypotheses" Econometrica, 50, 1--25.

White, H. (1980). "A heteroskedastic-consistent covariance matrix estimator and a direct test for heteroskedasticity" Econometrica, 48, 817--838.

White, H. (1984). Asymptotic Theory for Econometricians. London: Academic Press.

White, H. (1990). "A consistent model selection" in Granger 1990, pp. 369--383.

Wooldridge, J. M. (1999). "Asymptotic properties of some specification tests in linear models with integrated processes" in Engle, and White 1999, pp. 366--384.

Wright, P. G. (1915). "Moore's economic cycles" Quarterly Journal of Economics, 29, 631--641.

Yancey, T. A., and Judge, G. G. (1976). "A Monte Carlo comparison of traditional and stein-rule estimators under squared error loss" Journal of Econometrics, 4, 285--294.