Model Estimation Statistics

Contents

12 Model Estimation Statistics
  12.1 Introduction
  12.2 Model formulation
  12.3 OLS estimation
  12.3.1 The algebra of ordinary least squares
  12.3.2 *Correlations
  12.3.3 Estimated regression equation
  12.3.4 Variances
  12.3.4.1 Standard errors of the regression coefficients
  12.3.5 Some intermediate algebra
  12.3.6 Functions of normal variables: c2, tand F distributions
  12.3.7 Distributional results
  12.3.8 t-values and t-probability
  12.3.9 Subsets of parameters
  12.3.10 *Parameter reliability statistics
  12.3.11 Log-likelihood (LogLik)
  12.3.12 R2: squared multiple correlation coefficient
  12.3.13 Equation standard error (s^e)
  12.3.14 Residual sum of squares (RSS)
  12.3.15 Information criteria
  12.4 Recursive OLS estimation
  12.5 Forecasting
  12.5.1 Analysis of 1-step forecasts
  12.5.2 Dynamic forecasting
  12.6 Instrumental variables estimation
  12.6.1 The algebra of IVE
  12.6.2 Testing hypotheses on b
  12.6.3 Specification c2
  12.6.4 Recursive IV estimation

Chapter 12 Model Estimation Statistics

12.1 Introduction

Equation estimation is allowed by:

OLS Ordinary Least Squares Estimation
GETS Testimation
IVE Instrumental Variables Estimation
GETSIVE Instrumental Variables Testimation

Once a model has been specified, a sample period selected, and an estimation method chosen, the equation can be estimated. For ease of notation, the sample period is denoted t=1,...,T+H, after allowing for any lagged variables created, where H is the forecast horizon. The data used for estimation are X=( x1...xT) whereas the H retained observations XH=( xT+1...xT+H) are used for in-sample forecasts. Recursive estimation methods are initialized by a direct estimation over t=1,...,M-1, followed by recursive estimation over t=M,...,T (or the reverse for backward recursion).

This chapter discusses the statistics reported by PcGets following model estimation. The next chapter describes the evaluation tools. Sections marked with * denote information that can be requested.

12.2 Model formulation

We first consider the estimation of linear equations by ordinary least squares (OLS) and instrumental variables (IVs), as these are the main methods offered by PcGets. Chapter 10 considered the econometrics of model selection: here we are primarily concerned with the formulae needed to estimate any given specification by OLS or IVE. Below, matrices and vectors are in bold. For distributional results, we assume the data are either stationary or have been transformed to a non-integrated representation.

A regression is the mean of the conditional distribution of one random variable, denoted y, given the values of k others, denoted x, where x=( x1...xk) '. In a multivariate normal distribution, regressions are linear (and homoscedastic), so have the form:

E[ yt½xt] =b'xt=åi=1kbixi,t.
(eq:12.1)

One of the xi,t will generally be a constant (equal to unity), with the corresponding parameter being the intercept in (eq:12.1). b=( b1...bk) 'ÎÂk, is the k×1 parameter vector of interest. Usually, a relation like (eq:12.1) is expressed in model form as (for t=1,...,T):

yt=b'xt+et
(eq:12.2)

where et~IN[ 0,se2] , and is distributed independently of xt, so:

Dy|x( yt½xt) =N[ b'xt,se2] .
(eq:12.3)

The unknown parameters b and se2 (the equation standard error) must be estimated from the available sample of data. Section 12.3 describes how.

Equations like (eq:12.2) could also be postulated without the justification that they are the conditional mean of a normal distribution. If, nevertheless, E[ yt|xt] =b'xt then OLS remains consistent. However, when et is not independent of xt, then sufficient instrumental variables, denoted zt, are required such that E[ et|zt] =0, when rankE[ ztzt'] ³n for n endogenous right-hand side regressors. IV estimators are considered in section 12.6 below.

Since linear models play a major role in Gets, §12.3 formulates regression estimation, beginning with the algebra of ordinary least squares in §12.3.1. Some intermediate results on idempotent matrices and distributions of functions of normal variables in §12.3.5 and §12.3.6 respectively lead in §12.3.7 to the distributional results which underpin inference. Tests on subsets of estimates are developed in §12.3.9, and recursive estimation is noted in §12.4.

Section 12.6 then develops the algebra of IVE estimation and inference, and concludes by noting recursive estimation.

12.3 OLS estimation

The linear model under analysis is given by (eq:12.2). Grouping the observations, so that y'=( y1...yT) and X'=( x1...xT) , which is a T×k matrix with rank( X) =k and e'=( e1...eT) , then:

y=Xb+e with e~NT[ 0,se2I] ,
(eq:12.4)

where E[ X'e] =0. Although conditioning on X is too strong to be justifiable in economics, and essentially entails an experimental setting, most of the results recorded below hold in large samples under much weaker assumptions (see e.g., Hendry, 1995a), and can be extended to integrated processes (see e.g., Sims, Stock and Watson, 1990, and Banerjee, Dolado, Galbraith and Hendry, 1993). The assumptions about e are almost equally strong, but less objectionable in practice given the ability of investigators to formulate congruent models: see the discussion in Chapters 9 and 10.

12.3.1 The algebra of ordinary least squares

The algebra of OLS estimation does not depend on the statistical status of X, and the formulae below -- which are programmed in computer packages -- are often applied even when conditioning is invalid. However, statistical results do depend on the status of X, and the assumptions about e.

The OLS estimator b^ of b is given by:

b^=( X'X) -1X'y.
(eq:12.5)

Letting e^=y-Xb^ then X'(y-Xb^)=X'e^=0. To determine the properties of b^ as an estimator of b, when (eq:12.2) holds, substitute for y from (eq:12.4):

b^=b+( X'X) -1X'e
(eq:12.6)

then taking expectations conditional on X:

E[ ( b^-b) ½X] =E[ ( X'X) -1X'e½X] =( X'X) -1X'E[ e] =0.
(eq:12.7)

Thus, b^ is unbiased for b. More generally, when xt is weakly exogenous for b in (eq:12.2), then b^ is consistent for b.

12.3.2 *Correlations

The correlation matrix (with diagonal equal to unity) is reported for both the general and specific models. The second-moment matrix MXX=T-1X'X, and the matrix of correlations:

ri,j=
mi,j

(mi,imj,j)½
,
(eq:12.8)

are calculated. The eigenvalues of the correlation matrix are also reported.

12.3.3 Estimated regression equation

The first column of output records the names of the variables and the second, the estimated regression coefficients b^ in (eq:12.5). PcGets uses the QR decomposition with partial pivoting rather than the formula in (eq:12.5) to calculate b^, which analytically gives the same result, but is numerically more stable. The QR decomposition of X is X=QR, where Q is T×T and orthogonal (so Q'Q=IT), and R is T×k and upper triangular, so X'X=R'R.

The following three columns give further information about each of the magnitudes described below in §12.3.4 and §12.3.10.

12.3.4 Variances

Next, V[b^] is given by:

E[ ( b^-b) ( b^-b) '½X] = E[ ( X'X) -1X'ee'X( X'X) -1½X]
= ( X'X) -1X'E[ ee'] X( X'X) -1
= se2( X'X) -1.
(eq:12.9)

Further, se2 can be estimated unbiasedly by:

s^e2=
e^'e^

( T-k)
(eq:12.10)

when:

e^=y-Xb^,
(eq:12.11)

and e^'e^= RSS (an acronym for residual sum of squares). In turn, letting V[ .] denote variance, V[b^] can be estimated by:

V[ b^] ^=s^e2( X'X) -1.
(eq:12.12)

12.3.4.1 Standard errors of the regression coefficients

These are obtained from the variance-covariance matrix:

SE[ b^i] =(V[ b^i] ^)½=s^e(dii)½
(eq:12.13)

where dii is the ith diagonal element of ( X'X) -1 and s^u is the standard error of the regression, defined in §12.3.13 below.

12.3.5 Some intermediate algebra

Let M=IT-X( X'X) -1X', which is a symmetric and idempotent T×T matrix, such that M=M', M=M2 and M( IT-M) =0. Further, M annihilates X since MX=0, so that:

My=y-X( X'X) -1X'y=y-Xb^=e^=Me,
(eq:12.14)

where the last equality follows from pre-multiplying (eq:12.4) by M. Consequently:

RSS=y'My=e'Me.
(eq:12.15)

Since M is real and symmetric, let M=HLH' where L is the diagonal matrix of eigenvalues and H is the non-singular matrix of eigenvectors with H'H=IT. By idempotency:

M2=HLH'HLH'=HL2H'=M=HLH',

so L2=L and all the eigenvalues of M are either zero or unity. Thus, rank ( M) =tr( M) =tr( L) so:

rank( M) =tr( IT-X( X'X) -1X') =tr( IT) -tr( ( X'X) -1X'X) =( T-k) .
(eq:12.16)

There are ( T-k) unit and k zero eigenvalues, and M is singular of rank( T-k) .

12.3.6 Functions of normal variables: c2, t and F distributions

Three important functions of normally distributed random variables are distributed as the c2, t, and F distributions. First, we define these three distributions, then consider generalizations. Let Z~N[ 0,1] then:

Z2~c2( 1) ,

where c2( 1) is the chi-squared distribution with one degree of freedom. For a set of k independent random variables Zi~IN[ 0,1] :

åi=1kZi2~c2( k) ,
(eq:12.17)

where c2( k) is the c2-distribution with k degrees of freedom.

Let X~N[ mx,sx2] and h( k) be a c2( k) independently distributed from X:

t=
( X-mx) (k)½

sx(h( k) )½
~t( k) ,

where t( k) is Student's t-distribution with k degrees of freedom.

Thirdly, let h1( k1) and h2( k2) be two independent chi-squareds of k1 and k2 degrees of freedom, then:

f=
[ h1( k1) /k1]

[ h2( k2) /k2]
~F( k1,k2)

where F(k1,k2) is the F-distribution with k1 and k2 degrees of freedom. Note that t(k)2~F(1,k) by using these three results. All of these distributions have been tabulated (in more modern terms, programmed into computer packages), and occur frequently in empirical research, in the sense that an underlying normal distribution is often assumed.

Next, let V~Nk[ m,S] , then:

h( k) =( V-m) 'S-1( V-m) ~c2( k) .
(eq:12.18)

This result follows from the definition of a c2 by noting that any positive definite matrix S can be written as S=HH' where H is a non-singular lower triangular matrix, so that:

H-1( V-m) =z~Nk[ 0,I] ,
(eq:12.19)

and hence from (eq:12.18):

h( k) =z'z=åi=1kzi2~c2( k) .
(eq:12.20)

Partition z' into the k1 and k2 independent components ( z1':z2') , each of which is normal, then for k1+k2=k:

h1( k1) =z1'z1~c2( k1) and h2( k2) =z2'z2~c2( k2)
(eq:12.21)

are also independent.

12.3.7 Distributional results

Since e~NT[0,su2I], then e'e/su2~c2( T) . Because M is singular, we cannot apply the theorems of §12.3.6 on the distributions of functions of normal variables to Me or e'Me. However, collect all the unit eigenvalues of M in the first ( T-k) diagonal elements of L, with the last k diagonal elements being zeros. Let:

n=H'e~NT[ 0,se2H'H] =NT[ 0,se2I]

and consider the quadratic form:

e'Me=e'HLH'e=n'Ln=n1'n1,
(eq:12.22)

where n'=( n1':n2') and n1 and n2 correspond to the unit and zero roots respectively in L, so that n1 denotes the first ( T-k) elements of n corresponding to the unit eigenvalues of M. Then n1~NT-k[0,su2I], and since n~NT[0,su2I], n1 and n2 are distributed independently. Hence:

h2=
e'Me

se2
=
n1'n1

se2
~c2( T-k) .
(eq:12.23)

Thus, an idempotent quadratic form in standardized normal variables is distributed as a c2 with degrees of freedom equal to the rank of the idempotent matrix. Also:

h2=
( T-k) s^e2

se2
so that s^e2~
se2

( T-k)
c2( T-k) .
(eq:12.24)

The properties of s^e2 can be calculated from this last result using the c2-distribution. As h2~c2( T-k) , then E[ h2] =T-k and V[ h2] =2( T-k) .

Next, b^ is a linear function of the normally distributed vector e from (eq:12.4)--(eq:12.9:

b^~Nk[ b,se2( X'X) -1] ,
(eq:12.25)

and letting h1=(b^-b)'( X'X) (b^-b)/se2:

se2h1 = [ e'X( X'X) -1] ( X'X) [ ( X'X) -1X'e]
= e'X( X'X) -1X'e
= e'( IT-M) e.
(eq:12.26)

Since ( IT-M) =H( IT-L) H':

e'( IT-M) e=n'( IT-L) n=n2'n2,

so h1=n2'n2/se2. As n1 and n2 are distributed independently, h1 and h2 are also independent, matching their being c2( k) and c2( T-k) respectively.

Tests of H0: b=0 (or components thereof) follow from these results using the F-distribution, since y=e when b=0, so that on H0:

e'( IT-M) e=y'( IT-M) y=b^'( X'X) b^,
(eq:12.27)

with:

h1=
b^'( X'X) b^

se2
  H0~ c2( k)
(eq:12.28)

and hence:

hb=
( T-k) h1

kh2
=
( T-k) y'( I-M) y

ky'My
=
( T-k) b^'( X'X) b^

kRSS
  H0~ F( k,T-k) .
(eq:12.29)

The last expression for hb is the statistic which is actually computed to test H0 for the GUM. If b¹0 , then from (eq:12.25) the numerator of hb becomes a non-central c2 with non-centrality parameter b'X'Xb³0, whereas the denominator is unchanged. Thus, the statistic hb will on average lead to values larger than the F( k,T-k) anticipated under the null.

Finally, from (eq:12.25) each element b^i of b^ is normally distributed with variance given by su2 times the ith diagonal element dii of ( X'X) -1 and is independent of h2. Thus:

( b^i-bi)

s^e(dii)½
~t( T-k) ,
(eq:12.30)

where t( T-k) denotes Student's t-distribution with ( T-k) degrees of freedom and:

s^e(dii)½=SE[ b^i]
(eq:12.31)

is the standard error of b^i. On the hypothesis H0: bi=0:

ti=
b^i

SE[ b^i]
  H0  ~ t( T-k)
(eq:12.32)

which is a computable statistic from sample evidence alone.

12.3.8 t-values and t-probability

These statistics are conventionally calculated to determine whether individual coefficients are significantly different from zero:

tvalue=
b^i

SE[ b^i]
(eq:12.33)

where the null hypothesis H0 is bi=0. The null hypothesis is rejected if the probability of getting a t-value at least as large is less than the chosen significance level. This probability is given as:

tprob=1-Prob( | t| £| tvalue| )
(eq:12.34)

in which t has a Student t-distribution with T-k degrees of freedom.

When H0 is true (and the model is otherwise correctly specified in a stationary process), a Student t-distribution is used since the sample size is often small, and we only have an estimate of the parameter's standard error: however, as the sample size increases, t tends to a standard normal distribution under H0. Large t-values reject H0; but, in many situations, H0 may be of little interest to test.

12.3.9 Subsets of parameters

Consider estimating a subset of kb parameters bb of b where ka+kb=k. Partition X=( Xa:Xb) and b'=(ba':bb') so that:

y=Xaba+Xbbb+e.
(eq:12.35)

Let Ma=IT-Xa( Xa'Xa) -1Xa' which implies that MaXa=0 then:

May=MaXbbb+Mae
(eq:12.36)

and hence:

b^b=( Xb'MaXb) -1Xb'May.
(eq:12.37)

Consequently, b^b can be calculated by first regressing Xb on Xa and saving the residuals (MaXb), then regressing y on those residuals. Notice that the regressors need `corrected', but the regressand does not. Thus, procedures for `detrending' (for example), can be justified as equivalent to adding a trend to the model (see Frisch and Waugh, 1933, who first proved this famous result). From (eq:12.36) and (eq:12.37):

b^b=bb+( Xb'MaXb) -1Xb'Mae
(eq:12.38)

so that

V[ b^b] =se2( Xb'MaXb) -1.
(eq:12.39)

Thus:

b^b~Nkb[ bb,se2( Xb'MaXb) -1] .
(eq:12.40)

Hypothesis tests about bb follow analogously to the previous section. In particular, from §12.3.7 and (eq:12.40):

hb=
( b^b-bb) '( Xb'MaXb) ( b^b-bb)

kbs^e2
~F( kb,T-k) .
(eq:12.41)

When kb=1, this matches (eq:12.32) under H0: bb=0, and when kb=k, (eq:12.41) reproduces (eq:12.29) under H0: b=0. A useful case of (eq:12.41) is kb=( k-1) and Xa=i (a T×1 vector of ones), so all coefficients other than the intercept are tested.

If, instead of estimating (eq:12.35), Xa is omitted from the model in the incorrect belief that ba=0, the equation to be estimated becomes:

y=Xbbb+e
(eq:12.42)

The resulting estimator of bb, denoted by b~b=( Xb'Xb) -1Xb'y, confounds the effects of Xa and Xb:

b~b = ( Xb'Xb) -1Xb'( Xaba+Xbbb+e)
= Bbaba+bb+( Xb'Xb) -1Xb'e
(eq:12.43)

where Bba=( Xb'Xb) -1Xb'Xa. Thus:

E[ b~b] =Bbaba+bb,
(eq:12.44)

which equals bb if and only if Bbaba=0. Moreover, let Mb have the same form as Ma but using Xb then:

V[ b~b] ^=s~e2( Xb'Xb) -1 where s~e2=
y'Mby

( T-kb)
=
e~'e~

( T-kb)
(eq:12.45)

when:

e~=y-Xbb~b
(eq:12.46)

and:

E[ s~e2] =se2+
ba'Xa'MbXaba

( T-kb)
³se2.
(eq:12.47)

Conventionally, b~b is interpreted as a biased estimator of bb with bias given by Bbaba. The estimated variance matrix in (eq:12.45) may exceed, or be less than, that given by the relevant sub-matrix of (eq:12.12), in the sense that the difference could be positive or negative semi-definite.

12.3.10 *Parameter reliability statistics

If the `sample split analysis' option of the Model Settings dialog is marked, for each parameter ( b1,...,bk) in the final model, a reliability statistic is computed, based on its significance in two overlapping sub-samples as well as the whole sample. The final column records these measures for the bs. Low values may indicate a variable that is significant by chance in the whole sample, or one that is retained to offset a potential diagnostic-test problem.

Beneath the columnar presentation an array of summary statistics is also provided as follows:

12.3.11 Log-likelihood (LogLik)

The log of the value of the likelihood is reported.

12.3.12 R2: squared multiple correlation coefficient

R2=
åt=1T( y^t-y) 2

åt=1T( yt-y) 2
=1-
åt=1Te^t2

åt=1T( yt-y) 2
,
(eq:12.48)

assuming a constant is included. Intercepts should usually be included in the initial selection of xt, and PcGets will decide on its final retention.

12.3.13 Equation standard error ( s^e)

The equation standard error is the square root of the residual variance, which is defined as:

s^e2=
1

T-k
åt=1Te^t2,
(eq:12.49)

where the residuals are defined in (eq:12.11).

12.3.14 Residual sum of squares (RSS)

RSS =åt=1Te^t2.
(eq:12.50)

12.3.15 Information criteria

The three statistics reported are the Schwarz criterion (SC), the Hannan--Quinn (HQ) criterion, and the Akaike criterion (AIC). Here:

AIC = ln s~e2+2k/T,
HQ = ln s~e2+2k ( ln ( ln T) ) /T,
SC = ln s~e2+k ( ln T) /T,
(eq:12.51)

using the maximum likelihood estimate of su2:

s~e2=
T-k

T
s^e2= 1/T åt=1Te^t2.

For a discussion of the use of these and related scalar measures to choose between alternative models in a class, see Judge, Griffiths, Hill, Lütkepohl and Lee (1985).

12.4 Recursive OLS estimation

For recursive estimation, denote the linear equation by:

yt=b'xt+et
(eq:12.52)

where b is assumed to be constant, E[ xtet] =0"t, E[ et2] =se2, and E[ etes] =0 if t¹s. Let the complete sample period be ( 1,...,T) , and consider the least-squares outcome on a sub-sample up to t-1 (for t>k when there are k regressors in xt-1):

b^t-1=( Xt-1'Xt-1) -1Xt-1'yt-1,
(eq:12.53)

with Xt-1=( x1...xt-1) ' and yt-1=( y1...yt-1) '. When the sample is increased by one observation, then:

Xt'Xt =Xt-1'Xt-1+xtxt'
Xt'yt =Xt-1'yt-1+xtyt.

Thus, b^t and RSS t follow directly, from which equation and parameter standard errors are readily calculated:

s^t2=
RSSt

t-k
(eq:12.54)

and:

V[ b^t] =s^t2( Xt'Xt) -1.
(eq:12.55)

A new calculation is the set of innovations, which are the one-step ahead forecast errors:

nt=yt-xt'b^t-1.
(eq:12.56)

These are mean zero, independent random variables, but with heteroscedastic variance.

Finally, from the sequence of {RSSt-1}, sequences of tests (for example, for parameter constancy) can be calculated, based on Chow (1960). Both forwards and backwards recursive estimates are provided, as are moving window estimates (i.e., a fixed sample size).

12.5 Forecasting

Forecasting may be done 1-step ahead or h-steps ahead. The former are ex post (or static): any lagged information required to form forecasts is based on observed values. The latter are ex ante (or dynamic) and will reuse forecasts from previous period(s) if required. Consider the model:

yt=b1yt-1+b2yt-2+b3zt+vt

estimated over t=1,...,T conditional on zt. Assuming a forecast horizon of H=3, we find:

static forecast dynamic forecast
T+1 y^T+1=b^1yT+b^2yT-1+b^3zT+1 y^T+1=b^1yT+b^2yT-1+b^3zT+1
T+2 y^T+2=b^1yT+1+b^2yT+b^3zT+2 y^T+2=b^1y^T+1+b^2yT+b^3zT+2
T+3 y^T+3=b^1yT+2+b^2yT+1+b^3zT+3 y^T+3=b^1y^T+2+b^2y^T+1
                    +b^3zT+3
(eq:12.57)

Both types of forecast require data for t=1,...,T to obtain the coefficient estimates. Beyond that, static forecasts require z for t=T+1,...,T+H and y up to T+H-1, whereas dynamic forecasts only need z for t=T+1,...,T+H. In both types of forecast, the parameter estimates are fixed over the forecast period.

To distinguish the two types of forecast, let y^T+i,h denote the h-step forecast made for period T+i (i³h) and based on parameter estimation up to T. The h-step forecasts use actual values for lagged ys which go further back than h periods. Thus, 1-step forecasts, y^T+h,1 (h=1,...,H), always use actual values for lagged ys. Dynamic forecasts, y^T+h,h (h=1,...,H), never use actual values beyond T. Intermediate (h) step forecasts, using the new notation and omitting the zs, are:

1-step       2-step        3-step
T+1  y^T+1,1         -         -
T+2  y^T+2,1 y^T+2,2=b^1y^T+1,1+b^2yT         -
T+3  y^T+3,1 y^T+3,2=b^1y^T+2,1+b^2yT+1 y^T+3,3=b^1y^T+2,2+b^2y^T+1,1
T+i  y^T+i,1  y^T+i,2=b^1y^T+i-1,1+b^2yT+i-2  y^T+i,3=b^1y^T+i-1,2+b^2y^T+i-2,1
(eq:12.58)

Consequently, dynamic forecasts are the sequence of 1,2,...,H step forecasts.

12.5.1 Analysis of 1-step forecasts

Following estimation over t=1,...,T, 1-step forecasts are given by:

[Note: Dynamic forecasts are needed if the xs must also be predicted for the forecast period, necessitating system procedures.]

y^t=xt'b^,  t=T+1,...,T+H,
(eq:12.59)

which requires the observations XH'=(xT+1,...,xT+H). The 1-step forecast error is the mistake made each period:

et=yt-xt'b^,  t=T+1,...,T+H,
(eq:12.60)

which can be written as:

et=xt'b+ut-xt'b^=xt'( b-b^) +ut.
(eq:12.61)

Assuming that E[b^]=b, then E[et]=0 and:

V[ et] =E[ et2] =E[ ( xt'( b-b^) ) 2+ut2] =su2xt'( X'X) -1xt+su2.
(eq:12.62)

This corresponds to the results given for the innovations in recursive estimation, see §12.4. The whole vector of forecast errors is e=( eT+1,...,eT+H) '. V[e] is derived in a similar way:

V[ e] =s2uIH+XHV[ b^] XH'=su2( IH+XH( X'X) -1XH') .
(eq:12.63)

Estimated variances are obtained after replacing su2 by s^u2.

The columns respectively report the date for which the forecast is made, the realized outcome (yt), the forecast (y^t), the forecast error (et=yt-y^t), the standard error of the 1-step forecast (SE[ et] =ÖV[ et] ^), and a t-value (that is, the standardized forecast error et/SE[ et] ).

12.5.2 Dynamic forecasting

To derive expressions for dynamic forecasts, we need to take the lag structure of y into account. Consider a simple model with one lag of the dependent variable:

yt=b1yt-1+b2zt+ut
(eq:12.64)

Using backward substitution for any h ( h=1,...,H) commencing at T we find:

yT+h = b1yT+h-1+b2zT+h+uT+h
= b1( yT+h-2+b2zT+h-1+uT+h-1) +b2zT+h+uT+h
= b1hyT+åj=0h-1b1jb2zT+h-j+åj=0h-1b1juT+h-j.
(eq:12.65)

The forecast at h is:

y^T+h,h=b^1hyT+åj=0h-1 b^1jb^2zT+h-j,
(eq:12.66)

so that the forecast error at h is:

eT+h,h =yT+h- y^T+h,h
=(b1h-b^1h)yT+åj=0h-1(b1jb2- b^1jb^2)zT+h-j+ åj=0h-1b1juT+h-j.
(eq:12.67)

When the parameter uncertainty is negligible, the forecast error is:

eT+h,h=åj=0h-1b1juT+h-j
(eq:12.68)

with variance:

V[ eT+h,h] =E[ åi=0h-1åj=0h-1b1iuT+h-i uT+h-jb1j] =åj=0h-1 b12jsu2
(eq:12.69)

since E[ uiuj] =0 for i¹j and E[ ui2] =su2. Similar expressions hold when there are m lags of the dependent variable. For appropriate formulae when parameter-estimation variances are included, see Doornik and Hendry (2001a).

The model must characterize the economy as accurately in the forecast period as it did over the estimation sample if the forecast errors are to be from the same distribution as that assumed in-sample. This is a strong requirement, and seems unlikely to be met unless the parameters are constant within sample. Clements and Hendry (1998) give forecast-error variances for the whole sequence of forecasts, and Clements and Hendry (1999a) discuss forecasting with non-stationary series.

12.6 Instrumental variables estimation

The method of instrumental variables was originally developed as a computationally-simple approach for handling endogenous regressors in simultaneous-equations models. Although system procedures now abound (see the estimator generating approach in Hendry, 1976), it remains convenient to have methods that are viable for an individual equation.

Here we write the model as:

yt=b0'yt*+b1'wt+et,
(eq:12.70)

in which we have n-1 endogenous variables yt* and q1 non-modelled variables wt on the right-hand side (the latter may include lagged endogenous variables). We assume that we have q2 additional instruments, labelled wt*. Let zt denote the set of all instrumental variables (non-endogenous included regressors, plus additional instruments): zt=(wt':wt*')', which is a vector of length q=q1+q2. Thus, there are k=n-1+q1 coefficients of interest b=(b0':b1')'. Letting xt=(yt*':wt')', with X'=( x1...xT) , y=(y1...yT)', then the model is:

yt=b'xt+et,

or:

y=Xb+e,

to be estimated using q instruments Z'=( z1...zT) .

12.6.1 The algebra of IVE

When E[ yt|xt] ¹xt'b so that E[ xtet] ¹0 in (eq:12.2), then OLS estimation is inconsistent for b. If there is a n×1 vector zt such that n³k and:

E[ yt½zt] =E[ xt½zt] 'b
(eq:12.71)

so E[ zt'et] =0, then zt can be used as an instrumental variable in estimating b (b is not estimable when there are fewer than k instruments). Collecting all the observations:

y=Xb+e.
(eq:12.72)

Pre-multiply (eq:12.72) by Z':

Z'y=Z'Xb+Z'e.
(eq:12.73)

When rank ( T-1Z'X) =k, which is also sufficient to identify b (see White, 1984), and rank( T-1Z'Z) =n, then setting Z'e=0 (noting E[ Z'e] =0):

Z'y=Z'Xb~.

The solution to these n equations in k unknowns is:

b~=( X'Z( Z'Z) -1Z'X) -1X'Z( Z'Z) -1Z'y
(eq:12.74)

which is the generalized instrumental variables estimator. As for OLS, PcGets does not use expression (eq:12.74) directly, but instead uses the QR decomposition to ensure the computation is numerically more stable. If n=k, equation (eq:12.74) simplifies to:

b~=( Z'X) -1Z'y.
(eq:12.75)

Let:

X'=PZ'+V'
(eq:12.76)

denote the `reduced form' of the regressors as a linear function of the instruments, then, as E[ Z'V] =0:

P^=X'Z( Z'Z) -1,

so rank(P^)=k ensures identification (the `quality' of the estimates depends on how `large' P^ is, using a small number of additional instruments).

In large samples, for stationary variables:

(T)½( b~-b)   D®  Nk[ 0,se2H-1] ,

where:

H= plim T®¥[ T-1X'Z] ( T-1Z'Z) -1[ T-1Z'X] .

The variance matrix of b~ can be estimated by:

V[ b~] ~=s~e2( X'Z( Z'Z) -1Z'X) -1
(eq:12.77)

where:

e~=y-Xb~ and s~e2=
e~'e~

T-k
.
(eq:12.78)

However, statistics like R2 are not so easily defined.

The output is closely related to that reported for least squares.

12.6.2 Testing hypotheses on b

A c2 test of b=0 (other than the intercept) can be calculated which has a crude correspondence to the earlier F-test for OLS. On H0: b=0, the reported statistic behaves asymptotically as a c2( k-1) . Let:

xb=b~'( X'Z( Z'Z) -1Z'X) b~.
(eq:12.79)

Then:

xb

s~e2
    app  ~  c2(k)

would test whether all k coefficients are zero. To keep the intercept separate, we compute:

xb-Ty2

s~e2
    app  ~  c2(k-1).
(eq:12.80)

This amounts to using the formula for b~ in xb with y-yi instead of y, where i is a vector of unit elements. An approximate F-test can be formed as in Kiviet (1986).

Tests on subsets follow similarly. Individual `t'-statistics can be based on the ratios of estimated coefficients to their estimated standard errors from (eq:12.77).

12.6.3 Specification c2

This tests for the validity of the choice of the instrumental variables as discussed by Sargan (1964). It is asymptotically distributed as c2(q) when the q over-identifying instruments are independent of the equation error. It is also interpretable as a test of whether the restricted reduced form of the structural model (yt on xt combined with xt on zt) parsimoniously encompasses the unrestricted reduced form (yt on zt directly). Let:

yt=p'zt+ut,

with p^=( Z'Z) -1Z'y being the `reduced form' estimates. From (eq:12.2) and (eq:12.76):

yt=b'xt+et=b'Pzt+et+b'vt,

entailing:

p'=b'P.

The test is, therefore:

p^'( Z'Z) p^-b~'P^( Z'Z) P^'b~

e~'e~/T
    app  ~  c2(q).
(eq:12.81)

12.6.4 Recursive IV estimation

When instrumental variables estimators are used, the recursive formulae are similar to OLS but more cumbersome (see Hendry and Neale, 1987). PcGets simply recalculates the estimates as the sample size is changed. Like OLS, both forwards and backwards recursive estimates are provided, as are moving window.

References

Akaike, A. (1973). "Information theory and an extension of the maximum likelihood principle" In Petrov, B. N., and Saki, F. L.(eds.), Second International Symposium of Information Theory. Budapest.

Akaike, H. (1985). "Prediction and entropy" In Atkinson, A. C., and Fienberg, S. E.(eds.), A Celebration of Statistics, pp. 1--24. New York: Springer-Verlag.

Akerlof, G. A. (1979). "Irving Fisher on his head: The consequences of constant target-threshold monitoring of money holdings" Quarterly Journal of Economics, 93, 169--188.

Amemiya, T. (1980). "Selection of regressors" International Economic Review, 21, 331--354.

Anderson, T. W. (1971). The Statistical Analysis of Time Series. New York: John Wiley & Sons.

Andrews, D. W. K. (1991). "Heteroskedasticity and autocorrelation consistent covariance matrix estimation" Econometrica, 59, 817--858.

Banerjee, A., Dolado, J. J., Galbraith, J. W., and Hendry, D. F. (1993). Co-integration, Error Correction and the Econometric Analysis of Non-Stationary Data. Oxford: Oxford University Press.

Bårdsen, G. (1989). "The estimation of long run coefficients from error correction models" Oxford Bulletin of Economics and Statistics, 50.

Bean, C. R. (1977). "More consumers' expenditure equations" Academic panel paper (77)35, H.M. Treasury, London.

Bean, C. R. (1978). "The determination of consumers' expenditure in the UK" Government economic service working paper 4, H.M. Treasury, London.

Bontemps, C., and Mizon, G. E. (2001). "Congruence and encompassing" In Stigum, B.(ed.), Studies in Economic Methodology. Cambridge, Mass.: MIT Press.

Boswijk, H. P. (1992). Cointegration, Identification and Exogeneity, Vol. 37 of Tinbergen Institute Research Series. Amsterdam: Thesis Publishers.

Bowman, K. O., and Shenton, L. R. (1975). "Omnibus test contours for departures from normality based on Öb1 and b2" Biometrika, 62, 243--250.

Box, G. E. P., and Jenkins, G. M. (1976). Time Series Analysis, Forecasting and Control. San Francisco: Holden-Day. First published, 1970.

Box, G. E. P., and Pierce, D. A. (1970). "Distribution of residual autocorrelations in autoregressive-integrated moving average time series models" Journal of the American Statistical Association, 65, 1509--1526.

Breusch, T. S. (1990). "Simplified extreme bounds" in Granger 1990, pp. 72--81.

Brown, R. L., Durbin, J., and Evans, J. M. (1975). "Techniques for testing the constancy of regression relationships over time (with discussion)" Journal of the Royal Statistical Society B, 37, 149--192.

Burns, A. F., and Mitchell, W. C. (1946). Measuring Business Cycles. New York: NBER.

Campos, J., and Ericsson, N. R. (1999). "Constructive data mining: Modeling consumers' expenditure in Venezuela" Econometrics Journal, 2, 226--240.

Carruth, A., and Henley, A. (1990). "Can existing consumption functions forecast consumer spending in the late 1980s?" Oxford Bulletin of Economics and Statistics, 52, 211--222.

Chatfield, C. (1995). "Model uncertainty, data mining and statistical inference" Journal of the Royal Statistical Society, A, 158, 419--466. With discussion.

Chow, G. C. (1960). "Tests of equality between sets of coefficients in two linear regressions" Econometrica, 28, 591--605.

Chow, G. C. (1981). "Selection of econometric models by the information criteria" In Charatsis, E. G.(ed.), Proceedings of the Econometric Society European Meeting 1979, Ch. 8. Amsterdam: North-Holland.

Clayton, M. K., Geisser, S., and Jennings, D. E. (1986). "A comparison of several model selection procedures" In Goel, P., and Zellner, A.(eds.), Bayesian Inference and Decision Techniques: Elsevier Science.

Clements, M. P., and Hendry, D. F. (1998). Forecasting Economic Time Series. Cambridge: Cambridge University Press.

Clements, M. P., and Hendry, D. F. (1999a). Forecasting Non-stationary Economic Time Series. Cambridge, Mass.: MIT Press.

Clements, M. P., and Hendry, D. F. (1999b). "Modelling methodology and forecast failure" Unpublished typescript, Economics Department, University of Oxford.

Coen, P. G., Gomme, E. D., and Kendall, M. G. (1969). "Lagged relationships in economic forecasting" Journal of the Royal Statistical Society A, 132, 133--163.

Cox, D. R. (1961). "Tests of separate families of hypotheses" In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1, pp. 105--123 Berkeley: University of California Press.

Cox, D. R. (1962). "Further results on tests of separate families of hypotheses" Journal of the Royal Statistical Society B, 24, 406--424.

Cran, G. W., Martin, K. J., and Thomas, G. E. (1977). "A remark on algorithms. AS 63: The incomplete beta integral. AS 64: Inverse of the incomplete beta function ratio" Applied Statistics, 26, 111--112.

D'Agostino, R. B. (1970). "Transformation to normality of the null distribution of g1" Biometrika, 57, 679--681.

Davidson, J. E. H., and Hendry, D. F. (1981). "Interpreting econometric evidence: The behaviour of consumers' expenditure in the UK" European Economic Review, 16, 177--192. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Davidson, J. E. H., Hendry, D. F., Srba, F., and Yeo, J. S. (1978). "Econometric modelling of the aggregate time-series relationship between consumers' expenditure and income in the United Kingdom" Economic Journal, 88, 661--692. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Deaton, A. S. (1982). "Model selection procedures or, does the consumption function exist" In Chow, G. C., and Corsi, P.(eds.), Evaluating the Reliability of Macro-Economic Models, Ch. 5. New York: John Wiley.

Doan, T., Litterman, R., and Sims, C. A. (1984). "Forecasting and conditional projection using realistic prior distributions" Econometric Reviews, 3, 1--100.

Doornik, J. A. (1999). Object-Oriented Matrix Programming using Ox 3rd ed. London: Timberlake Consultants Press.

Doornik, J. A., and Hansen, H. (1994). "A practical test for univariate and multivariate normality" Discussion paper, Nuffield College.

Doornik, J. A., and Hendry, D. F. (1996). GiveWin: An Interactive Empirical Modelling Program. London: Timberlake Consultants Press.

Doornik, J. A., and Hendry, D. F. (2001a). Econometric Modelling using PcGive 10, Volume II. London: Timberlake Consultants Press.

Doornik, J. A., and Hendry, D. F. (2001b). Interactive Monte Carlo Experimentation in Econometrics using PcNaive. London: Timberlake Consultants Press.

Engle, R. F. (1982a). "Autoregressive conditional heteroscedasticity, with estimates of the variance of United Kingdom inflation" Econometrica, 50, 987--1007.

Engle, R. F. (1982b). "Autoregressive conditional heteroskedasticity with estimates of the variance of UK inflation". 50, 987--1008.

Engle, R. F., Hendry, D. F., and Richard, J.-F. (1983). "Exogeneity" Econometrica, 51, 277--304. Reprinted in Hendry, D. F., Econometrics: Alchemy or Science? Oxford: Blackwell Publishers, 1993, and Oxford University Press, 2000; and in Ericsson, N. R. and Irons, J. S. (eds.) Testing Exogeneity, Oxford: Oxford University Press, 1994.

Engle, R. F., Hendry, D. F., and Trumbull, D. (1985). "Small sample properties of ARCH estimators and tests" Canadian Journal of Economics, 43, 66--93.

Engle, R. F., and White, H.(eds.)(1999). Cointegration, Causality and Forecasting. Oxford: Oxford University Press.

Ericsson, N. R. (1983). "Asymptotic properties of instrumental variables statistics for testing non-nested hypotheses" Review of Economic Studies, 50, 287--303.

Ericsson, N. R., Campos, J., and Tran, H.-A. (1990). "PC-GIVE and David Hendry's econometric methodology" Revista De Econometria, 10, 7--117.

Ericsson, N. R., Hendry, D. F., and Prestwich, K. M. (1998). "The demand for broad money in the United Kingdom, 1878--1993" Scandinavian Journal of Economics, 100, 289--324.

Ericsson, N. R., and Irons, J. S.(eds.)(1994). Testing Exogeneity. Oxford: Oxford University Press.

Faust, J., and Whiteman, C. H. (1997). "General-to-specific procedures for fitting a data-admissible, theory-inspired, congruent, parsimonious, encompassing, weakly-exogenous, identified, structural model of the DGP: A translation and critique" Carnegie--Rochester Conference Series on Public Policy, 47, 121--161.

Friedman, M., and Schwartz, A. J. (1982). Monetary Trends in the United States and the United Kingdom: Their Relation to Income, Prices, and Interest Rates, 1867--1975. Chicago: University of Chicago Press.

Frisch, R., and Waugh, F. V. (1933). "Partial time regression as compared with individual trends" Econometrica, 1, 221--223.

Gilbert, C. L. (1986). "Professor Hendry's econometric methodology" Oxford Bulletin of Economics and Statistics, 48, 283--307. Reprinted in Granger, C. W. J. (ed.) (1990), Modelling Economic Series. Oxford: Clarendon Press.

Godfrey, L. G. (1978). "Testing for higher order serial correlation in regression equations when the regressors include lagged dependent variables" Econometrica, 46, 1303--1313.

Godfrey, L. G., and Orme, C. D. (1994). "The sensitivity of some general checks to omitted variables in the linear model" International Economic Review, 35, 489--506.

Godfrey, L. G., and Veale, M. R. (1999). "Alternative approaches to testing by variable addition" Mimeo, York University, UK.

Gourieroux, C., and Monfort, A. (1995). "Testing, encompassing, and simulating dynamic econometric models" Econometric Theory, 11, 195--228.

Granger, C. W. J. (1969). "Investigating causal relations by econometric models and cross-spectral methods" Econometrica, 37, 424--438.

Granger, C. W. J.(ed.)(1990). Modelling Economic Series. Oxford: Clarendon Press.

Haavelmo, T. (1944). "The probability approach in econometrics" Econometrica, 12, 1--118. Supplement.

Hannan, E. J., and Quinn, B. G. (1979). "The determination of the order of an autoregression" Journal of the Royal Statistical Society, B, 41, 190--195.

Harris, R. I. D. (1995). Using Cointegration Analysis in Econometric Modelling. London: Prentice Hall.

Harvey, A. C. (1981). The Econometric Analysis of Time Series. Deddington: Philip Allan.

Harvey, A. C. (1990). The Econometric Analysis of Time Series, 2nd ed. Hemel Hempstead: Philip Allan.

Hendry, D. F. (1976). "The structure of simultaneous equations estimators" Journal of Econometrics, 4, 51--88. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Hendry, D. F. (1979). "Predictive failure and econometric modelling in macro-economics: The transactions demand for money" In Ormerod, P.(ed.), Economic Modelling, pp. 217--242. London: Heinemann. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Hendry, D. F. (1980). "Econometrics: Alchemy or science?" Economica, 47, 387--406. Reprinted in Hendry, D. F., Econometrics: Alchemy or Science? Oxford: Blackwell Publishers, 1993, and Oxford University Press, 2000.

Hendry, D. F. (1985). "Monetary economic myth and econometric reality" Oxford Review of Economic Policy, 1, 72--84. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Hendry, D. F. (1994). "HUS revisited" Oxford Review of Economic Policy, 10, 86--106.

Hendry, D. F. (1995a). Dynamic Econometrics. Oxford: Oxford University Press.

Hendry, D. F. (1995b). "Econometrics and business cycle empirics" Economic Journal, 105, 1622--1636.

Hendry, D. F. (1996). "On the constancy of time-series econometric equations" Economic and Social Review, 27, 401--422.

Hendry, D. F. (1997). "On congruent econometric relations: A comment" Carnegie--Rochester Conference Series on Public Policy, 47, 163--190.

Hendry, D. F. (1999). "An econometric analysis of US food expenditure, 1931--1989" in Magnus, and Morgan 1999, pp. 341--361.

Hendry, D. F. (2000a). Econometrics: Alchemy or Science? Oxford: Oxford University Press. New Edition.

Hendry, D. F. (2000b). "Epilogue: The success of general-to-specific model selection" In Econometrics: Alchemy or Science?, pp. 467--490. Oxford: Oxford University Press. New Edition.

Hendry, D. F., and Doornik, J. A. (1994). "Modelling linear dynamic econometric systems" Scottish Journal of Political Economy, 41, 1--33.

Hendry, D. F., and Doornik, J. A. (1999). "The impact of computational tools on time-series econometrics" In Coppock, T.(ed.), Information Technology and Scholarship, pp. 257--269. Oxford: Oxford University Press.

Hendry, D. F., and Doornik, J. A. (2001). Econometric Modelling using PcGive 10: Volume I. London: Timberlake Consultants Press.

Hendry, D. F., and Ericsson, N. R. (1991a). "An econometric analysis of UK money demand in `Monetary Trends in the United States and the United Kingdom' by Milton Friedman and Anna J. Schwartz" American Economic Review, 81, 8--38.

Hendry, D. F., and Ericsson, N. R. (1991b). "Modeling the demand for narrow money in the United Kingdom and the United States" European Economic Review, 35, 833--886.

Hendry, D. F., and Krolzig, H.-M. (1999a). "General-to-specific model selection using PcGets for Ox" Unpublished paper, Economics Department, Oxford University.

Hendry, D. F., and Krolzig, H.-M. (1999b). "Improving on `Data mining reconsidered' by K.D. Hoover and S.J. Perez" Econometrics Journal, 2, 202--219.

Hendry, D. F., and Krolzig, H.-M. (2000). "The econometrics of general-to-simple modelling" Mimeo, Economics Department, Oxford University.

Hendry, D. F., Leamer, E. E., and Poirier, D. J. (1990). "A conversation on econometric methodology" Econometric Theory, 6, 171--261.

Hendry, D. F., and Mizon, G. E. (1978). "Serial correlation as a convenient simplification, not a nuisance: A comment on a study of the demand for money by the Bank of England" Economic Journal, 88, 549--563. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Hendry, D. F., and Mizon, G. E. (1990). "Procrustean econometrics: or stretching and squeezing data" in Granger 1990, pp. 121--136.

Hendry, D. F., and Mizon, G. E. (1993). "Evaluating dynamic econometric models by encompassing the VAR" In Phillips, P. C. B.(ed.), Models, Methods and Applications of Econometrics, pp. 272--300. Oxford: Basil Blackwell.

Hendry, D. F., and Mizon, G. E. (1999). "The pervasiveness of Granger causality in econometrics" in Engle, and White 1999.

Hendry, D. F., and Mizon, G. E. (2000). "Reformulating empirical macro-econometric modelling" Oxford Review of Economic Policy, 16, 138--159.

Hendry, D. F., and Morgan, M. S. (1995). The Foundations of Econometric Analysis. Cambridge: Cambridge University Press.

Hendry, D. F., Muellbauer, J. N. J., and Murphy, T. A. (1990). "The econometrics of DHSY" In Hey, J. D., and Winch, D.(eds.), A Century of Economics, pp. 298--334. Oxford: Basil Blackwell.

Hendry, D. F., and Neale, A. J. (1987). "Monte Carlo experimentation using PC-NAIVE" In Fomby, T., and Rhodes, G. F.(eds.), Advances in Econometrics, Vol. 6, pp. 91--125. Greenwich, Connecticut: Jai Press Inc.

Hendry, D. F., and Richard, J.-F. (1982). "On the formulation of empirical models in dynamic econometrics" Journal of Econometrics, 20, 3--33. Reprinted in Granger, C. W. J. (ed.) (1990), Modelling Economic Series. Oxford: Clarendon Press and in Hendry D. F., op. cit., (1993) and (2000).

Hendry, D. F., and Richard, J.-F. (1989). "Recent developments in the theory of encompassing" In Cornet, B., and Tulkens, H.(eds.), Contributions to Operations Research and Economics. The XXth Anniversary of CORE, pp. 393--440. Cambridge, MA: MIT Press.

Hendry, D. F., and von Ungern-Sternberg, T. (1981). "Liquidity and inflation effects on consumers' expenditure" In Deaton, A. S.(ed.), Essays in the Theory and Measurement of Consumers' Behaviour, pp. 237--261. Cambridge: Cambridge University Press. Reprinted in Hendry, D. F., op. cit., (1993) and (2000).

Hendry, D. F., and Wallis, K. F.(eds.)(1984). Econometrics and Quantitative Economics. Oxford: Basil Blackwell.

Hoover, K. D., and Perez, S. J. (1999). "Data mining reconsidered: Encompassing and the general-to-specific approach to specification search" Econometrics Journal, 2, 167--191.

Jarque, C. M., and Bera, A. K. (1980). "Efficient tests for normality, homoscedasticity and serial independence of regression residuals" Economics Letters, 6, 255--259.

Johansen, S. (1988). "Statistical analysis of cointegration vectors" Journal of Economic Dynamics and Control, 12, 231--254. Reprinted in R.F. Engle and C.W.J. Granger (eds), Long-Run Economic Relationships, Oxford: Oxford University Press, 1991, 131--52.

Johansen, S. (1992). "Testing weak exogeneity and the order of cointegration in UK money demand" Journal of Policy Modeling, 14, 313--334.

Judge, G. G., and Bock, M. E. (1978). The Statistical Implications of Pre-Test and Stein-Rule Estimators in Econometrics. Amsterdam: North Holland Publishing Company.

Judge, G. G., Griffiths, W. E., Hill, R. C., Lütkepohl, H., and Lee, T.-C. (1985). The Theory and Practice of Econometrics, 2nd ed. New York: John Wiley.

Kent, J. T. (1986). "The underlying nature of nonnested hypothesis tests" Biometrika, 73, 333--343.

Keynes, J. M. (1939). "Professor Tinbergen's method" Economic Journal, 44, 558--568.

Keynes, J. M. (1940). "Comment" Economic Journal, 50, 154--156.

Kiviet, J. F. (1985). "Model selection test procedures in a single linear equation of a dynamic simultaneous system and their defects in small samples" Journal of Econometrics, 28, 327--362.

Kiviet, J. F. (1986). "On the rigor of some mis-specification tests for modelling dynamic relationships" Review of Economic Studies, 53, 241--261.

Koopmans, T. C. (1947). "Measurement without theory" Review of Economics and Statistics, 29, 161--179.

Koopmans, T. C., Rubin, H., and Leipnik, R. B. (1950). "Measuring the equation systems of dynamic economics" In Koopmans, T. C.(ed.), Statistical Inference in Dynamic Economic Models, No. 10 in Cowles Commission Monograph, Ch. 2. New York: John Wiley & Sons.

Krolzig, H.-M. (2001). "General-to-specific reductions of vector autoregressive processes" Economics discussion paper 2000-W34, Nuffield College, Oxford.

Krolzig, H.-M., and Hendry, D. F. (2001). "Computer automation of general-to-specific model selection procedures" Journal of Economic Dynamics and Control, 25, 831--866.

Leamer, E. E. (1978). Specification Searches. Ad-Hoc Inference with Non-Experimental Data. New York: John Wiley.

Leamer, E. E. (1983a). "Let's take the con out of econometrics" American Economic Review, 73, 31--43. Reprinted in Granger, C. W. J. (ed.) (1990), Modelling Economic Series. Oxford: Clarendon Press.

Leamer, E. E. (1983b). "Model choice and specification analysis" In Griliches, Z., and Intriligator, M. D.(eds.), Handbook of Econometrics, Vol. 1, Ch. 5. Amsterdam: North-Holland.

Leamer, E. E. (1984). "Global sensitivity results for generalized least squares estimates" Journal of the American Statistical Association, 79, 867--870.

Leamer, E. E. (1990). "Sensitivity analyses would help" in Granger 1990, pp. 88--96.

Ljung, G. M., and Box, G. E. P. (1978). "On a measure of lack of fit in time series models" Biometrika, 65, 297--303.

Lovell, M. C. (1983). "Data mining" Review of Economics and Statistics, 65, 1--12.

Lütkepohl, H. (1991). Introduction to Multiple Time Series Analysis. Berlin: Springer.

Magnus, J. R., and Morgan, M. S.(eds.)(1999). Methodology and Tacit Knowledge: Two Experiments in Econometrics. Chichester: John Wiley and Sons.

Majunder, K. L., and Bhattacharjee, G. P. (1973a). "Algorithm AS 63. The incomplete beta integral" Applied Statistics, 22, 409--411.

Majunder, K. L., and Bhattacharjee, G. P. (1973b). "Algorithm AS 64. Inverse of the incomplete beta function ratio" Applied Statistics, 22, 411--414.

Mayo, D. (1981). "Testing statistical testing" In Pitt, J. C.(ed.), Philosophy in Economics, pp. 175--230: D. Reidel Publishing Co. Reprinted as pp. 45--73 in Caldwell B. J. (1993), The Philosophy and Methodology of Economics, Vol. 2, Aldershot: Edward Elgar.

McAleer, M., Pagan, A. R., and Volker, P. A. (1985). "What will take the con out of econometrics?" American Economic Review, 95, 293--301. Reprinted in Granger, C. W. J. (ed.) (1990), Modelling Economic Series. Oxford: Clarendon Press.

Mizon, G. E. (1977a). "Inferential procedures in nonlinear models: An application in a UK industrial cross section study of factor substitution and returns to scale" Econometrica, 45, 1221--1242.

Mizon, G. E. (1977b). "Model selection procedures" In Artis, M. J., and Nobay, A. R.(eds.), Studies in Modern Economic Analysis, pp. P97--120. Oxford: Basil Blackwell.

Mizon, G. E. (1984). "The encompassing approach in econometrics" in Hendry, and Wallis 1984, pp. 135--172.

Mizon, G. E. (1995). "Progressive modelling of macroeconomic time series: the LSE methodology" In Hoover, K. D.(ed.), Macroeconometrics: Developments, Tensions and Prospects, pp. 107--169. Dordrecht: Kluwer Academic Press.

Mizon, G. E., and Richard, J.-F. (1986). "The encompassing principle and its application to non-nested hypothesis tests" Econometrica, 54, 657--678.

Moore, H. L. (1914). Economic Cycles -- Their Law and Cause. New York: MacMillan.

Muellbauer, J. N. J. (1994). "The assessment: Consumer expenditure" Oxford Review of Economic Policy, 10, 1--41.

Nicholls, D. F., and Pagan, A. R. (1983). "Heteroscedasticity in models with lagged dependent variables" Econometrica, 51, 1233--1242.

Pagan, A. R. (1984). "Model evaluation by variable addition" in Hendry, and Wallis 1984, pp. 103--135.

Pagan, A. R. (1987). "Three econometric methodologies: A critical appraisal" Journal of Economic Surveys, 1, 3--24. Reprinted in Granger, C. W. J. (ed.) (1990), Modelling Economic Series. Oxford: Clarendon Press.

Paroulo, P. (1996). "On the determination of integration indices in I(2) systems" Journal of Econometrics, 72, 313--356.

Pesaran, M. H. (1974). "On the general problem of model selection" Review of Economic Studies, 41, 153--171.

Pesaran, M. H.(ed.)(1987). The Limits of Rational Expectations. Oxford: Basil Blackwell.

Pike, M. C., and Hill, I. D. (1966). "Logarithm of the gamma function" Communications of the ACM, 9, 684.

Rahbek, A., Kongsted, H. C., and Jørgensen, C. (1999). "Trend-stationarity in the I(2) cointegration model" Journal of Econometrics, 90, 265--289.

Sargan, J. D. (1964). "Wages and prices in the United Kingdom: A study in econometric methodology (with discussion)" In Hart, P. E., Mills, G., and Whitaker, J. K.(eds.), Econometric Analysis for National Economic Planning, Vol. 16 of Colston Papers, pp. 25--63. London: Butterworth Co. Reprinted as pp. 275--314 in Hendry D. F. and Wallis K. F. (eds.) (1984). Econometrics and Quantitative Economics. Oxford: Basil Blackwell, and as pp. 124--169 in Sargan J. D. (1988), Contributions to Econometrics, Vol. 1, Cambridge: Cambridge University Press.

Sargan, J. D. (1973). "Model building and data mining" Discussion paper, London School of Economics. Presented to the Association of University Teachers of Economics, Meeting, Manchester, April 1973.

Sargan, J. D. (1980). "Some tests of dynamic specification for a single equation" Econometrica, 48, 879--897. Reprinted as pp. 191--212 in Sargan J. D. (1988), Contributions to Econometrics, Vol. 1, Cambridge: Cambridge University Press.

Sargan, J. D. (1981). "The choice between sets of regressors" Mimeo, Economics Department, London School of