Forecasting with Bayesian vector autoregressive models: comparison of direct and iterated multistep methods

Katsuhiro Sugita (Faculty of Global and Regional Studies, University of the Ryukyus, Okinawa, Japan)

Asian Journal of Economics and Banking

ISSN: 2615-9821

Article publication date: 6 June 2022

Issue publication date: 4 August 2022

601

Abstract

Purpose

The paper compares multi-period forecasting performances by direct and iterated method using Bayesian vector autoregressive (VAR) models.

Design/methodology/approach

The paper adopts Bayesian VAR models with three different priors – independent Normal-Wishart prior, the Minnesota prior and the stochastic search variable selection (SSVS). Monte Carlo simulations are conducted to compare forecasting performances. An empirical study using US macroeconomic data are shown as an illustration.

Findings

In theory direct forecasts are more efficient asymptotically and more robust to model misspecification than iterated forecasts, and iterated forecasts tend to bias but more efficient if the one-period ahead model is correctly specified. From the results of the Monte Carlo simulations, iterated forecasts tend to outperform direct forecasts, particularly with longer lag model and with longer forecast horizons. Implementing SSVS prior generally improves forecasting performance over unrestricted VAR model for either nonstationary or stationary data.

Originality/value

The paper finds that iterated forecasts using model with the SSVS prior generally best outperform, suggesting that the SSVS restrictions on insignificant parameters alleviates over-parameterized problem of VAR in one-step ahead forecast and thus offers an appreciable improvement in forecast performance of iterated forecasts.

Keywords

Citation

Sugita, K. (2022), "Forecasting with Bayesian vector autoregressive models: comparison of direct and iterated multistep methods", Asian Journal of Economics and Banking, Vol. 6 No. 2, pp. 142-154. https://doi.org/10.1108/AJEB-04-2022-0044

Publisher

:

Emerald Publishing Limited

Copyright © 2022, Katsuhiro Sugita

License

Published in Asian Journal of Economics and Banking. Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at http://creativecommons.org/licences/by/4.0/legalcode.


1. Introduction

Vector autoregressive (VAR) models have been widely used to forecast macroeconomic variables and to analyze macroeconomics and policy. For one-period ahead forecasting, one has to just estimate the model. However, it is often the case that more than one-period forecasting is of interest. In making a multi-period forecast, there are two methods – direct forecast method and iterated forecast method, and there have been several theoretical research about which method is better for multi-period forecasting such as Bhansali (1996, 1997), Clements and Hendry (1996), Kang (2003), Chevillon and Hendry (2005), Ing (2003) and among others. These literature tend to conclude that direct forecasts are more robust to model specification and more efficient asymptotically, and thus the direct forecast method is preferable compared with the iterated forecast method, while the iterated forecast method can be more efficient only if the one-period ahead model is correctly specified. However, some empirical research studies show that iterated forecasts outperform direct forecasts. Ang et al. (2006) find that the iterated forecasts of the US GDP growth perform better than the direct forecasts. Marcellino et al. (2006) show that iterated forecasts outperform direct forecasts, especially with longer lag and longer forecast horizon; this paper uses 170 US monthly macroeconomic time series for either univariate or multivariate models. Pesaran et al. (2011) state that whether direct or iterated method is better in multi-period forecasting depends upon the sample size, forecast horizon, the underlying data generating process (DGP) and the methods used to select lag length for the model, and thus it is ultimately an empirical matter.

For multivariate VAR models, there exists an over-parameterization problem, which leads to imprecise inference and thus deteriorates the forecast performance. Some Bayesian approaches to VAR models have been increasingly popular since Bayesian method can shrink VAR models by restricting its prior distributions. In this paper we investigate whether restricted parsimonious VAR models can mitigate misspecification problem and thus improve the forecasting performance of iterated method. Here, an independent Normal-Wishart prior is used for the unrestricted VAR and the Minnesota prior (Minn) and the stochastic search variable selection (SSVS) prior are used for the restricted prior to compare multiperiod forecasting performance between the direct and iterated forecast method.

We conduct numerical simulations using both stationary and nonstationary data generating processes (DGPs) to evaluate forecasting performances with 2-, 4-, 8- and 12-step ahead horizons, and compute the mean squared forecast error (MSFE) to compare direct forecasts with iterated forecasts using Bayesian VAR models with unrestricted and restricted priors. Iterated forecasts are found to outperform direct forecasts for both unrestricted and restricted VAR models, particularly with long-lag model and with long forecasting horizon. Implementing SSVS in VAR is found to generally improve forecasting performance appreciably. With relatively long lag length and thus a large number of parameters in the model, it seems that SSVS can effectively restrict insignificant parameters in the model and thus improve forecasting performance.

The plan of this paper is as follows. In Section 2 multi-period forecasting using VAR model is described, and method to evaluate forecasting performances. Section 3 reviews Bayesian VAR models with three different priors – the independent Normal-Wishart prior, the Minnesota prior and the SSVS prior. Section 4 illustrates numerical experiments with artificially generated data, and then examines the results of the numerical simulations. Section 5 illustrates an application to a simple three variables VAR of US macroeconomics. Section 6 concludes. This paper is based on preliminary working papers, Sugita (2018), Sugita (2019a) and Sugita (2019b). All results reported in this paper are generated using Ox version 7.2 for Linux (see Doornik, 2013).

2. Iterated and direct multi-period forecasts for VAR models

This section describes iterated and direct forecasting methods for VAR models. Let yt be an n × 1 vector of observations at time t, then a VAR model with p lag is written as

(1)yt=μ+i=1pytiΘi+ετ
for t = 1, …, T, where μ is a n × 1 vector of an intercept term; Θi are n × n matrices of coefficients for i = 1, …, p; ɛt are n × 1 independent Nn0,Σ errors; and the covariance matrix Σ is an n × n positive definite matrix.

The one-step ahead forecast yˆt+1|t of the VAR model is obtained by estimating the parameters in eq. (1) as yˆt+1|t=μˆ(I)+i=1pyt+1i|tΘˆ(I),i, where μˆ(I) and Θˆ(I),i are the estimators for μ and Θi in eq. (1). To make forecasting further than one-period ahead into the future, there are two methods for making multi-period forecasts iterated forecasts and direct forecasts methods. Iterated forecasts for the h-period forecasts are obtained recursively as

(2)yˆt+h=μˆ(I)+i=1pyˆt+hiΘˆ(I),i
where yˆi|t=yj for j ≤ t.

Direct forecasts for the multi-period forecasting are obtained by estimating the model

(3)yt=μ+i=1pythiΘi+εt,
Then using the estimated coefficients directly to make the forecast of
(4)yˆt+h=μˆ(D)+i=1pytiΘˆ(D),i
where μˆ(D) and Θˆ(D),i are the estimators for μ and Θi in eq. (3). Thus, the relative forecast accuracy depends on how accurate Θˆ(I),i and Θˆ(D),i are estimated. If Θˆ(I),i is badly estimated with large errors, then its powered values diverge increasingly from Θi. Since the iterated method depends on one-period ahead coefficients Θˆ(I),i, the direct method is preferable when the one-period ahead model is not specified correctly. Chevillon and Hendry (2005) evaluate the asymptotic and finite-sample properties of direct forecasting method, and show that, compared with iterated method, the direct method is more efficient asymptotically, more precise in finite samples and more robust against model misspecification. The theoretical advantages of the direct forecasting method over the iterated method are shown by Bhansali (1996, 1997), Clements and Hendry (1996), Kang (2003) and Ing (2003) among others. However, Marcellino et al. (2006) evaluates a large-scale empirical comparison of iterated and direct forecasts using US macroeconomic time series data, and finds that iterated forecasts tend to have smaller MSFEs than direct forecasts, contrary to the theoretical preference of direct forecasts.

To evaluate the forecasting performances among several different models, the MSFE is the most widely used. Let yτ+h is a vector of observations at time τ + h for τ = τ0, …, Th − 1, and h = 2-, 4-, 8- and 12-step ahead forecasts. Then, Φˆ=μˆ,Θˆ1,,Θˆp is estimated for both the direct and iterated method, using information up to τ − 1 to forecast values yˆτ+h starting from τ = τ0 up to τ = Th − 1, and calculate the MSFE defined as:

(5)MSFE=1Thτ0+1τ=τ0Thyτ+hyˆτ+h|Φˆ,Yτ12.
where Yτ1=Xτ1,Xτ2,,X1.

3. Bayesian VARs

This section presents Bayesian VAR models with three different priors – independent Normal-Wishart prior, the Minnesota prior and the SSVS prior. The VAR model in eq. (1) can be written in matrix form as follows:

(6)Y=XΦ+ε
where the T × n matrix Y is defined as Y=(y1,,yT); the T × (1 + np) matrix X is defined as X=(x1,,xT); the (1 + np) × 1 vector is defined as xt=1,yt1,,ytp, the (1 + np) × n matrix Φ is defined as Φ=(μ,Θ1,,Θp); and the ɛ is a T × n matrix with ε=(ε1,,εT). Based on the VAR model in eq. (1) or eq. (6), we describe briefly the three priors in the following subsections.

3.1 Independent Normal-Wishart prior

The VAR model in eq. (6) with the independent Normal-Wishart prior

(7)vecΦMNvecΦ0,V0
(8)ΣIWΣ0,ν0
where MN refers to a multivariate normal with mean vec0) and covariance-variance matrix V0: IW refers to an inverted Wishart distribution with parameters Σ0 and degrees of freedom, ν0. Unlike the natural conjugate priors, prior for Φ in eq. (7) and Σ in eq. (8) are independently specified. With the joint prior and the likelihood, the conditional posterior densities of vec(Φ) and Σ are derived as follows:
(9)vec(Φ)|Σ,YMN(vec(Φ*),V*)
(10)Σ|Φ,YIWΣ*,ν*
where V*=V01+ΣXX1 and vecB=V*V01vecΦ0+ΣIκ1vecXY,

Σ*=YXΦYXΦ+Σ0, and ν* = T + ν0. Given these conditional posterior specifications above, the Gibbs sampler generates sample draws.

Note that, with zero prior mean Φ0 = 0 and large prior variance V0 in eq. (7), the posterior mean for Φ is almost identical to the Maximum likelihood estimator. In this paper the hyperparameters are set at vec0) = 0 and V0 = 100 in eq. (7), Σ0 = 0.1I, and ν = 5 in eq. (8).

3.2 Minnesota prior

Litterman (1986) proposes what we call the Minnesota prior which is shrinkage prior for a Bayesian VAR model with random walk components. For a VAR model with p-the lag in eq. (1), the Minnesota prior for the coefficients assumes that the importance of the lagged variables is shrinking with the lag length, so that the prior is tighter around zero with lag length such that ΘiN(Θi¯,V(Θi)) where the expected values of Θi is defined as Θ¯1=In and Θ¯2==Θ¯p=0n, and the variance of Θi is given as:

(11)V(Θi)=λ2i21θσ1ˆ2/σ2ˆ2θσ1ˆ2/σnˆ2θσ2ˆ2/σ1ˆ21θσ2ˆ2/σnˆ2θσnˆ2/σ1ˆ2θσnˆ2/σ2ˆ21,
where 0 < θ < 1, and Σ=diagσˆ12,,σˆn2. In this paper, the hyperparameters in eq. (11) are set at λ = 0.05 and θ = 0.1.

3.3 SSVS prior

Without any restriction on the regression coefficients and the covariance matrix in eq. (1), VAR models usually has over-parameterization problem. They contain a very large number of parameters, leading to imprecise inference and deterioration of the forecast performance. To overcome this problem, George et al. (2008) apply the Bayesian SSVS method in a VAR. The SSVS method, proposed by George et al. (2008) and George and McCulloch (1997), restricts the parameters of the model by using a hierarchical prior on the parameters.

SSVS defines the prior for the VAR coefficient Φ for each element in Φ. Let ϕj be each element in Φ, then the prior for ϕj is a hierarchical prior with mixture of two normal distributions with different variance conditional on an unknown dummy variable γj that takes zero or one:

(12)ϕj|γj(1γj)N(0,τ0,j2)+γjN(0,τ1,j2)
where τ0,j2 is small and τ0,j2<τ1,j2. This implies that if γj = 0, that is, the element ϕj is restricted to be close to 0 as ϕj|γjN(0,τ0,j2), the prior for ϕj|γj is virtually zero with small variance, on the other hand, if γj = 1, that is, the element ϕj is unrestricted as ϕj|γjN(0,τ1,j2), the prior is relatively non-informative with large variance. The priors on γj are assumed to be independent Bernoulli pi ∈ (0, 1) random variables as follows:
P(γj=1)=pj
(13)P(γj=0)=1pj
where pj is the prior parameter and pj = 0.5 for a natural default choice.

George and McCulloch (1997) and George et al. (2008) use a default semiautomatic approach that sets τkj=ckσˆϕj for k = 0, 1, where σˆϕj is the OLS estimates of the standard error of ϕj in an unrestricted VAR and pre-selected constants c0 and c1 must be c0 < c1 e.g. c0 = 0.1 and c1 = 10 as used by George et al. (2008), Jochmann et al. (2010) and Jochmann et al. (2013). In this paper, we follow these values for the hyperparameters.

4. Monte Carlo simulations

This section presents Monte Carlo simulations to illustrate forecasting performances for both iterated and direct forecast methods using VAR models. Two DGPs are considered: one follows non-stationary process and the other follows stationary process. For each DGPs, 100 samples of size T = 150 were simulated, and then for each sample, three types of priors are compared: (1) inverted Normal-Wishart (INW) prior, (2) Minnesota (Minn) prior and (3) SSVS prior.

The following two DGPs for VARs are considered for this experiment. Both DGPs contain intercept term. DGP 1 is a four-variable VAR with four lags, containing unit roots with parameters

DGP1:Φ(DGP1)=0.20.20.20.20.800000.400000.4000000.20000000000.300000.4000000.30000000000.3000000.300000.300000.3,andΨ(DGP1)=10.50.50.5010000100001
where Ψ is upper-triangular of the Choleski decomposition of Σ−1 = ΨΨ−1.

Next, DGP 2 is also a four-variable VAR with four lags, but stationary data with parameters

DGP2:Φ(DGP2)=0.50.50.50.50.60000.30.60000.30.60000.30.600000.200000.200000.200.3000.200.300000.300000.30.300000.300000.300000.3,andΨ(DGP2)=Ψ(DGP1) .

Each DGP is repeated 100 times to obtain 100 samples. As for determination of the lag length p, three different methods are used as (1) p = 4 (fixed), (2) p = 8 (fixed), and (3) p chosen by the Akaike information criterion (AIC) with 0 ≤ p ≤ 12. The first method has the fixed lag length as p = 4 is the true lag length. We do not use the Bayesian information criterion (BIC) for the lag length determination since the BIC is generally choosing short lag length, and the use of SSVS means that short lag model is not required to consider. For the selection of lag by the AIC, the AIC is computed at each date τ, where τ0 ≤ τ ≤ T − h, based on the one-step ahead regression for the iterated forecasts, and on the h-step ahead regression eq. (4) for the direct forecasts. For each τ in eq. (5), MCMC is run with 20,000 draws after 5,000 burn-in from τ = τ0 up to τ = T − h − 1 to compute the MSFEs in eq. (5) for each estimator by a recursive forecasting exercise of both an iterated and a direct multi-period forecasting method.

The Monte Carlo simulations for the multi-step forecasting are examined. Table 1 summarizes the MSFEs of both iterated and direct forecasts methods with forecast horizon 2-, 4-, 8- and 12-steps ahead. The MSFE in the table are the sum of the MSFE for each variable. For all series, pseudo-out-of-sample forecasts yˆτ+h are computed for τ = 80 to τ = 150 − h − 1, then we calculate the MSFE defined as eq. (5). Each figure in Table 1 is the average over 100 sample MSFEs. Inspection of Table 1 suggests the following:

  1. Among the three estimators by the INW, the Minn and the SSVS, the SSVS produces the lowest MSFE in most cases, though in a very few cases of direct forecasts the Minn shows barely better performances than the SSVS.

  2. The forecast performances by SSVS prior tends to be insensitive to the choice of the lag length, while the INW estimator considerably deteriorates the performances as the lag length is longer. That is, even if the lag length is more than 4 (that is the true lag length), the SSVS treats the coefficients on longer lags to be zero, while the forecast performances of other two models are largely depend upon the selection of the lag length. The Minnesota prior effectively provides shrinkage in parameters of the longer lags.

  3. For the INW and the SSVS, the iterated method of forecasts is better than the direct method, though for the Minn the results by iterated method are better for DGP2 than those by the direct method, but in some cases worse for DGP1.

  4. For these DGPs, the SSVS model with iterated forecast performs best for any forecast horizon.

Table 2 illustrates the distributions of the relative MSFE, that is the ratios of the MSFE of the direct forecast to the MSFE of the iterated forecast for different forecast horizons, MSFEdirectMSFEiterated. The table shows the mean, standard deviations, 95% highest posterior density intervals (HPDI) of the relative MSFE, and pr.(<1), which is probability that the ratio is less than 1 (the direct forecasts performs better than the iterated forecasts). The following results are found:

  1. For the INW and the SSVS, the mean values of the relative MSFEs are always greater than 1 (means that the iterated forecasts outperform the direct forecasts), while for the Minn the ratios are either greater or less than 1.

  2. For the INW and the SSVS, the mean values of the relative MSFEs are getting large as the forecast horizons are longer, meaning that the relative performance of the iterated forecasts improves with the forecast horizon.

  3. The MSFE ratios by the INW are quite sensitive to the choice of the lag length. As the lag length is longer, the relative MSFEs by the INW are getting larger. However, the relative MSFE by the SSVS is not affected by the choice of the lag length due to the insensitivities of the SSVS to the lag length.

  4. For all three estimators, the standard deviations of the relative MSFE are larger as the forecasts horizon is longer.

  5. Except for the Minn, the probability that the ratio is less than 1 is generally smaller with long-lag length.

  6. In the case of the Minn with non-stationary data, direct forecasts tend to have lower MSFEs than iterated forecasts, though with stationary data, the results are opposite as iterated forecasts lead to better performance than direct forecasts.

These findings show that for given DGPs the SSVS also has almost same properties in forecasting as the INW, as suggested by Marcellino et al. (2006). That is, forecasting performance by the SSVS also shows that the iterated forecast tends to outperform the direct forecast, especially with long-lag and longer forecast horizon. For the Minn, the results are ambiguous. Since the Minnesota prior set its prior mean for the coefficients on the first own lag to be 1 and other coefficients to be zero, the Minnesota prior prone to produce misspecified parameter estimates.

5. An empirical analysis

For an empirical example of comparison of the direct and iterated forecasts using Bayesian VAR models, this section considers multivariate model of US macroeconomics that uses three variables – unemployment rate, inflation rate and interest rate. A VAR model that uses these variables has been analyzed by Cogley and Sargent (2005), Primiceri (2005), Koop et al. (2009) and Jochmann et al. (2010), among many others. Our US data are quarterly, from 1953:1 to 2020:1 with sample size T = 268. Unemployment rate is measured by the civilian unemployment rate, inflation by the 400 times the difference of the log of CPI, which is the GDP chain-type price index, and interest rate by the three-month Treasury bill. These data are obtained from the Federal Reserve Bank of St. Louis [1], and are plotted in Figure 1.

The selection of the number of lags in a VAR affects efficiency in estimation and thus forecasting performances. Cogley and Sargent (2005) and Primiceri (2005) work with VAR(2) to analyze US macroeconomy with the three variables without mentioning any particular reason how the lag length is chosen. Jochmann et al. (2010) use VAR(4) for their SSVS VAR model because the SSVS can find zero restrictions on the parameters of longer lags even if the true lag length is less than 4. However, the true lag length might be larger than 4. With our data set, the number of lags is scattered depending upon which criterion we use – VAR(10) by the AIC, VAR(4) by the Hannan–Quinn criterion and VAR(2) by the BIC. Even if the true lag length is less than 10, the SSVS can set zero restrictions on the longer lags, thus we consider VAR (12) and VAR (AIC) where the lag length is chosen by the AIC. Forecast horizons are 2-, 4-, 8- and 12-period ahead. We work with a recursive forecasting exercise using both direct and iterated multistep forecasting method, with data up to time τ − 1, where τ = τ0, …, Th − 1, and τ0 = 80.

Table 3 presents the MSFEs eq. (5) for the three-variable VAR with the lag-length 12 and chosen by the AIC for the INW, the Minn and the SSVS estimators. For any forecast horizon, iterated forecasts have lower MSFE than direct forecasts. With enough long lag length of 12 the SSVS improves the forecast performance among other methods. However, with the lag selected by the AIC, almost half of the MSFEs by VAR with the Minnesota prior have the lowest MSFEs. Compared with the fixed lag length of 12, the lag length chosen by the AIC is shorter than 12 and the MSFEs are smaller than the MSFEs by the models with lag length 12. This indicates that the lag length 12 may be too long, containing unnecessary lags or parameters, though the SSVS is supposed to restrict insignificant coefficients to be zero. This indicates that SSVS is effective in ensuring parsimony in over-parameterized VAR(12) model.

The three variables used in this empirical analysis appear to be nonstationary, and thus transformation to stationary data by taking their first difference is also considered. For this case, the forecasting models are estimated using Δyt instead of yt in eq. (1), then these models are used to compute the forecast of the level of yt + h such as yˆt+h=yt+i=1hΔyˆt+i|t for the iterated forecast, and yˆt+h=yt+Δyˆt+h for the direct forecasts. All elements of the prior mean for the Minnesota prior are set to be zero as Θ¯1=0n since all series are transformed to be stationary by the first differencing. Table 4 presents the MSFEs of the case of the first difference data. The MSFEs of the first difference data tend to have lower MSFEs than the case of the level data, particularly for the inflation rates. These results also indicate that iterated forecasts outperform direct forecasts and the SSVS improves forecast performances than other models though the Minn produces better in some cases.

6. Conclusions

This paper examines comparison of direct and iterated multistep forecasting performance using three estimators for VAR model – the inverted Normal-Wishart (INW) prior, the Minnesota prior and the SSVS prior. Theoretically, direct method is preferable since the direct forecasts are prone to be efficient and more robust to model misspecification. Iterated forecasts are more efficient if the one-step ahead model is not misspecified. Since George et al. (2008) show VAR with SSVS prior greatly improves the one-step ahead forecast, the coefficients are estimated more efficiently and thus an iterated multi-period forecast method would be more efficient than the direct method. So, it is of interest if direct forecasts are compared with iterated forecasts using SSVS VAR model. Although Pesaran et al. (2011) noted that whether the direct or iterated method produced better forecasts is ultimately an empirical question; this paper considers the case of three estimators of VAR for comparison of direct and iterated method using two DGPs and US macroeconomics data. The results are exactly same as Marcellino et al. (2006), that is, iterated forecasts for the INW and SSVS estimators have lower MSFEs than direct forecasts, particularly if the models are with long-lag and longer forecast horizon, while it is ambiguous for the case of the Minnesota prior. The SSVS estimator tends to appreciably improve the forecast performance against other estimators by the INW and the Minnesota prior in most cases.

As an empirical example an application of US macroeconomics is studied to show a benefit of using SSVS prior in a VAR. With longer lags and thus large number of parameters that may include many insignificant, it seems that SSVS alleviates over-parameterization problem in VAR model by restricting insignificant parameters of the model, and enables to improve forecasting performance, although the Minnesota prior also produces smaller MSFEs in some case than SSVS since the Minnesota prior provides shrinkages on the longer lags.

With these results, iterated forecasts are found to produce better forecast performances than direct forecasts, and Bayesian method such as the Minnesota prior model and the SSVS model outperform the INW, particularly with longer lag and longer forecast horizon.

Figures

Data: US unemployment rate, interest rate, inflation rate

Figure 1

Data: US unemployment rate, interest rate, inflation rate

Monte Carlo simulation: average MSFEs

DGP 1DGP 2
Forecast horizonForecast horizon
ModelMethod2481224812
Lag = 4
INWDirect7.35211.6225.3340.508.53613.4819.4024.76
Iterated7.04710.2919.4629.398.27312.3415.8019.39
MinnDirect7.10310.5021.7734.758.85813.5818.4023.15
Iterated7.11410.5221.3135.108.34912.2716.1620.37
SSVSDirect6.51710.1421.9234.288.09112.4717.7422.72
Iterated6.3819.40517.4326.037.72911.7215.3818.87
Lag = 8
INWDirect9.57915.4934.1555.8511.1117.6725.5032.35
Iterated9.03513.1824.5936.9510.5415.6119.0222.29
MinnDirect7.76811.4924.2639.239.82514.8320.1724.58
Iterated8.01312.4327.1240.949.45213.7718.4323.84
SSVSDirect6.78810.8023.8737.538.40713.0518.5023.81
Iterated6.6219.83818.2327.157.96212.0415.7519.26
Lag by AIC
INWDirect9.76616.7444.7479.9511.8618.9827.8840.70
Iterated9.18313.3125.5542.8410.4315.3119.4624.07
MinnDirect7.34011.2525.1941.819.28014.0018.4823.99
Iterated7.55011.5024.1545.298.84113.0117.2521.99
SSVSDirect6.68110.7824.1637.888.43713.1518.6524.22
Iterated6.5959.75018.2327.367.93711.9715.7119.28

MSFE ratio

DGP 1DGP 2
Forecast horizonForecast horizon
Model 2481224812
Lag = 4
INWMean1.0431.1251.2771.3351.0311.0901.2211.269
St dev0.0270.0790.2060.3200.0290.0630.1170.172
HPDI (L)0.9910.9380.9120.8210.9740.9821.0310.915
HPDI (H)1.1001.2721.7902.1621.0901.2421.4861.676
Pr.(<1)0.060.050.070.120.070.090.020.05
MinnMean0.9990.9971.0160.9991.0621.1071.1371.132
St dev0.0310.0710.1930.2940.0350.0630.0970.154
HPDI (L)0.9440.8500.6600.5070.9930.9870.9800.895
HPDI (H)1.0521.1231.4621.6071.1411.2491.3631.410
Pr.(<1)0.500.490.490.510.030.040.040.22
SSVSmean1.0211.0741.2411.2931.0471.0611.1361.173
St dev0.0260.0730.2080.2890.0410.0630.1220.180
HPDI (L)0.9640.9510.8620.8030.9710.9380.9610.892
HPDI (H)1.0731.2481.7222.0661.1351.2031.4691.583
Pr.(<1)0.230.120.080.130.110.170.060.12
Lag = 8
INWMean1.0601.1661.3571.4461.0541.1321.3451.467
St dev0.0400.0960.2530.4210.0340.0720.1800.312
HPDI (L)0.9820.9880.9420.8030.9861.0211.0221.097
HPDI (H)1.1461.3961.9762.8711.1191.3041.7722.059
Pr.(<1)0.070.040.080.100.060.010.010.01
MinnMean0.9701.1660.9050.8341.0401.0791.0921.030
St dev0.0340.0960.2240.3280.0350.0610.1080.176
HPDI (L)0.8990.7500.5370.3070.9660.9750.9150.705
HPDI (H)1.0441.1261.3241.5701.1051.2061.3231.381
Pr.(<1)0.810.840.690.750.150.100.200.41
SSVSMean1.0241.0911.2891.3491.0551.0811.1591.206
St dev0.0270.0730.2110.2990.0430.0620.1220.207
HPDI (L)0.9740.9650.9420.8210.9810.9860.9610.876
HPDI (H)1.0801.2421.7222.0541.1371.2351.4611.774
Pr.(<1)0.140.090.050.090.050.050.050.12
Lag by AIC
INWMean1.0681.2751.7291.8751.1401.2531.4561.687
St dev0.1630.3130.5960.8610.1550.2900.5190.695
HPDI (L)0.7600.7910.9050.8450.8710.7940.6900.787
HPDI (H)1.4661.9743.1364.5561.4591.9572.6263.614
Pr.(<1)0.350.200.040.090.220.200.220.12
MinnMean0.9750.9901.0731.0661.0521.0801.0761.091
St dev0.0550.1300.3050.4540.0610.1040.1280.190
HPDI (L)0.8710.7530.6070.3730.9280.9260.8390.750
HPDI (H)1.0751.2351.7182.2181.2111.3731.3631.500
Pr.(<1)0.720.540.470.500.160.170.290.34
SSVSMean1.0141.1061.3131.3671.0621.0971.1711.224
St dev0.0410.1100.2370.3530.0510.0770.1390.237
HPDI (L)0.9460.9270.9790.7950.9650.9730.9820.914
HPDI (H)1.0871.3481.8772.2511.1871.2731.4601.883
Pr.(<1)0.370.150.070.120.100.120.040.12

MSFEs for US data: level data

Forecast horizon
24812
ModelVariableDirectIteratedDirectIteratedDirectIteratedDirectIterated
Lag = 12
INWUnemp.0.5140.1332.2990.9883.5133.1175.5854.771
Inflation3.6002.0635.8293.79110.938.50720.4911.32
Interest2.7581.4146.0263.30812.6010.7217.2314.87
MinnUnemp.0.5050.1392.2340.9943.3863.0725.5365.274
Inflation2.3791.1844.3722.6889.0926.16116.999.681
Interest1.9540.8854.3432.5499.3157.90813.5611.48
SSVSUnemp.0.3590.0731.3460.6343.8392.1664.2432.808
Inflation2.2521.1364.0852.5149.4436.09815.578.937
Interest2.1331.0014.1122.53010.107.56317.9411.23
Lag by AIC
Average Lag9.2209.9587.4469.9589.7689.9585.8639.958
INWUnemp.0.4490.1131.9100.8013.3192.5075.5713.670
Inflation3.2921.9544.6343.09210.746.00918.536.859
Interest2.5161.2605.3332.89711.668.71813.8111.51
MinnUnemp.0.4350.1201.8890.8183.2232.6565.2954.354
Inflation2.0341.1383.5002.2979.1045.10313.847.159
Interest1.9030.8383.8022.3118.9676.94710.4010.30
SSVSUnemp.0.3510.0721.3890.6193.8272.1424.1292.817
Inflation2.0381.0793.1822.3159.5535.52515.267.755
Interest2.0200.9663.7902.5809.7997.69114.4811.60

MSFEs for US data: first difference data

Forecast horizon
24812
ModelVariableDirectIteratedDirectIteratedDirectIteratedDirectIterated
Lag = 11
INWUnemp.0.3520.1141.2060.7873.3942.6135.3614.104
Inflation3.4271.7233.6122.5824.8294.2688.3255.246
Interest2.0351.4394.2563.1438.9039.31413.6912.37
MinnUnemp.0.3430.1121.1840.7823.3752.6425.2643.898
Inflation1.7390.9652.5791.4693.9202.3365.4212.889
Interest1.7770.8793.4792.3447.1466.48410.068.915
SSVSUnemp.0.3650.0691.2710.5653.2042.3614.6693.913
Inflation1.6110.9062.2171.3683.6512.3854.6842.975
Interest1.7771.0633.6102.4306.9306.8779.78710.08
Lag by AIC
Average Lag7.85110.546.04210.543.49410.541.00010.54
INWUnemp.0.3550.1141.1500.7863.1242.6074.7984.431
Inflation2.0991.9162.6712.5333.8463.9674.7575.200
Interest1.8271.6223.9303.2767.2789.5479.85812.08
MinnUnemp.0.3480.1121.1450.7903.1472.6654.7974.421
Inflation1.6950.9412.4711.3673.4532.5084.5523.249
Interest1.8430.8693.6422.4457.1817.1419.83410.23
SSVSUnemp.0.3600.0671.2870.5743.1962.3174.6343.901
Inflation1.6160.9042.2651.3353.6472.4134.4063.253
Interest1.6441.0223.4942.4936.9796.9719.70410.67

Note

References

Ang, A., Piazzesi, M. and Wei, M. (2006), “What does the yield curve tell us about GDP growth?”, Journal of Econometrics, Vol. 131 Nos 1-2, pp. 359-403.

Bhansali, R.J. (1996), “Asymptotically efficient autoregressive model selection for multistep prediction”, Annals of the Institute of Statistical Mathematics, Vol. 48 No. 3, pp. 577-602.

Bhansali, R. (1997), “Direct autoregressive predictors for multistep prediction: order selection and performance relative to the plug in predictors”, Statistica Sinica, Vol. 7, pp. 425-449.

Chevillon, G. and Hendry, D.F. (2005), “Non-parametric direct multi-step estimation for forecasting economic processes”, International Journal of Forecasting, Vol. 21 No. 2, pp. 201-218.

Clements, M.P. and Hendry, D.F. (1996), “Multi-step estimation for forecasting”, Oxford Bulletin of Economics and Statistics, Vol. 58 No. 4, pp. 657-684.

Cogley, T. and Sargent, T.J. (2005), “Drifts and volatilities: monetary policies and outcomes in the post WWII US”, Review of Economic Dynamics, Vol. 8 No. 2, pp. 262-302.

Doornik, J.A. (2013), Object-Oriented Matrix Programming Using Ox, Timberlake Consultants Press, London.

George, E.I. and McCulloch, R.E. (1997), “Approaches for Bayesian variable selection”, Statistica Sinica, Vol. 7, pp. 339-373.

George, E.I., Sun, D. and Ni, S. (2008), “Bayesian stochastic search for VAR model restrictions”, Journal of Econometrics, Vol. 142 No. 1, pp. 553-580.

Ing, C.-k. (2003), “Multistep prediction in autoregressive processes”, Econometric Theory, Vol. 19 No. 2, pp. 254-279.

Jochmann, M., Koop, G. and Strachan, R.W. (2010), “Bayesian forecasting using stochastic search variable selection in a VAR subject to breaks”, International Journal of Forecasting, Vol. 26 No. 2, pp. 326-347, doi: 10.1016/j.ijforecast.2009.11.002.

Jochmann, M., Koop, G., León-González, R. and Strachan, R.W. (2013), “Stochastic search variable selection in vector error correction models with an application to a model of the UK macroeconomy”, Journal of Applied Econometrics, Vol. 28 No. 4, pp. 62-81.

Kang, I.B. (2003), “Multi-period forecasting using different models for different horizons: an application to US economic time series data”, International Journal of Forecasting, Vol. 19 No. 3, pp. 387-400.

Koop, G., Leon-Gonzalez, R. and Strachan, R.W. (2009), “On the evolution of the monetary policy transmission mechanism”, Journal of Economic Dynamics and Control, Vol. 33 No. 4, pp. 997-1017.

Litterman, R.B. (1986), “Forecasting with Bayesian vector autoregressions: five year experience”, Journal of Business Economic Statistics, Vol. 4 No. 1, pp. 25-38.

Marcellino, M., Stock, J.H. and Watson, M.W. (2006), “A comparison of direct and iterated multistep AR methods for forecasting macroeconomic time series”, Journal of Econometrics, Vol. 135 No. 1, pp. 499-526.

Pesaran, M.H., Pick, A. and Timmermann, A. (2011), “Variable selection, estimation and inference for multi-period forecasting problems”, Journal of Econometrics, Vol. 164 No. 1, pp. 173-187.

Primiceri, G.E. (2005), “Time varying structural vector autoregressions and monetary policy”, Review of Economic Studies, Vol. 72 No. 3, pp. 821-852.

Sugita, K. (2018), Evaluation of forecasting performance using Bayesian stochastic search variable selection in a vector autoregression, Ryukyu Economics Working Paper #1, University of the Ryukyus.

Sugita, K. (2019a), Forecasting with vector autoregressions by Bayesian model averaging, Ryukyu Economics Working Paper #3, University of the Ryukyus.

Sugita, K. (2019b), Forecasting with vector autoregressions using Bayesian variable selection methods: comparison of direct and iterated methods, Ryukyu Economics Working Paper #2, University of the Ryukyus.

Acknowledgements

This work was supported by JSPS KAKENHI grant number 20K01591.

Corresponding author

Katsuhiro Sugita can be contacted at: ksugita@grs.u-ryukyu.ac.jp

Related articles