Granger causality is one of most representative methods to analyze causality between economic variables. It is based on linear vector autoregressive (VAR) models and investigates whether past information is effective for prediction. Although Granger causality is used for various studies, it can be applied to examine only stable linear relationships in the long run. The relationship between economic variables is not necessarily stable in the long run and frequently has time-varying properties. This implies that a causality relationship can also be time-varying, and hence we should take into account the time-varying properties when analyzing a causality relationship.
One method to introduce time-varying properties to Granger causality is through the use of a logistic smooth transition (LST) function. By using an LST function with time as the transition variable, we can test for both smooth and abrupt causalities. When a causality has such nonlinearity, the usual Granger causality tests based on a linear VAR model have low power and tend to give the misleading result of having no causalities in the system.   and  proposed nonlinear causality tests. Their analyses also showed significant nonlinear causality.
While time-varying causality is significant for the precise analysis of variables, heteroskedastic variances influence the tests for causality and nonlinearity such as time- varying properties. For example,  provided Monte Carlo evidence that causality tests have size distortions under heteroskedastic variances. In addition,  and  showed that heteroskedastic variances lead to spurious nonlinearity. Several economic variables investigated using Granger causality have heteroskedastic variances such as stochastic volatility (SV) (e.g.,  and  ). Therefore, if we do not deal appropriately with heteroskedastic variances in the tests for causality, we would not be able to obtain reliable results when examining for time-varying causality. However, previous studies have not clarified the influences of heteroskedastic variances on time-varying causality tests.
This paper investigates the statistical properties of time-varying causality tests when the disturbance terms have SV. The investigated tests include asymptotic tests based on first-order and third-order Taylor approximation and their counterparts with the heteroskedasticity-consistent covariance matrix estimators (HCCME) as introduced by  . As pointed out by  , the order of Taylor approximation affects the performance of linearity tests. We reveal the impact of the order of Taylor approximation on time- varying causality tests in the presence of SV. We also examine the time-varying causality tests using wild bootstrap. Wild bootstrap was proposed by  and replicates a sampling that does not depend on the form of heteroskedastic variances.  and  examine the properties of tests using wild bootstrap. We show which tests perform well even under SV by analyzing the size and power of the tests.
Our simulation results provide evidence that asymptotic time-varying causality tests and their counterparts with HCCME over-reject the null hypothesis of no causality in the presence of SV. This implies that their tests tend to yield misleading and unreliable results. In particular, their tests based on third-order Taylor approximation have larger distortions than those based on first-order Taylor approximation. In contrast, we find that time-varying causality tests using wild bootstrap have reasonable empirical sizes and sufficient power. The results of this paper would enable appropriate and reliable time-varying causality tests.
The rest of this paper is organized as follows. Section 2 presents time-varying causality tests. Section 3 provides the size and power properties of tests. Finally, Section 4 concludes the paper.
2 Time-Varying Causality Tests
We consider the following bivariate vector autoregressive system to test for time-varying causality relationship.
where and are zero mean errors and is a logistic smooth transition function to model time-varying causality. The transition function can be given by
where is a parameter determining the function’s smoothness, t is a transition variable, and c is the point where a regime changes from one to another. We assume that, , and. The system has a causality from to when and. means a time-varying constant. When, the causality from to does not appear because becomes zero. The value of the logistic smooth transition function is bounded between −1/2 and 1/2. When, we
have and. Meanwhile, when, we have and. moves toward −1/2
when and small, and toward 1/2 when and large. The causality is time-varying when depending on the value of. In addition, the time-varying causality using includes abrupt structural changes of causality because is the indicator function taking the value of −1/2 or 1/2 when.
The null and alternative hypotheses to test for time-varying causality in the system are
If, Equation (1) has no causality from to. However, the test is not simple and easy because the null hypothesis has an identification problem about and. They are identified only under the alternative hypothesis with. The identification problem was considered by  and  . To conduct the test in the presence of the identification problem,  proposed a Taylor series approximation. We use first-order and third-order Taylor series approximation around because the performance of the tests depends on the order of Taylor series approximation (e.g.,  ).
The regression models for (1) using the first-order and third-order Taylor series approximation are given by
where is an error term including a remainder term of Taylor series approximation. implies in (5) or in (6). Since we cannot test for directly, we instead test for or for . Denoting, , and as, , and, (5) and (6) can be rewritten respectively as
where, , and.
Testing for time-varying causality is expressed as
The Wald statistics to test for time-varying causality are derived as
where and, and are estimates of and, and is the estimate of the residual variance in each regression. and are matrixes that satisfy and, where and. Under the null hypothesis of no time-varying causality, (11) and (12) follow F distributions with degrees of freedom and, respectively. When we use HCCME for statistics (11) and (12), they are given by
where represents the residual in each regression. Statistics (13) and (14) using HCCME asymptotically have the same distributions as (11) and (12).
Wild bootstrap is also used for regression models with heteroskedastic variances to obtain reliable results. The method can simply resample heteroskedastic variances like SV. This paper employs the recursive-design wild bootstrap. The testing procedure is as follows.
Step 1. Compute test statistics (11) and (12) by applying (7) and (8) to the data.
Step 2. Estimate the system using the restricted model with in (7) and in (8) and obtain the estimate of and residuals denoted as.
Step 3. Obtain the estimates and and the residual, where is the residual of (2).
Step 4. Generate the bootstrapped sample as
where is the estimate of in (7) or (8), , , and and are i.i.d. N(0, 1). and are data generated recursively.
Step 5. Compute test statistics (11) and (12), denoted as WB1 and WB3, by applying (7) and (8) to the generated bootstrap sample.
Step 6. Repeat the bootstrap iterations M for steps 4 and 5. We obtain M statistics WB1 and WB3.
Step 7. Compute the bootstrap p-values as follows:
is an indicator function such that is 1 if is true and 0 otherwise. The null hypothesis is rejected if the p-value is smaller than a significant level.
3. Size and Power Properties
This section conducts Monte Carlo simulations to compare the size and power properties of causality tests under multivariate SV. The nominal size of the tests is 0.05, and we consider sample sizes and 400. Causality tests using wild bootstrap have 1000 bootstrap replications. The number of replications of simulations for all the tests is 10,000. We generate data with and use the data with sample size T. The initial 100 samples are discarded to avoid the effect of initial conditions. We denote the tests compared in this section as F1, F3, HC1, HC3, WB1, and WB3; we also denote the linear Granger causality test as F0, its test with HCCME as HC0, and its test using wild bootstrap as WB0.
We first investigate the size properties based on data generating process (DGP) given as
where and are error terms. We set and with normal error to the following.
The correlation parameter between and is set to and. DGP do not have any causality from to in the system from (19) to (21).
Table 1(a) presents the size properties of tests for normal error. We investigate two cases of and. The results in Table 1(a) indicate that for all the tests, the correlation parameter does not have any influence on size. Linear Granger causality test F0 and its time-varying causality version F1 perform well regardless of the value of. Their rejection frequencies are near the nominal size 0.05. However, we find that F3 has small over-rejections. For example, the size of F3 for, , and is 0.090. Compared with the results between and, F3 has a larger over-rejection for than for. The high persistence of affects the empirical size of F3. The property that F3 has additional regression parameters may lead to size distortions. We find that HC0, HC1, and HC3 perform worse than F0, F1, and F3. The rejection frequencies are larger than 0.05. In particular, the rejection frequency of HC3 is more than 0.1. These results imply that causality tests with HCCME are not useful under normal error. In contrast, the empirical sizes of WB0, WB1, and WB3 using wild bootstrap are close to the nominal size 0.05. While WB3 has small under-rejections, small size distortions are acceptable. Causality tests with wild bootstrap have reasonable empirical sizes.
We next examine the empirical sizes of tests under multivariate SV. The property of SV is that volatility is influenced by an error and changes stochastically. Multivariate SV allows for the correlation between errors of volatilities. and in (21) with SV are generated by
Here, and are given by
The regression parameter in (19) is set to 0.2. and describe the size of volatility persistence. High and indicate persistent volatility. is a parameter to determine the asymmetry of SV. While asymmetric volatility has, symmetric multivariate volatility has the restriction of all. For example, the volatility with and increases when is minus. The multivariate stochastic model is based on  and  .
Table 1(b) presents the size performance of tests under symmetric multivariate SV with and. All are set to zero in (24) and (25). Multivariate SV clearly leads to over-rejection. For example, the empirical sizes of F1, F3, HC1, and HC3 for, , , and are respectively 0.059, 0.090, 0.078, and 0.153 in Table 1(a) but 0.088, 0.127, 0.129, and 0.225 in Table 1(b). In addition, we observe that the correlation between errors affects the size performance of F0, F1, and F3. When compared with the empirical sizes of F0, F1, and F3 for
Table 1. (a) Size properties under normal error; (b) Size properties under symmetric multivariate stochastic volatility; (c) Size properties under asymmetric multivariate stochastic volatility.
and, they have larger over-rejections for and. This is different from the results in Table 1(a). Thus, the correlation between errors increases the over-rejections when the errors have multivariate SV. The results imply that multivariate SV causes size distortions in time-varying causality tests. Possibly, they provide misleading results that indicate a time-varying causality relationship. However, note that WB0, WB1, and WB3 perform better even under SV. The empirical sizes of WB0, WB1, and WB3 for, , and are 0.039, 0.058, and 0.053, respectively.
Asymmetric multivariate SV also results in size distortions for causality tests. We set and to and and to. From Table 1(c), while HC0, HC1, and HC3 have larger rejection frequencies than F0, F1, and F3, their size distortions are smaller than those for symmetric multivariate SV. WB0, WB1, and WB3 show reasonable size performances, as in Table 1(a) and Table 1(b). A comparison of Table 1(b) and Table 1(c) shows that asymmetry of SV does not have a large impact on time-varying causality tests. From the results of empirical sizes, time-varying causality tests using HCCME have a negative influence on empirical sizes. Furthermore, time-varying causality tests F3 and HC3 based on third-order Taylor approximation is inferior to tests F1 and HC1 based on first-order Taylor approximation. Although WB3 tends to have slight under-rejection, WB0, WB1, and WB3 are superior to other tests. In particular, WB0 and WB1 perform best irrespective of type of error.
We next investigate the power properties based on DGP, given as
where c is the point at which a regime changes from one to another. We set c to. We compare the cases of and .
Table 2(a) reports the power performance of tests under normal errors and with in (21). All the tests have a larger power when or than when. All the tests find it more difficult to detect changes only in a constant than only in AR parameter or in both a constant and AR parameter. In addition, the power of most of the tests increases when is large, because a large provides a sharp change in the smooth transition function. F0, HC0, and WB0 have smaller power compared to other tests regardless of the value of, , and. This shows that it is difficult for linear Granger causality tests to detect time-varying causality. F1, HC1, HC3, and WB1 apparently outperform other tests. However, HC1 and HC3 have over-rejections, as shown in Table 1(a). The better power performance of HC1 and HC3 is attributed to over-rejection of the null hypothesis; moreover, they tend to lead to spurious time-varying causality. Although F3 has size distortions under the null hypothesis with normal error, F3 has similar or lower power compared to F1. WB3 has a small under-rejection of the null hypothesis and lower power. These results indicate that time-varying causality tests with third-order Taylor approximation are not advantageous. Accordingly, F1 and WB1 obtain reasonable empirical sizes and better power performance when a variable has time-varying causality.
Table 2(b) presents the power properties under multivariate SV. SV is generated by (22) with and. HC0 and WB0 are naturally inferior to other tests. F0 performs well, unlike the results of Table 2(a). This performance is attributed to the size distortions under SV, as in Table 1(b). The same is true of the results of F1, F3, HC1, and HC3. They outperform other tests, but over-reject the null hypothesis. When DGP have stochastic volatility, they are likely to reject the null hypothesis of no time-varying causality and yield misleading results. It is important to have reasonable and acceptable empirical sizes in order to avoid misleading results. Although the power of WB1 and WB3 are lower than that of F1, F3, HC1, and HC3, they have reasonable and acceptable empirical sizes and lead to reliable results. We see that WB1 has higher power than WB3. The simulation results provide clear evidence that WB1 is more reliable from the perspective of controlling the size and obtaining sufficient power to find time-varying causality regardless of the presence of SV.
This paper investigated the statistical properties of time-varying causality tests when the errors of variables have multivariate SV. It is important to clarify the statistical properties of time-varying causality tests under SV, because economic variables often have SV and the relationship between them is time-varying. The tests we compared include the standard linear Granger causality and the time-varying causality tests, their tests with HCCME, and their tests using wild bootstrap. Simulation results indicate that time-varying causality tests and their counterparts with HCCME have size distortions
Table 2. (a) Power properties under normal error; (b) Power properties under multivariate stochastic volatility.
under highly persistent SV. Standard linear Granger causality tests perform relatively well under SV but has low power under time-varying causality. In contrast, time-varying causality tests using wild bootstrap have better size properties regardless of type of error. In particular, the time-varying causality test with first-order Taylor approximation and wild bootstrap has better statistical properties. These results indicate that the time-varying causality test with first-order Taylor approximation and wild bootstrap is reliable and useful to test for time-varying causality.
*This research was supported by KAKENHI (Grant number: 15K03527).
 Christopoulos, D.K. and León-Ledesma, M.A. (2008) Testing for Granger (Non-)Causality in a Time-Varying Coefficient VAR Model. Journal of Forecasting, 27, 293-303.
 Li, Y. and Shukur, G. (2011) Linear and Nonlinear Causality Tests in an LSTAR Model: Wavelet Decomposition in a Nonlinear Environment. Journal of Statistical Computation and Simulation, 81, 1913-1925.
 Hatemi-J, A. (2012) Asymmetric Causality Tests with an Application. Empirical Economics, 43, 447-456.
 Vilasuso, J. (2001) Causality Tests and Conditional Heteroskedasticity: Monte Carlo Evidence. Journal of Econometrics, 101, 25-35.
 Pavlidis, E., Paya, I. and Peel, D. (2010) Specifying Smooth Transition Regression Models in the Presence of Conditional Heteroskedasticity of Unknown Form. Studies in Nonlinear Dynamics and Econometrics, 14, Article 3.
 Asai, M., McAleer, M. and Yu, J. (2006) Multivariate Stochastic Volatility: A Review. Econometric Reviews, 25, 145-175.
 Vo, M. (2011) Oil and Stock Market Volatility: A Multivariate Stochastic Volatility Perspective. Energy Economics, 33, 956-965.
 White, H. (1980) A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity. Econometrica, 14, 1261-1295.
 Van Dijk, D., Terasvirta, T. and Franses, P.H. (2002) Smooth Transition Autoregressive Models—A Survey of Recent Developments. Econometric Reviews, 21, 1-47.
 Liu, R.Y. (1988) Bootstrap Procedure under Some Non-i.i.d. Models. Annals of Statistics, 16, 1696-1708.
 Davidson, R. and Flachaire, E. (2008) The Wild Bootstrap, Tamed at Last. Journal of Econometrics, 146, 162-169.
 Grobys, K. (2015) Size Distortions of the Wild Bootstrapped HCCME-Based LM Test for Serial Correlation in the Presence of Asymmetric Conditional Heteroscedasticity. Empirical Economics, 48, 1189-1202.
 Davies, R.B. (1977) Hypothesis Testing When a Nuisance Parameter Is Present Only under the Alternative. Biometrika, 64, 247-254.
 Luukkonen, R., Saikkonen, P. and Terasvirta, T. (1988) Testing Linearity against Smooth Transition Autoregressive Models. Biometrika, 70, 491-499.
 Harvey, A., Ruiz, E. and Shephard, N. (1994) Multivariate Stochastic Variance Models. Review of Economic Studies, 61, 247-264.
 Asai, M. and McAleer, M. (2006) Asymmetric Multivariate Stochastic Volatility. Econometric Reviews, 25, 453-473.