Journal Issue

This Time They Are Different: Heterogeneity and Nonlinearity in the Relationship Between Debt and Growth1

Markus Eberhardt, and Andrea Presbitero
Published Date:
December 2013
  • ShareShare
Show Summary Details

1 Introduction

The presence of a common threshold or turning point beyond which the detrimental impact of debt on growth is significant or significantly increases is currently taken as a given within many policy circles: in the United States, although many political battles impinge on the Congressional debate over the debt ceiling and the resulting government shutdown of October 2013, this state of affair at least in parts reflects a widespread belief that ‘debt is dangerous’ and that fiscal austerity represents the only way towards restoring sustainable growth for the world’s largest economy. In the United Kingdom Chancellor George Osborne displays a similar sentiment when telling his annual party conference that dealing with the repercussions of the financial crisis is not over “[u]ntil we’ve fixed the addiction to debt that got this country into this mess in the first place” (official Conservative Party Conference speech, Manchester, September 30, 2013). These strong convictions and ensuing actions were strongly influenced by the work of Carmen Reinhart and Kenneth Rogoff, who were among the first to suggest a debt-to-GDP threshold of around 90% beyond which economic growth is seriously affected by the debt burden (Reinhart & Rogoff, 2009, 2010a,b, 2011; Reinhart, Reinhart & Rogoff, 2012).1

This study is not about the analysis in Reinhart & Rogoff (2010b) – which is primarily descriptive – but about the substantial empirical literature these authors point to as a means of support for their findings (Kumar & Woo, 2010; Cecchetti, Mohanty & Zampolli, 2011; Checherita-Westphal & Rother, 2012). We build on this literature and approach the issue of nonlinearity in the debt-growth relationship with a number of alternative empirical strategies which enable us to distinguish a nonlinearity across countries from a within-country nonlinearity, a key distinction which has so far been entirely absent from the empirical literature. Identifying a within-country threshold effect would indeed inform policy makers of the presence of a country-specific tipping point, which may guide macroeconomic policies and fiscal adjustments. If debt-growth dynamics differ across countries, then the assumption of a common threshold across countries, as is the practice in the existing literature, leads to one-size-fits-all policies which are misleading at best and growth-retarding at worst.

We analyse the debt-growth nexus within a standard neoclassical growth model for aggregate economy data, employing an empirical framework which allows for different long-run equilibrium relationships between debt and growth across countries, while simultaneously accounting for short-run effects and the impact of unobserved global shocks and local spillover effects.2 Using total public debt data from 105 developing, emerging and developed economies over the 1972 to 2009 time horizon we find that long-run debt coefficients differ across countries and provide tentative evidence that countries with higher average debt-to-GDP ratios are more likely to see a negative effect on their long-run growth performance. We can however not find any evidence that a specific debt threshold common to all countries triggers a systematic parameter shift for individual countries as is widely suggested in the existing literature.

Four features of our empirical approach distinguish this study from the literature on debt and growth. First, we employ a flexible dynamic empirical framework which allows us to distinguish the long-run from the short-run relationship between debt and growth. We estimate the long-run and short-run parameters in a standard error correction model (ECM), test for the existence of a long-run equilibrium relationship (cointegration) and investigate concerns over endogeneity in the panel using recent panel time series methods.

Second, we put particular emphasis on modelling the debt-growth relationship as potentially differing across economies in an a priori unspecified way. Given theoretical arguments for structural (parameter) differences across countries, model and specification uncertainty, as well as serious shortcomings in the available data on public debt, we argue that flexibility in the cross-section dimensions of our panel econometric framework represents a crucial requirement and considerable strength when investigating complex entities such as national economies.

Third, we identify the structural parameters of the relationship between debt and growth by accounting for the distorting impact of cross-section dependence in the form of unobserved global shocks and local spillover effects,3 both of which are likely to affect different economies in the sample to a different extent.

Fourth, we investigate the issue of nonlinearities in both the cross- and within-country dimensions, employing novel approaches and diagnostics from the time-series literature adapted for use in the panel. The presence of a nonlinearity across countries is studied by estimating a heterogeneous dynamic ECM and subsequently analysing the cross-country patterns of short-run and long-run debt coefficients. Our analysis of the within-country type of nonlinearity includes two approaches: (i) we investigate an asymmetric dynamic model, where we pick a range of threshold values, including the 90% debt-to-GDP ratio, as potential ‘tipping point’ for our debt-growth analysis; (ii) we present results from static regression models with squared and cubed debt terms. These empirical specifications are informed by testing procedures for variable summability, as well as for balance and co-summability of the empirical specifications: since integration and cointegration are linear concepts we cannot apply these conventional tests to diagnose our nonlinear empirical models and instead adapt these novel time series methodologies for the panel.

The theoretical foundations for a negative and/or possibly non-linear relationship between debt and growth are rather tenuous (see Panizza & Presbitero, 2013, for a recent survey). While some models arrive at a negative long-run relationship (Elmendorf & Mankiw, 1999) which may be more pronounced if higher debt stocks lead to uncertainty or expectations of future financial repression (Cochrane, 2011), there are alternatives which suggest that in the presence of wage rigidities and unemployment this negative relationship disappears (Greiner, 2011). A nonlinearity or debt threshold can be motivated in developing countries by the presence of debt overhang (Krugman, 1988; Sachs, 1989) but it is difficult to extend this argument to advanced economies. Nonlinearities may also arise if there is a tipping point of fiscal sustainability (Ghosh et al., 2013;Greenlaw et al., 2013). However we are not aware of any theoretical models incorporating such debt tipping points in a growth framework.

Given the recent interest in this topic, our paper is naturally far from alone in studying the effect of the fiscal stance on growth in a cross-country regression framework (recent studies include Cordella, Ricci & Ruiz-Arranz, 2010; Kumar & Woo, 2010; Cecchetti, Mohanty & Zampolli, 2011; Checherita-Westphal & Rother, 2012; Panizza & Presbitero, 2012) – we provide a synthetic review of this literature in a Technical Appendix.4 Although individually quite rich in empirical results and proposed robustness checks, four features can broadly distinguish the analysis in these existing studies: (a) the data used (external or total debt) and country coverage (Euro area, OECD economies, developing countries, or emerging and developed countries); (b) the modelling of the hypothesised debt-growth nonlinearity/threshold (linear and squared debt terms in the regression, spline regression using preconceived thresholds, endogenous threshold regression); (c) the proposed time horizon of the results (short-run, long-run debt-growth relationship) depending on static or dynamic empirical specifications or, supposedly, the use of time-averaged or annual data; and (d) the identification strategy (standard IV/2SLS estimators, Arellano & Bond (1991)-type estimators). None of these studies however address more than one or arguably two of the four features we highlighted above (long-run versus short-run, cross-section dependence, cross-country heterogeneity, nonlinearity and asymmetry in integrated and cross-sectionally dependent macro panels), which we argue are of great importance for identification, analysis and interpretation.

In empirical spirit this study is closest to that of Kraay & Nehru (2006, p.342) investigating debt sustainability and arguing that “a common single debt sustainability threshold is not appropriate because it fails to recognize the role of institutions and policies that matter for the likelihood of debt distress”. In a similar vein, Reinhart, Rogoff & Savastano (2003) and Reinhart & Rogoff (2010c, p. 24) suggested that “debt thresholds are importantly country-specific”, while recent papers which emphasise the heterogeneity of the debt-growth nexus across countries (Kourtellos, Stengos & Tan, 2014). Within the wider growth empirics literature, we add to the recent work employing more flexible empirical specifications to account for cross-country correlations in order to identify the substantive relationship of interest (Pedroni, 2007; Eberhardt, Helmers & Strauss, 2013; Eberhardt & Teal, 2014), adopting empirical methods from the panel time series literature (Pesaran, 2006; Kapetanios, Pesaran & Yamagata, 2011; Chudik & Pesaran, 2013). We also provide methodological innovation in transferring the asymmetric cointegration framework (Shin, Yu & Greenwood-Nimmo, 2013) from the single time series setup to the panel and similarly for the analysis of summability, balance and co-summability (Berenguer-Rico & Gonzalo, 2013a,b).

The remainder of this article organised as follows: Section 2 considers how the complexities of the economic theory and data realities should inform our empirical analysis. Section 3 describes our data and provides an overview of the econometric methods we apply. In Section 4 we present our empirical results and detailed analysis of heterogeneity and nonlinearity in the debt-growth relationship across and within countries. Section 5 concludes.

2 Linking Theory and Empirics

2.1 Commonality and Heterogeneity

Two aspects of our approach are related to the modelling of empirical processes as common or different across countries. However, our interpretation of ‘common’ is somewhat different from what one may expect, in that we are concerned about common shocks (examples include the 1970s oil crises or the recent global financial crisis) and their distorting impact on identifying the debt-growth nexus in the data.

We start by providing some simple descriptive analysis highlighting the cross-sectional dependence of debt accumulation across countries. The data and sources are described in detail in sections 3.1 and the Data Appendix. Figure 1 provides a histogram for the years in which countries in our sample reach their debt-to-GDP ratio peak: although there is some heterogeneity as to the sample coverage for this period, it is notable that in over one-third of countries these peaks occurred in only three years, namely 1985, 1994 and 2009. Given that the data stretches over forty years, it is a remarkable indication of common effects across countries that the debt-to-GDP ratio peaks are clustered around a much smaller number of dates.

A second illustration, provided in Figure 2, links the debt-to-GDP ratio peaks for each country to the deviation of per capita GDP growth rate in the ‘peak years’ (defined ad hoc as running from two years prior to two years after the debt-to-GDP maximum) from that of the full time horizon (excluding the five peak years).5 We again highlight observations for the three years 1985, 1994 and 2009, as well as a small number of outliers. We can make a number of observations regarding this crude depiction of our empirical relationship of interest: first, there seems to be a negative correlation between the maximum debt level and relative growth performance between peak debt and other years. However, this negative relationship is not statistically significant (linear regression result reported in the figure footnote). Second, the figure highlights considerable heterogeneity across countries: for instance, among the countries for which debt-to-GDP peaked in 1994 (blue squares), one country experienced growth at around 2% above its growth rate in all other years, while another country experienced a ‘peak years’ average growth rate which was 4% lower.6 Third, and perhaps with view to the present debate in the literature most important, we note the dashed vertical line marking a debt-to-GDP ratio of 90%: a considerable number of countries had better growth performance in their peak debt years than at any other point since 1972, even at what some commentators refer to as ‘dangerous’ levels of debt.

In order to move away from matching single debt observations and average growth rates, we provide further descriptive analysis using interquartile ranges (IQR) for debt and growth. Here we focus on the cross-section variation in these two variables over time, further differentiating countries by income levels (high, middle, low). Figure 3 provides IQRs for debt (grey shading, right axis—three debt peak years highlighted in black) and growth (black whiskers, left axis) across all countries and the three income categories. Note that with the exception of 2009 in the High-Income Country sample, none of the three debt-peak years highlighted before look in any way remarkable, both in terms of debt or growth distribution: clearly while there is some commonality in terms of the timing of debt peaks across a number of countries, there are also other countries for which these years are in no way remarkable, thus reducing the spread of the debt IQR—a clear indication of the heterogeneity of the relationship between debt levels and growth. We can however deduce a pattern whereby the growth rate distribution seems to follow an inverted U-shape over time—this is apparent in the full sample and all three sub-samples—while the distribution of debt, perhaps apart from an initial decline in the early 1970s, does not show any clearly discernable patterns.

In econometric terms we are interested in accounting for the impact of ‘cross-section dependence,’ both in the unobservable as well as the observable parts of our empirical model. The conventional empirical approach adopted in the literature however assumes cross-section independence in the panel, i.e. that regression residuals show no systematic patterns of correlation across countries. The problems arising from such correlation are well-known in the econometric literature (Phillips & Sul, 2003; Andrews, 2005; Pesaran, 2006; Bai, 2009;Pesaran & Tosetti, 2011) but have found only comparatively limited recognition in applied work. We briefly sketch the standard framework from the panel time series literature, the common factor model, and indicate the identification problem arising if common shocks are present. For simplicity we adopt a static model with a single covariate x and a single unobserved common factor f with heterogeneous factor loadings λi

Cross-sectional dependence arises from this error structure and further from the assumption (or rather generalisation) that the same unobserved factors are also affecting the evolution of the covariate (ε it above and eit below are stochastic shocks)

Applied to a discussion of the debt-growth nexus, this setup suggests, quite uncontroversially, that there are unobservable time-invariant (ψt) and time-varying (ft) processes driving output (y), possibly including geography or climate amongst the former and institutions, business environment or intangible capital amongst the latter. ft also represents shocks, such as the 1970s oil crises, which affect all countries in the world (albeit to a different extent) and more localised spillover effects, e.g. productivity spillovers between neighbouring countries. Further, these unobserved processes are also suggested to affect the evolution of the determinants of output (x), including in our model the stock of debt. This is a particularly salient point given the recent experience of the global financial crisis and ensuing debt crises for a number of European economies.

We can now illustrate how the parameter of interest (βi or an average thereof) is unidentified unless the unobserved common factors are accounted for. Solving equation (2) for ft and plugging into (1) obtains

where in principle ζiβi. This idea extends to multiple factors and the multivariate context: if the unobservable ft is merely a ‘weak’ factor (representing only local spillovers between a small number of countries) then the estimate of the βi coefficients or their average may not be seriously biased; however, if we have multiple factors of the ‘weak’ and ‘strong’ type (the latter affecting all countries in the sample), the β coefficient is not identified.7 We can extend this setup by arguing for the following relationship in any observable variable which potentially could be employed as an instrument:

Some observable z is correlated with x and thus a potentially (depending on φi) informative instrument but at the same time via ft is correlated with the unobservables in equation (1) and therefore invalid. The notion that a small number of unobserved common factors drive all the macroeconomic variables underlies the application of principal component analysis in the macro forecasting literature (e.g. Stock & Watson, 2002) and thus does not seem far-fetched at all. The same sentiment is expressed in recent work of applied economists investigating macro panel data (Durlauf, Johnson & Temple, 2005; Clemens & Bazzi, 2013) and the general lack of robustness of IV results in the cross-country growth literature has seriously weakened this literature.

Finally, even ignoring the instrumentation issue, we can see that the specification in equation (1) also creates difficulties for standard pooled estimators which lead to heterogeneity bias (Pesaran & Smith, 1995). One problem here is that pooling introduces data dependencies in the residual terms if variable series are integrated, a data property typically assigned to macro data such as those employed in the analysis of the debt-growth nexus (Lee, Pesaran & Smith, 1997; Pedroni, 2007; Bond, Leblebicioglu & Schiantarelli, 2010). Heterogeneity misspecification enters linear combinations of integrated variables into the error term, raising the potential for spurious regression (Kao, 1999; Phillips & Sul, 2003). Existing research has found very different results when moving away from full sample analysis in homogeneous parameter regression models and investigating sub-samples along geographic, institutional or income lines (International Monetary Fund, 2012; Kourtellos, Stengos & Tan, 2014).

There are a number of reasons to assume the equilibrium relationship between debt and growth differs across countries. First, in line with the ‘new growth’ literature (see Temple, 1999) production technology may differ across countries, and in the same vein the relationship between debt and growth.8 Second, vulnerability to public debt depends not only on debt levels, but also on debt composition (Inter-American Development Bank, 2006). Unfortunately, existing data for the analysis of debt and growth represent a mixture of information relating to general and central government debt, debt in different denominations and with different terms attached (be they explicit or implicit). All of this implies that comparability of the debt data across countries may be compromised (Panizza & Presbitero, 2013). In addition, even assuming that debt stocks are comparable across countries and over time, the possible effect of public debt on GDP may depend on the reason why debt has been accumulated and on whether it has been consumed or invested (and in which economic activities). Third, different stock of debt may impinge differently on economic growth. In particular, one could argue that debt could hinder GDP growth when it becomes unsustainable, affecting interest rates and triggering a financial crisis, thus affecting the level of GDP. However, the capacity to tolerate high debts depends on a number of country-specific characteristics, related to past crises and the macro and institutional framework (Reinhart, Rogoff & Savastano, 2003; Kraay & Nehru, 2006; Manasse & Roubini, 2009). The argument of heterogeneity in the debt-growth relationship is a simple extension, which provides greater modelling flexibility and further allows for empirical testing of the validity of this assumption via residual diagnostics.

2.2 Heterogeneity and Nonlinearity

Following the standard strategy in the microeconomic literature, many empirical studies on the debt-growth nexus either include squared debt terms or use spline specifications in their empirical framework to capture the heterogeneous impact of debt across different levels of indebtedness (recent examples include Cordella, Ricci & Ruiz-Arranz, 2010; Pattillo, Poirson & Ricci, 2011; Checherita-Westphal & Rother, 2012). It is notable that this specification is part of a model which assumes common parameters across countries (pooled model), whereas we have just developed a number of arguments why we may want to investigate the possibility that each country follows a different long-run relationship between debt and growth. Given this innovation, the notion of a non-linearity and/or debt threshold becomes a question about the appropriate data dimension: does the nonlinearity distinguish the debt-growth nexus across different countries or within countries over time?

Beginning with the former, Haque, Pesaran & Sharma (1999) provide a detailed discussion of the consequences of neglected parameter heterogeneity in the context of static and dynamic cross-country savings regressions. They particularly indicate the potential for seeming nonlinear relations as a result of mis-specification, concluding that “[t]he linearity hypothesis may be rejected not because of the existence of a genuine nonlinearity between yit and xit, but due to slope heterogeneity” (Haque, Pesaran & Sharma, 1999, p.11).9

We provide a number of illustrations regarding the potential for heterogeneity misspecification in the debt-growth relationship. Figure 4 plots a fractional polynomial regression line (as well as a 95% confidence interval) for per capita GDP against the debt-to-GDP ratio (both variables in logs) – the former is taken in deviation from the country-specific means (‘within’ transformation) to take account of different income levels across countries and thus focus on changes relative to the country mean.10 As can be seen there is clearly a nonlinear relationship between these two variables, in line with the standard arguments advanced by Reinhart and Rogoff as well as many others discussed above, with a ‘threshold’ of 4.5 log points (equivalent to 90% debt-to-GDP) a distinct possibility: higher debt burden is associated with lower per capita GDP, although this is obviously not a statement regarding causality. In a second plot in the same figure we add the actual observations for this regression in form of a scatter graph—the intention here is to cast some doubt over the ‘very obvious’ nonlinear relationship just discussed. In a third plot we provide country-specific fractional polynomial regression lines for all countries in our sample, while a fourth plot randomly selects thirty countries from the previous plot. In our view this highlights that the seeming nonlinearity assuming a pooled empirical model (black regression line and shaded confidence intervals) is far from obvious when we assume an empirical model which allows the relationship to differ across countries.

Our descriptive analysis thus suggests that the raw data (adopting levels variables to elicit the long-run relationship) shows a clear non-linearity or threshold between the debt-to-GDP ratio and income at around 90% debt burden, provided we assume that all countries in the sample follow the same equilibrium path. However, relaxing this assumption in line with the motivation provided in the previous section seriously challenges this conclusion.

Of course this form of descriptive analysis is highly stylised, not to mention that there are other determinants of economic development and that such plots cannot provide any insights into any potentially causal relationship, be it from debt to growth or vice versa. Although our discussion is by no means conclusive, we feel that the illustrations provided above cast some doubt over the stringent implicit assumptions adopted in most of the existing literature: first, that we can carry out empirical analysis assuming that correlation across countries does not matter when running standard panel regression analysis which assumes cross-section independence. Second, the assumption that all countries, regardless of their level of economic development, their industrial structure or institutional environment, follow the same equilibrium relationship between debt and growth. Third, the notion that all countries are subject to the same debt threshold, beyond which growth is affected detrimentally, which is econometrically implemented by use of exogenous or endogenous debt thresholds or by adopting a polynomial specification for debt within a pooled empirical model, thus providing no insights whether the nonlinearity hides heterogeneity across countries or heterogeneity within countries over time.

Thus our empirical analysis of nonlinearity in the debt-growth nexus begins by considering a nonlinearity across countries. We adopt standard linear regression models, albeit of a fashion which accounts for both observed and unobserved heterogeneity. Identification of the long-run and short-run coefficients on debt is achieved by use of the Pesaran (2006) common correlated effects (CCE) estimator, which accounts for the presence of unobserved heterogeneity through a simple augmentation of the regression equation. Due to the dynamic setup and thus the presence of a lagged dependent variable it is necessary to adjust this augmentation following the suggestions in Chudik & Pesaran (2013). We then analyse the relationship between the estimated long-run coefficients and country-specific averages of debt levels, of debt-to-GDP ratios as well as peak debt-to-GDP ratios.

Next, we consider nonlinearity in the debt-growth nexus at the country-level. We are not the first to consider such an empirical setup: Caner, Grennes & Koehler-Geib (2010) argue that provided a debt-threshold exists, this would arguably differ across countries given the heterogeneity in financial market development, openness, institutional development amongst other causes. Kourtellos, Stengos & Tan (2014) argue that if there exist heterogeneities in the debt-growth relationship (thresholds) then there may be other nonlinearities inherent in the empirical model employed to investigate this phenomenon. They find that while there does not exist a generic threshold or tipping point beyond which debt has a detrimental effect on growth, there does exist such a threshold determined by countries’ level of democracy. The main concern for our empirical analysis here is the most appropriate specification with regards to the time-series properties of the data: reliable inference on a relationship involving variable series which are nonstationary involves establishing that these variables are cointegrated, and within both time series and panel time series econometrics a number of alternative approaches are available to test for this property. Crucially, however, cointegration defines a linear combination of variables integrated of order one (in our case) which is stationary (i.e. integrated of order zero). Difficulties for the analysis of potentially nonlinear relationships such as that between debt and growth arise given that the order of integration of the square or cube of an integrated variable is not defined within the linear integration and cointegration framework. We apply novel methods on the order of summability and the concept of co-summability from the time series econometric literature (Berenguer-Rico & Gonzalo, 2013a,b) to provide pre-estimation testing as to the validity of our empirical equation incorporating country-specific nonlinearities. To the best of our knowledge our study is the first to adopt these methods in the panel context, further addressing the concerns over cross-section dependence.

We adopt two approaches to investigating a nonlinearity at the country level: first we employ the nonlinear dynamic model by Shin, Yu & Greenwood-Nimmo (2013), where following selection of an exogenously given threshold (we focus on 52%, 75% and 90% in the debt-to-GDP ratio) we are able to investigate heterogeneous growth regimes (below and above the threshold) whilst accounting for cross-section dependence. Informed by our (co-)summability analysis our second approach will employ the familiar microeconometric practice of including polynomial terms of the debt stock variable in a static regression model whilst accounting for cross-section dependence.

3 Data and Empirical Strategy

3.1 Data

Our main variables are GDP, capital stock, constructed from gross fixed capital formation using the standard perpetual inventory method and assuming a common and constant 5% depreciation rate, and total public debt stock (all in logarithms of real US$). Data are taken from the World Bank World Development Indicators (WDI) database and, in the case of the debt stock, from an update to Panizza (2008). The debt variable is total (external and domestic) general government debt in nominal terms (face value), in the raw data expressed as a share of GDP. We express all variables in per capita terms, including the debt stock. Our empirical setup thus imposes constant returns to scale, which we believe is a reasonable assumption.

For a small number of empirical results we further make use of cross-section averages for data on trade openness and financial development. Trade openness (imports plus exports as a share of GDP) is taken from the NYU Global Development Network Growth Database – in turn based on the World Bank’s WDI and Global Development Finance databases – while financial development (ratio of bank credit to bank deposits) is taken from Thorsten Beck and Asli Demirguc-Kunt’s Financial Structure Database, updated in 2010.11 A Data Appendix provides more details on the construction of our variables and descriptive statistics. Detailed information about the sample make-up is confined to a Technical Appendix.

In the following we introduce our linear dynamic and asymmetric dynamic models of the debt-growth nexus as well as the concepts of summability, balance and co-summability in some more detail.

3.2 Empirical Specification: Linear Dynamic Model

The basic equation of interest in our analysis of the debt-development nexus is a static neoclassical production function augmented with a debt stock term:

where y is aggregate GDP, cap is capital stock and debt is the total debt stock – all variables are in logarithms of per capita terms, imposing constant returns to scale. Our specification of endogenous TFP in the form of common factors does however allow for externalities at the local and/or global level. These variables constitute the observable processes captured in our model, with their parameter coefficients βij (for j=K, D) allowed to differ across countries12—this heterogeneity is a central feature of our empirical setup as motivated in the previous section. In addition we include country-specific Total Factor Productivity (TFP) levels (αt) and a set of common factors ft with country-specific ‘factor loadings’ λt to account for the evolution of unobservable TFP over time. The common factors can be a combination of ‘strong’ factors, representing global shocks such as the recent financial crisis, the 1970s oil crises or the emergence of China as a major economic power; and ‘weak’ factors, capturing local spillover effects following channels determined by shared culture heritage, geographic proximity, economic and social interaction (Chudik, Pesaran & Tosetti, 2011). We assume that these unobservable factors not only drive our measure of output, but also the other covariates in the above model:13 this provides for a standard endogeneity problem whereby the βij parameters are not identified unless some means to account for the unobservable factors in the error term u is found. At the same time this endogenises TFP and allows for externalities of production. We will return to the identification strategy in our discussion of the empirical implementation below. Suffice to highlight that standard instrumentation in a pooled empirical framework is invalid in the present setup as we cannot obtain instruments which are both informative and valid due to the omnipresence of unobserved factors, and/or the underlying equilibrium relationship differing across countries. Finally, we allow for the unobserved factors to be nonstationary, which has important implications for empirical analysis since all observable and unobservable processes in the model are now integrated and standard inference is invalid (Kao, 1999).

Given the importance of time series properties and dynamics in macro panel analysis, we employ an error correction model (ECM) representation of the above substantive equation of interest. This offers at least three advantages over a static model such as the above or restricted dynamic specifications such as those commonly investigated in the literature:14 (i) we can readily distinguish short-run from long-run behaviour; (ii) we can investigate the error correction term and deduce the speed of adjustment for the economy to the long-run equilibrium; and (iii) we can test for cointegration in the ECM by closer investigation of the statistical significance of the error correction term. The ECM representation of the above model is as follows:

where the βij in equation (6) represent the long-run equilibrium relationship between GDP (y) and the measures for capital and debt in our model, while the γij represent the short-run relations. ρi indicates the speed of convergence of the economy to its long-run equilibrium. The term in round brackets represents the candidate cointegrating relationship we seek to identify in our panel time series approach. In equation (7) we have simply relaxed the ‘common factor restriction’ implicit in the nonlinear relationship between parameters in equation (6) and reparameterized the model to highlight that from the coefficients on the ‘levels’ terms (πij for j = K, D) we can back out the long-run parameters

whereas from the coefficient on the terms in first difference (πim for m = k, d, lowercase to distinguish from the long-run coefficients) we can read off the short-run parameters directly. πiEC indicates the speed at which the economy returns to the long-run equilibrium, with a half-life (in our data: in years) computable as (log(0.5)/log(1+πiEC)). Inference on this πiEC parameter will provide insights into the presence of a long-run equilibrium relationship: if πiEC=ρi=0 we have no cointegration and the model reduces to a regression with variables in first differences (i.e. the term in brackets in equation (6) drops out). If πiEC=ρi0 we observe ‘error correction’, i.e. following a shock the economy returns to the long-run equilibrium path, and thus cointegration between the variables and processes in round brackets/levels. Note that we have included the unobservable common factors f in our long-run equation: this implies that we seek to investigate cointegration between output, capital, debt and TFP.

In the spirit of Banerjee & Carrion-i-Silvestre (2011) we employ cross-section averages of all variables in the model to replace unobservables as well as omitted elements of the cointegration relationship.

Recent work by Chudik & Pesaran (2013) has highlighted that this approach is subject to small sample bias, in particular for moderate time series dimensions. Furthermore, these authors relax the assumption of strict exogeneity and thus allow for feedback between (in our application) debt, capital stock and output, which provides a more serious challenge to the original Pesaran (2006) approach:

If αi1 = 0 we maintain the assumption of strict exogeneity and we can proceed with the standard CCE augmentation, whereas if αi1 ≠ 0 this approach is only valid if and only if the dynamic common factor restrictions hold. Chudik & Pesaran (2013) provide the following empirical strategy employing cross-section averages in the presence of weakly exogenous regressors: in addition to the cross-section averages detailed in equation (8) they suggest (i) the inclusion of lags of the cross-section averages, in our ECM setup

and (ii) the inclusion of cross-section averages of one or more further covariates (other than cap and debt) which may help identify the unobserved common factors in the spirit of Pesaran, Smith & Yamagata (2013), in our ECM setup

for covariate z and similarly for further covariates. Chudik & Pesaran (2013) show that once augmented with a sufficient number of lagged cross-section averages (p = T can be employed as a rule of thumb) the CCE mean group estimator performs well even in a dynamic model with weakly exogenous regressors.

Our empirical framework and implementation thus provide a great deal of flexibility to aid our attempts in capturing the long-run and short-run relationships between debt and growth across a set of diverse economies. We do not claim that our empirical approach is ‘superior’ to the existing literature by pointing to asymptotic results in econometric theory or Monte Carlo simulation studies of known data generating processes. Instead, we highlight a set of assumptions which different empirical implementations make and provide diagnostic tests as to the validity of these assumptions. The comparison of results from different empirical estimators presented below does not constitute an exercise in data mining until the desired result emerges, but an attempt at testing the explicit and implicit assumptions made in each empirical model.

An important feature of the empirical implementation adopted here is that all models are estimated by OLS: modelling features such as nonstationarity, cross-section correlation, heterogeneity in the equilibrium relationship across countries and nonlinearity/asymmetry in the long-run and/or short-run relationship are captured by the empirical specification and the use of additional terms in the regression equation. While estimation is thus relatively straightforward, we rely on simulated critical values for various inferential and diagnostic statistics.

3.3 Empirical Specification: Weak Exogeneity Testing

In our factor model setup we have emphasised one type of endogeneity, whereby common factors drive both inputs and output, leading to identification issues unless the factors are accounted for. In the present context, a second form of endogeneity which implies reverse causality is deemed of particular importance for the interpretation of the empirical results: can we argue our empirical model derived from a neoclassical production function augmented with debt is correctly specified, or do we estimate a disguised demand equation for debt or investment?15 We adjust the basic common factor model introduced in equations (1) and (2) accordingly:

for a single covariate x and single factor f contained in both the y – and x-equations. Due to the presence of ψiεit in the second equation we should be concerned over whether y ‘causes’ x or the reverse being the case or both. The standard approach in the literature has been to instrument for x using one or a set of variables z which satisfy the conditions of informativeness (𝔼[zx] ≠ 0) and validity (𝔼[] = 0). Having adopted a panel time series approach the issues of endogeneity and direction of causation can here take an alternative pathway: provided our variables are nonstationary and cointegrated, we can then apply a test for weak exogeneity. This test, described in detail below, can help us determine whether our empirical results can be interpreted as arising from a production function, rather than a misspecified input demand function. This identification strategy is not as clean as microeconometric alternatives, such as a controlled or natural experiment and instrumental variable estimation.16 We argue that neither of these strategies are suitable in a macroeconomic context: experiments may provide insights into a unique episode or a single country experience, but arguably lack the external validity by necessity required in answering our research question. We already argued above that instrument validity is difficult to justify in macro panel analyses of a globalised world. With the empirical questions addressed in this study in mind we believe our empirical strategy is the best we can do.

Provided there exists a cointegrating relationship between variables the Granger Representation Theorem (Engle & Granger, 1987) states that these series can be represented in the form of a dynamic ECM. Generically, for a pair of cointegrated variables x and y we can write

where êt−1 represents the ‘disequilibrium term’ e^=yβ^ixd^ constructed using the estimated cointegrating relationship between these two variables (d represents deterministic terms). Equations (12) and (13) further include lagged differences of the variables in the cointegrating relationship. In the above example there are only two equations, since we have two variables in the cointegrating relationship. The Granger Representation Theorem implies that for a long-run equilibrium relationship to exist between y and x at least one of λ1i and λ2i must be non-zero: if (and only if) λ1i = 0 then x has a causal impact on y, if (and only if) λ2i ≠ 0 then the causal impact is reversed. If both λ1i and λ2i are non-zero they determine each other jointly.

3.4 Empirical Specification: Asymmetric Dynamic Model

We follow the discussion in Shin, Yu & Greenwood-Nimmo (2013) and define the asymmetric long-run regression model

where we again assume observable and unobservable processes are nonstationary and debt stock has been decomposed into debtit=debti0+debtit++debtit. The latter two terms are partial sums of values above and below a specific threshold, debti0 has been subsumed into the constant term. For instance, if we assume a threshold of zero then they define positive and negative changes in debt accumulation. For each country i let

This setup would suit the analysis of an asymmetric response to debt accumulation and relief, whereby the hypothesised substantial growth benefits of debt relief could be shown to be questionable given a the differential relationship between debt accumulation and growth on the one hand and debt reduction and growth on the other. In our present study we instead create partial sums for debt stock below and above a number of (exogenously determined) debt-to-GDP ratio thresholds, namely 52% (sample median), 75% and the ‘canonical’ 90%. Thus the partial sums are constructed from the per capita debt stock variable, while the assignment to one or the other regime is determined by the debt-to-GDP ratio – we adopt this practice in order to be able to compare our results with those in the literature adopting the debt-to-GDP ratio as the primary variable of interest.

The ECM version of our asymmetric dynamic model is thus

The dynamic asymmetry can be included in the long-run relationship (lagged levels terms), in the short-run behaviour (first difference terms) or both. As before we allow for cross-country heterogeneity in all long-run and short-run parameters and account for the presence of unobserved time-varying heterogeneity by augmenting the country regressions with cross-section averages of the dependent and independent variables. While in the original Shin, Yu & Greenwood-Nimmo (2013) time series approach the parameter estimates are identified by augmentation of the empirical equation with additional lagged differences, our panel approach relies on the common factor framework as developed in Section 2 above for identification. The same issues as highlighted in Section 3.2 apply and we shall augment the estimation equation with further lags of the cross-section averages (Chudik & Pesaran, 2013). Note that the implementation raises a number of problems in the case where the debt threshold is relatively high: if only a very small number/share of observations for a specific country are above the threshold, then the estimated coefficient may be very imprecise. In order to guard against this we present results of the estimated long-run debt parameters in the low and high debt regimes only for those countries where at least 20% of all time series observations are in one regime. For the 90% debt/GDP threshold this amounts to a total of 30 countries, 45 countries in case of the 75% threshold and 55 countries for the 52% threshold.17

3.5 Empirical Specification: Order of Summability, Balancedness and Co-Summability

The previous two sections provided empirical specifications either without country-specific non-linearities or where the within-country non-linearity was modelled as an asymmetry. In this section we discuss the fundamental difficulties arising for conventional empirical analysis when assuming a non-linear model in the presence of integrated variables and introduce a novel time series approach to deal with these issues. Suppose a single time series relationship yt = f (xt, θ) + ut for a nonstationary covariate xtI (1), stationary ut and some non-linear function f (·).18 In this context, it becomes difficult to apply our standard notion of integration to f (·), given that integration is a linear concept: although we may be able to determine the order of integration of xt, the order of integration of f (xt, θ) (and thus yt) may not be well defined for many non-linear transformations f (·). Assuming for illustration f(xt)=θxt2 we can make this point somewhat clearer: let xt = xt−1 + εt and st ∼ i.i.d.(0,σɛ2), then we know that

In words, we can show that the Engle & Granger (1987) characterisation of a stationary process holds for Δxt (finite variance is one of five characteristics, albeit the crucial one for our illustration), such that xt can be concluded to follow an I(1) process. Now investigate the same property for Δxt2:

We can see that the finite variance characteristic is violated, given that it is a function of time t – further differencing does not change this outcome. Although we can define xt within the integration framework, we cannot state the order of integration of xt2, which creates fundamental problems if the empirical analysis of yt = f (xt, θ) + ut is to be based on arguments of cointegration.

Berenguer-Rico & Gonzalo (2013b) develop an alternative approach, based on the ‘order of summability’S (δ) of linear or non-linear processes: “[t]he order of summability, 5, gives a summary measure of the stochastic properties – such as persistence – of the time series without relying on linear structures” (p.3). Using OLS we estimate for each country i

where k=1,,T,Yik*=YikYi1,Uik*=UikUi1andYik=log(t=1k(yitmt))2, with mt the country-specific partial mean of yit, namely mt=(1/t)j=1tyj. This is the definition for mt in the ‘intercept only’ case. Given the trending nature of our data we further investigate the ‘constant and linear trend’ case, where mt=(1/t)j=1tyij(2/t)j=1t(yij(1/j)l=1jyil). This implies

from which we then obtain our estimate of the order of summability δ^i*=(β^i*1)/2. This approach essentially investigates the rate of convergence of a rescaled sum constructed from the variable series yit. In the single time series inference can be established using confidence intervals constructed via estimation in subsamples; here, in the panel, where there is no natural ordering of countries in the cross-section dimension we take random draws of N countries (and in each country the full time series T), each time capturing the mean and median summability statistic, to create subsample estimates for inference.

It bears emphasising that summability is a more general concept than integration, but that that latter is closely related to the former in the following fashion: if a time series xt is integrated of order d, I(d) with d ≥ 0, then it is also summable of order d, S (d). It is the breakdown of the reverse of this condition in cases where xt is a nonlinear transformation which necessitates our adoption of the concept of summability. In our empirical application we will analyse the order of summability of all variables entering the polynomial specifications.

Next, in analogy to the analysis of integrated variables, the ‘balance’ of the empirical relationship needs to be tested, namely the condition that both sides of the empirical equation of interest have the same order of summability: S (δy) = S (δz) for z = f (xt, θ) = θf (xt) – see below for a comment on the linearity in parameters we assume here. Such a test of balance is equivalent to testing the null of βni ≡ (βyiβzi) = 0 in the country-specific regression

where Yyik* is for the LHS variable y and defined as in the summability analysis above, and Yzik* is the partially demeaned sum of all RHS processes Yzik=log(t=1k(zitmt))2, accounting for initial conditions in the same fashion as above by taking the deviation from the first observation. In practice, all elements of z (RHS variables) are summed, appropriately partially demeaned and their estimated order of summability is subtracted from that for y and the result divided by 2.19 Again inference in the single time series test is based on subsample estimation. In the panel we employ the same strategy to create subsample estimates and thus confidence bands as detailed above. Under the null of balance the resulting confidence interval includes zero and balancedness is a necessary but not sufficient condition for a valid empirical specification.20

Finally, let êt be the OLS residuals from a balanced country-specific regression yit=θ^g(xit)+e^it, then ‘strong co-summability’ will imply the order of summability of e^it,S(δe^it), is statistically close to zero. We employ the above approach to estimate the order of summability for êit which enables us to determine whether our balanced model is co-summable or not. Note that the residual series êit as defined above will sum to zero by default of the least squares principle, we therefore in practice do not subtract the estimate for the intercept term in each country regression. Inference in the original time series and in our panel application follow the same principles as the previous two testing procedures.

The above routines imply a sequence of tests (summability, balance, co-summability) which in principle bear close resemblance to the integration-cointegration concepts and testing procedures. The simplicity of the above approach is marred by the presence of deterministic components in the variable evolution. Intercept and trend terms are addressed by repeated partial demeaning of the variable series as suggested in Berenguer-Rico & Gonzalo (2013b).21 We assume non-linearity in variables but not in parameters:

The econometric theory of the approach is at present being extended to nonlinearity in parameters. However, the restriction to linearity in parameters is in line with the standard implementations in the literature adopting debt thresholds (endogenous or endogenous debt/GDP threshold with subsequent analysis splitting observations into separate below/above threshold values/terms) or nonlinearities through polynomial functions (linear, squared and cubed debt terms).

We provide an extension to the above panel versions of the balance and co-summability tests, whereby in the spirit of the recent panel time series literature we include the cross-section averages (CA) of all variables in the specification of the empirical test (Pesaran, 2006; Chudik & Pesaran, 2013). The motivation for this approach is the same we provided for our panel models above: country-by-country investigation of the variable and specification properties assumes these to be cross-sectionally independent. Both theorising and empirical practice have shown that in a globalising world where countries trade and are subject to similar social, economic and/or cultural heritage this assumption is likely to be violated.

We adopt two variants of the cross-section average augmentation: (i) a standard approach such as that outlined above, (ii) an approach where in addition to the CA of all model variables we also include the CA of ‘other covariates,’ similar to the approach in the dynamic heterogeneous panel estimations (Chudik & Pesaran, 2013).

3.6 Empirical Specification: Nonlinear Static Model

Following the investigation of summability, balance and co-summability in a flexible nonlinear model we estimate static models with polynomial approximations to unknown nonlinear functions:

We begin by reporting results for a static linear model, then introduce the squared debt term, and finally the cubed debt term. Due to the assumption of cross-country parameter heterogeneity on the one hand, and the reliance of the Mean Group-type estimation approach on parameter averages across a subset of countries on the other, the averaged results from this exercise do not tell the whole picture, given that we average across heterogeneous linear, U-shaped, inverted-U-shaped etc. specifications at the country-level. We provide the average results as a benchmark for comparison with existing work, along with diagnostic tests and descriptive information as to the general patterns of debt-growth relationships contained in the panel.

4 Empirical Results

4.1 Initial Analysis

We carried out panel unit root tests following Pesaran (2007) and investigated the cross-section correlation properties of the raw data including formal CD tests following Pesaran (2004). Results are provided in a technical appendix and indicate that the levels variable series are integrated of order 1 and subject to considerable cross-section dependence.

We conduct a similar descriptive analysis to that pursued in Reinhart & Rogoff (2010b) for our sample of countries, with results presented in Figure 5: within each income group (High, Upper- and Lower-Middle, Low Income) all observations are divided into four bins based on the debt-to-GDP ratio. Although arguably not as clearcut as these authors’ illustration for a set of developed and emerging economies, the means (dark grey bars) and medians (light grey) for different income groups by level of indebtedness may be taken as evidence for a differential growth performance beyond a 90% debt-to-GDP threshold, at least for the high-, lower middle- and low-income samples. We now provide empirical evidence that this descriptive result is misleading.

4.2 Results: Linear Dynamic Models

Table 1 presents results derived from an ECM specification, with results for a standard two-way fixed effects and pooled CCE in columns [1] and [2] assuming parameter homogeneity across countries and all other models in columns [3]-[10] allowing for differential relationships (we report robust mean estimates). The models in columns [4] and [5] represent the standard CCE estimator in the Mean Group version, while models in columns [6], [7] and [9] add further lags of the cross-section averages as suggested in Chudik & Pesaran (2013). Models in columns [8]-[10] experiment with the cross-section averages and lags of additional covariates outside the model: we adopt proxies for trade openness (‘open’) and financial development (‘findev’), both in logs. These variables only enter the empirical model in form of their cross-section averages. The aim here is to help identify the unobserved common factors ft, which represent global shocks and local spillover effects,22 so that adopting variables which are directly linked to globalisation was deemed a suitable choice here.

In each model we focus on the long-run estimates as well as the coefficient on the lagged level of GDP to investigate error correction – full ECM results are available on request. LRA refers to the ‘long-run average’ coefficient, which is calculated directly from the pooled model ECM results in [1] and [2] and the weighted averages – we follow standard practice in this literature and employ robust regression (see Hamilton, 1992) to weigh down outliers in the computation of the averages – of the heterogeneous model ECM results in [3]-[10]. LRA standard errors are computed via the Delta method. ALR refers to the ‘average long-run’ coefficient in the heterogeneous models, whereby the long-run coefficients are computed from the ECM results in each country and then averaged across the panel.23 In the ALR case standard errors are constructed following Pesaran & Smith (1995). For all heterogeneous models which address concerns over cross-section dependence there is evidence of error correction – the lagged GDP pc levels variable is highly statistically significant – and the average long-run coefficients appear statistically significant and positive throughout, whereas short-run coefficients are insignificant. The latter does not imply the absence of any significant effects, but rather highlights the heterogeneity across countries with dynamics on average cancelling out. Coefficients on lagged per capita GDP levels imply reasonable estimates for the speed of convergence, with a half-life of just under a year in most CMG specifications.24 Diagnostic tests highlight that the use of cross-section averages considerably reduces residual cross-section dependence – the CD statistic drops from 18 in the MG to between 2 and 3 in the CMG models. Based on work by Bailey, Kapetanios & Pesaran (2012) it is suggested that the implicit null hypothesis of this test is weak (rather than strong) cross-section dependence (Pesaran, 2013) – recall that dependence of the weak type only affects inference, whereas strong dependence can lead to an identification problem.25

Once we move from a pooled to a heterogenous parameter specification, statistically significant positive average long-run coefficients as we find in our sample only provide insights regarding the central tendency of the panel. This result may indicate that, on average, the countries in our sample are on the ‘right’ side of an hypothetical Debt Laffer curve. This hardly surprising as the median debt-to-GDP ratio is around 50% (Table A1), a value well below the ‘tipping points’ identified by the literature on developing and advanced economies (see Table TA1). In Figure 6 we therefore provide a number of plots indicating the cross-section dispersion of the long-run debt coefficients, primarily focusing on the estimates in the dynamic CCE model with one additional lag (column [6] of Table 1) given its favourable diagnostic results. With the exception of panel (b) all plots capture the country-specific average debt-to-GDP ratio over the entire sample period (in logs) on the x-axis and estimated debt-coefficients on the y-axis (all long-run except for panel (f), which plots short-run coefficients). Panel (a) suggests that there is a nonlinear relationship between the debt-to-GDP ratio and the long-run impact of debt, which around 90% debt-to-GDP turns negative. Panel (c) makes the same point grouping countries into quintiles based on average debt/GDP ratio and providing distributional plots for each of them (group #5 represents debt burden over 90% of GDP).

Panel (b) however cautions against this conclusion: instead of average debt-to-GDP ratio we plot here the debt-to-GDP ratio peak for each country. It is notable that many countries still have positive coefficients despite peak debt-to-GDP ratios in excess of 90%. Panel (d) splits the data into the 25% richest countries and the rest – the nonlinearity between debt burden and the long-run debt coefficient across countries seems to primarily be driven by the poorer countries in the sample. Panels (e) and (f) provide fitted fractional polynomial regression lines for the CMG models in Table 1 for which the residual CD test is below 3: [4]-[7] and [10]. With regard to long-run results in panel (e), the average relationship emerging seems to be fairly robust to the choice of empirical specification. There is no evidence for any systematic heterogeneity in the short-run coefficients presented in panel (f).

We thus find some tentative evidence for a nonlinearity in the long-run relationship between debt and growth across countries. We can be reasonably certain that these empirical models represent cointegrating relationships between debt and income, but this does not rule out the possibility of feedback from income to debt, which would question the validity of our empirical results. As a next step we therefore turn to weak exogeneity testings for all of our heterogeneous parameter models.

4.3 Results: Weak Exogeneity Testing

In Table 2 we present the results for the MG and various CMG models – models refer to the column numbering in Table 1. For each estimator we provide weak exogeneity tests using specifications with one or two lags, in each case providing three sets of results: for an output equation, a capital stock equation and a debt stock equation. If our suggestion that the empirical models analysed represent augmented production functions, rather than investment demand or debt demand equations, thus (informally) allowing us to argue for a causal relation from capital and debt stock to output and not vice-versa, we would expect a pattern whereby the various test statistics for the output equation reject the null of no causal relation from ‘inputs’ to output, whereas those in the two ‘input’ equations cannot reject their respective nulls. Taking in the results as a whole, there appears to be fairly strong evidence for the setup described: p-values for the statistic constructed from averaged t-statistics are typically below 10 percent in the output and close to unity in the input equations; the t-statistics on the averaged λi coefficients are typically very large in the former and typically below 1.96 in the latter.

The purpose of the analysis in this section was to investigate the possibility of a nonlinear relationship between the debt burden and the long-run debt coefficient in the cross-country dimension. A number of empirical models including dynamic CCE which allows for cross-section dependence in a dynamic model were evaluated and while the empirical results are somewhat fragile in a moderate-T sample, one might conclude that on balance there is some evidence for heterogeneity in the long-run coefficients across countries. We now turn to empirical models which allow for heterogeneous long-run relations across countries while at the same time allowing for thresholds in the relationship within countries over time, which represents a direct test of the consensus of a common threshold effect as propagated in the existing empirical literature.

4.4 Results: Asymmetric Dynamic Models

In Figure 7 we present results from the asymmetric (heterogeneous) dynamic regressions where we account for unobserved common factors by inclusion of cross-section averages of all covariates as well as one further lag of the cross-section averages. The three plots correspond to subsamples for an adopted threshold of 52% (top), 75% (middle) and 90% (bottom) for the debt-to-GDP ratio – in each case we only include countries which have at least 20% of their observations in one of the two regimes (below/above threshold), amounting to 55, 45 and 30 for the three thresholds, respectively. Empirical results on which these graphs are based can be found in Table 3: model [5] with one additional lag and asymmetry in the long- and short-run specification.

The x-axis in each plot represents the average debt burden over the entire time horizon, expressed as the average debt-to-GDP ratio (in logs) in the left column and, like in our regressions, as the total debt stock per worker (in logs) in the right column – in both sets of plots the left tip of each arrow represents the average value for the ‘low debt’ regime where debt is below 52%, 75% or 90% of GDP, while the right arrow tip marks the average value for the ‘high debt’ regime above these thresholds. The y-axis in each plot captures the estimated long-run debt coefficient which by construction is allowed to differ across regimes (and countries). Under the working hypothesis that a shift to the ‘high debt’ regime would have a negative, step-change type impact on long-run growth, we would expect most arrows to indicate a negative relationship. As can be seen, this hypothesis is not borne out by the empirical results: there is no evidence for any systematic change in the relationship between debt and growth when countries shift from a ‘low’ to ‘high’ debt regime, with only around one in two countries experiencing an increase in the debt coefficient.26 Average coefficient changes in each of the three cases are statistically insignificant (standard or robust means).

Thus our first test of within-country threshold effects in the debt-growth relationship suggests that the consensus in the empirical literature of a common debt threshold does not hold up for the cutoffs tested if we allow for observed and unobserved heterogeneity across countries. Before we move on to investigate the same issue with an alternative approach, adopting static polynomial specifications of the debt-growth nexus, we first provide results from the summability, balance and co-summability analysis which will determine the preferred polynomial specification.

4.5 Results: Summability, Balancedness and Co-Summability

Table 4 presents the summability results, with models assuming a constant term in the left panel and constant and trend terms in the right panel, with the latter a more natural choice given the trending nature of our data. It appears that all of the variables investigated reject summability of order 0, S(0), which justifies our concern about time series properties – recall the analogy with unit root tests, whereby integrated data of order 1 or higher provides evidence for nonstationarity. In the lower panel we carry out summability testing for the growth rates of per capita GDP, debt stock and capital stock. For the former two we can broadly conclude that these first difference series are S(0), while the capital stock growth rate appears to reject this null hypothesis.

Table 5 presents the results from balance tests, with (unaugmented) ‘standard’ specifications in Panel A, specifications augmented in the common correlated effect fashion in Panel B and specifications which further add cross-section averages from two ‘openness’ variables in Panel C. Recall that for the two sides of the equation to be balanced, i.e. be made up of variables with the same order of summability, the balance statistic should be close to zero. We highlight all those specifications where this requirement is statistically rejected by underlining the estimate and 95% confidence bands. In each of the three panels we provide results for a specification with a constant and a specification with a constant and trend term, where again the latter appears a priori the more suitable choice. Across all three panels there is relatively strong evidence for the linear specification to represent a balanced model, with mean and median estimates for the balance statistics close to zero. There is comparatively less evidence for the two nonlinear specifications, with only the median estimates and 95% confidence intervals for the Model with linear, squared and cubed debt terms containing zero. Having said that, the rejection of the null of equal order of summability on both sides of the model equation is marginal in the specification with linear and squared debt terms of both Panels B and C.

In the co-summability results presented in Table 6 we highlight those specifications for which balance was somewhat uncertain by printing them in grey, whereas the specifications which were confirmed as balanced are printed in black. We again have three blocks of results, for a standard panel version of co-summability (equivalent to Panel A in the balance results in Table 5), for a version which includes cross-section averages of all model variables (Panel B) and for a version which in addition to these cross-section averages includes those from ‘other covariates’ (Panel C). None of the specifications without cross-section averages is co-summable, and the estimated test statistics – summability statistics for model residuals – are some distance away from zero (which would signify co-summability). Results for the specifications with cross-section averages are noticeably closer to zero, but still reject co-summability in the linear model. Results for the final set of specifications which include further cross-section averages in the empirical model then move even closer to zero, with the linear specification now co-summable if we focus on the median statistic. The nonlinear models in this case also appear to be co-summable, however it bears reminding that there was comparatively less evidence for these models to be balanced, which as a prerequisite for co-summability renders these models at best as uncertain with regard to the presence or absence of a long-run equilibrium relationship.

We draw three conclusions from this analysis: first, there is strong evidence for significant persistence in the data investigated, which as argued above may seriously impact estimation and inference. Second, it appears that results from an approach which assumes cross-section independence yields very different results from one which relaxes this assumption. In the context of the recent panel econometric literature this finding is not at all surprising, given the importance of accounting for cross-section correlation in the analysis of macro panel datasets. Further investigation of this result is beyond the scope of this article and left for future research. Third, the only empirical model tested for which we found fairly convincing evidence of it representing a balanced and co-summable specification is the linear model augmented with standard and additional cross-section averages. There is less convincing evidence for nonlinear models, even though some only fail the balance tests marginally. It bears reminding that the purpose of this exercise was to identify linear or nonlinear specifications which represent long-run equilibrium relationships.27 Whatever the identification strategy of existing studies in the literature, these results suggest that the adoption of linear and squared debt terms in a flexible specification to model debt thresholds may represent a seriously misspecified empirical model which could lead to spurious regression results.

4.6 Results: Nonlinear Static Models

We present the averaged debt coefficients from static production function models in various specifications in Table 7.28 All of the models presented are heterogeneous parameter specifications, but we also investigated various pooled model specifications (Pooled OLS, Fixed Effects, CCE Pooled) and found strong evidence of nonstationary residuals in these models, thus highlighting the potentially spurious nature of estimates from pooled empirical models (results available on request).

The nine models for which results are presented in Table 7 correspond to the same nine tested for balance and co-summability (Table 6), namely standard Mean Group (MG) and Common Correlated Effects Mean Group (CMG) estimates, alongside CMG estimates where we further added cross-section averages of two ‘openness’ variables in the empirical specification (CMG+). Of these we only found strong evidence for the linear model in column [7] to represent a balanced and co-summable specification. Average estimates for the linear specifications in Table 7 indicate a negative relationship in the MG and no substantive relationship in the two CMG models, while the models with linear and squared debt terms on average indicate a concave relationship in all three models. The nonlinear model including a cubed debt term is on average statistically insignificant in the MG and CMG models but not in the CMG+ model.

Based on residual diagnostic – residual series from all models were found to be stationarity – we can see how the MG models suffer from very serious residual cross-section dependence (CD test statistics in excess of 20), which is dramatically reduced for the CMG models (around 3.5 to 4.5, thus still rejecting cross-section independence) and the CMG+ models. The latter could be argued to be largely free from cross-section dependence of the strong type, so that any remaining error dependence is not likely to seriously affect estimation (Pesaran, 2013). We also report the number of countries in each estimator for which we found statistically significant debt coefficients. Although we cannot trust country-specific estimates in this empirical approach, this certainly highlights the heterogeneity in the country-specific results, with no linear or nonlinear relationship for the debt-growth nexus emerging as clearly dominant: for instance, in the linear models we find similar numbers of positive and negative slope coefficients on debt once we account for the distorting impact of cross-section correlation. In the models with linear and squared debt there is more evidence for concave relations – in line with the Reinhart & Rogoff (2010b) debt threshold story – but it would be difficult to claim that that this result is uniform across all countries. In the models with three debt terms the averaged coefficients hide a great deal of heterogeneity across countries. Thus on the whole we cannot provide any support for the notion that countries possess similar or even identical nonlinearities in the debt-growth relationship over time once we relax the assumption of common parameters across countries.

5 Concluding Remarks

This article empirically investigates the relationship between public debt and long-run growth and provides important insights for the current debate on threshold effects in the debt-growth nexus sparked by the work of Carmen Reinhart and Kenneth Rogoff (Reinhart & Rogoff, 2009, 2010a,b, 2011). Our paper makes three contributions to this empirical literature: first, we investigated the long-run relationship by means of a dynamic empirical model and adopted time series arguments to establish the presence of a long-run equilibrium, taking into account possible endogeneity issues. Since estimation results are likely to be spurious and seriously biased if these well-known data properties are not recognised and addressed in the empirical analysis our approach signals a significant departure from the standard empirical modelling in this literature and arguably provides more reliable estimates. Second, we adopted empirical specifications which allowed for heterogeneity in the long-run relationship across countries, thus reflecting a host of theoretical and empirical arguments. This heterogeneity in the specification extends to the relevant unobservable determinants of growth and debt burden, which we have addressed by means of a flexible common factor model framework. Ours is the first panel study on debt and growth to address parameter heterogeneity and cross-section dependence, thus allowing for a closer match between economic theory and data restrictions on the one hand and empirical modelling on the other. Third, we used a number of novel empirical estimators and testing procedures to shed light on the potential nonlinearity in the debt-growth relationship, focusing on both the possibility of a debt-growth nonlinearity across and within countries, a distinction previously entirely absent from the empirical literature. It bears emphasising that no empirical study modelling the debt-growth relationship in a pooled panel model can claim to be able to distinguish these two types of nonlinearity.

Our empirical analysis provided some evidence for systematic differences in the debt-growth relationship across countries, but no evidence for systematic within-country nonlinearities in the debt-growth relationship for all countries in our sample. With regard to the first result we observed that long-run debt coefficients appeared to be lower in countries with higher average debt burden, although the average long-run debt coefficient across countries was positive. Regarding the second result, empirical tests seemed to support a linear specification rather than the polynomial specifications popular in the empirical literature. When we employed piecewise linear specifications adopting various pre-specified thresholds, the change in the debt coefficient at the threshold was just as likely to be positive as negative. These findings imply that whatever the shape and form of the debt-growth relationship, it differs across countries, so that appropriate policies for one country may be seriously misguided in another. The commonly found 90% debt threshold is likely to be the outcome of empirical misspecification – a pooled instead of heterogeneous model – and subsequently a misinterpretation of the results, whereby it is assumed that pooled model estimates – obtained from polynomial or piecewise linear specifications for debt – imply that a common nonlinearity detected applies within all countries over time.


Figure 1:Peak Debt/GDP Ratio Distribution

Notes: The histogram indicates the distribution of peak years for the debt-to-GDP ratio in our sample of 105 countries. Three years, 1985, 1994 and 2009 account for over one third of all debt/GDP peaks.

Figure 2:Peak Debt/GDP Ratio and Relative Growth

Notes: Along the x-axis we arrange countries by the value of the maximum debt-to-GDP ratio (in logarithms), highlighting three years in particular: 1985 (Triangles: 9/105 countries), 1994 (Squares: 17/105), and 2009 (Diamonds: 11/105). All other years (68/105) are indicated with hollow circles. Along the y-axis we plot the deviation of countries’ (i) average per capita growth rate in the five years around their peak debt year (i.e. peak debt occurs in year 3) from (ii) their average per capita growth rate over the entire time horizon 1972-2009 excluding the five ‘peak debt years.’ For peak debt year 2009 we only construct the growth average from 2007-2009 observations, similarly for 1972 and 1973 peak debt years. A simple (outlier-robust) linear regression of average per capita growth rates on debt-to-GDP peaks (in logarithms) yields the following result (absolute t-ratios in brackets): .011 [0.54] – .005 [1.12] log(debt/GDP)imax

Figure 3:Interquartile ranges—Growth and Indebtedness

Notes: Each plot shows the interquartile range for GDP per capita growth and the debt/GDP ratio in the year indicated. The three years during which debt/GDP ratios peaked in a substantial number of countries are highlighted. Note that data coverage varies across the sample: for High Income Countries (N = 29) the plot covers 70% of countries in the early 1970s, rising to 80% or more in the late 1970s and early 1980s, and in excess of 90% for the remainder of the period of observation; For Middle Income Countries (N = 53) the 1970s cover a minimum of 60% of countries, rising to more than 80% in the 1980s and in excess of 90% from the early 1990s onwards; for Low Income Countries (N = 23) coverage is poor till 1982 (32%-56%) when over 70% of countries are covered, rising to 90% by the mid-1980s and beyond until 2007 when coverage drops to 68% and then 60% until the end of the sample period.

Figure 4:Nonlinearities in the Country-Specific Debt-Income Nexus

Notes: This figure plots the unconditional relation between debt/GDP ratio and within-transformed per capita GDP (both in logs). We employ fractional polynomial regression (solid regression line; shaded 95% confidence intervals) for all observations – see Footnote 10 for details on how we limit the sample to aid presentation. In the right plot of the first row we also provide a scatter of all observations; in the left plot of the second row we instead add fractional polynomial regression lines estimated for each country separately, while in the right plot of the same row we pick these for 30 countries at random.

Figure 5:The Rogoff and Reinhart (2010) approach in our dataset

Notes: In each plot the light-grey bars represent median growth rates, the dark-grey bars the mean growth rates (both left axis), the black line the share of total observations (right axis) for each group respectively. For High Income Countries we have a total of 1,001 observations (29 countries), for the Upper Middle Income, Lower Middle Income and Low Income Countries these figures are 752 (23 countries), 1,021 (30 countries) and 685 (23 countries), respectively. Income classification follows the World Bank approach.

Figure 6:Patterns for CMG debt coefficients

Notes: We plot the country specific long-run coefficients for debt in each country, taken from the dynamic CMG model with one additional lag (in column [6] of Table 1) against (a) the country-specific average debt/GDP ratio (in logs), and (b) the country-specific peak value for debt/GDP (in logs)—for both plots we reduce the number of countries as detailed below to improve illustration. In both cases we added fitted fractional polynomial regression lines along with 5% and 95% confidence bands (shaded area). We further provide (c) box plots for all 105 country-estimates divided into quintiles of the average country debt/GDP ratio distribution—outliers are omitted from these box plots and we focus on the medians and interquartile ranges (shaded). In (d) we split the sample into the top 25% and bottom 75% by average income and fit fractional polynomial regression lines alongside 5% and 95% confidence bands for each grouping (reduced sample in the plot for illustration). The final set of plots in (e) and (f) presents fitted fractional polynomial regression lines of long-run and short-run debt coefficients against average debt/GDP ratio for all CMG models (columns [4]-[10]), respectively. In each case (as in the first two scatter plots) we omit the observations with average debt/GDP ratio below 12% (ARE, CHN, LUX) as well as countries with absolute long-run debt coefficients (ALR) over 0.5 resulting in 92 [4], 95 [5], 92 [6], 91 [7], 95 [8], 94 [9] and 93 [10] out of 105 observations. This practice excludes the following country estimates in four or more of the seven models: GNB, GUY, HUN, IRL, SLE, SYC, TGO. In all plots we add a horizontal line to mark zero, in most plots we also add a vertical line at 4.5 log points (=90%) of the debt/GDP ratio.

Figure 7:Debt Coefficient Comparison: three debt-to-GDP thresholds

Notes: We plot the long-run debt coefficients in the low and high debt regime for (top) 52%, (middle) 75% and (bottom) 90% debt/GDP thresholds. In each case we use the CCEMG results with one additional lag of cross-section averages (model [5] in Table 3) for 55, 44 and 29 countries, respectively—countries are only included if they have at least 20% of their observations in one of the two regimes (below/above threshold). The values on the x-axis represents the average debt/GDP ratio and the average debt stock per capita (both in logarithms) for the lower and higher regimes (average over all years in each regime), in the left and right plot respectively. We carried out empirical tests for statistical significance of average coefficient changes at each threshold and report the mean and robust mean estimates together with respective t-ratios.

Table 1:Linear Dynamic Models
additional covariate(s) †openopenopen, findev
additional lagged CAone lagtwo lagsone lag
country trends ‡××
Debt coefficients
ALR threshold96.3 %90.2%73.4%22.5%0.3%126.7%28.4%18.5%
EC coefficient
Implied half-life (years)6.061.671.040.850.650.780.650.740.620.69
CD test−0.8912.5718.
Notes: Results for full sample of JV = 105 countries, based on an error correction model with the first difference of log real GDP per worker as dependent variable. We report the robust mean of coefficients across countries in the heterogeneous parameter models in [3]-[10] (Hamilton, 1992) unless indicated; standard errors in these models are constructed following Pesaran & Smith (1995). † The CMG estimator (Pesaran, 2006; Chudik & Pesaran, 2013) is implemented using further cross-section averages (CA) of (a) additional lags and/or (b) other variables (open -trade/GDP, findev – financial development, both in logs) as indicated – see main text for details. $ These models are augmented with country-specific linear trend terms. ‘LRA’ refers to the long-run average coefficient, which is calculated directly from the pooled model ECM results in [1] and [2] and the averages of the heterogeneous model ECM results (standard errors computed via the Delta method) in [3]-[10]. ALR’ refers to the average long-run coefficient in the heterogeneous models, whereby the long-run coefficients are computed from the ECM results in each country and then averaged across the panel. ‘SR’ refers to the short-run coefficients. ALR threshold’ indicates the cross-country implied threshold (i.e. where the impact of debt becomes negative) derived from a linear regression of the long-run debt coefficient on average debt-to-GDP ratio over the entire time period, b The first set of t-statistics are non-parametric statistics derived from the country-specific coefficients following Pesaran & Smith (1995). The second set represent averages across country-specific t-statistics. *, ** and *** indicate significance at 10%, 5% and 1% level respectively. RMSE is the root mean squared error, CD test reports the Pesaran (2004) test, which under the null of cross-section independence is distributed standard normal.
Table 2:Weak Exogeneity Testing
Panel A: Without CAPanel B: With CA
ModelEquationlagsGM-tpAvgλ^it-statGM-tpAvg λ^it-stat
MG [3]Output1−2.540.01−0.928−21.98
Debt stock10.090.930.2541.71
Debt stock20.120.900.2391.11
CMG [4]Output1−2.230.03−0.842−19.32−2.060.04−0.800−17.00
Debt stock10.300.760.6493.700.170.860.4902.56
Debt stock20.280.780.7123.33−0.010.990.4051.54
CMG with trend [5]Output1−2.560.01−1.000−24.87−2.440.01−0.956−20.92
Debt stock10.240.810.4542.800.160.870.3691.92
Debt stock20.210.840.5352.620.020.980.3060.98
CMG with 1 add. lag [6]Output1−1.880.06−0.773−18.04−1.730.08−0.742−16.72
Debt stock10.290.770.5252.630.230.820.5442.36
Debt stock20.290.770.6623.080.050.960.3581.51
CMG with 2 add. lags [7]Output1−1.730.08−0.815−17.08−1.680.09−0.813−19.46
Debt stock10.160.870.3251.520.160.880.3861.58
Debt stock20.160.870.3481.450.011.000.0120.05
CMG with 1 add. covariate [8]Output1−2.060.04−0.815−18.06−1.930.05−0.764−16.25
Debt stock10.320.750.5913.150.210.840.4562.22
Debt stock20.280.780.6082.750.030.970.3080.99
CMG with 1 add. covariate & lag [9]Output1−1.710.09−0.802−15.00−1.610.11−0.737−13.93
Debt stock10.300.770.3711.740.270.790.4361.91
Debt stock20.310.760.5052.140.110.910.2340.78
CMG with 2 add. covariates [10]Output1−1.930.05−0.826−17.07−1.800.07−0.740−15.31
Debt stock10.340.740.6633.410.240.810.5602.77
Debt stock20.310.760.5872.490.060.960.2240.81
Notes: Numbers in brackets correspond to the columns in Table 1. For the tests in Panel B cross-section averages of all variables are added to the estimation equation, whereas in Panel A we do not include these. All results are for N = 105. Equation refers to the ECM regression where the named variable is on the LHS, lags reports the number of lagged differences included in the regression. GM-t gives the group-mean average of country-specific t-ratios for the coefficient on the disequilibrium term (λ^i) which is distributed N (0,1), p indicates the corresponding p-value. Avg λ^i refers to the robust mean coefficient on the ECM term, t-stat the corresponding t-statistic. Underlined p-values or ‘robust’ t-statistics indicate evidence against the hypothesis of a well-specified production function.
Table 3:Asymmetric Dynamic Models
country trends××
lagged CA1 lag2 lags
52% threshold
ALR debt >52% GDP−0.038−0.004−0.049−0.0140.0080.028
ALR debt <52% GDP0.002−0.009−0.0150.0140.0120.020
CD Test10.973.953.853.455.035.88
Obs (N)1,873 (55)1,873 (55)1,873 (55)1,873 (55)1,804 (55)1,768 (55)
75% threshold
ALR debt >75% GDP−0.053−0.032−0.040−0.0230.0130.013
ALR debt <75% GDP−0.0460.002−0.014−0.0030.018−0.004
CD Test3.461.78−0.321.721.360.97
Obs (N)1,509 (45)1,509 (45)1,509 (45)1,509 (45)1,434 (44)1,402 (44)
90% threshold
ALR debt >90% GDP−0.0010.003−0.021−0.0100.0690.037
ALR debt <90% GDP−0.0050.0540.0010.0840.0490.120
CD Test0.780.42−0.58−0.25−1.35−1.06
Obs (N)996 (30)996 (30)996 (30)996 (30)940 (29)921 (29)
Notes: We present average long-run coefficients (based on country-specific long-run results) for debt from models which allow for asymmetry in the debt coefficients, adapted to the panel from the time-series approach by Shin, Yu & Greenwood-Nimmo (2013). The dependent variable is the GDP per capita growth rate. Three thresholds are adopted to split the data into two (high/low debt) ‘regimes’: 52% (sample median), 75% and 90% debt/GDP ratio. Countries are only included in the analysis if they have at least 20% of their observations in one of the two regimes (below/above threshold), resulting in 55, 45 and 30 countries, respectively. Models [2]-[6] add cross-section averages to the regressions, those in [5] and [6] further add lags of the cross-section averages in the spirit of Chudik & Pesaran (2013). All models allow for long-run (LR) and short-run (SR) asymmetry, with the exception of model [4], which only allows for long-run asymmetry. ♭ For the coefficient on lagged GDP per capita we report the Pesaran & Smith (1995) nonparametric t-statistics as well as the average of country-specific t-statistics (t), the CD Test is distributed N (0,1) under the null of cross-section independence. RMSE is the root-mean squared error.
Table 4:Estimated Order of Summability
Deterministics: ConstantDeterministics: Constant and Trend
Lower CI band0.9481.0111.0271.0451.2170.3920.6380.6120.4680.745
Upper CI band1.2431.3251.3341.3641.4971.1801.1011.0481.0771.411
Lower CI band0.9600.9991.0261.0341.0840.4140.6260.6140.4680.719
Upper CI band1.2571.2721.2861.3151.4871.1231.0201.0251.0981.376
Notes: The table presents the panel statistics for N = 105 country-specific estimates of the order of summability δ^* (see main text for further details). All variables are in logarithms. We account for the constant term by partial demeaning, and for the additional linear trend term by double partial demeaning as detailed in Berenguer-Rico & Gonzalo (2013b). For each variable we present two sets of statistics: the upper (lower) panel presents mean (median) δ^* across the panel as well as the mean-(median-)based subsampling results (lower and upper 95% confidence bands). Each of the N−b + 1 = 95 subsamples of size b=int(N)+1=11 countries is a random draw of countries from our full sample of N = 105.
Deterministics: ConstantDeterministics: Constant and Trend
Lower CI band−0.0480.0900.094−0.725−0.5370.202
Upper CI band0.1710.3510.4020.0790.1900.580
Lower CI band−0.1460.0170.066−0.751−0.6300.083
Upper CI band0.1460.3470.3590.0430.2200.484
Notes: The table presents the panel statistics for N = 105 country-specific estimates of the order of summability δ^* (see main text for further details). All variables are in logarithms. We account for the constant term by partial demeaning, and for the additional linear trend term by double partial demeaning as detailed in Berenguer-Rico & Gonzalo (2013b). For each variable we present two sets of statistics: the upper (lower) panel presents mean (median) δ^* across the panel as well as the mean-(median-)based subsampling results (lower and upper 95% confidence bands). Each of the N−b + 1 = 95 subsamples of size b=int(N)+1=11 countries is a random draw of countries from our full sample of N = 105.
Table 5:Estimated Balance
Panel A – Standard Specification
DeterministicsConstantConstant & Trend
Nonlinearity-debtit2debtit2, debtit3-debtit2debtit2, debtit3
Lower CI band−0.316−0.274−0.278Lower CI band−0.1730.0390.078
Upper CI band0.0230.0790.066Upper CI band0.3380.7330.908
Lower CI band-0.348−0.295−0.357Lower CI band−0.1340.0030.014
Upper CI band-0.0330.1000.069Upper CI band0.3570.6880.893
Notes: The table presents distributional statistics for N = 105 country-specific estimates of the balance in the indicated regression models (δ^yδ^g) (see main text for further details). The RHS of each model always includes capit and debtit. All variables are in logarithms. We account for a constant term by partial demeaning, and for an additional linear trend term by repeated partial demeaning as detailed in Berenguer-Rico & Gonzalo (2013b). CA refers to the augmentation of the static country regression with cross-section averages following Pesaran (2006): in Panel B we include the model variables, in Panel C we include CA of the openness and financial development variables in addition to the model variables. Underlined mean or median balance statistics indicate evidence against the hypothesis of a balanced regression model, (δ^yδ^g)=0.
Panel B – Specification with CA
DeterministicsConstantConstant & Trend
Nonlinearity-debtit2debtit2, debtit3-debtit2debtit2, debtit3
Lower CI band-0.265-0.260-0.281Lower CI band−0.2180.0020.023
Upper CI band-0.029-0.012-0.017Upper CI band0.3510.8050.918
Lower CI band-0.301-0.312-0.342Lower CI band−0.1410.030−0.009
Upper CI band-0.031-0.045-0.089Upper CI band0.3640.7640.908
Notes: The table presents distributional statistics for N = 105 country-specific estimates of the balance in the indicated regression models (δ^yδ^g) (see main text for further details). The RHS of each model always includes capit and debtit. All variables are in logarithms. We account for a constant term by partial demeaning, and for an additional linear trend term by repeated partial demeaning as detailed in Berenguer-Rico & Gonzalo (2013b). CA refers to the augmentation of the static country regression with cross-section averages following Pesaran (2006): in Panel B we include the model variables, in Panel C we include CA of the openness and financial development variables in addition to the model variables. Underlined mean or median balance statistics indicate evidence against the hypothesis of a balanced regression model, (δ^yδ^g)=0.
Panel C – Specification with Additional CA
DeterministicsConstantConstant & Trend
Nonlinearity-debtit2debtit2, debtit3-debtit2debtit2, debtit3
Lower CI band-0.354-0.264-0.281Lower CI band−0.2200.0010.022
Upper CI band-0.110-0.021-0.017Upper CI band0.3160.7990.917
Lower CI band-0.317-0.317-0.343Lower CI band−0.1670.004−0.008
Upper CI band-0.060-0.050-0.089Upper CI band0.3330.7770.907
Notes: The table presents distributional statistics for N = 105 country-specific estimates of the balance in the indicated regression models (δ^yδ^g) (see main text for further details). The RHS of each model always includes capit and debtit. All variables are in logarithms. We account for a constant term by partial demeaning, and for an additional linear trend term by repeated partial demeaning as detailed in Berenguer-Rico & Gonzalo (2013b). CA refers to the augmentation of the static country regression with cross-section averages following Pesaran (2006): in Panel B we include the model variables, in Panel C we include CA of the openness and financial development variables in addition to the model variables. Underlined mean or median balance statistics indicate evidence against the hypothesis of a balanced regression model, (δ^yδ^g)=0.
Table 6:Co-Summability
Standard SpecificationSpecification with CA
Nonlinearity-debtit2debtit2, debtit3-debtit2debtit2, debtit3
Lower CI band0.7920.7750.979Lower CI band0.1090.030−0.012
Upper CI band1.0651.0381.396Upper CI band0.4310.3900.456
Lower CI band0.6700.7890.951Lower CI band0.093−0.118−0.264
Upper CI band1.0751.1141.309Upper CI band0.4610.3090.200
Notes: The table presents distributional statistics for N = 105 country-specific order of summability estimates for the respective model residuals. The RHS of each model always includes capit and debtit. All variables are in logarithms. CA refers to the augmentation of the static country regression with cross-section averages following Pesaran (2006), ‘Additional CA’ refer to the CA for the openness and financial development variables (in logs) as described in the main text. We account for a constant term by partial demeaning as detailed in Berenguer-Rico & Gonzalo (2013b). Underlined mean or median co-summability statistics indicate evidence against the hypothesis of a co-summable model specification, δ^=0. Since co-summability is conditional on balance, we only print those specifications in black for which we have convincing evidence from the balance testing in Table 5, whereas all other specifications are printed in grey.
Specification with Additional CA
Nonlinearity-debtit2debtit2, debtit3
Lower CI band0.052−0.051−0.089
Upper CI band0.3600.3220.374
Lower CI band−0.016−0.244−0.314
Upper CI band0.3360.1200.060
Notes: The table presents distributional statistics for N = 105 country-specific order of summability estimates for the respective model residuals. The RHS of each model always includes capit and debtit. All variables are in logarithms. CA refers to the augmentation of the static country regression with cross-section averages following Pesaran (2006), ‘Additional CA’ refer to the CA for the openness and financial development variables (in logs) as described in the main text. We account for a constant term by partial demeaning as detailed in Berenguer-Rico & Gonzalo (2013b). Underlined mean or median co-summability statistics indicate evidence against the hypothesis of a co-summable model specification, δ^=0. Since co-summability is conditional on balance, we only print those specifications in black for which we have convincing evidence from the balance testing in Table 5, whereas all other specifications are printed in grey.
Table 7:Static Linear and Nonlinear Models
CANo augmentationStandard CA
Bal & Co-Sum
# of Countries/10 \46⋃19 ⋂38~32/18 \22⋃13 ⋂27~28
Diagnostics ‡
CD Test25.6925.8522.974.484.063.48
CAAdd CA of additional covariates
Bal & Co-Sum×
# of Countries/18 \24⋃16 ⋂28~25
CD Test1.871.691.98
Notes: We report the estimates and diagnostic tests for static production functions with linear and polynomial debt terms. All estimates are robust means (see Table 1). The MG models further include trend terms, we also omitted to report the averaged capital stock coefficients and constant terms in all models (available on request). ‘Bal & Co-Sum’ indicates the specification which was found to be balanced and co-summable (Tables 5 and 6). † We report the number of countries with statistically significant (5% level) positive and negative debt coefficient using / and \, respectively; ⋃ and ⋂ report the number of countries with statistically significant (5% level) convex and concave debt-growth relationships, respectively. ~ reports the number of countries for which all three debt terms are statistically significant at the 5% level. ‡ All residual series were found to be stationary.
Data Appendix

A-I Data construction

The principle data sources for our empirical analysis are the World Bank World Development Indicators and an update to the dataset provided by Panizza (2008). From the former we take real GDP in year 2000 US$ values, the per capita series of the same variable, population as well as gross fixed capital formation (investment) as a share of GDP. The Panizza data provides total debt series, comprising domestic and external debt, in face value terms as a percentage of GDP, enabling us to construct the real debt stock series.

With the investment series we can construct real capital stock by adopting the perpetual inventory method with a standard annual depreciation rate of 5%. If country series contained gaps of less than three years length we used cubic spline interpolation. This resulted in changes to a total of 53 observations in 19 country series,29 thus an average of 2.8 per country. This amounts to a total of 1.36% of our full sample data. Note that this interpolation does not affect the overall sample size, since the observations in question are also missing for GDP and other variables; it does however aid the construction of the capital stock series.

In the process of constructing the capital stock series we investigated a number of basic magnitudes, including the investment-to-GDP ratio in 1970 (found to be between 10 and 40%) and the capital-output ratio in 1970 (found to be between 1.5 and 4.7), which were all within reasonable bounds and thus no adjustments were made. We did however limit our analysis to countries with at least 21 years of data, which effectively included any transition economy as well as a small number of African and Latin American countries.30 The final sample contains 3,485 observations (dynamic specification) from 105 countries (23 Low-Income, 30 Lower Middle-Income, 23 Upper Middle-Income and 29 High Income countries based on World Bank classification), thus on average 33.2 years per country (range of 21 to 38 country observations) from 1972 to 2009.

Some of the empirical models make use of the cross-section averages of two proxies for trade openness and financial development: we use the data series provided by Bill Easterly for the former (exports plus imports as a share of GDP – data is from the WDI and Global Development Finance) and by Thorsten Beck and Asli Demirguc-Kunt (Financial Structure Database) for the latter (ratio of bank credit to bank deposits). Both raw variables are in logs.

We present descriptive statistics for our sample in Table A1. Detailed information about the sample makeup is confined to a Technical Appendix.

A-II Descriptive Statistics

Table A1:Descriptive Statistics
Panel A: Raw Variables and Transformations
GDP growth%age growth rate3.4293.6734.821−69.81233.280
GDP per capitalevel6,8441,9509,57210257,215
GDP pc growth%age growth rate1.6372.0504.780−63.28532.091
Population growth%age growth rate1.7921.8241.265−8.27114.734
Investment/GDP ratio%age share of GDP21.73021.1077.1583.41276.693
Capital Stocklevel7.01E+112.97E+102.55E+124.21E+082.91E+13
Capital Stock growth%age growth rate3.5573.1582.761−3.62928.813
Capital Stock per capitalevel1.99E+045.22E+032.88E+041.89E+021.56E+05
Capital Stock pc growth%age growth rate1.7651.6902.783−9.60725.157
Debt (total)level1.54E+116.98E+097.35E+119.34E+061.06E+13
Debt growth%age growth rate4.9244.03417.748−142.053136.954
Debt (total) per capitalevel3.52E+031.01E+036.37E+032.56E+008.33E+04
Debt pc growth%age growth rate3.1322.49217.749−144.855133.551
Debt/GDP ratio%age share of GDP62.17751.72849.6180.971470.610
Panel B: Regression Variables (in logs or first differences of logs)
Δ yit0.0160.0200.048−0.6330.321
Notes: We present descriptive statistics for the full sample of 3,485 observations from N = 105 countries (average T=33.2). In Panel A we added a number of standard transformations of the data applied, e.g. the debt/GDP ratio and the investment/GDP ratio as well as per capita GDP and its growth rate. Some of these variables are applied in the post-estimation analysis. in Panel B we present descriptives for the error correction model regression variables, namely Δyit—GDP per capita growth rate, yit - 1—lagged level of GDP per capita (in logs), capit-1—lagged level of capital stock per capita (in logs), debtit-1—lagged level of debt stock per capita (in logs), Δcapit—growth rate of capital stock per capita, Δdebtit—growth rate of debt stock per capita.
Technical Appendix—not intended for publication

TA-I Selective Review of the Empirical Literature

In the following we provide a selective review of recent studies in the empirical literature of debt and growth. Table TA1 provides an overview of characteristics related to the sample (N) and its makeup, the period and time-series dimension of the data (T) and whether and how any data aggregation over time was carried out: until the most recent contributions which use annual data all studies investigated averaged the data over time, in line with the standard practice in the cross-country growth literature. Further details provided cover the empirical model setup, namely the dependent variable and covariates (including proxies for debt stock and debt service), as well as the empirical specification. We focus on the most general parametric and semi-parametric results in each paper. Most studies reviewed carried out a large number of regressions (robustness checks), including some adopting nonparametric methods; results for these are sometimes indicated but we abstract from a more detailed discussion for conciseness. The final column of the table indicates which specific regression results we base our discussion on. Regarding the variables entering the model there are minor differences across studies, although investment, trade openness (trade/GDP) and a measure of human capital are typically included. With regard to the latter, it is particularly notable that all papers reviewed adopt a pooled partial adjustment model (PAM) which includes some lagged level of GDP or GDP per capita as covariate and GDP growth or the per capita equivalent as the dependent variable. Pooled here indicates that it is assumed the equilibrium relationship is the same across all countries in the sample. Implementations are again typical of the standard in the cross-country growth literature, including OLS, FE and various instrumentation strategies (including Arellano and Bond (1991)-type estimators). All studies consider nonlinearities in the debt-growth relationship.

Table TA1:Selective Review of the Literature
AuthorsSamplePeriodAveragingDep. Var.Debt stockDebt serviceCovariatesfModelEstimator(s)NonlinearityReference
Clements, Bhattacharya & Nguyen (2003)55 LICs (n≈270)1970-993-yearΔln(Y/L)itln(ED/Y)it, ln(ED/Y)it2 or ln(ED/EX)it, ln(ED/EX)it2 (FX NPV)(TDS/EX)itlnYi,t-1, ΔTOT, ΔL, HC, Inv/Y, FB/Y, [EX+IM]/YPooled PAM*FE, SGMMSquared debt termTable 1
significant, concaveinsignificantnegative beyond 30-37% (ED/Y), 115-120% (ED/EX) threshold
Pattillo, Poirson & Ricci (2004)61 LDCs (n≈450)1969-983-yearΔln(Y/L)itln(ED/Y)it, ln(ED/Y)it ×D (threshold dummy)(TDS/EX)itln(Y/L)i,t-1; ln(Inv/Y), ln(HC), ln(ΔL), ΔTOT, [EX+IM]/Y, FB/YPooled PAMOLS, IV (lags), FE, DGMM, SGMMspline regression (dummy), determined by R2Table 2
significant, negative beyond thresholdinsignificantnegative beyond 18% (ED/Y) threshold
Imbs & Ranciere (2005)87 LDCs (n≈600 in 1&2)1969-20023/5-yearΔln(Y/L)itln(ED/Y)it or ln(ED/EX)it (FV, different versions NPV)ln(Y/L)i,t-1; ln(ΔL), ln(Inv/Y), ln(HC), ΔTOT, [EX+IM]/YPooled PAMOLS, FE, SGMM; kernel estimatorskernel estimationTables 1-3, Figures 1-5
mostly insignificanterror bands commonly contain zero
Presbitero (2008)110 LDCs (n≈380)1980-20045-yearΔln(Y/L)itln(ED/Y)it, ln(ED/Y)it2; or ln(ED/EX)it, ln(ED/EX)it2; CPIA interactions (NPV; filtered Y)ln(Y/L)it-1, ln(Inv), ln(HC), ΔL, ΔTOT, ln([EX+IM]/Y), CPIA, σinfPooled PAMSGMMSquared debt term; interactions with ‘good’ CPIA scoreTables 4, 5
negative significant (linear) only in high CPIA; concave relation washed out by CPIA interactioninverted-U relation disappears with CPIA-interaction
Caner, Grennes & Koehler-Geib (2010)26 HICs & 75 LDCs (n = 101)1980-2008LR averageΔln (Y)i (LR average)(TD/Y); (LR average)ln(Y/L)i,1970 (pre-sample levels value), [EX+IM]/Y, infl (both LR averages)MRWOLSendogenous threshold regression (cross-section); restricted LDC sampleTables 1, 3
significant positive (negative) before (beyond) thresholdthreshold at 77% debt/GDP; 64% in LDC sample
Cordelia, Ricci & Ruiz-Arranz (2010)79 LDCs (n ≈ 700)1970-20023-yearΔln(Y/L)itln(ED/Y)it, ln(ED/Y)it2, ln(ED/Y)it ×D, ln(ED/Y)it ×D (NPV)(TDS/EX)it, (TDS/EX)it ×Dln(Y/L)i,t-1; ln(Inv/Y), ln(HC) ln(ΔL), ΔTOT, [EX+IM]/Y, FB/Y, Aid/Y, High-Debt DummyPooled PAMOLS, SGMMSquared debt term; spline regression (dummy), split at medianTable 2
OLS: overhang → irrelevance as ED ↑ (SGMM all insignificant); differs by institutional qualityinsignificant[later endogenous threshold regressions: 18%, 72% (ED/Y)]
Kumar & Woo (2010)38 HICs and EMEs (n ≈ 179)1970-20075-year LD, 4-year lagsln(Y/L)it -ln(Y/L)i,tln(TD/Y)i,t (average) and interactions (< 30%, 30-90%, > 90%)ln(Y/L)it, ΔTOT, (HC)i,t, Dcrisis, ln(l+inf)i,t-τ,(Cgov/Y);i,t-τ, FD, ln([EX+IM]/Y)i,t, ln(LL/Y)i,t-τ; also: ln(Inv)i,t, Li,t-τ, ….Pooled PAMOLS, BE, FE, SGMMInteractions (< 30%, 30 - 90%, > 90% of GDP)Tables 1, 5
negative significant (linear); some evidence of overhang >90%some evidence of overhang >90%
Cecchetti, Mohanty & Zampolli (2011)18 HICs (n ≈ 360)1980-20105-year (OL)Δln(Y/L)it(TD/Y)it, also corporate, hh and total (all three) debt; threshold interactionsln(Y/Li,t-1 (Sav/Y), HC, ΔL, DR, Dcrisis, inf, ([EX+IM]/Y), (LL/Y)Pooled PAMFEthreshold regressionTables 5, 6 (gov debt)
negative linear effect driven by thresholddebt detrimental beyond threshold at 96% debt/GDP
Pattillo, Poirson & Ricci (2011)93 LDCs (n ≈ 630)1969-983-yearΔln(Y/L)itln(ED/Y)it, ln(ED/Y)it2; ln(ED/EX)it, ln(ED/EX)it2 (FV or NPV)(TDS/EX)itln(Y/L)i,t-1; ln(HC), ln(Inv/Y), ln(ΔL), ΔTOT, [EX+IM]/Y, FB/YPooled PAMFE, SGMMSquared debt term (also spline regression)Table 4
significant, concave relationship in most modelsinsignificantgrowth-retarding threshold >35% (ED/Y), >160% (ED/EX)
Calderon & Fuentes (2012)116 countries(n ≈ 740)1970-20105-yearΔln(Y/L)itln(TD/Y)itln(Y/L)i,t-1 ln(HC), ln([Credit/GDP]*100), ln(Inst), ln(inf)it, (FB/Y)*100, ln(100*[EX+IM]/Y), ln([fin/Y]*100); σY, debt-interactionsPooled PAMSGMMSquared debt term; debt-interactionsTables 2, 3
negative significant, mitigated in rich, financially-developed countries with good institutionsinverted-U; debt negative for Y/L>$7k
Checherita-Westphal & Rother (2012)12 EAC (n ≈ 390)1970-2008annual (also: 5-year)Δln(Y)it(TD/Y)it, (TD/Y)it2ln(Y/L)i,t-1; Inv/Y, ΔL, tax, FB/Y, IR, REER, [EX+IM]/Y, ΔTOT, inf, DRPooled PAMFE, (lags); GMM)2SLS (alsoSquared debt termTables 1, 3
significant, concave relationshipinverted-U with 90-100% debt/GDP threshold
Minea & Parent (2012)20 HICs (n ≈ 1,300)1945-2009annualΔln(Y)itln(TD/Y)it-PooledPSTRmultiple smooth thresholdsTable 1 & Figure 2
nonlinear debt-growth relation, +ve above 115%
Presbitero (2012)92 LICs & MICs (n ≈ 320)1990-20073-yearΔln(Y/L)itln(TD/Y)it; ln(TD/Y)it2; also at t—1ln(Y/L)i,t-1; ln(Inv), In (HQ, ΔTOT, ln([EX+IM]/Y), CPIA, σinfPooled PAMSGMMSquared debt term; spline regression (dummy); CPIA interactionsTables 3, 4
debt irrelevance threshold >90%; overhang only in high CPIA economiesinverted-U relation driven by a few observations
Panizza & Presbitero (2012)17 OECD1970-2008annualforward Δln(Y/L)it (t + 1 to t + 6)(TD/Y)it, also corporate, hh and total (all three) debt; threshold interactionsln(Y/L)it, (Sav/Y), HC, ΔL, DR, Dcrisis, inf, ([EX+IM]/Y), (LL/Y), (FCDebt/Debt)i,t-1 REERi,t-1Pooled PAMOLS, IV (valuation effect on debt)threshold regressionTable 3
negative linear effect disappears with instrumentationno significant effect found
Afonso & Jalles (2013)155 countries (n ≈ 1,600)1970-2008annual (also: 5-year)Δln(Y/L)itln(TD/Y)it, ln(TD/Y)it × C/Yit (FV); vast number of proxies and debt interactionsln(Y/L)it-1 Inv/Y, (HC), lnvpub/Y, ln(ΔL), [EX+Im]/YPooled PAMOLS, LAD, FE, DGMM, SGMMSample splitting; threshold dummies at various cut-offs; threshold regressionsTable 1
negative significant; later results interpreted as nonlinear debt-growth relationendog. threshold 59%, +ve <30%, -ve >90%
Baum, Checherita-Westphal & Rother (2013)12 EAC (n ≈ 250)1990-2010annualΔln(Y)it(TD/Y)i,t-1, threshold interactionΔln(Y/L)i,t-1 Inv/Y, [EX+IM]/Y, EMU dummy; add. covariates for robustnessPooled growth AR(1)GMM Δln(Y/L)i,t-1 instrumented with lagsendogenous panel threshold regressionTables 2, 3
static and dynamic models indicate debt threshold(s), leading to inverted-U in most general model+ve (-ve) up to (beyond) 96%; two thresholds: +ve <66%, -ve >96%
Egert (2013)20 HICs (n ≈ 1,000)1960-2009annualΔln(Y)itln (TD/Y)i,t-1 Δln (TD/Y)it-1-endogenous threshold regressionsTable 7
threshold between 50% and 90%
Kourtellos, Stengos & Tan (2014)82 countries (n = 246)1980-2009decadal*Δln(Y/L)itln(TD/Y)itln(Y/L)i,t-1, ln(Inv/Y), ln(HC), ln(ΔL+0.05)MRWSTR-OLS, STR-SGMM, linear versionsendogenous threshold regressionsTable 4
negative in low democracy sample (ln(Y/L)i,t-1 insign.!)endogenous threshold, growth-retarding for low democracy sample
Notes: We use subscript i for countries and t for time periods, also for averaged time periods (see column on averaging; in few cases time-periods overlap (indicated as OL), but typically periods are non-overlapping). LD refers to ‘long differences’ (e.g. t-(t—30)).Sample: LDCs - less developed economics; LICs - low income countries; MICs - middle income countries; HICs - high income countries; EMEs - emerging economies; EAC - Euro Area countries, n refers to the number of observations in the regression (may be time-averaged); n refers to the number of observations; since this at times differs across specifications we provide ballpark figures. Averaging: Time-period over which data is averaged; annual - no averaging. Note that the ‘convergence term’ (ln(Y/L)i,t-1, i.e. lagged level of log per capita GDP) is typically the value for the first year in a three or five-year period rather than an average value (annual data obviously excepted).Dependent Variables: Y - real GDP; Y/L - real GDP per capita; Δln(Y/L) - real GDP per capita growth.Debt Variables: ED/Y - external debt to GDP ratio; ED/EX - external debt to exports ratio; similarly for Total Debt (TD). FV and NPV refer to face- and net present value of debt.Other Variables: Inv/Y - investment to GDP ratio; HC - human capital proxy (typically Barro & Lee, 2013); ΔL - population growth; ΔTOT - changes in terms of trade; [EX+IM]/Y - trade openness; (LL/Y) - liquid liabilities to GDP ratio; fin/Y—financial openness; FB/Y - fiscal balance over GDP; IR - interest rate; REER - real effective exchange rate; inf - (CPI) inflation; DR - dependency ratio; FCDebt/Debt - foreign currency debt to total debt ratio; D - dummy for certain events (crises; EMU membership) amongst others; σinf - inflation volatility; σY - GDP pc growth volatility; Credit/GDP - private credit to GDP ratio; Aid/Y- foreign aid to GDP ratio, inst—institutional quality/good governance. * Values are decadal averages with the exception of the ‘initial income’ variable, which is the value for the first year of each decade; ‘lags’ are 5-year averages for the 5 years immediately preceding a decade.Model: PAM - partial adjustment model (growth regressed on lagged level of dependent variables and contemporaneous covariates); MRW - cross-section convergence regression model following Mankiw, Romer & Weil (1992) without panel aspect. Estimators: OLS - ordinary least squares; FE - one-way fixed effects; BE - between groups estimator (cross-section of averages); DGMM - Arellano & Bond (1991); SGMM - Blundell & Bond (1998);—STR-GMM - structural threshold regression model combined with GMM approach; PTSR - Panel Smooth Transition Regression.

TA-II Sample Makeup

Table TA2:Sample details
AREUnited Arab EmiratesMENAHIC non-OECD311978-2007
ARGArgentinaLACUpper MIC231973-20091980-95
AUSAustraliaEAPHigh income: OECD371973-2008
AUTAustriaEEEHigh income: OECD381973-2009
BELBelgiumEEEHigh income: OECD351973-20091989-92
BFABurkina FasoSSALIC261982-2006
BHRBahrainEAPHIC non-OECD241983-20081990-93
BLZBelizeLACLower MIC271983-2008
BOLBoliviaLACLower MIC381973-2009
BRABrazilLACUpper MIC301981-2009
BRBBarbadosLACHIC non-OECD261973-20021974-79
BWABotswanaSSAUpper MIC381973-2009
CAFCentral African RepublicSSALIC311980-2008
CANCanadaNAHigh income: OECD381973-2009
CHESwitzerlandEEEHigh income: OECD381973-2009
CHLChileLACUpper MIC381973-2009
CHNChinaEAPLower MIC271984-2009
CIVCote d’IvoireSSALower MIC381973-2009
CMRCameroonSSALower MIC311978-2007
COGCongo, Rep.SSALower MIC341977-2008
CPVCape VerdeSSALower MIC221989-2008
CRICosta RicaLACUpper MIC381973-2009
CYPCyprusEEEHIC non-OECD321978-2008
DEUGermanyEEEHigh income: OECD381973-2009
DMADominicaLACUpper MIC311983-2006
DNKDenmarkEEEHigh income: OECD291979-20091997-2000
DOMDominican RepublicLACUpper MIC381973-2009
DZAAlgeriaMENAUpper MIC381973-2009
ECUEcuadorLACLower MIC381973-2009
EGYEgyptMENALower MIC381973-2009
ESPSpainEEEHigh income: OECD381973-2009
FINFinlandEEEHigh income: OECD341973-20091979-83
FJIFijiEAPUpper MIC371973-2008
FRAFranceEEEHigh income: OECD341973-20091978-82
GABGabonSSAUpper MIC381973-2009
GBRUnited KingdomEEEHigh income: OECD381973-2009
GMBThe GambiaSSALIC221984-2004
GRCGreeceEEEHigh income: OECD331973-20091976-81
GRDGrenadaLACUpper MIC311980-2008
GTMGuatemalaLACLower MIC381973-2009
GUYGuyanaLACLower MIC371973-2008
HNDHondurasLACLower MIC381973-2009
HUNHungaryLACLower MIC211984-2008
IDNIndonesiaSALower MIC291982-20091993-96
INDIndiaEAPLower MIC241973-20091974-78, 83-93
IRLIrelandEEEHigh income: OECD381973-2009
IRNIranMENAUpper MIC261983-20071991-95
ISLIcelandEEEHigh income: OECD361975-2009
ISRIsraelMENAHigh income: OECD381973-2009
ITAItalyEEEHigh income: OECD381973-2009
JAMJamaicaLACUpper MIC261972-1997
JORJordanMENALower MIC321979-2009
JPNJapanEAPHigh income: OECD381973-2009
KORKoreaEAPHigh income: OECD381973-2009
LCASt. LuciaLACUpper MIC271984-2009
LKASri LankaSALower MIC381973-2009
LSOLesothoSSALower MIC381973-2009
LUXLuxembourgEEEHigh income: OECD311977-20091990-93
MARMoroccoMENALower MIC381973-2009
MEXMexicoLACUpper MIC381973-2009
MUSMauritiusSSAUpper MIC321983-2009
MYSMalaysiaEAPUpper MIC381973-2009
NICNicaraguaLACLower MIC331973-20091988-93
NLDNetherlandsEEEHigh income: OECD351973-20092000-03
NORNorwayEEEHigh income: OECD341973-20091981-85
NZLNew ZealandEAPHigh income: OECD351973-20091999-02
PAKPakistanSALower MIC331973-20091991-96
PANPanamaLACUpper MIC281983-2009
PERPeruLACUpper MIC381973-2009
PHLPhilippinesEAPLower MIC381973-2009
PNGPapua New GuineaEAPLower MIC351976-2009
PRYParaguayLACLower MIC381992-2009
SENSenegalSSALower MIC381973-2009
SLESierra LeoneSSALIC261983-2007
SLVEl SalvadorLACLower MIC381973-2009
SWESwedenEEEHigh income: OECD381973-2009
SWZSwazilandSSALower MIC381973-2009
SYCSeychellesSAUpper MIC281983-2009
SYRSyrian Arab RepublicMENALower MIC381973-2009
THAThailandEAPLower MIC381973-2009
TUNTunisiaMENALower MIC381973-2009
TURTurkeyEEEUpper MIC331979-2009
URYUruguayLACUpper MIC381972-2009
USAUnited StatesNAHigh income: OECD381973-2009
VENVenezuelaLACUpper MIC341973-20091992-96
VUTVanuatuEAPLower MIC231986-2007
ZAFSouth AfricaSSAUpper MIC381973-2009
Notes: Regional codes are EAP—East Asia & Pacific; EEE—Emerging Economies in Europe; LAC—Latin America & Caribbean; MENA—Middle East & North Africa; SA—South Asia; SSA—Sub-Saharan Africa.Economies are divided among income groups according to 2009 gross national income (GNI) per capita, calculated using the World Bank Atlas method (see LIC—Low-Income Country ($995 or less); Lower MIC—Lower Middle-Income Country ($996-3,945); Upper MIC—Upper Middle-Income Country ($3,946-12,195). ‘Obs’ indicates the time-series observations available.

TA-III Stationarity testing

Table TA3:Panel Stationarity Testing
Maddala and Wu (1999) Fisher test (deterministics: constant)
LagsGDP pc(p)Debt pc(p)Cap pc(p)
Notes: All variables in logarithms. For the Maddala and Wu (1999) test we report the Fisher statistic and associated p-value, for the Pesaran (2007) test the standardised Z-tbar statistic and its p-value. The null hypothesis for both tests is that all series are nonstationary. Lags indicates the lag augmentation in the Dickey Fuller regression employed. Augmentation of the Dickey Fuller regressions with a constant or a constant and trend as indicated. We used the Stata routine multipurt by Markus Eberhardt, which wraps the routines xtfisher and pescadf written by Scott Merryman and Piotr Lewandowski respectively.
Maddala and Wu (1999) Fisher test (deterministics: constant and trend term)
LagsGDP pc(p)Debt pc(p)Cap pc(p)
Notes: All variables in logarithms. For the Maddala and Wu (1999) test we report the Fisher statistic and associated p-value, for the Pesaran (2007) test the standardised Z-tbar statistic and its p-value. The null hypothesis for both tests is that all series are nonstationary. Lags indicates the lag augmentation in the Dickey Fuller regression employed. Augmentation of the Dickey Fuller regressions with a constant or a constant and trend as indicated. We used the Stata routine multipurt by Markus Eberhardt, which wraps the routines xtfisher and pescadf written by Scott Merryman and Piotr Lewandowski respectively.
Pesaran (2007) CIPS test (deterministics: constant)
LagsGDP pc(p)Debt pc(p)Cap pc(p)
Notes: All variables in logarithms. For the Maddala and Wu (1999) test we report the Fisher statistic and associated p-value, for the Pesaran (2007) test the standardised Z-tbar statistic and its p-value. The null hypothesis for both tests is that all series are nonstationary. Lags indicates the lag augmentation in the Dickey Fuller regression employed. Augmentation of the Dickey Fuller regressions with a constant or a constant and trend as indicated. We used the Stata routine multipurt by Markus Eberhardt, which wraps the routines xtfisher and pescadf written by Scott Merryman and Piotr Lewandowski respectively.
Pesaran (2007) CIPS test (deterministics: constant and trend)
LagsGDP pc(p)Debt pc(p)Cap pc(p)
Notes: All variables in logarithms. For the Maddala and Wu (1999) test we report the Fisher statistic and associated p-value, for the Pesaran (2007) test the standardised Z-tbar statistic and its p-value. The null hypothesis for both tests is that all series are nonstationary. Lags indicates the lag augmentation in the Dickey Fuller regression employed. Augmentation of the Dickey Fuller regressions with a constant or a constant and trend as indicated. We used the Stata routine multipurt by Markus Eberhardt, which wraps the routines xtfisher and pescadf written by Scott Merryman and Piotr Lewandowski respectively.

TA-IV Cross-section dependence

Table TA4:Cross-Section Correlation
Panel A: LevelsPanel B: First Differences
avg ρ0.390.360.38avg ρ0.070.070.05
avg |ρ|0.660.520.75avg |ρ|
Panel C: Heterog. AR(2)Panel D: Heterog. AR(2) CCE
avg ρ0.090.130.08avg ρ0.000.000.00
avg |ρ| |ρ|
Notes: We present the average and average absolute correlation coefficients across the N (N−1) sets of correlations. CD reports the Pesaran (2004) cross-section dependence statistic, which is distributed N (0,1) under the null of cross-section independence. Panels A and B test the variable series in levels and first differences respectively. In Panel C each of the three variables in levels is entered into a time-series regression zit=π0,i+π1,izi,t1+π2,izi,t2+π3,it+ɛit, conducted separately for each country i. In Panel D the country-regressions are augmented with cross-section averages of all variables (in the Pesaran (2006) CCE fashion) instead of a linear trend. The correlations and cross-section dependence statistic in Panels C and D are then based on the residuals from these AR(2) regressions. We used the Stata routine xtcd written by Markus Eberhardt. The contrast between results in Panel C and D show the power of the simple cross-section average approach in addressing residual cross-section dependence.

    Afonso, Antonio & JoaoTovar Jalles. 2013. “Growth and productivity: The role of government debt.International Review of Economics & Finance25:384407.

    Andrews, Donald W. K.2005. “Cross-Section Regression with Common Shocks.Econometrica73 (5):15511585.

    Arellano, Manuel & StephenBond. 1991. “Some Tests of Specification for Panel Data: Monte Carlo Evidence and an Application to Employment Equations.Review of Economic Studies58 (2):27797.

    Bai, Jushan. 2009. “Panel Data Models with Interactive Fixed Effects.” Econometrica77 (4):12291279.

    Bailey, Natalia, GeorgeKapetanios & M.Hashem Pesaran. 2012. “Exponent of Cross-sectional Dependence: Estimation and Inference.” Cambridge University, unpublished working paper, January2012.

    Banerjee, Anindya & JosepCarrion-i-Silvestre. 2011. Testing for Panel Cointegration using Common Correlated Effects Estimators. Paper presented at the 14th Applied Economics Meeting, Huelva, June.

    Barro, Robert J. & JongWha Lee. 2013. “A new data set of educational attainment in the world, 1950-2010.” Journal of Development Economics104:184198.

    Baum, Anja, CristinaChecherita-Westphal & PhilippRother. 2013. “Debt and growth: New evidence for the euro area.” Journal of International Money and Finance32:809821.

    Berenguer-Rico, Vanessa & JesusGonzalo. 2013a. “Co-summability: from linear to non-linear co-integration.” Universidad Carlos III de Madrid Working Paper1312.

    Berenguer-Rico, Vanessa & JesusGonzalo. 2013b. “Summability of stochastic processes: a generalization of integration and co-integration valid for non-linear processes.” Journal of Econometrics forthcoming.

    Blundell, Richard & StephenBond. 1998. “Initial conditions and moment restrictions in dynamic panel data models.” Journal of Econometrics87 (1):115143.

    Bond, Stephen R., AsliLeblebicioglu & FabioSchiantarelli. 2010. “Capital Accumulation and Growth: A New Look at the Empirical Evidence.” Journal of Applied Econometrics15 (7):10731099.

    Caner, Mehmet, ThomasGrennes & FritziKoehler-Geib. 2010. Finding the tipping point – when sovereign debt turns bad. Policy Research Working Paper Series 5391 The World Bank.

    Canning, David & PeterPedroni. 2008. “Infrastructure, Long-Run Economic Growth And Causality Tests For Cointegrated Panels.” Manchester School76 (5):504527.

    Cecchetti, Stephen, MadhusudanMohanty & FabrizioZampolli. 2011. Achieving Growth Amid Fiscal Imbalances: The Real Effects of Debt. In Economic Symposium Conference Proceedings.Federal Reserve Bank of Kansas City pp. 145196.

    Chari, Varadarajan V, Patrick JKehoe & Ellen RMcGrattan. 2000. “Sticky price models of the business cycle: can the contract multiplier solve the persistence problem?Econometrica68 (5):11511179.

    Checherita-Westphal, Cristina & PhilippRother. 2012. “The Impact of High Government Debt on Economic Growth and its Channels: An Empirical Investigation for the Euro Area.” European Economic Review56 (7):13921405.

    Chudik, Alexander & M.Hashem Pesaran. 2013. “Common Correlated Effects Estimation of Heterogeneous Dynamic Panel Data Models with Weakly Exogenous Regressors.” Federal Reserve Bank of Dallas Globalization and Monetary Policy Institute Working Paper No. 146.

    Chudik, Alexander, M.Hashem Pesaran & ElisaTosetti. 2011. “Weak and Strong Cross Section Dependence and Estimation of Large Panels.” Econometrics Journal14 (1):C45C90.

    Clemens, Michael A. & SamuelBazzi. 2013. “Blunt Instruments: Avoiding Common Pitfalls in Identifying the Causes of Economic Growth.” American Economic Journal: Macroeconomics5 (2):15286.

    Clements, Benedict J., RinaBhattacharya & ToanQuoc Nguyen. 2003. External Debt, Public Investment, and Growth in Low-Income Countries. IMF Working Papers 03/249 International Monetary Fund.

    Cochrane, John H. 2011. “Understanding policy in the great recession: Some unpleasant fiscal arithmetic.” European Economic Review55 (1):230.

    Cordella, Tito, LucaAntonio Ricci & MartaRuiz-Arranz. 2010. “Debt Overhang or Debt Irrelevance?IMF Staff Papers57 (1):124.

    Durlauf, Steven N., Paul A.Johnson & Jonathan R.W.Temple. 2005. Growth Econometrics. In Handbook of Economic Growth, ed. PhilippeAghion & StevenDurlauf. Vol. 1 of Handbook of Economic GrowthElsevier chapter 8, pp. 555677.

    Eberhardt, Markus. 2013. “Nonlinearities in the Relationship between Debt and Growth: Evidence from Co-Summability Testing.” unpublished working paper, October.

    Eberhardt, Markus & ChristianHelmers. 2010. “Untested Assumptions and Data Slicing: A Critical Review of Firm-Level Production Function Estimators.” Oxford University, Department of Economics Discussion Paper Series #513.

    Eberhardt, Markus, ChristianHelmers & HubertStrauss. 2013. “Do spillovers matter when estimating private returns to R&D?The Review of Economics and Statistics95 (2):436448.

    Eberhardt, Markus & FrancisTeal. 2011. “Econometrics for Grumblers: A New Look at the Literature on Cross-Country Growth Empirics.” Journal of Economic Surveys25 (1):109155.

    Eberhardt, Markus & FrancisTeal. 2013. “Structural Change and Cross-Country Growth Empirics.” World Bank Economic Review27 (2):229271.

    Eberhardt, Markus & FrancisTeal. 2014. “No Mangoes in the Tundra: Spatial Heterogeneity in Agricultural Productivity Analysis.” Oxford Bulletin of Economics and Statistics forthcoming.

    Egert, Balazs. 2013. The 90% Public Debt Threshold: The Rise and Fall of a Stylised Fact. CESifo Working Paper 4242 CESifo.

    Elmendorf, Douglas W. & Gregory N.Mankiw. 1999. Government debt. In Handbook of Macroeconomics,ed. J. B.Taylor & M.Woodford. Vol. 1 of Handbook of MacroeconomicsElsevier chapter 25, pp. 16151669.

    Engle, Robert F. & Clive W. J.Granger. 1987. “Cointegration and Error correction: representations, estimation and testing.” Econometrica55 (2):252276.

    Ghosh, Atish R., Jun I.Kim, Enrique G.Mendoza, Jonathan D.Ostry & Mahvash S.Qureshi. 2013. “Fiscal Fatigue, Fiscal Space and Debt Sustainability in Advanced Economies.” The Economic Journal123 (566):F4F30.

    Granger, Clive W. J. & PaulNewbold. 1974. “Spurious regressions in econometrics.” Journal of Econometrics2 (2):111120.

    Greenlaw, David, JamesHamilton, PeterHooper & Frederic SMishkin. 2013. Crunch Time: Fiscal Crises and the Role of Monetary Policy. NBER Working Papers 19297 National Bureau of Economic Research.

    Greiner, Alfred. 2011. “Economic Growth, Public Debt and Welfare: Comparing Three Budgetary Rules.” German Economic Review12 (2):205222.

    Hamilton, Lawrence C. 1992. “How Robust is Robust Regression?Stata Technical Bulletin1 (2).

    Haque, Nadeem U., M.Hashem Pesaran & SunilSharma. 1999. “Neglected Heterogeneity and Dynamics in Cross-Country Savings Regressions.” unpublished working paper.

    Hendry, David. 1995. Dynamic Econometrics.Advanced Texts in Econometrics Oxford University Press.

    Imbs, Jean & RomainRanciere. 2005. The overhang hangover. Policy Research Working Paper Series 3673 The World Bank.

    Inter-American Development Bank. 2006. Living with Debt: How to Limit the Risks of Sovereign Finance.Cambridge, MA: Harvard University Press.

    International Monetary Fund. 2012. The Good, the Bad, and the Ugly: 100 Years of Dealing with Public Debt Overhangs. In World Ecoomic Outlook - Coping with High Debt and Sluggish Growth.International Monetary Fund.

    Islam, Nazrul. 1995. “Growth Empirics: A Panel Data Approach.” Quarterly Journal of Economics110 (4):112770.

    Kao, Chihwa. 1999. “Spurious regression and residual-based tests for cointegration in panel data.” Journal of Econometrics65 (1):915.

    Kapetanios, George, M.Hashem Pesaran & TakashiYamagata. 2011. “Panels with Nonstationary Multifac-tor Error Structures.” Journal of Econometrics160 (2):326348.

    Kourtellos, Andros, ThanasisStengos & ChihMing Tan. 2014. “The Effect of Public Debt on Growth in Multiple Regimes.” Journal of Macroeconomics forthcoming.

    Kraay, Aart & VikramNehru. 2006. “When Is External Debt Sustainable?World Bank Economic Review20 (3):341365.

    Krugman, Paul. 1988. “Financing vs. forgiving a debt overhang.” Journal of Development Economics29 (3):253268.

    Kumar, Manmohan S. & JaejoonWoo. 2010. Public Debt and Growth. IMF Working Papers 10/174 International Monetary Fund.

    Lee, Kevin, M.Hashem Pesaran & Ron P.Smith. 1997. “Growth and Convergence in a Multi-country Empirical Stochastic Solow Model.” Journal of Applied Econometrics12 (4):35792.

    Manasse, Paolo & NourielRoubini. 2009. “Rules of thumb for sovereign debt crises.” Journal of International Economics78 (2):192205.

    Mankiw, N Gregory, DavidRomer & David NWeil. 1992. “A Contribution to the Empirics of Economic Growth.” The Quarterly Journal of Economics107 (2):40737.

    Minea, Alexandru & AntoineParent. 2012. Is high public debt always harmful to economic growth? Rein-hart and Rogoff and some complex nonlinearities. Working Papers 8 Association Francaise de Cliometrie.

    Mundlak, Yair, RitaButzer & Donald F.Larson. 2012. “Heterogeneous technology and panel data: The case of the agricultural production function.” Journal of Development Economics99 (1):139149.

    Panizza, Ugo. 2008. Domestic And External Public Debt In Developing Countries. UNCTAD Discussion Papers 188 United Nations Conference on Trade and Development.

    Panizza, Ugo & AndreaFilippo Presbitero. 2012. Public Debt and Economic Growth: Is There a Causal Effect? Mo. Fi.R. Working Papers 65.

    Panizza, Ugo & AndreaFilippo Presbitero. 2013. “Public Debt and Economic Growth in Advanced Economies: A Survey.” Swiss Journal of Economics and Statistics149 (2):175204.

    Pattillo, Catherine A., HelenePoirson & LucaAntonio Ricci. 2004. What Are the Channels Through Which External Debt Affects Growth? IMF Working Papers 04/15 International Monetary Fund.

    Pattillo, Catherine A., HelenePoirson & LucaAntonio Ricci. 2011. “External Debt and Growth.” Review of Economics and Institutions2 (3). Retrieved from

    Pedroni, Peter. 2007. “Social capital, barriers to production and capital shares: implications for the importance of parameter heterogeneity from a nonstationary panel approach.” Journal of Applied Econometrics22 (2):429451.

    Pesaran, M. Hashem. 2004. “General Diagnostic Tests for Cross Section Dependence in Panels.” unpublished working paper.

    Pesaran, M. Hashem. 2006. “Estimation and inference in large heterogeneous panels with a multifactor error structure.” Econometrica74 (4):9671012.

    Pesaran, M. Hashem. 2007. “A simple panel unit root test in the presence of cross-section dependence.” Journal of Applied Econometrics22 (2):265312.

    Pesaran, M. Hashem. 2013. “Testing Weak Cross-Sectional Dependence in Large Panels.” Econometrics Reviews (forthcoming).

    Pesaran, M. Hashem & ElisaTosetti. 2011. “Large Panels with Common Factors and Spatial Correlations.” Journal of Econometrics161 (2):182202.

    Pesaran, M. Hashem & Ron P.Smith. 1995. “Estimating long-run relationships from dynamic heterogeneous panels.” Journal of Econometrics68 (1):79113.

    Pesaran, M. Hashem, VanessaSmith & TakashiYamagata. 2013. “Panel Unit Root Tests in the Presence of a Multifactor Error Structure.” Journal of Econometrics175 (2):94115.

    Phillips, Peter C. B. & DonggyuSul. 2003. “Dynamic panel estimation and homogeneity testing under cross section dependence.” Econometrics Journal6 (1):217259.

    Presbitero, Andrea F. 2008. “The Debt-Growth Nexus in Poor Countries: A Reassessment.” Economics: The Open-Access, Open-Assessment E-Journal2 (30).

    Presbitero, Andrea F.2012. “Total Public Debt and Growth in Developing Countries.” European Journal of Development Research24 (4):606626.

    Reinhart, Carmen M. & Kenneth S.Rogoff. 2009. This Time is Different - Eight Centuries of Financial Folly.Princeton, New Jersey: Princeton University Press.

    Reinhart, Carmen M. & Kenneth S.Rogoff. 2010a. Debt and Growth Revisited. MPRA Paper 24376University Library of Munich, Germany.

    Reinhart, Carmen M. & Kenneth S.Rogoff. 2010b. “Growth in a Time of Debt.American Economic Review Papers and Proceedings100 (2):57378.

    Reinhart, Carmen M. & Kenneth S.Rogoff. 2010c. Growth in a Time of Debt. Working Paper 15639 NBER.

    Reinhart, Carmen M. & Kenneth S.Rogoff. 2011. A Decade of Debt. NBER Working Papers 16827National Bureau of Economic Research, Inc.

    Reinhart, Carmen M., Kenneth S.Rogoff & Miguel A.Savastano. 2003. “Debt Intolerance.” Brookings Papers on Economic Activity34 (1):174.

    Reinhart, Carmen M., Vincent R.Reinhart & Kenneth S.Rogoff. 2012. “Public Debt Overhangs: Advanced-Economy Episodes Since 1800.” Journal of Economic Perspectives26 (3):6986.

    Sachs, Jeffrey D.1989. The Debt Overhang of Developing Countries. In Debt, Stabilization and Development, ed. GuillermoA. Calvo, RonaldFindlay, PenttiKouri & JorgeBraga de Macedo. Oxford: Blackwell.

    Shin, Yongcheol, ByungchulYu & MatthewGreenwood-Nimmo. 2013. Modelling Asymmetric Cointegration and Dynamic Multipliers in a Nonlinear ARDL Framework. In Festschrift in Honor of Peter Schmidt, ed. WilliamC. Horrace & Robin C.Sickles. New York (NY): Springer.

    Smith, Ron P.2001. “Estimation and Inference with Non-Stationary Panel Time-Series Data.” Paper prepared for the RC33 Conference.

    Stock, James H. & Mark WWatson. 2002. “Forecasting using principal components from a large number of predictors.” Journal of the American Statistical Association97 (460):11671179.

    Temple, Jonathan. 1999. “The New Growth Evidence.” Journal of Economic Literature37 (1):112156.

Affiliations: IMF (Presbitero); University of Nottingham (Eberhardt). We are grateful to Thorsten Beck, Vanessa Berenguer-Rico, Andrew Berg, Gianluca Cafiso, Tito Cordella, Panicos Demetriades, Jerry Hausman, George Kapetanios, M. Hashem Pesaran, Kenneth Rogoff, Nicolas van de Sijpe, Francesco Venturini, and participants to several seminars and conferences for helpful comments and suggestions on earlier drafts of the paper. All remaining errors are our own. This working paper is part of a research project on macroeconomic policy in low-income countries supported by the U.K.’s Department for International Development.

Note that while recently some details of their research and therefore the conclusions drawn have been seriously questioned (for a recent review see Panizza & Presbitero, 2013), the message of a common 90% debt threshold is still widely perceived in policy circles, among academics and in the media. This is the case notwithstanding that Reinhart & Rogoff (2010a) explicitly stated that they “do not pretend to argue that growth will be normal at 89% and subpar (about 1% lower) at 91% debt/GDP any more than a car crash is unlikely at 54mph and near certain at 56mph” and that causality is difficult to assess, as “for low-to-moderate levels of debt there may or may not be one; the issue is an empirical one, which merits study. For high levels of debt the evidence points to bi-directional causality.”

Our analysis, like that in most of this literature, emphasises the long-run; we therefore employ a levels specification explaining income (per capita GDP), rather than the growth rate of income. The popularity of the ‘growth’ specification in the cross-country empirical literature is justified by the presence of a lagged level of per capita GDP as additional regressor. This restricted error correction (partial adjustment) specification, which provides estimates for a long-run levels relationship, is however frequently misinterpreted as a growth equation (see Eberhardt & Teal, 2011, for a discussion). For consistency with the literature we use ‘growth’ instead of ‘income’ or ‘development’ in the following.

Examples for the former include the oil crises in the 1970s or the recent financial crisis, while knowledge spillovers from R&D investments in developed economies (Eberhardt, Helmers & Strauss, 2013) can be seen as illustrations for the latter.

Panizza & Presbitero (2013) also provide a detailed discussion of the recent empirical literature on the relationship between debt and growth in advanced economies, focusing on causality and non-linearities.

For peaks at the start (end) of our sample we limit these averages to the peak year and the two years after (before).

Interestingly the green diamonds indicating 2009 show that all countries in which debt peaked in that year (all High-Income countries bar GRD and LCA) had worse growth performance in 2007-2009 than in all other years (average growth rates, respectively).

Of course the above setup as described thus far is nothing new to any applied econometrician, in that it resembles the standard identification problem encountered, for instance, in productivity analysis at the firm-level. Here this phenomenon is referred to as ‘transmission bias’, which arises from firms’ reaction to (for the econometrician) unobservable productivity realisations when making input choices. Solutions to this problem are typically sought via instrumentation of one form or another (for a recent survey of the micro literature see Eberhardt & Helmers, 2010).

In this vein, some recent work looks at single episodes of debt overhangs (Reinhart, Reinhart & Rogoff, 2012; International Monetary Fund, 2012).

This discussion refers to the static case. In the dynamic setup they do not explicitly discuss the nonlinearity appearance, although there is no reason to assume this is limited to the static case.

The same pattern emerges when we use untransformed per capita GDP. In order to aid presentation in Figure 4 we exclude ‘extreme’ values (10% of observations) from this descriptive graph: the ‘full’ sample fractional polynomial regressions exclude all observations for debt-to-GDP ratio below 2 log points (<7.3% debt-to-GDP ratio), which amounts to 85 observations (primarily ARE, CHN and LUX). The scatter plots and country-specific fractional polynomial regressions further exclude all observations for which the within-transformed income change year-on-year exceeds 40%, which amounts to 281 observations (primarily BWA, IRL, KOR, THA and other fast-growing Middle-Income Countries).

If these two variables are included in the empirical analysis the resulting estimates are for the data ending in 2008.

We assume these parameter coefficients are fixed (Pesaran & Smith, 1995). Their moments and conditional distribution present a central interest in this study.

A formal motivation for this setup from economic theory can be found in Mundlak, Butzer & Larson (2012) and Eberhardt & Teal (2013). Note that covariates are not assumed to be only driven by common factors also contained in the estimation equation (ft) but can have additional factors gt exclusive to their evolution.

It is somewhat of a tradition in the cross-country growth literature to follow Mankiw, Romer & Weil (1992) in the specification of ‘convergence’ regression models in the panel, regressing output growth Δy at time t (or averaged over a certain time horizon) on a lagged (or initial) level of output at time t − 1 as well inputs at time t (or averaged over a certain time horizon). This is despite the clear link Islam (1995) made between a cross-country growth model and a dynamic panel data model, which represents an Autoregressive Distributed Lag (ARDL) model of order (1,1). See Eberhardt & Teal (2011) for a survey of the cross-country growth empirics literature. Hendry (1995, 212) provides an insightful discussion on the encompassing nature of the ARDL model, for which “virtually every type of single equation model in empirical time-series econometrics is a special case”.

The discussion in this section is based on Eberhardt & Teal (2014), which follows the methodology of Canning & Pedroni (2008).

We implicitly assume that debt accumulation is orthogonal to any expectations of future growth accelerations or slowdowns. It however bears reminding that our common factor framework allows us to account for global and local economic circumstances and business cycles. These global and local economic conditions and trends arguably play a dominant role in forming expectations about an individual country’s future growth trajectory and the orthogonality condition thus applies much more narrowly to any expectations formed on the basis of exogenous, country-specific shocks.

In principle the implementation of the asymmetric dynamic model by Shin, Yu & Greenwood-Nimmo (2013) is subject to the same criticism we level against the polynomial models in the following sub-section. However, given the limited number of countries which have substantial observations for debt-to-GDP in both regimes (above, below threshold), it is infeasible to adopt the subsampling strategy (working with random sets of countries) that we propose below to obtain inferential statistics.

Our discussion in this section as well as the implementation follow Berenguer-Rico & Gonzalo (2013a) and Berenguer-Rico & Gonzalo (2013b).

Given δ^y*=(β^y*1)/2andδ^z*=(β^z*1)/2 it is easy to see that δ^y*δ^z*=(β^y*β^z*)/2.

Again the parallels with the theory of integration and cointegration may help to illustrate this point: the seminal Granger & Newbold (1974) paper investigated spurious regression by regressing two independent random walks, Yt and Xt. Since both processes are I(1) the regression equation Yt = β0 + β1Xt is balanced. Since they constitute independent processes, later work by Engle & Granger (1987) would suggest that the residual series from this regression are not I(0), so that Yt and Xt are not cointegrated. Similarly, balancedness of yt = f (xt, θ) is a necessary prerequisite for co-summability between yt and f (xt, θ).

We do not pursue the analysis of a quadratic trend here due to the limited time-series dimension of our panel.

The country-series for ‘open’ and ‘findev are not additional covariates in our regression model.

For more details on these concepts see Smith (2001).

The half-life indicates “the length of time after a shock before the deviation in output shrinks to half of its impact value” (Chari, Kehoe & McGrattan, 2000, 1161).

For models [8] and [9] we also investigated the use of ‘findev’ instead of’open’ and found qualitatively identical results. This includes the cross-country plots in Panels (e) and (f) of Figure 6.

This simple count does not take statistical significance into account.

Eberhardt (2013) applies these methods to the long time series data provided by Reinhart & Rogoff (2009) and finds no evidence for a non-linear long-run relationship between debt and growth in the data for the US, Great Britain, Sweden and Japan as well as 23 other (primarily High-Income) countries.

We only focus on static models given the serious difficulties arising in reconciling nonlinearities with cross-section dependence, parameter heterogeneity and a dynamic specification within a panel of moderate time series dimension.

The affected countries are BEL, BEN, BHR, DNK, FIN, FRA, GHA, HUN, IND, KEN, LUX, MDG, MWI, NLD, NOR, NZL, TON, TTO, VEN – see Technical Appendix for country iso-codes. Note that of these Bosnia and Herzegovina (BHR), Tonga (TON) and Trinidad & Tobago (TTO) were later dropped due to insufficiently long time series.

The following 67 countries for which some debt-to-GDP information was available in the Panizza data (covering 172 countries) were thus dropped from the analysis, either due to lack of observations for all variable series or due to the time series restriction: Albania, Angola, Armenia, Azerbaijan, Bahamas, Belarus, Bhutan, Bosnia and Herzegovina, Brunei Darussalam, Bulgaria, Cambodia, Colombia, Croatia, Czech Republic, Djibouti, Equatorial Guinea, Eritrea, Estonia, Georgia, Guinea, Haiti, Hong Kong SAR, Iraq, Kazakhstan, Kuwait, Kyrgyz Republic, Lao PDR, Latvia, Lebanon, Liberia, Lithuania, FYR Macedonia, Maldives, Mauritania, Moldova, Mongolia, Myanmar, Namibia, Nigeria, Oman, Poland, Portugal, Qatar, Romania, Russian Federation, Samoa, San Marino, Sao Tome and Principe, Saudi Arabia, Slovak Republic, Slovenia, Solomon Islands, St. Kitts and Nevis, St. Vincent and the Grenadines, Sudan, Taiwan, China, Tajikistan, Tanzania, Tonga, Trinidad and Tobago, Turkmenistan, Uganda, Ukraine, Uzbekistan, Vietnam, Yemen (Rep).

Other Resources Citing This Publication