The Impact of Gray-Listing on Capital Flows: An Analysis Using Machine Learning
Author:
Mizuho Kida
Search for other papers by Mizuho Kida in
Current site
Google Scholar
Close
and
Simon Paetzold https://isni.org/isni/0000000404811396 International Monetary Fund

Search for other papers by Simon Paetzold in
Current site
Google Scholar
Close

The Financial Action Task Force’s gray list publicly identifies countries with strategic deficiencies in their AML/CFT regimes (i.e., in their policies to prevent money laundering and the financing of terrorism). How much gray-listing affects a country’s capital flows is of interest to policy makers, investors, and the Fund. This paper estimates the magnitude of the effect using an inferential machine learning technique. It finds that gray-listing results in a large and statistically significant reduction in capital inflows.

Abstract

The Financial Action Task Force’s gray list publicly identifies countries with strategic deficiencies in their AML/CFT regimes (i.e., in their policies to prevent money laundering and the financing of terrorism). How much gray-listing affects a country’s capital flows is of interest to policy makers, investors, and the Fund. This paper estimates the magnitude of the effect using an inferential machine learning technique. It finds that gray-listing results in a large and statistically significant reduction in capital inflows.

1 Introduction

This paper examines what happens to a country’s capital flows after it is identified in the Financial Action Task Force (FATF)’s list of countries or jurisdictions with strategic deficiencies in their framework for Anti-Money Laundering/Combating the Financing of Terrorism (AML/CFT). The list of countries with such deficiencies is known as the gray list. How much gray-listing affects a country’s capital flows is of interest to policy makers and investors. It is also of interest to the Fund in relation to country surveillance, prioritizing capacity development effort, and in relation to forecasts of demand for Fund-supported programs and estimates of capacity to repay.

The FATF maintains two public lists of countries with weak AML/CFT regimes: “jurisdictions under increased monitoring” that “are actively working with the FATF to address strategic deficiencies in their regimes”1 and “high-risk jurisdictions subject to a call for action” that are not actively engaging with FATF to address these deficiencies.2 The former is known as the gray list and the latter as the blacklist.

As of now there are only two countries on the blacklist—Iran and North Korea—whereas there are nineteen on the gray list—Albania, Barbados, Botswana, Burkina Faso, Cambodia, Cayman Islands, Ghana, Jamaica, Mauritius, Morocco, Myanmar, Nicaragua, Pakistan, Panama, Senegal, Syria, Uganda, Yemen, and Zimbabwe.3 Countries that are not currently on the gray list could be gray-listed in the future as the process is ongoing and the FATF standards are periodically updated. Appendix A describes the gray-listing process.

For a country on the blacklist, FATF calls on other countries to apply enhanced due diligence and countermeasures, increasing the cost of doing business with the country and in some cases severing business relations altogether. Even countries placed on the gray list could experience a disruption in capital flows. One possible mechanism is de-risking, whereby banks exit relationships with customers that are based in high-risk countries to reduce compliance costs (Alleyne et al., 2017; Collin, Cook, and Soramaki, 2016). Another is market enforcement, whereby investors use gray-listing as a heuristic for evaluating the risk of doing business with a country, and therefore reallocate resources to reduce their exposure to the country (Morse, 2019).

Nevertheless, the empirical evidence for these effects is weak. There have only been a few studies, some of which found little or no effect.

Kudrle (2009) searches for the impact of gray-listing on cross-border bank flows in 38 tax havens over the period 2000 to 2007. Using the jurisdiction’s share in the total value of the corresponding Bank for International Settlements (BIS) category for Offshore Centers as a dependent variable, he finds little systematic effect of gray-listing.

Farias and de Almeida (2014) examine the impact of gray-listing on the ability to attract and retain FDI in a sample of 36 Latin America and Caribbean countries (of which 7 were gray-listed) over the period 2000 to 2006. They find a significant but small reduction in the ratio of FDI to GDP (on average, 0.3–0.4 percentage points).

Balakina, D’Andrea, and Masciandaro (2016) examine how cross-border bank flows respond to gray-listing in a sample of 126 countries (of which 29 have been gray-listed) over the period 2000 to 2015. Using annual growth of total foreign claims (i.e., net bank flows) from BIS data, they find no consistent effect of gray-listing. However, when total foreign claims are split between bank outflows (growth in total foreign assets) and inflows (growth in total foreign liabilities), they find a small but statistically significant increase in outflows (about 23 percent a year on average while on the gray list) but not in inflows.

Collin, Cook, and Soramaki (2016) examine the effect of gray-listing on cross-border payments to and from countries. Using SWIFT monthly numbers of cross-border payments between customers in every country connected to the SWIFT network between 2004 and 2014 and linking these numbers with data on the timing of gray-listing, they find that being added to the gray list results in a 7–10 percent reduction in the number of payments sent to an affected country by the rest of the world. But they find that it has no consistent effect on the number of payments sent from an affected country. They also find no evidence of reduction in bilateral payments between gray-listed countries.

Finally, Morse (2019) analyzes the effect of gray-listing on bank inflows (growth in cross-border liabilities) for 39 countries (10 of which were gray listed) in 2010–2015 and finds a large significant impact on the bank inflows of around 15–16 percent.

This paper contributes to the literature on the effect on gray-listing on capital flows in three ways. First, to the best of our knowledge, this is the first study that looks at the impact of gray-listing on all components of capital flows: FDI, portfolio flows, and banking and other flows. Second, it uses more recent data on gray-listing and includes a larger sample of gray-listed countries. Third, to the best of our knowledge, this is also the first study that studies the problem using machine learning.

More specifically, the paper analyzes the impact of gray-listing on capital flows in a sample of 89 emerging and developing countries in 2000–2017. We use quarterly data on capital flows from the IMF Financial Flows Analytics (FFA) database. The data on gray-listing come from FATF public statements, which are issued three times a year (February, June, and October) and FATF annual reports. During this period, 78 countries have been gray-listed at least once (not all of them remain in the estimation sample due to missing values). For empirical execution, we use a machine learning technique known as lasso.

The paper finds a large, significant negative effect of gray-listing on capital inflows. The empirical results suggest that capital inflows decline on average by 7.6 percent of GDP when the country is gray-listed. The results also suggest that FDI inflows decline on average by -3.0 percent of GDP, portfolio inflows decline on average by -2.9 percent of GDP, and other investment inflows decline on average by -3.6 percent of GDP. The estimated impacts are all statistically significant.

The remainder of the paper is organized as follows. Section 2 presents

stylized facts, using an event study approach to describe how capital flows evolved around the time of gray-listing. Section 3 discusses the intuition behind the lasso technique and applies it to the estimation of the impact of gray-listing on capital flows. Section 4 presents robustness checks. Section 5 concludes.

2 Event Analysis

We use an event history analysis (e.g., see Gourinchas and Obstfeld, 2012) to explore the evolution of capital flows around the time of gray-listing in affected countries. Formally, we estimate the following fixed-effect panel regression:

yi,t=αi+γλt+Σsβsδs,t+ϵi,t(1)

where yi,t is capital flows for country i in time t; αi is a country fixed effect, At is a dummy variable that equals 1 in each quarter during the global financial crisis (GFC) and 0 otherwise, and δs,t is a dummy variable that equals 1 when a country i is s periods away from gray-listing in time t. We set s to nine quarters (four quarters before, four quarters after) to see how capital flows evolve before and after gray-listing. The country fixed effect (a.i) measures the average capital flow for a country i in periods outside the GFC and the gray-listing event (“tranquil times”). The coefficient 7 measures the average deviation of capital flows in country i in each quarter during the GFC. The coefficient βs captures how capital flows evolved within an s-quarter window surrounding gray-listing. The error εi,t captures all the remaining variation in the capital flows.

The data on capital flows come from the IMF’s FFA database, which provide quarterly data on FDI, portfolio flows (debt and equity), and other investments (private, official, bank, and non-bank) on a gross basis (inflows and outflows) and a net basis. The data are available in both nominal terms (i.e., in millions of USD) and as a share of (quarterly) GDP. We use the latter for this analysis. Our focus is on the sample of 89 emerging and developing countries (EMDCs) for which the data are available for the period between 2000q1 and 2017q4.4 See appendix B for the list of countries. The list of EMDCs is taken from the World Economic Outlook (WEO) in 2000, to which we add nine EMDCs not included at the time (Afghanistan, Brunei Darussalam, Iraq, Korea, Kosovo, Liberia, Montenegro, Serbia, and Timor-Leste) and six advanced economies (Greece, Hong Kong, Iceland, Ireland, Israel, and Singapore) mainly for the purpose of maintaining a reasonable sample size.5 We exclude China from the analysis because of its unique characteristics including its size. Extreme values in capital flows are also excluded from the data before running the model in equation (1).6

The dates of gray-listing come from FATF public statements, issued three times a year (currently in February, June, and October) and FATF annual reports, and are provided in Appendix A. As explained in the appendix, the timing of the news does not necessarily follow an automatic calendar because gray-listing is an outcome of a case-by-case evaluation process that is subject to debate and repeated extensions. Moreover, some types of capital flows may respond to the announcements more quickly than others (Becker and Noone, 2008).

To capture responses of capital flows to an event, the exact timing of which is uncertain, we follow the approach in the literature (e.g., Bussiere and Fratzscher, 2006, Gourinchas and Obstfeld, 2012, and more recently, Choi and Hashimoto, 2017) and define the event as a window [to — k,to + k], where to is a date of the FATF announcement and k is allowed to vary between 1 and 3 quarters.7 Figures 1 and 2 plot the estimates of βs in equation (1) for each type of capital flow.

Figure 1:
Figure 1:

Evolution of Capital Flows Around Gray-Listing (Percent of GDP)

Citation: IMF Working Papers 2021, 153; 10.5089/9781513582436.001.A001

The figure presents results from estimation of the fixed-effect panel regression in equation (1) on quarterly data between 2000q1 and 2017q4. Each heading corresponds to the dependent variable in the model. The estimates of conditional means of each variable are reported on the vertical axis. The horizontal axis represents the number of quarters before (negative sign) and after a shock. Excludes extreme values. See text for detail.
Figure 2:
Figure 2:

Evolution of Components of Other Flows Around Gray-Listing (Percent of GDP)

Citation: IMF Working Papers 2021, 153; 10.5089/9781513582436.001.A001

The figure presents results from estimation of the fixed-effect panel regression in equation (1) on quarterly data between 2000q1 and 2017q4. Each heading corresponds to the dependent variable in the model. The estimates of conditional means of each variable are reported on the vertical axis. The horizontal axis represents the number of quarters before (negative sign) and after a shock. Excludes extreme values. See text for detail.

Capital inflow declines sharply at time of gray-listing (Figure 1). Gross inflow is on average over 6 percentage points of GDP lower compared to the historical norms. All components of capital inflow experience a decline at the time of gray-listing. FDI inflow, on average, declines by 3.2 percent of GDP, portfolio inflow by 3.3 percent of GDP, and other inflow by 3.1 percent of GDP. Although the magnitude of the decline is similar for both FDI and portfolio inflows, dynamics around the time of gray-listing show that portfolio flows tend to be more sensitive and prone to reversal after a shock, which is in line with the conventional wisdom (Becker and Noone, 2008).

Looking at the behavior of capital outflows, gross outflow declines, on average by around 3 percentage points at the time of gray-listing, which partially offsets the drop in gross infows.8 However, there is a surge in capital outflow a few quarters ahead of the announcement. Such behavior is well-documented in the literature and consistent with information asymmetry, where domestic investors know more about the country than foreign investors.9

Figure 2 shows components of “other” investment flows, which consist of interbank loans, suppliers’ credits, trade credits, and other more difficult to classify items. On average, their inflows decline by 2 to 3 percent of GDP (Figure 2). Inflows in the private sector are more severely affected (-3.3 percent of GDP) than those in the official sector (-0.5 percent); bank flows are somewhat less affected (-2.1 percent) than nonbank flows (-2.4 percent). We, again, observe the anticipatory behavior in the outflows, which seem to increase few periods ahead of the announcement of gray-listing, especially among private sector flows (both bank and non-bank).

What explains the V-shaped recovery after the initial fall in capital inflow? A possible explanation is that the initial decline in capital flow represents an “overshooting” and is followed by a period of large volatility because of incomplete information and learning by foreign investors (Bacchetta and van Wincoop, 2000). Gray-listing creates a large degree of uncertainty, especially in the early period, when foreign investors are unsure about how domestic firms will cope and how other investors will respond in the new environment. Using a theoretical model and simulations, Bacchetta and van Wincoop (2000) show that in such an environment capital flow could overshoot in response to news and then fluctuate widely in early periods as investors update their perceptions.

Another question is whether the decline in capital flow is genuine or if it merely reflects migration of capital to informal channels? To see this we re-run the equation (1) using net errors and omissions as the dependent variable. The result is reported in Appendix C. Net errors and omissions show a slight increase at time of gray-listing, but not as much to fully explain the extent of declines in capital inflow, suggesting that while some of the declines are explained by diversion of the inflow to shadow (informal) flows, much of the declines in capital inflows are a genuine decline.

3 Econometric Analysis

The event history analysis in the previous section shows that gray-listing is associated with a sharp decline in capital inflow. While suggestive, the analysis does not allow us to attribute all or even most of the decline to gray-listing, because other factors may also be driving the results. That is, there are many possible confounding factors. In order to isolate and estimate the treatment effect of gray-listing, we need to use a regression analysis to control for the confounding factors.

3.1 Methodology

We estimate a simple model of capital flow:

yi,t=αdi,t+xi,tβ+μi+ϵi,t,i=1,....n;t=1,...,T(2)

where di,t is the causal variable of interest or treatment effect, xi,t is the set of controls or confounding factors, µi is a country fixed effect, and εi,t is the error.

There is potentially a large number of confounding factors. According to the literature on the determinants of capital flows (including sudden stops or capital reversals), there are two broad categories of factors that need to

be accounted for—global and domestic. The global factors are included to account for important co-movements in capital flows across emerging and developing economies. Commonly used variables for this purpose include VIX (as a proxy for global risk aversion), changes in money supply of major advanced economies (as a proxy for global liquidity), growth rates of major advanced economies (as a proxy for the strength of the global economy and perhaps investors’ risk appetite), and the Federal Funds Rate (to account for the special role of the U.S. dollar as a source of liquidity to the global financial system). The domestic factors are country-specific factors that may explain differences across countries in capital inflows. Commonly used variables for this purpose include the level of per capita income, real GDP growth, trade openness, financial openness, the exchange rate regime, exchange rates, foreign reserves, current account balances, bank credit, and sovereign credit ratings.

The large number of potential confounding variables poses a challenge to model selection and estimation. First, where there are many potential confounding variables, it is likely that only some of them are relevant, but empirical studies often do not agree because of their differences in data, models, methodology, as well as researchers’ preoccupations. Second, there is a problem of estimation. Including too many variables in the regression leads to inefficiency; because variables tend to be highly correlated with each other, they need to be included parsimoniously (Eichengreen and Gupta, 2016). Yet failing to control for key confounders leads to omitted variable bias. Third and finally, there is a problem of saturation (or “high dimensionality”). It is possible that the influence of these variables is nonlinear and that there are important interactions effects. But using several permutations of each variable—such as levels, lags, differences at various horizons, and polynomials—or including all pairwise interactions and temporal and spatial effects would be infeasible as it would quickly run into a problem of degrees of freedom.

Faced with these dilemmas, economists are increasingly using machine learning for model selection and estimation. The proposed estimation strategy in this paper is lasso.10 Lasso is probably the most familiar machine learning technique among economists (Mullainathan and Spiess, 2017). As such, it has a relatively well-developed literature on the statistical properties of its estimates. In the current context, it has the following attractive properties.

First, it is a straightforward extension of linear regression but is good at dealing with the dimensionality problem. Just like ordinary least squares (OLS), lasso minimizes the sum of squared deviations between observed and model predicted values. Unlike OLS, to achieve a dimension reduction, lasso imposes a penalty on model complexity by limiting the sum of the absolute values of the coefficients and forces all but the most important ones to zero. Simply put, lasso does the model selection for us by systematically selecting the most important features and throwing out others that contribute little to the model’s fit (Ahrens, Hansen, and Schaffer, 2019).

Second, it allows causal interpretation of the estimated coefficients for a subset of variables. Since the primary purpose of most supervised machine learning techniques is prediction, the techniques usually do not produce estimates that can be interpreted as causal (Tiffin, 2019). Similarly, lasso, in general, does not produce unbiased estimates of coefficients and standard errors required to perform valid inference.11 However, an extension of lasso, known as double-selection lasso (Belloni, Chernozhukov, and Hansen, 2014a; Urminsky, Hansen, and Chernozhukov, 2016), addresses the problem by adopting a modified estimation procedure and generates unbiased coefficients and standard errors for a subset of variables that are suitable for making causal inferences.

Finally, a recent paper by Belloni, Chernozhukov, and Hansen (2014a) provides a panel data extension of double-selection lasso, accommodating common forms of within-group correlation and heteroskedasticity in errors expected in a panel data context.

3.2 Model

Formally, for the general model

yi=xiβ+ϵi,i=1,...,n(3)

lasso finds a vector of coefficients, β, such that

1nΣi=1n(yixiβ)2+λΣj=1p|βj|(4)

is minimized for a given value of λ. The last term distinguishes lasso from OLS: there is a cost to including many regressors and we can reduce the objective function by setting βj = 0 for variables that contribute little to the fit. Thus, lasso does the model selection for us.12 The parameter λ is called a tuning parameter, which controls the overall penalty level. The larger is λ, the more coefficients are shrunk to zero. If λ = 0 we have an OLS regression (Ahrens, Hansen, and Schaffer, 2019 and Fonti and Belitser, 2017).

Lasso estimator can also be written as

β^lasso(λ)=argβmin1nΣi=1n(yixiβ)2+λnλΣj=1pψj|βj|(5)

where λ is the tuning parameter that controls the overall penalty level as before and ψj are predictor-specific penalty loadings. The latter are predictor-specific to account for the fact that the predictors xj are not of equal variance. Under a homoskedasticity assumption, lasso sets the penalty loading to

ψj=1nΣi=1nxi,j2(6)

In order to allow for valid inference on the coefficient of a variable of interest (in our case, the dummy variable for gray-listing), we follow the double-selection approach developed by Belloni, Chernozhukov, and Hansen (2014a). Rewriting equation (3), we split the vector xi into a variable of interest, di, and other controls:

yi=β0di+β1xi,1+...+βpxi,p+ϵi(7)

We want to be able make a valid inference on the parameter β0. The double-selection approach estimates the equation (7) in three steps:

1. Use lasso to estimate yi on x’s without di. Denote the set of lasso-selected controls by A.

2. Use lasso to estimate di on x’s. Denote the set of lasso-selected controls by B.

3. Estimate the following equation using OLS:

yi=β0di+wiβ+ϵi(8)

where wi = A∪B, i.e., the union of the selected controls from Steps 1 and 2 (Ahrens, Hansen, and Schaffer, 2018). Thus, the double-selection approach first uses lasso to select covariates correlated with the outcome (yi), and second, uses lasso to select covariates correlated with the treatment (di). By taking the union of both sets of covariates and including them as controls in a standard OLS regression (in the third stage), it takes account of the joint association that is the source of bias for the treatment effect (Belloni, Chernozhukov, and Hansen, 2014a.)

Finally, the panel data extension of double-selection lasso developed by Belloni et al. (2014b) extends the general model to the panel framework:13

yi,t=xi,tβ+αi+ϵi.t,i=1,...,n;t=1,...,T(9)

Eliminating the fixed effects by subtracting the individual specific intercept leads to the “within model”:

y¨i,t=x¨i,tβ+ϵ¨i.t,(10)

where x¨i,t=xi,t1TΣt=1Txi,t. The coefficient estimate is defined by the solution to the penalized minimization problem of the within model:

β^lasso(λ)=argβmin1nTΣi=1nΣt=1T(y¨i,tx¨i,tβ)2+λnTΣi=1pψj|βj|(11)

The approach by Belloni et al. (2014b) to accommodate the non-homoskedastic errors, according to Ahrens, Hansen, and Schaffer (2019), is similar to standard clustered-error adjustments used in OLS models, whereby “observations within clusters are aggregated to create ‘super-observations,’ which are assumed independent, and ... are treated similarly to cross-sectional observations in the non-clustered case” (p. 21). Accordingly, the penalty loading ψj is given by

ψj=1nTΣinuij2(12)

where uij is the “super-observation”, defined as uij = Σtxijtεit for the ith cluster and jth regressor.

3.3 Data

For the empirical implementation of the model in equation (10), the dependent variables are gross capital inflow, FDI inflow, portfolio inflow, and other inflows, all as a share of GDP. The data on capital flows come from the FFA database, as before. We supplement the data on quarterly GDP where it is possible using the data from the IFS and Haver, and exclude extreme values from the data before running the model in equation (10) (see Section 2 for detail).

The explanatory variable of interest is an indicator variable dit (= 1 for gray-listing), defined over a window [to — k,to + k], where to is the date of the FATF announcement and k is varied between 1 and 3 quarters (see Section 2). Following a standard practice in the literature, we drop post-gray-listing observations for four quarters, so as to avoid the post-gray-listing dynamics in capital flow biasing our estimation of βo.14

With regard to control variables xi,t, there are 15 domestic primary variables: six exchange rate variables (exchange rate vis-a-vis the SDR, nominal effective exchange rate, real effective exchange rate, all for period average and end of period); two official reserves variables (in U.S. dollars and as a share of GDP); two current account balance variables (in U.S. dollars and as a share of GDP); one-year ahead forecast of real GDP growth rate; a measure of capital account openness; a measure of exchange rate flexibility; and credit ratings. Because of endogeneity concerns, all domestic variables are included in the model with four-quarter lags. Missing values in the primary domestic variables reduce the sample size to 81 countries, of which 28 have been gray-listed. See Appendix B for the list of countries. The domestic variables are complemented by 22 international variables: VIX, the U.S. short-term rate, the U.S. 10-year rate, the U.S. Federal Funds rate, two commodity price indexes (one including and one excluding gold), the real GDP growth rate of G4 countries (the U.S., the Euro Area, the U.K., and Japan) in total and individually, the money supply of G4 countries as a share of GDP in total and individually, the growth rate of the money supply of G4 countries in total and individually, and a dummy variable for the GFC. Furthermore, the analysis uses several permutations of each variable—such as levels and differences—as well as interaction effects of each variable with the three categorical variables in the model (a measure of exchange rate flexibility, credit ratings, and the dummy variable for the GFC). Finally, the model includes a full set of year fixed effects. Overall, this yields 2,842 variables. Appendix D describes the variables and sources.

3.4 Results

Table 1 reports the estimates of the impact of gray-listing on capital inflow, with results for different types of capital inflow presented across the columns. With double-selection lasso, the estimates of the coefficient of the variable of interest (β0) can be interpreted just as they are in a standard linear regression. However, the coefficients of the control variables cannot and therefore are not reported here.15

Table 1:

Effect of Gray-Listing on Capital Inflows

article image
The table presents estimates of the effect of gray-listing on the type of capital flow, in percent of GDP. Each specification includes additional controls as described in the text and Appendix D and a full set of year and country fixed effects. Robust standard errors clustered by country are in parentheses. Confidence intervals are at the 95 percent confidence level. *, **, *** mean, respectively, significant at the 10, 5, and 1 percent level.

Our point estimate for the effect of gray-listing on total capital inflow is -7.6, meaning that we expect a country’s capital inflow to decline by 7.6 percent of GDP when the country is gray-listed. The coefficient estimate is statistically significant at the 1 percent level based on the robust standard errors. The 95 percent confidence interval is between -10.6 percent of GDP and -4.6 percent of GDP, meaning that most countries experience a decline within this range when gray-listed.16

Moving on to different components of capital flows, the effect of gray-listing on FDI inflow is estimated to be -3.0 percent of GDP and the effect on portfolio flow is estimated to be -2.9 percent of GDP, both are significant at the 1 percent confidence level (Table 1, columns 2 and 3). The relative magnitude of the coefficients is thus similar for FDI and portfolio inflows– consistent with the finding in the event analysis–although it is possible that dynamics around the time of gray-listing could show different sensitivity both before and after the event (see Section 2).

Looking at different categories of other flows (Table 2), the estimated impact of gray-listing is largest for non-official sector flows (-3.8 percent of GDP). Banking sector flows are somewhat less negatively affected than other (non-bank) private flows (-2.0 percent of GDP and -2.4 percent of GDP). Official sector flows are the least severely affected (-0.9 percent of GDP). All estimates are significant at the 5 percent level or better.

Table 2:

Effect of Gray-Listing on Categories of Other Inflows

article image
The table presents estimates of the effect of gray-listing on the components of other investment flows, in percent of GDP. Each specification includes additional controls as described in the text and Appendix D and a full set of year and country fixed effects. Robust standard errors clustered by country are in parentheses. Confidence intervals are at the 95 percent confidence level. *, **, *** mean, respectively, significant at the 10, 5, and 1 percent level.

4 Robustness

4.1 Event Window

Our analysis focuses on the response of capital flows in a period around gray-listing, and to capture the response, we define gray-listing as an event with widow [t0k,t0 + k] where t0 is the date of the FATF public announcement and k is varied between 1 and 3 quarters, allowing for the possibility that gray-listing may be foreseen by market participants and that some investors may delay their response due to uncertainty (see Section 2). The size of the window in an event study is an open choice (Choi and Hashimoto, 2017). If the window is too narrow, it may preclude any proper identification of the effect of an event that takes time to materialize. If the window is too wide, other confounding factors may affect the dependent variable. In the baseline, we use k = 3. To check robustness of our estimates reported in Table 1, we re-estimate the model using k = 1, 2, and 4. Results are reported in Table 3.

Table 3:

Robustness: Effect of Gray-Listing using Different Event Windows

article image
The table presents estimates of the effect of gray-listing on total capital inflow, in percent of GDP. Each column corresponds to the size of k in an event window [t0 -k, t0 +k]. The column k=3 corresponds to the baseline results in the column “Total” in Table 1. Each specification includes additional controls as described in the text and Appendix D and a full set of year and country fixed effects. Robust standard errors clustered by country are in parentheses. Confidence intervals are at the 95 percent confidence level. *, **, *** mean, respectively, significant at the 10, 5, and 1 percent level.

For all values of k, we consistently find negative and statistically significant effects of gray-listing on capital inflows. The absolute size of the coefficient is larger when using a wider event window (k = 4), perhaps as expected. The coefficient becomes smaller in absolute value using a narrower window (k = 2 or k = 1) but remains negative and significant. It is possible that the larger impact we find using the wider window reflects influences of confounding factors; it is also possible that we may not see the full extent of impact on capital inflow using a narrower window, for example, because gray-listing is foreseen by market participants and they adjust their behavior earlier or, conversely, because gray-listing does not initially trigger de-risking among the investors in sufficient numbers within the first few months.

4.2 Small Number of Gray-Listing Observations

In our data, gray-listing observations are a small minority of the sample— about 0.7 percent—and this reduces the model’s accuracy for estimating the impact of gray-listing on capital flows. To check robustness of our estimates reported in Table 1, we re-estimate the model using different resampling techniques to increase the share of gray-listing observations in our sample. The techniques we used are oversampling, which involves adding more observations of the minority class sample; undersampling, which involves removing the majority class observations; and using a combination of the two. In all cases, we boost the share of gray-listing observations up to about 2 percent of the data (for example, see Tiffin, 2019).

The simplest oversampling technique would involve replicating the existing minority class observations (i.e., gray-listing=1) until the desired mix of majority and minority class in the sample is achieved. However, we use an alternative technique known as SMOTE (Synthetic Minority Oversampling Technique).17 Instead of replicating exact copies of the existing minority class observations, SMOTE creates synthetic copies that are similar to, but not exactly the same as, the existing minority class observations (see Fernandez et al., 2018 for details).

Next, we use random undersampling (RU), which involves randomly selecting and deleting the majority class observations, until the share of gray-listing observations in the data reaches 2 percent.18

Finally, we use a combination of SMOTE and random undersampling. According to the machine learning literature, the combined approach often results in the model with better out-of-sample performance (Brownlee, 2020).

The results are reported in Table 4. The estimated coefficients of gray-listing remain negative and similar in size to that obtained in the baseline. The standard errors become smaller where a number of the minority-class observations are substantially increased (i.e., under SMOTE) as expected but not where only the share of the minority-class observations is increased (i.e., under random undersampling or a combination of SMOTE and random undersampling). The 95 percent confidence intervals are therefore smaller under SMOTE, ranging from -9 to -4 percent of GDP, but remains similar to the baseline in others (-11 to -4 percent of GDP).

Table 4:

Robustness: Effect of Gray-Listing using Different Event Windows

article image

5 Concluding Remarks

How much gray-listing affects a country’s capital flows is of interest to policy makers, investors, and the Fund. It tells the policy makers about potential economic costs of gray-listing and may encourage them to make appropriate investments to address gaps in their countries’ AML/CFT regimes. It tells investors how others may respond to the gray-listing of countries of interest. It is of interest to the Fund in relation to country surveillance, capacity development, forecasts of demand for Fund programs, and estimates of capacity to repay. For example, a sudden loss of capital inflows after gray-listing could lead to loss of external reserves, and for vulnerable countries, it could mean a Balance of Payments crisis and demand for an IMF program; if the country already has a Fund program, it could mean delays in implementing agreed reforms, or a need to recalibrate policy targets and timeline for achieving them; if a country has only recently completed the Fund program, a sudden loss of capital inflows or external reserves could signal risk to capacity to repay the Fund and call for a closer monitoring.

The table presents estimates of the effect of gray-listing on total capital inflow, in percent of GDP. Each column corresponds to the re-sampling method used to increase the share of minority class data (gray-listing=1). Each specification includes additional controls as described in the text and Appendix D and a full set of year and country fixed effects. Robust standard errors clustered by country are in parentheses. Confidence intervals are at the 95 percent confidence level. *, **, *** mean, respectively, significant at the 10, 5, and 1 percent level.

The paper presents a relatively novel approach—machine learning—to estimating the effect of gray-listing on capital flows, where there is a large number of potentially confounding factors. The particular machine learning technique used, lasso, provides a systematic approach to model selection and estimation while still allowing for causal inference on the estimated coefficient of the treatment effect.

The paper finds that gray-listing has a significant negative impact on a country’s capital flows. The magnitude of the negative effect is large—on average -7.6 percent of GDP. It varies, however, by type of capital flow. The results are robust despite the small number of gray-listings in the sample and are robust to a narrower definition of the timing of gray-listing.

References

  • Ahrens, Achim, Christian B Hansen, and Mark E Schaffer. 2018. “LAS-SOPACK and PDSLASSO: Prediction, model selection and causal inference with regularized regression.” London stata conference 2018, Stata Users Group.

    • Search Google Scholar
    • Export Citation
  • Ahrens, Achim, Christian B. Hansen, and Mark E Schaffer. 2019. “lassopack: Model Selection and Prediction with Regularized Regression in Stata.” IZA Discussion Papers 12081, Institute of Labor Economics (IZA).

    • Search Google Scholar
    • Export Citation
  • Alleyne, Trevor, Jacques Bouhga-Hagbe, Thomas Dowling, Dmitriy Kov-tun, Alla Myrvoda, Joel Okwuokei, and Jarkko Turunen. 2017. “Loss of Correspondent Banking Relationships in the Caribbean: Trends, Impact, and Policy Options.” Working Paper WP/17/209, International Monetary Fund.

    • Search Google Scholar
    • Export Citation
  • Bacchetta, Philippe and Eric van Wincoop. 2000. Capital Flows to Emerging Markets: Liberalization, Overshooting, and Volatility. University of Chicago Press, 61–98. URL http://www.nber.org/chapters/c6165.

    • Search Google Scholar
    • Export Citation
  • Balakina, Olga, Angelo D’Andrea, and Donato Masciandaro. 2016. “Bank Secrecy in Offshore Centres and Capital Flows: Does Blacklisting Matter?Working Paper 2016–20, Centre for Applied Research on International Markets, Banking, Finance and Regulation.

    • Search Google Scholar
    • Export Citation
  • Becker, Chris and Clare Noone. 2008. “Volatility and Persistence of Capital Flows.” In Regional Financial Integration in Asia: Present and Future, BIS Papers 42. Bank for International Settlements, 159180.

    • Search Google Scholar
    • Export Citation
  • Belloni, Alexandre, Victor Chernozhukov, and Christian Hansen. 2014a. “High-Dimensional Methods and Inference on Structural and Treatment Effects.” Journal of Economic Perspectives 28 (2):2950.

    • Search Google Scholar
    • Export Citation
  • Belloni, Alexandre, Victor Chernozhukov, Christian Hansen, and Damian Kozbur. 2014b. “Inference in High Dimensional Panel Models with an Application to Gun Control.” Journal of Business and Economic Statistics 34 (4):590605.

    • Search Google Scholar
    • Export Citation
  • Broner, Fernando, Tatiana Didier, Aitor Erce, and Sergio L. Schmukler. 2013. “Gross capital flows: Dynamics and crises.” Journal of Monetary Economics 60 (1):113133. URL http://www.sciencedirect.com/science/article/pii/S0304393212001675Carnegie-NYU-RochesterConference.

    • Search Google Scholar
    • Export Citation
  • Brownlee, Jason. 2020. “Undersampling Algorithms for Im-balanced Classification.” Article dated on January 20, 2020. Available at: https://machinelearningmastery.com/undersampling-algorithms-for-imbalanced-classification/. Accessed: 10-152020.

    • Search Google Scholar
    • Export Citation
  • Bussiere, Matthieu and Marcel Fratzscher. 2006. “Towards a new early warning system of financial crises.” Journal of International Money and Finance 25 (6):953973.

    • Search Google Scholar
    • Export Citation
  • Chawla, N., K. Bowyer, L. Hall, and P. Kegelmeyer. 2002. “SMOTE: Synthetic Minority Over-Sampling Technique.” Journal of Artificial Intelligence Research 16:321357.

    • Search Google Scholar
    • Export Citation
  • Choi, Sangyup and Yuko Hashimoto. 2017. “The Effects of Data Transparency Policy Reforms on Emerging market Sovereign Bond Spreads.” Working Paper 17/74, International Monetary Fund.

    • Search Google Scholar
    • Export Citation
  • Chow, Gregory C and An-loh Lin. 1971. “Best Linear Unbiased Interpolation, Distribution, and Extrapolation of Time Series by Related Series.” The Review of Economics and Statistics 53 (4):372375. URL https://ideas.repec.org/a/tpr/restat/v53y1971i4p372-75.html.

    • Search Google Scholar
    • Export Citation
  • Collin, Matthew, Samantha Cook, and Kimmo Soramaki. 2016. “The Impact of Anti-Money Laundering Regulation on Payment Flows: Evidences from SWIFT Data.” Working Paper 445, Center for Global Development.

    • Search Google Scholar
    • Export Citation
  • Detragiache, Enrica and Asli Demirguc-Kunt. 1998. “Financial Liberalization and Financial Fragility.” IMF Working Papers 1998/083, International Monetary Fund.

    • Search Google Scholar
    • Export Citation
  • Eichengreen, Barry and Poonam Gupta. 2016. “Managing Sudden Stops.” Policy Research Working Paper 7639, World Bank.

  • Farias, Maria Elisa and Monica Arruda de Almeida. 2014. “Does Saying “Yes” to Capital Inflows Necessarily Mean Good Business? The Effect of Antimoney Laundering Regulations in the Latin American and the Caribbean Economies.” Economics & Politics 26:96127.

    • Search Google Scholar
    • Export Citation
  • Fernandez, A., S. Garcia, F. Herrera, and N. Chawla. 2018. “SMOTE for Learning from Imbalanced Data: Progress, Challenges, Making the 15-year Anniversary.” Journal of Artificial Intelligence Research 61:863905.

    • Search Google Scholar
    • Export Citation
  • Fonti, Valeria and E. Belitser. 2017. “Paper in Business Analytics Feature Selection using LASSO.”

  • Forbes, Kristin J. and Francis E. Warnock. 2012. “Capital flow waves: Surges, stops, flight, and retrenchment.” Journal of International Economics 88 (2):235251.

    • Search Google Scholar
    • Export Citation
  • Frank, Ildiko E. and Jerome H. Friedman. 1993. “A Statistical View of Some Chemometrics Regression Tools.” Technometrics 35 (2):109135.

    • Search Google Scholar
    • Export Citation
  • Giles, Judith A. and David E. A. Giles. 1993. “Pre-test Estimation and Testing in Econometrics: Recent Developments.” Journal of Economic Surveys 7 (2):145197.

    • Search Google Scholar
    • Export Citation
  • Gourinchas, Pierre-Olivier and Maurice Obstfeld. 2012. “Stories of the Twentieth Century for the Twenty-First.” American Economic Journal: Macroeconomics 4 (1):22665.

    • Search Google Scholar
    • Export Citation
  • IMF. 2013. “Yin and Yang of Capital Flow Management: Balancing Capital INflows with Capital Outflows.” In World Economic Outlook, chap. 4. Washington, D.C.: International Monetary Fund, 113132.

    • Search Google Scholar
    • Export Citation
  • Kudrle, Robert Thomas. 2009. “Did Blacklisting Hurt the Tax Havens?Journal of Money Laundering Control 12 (1):3349.

  • Morse, Julia C. 2019. “Blacklists, Market Enforcement, and the Global Regime to Combat Terrorist Financing.” International Organization 73 (3):511545.

    • Search Google Scholar
    • Export Citation
  • Mullainathan, Sendhil and Jann Spiess. 2017. “Machine Learning: An Applied Econometric Approach.” Journal of Economic Perspectives 31 (2):87106.

    • Search Google Scholar
    • Export Citation
  • Simmons, Joseph P., Leif D. Nelson, and Uri Simonsohn. 2011. “False-Positive Psychology: Undisclosed Flexibility in Data Collection and Analysis Allows Presenting Anything as Significant.” Psychological Science 22 (11):13591366.

    • Search Google Scholar
    • Export Citation
  • Tibshirani, Robert. 1996. “Regression Shrinkage and Selection via the Lasso.” Journal of the Royal Statistical Society. Series B (Methodological) 58 (1):267288.

    • Search Google Scholar
    • Export Citation
  • Tiffin, Andrew. 2019. “Machine Learning and Causality: The Impact of Financial Crises on Growth.” Working Paper WP/19/228, International Monetary Fund.

    • Search Google Scholar
    • Export Citation
  • Tille, Cdric and Eric van Wincoop. 2014. “International capital flows under dispersed private information.” Journal of International Economics 93 (1):3149. URL http://www.sciencedirect.com/science/article/pii/S0022199614000294.

    • Search Google Scholar
    • Export Citation
  • Torgo, L., R. P. Ribeiro, B. Pfahringer, and P. Branco. 2013. “SMOTE for Regression.” In Progress in Artificial Intelligence, Vol. 8154, edited by L. Correia, L. P. Reis, and J. Cascalho. Springer, Berlin, Heidelberg, 378389.

    • Search Google Scholar
    • Export Citation
  • Urminsky, Oleg, Christian Hansen, and Victor Chernozhukov. 2016. “Using Double-Lasso Regression for Principled Variable Selection.” Available at SSRN: https://ssrn.com/abstract=2733374.

    • Search Google Scholar
    • Export Citation

Appendices

A FATF Gray-Listing

The Financial Action Task Force (FATF), established in 1989 by the G-7, is an intergovernmental body that develops and promotes policies to protect the global financial system against money laundering and terrorist financing (https://www.fatf-gafi.org/about/). It has issued 40 recommendations on how states should combat the problems of money laundering, terrorist financing, and proliferation financing through legal and regulatory action. Since 2000, it has periodically issued lists of countries and jurisdictions that it judges to have strategic AML/CFT deficiencies.19

The FATF uses a mutual evaluation model. The evaluations are carried out by the FATF and nine FATF-style regional bodies, and occasionally, by the IMF or World Bank. Each body forms a small team consisting of legal, law enforcement, and financial experts. The assessment team conducts two to three on-site visits and rates the level of technical compliance of a country’s legal and regulatory framework on each recommendation as compliant, largely compliant, partially compliant, or noncompliant. It also rates the effectiveness of the AML/CFT system on 11 “immediate outcomes” as low, moderate, substantial, or high. “The evaluation process is lengthy, often taking more than a year” (Morse, 2019, p.10).20 The mutual evaluation reports are discussed and adopted in plenary meetings of the relevant assessor body. FATF mutual evaluations are adopted at tri-annual meetings (currently in February, June, and October). “During these meetings, evaluated countries may argue against portions of the draft report or advocate rating changes” (Morse, 2019, p.10).21

There are several ways a country can become subject to review by the International Cooperation Review Group (ICRG): it does not allow timely publication of the mutual evaluation report or does not participate in as a member in the FATF’s Global Network, the result of its assessment shows a significant number of key deficiencies, or it is nominated for the ICRG review by one or more other countries for specified and substantiated reasons.22 The country that enters the ICRG review process as a result of poor performance on its mutual evaluation has one year to address its key deficiencies to avoid listing. If deficiencies are not sufficiently addressed, the FATF works with the country to develop an action plan. Once the plan is adopted by the FATF and committed to by the country, the FATF includes the country in the gray list. If no action plan is agreed with the FATF or high-level political commitment is not given, the FATF includes the country in the black list. Countries on the gray list that fail to make sufficient progress on their action plans within one year are also subject to blacklisting.

The decisions of FATF or outcomes of the plenary meetings involve a complex case-by-case assessment and do not follow an automatic timetable. Each assessed country can argue or dispute the findings of the mutual evaluation report and its ratings. Moreover, the FATF can extend the previously agreed deadlines for taking actions, further prolonging the gray-listing process.

Table A1 provides a list of gray-listed countries with dates of announcements.

Table A1.

Gray-listed countries, 2000–17

article image
article image

Listed under the NCCT (non-co-operative countries and territories process) (2000–2006) before it was replaced by the ICRG process (since 2007). See FATF website for detail.

B Estimation Sample: 2000q1–2017q4

article image
article image

At any time during the sample period 2000–2017.

Not in the sample for lasso.

Not in the sample in some specification for lasso.

C Gray-Listing and Net Errors and Omissions

Figure 3:
Figure 3:

Evolution of Capital Flows and Net Errors and Omissions Around Gray-Listing (Percent of GDP)

Citation: IMF Working Papers 2021, 153; 10.5089/9781513582436.001.A001

Source: IFS, FFA, and authors’ calculations. Based on estimation of equation (1) using quarterly data of total capital inflow to GDP and net errors and omissions to GDP between 2000q1 and 2017q4. Each panel reports the estimates of conditional means of each variable relative to “tranquil times” around the time of gray-listing. See Section 2 for detail.

D Variables and Data Sources

Table D1.

Variables and Data Sources

article image
article image
Note: CBOE = Chicago Board of Options Exchange; FRED = Federal Reserve Economic Data; IFS = International Financial Statistics; WEO = World Economic Outlook.
3

Based on the FATF announcement in February 2021.

4

Unavailability of quarterly data on GDP constrains the sample size. In the FFA, the data on quarterly capital flows are available for 121 EMDCs but the data on quarterly GDP are available only for 81, of which 34 have been gray-listed. We therefore supplement the quarterly GDP data for 8 more gray-listed countries using data from the IFS and Haver: for Tajikistan using the available quarterly GDP data from Haver; for Angola, Bangladesh, Nepal, and Trinidad and Tobago using estimated quarterly GDP based on the quarterly industrial production data from the IFS; and for Pakistan, Panama, and Venezuela using estimated quarterly GDP based on the quarterly industrial production data from Haver. The estimation of quarterly GDP follows the Chow-Lin method (see Chow and Lin, 1971).

5

Given their size or their economic structure, capital flows in these countries can be influenced to a greater degree by similar forces as those in EMDCs.

6

Detection of extreme values are based on the fixed-effect (within) transformation, y¨it=(yity¯i,t), where y¯i,t=1TΣt=1Tyit, and defined as those that lie outside the whiskers of the generalized box plot that adjusts for asymmetry and skewness in the distribution of ÿi,t. See Bruffaerts et al (2014).

7

For example, Gourinchas and Obstfeld (2012) define the timing of banking, currency, and financial crisis using a window, where k is allowed to vary between 1 to 3 years; Choi and Hashimoto (2017) define the timing of data transparency policy reform using an event window in which k is allowed to vary between 1 to 4 quarters.

8

Such offsetting behavior is well-documented in the literature. See for example, Broner et al. (2013) and Forbes and Warnock (2012).

9

A substantial literature has documented such information differences between domestic and foreign agents. See, for example, Tille and van Wincoop (2014) and IMF (2013). Broner et al. (2013) also report similar empirical evidence showing that gross capital outflows start rising ahead of a crisis when the crisis originates domestically.

10

Developed by Frank and Friedman (1993) and Tibshirani (1996). While lasso originally stood for least absolute shrinkage and selection operator, today it is considered a word and not an acronym and encompasses other machine learning algorithms based on similar principles, such as ridge and elastic-net.

11

This is because lasso, by selecting the covariates based on their ability to predict the outcome subject to constraints on model complexity as described above, may not select the “true” model (i.e., the data-generating process or DGP). And there is a possibility that the regularization excludes some variables that belong in the DGP and a correlation between them and variables included in the model can create bias in the estimated coefficients (i.e., omitted variable bias).

12

How does lasso’s feature reduction compare with successively reducing the number of insignificant controls using OLS (i.e., the general-to-specific approach)? Ahrens, Hansen, and Schaffer (2018) note that the latter approach tends to be more vulnerable to “pre-test bias”, “researcher degrees of freedom”, and “p-hacking”. Pre-test bias refers to nonin-dependence of successive test statistics in sequential hypothesis testing in econometric modeling (Giles and Giles, 1993); researcher degrees of freedom refers to many decisions available to researchers in model selection (e.g., which variables to include or exclude, whether to combine or transform variables, which alternative specifications to be compared and tested, etc.); and p-hacking refers to researchers exploring various analytical alternatives to search for a combination that yields significant results and to report only what worked (Simmons, Nelson, and Simonsohn, 2011). Ahrens, Hansen, and Schaffer (2018) also compare model-selection and performance of lasso against a “kitchen sink” OLS (includes all regressors) and a “step-wise” OLS (begin with general model and drop if p-value below 0.05). They found that the kitchen-sink OLS suffered from “classical” symptoms of overfitting, with lowest in-sample RMSE (root-mean-square errors) and worst out-of-sample prediction; and the step-wise OLS suffered from biased (over-sized) coefficients, inflated R-squared, and invalid p-values; while lasso-based models, in contrast, had generally superior in-sample and out-of-sample performance.

13

Following exposition comes from Ahrens, Hansen, and Schaffer (2019).

15

This is because lasso, in general, does not produce unbiased estimates of coefficients and standard errors (see Section 3.1). The double-selection lasso eliminates the source of the bias for estimating the variable of interest (Section 3.2). But estimates of other controls remain open to the potential bias.

16

The impact of -7.6 percent of GDP seems large, although not unseen during previous crises (see e.g., Broner et al., 2013). To put it into perspective, using the FFA data, a decline of this magnitude corresponds to about the 25th percentile of the distribution of quarterly declines in gross capital inflow for EMDCs during the GFC (which means that 25 percent of EMDCs in the sample had a larger quarterly decline), or corresponds to about the 45th percentile of the distribution of quarterly declines for countries affected by the Asian Crisis in 1997–98.

17

The technique is originally developed by Chawla et al. (2002), and since has become de facto standard in the machine learning literature for dealing with imbalanced data (Fernandez et al., 2018). In this literature, SMOTE is often used for resampling the minority class observations based on outcome (dependent) variables. Here, we apply SMOTE for resampling the minority class based on the predictor (independent) variable (i.e., gray-listing), as done in, for example, Torgo et al. (2013) and Tiffin (2019).

18

Random undersampling, although simple and effective, has a limitation that samples are removed “without any concern for how useful or important they might be in determining the decision boundary between the classes. This means it is possible, or even likely, that useful information will be deleted” (Brownlee, 2020).

19

In the early 2000s, the FATF mainly assessed countries for technical compliance, whereas the FATF revised its standards and assessment methodology in 2012–13 to focus on evaluating the effectiveness of AML/CFT systems. This meant that countries removed from the gray list on account of legal reforms may have ended up back on the list for lack of effective implementation.

21

More specifically, during the meetings, FATF members, observers, assessors, and the assessed countries consider issues where there is significant debate over the interpretation of a FATF Standard or the application of the methodology. These issues are raised by delegations (including the assessed country) that have an analytical concern. This vetting process may result in changes in ratings, analysis, conclusions, or recommendations.

22

Entries to the ICRG review process through the last mechanism are rare, however, and require the consensus of the FATF membership. There are also mechanisms in the ICRGs procedures to ensure that the FATF does not review small countries with a potentially insignificant impact on the financial system.

  • Collapse
  • Expand
The Impact of Gray-Listing on Capital Flows: An Analysis Using Machine Learning
Author:
Mizuho Kida
and
Simon Paetzold
  • Figure 1:

    Evolution of Capital Flows Around Gray-Listing (Percent of GDP)

  • Figure 2:

    Evolution of Components of Other Flows Around Gray-Listing (Percent of GDP)

  • Figure 3:

    Evolution of Capital Flows and Net Errors and Omissions Around Gray-Listing (Percent of GDP)