Rules of Thumb for Bank Solvency Stress Testing

Rules of thumb can be useful in undertaking quick, robust, and readily interpretable bank stress tests. Such rules of thumb are proposed for the behavior of banks’ capital ratios and key drivers thereof—primarily credit losses, income, credit growth, and risk weights—in advanced and emerging economies, under more or less severe stress conditions. The proposed rules imply disproportionate responses to large shocks, and can be used to quantify the cyclical behaviour of capital ratios under various regulatory approaches.

Abstract

Rules of thumb can be useful in undertaking quick, robust, and readily interpretable bank stress tests. Such rules of thumb are proposed for the behavior of banks’ capital ratios and key drivers thereof—primarily credit losses, income, credit growth, and risk weights—in advanced and emerging economies, under more or less severe stress conditions. The proposed rules imply disproportionate responses to large shocks, and can be used to quantify the cyclical behaviour of capital ratios under various regulatory approaches.

I. Introduction

Financial sector stress testing has become a widespread, important, and prominent activity. Stress testing is used to identify financial sector vulnerabilities,2 to inform policy decisions affecting the financial system and individual institutions,3 and to guide companies’ own risk management. Yet, its practical application is often demanding, and there remain questions about its reliability.

Obtaining stress test results and establishing their robustness would be facilitated by the availability of “rules of thumb,” that is, rough guides to typical behavioral relationships, magnitudes of shocks, and the impact of shocks on banks, based on a wide range of experience. This paper attempts identifying such rules of thumb that apply to bank solvency, that is, capital ratios. The rules of thumb elaborated in this study are meant to serve mainly as general guidelines, particularly to understand worst-case scenarios, and to complement rather than substitute for detailed analysis of a country’s banks and its institutional, financial and conjunctural circumstances.

Rules of thumb would be useful at various stages of stress testing:

  • One challenge is to design tests for a country with limited relevant experience and data, either because of structural breaks in time series or because the county did not suffer a severe banking crisis in recent decades. Then, on the one hand, rules of thumb derived from experience in various countries with underlying similarities can be used to construct and calibrate relevant stress tests for the country concerned. Rules of thumb based on long, world-wide experience provide indicators of the great variety of scenarios that countries have suffered. On the other hand, given a scenario, rules of thumb for behavioral relationships can be used to make projections when reliable stress testing methods are unavailable locally. For example, when national authorities or bank management are unable to estimate behavioral relationships robustly based on the data available, it may be wise to “import” a rule of thumb.

  • Even when national experience and data allow the construction of a relatively complex model that captures well past behavior, it could be less relevant in the future (for example, because certain asset classes are more or less relevant than they were in the past), and thus give a false sense of accuracy; “model uncertainty” is an important consideration in stress testing and risk management more generally, though it is easy to overlook. Unless one knows with some precision the behavioral relationships that are relevant going forward, a simple rule of thumb may provide the most reliable estimates and basis for action (Haldane, 2012 and 2013). Rules of thumb can be used also to check the plausibility of estimates of behavior derived from national experience.

  • It is often important to assess, and if possible quantify, the potential impact of nonlinearities; stress tests are much more informative and credible if one can say how sensitive are results to changes in assumptions (Taleb et al, 2012). To this end, the ability to conduct multiple “runs” at low marginal cost using rules of thumb, rather than re-analyzing data in fine detail, is valuable to supervisors and managers.

  • Rules of thumb can be used to assess the stability implications of various prudential rules, such as minimum capital requirements, including on a crosscountry basis. With rules of thumb, one can generate rough projections of the magnitude of shocks that banks typically face and what they could withstand depending on their capitalization and other characteristics. Such projections would not replace detailed impact studies, but would provide a plausibility and robustness check.

  • Stress test frameworks (methods and assumptions alike) and outcomes have to be readily accessible for senior managers and policy-makers if action is to be triggered. A very complex model may be difficult to interpret and to link to policy instruments, and therefore it may distract from an informed debate on what actions should be taken; debate over the model may obscure debate over policy. To this end, decision-makers would be helped by the availability of readily understandable benchmarks regarding the potential outcome of stress tests and some of the main behavioral relationships underlying them.

Motivated by on these considerations, this paper concentrates on the formulation of rules of thumb for key factors affecting bank solvency, namely credit losses, pre-impairment income and credit growth during crises, and illustrates their use in the simulation of the evolution of capital ratios under stress.4 We thereby seek to provide answers to the following common questions in stress testing:

  • How much do credit losses usually increase in case of a moderate, medium and severe macroeconomic downturn and/or financial stress event, e.g., if cumulative real GDP growth turns out to be, say, 4 or 8 percentage points below potential (or average or previous years’) growth?

  • How typically do other major factors that affect capital ratios, such as profitability, credit growth, and risk-weighted assets (RWA), react under these circumstances?

  • Taking these considerations together, how does moderate, medium, or severe macro-financial stress translate into (a decrease in) bank capital, and thus, how much capital do banks need to cope with different levels of stress?

In answering these sorts of questions, a useful set of rules of thumb for stress testing should embody several properties:

  • Coverage of the major factors contributing to banks’ vulnerabilities (in terms of solvency).

  • Wide applicability, but with criteria to determine where inapplicable. A rule of thumb should be useful in many circumstances and many countries, but it should be clear where it should not be used.

  • Robustness, implying that the rule is supported by a variety of evidence and not subject to excessive model risk.

  • Intuitiveness, so that it can be used to interpret results and inform decision-making.

As a corollary of these properties, a desirable rule of thumb should be relatively simple. Simplicity is likely to enhance wide applicability, robustness, and intuitiveness. Rules will be developed along these principles.

To find common patterns, the study investigated various pieces of empirical evidence, including descriptive statistics, which may capture stress that does not necessarily originate from measured macroeconomic factors. Our analysis includes data from various previous crises, but focuses on the crises during the last 15 years (including the Russian/Asian crisis, crises experienced by the transition countries in Central and Eastern Europe in the late 1990s, the burst of the internet bubble in the early 2000s, the global financial crisis, and country, as well as bank-specific crises).

The evidence justifies distinguishing between emerging market economies (EMs) and advanced economies (ACs), because their typical behavior differs importantly.5 These differences can plausibly be attributed to differences in macroeconomic performance—for example, EMs tend to have relatively large cyclical fluctuations—and structural factors, such as the effectiveness of loan workout mechanisms. Because the rules of thumb differ across ACs and EMs, as do the typical magnitudes of shocks, different levels of capitalization are needed to achieve a given level of resilience against potential shocks. Evidence from low income countries (LICs) is used whenever possible, but less is available, and the functioning of LIC economies may differ from that of both ACs and EMs, for example, because of greater dependence on export of commodities and a much lower level of bank intermediation.

The evidence suggests also that the behavior of relevant variables (loss rates, income, credit growth, RWAs, capital ratios) is highly non-linear around crises. Effects on bank capitalization and loan quality under a severe crisis are disproportionately great. In case of credit losses, for example, severe loss levels are many times higher than in normal times, and in such circumstances banks typically exhibit substantially lower pre-impairment income that can be used as a buffer against losses. In response to stresses, they restrain from paying dividend and/or deleverage, the latter being a powerful way to restore bank solvency but a macroeconomically costly alternative if it comes along with constrained credit supply.

It is found also that the interpretation of capital levels should take into account differences in the measurement of regulatory capital: the measured risk-based capitalization of a bank using the standardized approach (StA) under the Basel II standard will be less sensitive to both positive and negative shocks than that of a bank using the Internal Ratings-Based approach (IRB). On this basis, one may suggest that the StA may be slow to reveal emerging vulnerabilities, while the IRB approach more quickly reflects deterioration in a bank’s situation (and rebounds faster when conditions improve), provided that changes in risk are reflected in IRB risk weights on a timely basis.6 By the same token, a given level of risk-based capitalization during benign times may be less of a buffer for a bank using the IRB approach than for a bank using the StA.

The structure of the paper and the main variables investigated are illustrated in the following chart. Section II explains the main elements of the approach and the evidence available. Section III contains the proposed descriptive rules of thumb based on stylized facts of financial crises, where the rules are conditional on the evolution of credit losses. Section IV contains rules of thumb versions of satellite models that link the key drivers of bank solvency—notably credit losses, pre-impairment income, credit growth, and RWAs—to the evolution of GDP. Application of these rules to stylized banks based in an AC or an EM serves to illustrate their use (Section V). Section VI concludes.

II. Methodology and Sources

The main metric used in bank solvency stress tests is the capital ratio, and especially the risked-based capital ratio defined as a bank’s capital divided by its risk-weighted assets (RWA).7,8 The tests are meant to yield projections of capital ratios after stress, over the relevant time horizon. Hence, the rules of thumb described here are those most relevant to this objective. Post-shock risk-based capitalization can be decomposed as follows:

ProjectedCapitalRatiot+1=InitialCapitalt+ProjectedRetainedNetProfitt+1InitialRWAt+ProjectedChangeinRWAt+1

where the projected retained net profit is negative if net income is negative, and otherwise depends on the assumption made on dividend payouts.9 The main factors affecting net income that are relevant for solvency tests include:10

  • Loan loss provisions (and write-offs, from an ex-post perspective), which in turn depend on probabilities of default (PDs) and loss given default (LGD) rates, or on some other rule for categorizing nonperforming loans (NPLs) and making provisions on them.

  • Pre-impairment income, including net interest income (including funding costs); commission and fee income; trading income; other operating income; and operating expenses.

  • Dividend payouts and taxation.

Both the numerator and the denominator of capital ratios matter; especially if the projection period for a stress test is extended beyond a year, RWA might evolve in ways that strongly affect the need for capital and a bank’s ability to meet regulatory requirements. The change in RWA depends on two main factors: (i) volume, i.e., the projected net growth in the balance sheet and specifically that of loans; and (ii) risk, i.e., changes in risk weights due to the changes of the risk profile of the banks’ assets, especially for those banks using an IRB approach.11

Other stability metrics are available and often useful—the unweighted leverage ratio, for example, is widely regarded as a robust, complementary indicator. The post-stress leverage ratio can be decomposed as follows:

ProjectedCapitalRatiot+1=InitialCapitalt+ProjectedRetainedNetProfitt+1InitialAssetst+ProjectedChangeinAssetst+1

where “assets” take account of on- and off-balance sheet items, such as credit lines, commitments, and guarantees. The denominator of this ratio can be affected in a stress scenario if allowance is made for on- and off-balance sheet growth, including through the write-off of losses. Factors affecting the numerator are the same as those for the risk-based capital ratio.

Return on capital (ROC) and return on assets are indicative of a bank’s ability to recover from a capital loss. Indeed, profitability is the first line of defense of any bank against credit and other risk. A sufficiently profitable bank can earn enough to restore its capitalization even in the face of substantial stress, either by attracting new capital with the promise of dividends or by retaining earnings. A bank with low profitability will be less able to recover from even a brief negative shock. The proposed rules of thumb are useful for projecting these metrics as well.

The rules of thumb are derived from the literature on banking crises, and statistical evidence on the evolution of bank loan quality and quantity obtained from two datasets:

  • Long-sample evidence is provided by data on default rates over a period of 90 years (1920-2012) for ACs, and indeed mainly U.S. corporates (Moody’s 2013).12,13 Specifically, Moody’s reports annual exposure-weighted historical default rates. This long sample includes five periods of substantial stress.

  • Recent cross-country evidence is provided by data on the sample of banks available in Bankscope. The time dimension is limited to the period from 1996 to 2011, with the number of banks increasing in the later part of the sample. However, this period does cover various episodes of stress in the countries covered. Evidence is obtained on bank performance in ACs, EMs and some LICs. The data covers more than 16,000 banks in 200 countries and jurisdictions, but the majority of banks (more than 13,000) are based in ACs (almost half in the United States), about 3,000 in are EMs, and only 550 are in LICs. There is some selection bias toward larger banks, especially in the LICs, which renders the evidence for these countries less robust. For the establishment of the rules of thumb, outliers were removed from the sample to avoid misleading results, and other robustness checks were performed, leaving a sample of more than 10,000 banks from almost 170 countries.14 Summary statistics are provided in Appendix 1.

Such evidence drawn from a wide range of times and countries no doubt suffers from large differences in definitions of relevant variables, and the analysis of the evidence needs to recognize and cope with this challenge. Even today among countries using accounts based on International Financial Reporting Standards, criteria for classifying assets or defining capital, for example, can differ widely. Loss recognition and provisioning practices can differ from bank to bank within the same jurisdiction. Yet, for the purposes of this study, using a wide range of evidence is essential: only diverse data sources can yield evidence on the effects of extreme but plausible shocks, including shocks which force banks to reveal losses that are hidden in normal times. An attempt is made to limit the influence of data problems by using relatively robust techniques, and also by looking at indicators of both location and spread.

III. Typical Banking Crises and Descriptive Rules of Thumb

A simple but possibly robust approach to defining and calibrating a stress test for a banking crisis is to look at historical episodes, not just in one country—which may have limited experience—but in a range of comparable countries. Past crises episodes and severe recessions provide highly relevant information on the impact of stress on bank solvency through various channels. In what follows we seek to come up with descriptive rules of thumb, after some suggestive conclusions on the link between macroeconomic conditions and banks’ performance.

A. Literature on Banking Crises

There is a large literature on banking crises, mostly looking at macroeconomic precursors and effects, and at the effectiveness of the strategies adopted on different occasions.15 A subset of studies provides quantitative evidence that is of direct relevance to stress testing. One example thereof is Čihák and Schaeck (2007), who look at 51 episodes of banking crisis during 1994-2004, in a sample that covers countries from every region. Figure 1 illustrates the typical behavior of default rates through a crisis, showing the evolution of the stock and (a proxy for) the inflow of NPLs relative to total loans three years on either side of a crises peak. Table 1 provides more detailed estimates, broken down by region.

Figure 1.
Figure 1.

Čihák and Schaeck Evidence on Typical Evolution of NPL Ratios Around a Crisis

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Čihák and Schaeck (2007, p.15); and authors’ calculations.
Table 1.

Čihák and Schaeck Evidence on Typical Evolution of NPL Stock Ratios Around a Crisis

article image
Source: Čihák and Schaeck (2007).

Percent of loans overdue 90 days or more.

Country sample “Asia” includes Indonesia, Korea, Thailand, and Philippines; “Emerging Europe” includes Czech Republic, Russia, Slovak Republic, and Turkey. “Latin America” includes Argentina, Brazil, Mexico, and Uruguay.

It can be seen that NPL stock ratios (in many countries still the most commonly available credit risk indicator) increase substantially during crises, and typically peak one year after the materialization of a crisis.16 The temporal shift reflects the fact that some loans default with some time lag after the materialization of macroeconomic stress, and that many banks have tended to recognize NPLs with delay, i.e., do not provision fully all potential losses after the first year(s) of a crisis.17,18 The NPL stock ratios rise by about 10 percentage points from the typical level one year before the crisis, in “average” crises, and almost 25 percentage points in severe crises. The stock is persistent: even after three years, the NPL ratio is at about the same level when the crisis materializes.

Using NPL flow ratios as a proxy for default rates (Box 1), we find that they peak at about 10 percent in “average” crises, and at about 18 percent in severe crises, up from 3 percent in “normal” times. The evidence here suggests that default rates come down to pre-crisis levels after three years, which makes their pattern roughly symmetric with respect to the crisis. The time needed to resolve problem loans implies that reduction in the stock tends to take longer.19

One immediate implication of this evidence by type of country is that the NPL ratio and the NPL flow ratio in EMs tend to peak at higher levels, and are generally more variable than those in ACs. Also, real GDP growth is much more variable in EMs. A useful set of rules of thumb need to accommodate these major differences; different rules may be needed for ACs and EMs. Similarly large differences in typical behavior across types of country will be documented in the analysis that follows.

Proxies for Credit Loss Rates

For analytic purposes it is often most useful to consider probabilities of default (PD) and rates of loss given default (LGD), the product of which equals the rate of credit losses per time period. However, information on PDs and LGD is often unavailable, and so proxies have to be used. Some of these proxies capture stocks rather than flows, and so ancillary assumptions are needed to estimate rates.

Banks’ profit and loss accounts provide evidence on (past) credit loss rates. Generally, accounts are meant to recognize potential losses when there is firm evidence of impending default, and value them depending on expected losses given default. Accounting practices differ in the extent to which they are forward looking and the discretion banks have in recording the timing and magnitude of losses.

Many countries report data on NPL stocks. The PD at a certain time t, in a forward-looking context, can be approximated by PDt = NPLt+1 - NPLt +α* NPLt-1, where α is the portion of the loans that are written-off

in period t-1. Here, for the years before the crisis, α is set to 0.5, which means that NPLs are fully written off after about two years. For the period after the peak of NPLs, it is assumed that loans are written off at a slightly slower pace, namely after three years, which is equivalent to an α of 0.33. This approximation is affected by differences across countries in the definition of NPLs, and by shifts in the severity of impairment of NPLs, which would normally be reflected in provisioning rates.

B. Historical Evidence on Banking Crises

The long-sample evidence suggests broadly similar patterns: The default rates observed in the Moody’s data for corporates during the last nine decades peak on five occasions, with the highest peak (unsurprisingly) occurring during the Great Depression of the 1930s (Figure 2).20

Figure 2.
Figure 2.

Historical Annual Default Rate for All Rating Grades

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Default Rates: Moody’s (2013); Real GDP Growth: Federal Reserve Bank (2011).

According to Giesecke et al. (2011), default rates before 1900 twice peaked at levels even higher than that seen during the 1930s (Figure 3). This finding may reflect the fact that the U.S. economy before 1900 was more like that of an emerging market country, being relatively highly dependent on commodity production and prices and prone to major infrastructure booms and busts (such as in railroad building). Also, arguably, economic policy-making and implementation were worse before the establishment of the Federal Reserve or major automatic stabilizers. Hence, the historical United States corroborates the hypothesis that default rates are higher and more variable in EMs than in ACs. The loss levels observed during the 1930s are kept as a reference point of an extreme AC crisis, with the caveat that conditions have changed substantially since then.

Figure 3.
Figure 3.

Historical Corporate Bond Default Rates (1866–2008)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Giesecke et al. (2011).Note: The figures are based on value-weighted default rates while the data from Moody’s (2013) focuses on the percentage of issuers that default.

An implication of the evidence presented so far is that one should distinguish crises by their severity. There may be relatively mild crises, that occur relatively frequently, and at greater intervals there are major crises, with more profound effects.21 A useful set of rules of thumb need to accommodate also these major differences; different rules may be needed for moderate, moderate, severe, and extreme stress situations. Specifically, we distinguish between:

  • normal conditions (in statistical terms, the median of credit loss rates);

  • moderate stress (the 80th percentile of credit loss rates);22

  • medium stress (the 90th percentile of credit loss rates);

  • severe stress (the 97.5th percentile of credit loss rates); and

  • extreme stress scenarios (the historical maximum—typically the worst year in a century).

Applied to the long-term evidence on default rates from Moody’s, one of the five crises would be classified as “moderate/medium,” two as “medium,” the current crisis as a borderline case between “medium” and “severe,” and the crisis in the 1930s as a “severe/extreme” crisis (Figure 2).

C. Descriptive Rules of Thumb

Credit loss rates

The recent cross-sectional evidence (from the 15 years 1996-2011) on the effects of crises on banks’ loan quality corroborates the long-term evidence. Table 2 shows the annual credit loss rates (flow of loan loss provisions from profit and loss accounts relative to the total loan stock) banks should expect under various levels of stress, distinguishing between ACs, EMs, and LICs. These parameters have been determined based on the median annual loss rates per year; the historical percentiles are computed for each of the three country types corresponding to the respective stress level, with a view to allowing for a wide comparison.

Table 2.

Typical Credit Loss Levels under Different Levels of Shocks

(Medians except where indicated; percent of credit outstanding)

article image
Source: Authors, based on Bankscope data.

This evidence confirms the previous observation that credit loss levels have been typically substantially lower in ACs than in EMs and LICs, while loss levels are found to be quite similar for EMs and LICs. For AC banks, credit loss rates peak roughly at 0.8 percent under moderate stress, 1.1 percent for medium stress, and 2.4 percent under severe stress conditions, while levels of more than 3 percent are historical maximum levels for country aggregates. Across stress levels, losses are higher in EMs by a factor of about three compared to those in ACs. Yet, Figure 4 shows that this is a general pattern, and that the results for the three country groups overlap, i.e., that there are EM countries where banks have experienced lower median loss rates during the past 15 years than some of the AC countries, for example. The same also holds true for moderate, medium, severe and extreme stress levels - an important reason for this result being that, during the sample period of 15 years, only some of the countries experienced a banking crisis.

Figure 4.
Figure 4.

Median Loss Rates by Country (1996-2011)

(Percent of credit outstanding)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Bankscope data.

Comparable results are obtained from bank-by-bank data. The maximum credit loss level for each bank was assigned to the respective level of severity.23 To assess the full pattern of solvency parameters around crises, seven consecutive observations are used: three before the peak of the crisis, the peak year, and three afterwards. Given the rather limited length of the time series of the Bankscope dataset (Appendix Figure 1), just one severity level (the maximum) per bank was included, which filter also precludes the multiple use of data points.

Banks with very low loss levels or extremely high loss levels (loss levels above 60 percent, which applies to banks with unusual business models, such as public banks involved in credit guarantee business) were excluded from the analysis to avoid distortion. As a means to test robustness, estimates were recalculated using a sample including only those banks with at least five consecutive observations during the stress years (from –2 to 2 years); results were qualitatively similar.

Credit loss rates are found to peak sharply during one single year, and the credit loss rate pattern is symmetrical with respect to the crisis (Figure 5 and Appendix Table 4).24 This finding is consistent with other results (Figures 1 and 13 below), but comes out more clearly based on the large sample of banks, and suggests that credit loss rates from the profit and loss accounts are indeed a useful proxy for banks’ default rates and LGDs.

Figure 5.
Figure 5.

Typical Evolution of Credit Loss Rates under Stress

(median loss rate by stress severity, percent of credit outstanding)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Bankscope data.

Loss given default (LGD)

There is empirical evidence that LGDs too fluctuate with the business cycle (Figure 6; the circled stress years are the same in Figures 2). Descriptive evidence for workout LGDs (i.e., LGDs for bank loans) from Moody’s (2013) is used to determine changes of LGDs under stress for ACs. The time series by Moody’s for loans, which relate to industry averages rather than bank-by-bank results, spans the period from 1990 to 2012.25,26

Figure 6.
Figure 6.

Evolution of LGDs through the Cycle

(Percent of the face value of affected credits)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: LGDs: Araten et al. (2004); Moody’s (2013); Real GDP Growth: Federal Reserve Bank (2013).

Figure 7 shows that LGD and default rates are positively correlated (the pairwise correlation is 0.47), but LGD rates tend to fluctuate proportionately less than do default rates and are less cyclical.27 The relationship allows one to project LGD under stress scenarios of various degrees of severity, where the severity of the scenario is captured by the respective default rate (Table 3). LGDs typically double in case of a severe/extreme stress, whereas default rates increase many times over, albeit starting from very low levels in normal times. However, because LGDs variations have a linear impact on RWAs, their impact on bank solvency is still considerable. Moreover, in absolute terms LGDs greatly amplify stress under severe conditions.

Figure 7.
Figure 7.

Link Between LGDs and Default Rates

(Percent)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Moody’s (2013) data.
Table 3.

Stress Levels of Default Rates and LGDs for ACs

(Percent)

article image
Source: Authors based on Moody’s (2013) data.

Based on Moody’s default rates shown in Figure 2.

LGDs for EMs and LICs are not covered due to lack of data. The historical average LGD rates for EMs and LICs are about 59 percent and 62 percent, respectively. These LGDs have been derived from the World Bank (Doing Business) based on Djankov et al. (2007) and are assumed to correspond to long-term average LGDs.28 A practical approach for stress testing purposes would be to use the long-term average level in a specific country and, for a stress scenarios of given intensity, add the same absolute increase as seen in ACs.

Credit losses (should) account for fluctuations in both PDs and LGDs (Box 1). One may therefore wish to compare the default rates in Table 3 with the loss rates in Table 2 for the ACs by dividing the loss rate by the respective LGD for the stress level. The median long-term default rate observed in Moody’s data (0.7 percent) translates into a loss rate of about 0.29 percent (using the LGD for the US, as determined by Schmieder and Schmieder (2011), of 0.42). This estimated loss rate is comparable to the 0.24 percent median loss rate for the U.S. banks in the Bankscope data (for the period covered, i.e., 1996-2011)29 More generally, if one divides the credit loss rates in Table 3 by the LGDs in Table 4, the implied default rates are 1.3 percent (normal), 2.7 percent (moderate stress), 3.3 percent (medium), 5.7 percent (severe) and 8 percent (extreme). On the lower end, the implied default rates are higher than the actual default rates, unless one replaces the projected LGDs by some average LGD for advanced countries (35 percent). Overall, this result suggests that credit loss rates from the profit and loss accounts, divided by LGD rates, provide reasonable approximations to PDs, at least for ACs. The same applies to the implied default rates for EMs and LICs using an LGD of 60 percent.

Table 4.

Rules of Thumb for the GDP sensitivity of Key Bank Solvency Variables

article image
Source: Authors’ calculations based on studies and sources mentioned in the text.

Pre-impairment income

The next question is how pre-impairment income evolves when stress occurs, i.e., conditional on credit losses.30 The key components of pre-impairment operational income are (a) net interest income, which tends to be relatively stable insofar as interest rates changes are passed on to loans and deposits when they mature; (b) net fee and commission income; c) trading income; d) other operating income; and (e) operating expenses. The remainder is other non-operating income, which is normally neglected because it represents one-off items that do not affect the sustainability of a bank’s business model. In order to normalize the measure of income despite large cross-country differences in banks’ leverage, we focus on the ROC rather than return on assets.

Using bank-by-bank data in the Bankscope database, median net pre-impairment ROC is found to be highest in LICs (25.0 percent), followed by EMs (18.9 percent) and then ACs (11.9 percent), in line with expectations and accounting for the rank-order in terms of the risk of doing business as captured by the volatility of loan losses.

Pre-impairment income conditional on credit loss rates is found to be affected in stress scenarios, but on average to remain positive and thus a buffer against credit losses (Figure 8 and Appendix Table 4). The behavior of income with respect to the crisis trough tends to be less symmetric than that of credit loss rates; income often remains low for some years after the trough (for AC banks) or drops significantly at the time of the trough (for EM banks).31

Figure 8.
Figure 8.

Typical Evolution of Pre-impairment Income Under Stress

(Median pre-impairment ROC by stress severity; percent)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Bankscope data.

However, there is substantial variation to this finding across banks. If one looks at the AC banks that are more adversely affected, i.e., the worst performing 25th and 10th percentiles, income becomes negative for at least a quarter of banks under severe stress (Figure 9); 10 percent of bank encounter ROCs below –20 percent at the time of the trough under severe stress. A similar pattern can be detected for EM banks, but the limited quantity of available data precludes firm calibration.

Figure 9.
Figure 9.

Evolution of Pre-impairment Income for Worst Performing Banks under Stress

(Pre-impairment ROC, percent)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Bankscope data.

Certain earning sources seem especially vulnerable to stress conditions, as illustrated in Figure 10 (with detailed data in Table 5A), which shows the standard deviations across crisis periods of the median ratio to capital of various income and expense account components. For AC banks, net commission and fee income is consistently relatively volatile. The median AC bank’s operating expenses become very volatile in a severe crisis, perhaps because a severe crisis will force a bank to bear restructuring costs, and because capital is reduced. For the median EC bank, the change of net income under stress is largely driven by net interest income due to foregone interest on credit losses and, possibly, interest rate behavior during crises.32 EC banks manage to reduce operating expenses at times of stress, unlike AC banks.

Table 5.

Historical Evidence on Typical Evolution of Default Rates Around a Crisis

article image
Source: Authors’ calculations based on Moody’s and Federal Reserve Bank data.

The trough of the crisis is set by the year with the lowest GDP growth.

Figure 10.
Figure 10.

Standard Deviation Across Crisis Periods of Median Income and Expense Components

(Percent of capital)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Bankscope data.

A common perception that trading income is only mildly correlated with the cycle is generally true except in severe crises, when trading income can become highly negative (Figure 11).33 The likelihood of sizable trading losses is discussed further in Box 2, highlighting that “Black Swan” events need to be captured by stress tests for banks with meaningful trading operations.34

Dividend-payout is found to drop to zero under severe stress, both for AC banks and EC/LIC banks (Appendix Table 4). Tax payments are also reduced, reflecting lower net income.

Figure 11.
Figure 11.

Trading Income under Stress, by Quantile

(Percent of capital)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Bankscope data.

How Likely is it that Large Trading Losses Coincide with Large Credit Losses?

In many stress testing exercises in the past, no explicit link could be established between credit losses and trading income. And there is a good reason for that, as shown in Figure 13 (upper left hand panel): for a median bank with some trading operations, trading income is, on average, slightly positive under most stress conditions. However, this is not necessarily the case if one moves further into the “tail,” and in particular if one focuses on the experience of banks with sizable trading books under severe conditions. Looking at the severe scenarios as defined by credit risk losses, a bank at the worst performing decile (in terms of the ratio of trading income losses to capital) could lose about 5 percent of capital. The worst performing 1 percent of banks might lose a third of capital—which would likely be more than a bank could suffer and still survive combined with other losses and reduction of income banks would likely face under such conditions. The severe scenario may occur with low probability, but the impact could be large, especially because trading activity is concentrated in larger banks: the worst performing 1 percent of banks could hold a substantial share of aggregate assets. The non-linearity and concentration of trading losses was apparent during the recent global crisis, when a few large European banks lost within one year more than one third of their capital due to trading losses alone, while many smaller banks had insignificant trading losses.

Trading income is likely to follow close to a random walk pattern, with little serial correlation, so a series of very bad years are unlikely to occur, but the simulations outlined above suggest that one bad year of trading could enough to bring down a large bank.

Credit growth

Banking crises affect not only the quality of a given stock of loans, but also asset and loan growth rates; nominal credit growth of banks’ customer loans net of credit losses tends to slow sharply, even if it does not become negative (Figure 12 and Appendix Table 4).35 Conditional on stress (as measured by credit losses), deleveraging occurs only in case of severe crises in ACs, while medium intensity crises tend to end in three years of (close to) zero year-on-year credit growth for ACs. Credit growth remains fairly sizable for EM banks under stress, except for severe stress, when credit growth becomes zero. Again, the pattern for the LIC banks is less clear-cut, owing to limited data. Asset growth behaves very similarly to credit growth. Thus, both the risk-weighted capital ratios and the unweighted leverage ratio may be affected by this effect in the denominator.

Figure 12.
Figure 12.

Typical Evolution of Credit Growth for ACs and EMs under Stress

(Percentage change, net of credit losses)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Bankscope data.

IV. Rules of Thumb for Satellite Models

An alternative to basing stress tests on generic crisis scenarios (referred to as descriptive rules of thumb) is to project the impact of some major shocks to macroeconomic variables on banks’ solvency variables. This more complex but more interpretable approach relies on so-called satellite models that describe macro-financial linkages. Various types of satellite models have been used in the literature to establish such models, including time series analysis, linear and non-linear regression models (such OLS regression, logistic regression, panel analysis) and structural models (see Foglia, 2008, and Drehmann, 2009, for example).

For the purpose of this paper, the attention focuses on fairly simple relationships, albeit with allowance for nonlinear responses that depend on the severity and duration of strain as measured by a shock to real GDP growth. More complex relationships might be estimated where a stress tester has available a rich dataset, covering periods (but without structural breaks) with a variety of shocks for a variety of banks. Yet even in such circumstances, transparent, understandable rules of thumb may be useful in checking the robustness of results.

It should be noted that satellite models are best suited to capturing the effects of exogenous macroeconomic shocks, rather than shocks that originate within the financial system, for example due to asset bubbles or large-scale malfeasance (Alfaro and Drehmann, 2009).36 In the latter situation, the macroeconomic deterioration tends to be a consequence of, and to follow the financial sector disturbance. Hence, the time series properties of the satellite models should be sensitive to the origins of the disturbance.

A. Explanatory Variables and Estimation Approach

The most important single determinant of bank solvency is the overall conjunctural conditions prevailing in the economy, which will affect all solvency factors (credit losses, income, credit growth, and so forth). Strong economic activity should generally allow firms to generate the revenue to repay loans, households to earn steady income to meet debt service obligations, and collateral to retain its value. Weak economic activity will have corresponding negative effects on a bank’s clients and thus on the bank itself.

Real GDP is normally the most relevant and most readily available measure of aggregate activity. Policy-makers and others make frequent real GDP forecasts, and its relationship to other macroeconomic variables is well-studied. Hence, its behavior can be readily interpreted, and it can usually be forecast in both baseline and stress scenarios.37 The design of scenarios is one of the key challenges for stress tests, and running a number of potential scenarios allows studying sensitivities (Taleb et al, 2012)—a key advantage of using rules of thumb is that they facilitate working through numerous scenarios. In what follows, we will establish rules of thumb relating variables related to bank solvency to GDP growth.

Economic intuition and evidence presented above suggests that, if economic conditions are broadly stable and as anticipated, then only a (very) small proportion of loans will go bad (Figure 2, Table 2). However, non-performance may increase rapidly if conditions are unexpectedly adverse, which often happens only after a long period of benign times, and makes such shocks all the more challenging; it is these negative “surprises” that cause borrowers to be unable to repay and collateral to be reduced in value. Yet, as displayed above, what counts as an exceptionally large shock in, say, the United States or a Western European country, may be well within recent historical experience for many EMs. Moreover, economic intuition and some empirical evidence suggest that prolonged periods of low or negative growth will have a more pronounced effect on bank profitability than a brief recession followed by recession

With this in mind, we computed the average (i) changes in real GDP growth at time t=0 (the year with the lowest real GDP growth) relative to year t–4; and (ii) the cumulative deviation of real GDP growth rates from trend from t–4 to t = 0, using World Economic Outlook (WEO) data for both ACs and ECs.38 The cumulative deviation from trend is likely to be more telling (given that it contains information on the duration of the shock), but depends on an estimate of trend growth, which in some cases (e.g., following an unsustainable burst of growth) may be difficult to obtain. A practical approach to determine trend growth is to use the average GDP growth observed in the past over at least one complete cycle (say, the last 10 years) or baseline forecasts (e.g., data from the WEO). For this study, the average real GDP growth rate for 1980 to 2011 has been used as a benchmark.

These GDP growth shocks (for the respective country type—AC or EC) are compared to the behavior of the main variables related to bank solvency. The comparison is carried out by various means: simple ratios and correlations are calculated, as are regressions. The main estimates are based on bank-level and country-level evidence. Given that the sample of bank-level data is dominated by observations from a few countries (notably the United States), which results in limited variation for the GDP trajectories, country-level data was used to come up with the rules.39 However, because then the evidence is limited to some 20 observations per country type and stress level, some parameters had to be smoothed, with a view to ensure consistency across estimates: the level of the macroeconomic shock (in terms of GDP growth) multiplied by the sensitivity of the solvency parameters (computed based on bank- and country-level data) added to the pre-shock level should result in broadly the same stress levels as observed under the descriptive rules discussed above.40 While the unadjusted results achieved such consistency in qualitative terms, the median rules were adjusted slightly to align with the respective descriptive rules of thumb.

It must be recognized that a given macroeconomic shock will have diverse effects across banks (and countries): some will survive quite well, and others may be devastated. Some banks may suffer large losses in calm periods. From a stability perspective, policymakers are likely to be as concerned about the “tail” of weak banks as they are about the mean or median bank; a systemic and macroeconomic problem can be created by severe losses in just one quarter or even one tenth of the banking system. Hence, the emphasis here is not on the median bank, but on the distribution of results across banks and in particular the weaker banks. It turned out that the tail sensitivities, taken from the computed country-level GDP sensitivities, deliver solvency parameters that align well with the descriptive data for the corresponding confidence level, and represent worst case crisis elasticities.41

Possible time lags need to be considered in making these comparisons. For both credit loss rates and credit growth, their trajectories are found to be fairly symmetric with respect to the crisis (see above), and the highest credit losses and lowest credit growth levels tend to concur with the year of the lowest real GDP growth. Hence, the results reported refer to coincident effects. Because pre-impairment income is less symmetric with respect to financial stress (conditional on credit losses; Figures 9 and 11), changes of real GDP growth rates are compared with the subsequent changes of pre-impairment income, with a view to err on the conservative side. For ACs, the change in income is computed as the minimum income level observed between t= 0 and t= 3, minus the initial income level at t=–4. For ECs, income tends to remain high until stress materializes; hence, the average income in t= –3 to t= 0 is compared with the minimum income level after t= 0 (in Appendix Table 4).

While the data available for LICs were sufficient to come up with some meaningful descriptions of the typical behavior of aggregates in financial crises, comparatively little data were available to investigate macro-financial linkages. Hence, macro-financial satellite rules of thumb are not reported for LICs.

B. Rules of Thumb for Satellite Models

Table 4 summarizes the estimated real GDP growth sensitivities of credit losses, pre-impairment income and credit growth, distinguishing, as before, among moderate, medium and severe stress (as measured by the real GDP growth paths), and between ACs and EMs.

The table provides two different sets of rules, based either on cumulative changes in growth rates or on the change in the annual growth rates from pre-crisis to crisis trough.42 Each approach might be useful for stress tests in certain circumstances. For example, one first establishes a scenario in terms of cumulative deviation of real GDP growth rates from trend during the years up to the trough of the stress period (such as 5.9 percentage points for the moderate stress in an AC), which deviation is multiplied by the corresponding sensitivity parameter—in this case leading to an increase in median bank credit losses of −5.9x−0.1 = 0.59 percentage points in the worst year. Projections based on changes in annual growth rates work similarly: for an AC, for example, one could simulate the impact of a drop of real GDP growth rate from before the recession (perhaps 2.4 percent, the average real GDP growth rate in ACs during 1980-2011), to 0.0 percent (the moderate scenario for ACs), to -1.9 percent (medium stress), or to -5.0 percent (severe stress) at the trough. The median AC bank’s credit losses would increase by −2.4x−0.2 = 0.48 percentage points, −4.3 x−0.2 = 0.86 percentage points, and −7.4x−0.4 = 2.96 percentage points, respectively.

Projections along the path from pre-crisis to peak crisis can be calculated. For the rule based on changes of annual growth rates from pre-crisis levels and assuming moderate stress, for example, one would establish the level of solvency parameters as follows: for t = -3, the assumption of an initial drop of GDP by 0.3 percent, say, would yield loss rates of 0.36 (0.3 percent plus -0.3x-0.2). For the next year (t = -2), the same sensitivity (0.2) would be multiplied with the change in the growth rate and added to previous year’s level, and so on.

For stress testing when the system is already under strain, one would first have to decide, based on expert judgment, which stress level might occur (using the benchmark figures in Table 2), and which year (t-3 to t+3) reflects the current situation. For example, if one assumes that a severe scenario will occur, and that one is already at t = -2, one would use the sensitivities of the severe scenario to simulate the trajectories going forward. As in the previous case, one would obtain an estimate by adding the change of growth rates multiplied by the sensitivity of the respective stress level to the actual level of the solvency variable.

The table reveals that in many cases the sensitivity increases with stress. For example, the sensitivity of credit loss rates to GDP declines is roughly the same in moderate and medium stress situations, but at least twice as high under severe stress.43 Hence, the rules are tied to the categorized GDP trajectories. Thus, it is misleading to take a parameter from a moderate GDP shock and use it to project the effects of a severe shock, for example. If one seeks to simulate a drop in GDP growth by 3 percentage points, say, one would likely use the parameters for the moderate scenario (as 3 percent is close to 2.4 percent), but could also pick the parameters for the medium shock to compute an upper bound. However, not all relationships exhibit pronounced non-linearity, as witnessed by the linear relationship between the size of the GDP shock and the change in credit growth rates.

The sensitivity of GDP shocks is much higher for a “tail” of weak banks than for the median bank. Across variables of interest, the coefficient for the worst performing 10th percentile of banks is normally at least double, and sometimes many times that of the median bank. The difference for ROC is especially large.

Credit quality and credit loss rates

For credit losses, a consistent observation is that large output falls can cause at least a one-for-one increase in default rates, which relationship is evidenced both by the findings reported earlier (Figures 1 and 2 and Table 1) and by further analysis of data from Moody’s (2013) (Figure 13 and table 5). In the period covered by the Moody’s data, all five peaks of default rates during the last 90 years were accompanied by a sharp economic downturn, suggesting that financial cycles and business cycles are usually closely linked in terms of their timing.44 In simple terms, based on Table 5, the absolute sensitivity of default rates to GDP growth is about 0.5 (increase of default rates by 3.5 percentage points divided by a decrease in GDP growth levels of 6.4 percentage points); without the observations during the 1930s (where the financial cycle was lagging behind the business cycle), the absolute sensitivity is about 0.6 (= 3.1/4.8). The evidence collected for the rules of thumb established in Table 4 suggests that there is typically (in 60 percent of the cases) no lag between the GDP trajectory and the credit loss trajectory; in about 20 of the cases the GDP trajectory leads by one year, and in the remaining 20 percent of the cases the GDP trajectory lags by one year. As shown in Figures 2 and 13, and in Tables 1 and 5, the bulk of credit losses typically occur at the time of the trough of the associated macroeconomic crisis; the time of the lowest GDP growth coincides with the year when loss rates peak sharply—an important finding when it comes to specifying rules of thumb satellite models, and in line with most publicly available satellite models. Default rates come back to pre-crisis levels (i.e., the long-term average of 1.1 percent) after three years. However, in the case of the 1930s Great Depression, losses remained elevated for several years after the trough in GDP growth, and did not come down to the average until after 6 years.

Figure 13.
Figure 13.

Historical Evidence on Typical Evolution of Default Rates Around a Crisis

(Percent)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Moody’s (2013) data.

This is in line with GDP elasticities found based on the sample data from Bankscope (Table 4), where one can clearly observe the non-linear (convex) pattern of credit loss rates, with the sensitivities increasing sharply as one moves from moderate to severe stress. The table also reveals that the GDP sensitivity of loss rates in EMs is considerably higher than that found in ACs, and becomes close to one-for-one or more under severe conditions.45

For default rates, Moody’s (2013) data are used to compute GDP sensitivities for ACs (Table 6), which turn out to be qualitatively similar to loss rates computed based on bank-by-bank data discussed above. Again, the parameters in the table are meant to be used together with the GDP trajectories for the ACs from Table 4. For projections based on the cumulative deviation of GDP growth from trend, the parameters should be divided by two.

Table 6.

Rules of Thumb for the GDP Growth Sensitivity of Credit Risk Parameters

article image
Source: Authors, based on studies and sources mentioned in the text.

LGD

The correlation between LGDs for loans from Moody’s (2013) with GDP real growth rates is -0.44, indicating that LGDs are lower during periods of higher output (see also Figure 6). The evidence on hand, however, suggests that the sensitivity of LGD rates to a real GDP slowdown is usually moderate in ACs, given that LGD even in normal times is typically in the 30 to 60 percent range; the sensitivity (in relative terms) is lower than that of PDs (Table 6). For ECs, there is not enough evidence to come up with separate parameters, but the parameters for ACs could be used as a starting point.

However, these rules of thumb for LGD (which are especially low for moderate and medium stress scenarios), are unlikely to apply when the source of the shock is related to the bursting of an asset price bubble: the end of an asset price bubble and in particular a debt-financed real estate bubble entails that much of the collateral backing the lending has suffered a drastic reduction in value, and the market for it has become much less liquid. Hence, LGD rates should be expected to be exceptionally high in such situations. A practical approach could be to use the coefficient from the severe scenario even when the macroeconomic shock is moderate or medium.

Income

As it was done for credit losses, Table 4 compares the changes in GDP to changes in pre-impairment income. The median GDP sensitivities of income are relative modest, but the relationship can become highly non-linear in unfavorable circumstances, in line with Figure 10, and especially for a substantial portion of poorly-performing banks.

Retained earnings

Profits translate into capital directly through the retention of earnings. A bank that makes losses would normally be expected to “retain” all these losses in the form of a reduction in its capital ratio. A bank that makes profits but has low capitalization would be expected to retain all or almost all profits.46 But a bank that is both profitable and well-capitalized would normally pay out a noteworthy portion of its net income (or extend its balance sheet, when perhaps macro-prudential policy is warranted; see IMF, 2011b), without substantially running-down its capital to do so. Such behavior can easily be incorporated into a rule of thumb.47

The expected levels of profit retention by banks under stress are shown in Appendix Table 4. AC banks are found to pay-out about 35 percent in “normal” times and moderate stress times, 20 percent under medium stress, and zero under severe stress. For the EM and AC banks, the levels are similar, at around 30 (EM) and 45 (LIC) in normal times and moderate stress, 17 (EM) and 40 (LIC) under medium stress, and 0 under severe stress.

Credit growth

For credit growth, the rule foresees similar GDP sensitivities for AC banks across shock levels. The sensitivity in ECs tends to decrease with stress intensity, which result may be influenced by the comparatively high credit growth rates, especially in EC countries during the last 15 year. Going forward, the trend might change as EM financial systems mature, which is why these rules should be used carefully.

In applying these rules of thumb, it will again be important to make allowance for the origins of the simulated financial crisis: a crisis provoked by an external macroeconomic shock is likely to be associated with relatively stable credit aggregates or only a more moderate slowdown. A crisis provoked by the bursting of a credit-financed asset price bubble or a lending boom is likely to be followed by a more contractionary path for credit aggregates.

Asset correlations

Asset correlations are one of the solvency parameters that have been studied less intensely, despite their important impact on capital ratios. Besides the scarcity of long-term data, one reason for the relative neglect may be that IRB asset correlations are determined conditional on the level of PDs, based on a cross-sectional rule that foresees that lower PDs are associated with higher asset correlations.48 Using either IRB asset correlations (or fixed asset correlations) may be appropriate for supervisory purposes because the approach avoids possible double-counting of potential stress when losses have already materialized (and affect the numerator of capital ratios), while the denominator (RWAs) (still) foresees the potential for elevated potential losses. However, if one takes a multi-period, stress testing view, this relationship could be misleading, as illustrated below.

To determine the fluctuations of asset correlations under macroeconomic stress, we refer to a study by Duellmann and others (2008) based on a sample of data on large, predominantly Western European corporate.49 During a period of six years (1997–2003), asset correlations fluctuated strongly, ranging from 4 percent to 16 percent, with a mean at 10.5 percent (Appendix Figure 2).50 We find a high GDP growth sensitivity of -2.7

(Appendix Figure 3), which implies that a fall of GDP growth by 1 percentage points leads to an increase of asset correlations by 2.7 percentage points.

These parameter estimates are meant to offer a simple rule, but further evidence, including for the latest years of the financial crisis, would be helpful. In particular, more evidence on the behavior of asset correlations in more extreme stress situations is needed in order to identify any non-linearities (see also Box 3).

How do IRB Correlations Compare with Empirical Correlations?

As shown in Figure 14a, IRB asset correlations have been calibrated based on conservative rules for “normal” macro-financial conditions. However, due to the nature of the calibration (i.e., based on cross-sectional considerations; see main text), correlations decrease with an increase in default risk (i.e., PDs), which will lead to underestimation of RWAs under stress. While there are valid reasons to reduce the pro-cyclicality of regulatory asset correlations, stress testing warrants point-in-time measurement of risks. For medium and severe stress situations, higher PIT correlation level would reflect elevated economy-wide risks, which is often the main concern of stress testers.

Figure 14a.
Figure 14a.

Comparison Between IRB Asset Correlation and Empirical Asset Correlations for Corporate Debt

(Percent)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on IRB formula and Duellmann et al. (2008).Note: The analysis herein (i.e. for both graphs) uses the point-in-time PDs and LGDs displayed in Table 4 and an effective maturity of 2.5 years.

Using the benchmarks for the drop in real GDP growth over a three year horizon (i.e., from t=–4 to t=0) in the run-up of crises (Table 4) and the empirical relationship between asset correlations and GDP growth (Appendix Figure 3), levels of asset correlation for moderate, medium and severe stress can be estimated. The estimated asset correlations increase substantially with stress, to levels of about 30 percent from 10 percent under “normal” (TTC) conditions. However, these estimates are based on extrapolations beyond 16 percent, which warrants a strong caveat. The impact of using difference asset correlations on RWAs is shown in Figure 14b: the resulting risk weights are similar for moderate stress levels, but become more differentiated under severe conditions, and can reach levels many times greater than normal.

Figure 14b.
Figure 14b.

Resulting Risk Weights

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors.

The use of point-in-time asset correlations for stress testing purposes should not result in double-counting. However, once losses have materialized the potential for additional losses decreases. Refined calibrations for asset correlations should account for this fact. As an alternative, fixed IRB correlations based on low PDs could be used, as suggested by Schmieder, Puhr and Hasan (2011).

V. Worked Examples

To illustrate the use of the descriptive rules of thumb, we simulate the impact of stress on representative stylized banks based in ACs and EMs, representing weighted average bank characteristics for the sample of banks in Bankscope.51,52

A. Bank Characteristics

The illustrative AC bank is assumed to have total on-balance sheet assets of US$100 billion (Table 7); customer loans (excluding loans to banks) of US$47 billion; US$46 billion of securities subject to credit risk; US$21 billion of off-balance sheet assets subject to credit exposure;53 US$7 billion in other assets such as fixed assets that are not subject to credit risk; and capital of US$6.0 billion. For the StA, it is assumed that variable external ratings apply to 10 percent of assets that are subject to credit risk (as for non-sovereign bonds, for example), and that their risk weights react accordingly in the stress scenarios. The IRB RWAs are computed under the assumption that RWAs for credit risk amounts to 80 percent of total RWAs, and that the risk-weights for non-loan assets subject to credit risk are one third of those applied to loans (due to a typically lower default risk, e.g., for sovereign exposure, and shorter maturities).54 The computed ratio of IRB RWA divided by total assets (the so-called RWA density) is in line with the observed average RWA density for large international IRB banks.

Table 7.

Features of Banks Used in the Worked Examples

(US$ billions)

article image
Source: Authors, based on Bankscope data.

AC: weighted average for banks with total assets less than US$100 billion. EC: weighted average of all banks.

Capital divided by respective RWAs.

Estimated based on the sensitivities presented in this study, assuming that 20 percent of RWAs are accounted for by other risk types (such as market risk and operational risk).

The corresponding bank based in an EM is assumed to have the same total assets, of which 54 percent constitutes customer loans, with capital amounting to US$8.7 billion. The elevated credit risk (i.e., higher PDs and LGDs) of EM banks compared to that of

AC banks result in relatively high capital weights under IRB, and therefore a lower capital ratio.55, 56

Simulations

The evolution of capital ratios for the AC and EC illustrative banks during multi-year periods of stress is shown in Figure 15, using the descriptive solvency stress parameters from Appendix Table 4, except for credit growth, for which a baseline level of 5 percent is used for ACs and 7 percent for ECs.57 For these levels of credit growth, the capital ratios remain roughly unchanged under normal conditions. Year t–4 shows the initial capital level for banks under both the IRB and StA.

Figure 15.
Figure 15.

Evolution of Capital Ratios during Stress Periods

(Percent)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Bankscope data.

For AC banks under the IRB, severe stress will reduce capital levels to 3 percent, i.e., below the regulatory minimum of 8 percent, and medium level stress also reduces capital levels substantially, to about 7 percent from an initial level of 14.7 percent. Moderate stress can be digested by banks.

Measured bank capitalization under the StA is generally affected less by stress than that under the IRB approach, largely due to the more substantial changes in RWAs under the IRB, especially under severe stress (Table 8, which shows RWA as a proportion of total assets so as to normalize for the effects of balance sheet growth). Variations in RWAs based on PIT credit risk parameters (PD, LGD) are very sizable in the absence of mitigation through behavioral adjustments (e.g., re-balancing of assets toward highly-rated securities).

Table 8.

Simulated Evolution of RWAs Relative to Total Assets During Stress Periods

(Percent)

article image
Source: Authors.

The leverage ratio (regulatory capital to on- and off-balance sheet assets) will be hit if income becomes negative (see below), and will rise in the case of a credit contraction. According to the worked examples, severe stress in ACs leads to a drop of the leverage ratio by 1.9 percentage points (in the worst year), i.e., from 5 percent to 3.1 percent (Figure 16). For the EC banks, medium and severe stress for the ECs result in a drop of the leverage ratio by 1.4 and 7.9 percentage points, respectively, from an initial level of 7 percent. Hence, “average” banks (as simulated in this case study) are, with the exception of EC banks under severe stress, in a solid position to maintain an adequate leverage ratio.58

Figure 16.
Figure 16.

Evolution of Leverage Ratios during Stress Periods

(Percent)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Bankscope data.

Both credit losses and declines in pre-impairment income contribute to weakening banks. The typical bank’s net income after loan losses (and therefore retained income) becomes negative under severe stress for the ACs (–25 percent ROC), and under medium (–0.3 percent ROC) and severe stress (–95 percent ROC) for the ECs (Table 9). For AC countries, net interest income contributes 59 percent of the decline in earnings, followed by net fees and commissions at 21 percent, and other income at 20 percent (trading and fair value contributes half of this). For EC banks, the corresponding contributions are 74 percent, 18 percent, and 9 percent. In order to simulate the income of a specific bank under stress, the sensitivities given in Table 5 could be used together with the relative contribution of the earning sources for that specific bank to adjust the trajectories of net overall income.

Table 9.

Simulated Evolution of Net Income during Stress Periods

(Percent ROC)

article image
Source: Authors.

Undefined due to negative capital.

When compared to evidence from the Bankscope dataset, the predicted evolution of capitalization of an AC bank under severe stress based on the StA approach roughly matches the evolution of the capital ratio of the 25-th percentile of those banks that experienced severe stress conditions during the last 15 years (Figure 17, which shows the evolution of actual, historical capitalization for various quantiles, and the levels predicted by the descriptive rules of thumb).59 The capitalization prediction based on the IRB approach is close to the 10-th percentile of actual results. The comparability of outcomes is reassuring, suggesting that the calibration is appropriate.60

Figure 17.
Figure 17.

Evolution of Capital Ratios: Actual vs. Predicted for an AC Bank

(Percent)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Bankscope data (2,050 observations).

As another means of comparison, the rules for the satellite models were applied in the context of recent stress tests run for EU and U.S. banks. Again the results were reassuring (Box 4).

Rules of Thumb Applied to Recent Stress Tests

The satellite model rules of thumb are applied to the scenarios used in recent stress tests run by EBA (2011) and the U.S. authorities (Federal Reserve Board, 2012). The macroeconomic scenario for the EU simulated a cumulative deviation from baseline growth by about 4 percentage points for a horizon of two years (2011–2012). The scenario for the United States simulated a cumulative deviation during three years (2012–14) by 5 percentage points from the baseline, including a dip by 6 percentage points within one year (as observed in 2009), with a subsequent recovery. The European stress tests included 90 large banks, and the United States tested the largest 18 banking holding companies. For the computations below, the worked example with the average AC bank will be assumed to be representative for the average of both European and U.S. banks.

The outcome of the EBA stress test—an average drop of capital ratios by around 2 percentage points (without considering capital increase) —can be approximated both by the “cumulative deviation from GDP trend” rule and by the “change in GDP growth” rule (Table 4). The latter rule is probably more useful for the scenario at hand, which did not simulate a sharp drop for one year, but rather a sustained lower level of real GDP growth of around zero (compared to 2 percent under the baseline). Using the average AC bank from Table 7, the IRB rule of thumb (applicable because most large European banks use the IRB) linking GDP growth to changes in capital ratios suggests a drop of capital ratios by 3 to 4 percentage points based on the GDP change rule (2 percentage points drop in real GDP growth times sensitivity of 1.5-2 percentage points per unit of GDP), while the cumulative deviation rule would suggest a drop of capital ratios by about 3.2 percentage points (4 percentage points drop times sensitivity of about 0.8 percentage points per unit of GDP, see text below). Both scenarios would fall into the “moderate” category (Table 4). This outcome is reasonable, while reflecting that the rules of thumb are intentionally conservative.

For the test run for the U.S. banks, the average drop of capital ratios during the dip year (in which GDP growth was simulated to drop from +2 percent to −4 percent) was estimated at around 4.3 percentage points. The StA rule of thumb (applicable because U.S. banks currently apply Basel I) for the change rule (corresponding to a medium/severe stress level) would suggest a drop by about 3.9 percentage points (6 percentage points drop times sensitivity of about 0.65 percentage points per unit of GDP), which is again relatively close to the actual estimate.

Rules of thumb, the regulatory regime, and pro-cyclicality

The examples illustrate that a capital ratio projected under the StA tends to react less quickly (in either direction) than one projected using the IRB approach, and therefore need to be interpreted differently. Even a relatively modest change in StA-measured capitalization may be economically significant. By the same token, IRB capital ratios require ample buffers to withstand a given level of stress (especially if based on point-in-time PDs and LGDs)—as high IRB capital ratios in good times can give a false sense of security, while they are conservative in bad times. Hence, in order to interpret a solvency measure and in particular the results of a stress test simulation, one needs to understand at which stress level a bank currently finds itself, and the regulatory regime.

The stylized example allows one to compute the impact of a drop of GDP growth by 1 percentage points on capital ratios under different stress levels, using the historical average changes of GDP growth levels corresponding to these stress levels (as displayed in Table 4 and Figures 15 and 17):

  • For the AC bank, a 1 percentage point drop of GDP growth would lead to a reduction in capitalization by 0.4 to 0.5 percentage points for a StA bank, and about 1.6 to 2.3 percentage points for an IRB bank, depending on the severity of the shock. For the cumulative deviation from the GDP growth trend, the coefficients should to be divided by two. While this is the general trend based on median rules, the effect could be non-linear and not strictly monotonic because several factors affect the capital ratio: a stress that results in a modest rise in credit loss but a sharp deceleration in credit growth can leave the capital ratio more or less unchanged, for example.

  • For the EM bank, the corresponding loss of capitalization would be about 0.5 to 1.2 under the StA per unit drop of real GDP growth, and 0.6 to 0.7 percentage points under the IRB approach. (For the cumulative deviation from the GDP growth trend, the coefficients should be divided by three.) The coefficient under the IRB approach for EC banks is relatively low because EC banks exhibit higher initial risk weights than AC banks, and therefore the sensitivity of RWA to cyclical fluctuations is lower.

A common concern is that the regulatory regime may impose capital requirements that are excessively pro-cyclical; capital requirements would increase sharply in difficult times if they impose use of PIT parameters. Such pro-cyclicality can be costly and even destabilizing if it provokes a credit crunch that further weakens economic performance. One could replace PIT-based RWAs by RWAs based on a moving average of, say, for a five year period, which can be considered a TTC estimate for regulatory purposes. The counter-argument is that, for purposes of stability analysis, PIT estimates are what matter, as the TTC estimates are not relevant to assessing whether a bank will survive through the strains to which it is subject at the worst phases of the cycle.61

The rules of thumb allow one to compare the effects of the two approaches (Figure 18). A TTC approach results in a slower decline in calculated capital ratios than does a PIT approach over the two years leading to a crisis. Under TTC, a substantial drop comes at the time of the crisis or thereafter. Hence, differences in capital ratios end close to the starting level (as one would expect, assuming that neither approaches is biased across the cycles as a whole). The PIT approach gives an earlier warning of strain, and the TTC approach perhaps gives more “breathing space” during which capital can be bolstered, and maturing assets could be replaced by assets with lower risk weights.

Figure 18.
Figure 18.

Capital Ratios with Point-in-Time vs. Through-The-Cycle RWAs

(Percent)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors.

Policy-makers might well ask what level of the capital ratio is needed, such that a (typical) bank stays above the regulatory minimum (8 percent total capital ratio) under stress. To answer this question, the required capital levels are simulated by changing the capital ratios (via the numerator) to determine the initial level that would result in a ratio of about 8 percent at time t=0 when strain is greatest. An average AC bank under the IRB would need to achieve a capital ratio of around 30 percent in normal times in order to remain capitalized at or above 8 percent at time zero under severe stress and assuming a PIT capital regime; it would need about 17 percent to be prepared for medium stress. Using TTC risk weights, the corresponding capital ratios would be 20 percent for severe stress and 14 percent for the median scenario. For the average AC bank under the StA, a capital ratio of 10 percent would be needed to cope with medium stress without falling below the regulatory floor, and 12.5 percent to cope with extreme stress.

For an EC bank under the StA, a capital level of 12 percent would be sufficient to cope easily with medium level shocks, and a capital level of around 22 percent would be sufficient to cope with severe stress.

Banks would need more capital if they have relatively volatile pre-impairment earnings. For example, an AC bank from the worst decile of pre-impairment earnings would need a capital ratio of 22 percent using the IRB approach to cope with medium stress (Figure 9). An EC bank from the worst decile, using StA, would need a capital ratio of 17 percent to deal easily with medium stress, and 25 percent to cope with severe stress.

It should be noted that the policy considerations warrant several caveats:

  • the examples simulate average behavior of banks in terms of credit growth (that banks do not deleverage more as suggested by the rules of thumb), assuming that banks do not change the structure of their balance sheet (e.g., by replacing assets with higher risk weights by assets with lower risk weights), and neglecting asset sales (e.g., by sale of assets or disposals of legal entities) as well as capital raising beyond an endogenous amount of retained earnings. For multi-year periods, these are strong assumptions, but with the advantage of yielding a clear-cut benchmark what would happen without major changes.; and

  • the simulation uses median risk parameters for banks that experienced severe stress. While median credit loss rates under severe stress are close to 3 percent for AC banks, they were much higher for the below-average bank (Table 4 and Figure 16). The same applies to income, where banks with more volatile sources of income can experience income levels that are substantially more unfavorable than those of the median bank (Figure 9). The same applies to the other severity levels, but in those cases the range of parameters is lower (given that they are bound by the severe case).

VI. Conclusion

A variety of evidence is presented on the “average” pattern of behavior of financial aggregates relevant to solvency stress testing banks based in EMs and ACs, and, with some limitations, also for larger LIC banks. Table 10 provides an overview of some main results.

Table 10.

Overview of Main Rules of Thumb

(Percent)

article image
Source: Authors, and evidence in paper.

Typical levels of credit loss rates, pre-impairment income, and credit growth were estimated under moderate stress (a one-in-10/15-years shock), medium stress (worst-in-20-year), severe stress (a 1-in-40-years shock), and extreme stress (1-in-100 years). All three variables react in non-linear fashion to the severity of stress, which means that effects under severe conditions is manifold the effects under moderate conditions. Also, a substantial “tail” of poorly performing banks is likely to be much more affected than the median bank.

Comparing ACs on the one hand and EMs/LICs on the other, loss levels are found to be substantially higher in the latter, compensated for by higher returns. It was found that 1-in-20 year stress loss levels usually lead banks to report some net losses, especially in ECs, and thereby lose some capitalization (1 to 3 percentage points if they are under Basel I or the Basel II standardized approach), but only a macroeconomic crisis approaching severe intensity would normally bring down typical well-capitalized banks (unless there are other issues related to confidence and financial sector-generated sources of strain).

Further evidence is presented on macro-financial linkages, and specifically on defining rules of thumb of how a change in GDP growth triggers credit losses, income, and credit growth effects under different levels of stress. While such rough satellite models are more complex than the descriptive solvency rules, they allow the development of scenarios based on an explicit story. As such, the rules make allowance for national circumstances, such as the expected severity of shocks.

While the study has found general patterns, country-specific and/or bank-specific circumstances may differ widely from the average. Hence, the rules of thumb elaborated in this study serve as broad guidelines, particularly to understand benchmarks for worst-case scenarios, but do not fully substitute for detailed analysis when that is possible. The rules of thumb with explicit focus on macro-financial linkages cover only some of the main macroeconomic risk factors that may affect a banking system, namely those captured by GDP. It would be worthwhile to investigate whether analogous simple rules can be formulated that link specific elements of banks’ balance sheets and profitability to such other sources of vulnerability.62

The rules of thumb can be used to compute minimum levels of capitalization needed to withstand shocks of different severities—even those far from a country’s historical experience. Also, the regulatory approach used by banks matters: whether a bank adopts an IRB approach to estimating risk-weighted assets or relies on a standardized approach is shown to make a substantial difference to the magnitude and also the timing of when the effects of shocks are recognized, provided that banks’ risk models reflect changes in risk on a timely basis. Thus, the results are relevant to recent policy discussions centered on the robustness of regulatory capital ratios, especially on the computation of RWAs (e.g., BIS 2013, BCBS 2013, Haldane 2012, 2013) and the design of (countercyclical) capital buffers (e.g., Drehmann and other 2009). The results echo the call for (much) longer samples to be used in the calibration of models used for RWA computation and the “right” choice of the regulatory capital ratio (e.g., BCBS 2013, BIS 2013).

Appendix 1. Data Summary

Appendix Table 1.

Overview of Bankscope Data

article image
Source: Authors, based on Bankscope data.
Appendix Figure 1.
Appendix Figure 1.

Overview of Raw Bankscope Sample Size by Year

(Number of banks)

Citation: IMF Working Papers 2013, 232; 10.5089/9781475518115.001.A001

Source: Authors, based on Bankscope data.
Appendix Table 2.

Overview of Top 10 Countries by Category for Cleaned Bankscope Data

article image
Source: Authors, based on Bankscope data.
Appendix Table 3.

Overview of Bankscope Data, by Stress Level

article image
Source: Authors, based on Bankscope data.

Appendix 2. Supplementary Evidence

Appendix Table 4.

Evolution of Solvency Parameters around Crisis Dates

article image
article image
Source: Authors, based on Bankscope data.
Appendix Table 5.

Components of Pre-impairment Income and Expenses

(Percent of capital)

article image
article image
Source: Authors, based on Bankscope data. Note that the figures are medians, and therefore they do not necessarily add up.