- 1 AN INTRODUCTION TO CONSUMER PRICE INDEX METHODOLOGY
- 2 USES OF CONSUMER PRICE INDICES
- 3 CONCEPTS AND SCOPE
- 4 EXPENDITURE WEIGHTS AND THEIR SOURCES
- 5 SAMPLING
- 6 PRICE COLLECTION
- 7 ADJUSTING FOR QUALITY CHANGE
- 8 ITEM SUBSTITUTION, SAMPLE SPACE AND NEW PRODUCTS
- 9 CALCULATING CONSUMER PRICE INDICES IN PRACTICE
- 10 SOME SPECIAL CASES
- 11 ERRORS AND BIAS
- 12 ORGANIZATION AND MANAGEMENT
- 13 PUBLICATION, DISSEMINATION AND USER RELATIONS
- 14 THE SYSTEM OF PRICE STATISTICS
- 15 BASIC INDEX NUMBER THEORY
- 16 THE AXIOMATIC AND STOCHASTIC APPROACHES TO INDEX NUMBER THEORY
- 17 THE ECONOMIC APPROACH TO INDEX NUMBER THEORY: THE SINGLE-HOUSEHOLD CASE
- 18 THE ECONOMIC APPROACH TO INDEX NUMBER THEORY: THE MANY-HOUSEHOLD CASE
- 19 PRICE INDICES USING AN ARTIFICIAL DATA SET
- 20 ELEMENTARY INDICES
- 21 QUALITY CHANGE AND HEDONICS
- 22 THE TREATMENT OF SEASONAL PRODUCTS
- 23 DURABLES AND USER COSTS

# 18 THE ECONOMIC APPROACH TO INDEX NUMBER THEORY: THE MANY-HOUSEHOLD CASE

- International Monetary Fund
- Published Date:
- August 2004

## Introduction

**18.1** In the previous chapter on the economic approach to index numbers, it was implicitly assumed that the economy behaved as if there were a single representative consumer. In the present chapter, the economic approach is extended to an economy with many household groups or many regions. In the algebra below, an arbitrary number of households, *H* say, is considered. In principle, each household in the economy under consideration could have its own consumer price index. In practice, however, it will be necessary to group households into various classes. Within each class, it will be necessary to assume that the group of households in the class behaves as if it were a single household in order to apply the economic approach to index number theory. The partition of the economy into *H* household classes can also be given a regional interpretation: each household class could be interpreted as a group of households within a region of the country under consideration.

**18.2** The concepts of a *plutocratic index* and a *conditional index* are introduced in paragraphs 18.3 to 18.13. Using the plutocratic concept, each household in the economy is given a weight in the national index that is proportional to the household’s expenditures on commodities for the two periods under consideration. A conditional index is an index that depends on environmental variables that might affect household expenditures on commodities. One example of an environmental variable is the weather: if the weather is cold, then households will spend more on heating fuel. In paragraphs 18.14 to 18.22 it is shown how a national Fisher price index can approximate a plutocratic cost of living index. Finally, paragraphs 18.23 to 18.35 consider an alternative conceptual framework for a national index, the *democratic index*. Using this index concept, each household in the economy is given an equal weight in the national index (as opposed to the plutocratic concept where households that spend more get a higher weight in the national index).

## Plutocratic cost of living indices and observable bounds

**18.3** In this section, an economic approach to the consumer price index (CPI) is considered that is based on the *plutocratic cost of living index* that was originally defined by Prais (1959). This concept was further refined by Pollak (1980. p. 276; 1981, p. 328), who defined his *Scitovsky-Laspeyres cost of living index* as the ratio of total expenditure required to enable each household in the economy under consideration to attain its base period indifference surface at period 1 prices to the corresponding expenditure required to attain the same standard of living using period 0 prices. In the following paragraph, this concept will be explained more fully.

**18.4** Suppose that there are *H* households (or regions) in the economy and suppose further that there are *n* commodities in the economy in periods 0 and 1 that households consume *and* that we wish to include in our definition of the cost of living. Denote an *n*-dimensional vector of commodity consumption in a given period by *q* ≡ (*q*_{1},*q*_{2},…, *q _{n}*) as usual. Denote the vector of period

*t*market prices faced by household

*h*by

*t*= 0,1. Note that it is

*not*assumed that each household faces the same vector of commodity prices. In addition to the market commodities that are in the vector

*q*, it is assumed that each household is affected by an

*M*-dimensional vector of

*environmental*

^{1}

*or demographic*

^{2}

*variables or public goods*,

*e*≡(

*e*

_{1},

*e*

_{2},…,

*e*). It is supposed that there are

_{M}*H*households (or regions) in the economy during periods 0 and 1, and the preferences of household

*h*over different combinations of market commodities

*q*and environmental variables

*e*can be represented by the continuous utility function

*f*(

^{h}*q,e*) for

*h*= 1,2,…,

*H*.

^{3}For periods

*t*= 0, l and for households

*h*= 1,2,…,

*H*, it is assumed that the observed household

*h*consumption vector

*h*expenditure minimization problem:

where *h* in period *t*, *h* during period *t* and *C ^{h}* is the cost or expenditure function that is dual to the utility function

*f*.

^{h}^{4}Basically, these assumptions mean that each household has stable preferences over the same list of commodities during the two periods under consideration, the same households appear in each period and each household chooses its consumption bundle in the most cost-efficient way during each period, conditional on the environmental vector that it faces during each period. Note again that the household (or regional) prices are in general different across households (or regions).

**18.5** With the above assumptions in mind, the example of Pollak (1980;1981) and Diewert (1983a, p. 190)^{5} is followed. The class of *conditional plutocratic cost of living indices*, *P*^{*}(*p*^{0},*p*^{1},*u*,*e*_{1},*e*_{2},…,*e _{H}*), pertaining to periods 0 and 1 for the arbitrary utility vector of household utilities

*u*≡ (

*u*

_{1},

*u*

_{2},…,

*u*) and for the arbitrary vectors of household environmental variables

_{H}*e*, for

_{h}*h*= 1,2,…,

*H*is defined as follows:

The numerator on the right-hand side of equation (18.2) is the sum over households of the minimum cost, *h* to achieve the arbitrary utility level *u _{h}*, given that the household

*h*faces the arbitrary vector of household

*h*environmental variables

*e*and also faces the period 1 vector of prices

_{h}*h*to achieve the

*same*arbitrary utility level

*u*, given that the household faces the

_{h}*same*arbitrary vector of household

*h*environmental variables

*e*and also faces the period 0 vector of prices

_{h}**18.6** The general definition (18.2) is now specialized by replacing the general utility vector *u* by either the period 0 vector of household utilities *e*_{1},*e*_{2},…*e _{H}*) ≡

*e*by either the period 0 vector of household environmental variables

*Laspeyres conditional plutocratic cost of living index*,

^{6}The choice of the period 1 vector of utility levels and period 1 environmental variables leads to the

*Paasche conditional plutocratic cost of living index*,

**18.7** Using definition (18.2), the Laspeyres conditional plutocratic cost of living index,

since *h* = 1,2,…, *H*

≡ *P _{PL}*

where *P _{PL}* is defined to be the observable (in principle)

*plutocratic Laspeyres price index*,

^{7}

**18.8** If prices are equal across households (or regions), so that

then the plutocratic (or disaggregated) Laspeyres price index, *P _{PL}*, collapses down to the usual aggregate Laspeyres index,

*P*; i.e., then

_{L}*P*becomes

_{PL}where the total quantity vector in period *t* is defined as

**18.9** The inequality (18.3) says that the theoretical Laspeyres plutocratic conditional cost of living index, *P _{PL}*. The special case of inequality (18.3) when the equal prices assumption (18.4) holds

^{8}was first obtained by Pollak (1989, p. 182) for the case of one household with environmental variables and by Pollak (1980, p. 276) for the many-household case, but where the environmental variables are absent from the household utility and cost functions.

**18.10** In a similar manner, specializing definition (18.2), the Paasche conditional plutocratic cost of living index,

where *P _{PP}* is defined to be the plutocratic or disaggregated (over households) Paasche price index,

**18.11** If prices are equal across households (or regions), so that assumptions (18.4) hold, then the disaggregated Paasche price index *P _{PP}* collapses down to the usual aggregate Paasche index,

*P*; i.e., then

_{P}*P*becomes

_{PP}**18.12** Returning to the inequality (18.7), it can be seen that the theoretical Paasche conditional plutocratic cost of living index, *P _{PP}*. Diewert (1983a, p. 191) first obtained the inequality (18.7) for the case where the environmental variables are absent from the household utility and cost functions, and prices are equal across households. The general case is attributable to Diewert (2001, p. 223).

**18.13** In the following section, it will be shown how to obtain a theoretical plutocratic cost of living index that is bounded from above and below rather than the theoretical indices in inequalities (18.3) and (18.7) that just have the one-sided bounds.

## The Fisher plutocratic price index

**18.14** Using the inequalities (18.3) and (18.7) and the continuity properties of the conditional plutocratic cost of living ^{9}

There exists a reference utility vector

${u}^{*}\equiv ({u}_{1}^{*},{u}_{2}^{*},\dots ,{u}_{H}^{*})$ such that the household*h*reference utility level${u}_{h}^{*}$ lies between the household*h*period 0 and 1 utility levels,${u}_{h}^{0}$ and${u}_{h}^{1}$ respectively for*h*= 1,…,*H*, and there exist household environmental vectors${e}_{h}^{*}\equiv ({e}_{h1}^{*},{e}_{h2}^{*},\dots ,{e}_{hM}^{*})$ such that the household*h*reference*m*th environmental variable${e}_{hm}^{*}$ lies between the household*h*period 0 and 1 levels for the*m*th environmental variable,${e}_{hm}^{0}$ and${e}_{hm}^{1}$ respectively for*m*= l,2,…,*M*and*h*= 1,…,*H*, and the conditional plutocratic cost of living index${P}^{*}({p}_{1}^{0},\dots ,\phantom{\rule[-0.0ex]{0.3em}{0.0ex}}{p}_{H}^{0},{p}_{1}^{1},\dots ,{p}_{H}^{1},{u}^{*},{e}^{*})$ evaluated at this intermediate reference utility vector*u*^{*}and the intermediate reference vector of household environmental variables${e}^{*}\equiv ({e}_{1}^{*},{e}_{2}^{*},\dots ,{e}_{H}^{*})$ lies between the observable (in principle) plutocratic Laspeyres and Paasche price indices,*P*and_{PL}*P*, defined above by the last equalities in (18.3) and (18.7)._{PP}

**18.15** The above result says that the *theoretical national plutocratic conditional consumer price index**P _{PL}* and the plutocratic or disaggregated Paasche index

*P*. Hence if

_{PP}*P*and

_{PL}*P*are not too different, a good point approximation to the theoretical national plutocratic consumer price index will be the

_{PP}*plutocratic or disaggregated Fisher index P*defined as:

_{PF}The plutocratic Fisher price index *P _{PF}* is computed just like the usual Fisher price index, except that each commodity in each region (or for each household) is regarded as a separate commodity. Of course, this index will satisfy the time reversal test.

**18.16** Since statistical agencies do not calculate Laspeyres, Paasche and Fisher price indices by taking inner products of price and quantity vectors, as was done in equation (18.9) and the previous definitions, it will be useful to obtain formulae for the Laspeyres and Paasche indices that depend only on price relatives and expenditure shares. In order to do this, it is necessary to introduce some notation. Define the expenditure share of household *h* on commodity *i* in period *t* as

Define the expenditure share of household *h* in total period *t* consumption as:

Finally, define the national expenditure share of commodity *i* in period *t* as:

The Laspeyres price index for region *h* (or household *h*) is defined as:

**18.17** Referring back to equation (18.3), the plutocratic national Laspeyres price index, *P _{PL}*, can be rewritten as follows:

Equation (18.15) shows that the plutocratic national Laspeyres price index is equal to a (period 0) regional expenditure share-weighted average of the regional Laspeyres price indices. Equation (18.16) shows that the national Laspeyres price index is equal to a (period 0) expenditure share-weighted average of the regional price relatives, *i* in region *h*.

**18.18** The Paasche price index for region *h* (or household *h*) is defined as:

**18.19** Referring back to equation (18.7), the plutocratic national Paasche price index, *P _{PP}*, can be rewritten as follows:

Equation (18.19) shows that the national plutocratic Paasche price index is equal to a (period 1) regional expenditure share-weighted harmonic mean of the regional Paasche price indices. Equation (18.20) shows that the national Paasche price index is equal to a (period 1) expenditure share-weighted harmonic average of the regional price relatives, *i* in region *h*.

**18.20** Of course, the share formulae for the plutocratic Paasche and Laspeyres indices, *P _{PP}* and

*P*given by equations (18.20) and (18.16), can now be used to calculate the plutocratic Fisher index,

_{PL}*P*≡ [

_{PF}*P*]

_{PP}P_{PL}^{1/2}.

**18.21** If prices are equal across regions, the formulae (18.16) and (18.20) simplify. The formula for the plutocratic Laspeyres index becomes:

where *P _{L}* is the usual aggregate Laspeyres price index based on the assumption that each household faces the same vector of commodity prices; see equation (18.5) for the definition of

*P*. Under the equal prices across households assumption, the formula for the plutocratic Paasche index becomes:

_{L}where *P _{P}* is the usual aggregate Paasche price index based on the assumption that each household faces the same vector of commodity prices; see equation (18.8) for the definition of

*P*.

_{P}**18.22** Thus with the assumption that commodity prices are the same across regions, in order to calculate national Laspeyres and Paasche indices, only “national” price relatives and national commodity expenditure shares are required for the two periods under consideration. If there is regional variation in prices, however, then the simplified formulae (18.21) and (18.22) are not valid and it is necessary to use the earlier formulae (18.16) and (18.20), which require the use of regional price relatives and regional expenditure shares.

## Democratic versus plutocratic cost of living indices

**18.23** The plutocratic indices considered above weight each household in the economy according to the size of its expenditures in the two periods under consideration. Instead of weighting in this way, it is possible to define theoretical indices (and “practical” approximations to them) that give each household or household group in the economy an equal weight. Following Prais (1959), such an index will be called a *democratic index*. In this section, the plutocratic index number theory developed in paragraphs 18.3 to 18.22 will be reworked into the democratic framework.

**18.24** Making the same assumptions as in paragraph 18.4, define the class of *conditional democratic cost of living indices*, *u* ≡ (*u*_{1},*u*_{2},…,*u _{H}*) and for the arbitrary vectors of household environmental variables

*e*for

_{h}*h*= 1,2,…,

*H*as follows:

Thus *e _{h}*, is not present in the cost function of household

*h*, then the conditional index

**18.25** Now specialize the general definition (18.23) by replacing the general utility vector *u* by either the period 0 vector of household utilities *e*_{1},*e*_{2},…*e _{H}*) ≡

*e*by either the period 0 vector of household environmental variables

*Laspeyres conditional democratic cost of living index*,

*Paasche conditional democratic cost of living index*,

**18.26** Specializing definition (18.23), the Laspeyres conditional democratic cost of living index,

since *h* = 1,2,…, *H*

≡ *P _{DL}*

where *P _{DL}* is defined to be the observable (in principle) democratic Laspeyres price index,

**18.27** In a similar manner, specializing definition (18.23), the Paasche conditional democratic cost of living index,

where *P _{DP}* is defined to be the

*democratic Paasche price index*,

*h*quantities for period 1,

*h*in the summation of individual household Paasche indices. Thus, it can be seen that the theoretical Paasche conditional democratic cost of living index,

*P*. Diewert (1983a, p. 191) first obtained the inequality (18.25) for the case where the environmental variables are absent from the household utility and cost functions, and prices are equal across households.

_{DP}**18.28** It is now shown how to obtain a theoretical democratic cost of living index that is bounded from above and below by observable indices. Using the inequalities (18.24) and (18.25) and the continuity properties of the conditional democratic cost of living

There exists a reference utility vector

${u}^{*}\equiv ({u}_{1}^{*},{u}_{2}^{*},\dots ,{u}_{H}^{*})$ such that the household*h*reference utility level${u}_{h}^{*}$ lies between the household*h*period 0 and I utility levels,${u}_{h}^{0}$ and${u}_{h}^{1}$ respectively for*h*= 1,…,*H*. Also, there exist household environmental vectors${e}_{h}^{*}\equiv ({e}_{h1}^{*},{e}_{h2}^{*},\dots ,{e}_{hM}^{*})$ such that the household*h*reference*m*th environmental variable${e}_{hm}^{*}$ lies between the household*h*period 0 and 1 levels for the mth environmental variable,${e}_{hm}^{0}$ and${e}_{hm}^{1}$ respectively for*m*= 1,2,…,*M*and*h*= 1,…,*H*. The conditional democratic cost of living index${P}_{D}^{*}({p}_{1}^{0},\dots ,\phantom{\rule[-0.0ex]{0.3em}{0.0ex}}{p}_{H}^{0},{p}_{1}^{1},\dots ,{p}_{H}^{1},{u}^{*},{e}^{*})$ , evaluated at this intermediate reference utility vector*u*^{*}and the intermediate reference vector of household environmental variables${e}^{*}\equiv ({e}_{1}^{*},{e}_{2}^{*},\dots ,{e}_{H}^{*})$ , lies between the observable (in principle) democratic Laspeyres and Paasche price indices,*P*and_{DL}*P*, defined above by the last equalities in (18.24) and (18.25)._{DP}

**18.29** The above result says that the *theoretical national democratic conditional consumer price index**P _{DL}* and the democratic Paasche index

*P*. Hence if

_{DP}*P*and

_{DL}*P*are not too different, a good point approximation to the theoretical national democratic consumer price index will be the

_{DP}*democratic Fisher index P*, defined as:

_{DF}The democratic Fisher price index, *P _{DF}*, will satisfy the time reversal test.

**18.30** Again, it will be useful to obtain formulae for the democratic Laspeyres and Paasche indices that depend only on price relatives and expenditure shares. Using definition (18.10) for the household *h* expenditure share on commodity *i* during period *t*, *h* can be written in share form as follows:

Substituting equation (18.27) into the definition of the democratic Laspeyres index, *P _{DL}*, leads to the following share type formula:

^{10}

Similarly, substituting equation (18.28) into the definition of the democratic Paasche index, *P _{DP}*, leads to the following share type formula:

**18.31** The formula for the democratic Laspeyres index in the previous paragraph simplifies if it can be assumed that each household faces the same vector of prices in each of the two periods under consideration. Under this condition, equation (18.28) can be rewritten as follows:

where the period 0 democratic expenditure share for commodity *i*,

Thus *i* during period 0. The formula for the democratic Paasche index does not simplify in the same way, under the assumption that households face the same prices in each period, because of the harmonic form of averaging in equation (18.30).

**18.32** The conclusion at this point is that democratic and plutocratic Laspeyres, Paasche and Fisher indices can be constructed by a statistical agency provided that information on household-specific price relatives,

**18.33** It is now necessary to discuss a practical problem that statistical agencies face: namely, that existing household consumer expenditure surveys, which are used in order to form estimates of household expenditure shares, are not very accurate. Thus the detailed commodity by region expenditure shares,

**18.34** It should be noted that the conditional index framework used above can be used to model situations where household preferences change (continuously) from the base period to the current period: simply choose the environmental variable to be time *t*. The theoretical results in paragraphs 18.14 and 18.28 imply the existence of cost of living indices that lie between observable Laspeyres and Paasche bounds, where the preference functions for the households are taken to be some preferences that are intermediate between the preferences pertaining to the two periods under consideration. As usual, if the observable bounds are not too far apart, taking the geometric average of the bounds leads to an adequate approximation to these theoretical cost of living indices.^{11}

**18.35** For criticisms and some limitations of the economic approach to index number theory, see Turvey (2000) and Diewert (2001).^{12}

This is the terminology used by Pollak (1989, p. 181) in his model of the conditional cost of living concept.

Caves, Christensen and Diewert (1982a, p. 1409) used the terms demographic variables or public goods to describe the vector of conditioning variables *e* in their generalized model of the Konüs price index or cost of living index, while Diewert (2001) used the term environmental variables.

It is assumed that each *f ^{h}*(

*q, e*) is continuous and increasing in the components of

*q*and

*e*, and is concave in the components of

*q*.

In order to simplify notation, in this section the notation *p* and *q* is used rather than the usual summation notation.

These authors provided generalizations of the plutocratic cost of living index attributable to Prais (1959). Pollak and Diewert did not include the environmental variables in their definitions of a group cost of living index.

This is the concept of a cost of living index that Triplett (2001) found most useful for measuring inflation: “One might want to produce a COL *conditional* on the base period’s weather experience…. In this case, the unusually cold winter does not affect the *conditional* COL subindex that holds the environment constant….the COL subindex that holds the environment constant is probably the COL concept that is most useful for an anti-inflation policy.” Hill (1999, p. 4) endorsed this point of view.

Thus the plutocratic Laspeyres index can be regarded as an ordinary Laspeyres index except that each commodity consumed by each household (or in each region) is regarded as a separate commodity.

The general case was obtained by Diewert (2001, p. 222).

See Diewert (2001. p. 223). Note that the household cost functions must be continuous in the environmental variables; this is a real restriction on the types of environmental variables which can be accommodated by the result.

Comparing the formula for the democratic Laspeyres index, *P _{DL}*, with the previous formula (18.16) for the plutocratic Laspeyres index,

*P*, it can be seen that the plutocratic weight for the

_{PL}*i*th price relative for household

*h*is

For a more extensive treatment of cost of living theory in the context of taste change, see Balk (1989a).

For a vigorous defence of the economic approach, see Triplett (2001).