Uploaded by Varvara Orlova

Macro Subject Guide

advertisement
Undergraduate study in Economics,
Management, Finance and the Social Sciences
Macroeconomics
K. Sheedy
EC2065
2022
Macroeconomics
K. Sheedy
EC2065
2022
Undergraduate study in
Economics, Management,
Finance and the Social Sciences
This subject guide is for a 200 course offered as part of the University of London
undergraduate study in Economics, Management, Finance and the Social Sciences.
This is equivalent to Level 5 within the Framework for Higher Education Qualifications
in England, Wales and Northern Ireland (FHEQ).
For more information, see: london.ac.uk
This guide was prepared for the University of London by:
Dr Kevin Sheedy, Assistant Professor of Economics, Department of Economics, The
London School of Economics and Political Science.
This is one of a series of subject guides published by the University. We regret that
due to pressure of work the authors are unable to enter into any correspondence
relating to, or arising from, the guide. If you have any comments on this subject
guide, favourable or unfavourable, please use the form at the back of this guide.
University of London
Publications Office
Stewart House
32 Russell Square
London WC1B 5DN
United Kingdom
london.ac.uk
Published by: University of London
© University of London 2022
The University of London asserts copyright over all material in this subject guide
except where otherwise indicated. All rights reserved. No part of this work may
be reproduced in any form, or by any means, without permission in writing from
the publisher. We make every effort to respect copyright. If you think we have
inadvertently used your copyright material, please let us know.
EC2065 Macroeconomics
Contents
Chapter 1: The supply side of the economy ........................................................................... 8
1.1 Production functions and factors of production........................................................... 8
1.1.1 Factors of production............................................................................................ 8
1.1.2 The production function ....................................................................................... 9
1.1.3 Neoclassical production functions....................................................................... 10
1.1.4 The Cobb-Douglas production function ............................................................... 12
1.1.5 The per worker production function ................................................................... 12
1.2 Factor markets and the distribution of income .......................................................... 13
1.2.1 The Cobb-Douglas production function ............................................................... 15
Box 1.1: Understanding inequality in wages ................................................................ 15
1.3 Population growth according to Malthus ................................................................... 17
1.3.1 Demographics ..................................................................................................... 17
1.3.2 An agricultural economy ..................................................................................... 18
1.3.4 Dynamics of a Malthusian economy.................................................................... 19
1.3.5 What does (or does not) help? ............................................................................ 20
Box 1.2: A 14th-century pandemic ............................................................................... 21
1.4 Hours of work and the supply of labour ..................................................................... 22
1.5 The effects of wages on labour supply ....................................................................... 26
1.5.1 Effects on those already participating in the labour market ................................ 26
1.5.2 Effect on the labour-market participation decision ............................................. 27
1.5.3 The labour supply curve ...................................................................................... 28
1.5.4 Do higher tax rates raise more revenue? ............................................................ 30
1.6 Equilibrium and efficiency ......................................................................................... 31
1.6.1 A static macroeconomic model ........................................................................... 31
1.6.2 Equilibrium in labour and goods markets ............................................................ 31
1.6.3 Economic efficiency ............................................................................................ 32
Box 1.4: Should wages or rents be taxed to pay for public expenditure? ..................... 34
1.6.4 Taxing wages but not rents ................................................................................. 35
1.6.5 Taxing rents but not wages ................................................................................. 35
1.7 Capital accumulation ................................................................................................. 36
i
EC2065 Macroeconomics
1.8 The Solow model ....................................................................................................... 38
1.8.1 The Solow diagram ............................................................................................. 39
Box 1.5: The ‘Asian tiger’ economies ........................................................................... 41
Box 1.6: Interest rates in the long run .......................................................................... 44
Chapter 2: Economic growth ............................................................................................... 46
2.1 Evidence on economic growth ................................................................................... 46
2.1.1 Measuring economic growth .............................................................................. 47
2.1.2 Why growth matters ........................................................................................... 47
2.1.3 Economic growth in historical perspective .......................................................... 48
2.1.4 The distribution of income across countries........................................................ 50
2.1.5 Convergence ....................................................................................................... 50
2.2 Income and growth rates across countries ................................................................ 52
Box 2.1: Can the Solow model explain large income differences across countries? ...... 54
Box 2.2: How long does convergence to the steady state take in the Solow model? .... 56
2.3 Technological progress .............................................................................................. 57
2.3.1 A constant growth rate of technology ................................................................. 58
2.4 International flows of investment .............................................................................. 60
Box 2.3: Institutions and income differences across countries ..................................... 61
2.5 The golden rule ......................................................................................................... 62
2.5.1 Finding the golden rule ....................................................................................... 63
2.5.2 Transitional paths to the golden rule .................................................................. 65
2.5.3 Testing for dynamic inefficiency.......................................................................... 66
Box 2.4: Climate change and the economy .................................................................. 67
2.6 The AK model ............................................................................................................ 68
2.6.1 The AK production function ................................................................................ 68
2.6.2 Endogenous growth ............................................................................................ 70
Box 2.5: Endogenous growth and divergence between countries ................................ 70
2.7 Learning by doing ...................................................................................................... 72
2.8 Human capital ........................................................................................................... 73
2.9 Research and development ....................................................................................... 75
2.9.1 Non-rivalrous but excludable technologies ......................................................... 76
2.9.2 A production function for ideas .......................................................................... 77
2.9.3 Endogenous growth ............................................................................................ 77
ii
EC2065 Macroeconomics
2.9.4 Constant returns or diminishing returns to R&D? ............................................... 78
2.10 International technology transfer ............................................................................ 78
Box 2.6: How strong should intellectual property rights be? ........................................ 80
Chapter 3: Aggregate demand............................................................................................. 82
3.1 Detrending macroeconomic data .............................................................................. 82
3.1.1 Business cycles.................................................................................................... 82
3.1.2 Detrending.......................................................................................................... 83
3.1.3 Business-cycle stylised facts ................................................................................ 84
3.2 Consumption ............................................................................................................. 86
3.3 A two-period consumption model ............................................................................. 88
3.3.1 Preferences ........................................................................................................ 88
3.3.2 Budget constraint ............................................................................................... 89
3.3.3 Choice of an optimal consumption plan .............................................................. 92
3.4 Bonds, yields, and interest rates ................................................................................ 96
3.5 Interest rates and consumption ................................................................................. 96
Box 3.2: Durables, non-durables, and services ........................................................... 100
3.6 Consumption smoothing in the aggregate ............................................................... 101
Box 3.3: Supply disruptions and real interest rates .................................................... 104
3.8 A two-period model of investment .......................................................................... 106
3.8.1 The production function ................................................................................... 106
3.8.2 Capital accumulation ........................................................................................ 106
3.8.3 Firms’ profits .................................................................................................... 106
3.8.4 Options for financing investment ...................................................................... 107
3.8.5 The optimal investment decision ...................................................................... 108
3.8.6 Does the source of financing matter? ............................................................... 109
3.8.7 The investment demand curve.......................................................................... 109
3.9 The stock market ..................................................................................................... 110
Box 3.4: Stock prices and firms’ investment decisions ............................................... 111
Box 3.5: Should capital be taxed? .............................................................................. 112
3.10 Labour supply over time ........................................................................................ 114
3.11 A dynamic macroeconomic model ......................................................................... 116
3.11.1 A representative household ............................................................................ 116
3.11.2 Firms............................................................................................................... 117
iii
EC2065 Macroeconomics
3.11.3 Government ................................................................................................... 118
3.12 General equilibrium ............................................................................................... 118
Box 3.6: Growth slowdowns and real interest rates ................................................... 121
Chapter 4: Fiscal policy and credit-market imperfections .................................................. 123
4.1 Taxes and the government’s budget constraint ....................................................... 123
4.2 Ricardian equivalence .............................................................................................. 126
Box 4.2: The effects of a fiscal stimulus ..................................................................... 131
4.3 Credit-market imperfections ................................................................................... 134
Box 4.3: Bequests and intergenerational redistribution ............................................. 136
4.4 Interest-rate spreads ............................................................................................... 139
4.5 Asymmetric information .......................................................................................... 142
Box 4.4: Does the current profitability of firms matter for investment?..................... 143
Box 4.5: Financial crises ............................................................................................. 144
4.6 Limited commitment ............................................................................................... 146
Box 4.6: Interest rates and the value of housing collateral......................................... 149
4.7 Overlapping generations ......................................................................................... 151
Box 4.7: Pay-as-you-go pension systems.................................................................... 153
Box 4.8: Should pensions be fully funded? ................................................................. 155
Box 4.9: Declining population growth rates and pay-as-you-go pensions ................... 157
Box 4.10: Bubbles in financial markets....................................................................... 159
Box 4.11: Does the government have a budget constraint when interest rates are low?
.................................................................................................................................. 161
Chapter 5: Unemployment ................................................................................................ 163
5.1 Introduction to unemployment ............................................................................... 163
5.2 Efficiency wages ...................................................................................................... 166
Box 5.1: Changes in firms’ ability to monitor workers ................................................ 170
5.3 Search and matching in the labour market .............................................................. 171
5.4 A model of job search .............................................................................................. 172
5.5 Stocks and flows in the labour market ..................................................................... 175
5.51 Stock-flow accounting........................................................................................ 175
5.52 The equilibrium unemployment rate ................................................................. 176
5.53 Unemployment in the search-and-matching model ........................................... 177
Box 5.2: The generosity of the welfare state.............................................................. 178
Box 5.3: Wage dispersion and incentives to search for jobs ....................................... 179
iv
EC2065 Macroeconomics
5.6 Vacancies and unemployment ................................................................................. 180
5.6.1 An equilibrium search model of unemployment ............................................... 181
5.6.2 The matching function ...................................................................................... 182
5.6.3 Explaining the Beveridge curve ......................................................................... 183
5.6.4 Market tightness............................................................................................... 184
5.7 Wage bargaining...................................................................................................... 184
5.8 Job creation ............................................................................................................. 186
Box 5.4: Mismatch ..................................................................................................... 188
Box 5.6: ‘Furlough’ policies in the COVID pandemic ................................................... 189
Box 5.5: The bargaining power of workers ................................................................. 189
Chapter 6: Money ............................................................................................................. 191
6.1 Why does money matter? ....................................................................................... 191
6.1.1 Medium of exchange ........................................................................................ 191
6.1.2 Unit of account ................................................................................................. 192
6.1.3 Different objects that serve as money .............................................................. 192
6.2 A search-theory perspective on money ................................................................... 192
6.2.1 A simple search model of money ...................................................................... 193
6.2.1 Commodity money ........................................................................................... 193
6.2.2 Credit money .................................................................................................... 194
6.2.3 Money and credit ............................................................................................. 195
6.2.4 Fiat money ........................................................................................................ 196
Box 6.1: Cryptocurrencies .......................................................................................... 198
6.3 Money and assets as stores of value ........................................................................ 199
6.3.1 Inflation ............................................................................................................ 199
6.3.2 The Fisher equation .......................................................................................... 200
6.3.3 Ex-ante and ex-post interest rates .................................................................... 201
6.3.4 The opportunity cost of holding money ............................................................ 201
6.3.5 Real and nominal interest rates ........................................................................ 202
6.4 The demand for money ........................................................................................... 203
6.4.1 Economising on holding money ........................................................................ 203
6.4.2 Alternatives to money ...................................................................................... 205
6.4.3 The money demand function ............................................................................ 205
6.5 Money and economic activity .................................................................................. 207
v
EC2065 Macroeconomics
6.6 The supply of money ............................................................................................... 208
6.7 Money and prices .................................................................................................... 210
Box 6.2: The instability of money demand ................................................................. 211
6.8 Money and public finance ....................................................................................... 213
6.9 Does monetary policy matter?................................................................................. 216
Box 6.3: Money supply increases that the central bank announces are temporary .... 218
6.10 Optimal monetary policy and the costs of inflation................................................ 220
Box 6.4: Hyperinflations ............................................................................................ 222
Box 6.5: Cash and tax evasion .................................................................................... 223
6.11 Conducting monetary policy by setting interest rates ............................................ 224
6.12 Taylor rules and the Taylor principle ...................................................................... 226
6.13 The liquidity trap and the zero lower bound .......................................................... 227
Box 6.6: A deflation trap ............................................................................................ 228
6.14 Negative nominal interest rates............................................................................. 229
Chapter 7: Banking and finance ......................................................................................... 232
7.1 Fractional reserve banking....................................................................................... 232
7.2 The tools of monetary policy ................................................................................... 234
7.2.1 Open-market operations .................................................................................. 234
7.2.2 Standing facilities.............................................................................................. 235
7.2.3 Reserve requirements....................................................................................... 235
7.3 The interbank market .............................................................................................. 236
7.3.1 The demand for reserves .................................................................................. 236
7.3.2 Equilibrium in the interbank market ................................................................. 237
Box 7.1: The ‘channel’ system of monetary policy ..................................................... 239
Box 7.2: The ‘floor’ system of monetary policy .......................................................... 240
7.4 The supply of bank deposits .................................................................................... 241
7.4.1 Costs of maintaining adequate reserves ........................................................... 242
7.4.2 Bank capital requirements ................................................................................ 243
7.5 Equilibrium in the banking market ........................................................................... 243
Box 7.3: Should central banks pay interest on reserves?............................................ 244
Seigniorage revenue and the profitability of the central bank ................................... 246
7.6 Bond maturity and the yield curve .......................................................................... 246
7.7 The expectations theory of long-term interest rates ................................................ 248
vi
EC2065 Macroeconomics
Box 7.4: Forecasting from the shape of the yield curve.............................................. 249
7.8 Risk and portfolio choice ......................................................................................... 250
Box 7.5: The typical shape of the yield curve ............................................................. 253
7.9 The functions of banks............................................................................................. 254
7.9.1 The Diamond-Dybvig model .............................................................................. 255
7.9.2 An economy with no financial intermediaries ................................................... 256
7.10 Banking as maturity transformation ...................................................................... 256
7.10.1 Bank deposits ................................................................................................. 256
7.10.2 Competition between banks ........................................................................... 257
7.10.3 The equilibrium deposit contract offered by banks ......................................... 258
7.11 Bank runs .............................................................................................................. 260
7.11.1 Strategic demands for withdrawals and bank failures ..................................... 260
7.11.2 Multiple equilibria and the possibility of bank runs ......................................... 261
7.11.3 The 2007 Northern Rock bank run .................................................................. 261
7.11.4 The ‘shadow’ banking system and the 2008 financial crisis ............................. 261
7.12 Deposit insurance and bank regulation .................................................................. 261
7.12.1 Deposit insurance ........................................................................................... 262
7.12.2 The central bank as ‘lender of last resort’ ....................................................... 262
7.12.3 Bank capital requirements .............................................................................. 262
7.12.4 Reserve requirements ..................................................................................... 263
Box 7.6: The 100 per cent reserve requirements ....................................................... 263
Box 7.7: Central-bank digital currency ....................................................................... 265
Chapter 8: Business cycles................................................................................................. 266
8.1 Nominal rigidity ....................................................................................................... 266
8.2 The new Keynesian model ....................................................................................... 268
8.3 The real effects of monetary policy.......................................................................... 271
Box 8.1: The Volcker disinflation................................................................................ 272
8.4 Business cycles due to demand shocks .................................................................... 273
Box 8.2: Can the new Keynesian model match the business-cycle stylised facts?....... 274
Box 8.3: Labour hoarding ........................................................................................... 280
8.5 The natural rate of interest...................................................................................... 281
8.5.1 Imperfect competition and the output supply curve ......................................... 281
8.5.2 Market clearing in the absence of nominal rigidities ......................................... 282
vii
EC2065 Macroeconomics
8.5.3 The long run and the short run ......................................................................... 283
8.6 Optimal stabilisation policy...................................................................................... 283
Box 8.4: Modelling monetary policy using Taylor rules and LM curves ....................... 285
8.6.1 Money supply targets, the LM curve and the IS-LM model................................ 286
8.6.2 The Taylor rule .................................................................................................. 287
8.7 Real business cycle theory ....................................................................................... 288
8.8 Business cycles due to supply shocks ....................................................................... 289
8.8.1 Supply shocks ................................................................................................... 289
8.8.2 The predictions of the RBC model ..................................................................... 290
8.8.3 Stabilisation policy? .......................................................................................... 291
Box 8.5: Sources of supply shocks in the RBC model .................................................. 291
8.9 Coordination failure model ...................................................................................... 293
8.9.1 Labour productivity spillover across firms ......................................................... 293
8.9.2 Implications for the output supply curve .......................................................... 295
8.10 Multiple equilibria and business cycles .................................................................. 296
Box 8.6: The strength of strategic complementarities................................................ 299
Chapter 9: Inflation, expectations and macroeconomic policy........................................... 300
9.1 Inflation and the Phillips curve ................................................................................ 300
9.1.1 Firms’ incentives to adjust prices ...................................................................... 300
9.1.2 Price changes and economic activity ................................................................. 301
9.1.3 Expectations ..................................................................................................... 302
9.1.4 The Phillips curve .............................................................................................. 302
9.1.5 Inflation and unemployment ............................................................................ 303
9.2 Expectations and aggregate demand ....................................................................... 303
9.2.1 Inflation expectations and real interest rates .................................................... 303
9.2.2 Expectations of the economy’s future GDP ....................................................... 304
9.3 Aggregate demand with market imperfections ........................................................ 304
9.3.1 Consumption and aggregate demand ............................................................... 305
9.3.2 Consumption and aggregate demand with credit-market imperfections........... 306
Box 9.1: Multiplier and crowding-out effects of fiscal policy ...................................... 307
Box 9.2: Asset prices and the financial accelerator .................................................... 308
Box 9.3: The 2008 financial crisis ............................................................................... 309
9.4 Inflation, aggregate demand and monetary policy................................................... 310
viii
EC2065 Macroeconomics
9.5 The costs of inflation ............................................................................................... 311
9.5.1 Money being a poor store of value ................................................................... 311
9.5.2 Menu costs and relative-price distortions ......................................................... 311
9.5.3 Inflation and redistribution ............................................................................... 311
Box 9.4: Inflation targeting ........................................................................................ 312
9.6 Time inconsistency .................................................................................................. 314
Box 9.5: The inflation bias problem ........................................................................... 314
9.7 Unconventional monetary policy at the interest-rate lower bound ......................... 316
9.7.1 Quantitative easing........................................................................................... 316
9.7.2 Forward guidance ............................................................................................. 318
Box 9.6: Inflation targeting and the interest-rate lower bound problem .................... 319
Box 9.7: Forward guidance and confidence ............................................................... 320
Box 9.8: Negative interest rate policies...................................................................... 321
Chapter 10: Open-economy macroeconomics ................................................................... 322
10.1 International trade in goods and assets ................................................................. 322
10.2 Gains from trade in assets ..................................................................................... 324
Box 10.1: International risk sharing ........................................................................... 328
Box 10.2: The ‘twin deficits’....................................................................................... 330
10.3 Sovereign default................................................................................................... 331
10.4 Open-economy real dynamic model ...................................................................... 334
10.4.1 Balance-of-payments equilibrium, capital flows and net exports .................... 334
10.4.2 Examples ........................................................................................................ 336
Box 10.3: Global imbalances ...................................................................................... 337
Box 10.4: Capital controls .......................................................................................... 339
10.5 The terms of trade ................................................................................................. 340
10.6 Exchange rates ...................................................................................................... 341
10.7 Exchange-rate regimes .......................................................................................... 343
10.7.1 Intervention in the foreign-exchange market .................................................. 343
10.7.2 A shock to foreign prices ................................................................................. 344
10.7.3 Monetary policy autonomy with a flexible exchange rate ............................... 345
10.8 Open-economy sticky-price model ........................................................................ 346
10.8.1 Competitiveness and output demand ............................................................. 346
10.8.2 Balance-of-payments equilibrium and uncovered interest parity .................... 347
ix
EC2065 Macroeconomics
10.8.3 Monetary policy and the exchange-rate regime .............................................. 348
Box 10.5: The trilemma ............................................................................................. 350
Box 10.6: Currency crises ........................................................................................... 352
x
EC2065 Macroeconomics | General information
General information
Module level: 5
Credit: 30
Notional study time: 300 hours
Summary
This course will cover the fundamental principles of macroeconomics at an
intermediate level. Topics include the supply side of the economy and economic
growth, the demand side of the economy, consumption, investment, fiscal policy, the
labour market and unemployment, financial markets, banking, money and monetary
policy, business cycles, inflation, and international macroeconomics.
Conditions
Prerequisite: If taken as part of a BSc degree, the following course(s) must be
passed before this course may be attempted:
EC1002 Introduction to economics and
Either MT105A Mathematics 1 (half course) or MT1174 Calculus or MT1186
Mathematical methods).
Aims and objectives
This course aims to bring you up to date with modern developments in
macroeconomics and to help you analyse the macroeconomic issues of the day.
Learning outcomes
At the end of the course and having completed the essential reading and activities,
you should be able to think about and give answers to key macroeconomic
questions, for example:
•
What are the forces that drive long-term prosperity?
•
Is a growth slowdown in emerging economies inevitable?
•
Why are real interest rates so low?
•
What causes bubbles in financial markets?
•
Does the government have a budget constraint?
•
How does the labour market respond to structural change and shifting
employment patterns?
•
What is the role of banks and why are they inherently fragile?
•
Is it a good idea for central banks to set up new digital currencies?
1
EC2065 Macroeconomics | General information
•
Why does economic activity fluctuate?
•
Can and should policymakers seek to ameliorate business cycles?
•
What options do central banks have when nominal interest rates fall to zero?
•
What are the causes of global imbalances?
The approach of the course is to discuss the salient features of the data and then go
on to present macroeconomic models to study these questions.
Essential reading
The textbook for the course is:
Williamson, S.D. Macroeconomics. (London: Pearson, 2018) 6th edition
[ISBN 9780134472119].
Detailed reading references in this subject guide refer to the editions of the set
textbooks listed above. New editions of one or more of these textbooks may have
been published by the time you study this course. You can use a more recent edition
of any of the books; use the detailed chapter and section headings and the index to
identify relevant readings. Also check the virtual learning environment (VLE)
regularly for updated guidance on readings.
Online study resources
In addition to the subject guide and the Essential reading, it is crucial that you take
advantage of the study resources that are available online for this course, including
the VLE and the Online Library.
You can access the VLE, the Online Library and your University of London email
account via the Student Portal.
You should have received your login details for the Student Portal with your official
offer, which was emailed to the address that you gave on your application form. You
have probably already logged in to the Student Portal in order to register! As soon as
you registered, you will automatically have been granted access to the VLE, Online
Library and your fully functional University of London email account.
If you have forgotten these login details, please click on the ‘Forgotten your
password’ link on the login page.
The VLE
The VLE, which complements this subject guide, has been designed to enhance
your learning experience, providing additional support and a sense of community. It
forms an important part of your study experience with the University of London and
you should access it regularly.
The VLE provides a range of resources for EMFSS courses:
•
Course materials: Subject guides and other course materials available for
download. In some courses, the content of the subject guide is transferred
2
EC2065 Macroeconomics | General information
•
•
•
•
•
•
•
•
•
into the VLE and additional resources and activities are integrated with the
text.
Readings: Direct links, wherever possible, to essential readings in the Online
Library, including journal articles and ebooks.
Video content: Including introductions to courses and topics within courses,
interviews, lessons and debates.
Screencasts: Videos of PowerPoint presentations, animated podcasts and
on-screen worked examples.
External material: Links out to carefully selected third-party resources.
Self-test activities: Multiple-choice, numerical and algebraic quizzes to
check your understanding.
Collaborative activities: Work with fellow students to build a body of
knowledge.
Discussion forums: A space where you can share your thoughts and
questions with fellow students. Many forums will be supported by a ‘course
moderator’, a subject expert employed by LSE to facilitate the discussion and
clarify difficult topics.
Past examination papers: We provide up to three years of past
examinations alongside Examiners’ commentaries that provide guidance on
how to approach the questions.
Study skills: Expert advice on getting started with your studies, preparing for
examinations and developing your digital literacy skills.
Note: Students registered for Laws courses also receive access to the dedicated
Laws VLE.
Some of these resources are available for certain courses only, but we are
expanding our provision all the time and you should check the VLE regularly for
updates.
Making use of the Online Library
The Online Library contains a huge array of journal articles and other resources to
help you read widely and extensively.
To access the majority of resources via the Online Library you will either need to use
your University of London Student Portal login details, or you will be required to
register and use an Athens login.
The easiest way to locate relevant content and journal articles in the Online Library is
to use the Summon search engine.
If you are having trouble finding an article listed in a reading list, try removing any
punctuation from the title, such as single quotation marks, question marks and
colons.
For further advice, please use the online help pages or contact the Online Library
team.
3
EC2065 Macroeconomics | General information
Syllabus
Topic 1: The supply side of the economy
Contents: Factors of production (labour, land, and capital), the production function
and the supply side of the economy, the distribution of income between factors
(wages, rents, and interest), population growth, labour-market participation and
labour supply, taxation, competitive equilibrium and efficiency, the Solow model.
Questions to address
•
Why was economic growth so different before and after the Industrial
Revolution?
•
Why have hours worked declined in advanced economies even though wages
are so much higher than in the past?
•
Why has wage inequality increased in recent decades?
•
Should land-value taxes be used?
•
Is a growth slowdown in emerging economies inevitable?
Topic 2: Economic growth in the long run
Contents: Evidence on economic growth and the income distribution across
countries, convergence, saving rates and the Golden rule, technological progress,
international investment flows, institutions and misallocation, endogenous growth
theory, learning by doing, human capital, research and development, diffusion of
knowledge between countries.
Questions to address
•
What are the forces that drive long-run prosperity?
•
Are we saving enough for the future?
•
Why is the gap between rich and poor countries so large?
•
How strong should intellectual property rights be?
•
What are the implications of climate change for the economy?
Topic 3: The demand side of the economy
Contents: Evidence on macroeconomic fluctuations using detrended data,
consumption, the relationship between consumption and income, durable and nondurable goods and services, interest rates and saving, bond yields, determinants of
real interest rates, investment, the stock market, a dynamic macroeconomic model.
Questions to address
•
Why are purchases of capital goods by firms very volatile while purchases of
services by households are more stable?
•
Why are real interest rates so low?
4
EC2065 Macroeconomics | General information
•
Do low interest rates discourage saving and stimulate consumption?
•
Should capital and wealth be taxed to fund public expenditure instead of
wages?
•
Is there a link between the stock market and the amounts invested by firms?
Topic 4: Credit-market imperfections and fiscal policy
Contents: The government budget constraint, Ricardian equivalence, taxes on
consumption, borrowing constraints, interest-rate spreads, asymmetric information
between borrowers and lenders, limited commitment and collateral, overlapping
generations, pension systems.
Questions to address
•
Can tax cuts stimulate demand?
•
Why do house prices affect the economy?
•
How does a financial crisis affect households’ and firms’ spending decisions?
•
Should the government be involved in providing pensions or leave it to the
market?
•
How do demographics affect the pension system?
•
What causes bubbles in financial markets?
•
Does the government have a budget constraint?
Topic 5: Unemployment, vacancies, and wages
Contents: Unemployment and wage rigidity, efficiency wages, the process of search
and matching, wage dispersion and the reservation wage, labour-market flows,
vacancies and the Beveridge curve, wage bargaining, job creation, and labourmarket tightness.
Questions to address
•
Why does unemployment occur, even when job vacancies are unfilled?
•
How does the labour market respond to structural change and shifting
employment patterns?
•
What are the effects of labour-market institutions such as unemployment
insurance?
•
What was the role of job-support schemes during the Covid pandemic?
Topic 6: Money and monetary policy
Contents: The nature and functions of money, money’s role as a medium of
exchange with search and matching, inflation and interest rates, the demand for
money and credit as a substitute for money, the fiat monetary system, the effects of
5
EC2065 Macroeconomics | General information
monetary policy on inflation and economic activity, controlling inflation using Taylor
rules, the lower bound on nominal interest rates.
Questions to address
•
Why has the link between money supply growth and inflation been unstable?
•
What advantages do governments derive from being able to create money?
•
What are hyperinflations and why are they so damaging?
•
How should central banks conduct monetary policy?
•
Is it better to have a monetary system without physical cash?
•
Can and should nominal interest rates ever be negative?
•
How do cryptocurrencies differ from existing forms of money?
Topic 7: Banking, finance, and the money markets
Contents: Fractional reserve banking, the tools of monetary policy and how policy
decisions are implemented, the interbank market, reserves and deposit creation by
commercial banks, bonds and the yield curve, the expectations theory of long-term
interest rates, risk and portfolio choice, banking and maturity transformation, bank
runs.
Questions to address
•
How do central banks control interest rates?
•
What information can we learn from the shape of the yield curve?
•
What is the role of banks and why are they inherently fragile?
•
How should the banking system be regulated?
•
Would the monetary system work better if commercial banks were prevented
from creating money by imposing 100 per cent reserve requirements?
•
Is it a good idea for central banks to set up new digital currencies?
Topic 8: Nominal rigidities and business cycles
Contents: Sticky prices, the New Keynesian model, the real effects of monetary
policy, business cycles and the role of stabilization policy, demand and supply
shocks, real business cycle theory, evidence on business-cycle fluctuations,
coordination failure and multiple equilibria.
Questions to address
•
Why does economic activity fluctuate?
•
What are the shocks that cause booms and recessions?
•
Can and should policymakers seek to ameliorate business cycles?
6
EC2065 Macroeconomics | General information
•
Can changes in confidence be the driving force of business cycles?
Topic 9: Inflation, expectations, and macroeconomic policy
Contents: Inflation and the Phillips curve, the role of expectations, costs of inflation,
time inconsistency and the inflation bias, inflation targeting, aggregate demand
multipliers and the effectiveness of fiscal policy, unconventional monetary policy at
the interest-rate lower bound, quantitative easing and forward guidance.
Questions to address
•
Can a fiscal stimulus raise GDP by more than the extra government
spending?
•
Should central banks prioritise controlling inflation or focus on trying to
stabilise fluctuations in real GDP?
•
What options do central banks have when nominal interest rates fall to zero?
•
What is forward guidance, and how effective is it as a monetary policy tool?
•
Should inflation targeting be reformed, or abandoned and replaced by targets
for the price level or nominal GDP?
Topic 10: International trade in goods and assets
Contents: The balance of payments, gains from trade in assets, sovereign default
and limits on international lending, determinants of the current account, exchange
rates and exchange-rate regimes, purchasing power parity, the terms of trade,
uncovered interest parity, capital mobility and capital controls, the trilemma.
Questions to address
•
What are the causes of ‘global imbalances’?
•
Do government budget deficits cause current-account deficits?
•
Are monetary and fiscal policy still effective in an open economy where capital
can flow freely?
•
Should exchange rates be fixed or left to float?
•
What are the causes of currency crises and the collapse of fixed exchangerate systems?
7
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Chapter 1: The supply side of the economy
We start by studying the supply side of the economy. This means the economy’s capacity to
produce goods and services. We will defer consideration of demand-side issues until
Chapter 3. In this chapter, we will introduce a basic supply-side theory of gross domestic
product (GDP). GDP is a measure both of production and of income, so our model is a
starting point in understanding how much an economy is able to produce and how much
people within the economy are able to earn.
Using the supply-side model, we will look at what explains the level of GDP and how the
economy’s total income is distributed. This model serves as the foundation for our study of
economic growth in Chapter 2, where we will explore why there is growth in GDP per
person over time and why the level of GDP per person differs so much across countries.
Essential reading
•
Williamson, Chapters 4, 5 and 7.
1.1 Production functions and factors of production
The supply-side model of an economy’s GDP has two ingredients: first, the quantities of
factors of production available and second, the production function.
1.1.1 Factors of production
Factors of production are basic inputs into the production process for goods and services.
Here, we consider three factors:
1. Labour
2. Land
3. Capital.
Capital refers to goods used to produce other goods in the future, for example, machinery,
tools, buildings, computers, vehicles. In our model, suppose that capital goods are
homogeneous and the capital stock, denoted 𝐾𝐾, is the quantity of units of capital available
to use for production. The quantity of capital can be increased by producing new capital
goods and this process of capital accumulation is described in Section 1.7. For now, the basic
supply-side model takes as given the available supplies of each of the factors of production,
including capital.
Land refers to physical space needed to produce goods and services, for example, farmland,
or a city-centre site for a shopping mall. An essential characteristic of land is that it is in fixed
supply. Even when we allow for the supplies of labour and capital to change over time in
later models, the quantity of land remains fixed. Treating units of land as homogeneous for
8
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
simplicity, we denote the fixed quantity by 𝐿𝐿. Natural resources can also be included as part
of land, broadly interpreted.
Labour as a factor of production refers to work done by people to produce goods and
services. The supply of labour to the economy depends on how many workers there are and
the number of hours worked by each person. To begin with, we will treat labour as
homogeneous and denote the quantity by 𝑁𝑁 that, depending on the context, could be
measured as a number of people or a number of hours worked. More broadly, we could also
consider the skills, training and experience of the workforce as part of what determines the
supply of labour, although these are often counted as a separate factor of production
known as ‘human capital’, considered in Chapter 2.
1.1.2 The production function
The production function describes how the factors of production are combined to produce
the output of final goods and services. The production function is a high-level summary of
the production process, which may involve many stages and many intermediate inputs that
are not considered explicitly. The production function is the link between the basic inputs of
the factors of production and the output of final goods and services.
The economy’s real GDP π‘Œπ‘Œ measures output of final goods and services. In practice, this
comprises a large number of different products that are aggregated using their relative
prices. However, for much of the time we will simplify matters by assuming there is just a
single homogeneous good, or a stable basket of goods, produced in the economy.
The production function describes how output π‘Œπ‘Œ is produced:
π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝐿𝐿, 𝑁𝑁)
Assuming factors are fully employed, factor supplies 𝐾𝐾, 𝐿𝐿, and 𝑁𝑁 are inputs to production
function 𝐹𝐹 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁). In addition to the factor supplies, output also depends on the overall
productivity of the factors. This is the notion of total factor productivity (TFP), which is
represented in the production function by the coefficient 𝑧𝑧. The value of 𝑧𝑧 can represent the
level of technology, or how efficiently factors of production are allocated to their best uses.
The production function, together with the factor supplies and TFP, determine real GDP π‘Œπ‘Œ.
We typically make some conventional assumptions about the function 𝐹𝐹(𝐾𝐾, 𝐿𝐿, 𝑁𝑁).
Constant returns to scale
The first assumption is that the production function has constant returns to scale. If the
economy were able to double its supplies of all factors of production then it should be able
to double its output. More generally, scaling all inputs of factors of production scales output
in same way. The usual justification for assuming constant returns to scale is the ‘replication
principle’. By using the same production techniques and technologies with the additional
factors organised in the same way as the original factors, it should be possible to make
double the original amount of output if all factors have been accounted for.
9
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Positive, but diminishing, marginal products
The second assumption is that increasing the supply of one factor of production without
changing the amounts of the other factors available raises output but less than
proportionately. The extra output produced by an extra unit of one factor is the marginal
product of that factor. Mathematically, a factor’s marginal product is the partial derivative
of the production function with respect to that factor. For example, the marginal product of
capital, denoted by 𝑀𝑀𝑃𝑃𝐾𝐾 , is:
𝑀𝑀𝑃𝑃𝐾𝐾 =
πœ•πœ•πœ•πœ•
= 𝑧𝑧𝐹𝐹𝐾𝐾 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁)
πœ•πœ•πœ•πœ•
Stated in terms of marginal products, the second assumption is that each factor’s marginal
product is positive but diminishes as the supply of that factor increases. Mathematically,
𝑀𝑀𝑃𝑃𝐾𝐾 is positive but declines as 𝐾𝐾 increases. The same assumptions are made for the
marginal products of land 𝑀𝑀𝑃𝑃𝐿𝐿 = πœ•πœ•πœ•πœ•⁄πœ•πœ•πœ•πœ• and labour 𝑀𝑀𝑃𝑃𝑁𝑁 = πœ•πœ•πœ•πœ•⁄πœ•πœ•πœ•πœ•.
Why are factors’ marginal products decreasing? Consider, for example, the use of capital in
an office job. Without a computer, it is very difficult to perform many tasks. Giving an office
worker a computer has a large effect on that worker’s output compared to no computer, so
marginal product of capital initially high. And while a more powerful computer may allow
the worker to produce more, given the nature of the task performed by the worker, extra
computing power is unlikely to raise the worker’s output proportionately. This means that
the marginal product of capital declines.
It is important to note that this argument holds fixed the number of workers, their skills and
state of technology. If the extra computing power were allocated to an additional worker, or
used by workers with higher skills to perform more advanced tasks then there is no
presumption that output will not rise proportionately.
Inada conditions
The third assumption on the production function is known as the ‘Inada conditions’. These
are essentially a stronger version of the assumption of diminishing marginal products.
Rather than just requiring the marginal product of a factor declines as the use of that factor
increases, the Inada conditions require that the marginal product declines all the way to
zero. Similarly, the Inada conditions require that the marginal product of a factor is initial
very high (mathematically, it is said to approach infinity) when the usage of the factor is
close to zero. One consequence of the Inada conditions is that some positive amount of
each factor of production is essential to produce any output.
1.1.3 Neoclassical production functions
In summary, the usual assumptions we make about production functions are:
•
Constant returns to scale:
𝐹𝐹 (𝑠𝑠𝑠𝑠, 𝑠𝑠𝑠𝑠, 𝑠𝑠𝑠𝑠) = 𝑠𝑠𝑠𝑠(𝐾𝐾, 𝐿𝐿, 𝑁𝑁)
A scaling of inputs of all factors of production by 𝑠𝑠 (for example, 𝑠𝑠 = 2 is doubling
inputs) scales output by 𝑠𝑠.
10
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
•
Positive but diminishing marginal products of factors:
πœ•πœ•πœ•πœ•
𝐹𝐹𝐾𝐾 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) =
>0
πœ•πœ•πœ•πœ•
πœ•πœ• 2 π‘Œπ‘Œ
𝐹𝐹𝐾𝐾𝐾𝐾 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) =
<0
πœ•πœ•πΎπΎ 2
,
These are for capital. The same assumptions hold for other factors. For land, 𝐹𝐹𝐿𝐿 > 0 and
𝐹𝐹𝐿𝐿𝐿𝐿 < 0, and for labour, 𝐹𝐹𝑁𝑁 > 0 and 𝐹𝐹𝑁𝑁𝑁𝑁 < 0.
•
Inada conditions:
lim 𝐹𝐹𝐾𝐾 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = ∞
,
𝐾𝐾→0
lim 𝐹𝐹𝐾𝐾 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = 0
𝐾𝐾→∞
These are for capital. The same assumptions hold in terms of 𝐹𝐹𝐿𝐿 and 𝐹𝐹𝑁𝑁 for land and
labour.
A production function that satisfies all three assumptions is called a ‘neoclassical production
function’. Figure 1.1 plots the relationship between output π‘Œπ‘Œ and capital 𝐾𝐾 for a neoclassical
production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝐿𝐿, 𝑁𝑁). This is not a plot of the entire production function
because land and labour are held constant at 𝐿𝐿0 and 𝑁𝑁0 but similar diagrams can be used to
show the relationship between π‘Œπ‘Œ and 𝐿𝐿, and π‘Œπ‘Œ and 𝑁𝑁, holding the other two factors fixed.
However, the constant returns to scale assumption cannot be illustrated in the diagram
because that would require changing all the factor inputs at the same time.
Figure 1.1: A neoclassical production function
The production function is upward sloping because its gradient is the marginal product of
capital, which is positive. The gradient declines as 𝐾𝐾 increases because the marginal product
of capital is diminishing. The Inada conditions imply the production function is extremely
steep for 𝐾𝐾 close to zero and flattens out as 𝐾𝐾 becomes very large. The production function
must also pass through the origin because some capital is essential for production given the
neoclassical assumptions.
11
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
1.1.4 The Cobb-Douglas production function
The most commonly used example of a neoclassical production function is the CobbDouglas functional form:
π‘Œπ‘Œ = 𝑧𝑧𝐾𝐾 𝛼𝛼 𝐿𝐿𝛽𝛽 𝑁𝑁 1−𝛼𝛼−𝛽𝛽
The parameters 𝛼𝛼 and 𝛽𝛽 each lie between 0 and 1, and the sum 𝛼𝛼 + 𝛽𝛽 is less than 1. The
Cobb-Douglas production function satisfies the three neoclassical assumptions. First, it has
constant returns to scale because:
𝑧𝑧(𝑠𝑠𝑠𝑠 )𝛼𝛼 (𝑠𝑠𝐿𝐿)𝛽𝛽 (𝑠𝑠𝑁𝑁)1−𝛼𝛼−𝛽𝛽 = 𝑠𝑠𝑠𝑠𝐾𝐾 𝛼𝛼 𝐿𝐿𝛽𝛽 𝑁𝑁 1−𝛼𝛼−𝛽𝛽
The marginal product of capital is:
𝑀𝑀𝑃𝑃𝐾𝐾 = 𝑧𝑧𝑧𝑧𝐾𝐾 𝛼𝛼−1 𝐿𝐿𝛽𝛽 𝑁𝑁 1−𝛼𝛼−𝛽𝛽
This is positive because 𝛼𝛼 > 0. It declines as 𝐾𝐾 increases because 𝛼𝛼 < 1, so the exponent of
capital in the expression for 𝑀𝑀𝑃𝑃𝐾𝐾 is negative. It follows that the marginal product of capital
is positive but diminishing. Using the partial derivatives with respect to land and labour, the
same is true of the marginal products of land and labour.
The expression for the marginal product of capital approaches infinity as 𝐾𝐾 → 0, and
approaches zero as 𝐾𝐾 → ∞. The same can be shown for the other factors. This confirms the
Inada conditions hold for the Cobb-Douglas production function.
1.1.5 The per worker production function
In many contexts, we are interested in how much output is produced per worker, rather
than total production. This is relevant if we want to calculate living standards in an
economy, which are connected to how much is produced per person.
Here, we focus on just two factors of production, capital 𝐾𝐾 and labour 𝑁𝑁. We assume a
neoclassical production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧 (𝐾𝐾, 𝑁𝑁). Output per worker, denoted by 𝑦𝑦 = π‘Œπ‘Œ/𝑁𝑁,
can be explained in terms of capital per worker π‘˜π‘˜ = 𝐾𝐾/𝑁𝑁 and TFP 𝑧𝑧.
𝑦𝑦 =
π‘Œπ‘Œ 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁)
𝐾𝐾 𝑁𝑁
=
= 𝑧𝑧𝑧𝑧 οΏ½ , οΏ½ = 𝑧𝑧𝑧𝑧(π‘˜π‘˜, 1) = 𝑧𝑧𝑧𝑧(π‘˜π‘˜)
𝑁𝑁
𝑁𝑁
𝑁𝑁 𝑁𝑁
This equation is derived using the constant returns to scale property of a neoclassical
production function which implies a scaling of all factors of production (here by 1/𝑁𝑁) is
equivalent to scaling output in the same proportion. In the above, the function 𝑓𝑓(π‘˜π‘˜) is
simply used as a shorthand for 𝐹𝐹(π‘˜π‘˜, 1).
Taking the Cobb-Douglas production function π‘Œπ‘Œ = 𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 1−𝛼𝛼 for example:
π‘Œπ‘Œ 𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 1−𝛼𝛼
𝐾𝐾 𝛼𝛼
𝛼𝛼 −𝛼𝛼
𝑦𝑦 = =
= 𝑧𝑧𝐾𝐾 𝑁𝑁 = 𝑧𝑧 οΏ½ οΏ½ = π‘§π‘§π‘˜π‘˜ 𝛼𝛼
𝑁𝑁
𝑁𝑁
𝑁𝑁
This confirms that 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜) with 𝑓𝑓(π‘˜π‘˜) = π‘˜π‘˜ 𝛼𝛼 in this case.
If the production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) is neoclassical, the per worker production function
has an increasing and concave shape. Observe that aggregate output is π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧𝑧𝑧(𝐾𝐾⁄𝑁𝑁), so
12
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
the marginal product of capital is 𝑀𝑀𝑃𝑃𝐾𝐾 = πœ•πœ•πœ•πœ•⁄πœ•πœ•πœ•πœ• = 𝑧𝑧𝑓𝑓 ′ (𝐾𝐾⁄𝑁𝑁) = 𝑧𝑧𝑧𝑧′(π‘˜π‘˜). The neoclassical
assumptions state that 𝑀𝑀𝑃𝑃𝐾𝐾 is positive but diminishing in capital 𝐾𝐾, which must also hold for
capital per worker π‘˜π‘˜. This implies 𝑓𝑓′(π‘˜π‘˜) is an increasing and concave function of π‘˜π‘˜.
1.2 Factor markets and the distribution of income
The previous section shows a simple model of how the real GDP of an economy is
determined by its production technologies and its supplies of factors of production. As well
as being a measure of production, GDP is a measure of total income in an economy, so the
model also explains aggregate income. The next step is to ask how that aggregate income is
distributed among the factors of production.
Up to this point, we have not considered markets in our model of the economy, although
markets were implicit in how the factors of production were organised and allocated among
different uses. Here, markets are introduced to explain the distribution of income. We use
an analysis with firms in perfectly competitive markets for factors of production. An
example of a factor market is a market for labour, a market where firms can hire the
services of workers for a time. There are also markets for renting land and capital.
For each of the factor markets, there is a factor price. In the labour market, this is a wage 𝑀𝑀
per hour of labour, or per person working for a given amount of time. For land, there is a
rent π‘₯π‘₯ and for capital there is a rental price 𝑅𝑅. We do not consider markets for outright
purchases or sales of factors of production at this stage. In the factor markets, households
supply factors of production that they own, which are hired or rented by firms. Households
supply their own labour, together with land and capital goods. In this analysis, firms do not
own factors of production themselves. As in the earlier model of production, all supplies of
factors of production are taken as given, so the supply curves are price inelastic.
On the demand side, firms hire factors of production 𝐾𝐾, 𝐿𝐿 and 𝑁𝑁 to produce output π‘Œπ‘Œ,
taking prices and factor prices as given. The production function is π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝐿𝐿, 𝑁𝑁), which
has the same properties assumed earlier. Firms aim to maximise profits πœ‹πœ‹:
πœ‹πœ‹ = π‘Œπ‘Œ − 𝑅𝑅𝑅𝑅 − π‘₯π‘₯π‘₯π‘₯ − 𝑀𝑀𝑀𝑀
These profits are measured in real terms, making the price of a unit of output equal to 1.
The revenue of the firm is simply the quantity π‘Œπ‘Œ of output it produces and sells. Its costs are
its spending on hiring factors of production, which are given by the factor prices multiplied
by the quantities hired of each factor.
The first-order conditions for profit maximisation are πœ•πœ•πœ•πœ•⁄πœ•πœ•πœ•πœ• = 0, πœ•πœ•πœ•πœ•⁄πœ•πœ•πœ•πœ• = 0 and
πœ•πœ•πœ•πœ•⁄πœ•πœ•πœ•πœ• = 0, which are equivalent to the following:
𝑧𝑧𝐹𝐹𝐾𝐾 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = 𝑅𝑅 , 𝑧𝑧𝐹𝐹𝐿𝐿 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = π‘₯π‘₯ , 𝑧𝑧𝐹𝐹𝑁𝑁 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = 𝑀𝑀
The terms 𝑧𝑧𝑧𝑧𝐾𝐾 , 𝑧𝑧𝑧𝑧𝐿𝐿 and 𝑧𝑧𝑧𝑧𝑁𝑁 denote the marginal products of capital, land and labour
respectively, where 𝐹𝐹𝐾𝐾 , 𝐹𝐹𝐿𝐿 , and 𝐹𝐹𝑁𝑁 are the partial derivatives of the function 𝐹𝐹(𝐾𝐾, 𝐿𝐿, 𝑁𝑁) with
respect to 𝐾𝐾, 𝐿𝐿, and 𝑁𝑁. These marginal products are diminishing in the quantity hired of
each factor, holding the quantities of the other factors constant. Hence, in each factor
market diagram with the factor price on the vertical axis and the quantity hired on the
13
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
horizontal axis, the demand curves of firms are the downward-sloping marginal product
curves. The supply curves are vertical, representing given price-inelastic supplies of each
factor.
In competitive markets, the factor prices are determined by market clearing. The
equilibrium rental prices of capital and land 𝑅𝑅∗ and π‘₯π‘₯ ∗ and the equilibrium wage 𝑀𝑀 ∗ are
found where the factor demand curves intersect the supply curves. This is illustrated in the
rental market for capital in Figure 1.2. The diagram shows how the amount of real capital
income 𝑅𝑅 ∗ per unit of capital owned is determined. The total amount of capital income is
then 𝑅𝑅 ∗ 𝐾𝐾, where 𝐾𝐾 is the supply of capital. Note that this is gross capital income – there is
no allowance made here for depreciation of capital, consistent with how GDP is a measure
of gross income. Similar diagrams can be used to see how rents of land and wages are
determined.
Figure 1.2: Factor market equilibrium
While firms are maximising profits when they choose their factor demands, perfect
competition and a constant-returns-to-scale production function imply that profits πœ‹πœ‹ are
zero in equilibrium. It is a general mathematical property that if 𝑧𝑧𝑧𝑧 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) has constant
returns to scale then:
𝑧𝑧𝑧𝑧 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = 𝑧𝑧𝑧𝑧𝐹𝐹𝐾𝐾 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) + 𝑧𝑧𝑧𝑧𝐹𝐹𝐿𝐿 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) + 𝑧𝑧𝑧𝑧𝐹𝐹𝑁𝑁 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁)
Intuitively, constant returns to scale implies that a 1% increase in each of the factors of
production 𝐾𝐾, 𝐿𝐿, and 𝑁𝑁 adds 1 per cent to existing output π‘Œπ‘Œ, that is, this raises output by
0.01 × π‘§π‘§π‘§π‘§(𝐾𝐾, 𝐿𝐿, 𝑁𝑁). Adding 1 per cent to capital 𝐾𝐾 raises output by 0.01𝐾𝐾 × π‘§π‘§π‘§π‘§πΎπΎ , where 𝑧𝑧𝑧𝑧𝐾𝐾
is the marginal product of capital. Adding 1% to labour 𝐿𝐿 would raise output by
0.01𝑁𝑁 × π‘§π‘§π‘§π‘§π‘π‘ , and so on for all factors. Summing over all factors then confirms the equation
above.
Since profit-maximisation by firms in perfectly competitive markets equalises marginal
products and factor prices for each factor, it follows that
π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = 𝑅𝑅𝑅𝑅 + π‘₯π‘₯π‘₯π‘₯ + 𝑀𝑀𝑀𝑀
14
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
This means that payments to factors of production will use up all a firm’s revenues, so
profits πœ‹πœ‹ are zero. Strictly speaking, this shows that economic profits are zero. Where firms
own factors of production themselves rather than renting them, which is often the case with
land and capital, some of the payments to factors described above effectively go the owner
of the firm. In that case, accounting profits would not be zero. However, these would not be
true economic profits but would instead represent the implicit rental of the factors of
production owned by the firm.
1.2.1 The Cobb-Douglas production function
As an example, consider the distribution of income when the production function has the
Cobb-Douglas functional form π‘Œπ‘Œ = 𝑧𝑧𝐾𝐾 𝛼𝛼 𝐿𝐿𝛽𝛽 𝑁𝑁 1−𝛼𝛼−𝛽𝛽 . As seen earlier, the marginal products of
capital, land and labour are 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝛼𝛼𝛼𝛼⁄𝐾𝐾, 𝑀𝑀𝑃𝑃𝐿𝐿 = 𝛽𝛽𝛽𝛽 ⁄𝐿𝐿, and 𝑀𝑀𝑃𝑃𝑁𝑁 = (1 − 𝛼𝛼 − 𝛽𝛽)π‘Œπ‘Œ ⁄𝑁𝑁.
With these being equated to the factor prices 𝑅𝑅, π‘₯π‘₯, and 𝑀𝑀, each factor of production
receives a constant share of GDP as income:
𝑅𝑅𝑅𝑅
= 𝛼𝛼
π‘Œπ‘Œ
,
π‘₯π‘₯π‘₯π‘₯
= 𝛽𝛽
π‘Œπ‘Œ
,
𝑀𝑀𝑀𝑀
= 1 − 𝛼𝛼 − 𝛽𝛽
π‘Œπ‘Œ
The income shares are given by the exponents of each factor of production in the CobbDouglas formula. For example, the exponent of capital 𝐾𝐾 is 𝛼𝛼, a parameter between 0 and 1.
Total capital income 𝑅𝑅𝑅𝑅 as a fraction of GDP π‘Œπ‘Œ is equal to the parameter 𝛼𝛼.
Box 1.1: Understanding inequality in wages
The last few decades have seen rising income inequality within many countries. What
might explain why this has occurred? Here, we focus on inequality in wages rather than
income inequality more broadly (which would also consider capital income), or on
wealth inequality.
One important dimension of the rise in wage inequality is the increase in the relative
wages of highly skilled workers compared to those with more basic skills. While a
university or college education is not the only measure of having skills, the ‘collegewage premium’ in the USA and elsewhere has received much attention. The size of the
wage premium from attending university is crucial to the debate on the returns to
higher education. A large premium means the returns might be high even if the cost of
education has risen.
In the USA, prior to 1980, an average college-educated worker earned less than 60 per
cent extra compared to an average worker without a college education. By the 2010s,
this college-wage premium had risen to close to 100 per cent. At first glance, this is
puzzling because there has also been a substantial increase in the fraction of collegeeducated workers during that period, which rose from 20 per cent to 50 per cent. With
diminishing returns to individual factors of production, an increase in supply should
push down the factor payment, all else being equal. These observations suggest
something else must have changed after 1980.
15
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
One hypothesis we will explore is a shift in the relative demand for workers with
different levels of skill. We separate the supply of labour into highly skilled workers 𝐻𝐻
and unskilled workers 𝑁𝑁. The supply of 𝐻𝐻 is related to the concept of ‘human capital’
studied in Section 2.8. We consider the following example of a production function π‘Œπ‘Œ =
𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁, 𝐻𝐻):
π‘Œπ‘Œ = 𝑧𝑧(𝐾𝐾 𝛼𝛼 𝑁𝑁 1−𝛼𝛼 + 𝐡𝐡𝐾𝐾 𝛼𝛼 𝐻𝐻1−𝛼𝛼 )
As with a Cobb-Douglas production function, the parameter 𝛼𝛼, a number between 0 and
1, indicates the importance of physical capital in producing goods and services. The
variable 𝑧𝑧 is total factor productivity and a change in 𝑧𝑧 affects the marginal products of
all factors of production. What is new in the production function above is 𝐡𝐡, an
exogenous variable that represents what is known as ‘skill-biased technology’. A change
in 𝐡𝐡 affects the marginal product of skilled labour 𝐻𝐻 but not the marginal product of
unskilled labour 𝑁𝑁.
The production function above resembles two Cobb-Douglas production functions that
are added together. However, it is not possible to use a standard Cobb-Douglas
production function such as π‘Œπ‘Œ = 𝑧𝑧𝐾𝐾 𝛼𝛼 𝐻𝐻𝛽𝛽 𝑁𝑁 1−𝛼𝛼−𝛽𝛽 in this exercise. If 𝐻𝐻 were multiplied
by a coefficient 𝐡𝐡 then this would be algebraically equivalent to a change in total factor
productivity 𝑧𝑧. It is not possible to build in skill-biased technological change with a basic
Cobb-Douglas production function.
We now apply our analysis of the distribution of income. Competitive markets imply
wages 𝑀𝑀𝐻𝐻 and 𝑀𝑀𝑁𝑁 for skilled and unskilled labour that are equal to their marginal
products:
𝑀𝑀𝑁𝑁 = 𝑀𝑀𝑀𝑀𝑁𝑁 =
πœ•πœ•πœ•πœ•
= (1 − 𝛼𝛼 )𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 −𝛼𝛼
πœ•πœ•πœ•πœ•
,
𝑀𝑀𝐻𝐻 = 𝑀𝑀𝑃𝑃𝐻𝐻 =
πœ•πœ•πœ•πœ•
= (1 − 𝛼𝛼 )𝑧𝑧𝑧𝑧𝐾𝐾 𝛼𝛼 𝐻𝐻 −𝛼𝛼
πœ•πœ•πœ•πœ•
The implications for the relative wage 𝑀𝑀𝐻𝐻 /𝑀𝑀𝑁𝑁 can be deduced from these equations:
𝑀𝑀𝐻𝐻 (1 − 𝛼𝛼 )𝑧𝑧𝑧𝑧𝐾𝐾 𝛼𝛼 𝐻𝐻 −𝛼𝛼
𝐻𝐻 −𝛼𝛼
=
=
𝐡𝐡
οΏ½
οΏ½
(1 − 𝛼𝛼 )𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 −𝛼𝛼
𝑀𝑀𝑁𝑁
𝑁𝑁
The relative wage declines with the relative supply of high-skilled labour 𝐻𝐻/𝑁𝑁 but it
increases with skill-biased technological change, that is, higher 𝐡𝐡.
This logic suggests one explanation for the rising skill premium (the relative wage
𝑀𝑀𝐻𝐻 /𝑀𝑀𝑁𝑁 increasing) alongside an increase in the relative supply of skilled labour 𝐻𝐻/𝑁𝑁 is
skill-biased technological change. Skill-biased technological change is improvements in
technology that disproportionately boost the productivity of skilled workers compared
to unskilled workers. For example, advances in computing, telecommunications, data
science and e-commerce may increase demand for highly skilled workers but not
unskilled workers. These changes can be represented by an increase in 𝐡𝐡 rather than an
increase in total factor productivity 𝑧𝑧. Earlier technological progress that may have been
more uniform in its effects is represented by higher TFP 𝑧𝑧. As we have seen, higher 𝐡𝐡
can raise 𝑀𝑀𝐻𝐻 /𝑀𝑀𝑁𝑁 even though there is an increase in the relative supply 𝐻𝐻/𝑁𝑁 of skilled
workers. Changes in TFP do not affect relative wages 𝑀𝑀𝐻𝐻 /𝑀𝑀𝑁𝑁 .
16
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Skill-biased technological progress is not the only explanation for the rising skill
premium. Globalisation owing to lower barriers to international trade is another
explanation. Even if labour is not mobile internationally, free trade in goods tends to
equalise the skill premium across countries through equalising the relative prices of
goods that are produced more or less intensively using skilled labour. The skill premium
is then determined by the relative supply of skilled labour at world level, where there
are relatively fewer skilled workers than within advanced economies.
1.3 Population growth according to Malthus
We now extend the basic model of production and distribution so that the supplies of
factors of production can change over time. This allows us to consider the dynamics of
aggregate GDP and individual incomes. We begin by considering the supply of labour,
interpreted as the number of workers. One reason the supply of labour adjusts over time is
because the population rises or falls. Here, we study the theory of population growth put
forward by Malthus in the 18th century and its implications for economic growth.
A key prediction of the theory is that population growth holds down living standards when
production of goods depends on land that is in fixed supply. We will see that a Malthusian
model of the economy can explain the stagnation in per-capita incomes seen in most of the
world prior to the 19th century. Technological advances lead to population growth, but not
rising living standards.
1.3.1 Demographics
Malthus in his Essay on the principle of population argued that per capita income and
consumption affect population growth. Lower consumption per person leads to worse
nutrition and health, hence higher death rates and infant mortality, and lower or negative
population growth rate. Furthermore, lower income per person induces families to have
fewer children they would struggle to support, hence lower birth rates and a lower
population growth rate. Higher income and consumption have the opposite effects and
raise the population growth rate. These effects are larger when people are close to
subsistence.
In the model, assume that all individuals are workers. The current population and number of
workers is denoted by 𝑁𝑁. The future population is denoted 𝑁𝑁′, where the notation ′ refers
to a value of a variable in the next time period (the future). The population growth rate
between the current and future time periods is (𝑁𝑁 ′ − 𝑁𝑁)/𝑁𝑁. If 𝐢𝐢 is aggregate consumption,
𝑐𝑐 = 𝐢𝐢/𝑁𝑁 measures average consumption per person, which is a measure of average living
standards. A mathematical representation of the demographics assumed by Malthus is:
𝑁𝑁 ′
= 𝑔𝑔(𝑐𝑐 )
𝑁𝑁
17
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
The demographic function 𝑔𝑔(𝑐𝑐) is increasing in 𝑐𝑐, an example of which is depicted in Figure
1.3. The population growth rate is 𝑔𝑔(𝑐𝑐 ) − 1, so for levels of 𝑐𝑐 where 𝑔𝑔(𝑐𝑐 ) > 1 the
population is rising , and for levels of 𝑐𝑐 where 𝑔𝑔(𝑐𝑐 ) < 1 the population is falling.
Figure 1.3: Demographics and living standards
1.3.2 An agricultural economy
Malthus’s demographic assumption is particularly relevant for the predominantly
agricultural economies of the past when land 𝐿𝐿 and labour 𝑁𝑁 were the key factors of
production. We assume a neoclassical production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐿𝐿, 𝑁𝑁). The crucial
feature of land is that the supply 𝐿𝐿 remains constant over time. There is no capital, so no
investment and we ignore government and international trade in the model, hence average
consumption per person 𝑐𝑐 is equal to income per worker 𝑦𝑦 = π‘Œπ‘Œ/𝑁𝑁.
For the same amount of land available, more workers can produce more output but there
are diminishing returns to labour. Think of additional workers needing to use lower quality
land relative to that already being farmed, or instead work more intensively on land already
in use. These do not increase output of crops as much as if additional workers had access to
unlimited land of the same quality as that used by existing workers.
The per-worker production function in the Malthusian model is:
𝑦𝑦 =
π‘Œπ‘Œ 𝑧𝑧𝑧𝑧(𝐿𝐿, 𝑁𝑁)
𝐿𝐿 𝑁𝑁
=
= 𝑧𝑧𝑧𝑧 οΏ½ , οΏ½ = 𝑧𝑧𝑧𝑧 (𝑙𝑙, 1) = 𝑧𝑧𝑧𝑧 (𝑙𝑙 )
𝑁𝑁
𝑁𝑁
𝑁𝑁 𝑁𝑁
Here, 𝑙𝑙 = 𝐿𝐿/𝑁𝑁 denotes the amount of land available per worker and 𝑓𝑓(𝑙𝑙 ) is simply a
shorthand for 𝐹𝐹(𝑙𝑙, 1). This per-worker production function is illustrated in Figure 1.4, where
output per worker is increasing in land per worker 𝑙𝑙. Hence, as the population 𝑁𝑁 rises,
available land per worker 𝑙𝑙 declines, which reduces average output produced per worker.
This is a reflection of diminishing returns to labour when land is in fixed supply.
18
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Figure 1.4: Per-worker production function
1.3.4 Dynamics of a Malthusian economy
How do the population and living standards change over time in a Malthusian economy?
Using 𝑁𝑁′⁄𝑁𝑁 = 𝑔𝑔(𝑐𝑐) and 𝑐𝑐 = 𝑦𝑦 = 𝑧𝑧𝑧𝑧 (𝑙𝑙 ) = 𝑧𝑧𝑧𝑧(𝐿𝐿⁄𝑁𝑁), the change in the population over time
is determined by the equation:
𝐿𝐿
𝑁𝑁 ′
= 𝑔𝑔 �𝑧𝑧𝑧𝑧 οΏ½ οΏ½οΏ½
𝑁𝑁
𝑁𝑁
For a low current population 𝑁𝑁, 𝑓𝑓(𝐿𝐿⁄𝑁𝑁) is high and g(z𝑓𝑓(𝐿𝐿⁄𝑁𝑁)) is greater than 1, so 𝑁𝑁 ′ >
𝑁𝑁, which means the population is increasing over time. For a high current population 𝑁𝑁,
𝑓𝑓(𝐿𝐿⁄𝑁𝑁) is low and g(z𝑓𝑓(𝐿𝐿⁄𝑁𝑁)) is less than 1, so 𝑁𝑁 ′ < 𝑁𝑁, which means the population is
falling over time. This tells us that for given parameters of this model, such as the level of
technology 𝑧𝑧, the population converges to a steady state 𝑁𝑁 ∗ .
A steady state is a value of a variable such that once the economy reaches that level of the
variable, there is no further change in that variable over time. The steady state of the
Malthusian model is depicted in Figure 1.5. Since the population converges to a steady
state, per capita income and consumption also reach a steady state 𝑐𝑐 ∗ . For this to result in
zero population growth, it must be the solution of equation 𝑔𝑔(𝑐𝑐 ∗ ) = 1. This solution is
shown using the demographic function in the left panel of the figure. Intuitively, if 𝑐𝑐 > 𝑐𝑐 ∗
then the population rises, pushing down 𝑙𝑙 and 𝑦𝑦 = 𝑐𝑐.
19
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Figure 1.5: Steady state of the Malthusian model
Given the steady state 𝑦𝑦 ∗ = 𝑐𝑐 ∗ for output and consumption per worker, the steady state for
land per worker 𝑙𝑙 ∗ is the solution of 𝑦𝑦 ∗ = 𝑧𝑧𝑧𝑧(𝑙𝑙 ∗ ). This is found using the per worker
production function in the right panel of the figure. Finally, given 𝑙𝑙 ∗ , the population in
steady state is simply 𝑁𝑁 ∗ = 𝐿𝐿⁄𝑙𝑙 ∗ .
As we will see in Section 2.1, the Malthusian model’s prediction of stagnation in living
standards 𝑐𝑐 ∗ is consistent with historical evidence prior to the 19th century.
1.3.5 What does (or does not) help?
The conclusion that living standards 𝑐𝑐 stagnate in the long run continues to hold even if
technology 𝑧𝑧 improves. This can be seen from Figure 1.5 observing that steady-state 𝑐𝑐 ∗ is
independent of 𝑧𝑧. Better technology 𝑧𝑧 ultimately leads only to a larger population 𝑁𝑁 ∗
because an upward shift of the per worker production function with the same 𝑐𝑐 ∗ = 𝑦𝑦 ∗
results in lower 𝑙𝑙 ∗ . There would be higher living standards during the period until population
converges to its new higher steady state but not in the long run.
The discovery of new land 𝐿𝐿 does not help either in the long run. This would simply cause
the population to rise (higher 𝑁𝑁 ∗ ) with no change in 𝑙𝑙 ∗ or 𝑐𝑐 ∗ in the long run, although there
would be temporarily higher living standards before the population reaches its new steady
state.
Weakening the link between the birth rate and living standards would help. This
demographic transition shifts down the 𝑔𝑔(𝑐𝑐 ) line and leads to a higher steady-state for
living standards 𝑐𝑐 ∗ .
Structural transformation of economy of the economy is another way the Malthusian trap
can be escaped. Industrialisation of the economy reduces the dependence of production on
land in fixed supply. Crucially, since capital can be accumulated, this helps to avoid the
problem of diminishing returns to labour.
20
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Box 1.2: A 14th-century pandemic
Around 1350, Europe, North Africa and Western Asia were struck by a bubonic plague
pandemic (known as the ‘Black Death’). This pandemic is believed to have killed more than
a third of the population of the affected areas. The main economic effect of the pandemic
came from the shortages of labour it created.
In the Malthusian model, a pandemic causes a temporary downward shift of the
demographic function 𝑔𝑔(𝑐𝑐). The population growth rate is 𝑔𝑔(𝑐𝑐 ) − 1, so more deaths can be
represented by a lower 𝑔𝑔(𝑐𝑐) for each value of 𝑐𝑐. Starting from a steady state, lower 𝑔𝑔(𝑐𝑐)
means a falling population. With a fixed supply of land 𝐿𝐿, land per worker 𝑙𝑙 = 𝐿𝐿/𝑁𝑁 rises, so
output and consumption per worker 𝑐𝑐 = 𝑦𝑦 = 𝑧𝑧𝑧𝑧(𝑙𝑙) are higher. This is illustrated in Figure
1.6.
For the survivors, the pandemic leads to higher output per worker because land was
previously more scarce. This is true even though total GDP π‘Œπ‘Œ declines when the population
falls. Once pandemic is over, 𝑔𝑔(𝑐𝑐) returns to normal and population and living standards
ultimately go back to their former steady state unless something else changes.
Figure 1.6: Population growth and output per worker in a pandemic
The pandemic also has significant distributional effects. Let us consider how total output π‘Œπ‘Œ
is distributed among workers and owners of land in the Malthusian model. Workers do not
receive all of 𝑦𝑦 as income unless they own the land they use to produce. If competition
determines factor payments, wages 𝑀𝑀 and rents π‘₯π‘₯ are equal to the marginal products of
labour 𝑀𝑀𝑃𝑃𝑁𝑁 and land 𝑀𝑀𝑃𝑃𝐿𝐿 .
With a neoclassical production function, the marginal product of labour 𝑀𝑀𝑃𝑃𝑁𝑁 is diminishing
in the population 𝑁𝑁. Given a fixed supply of land 𝐿𝐿, a lower population 𝑁𝑁 means the
marginal product of labour is higher, so wages 𝑀𝑀 = 𝑀𝑀𝑃𝑃𝑁𝑁 rise. What about rents π‘₯π‘₯ = 𝑀𝑀𝑃𝑃𝐿𝐿 ?
The per worker production function implies π‘Œπ‘Œ = 𝑧𝑧𝑁𝑁𝑓𝑓 (𝐿𝐿⁄𝑁𝑁), which can be used to obtain
expressions for the marginal products of land and labour using the chain rule.
21
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
𝑀𝑀𝑃𝑃𝑁𝑁 =
𝑀𝑀𝑃𝑃𝐿𝐿 =
πœ•πœ•πœ•πœ•
𝑁𝑁
𝐿𝐿
= 𝑧𝑧 𝑓𝑓 ′ οΏ½ οΏ½ = 𝑧𝑧𝑧𝑧′(𝑙𝑙)
πœ•πœ•πœ•πœ•
𝑁𝑁
𝑁𝑁
πœ•πœ•πœ•πœ•
𝐿𝐿
𝐿𝐿
𝐿𝐿
= 𝑧𝑧𝑧𝑧 οΏ½ οΏ½ − 𝑧𝑧𝑧𝑧 2 𝑓𝑓 ′ οΏ½ οΏ½ = 𝑧𝑧(𝑓𝑓(𝑙𝑙 ) − 𝑙𝑙𝑙𝑙′(𝑙𝑙))
πœ•πœ•πœ•πœ•
𝑁𝑁
𝑁𝑁
𝑁𝑁
These equations imply 𝑀𝑀 = 𝑦𝑦 − π‘₯π‘₯π‘₯π‘₯, which says that wages are equal to output per worker
minus rent times land used per worker.
Both marginal products and hence wages and rents depend on the relative supply of land to
labour as measured by land-per-worker 𝑙𝑙 = 𝐿𝐿/𝑁𝑁. It can be seen that 𝑀𝑀𝑃𝑃𝐿𝐿 is diminishing in 𝑙𝑙
because 𝑓𝑓 (𝑙𝑙 ) = 𝐹𝐹(𝑙𝑙, 1), so 𝑓𝑓 ′′ (𝑙𝑙 ) < 0, whereas 𝑀𝑀𝑃𝑃𝑁𝑁 is increasing in 𝑙𝑙 because derivative of
𝑓𝑓(𝑙𝑙 ) − 𝑙𝑙𝑙𝑙′(𝑙𝑙) with respect to 𝑙𝑙 is 𝑓𝑓 ′ (𝑙𝑙 ) − 𝑓𝑓 ′ (𝑙𝑙 ) − 𝑙𝑙𝑓𝑓 ′′ (𝑙𝑙 ) = −𝑙𝑙𝑓𝑓 ′′ (𝑙𝑙 ) > 0. These relationships
are depicted in Figure 1.7. As the pandemic increases land per worker 𝑙𝑙, it causes a rise in
𝑀𝑀𝑃𝑃𝑁𝑁 and hence higher wages, but a fall in 𝑀𝑀𝑃𝑃𝐿𝐿 and hence lower rents.
Figure 1.7: Wages and rents with a lower population
1.4 Hours of work and the supply of labour
As well as changes in the population, the supply of labour also depends on how many hours
people work, which is related to decisions such as:
•
•
•
Full-time versus part-time work?
Participate or not in the labour market?
Early retirement or continue working?
Here, we take as given the number of workers, also the skills, education and training of
workers, returning later to the issue of human-capital accumulation.
In our analysis of labour supply, the key trade-off is that more hours of work lead to more
income and hence a greater ability to purchase goods and services but also less time for
other things such as leisure.
22
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
We use a simple static model (meaning there is no saving or borrowing) to study the choice
of labour supply by households. The supply of labour by a household (in units of time, e.g.
hours) is denoted by 𝑁𝑁. Time the household enjoys as leisure is denoted by 𝑙𝑙. ‘Leisure’ is a
catch-all term for anything other than time spent earning wages, so as well as leisure in the
usual sense, it also includes cooking, cleaning and childcare, activities known as ‘home
production’. While home production includes a component of work, the household benefits
from it in the sense that otherwise the household would have to pay for equivalent services
in the market, for example, eating out in restaurants, hiring a cleaner or a childminder.
A crucial constraint is that the household has a fixed amount of time β„Ž available in a given
day, week, or year. Time used for work cannot also be used for leisure:
𝑁𝑁 + 𝑙𝑙 = β„Ž
By working (not counting home production), the household is paid a wage 𝑀𝑀 per unit of
time, e.g. an hourly wage. This wage is specified in real terms, as with all other variables in
this chapter. If a household works for 𝑁𝑁 hours then total wage income is 𝑀𝑀𝑀𝑀. The ultimate
purpose of work is to use the income to buy goods and services. Consumption of goods and
services (not counting home production) is denoted by 𝐢𝐢.
Some households may also be able to use non-wage income such as dividend income from
owning shares to buy goods and services. The amount of non-wage income is denoted by πœ‹πœ‹.
More broadly, considering the household as a family, πœ‹πœ‹ could also be interpreted as the
income of an individual’s partner separate from the amount the individual earns direct.
Our analysis must also consider taxes, which influence how much households are able to
spend of their pre-tax incomes. For now, taxes are assumed to take a ‘lump sum’ form: an
amount of tax 𝑇𝑇 that must be paid irrespective of how much a household earns or
consumes, for example, a poll tax. While this simplifies matters, most taxes are not like this
and we will see what difference it makes by considering income taxes and consumption
taxes later. Note that we allow 𝑇𝑇 to be negative, indicating the household receives a
transfer payment from the government rather than being a taxpayer.
Given wage income 𝑀𝑀𝑀𝑀, non-wage income πœ‹πœ‹, and taxes 𝑇𝑇, the maximum amount of
consumption affordable to a household is:
𝐢𝐢 = 𝑀𝑀𝑀𝑀 + πœ‹πœ‹ − 𝑇𝑇
As this is a static model, there is no role for saving for the future in this budget constraint, so
households will consume their income. Instead, the purpose of the model is to analyse how
much labour households will supply, which affects their income and, hence, their
consumption.
Our analysis proceeds by combining the two constraints on time and spending power. Since
𝑁𝑁 = β„Ž − 𝑙𝑙, we can write a combined constraint in terms of the consumption 𝐢𝐢 and leisure 𝑙𝑙
that the household ultimately values:
𝐢𝐢 + 𝑀𝑀𝑀𝑀 = π‘€π‘€β„Ž + πœ‹πœ‹ − 𝑇𝑇
23
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
This budget constraint is plotted in Figure 1.8 with leisure 𝑙𝑙 on the horizontal and
consumption 𝐢𝐢 on the vertical axis. Given the time physically available, it is not possible to
have more than β„Ž hours of leisure. If a household chose the maximum amount of leisure 𝑙𝑙 =
β„Ž this would mean supplying no labour and thus the ability to consume goods would depend
solely on non-wage income after tax πœ‹πœ‹ − 𝑇𝑇. As leisure falls below β„Ž, hours of labour supplied
increase and each extra hour of labour adds 𝑀𝑀 to wage income, increasing the ability to
consume by 𝑀𝑀. Therefore, the budget constraint is a downward-sloping straight line with
gradient −𝑀𝑀 that passes through the point (β„Ž, πœ‹πœ‹ − 𝑇𝑇).
Figure 1.8: Constraint on consumption and leisure
Households like both more consumption 𝐢𝐢 and more leisure 𝑙𝑙 but the constraints imply
there is a trade-off between them. To study the optimal choice of leisure and, hence, the
supply of labour, we need to say more about preferences.
We describe the household’s preferences over 𝐢𝐢 and 𝑙𝑙 using indifference curves added to
the diagram with the budget constraint. Indifference curves are downward sloping because
less of one thing the household likes requires more of the other to compensate. Indifference
curves are also assumed to be convex to the origin, as depicted in Figure 1.9. This shape
reflects a dislike of extremes where the household has very little consumption or leisure but
much more of the other.
24
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Figure 1.9: Indifference curves over consumption and leisure
The shape of the indifference curves can be described in terms of a diminishing marginal
rate of substitution 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 between consumption 𝐢𝐢 and leisure 𝑙𝑙. The marginal rate of
substitution is how much extra of one good a household needs to be given to compensate
for the loss of one unit of another. The gradient of an indifference curve is −𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 , with
𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 representing how much extra consumption a household needs to receive to be no
worse off after giving up a unit of leisure by supplying more labour.
Consumption and leisure are also assumed to be normal goods. This means that when
households are better off and able to reach a higher indifference curve, they choose to have
more consumption and more leisure, holding constant the hourly wage 𝑀𝑀. In the diagram,
this means the line joining points on different indifference curves where the tangent lines
have the same gradient is upward sloping.
A household wants to reach the highest indifference curve subject to the constraints. As
shown in Figure 1.10, there are two general cases to consider. First, where it is optimal to
participate in the labour market (𝑙𝑙 < β„Ž), in which case the optimal consumption-leisure
choice is where an indifference curve is tangent to the constraint, mathematically, 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 =
𝑀𝑀. The second case is where the household does not find it optimal to participate in the
labour market (𝑙𝑙 = β„Ž) and the optimal choice of 𝐢𝐢 and 𝑙𝑙 is at the corner of constraint.
25
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Figure 1.10: Labour-market participation decision
Participation in the labour market is optimal if the marginal rate of substitution 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 at
zero labour supply (𝑙𝑙 = β„Ž) is less than the hourly real wage 𝑀𝑀, i.e. the indifference curve
passing through the corner of the constraint is less steep than the budget constraint to the
left of this point.
This logic indicates participation in the labour market is more likely when wages 𝑀𝑀 are high,
which makes the budget constraint steeper. High taxes 𝑇𝑇 or low transfer payments (the
negative of 𝑇𝑇) increase the likelihood of participation, moving the corner (β„Ž, πœ‹πœ‹ − 𝑇𝑇)
downwards. Similarly, a low level of other income πœ‹πœ‹ increases participation, with πœ‹πœ‹ being
low because the household has little wealth or an individual’s partner does not have a high
income. Finally, preferences can also matter, with a strong preference for consumption over
leisure (a low marginal rate of substitution 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 ) making participation more likely.
For those choosing to participate, we can also analyse how the number of hours worked
depends on these considerations.
1.5 The effects of wages on labour supply
This section studies how wages affect the supply of labour. We will use our analysis here to
derive a supply curve for labour. There are two aspects of the labour supply response to
wages. First, how do hours worked change for those participating in the labour market?
Second, how do wages affect the decision to participate or not in the labour market?
1.5.1 Effects on those already participating in the labour market
Let us first consider participants in the labour market. An increase in the real wage 𝑀𝑀 pivots
the budget constraint upwards, making a household better off all else being equal. A careful
study of how a household reacts to the wage change requires breaking down the response
into income and substitution effects.
Intuitively, the substitution effect captures the effect of wages on incentives. A higher wage
increases the price of leisure (more consumption is forgone by taking leisure), so a
household substitutes away from leisure towards consumption, which means choosing to
work more. The income effect captures the impact of wages on how well off households
26
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
are. A higher wage makes a household better off and, since consumption and leisure are
both normal goods, there is desire to enjoy more leisure by choosing to work less.
Income and substitution effects are analysed in Figure 1.11. A higher wage makes the
budget constraint steeper, pivoting it around the corner. Formally, the substitution effect
(SE) is found by considering the effects of the steeper budget constraint gradient,
controlling for whether the household is made better off or worse off. Since the higher
wage makes the household better off, the substitution effect can be isolated by also making
a parallel shift downwards of the budget constraint so that it is tangent to the original
indifference curve. This results in a new tangency point north-west of the original tangency
because this is where the indifference curve is steeper. Leisure falls (labour supply
increases) and consumption rises.
Figure 1.11: Income and substitution effects on labour supply
The income effect (IE) is isolated by removing the hypothetical parallel shift of the budget
constraint used to derive the substitution effect. Hence, the income effect results from a
parallel upward shift of the budget constraint in this case, causing a movement on to a
higher indifference curve in a north-east direction. Leisure and consumption both rise, so
labour supply falls.
Overall, combining the substitution effect and income effect to obtain the combined effect,
consumption must rise but leisure may rise or fall. Thus, the effect of wages on labour
supply is ambiguous. In the diagram, income and substitution effects exactly cancel out for
leisure and labour supply but this is a special case. In general, either the substitution effect
or the income effect could dominate. If the substitution effect dominates, leisure falls and
labour supply rises, while if the income effect dominates, leisure rises and labour supply
falls.
1.5.2 Effect on the labour-market participation decision
What about those not already participating in the labour market? A higher wage 𝑀𝑀 makes
the budget constraint steeper, pivoting it around the point of non-participation. Since the
27
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
indifference curve passing through that point needs to be steeper than the budget
constraint for non-participation to be optimal, a sufficiently high wage would cause an
individual to supply some labour. This is shown in Figure 1.12.
Figure 1.12: The effects of wages on labour-market participation
Taking πœ‹πœ‹ as given, a higher wage 𝑀𝑀 has no offsetting income effect on the participation
decision because a higher wage does not make non-participants better off if they are not
earning any labour income. However, if πœ‹πœ‹ is interpreted more broadly as including family
income from a partner who works, in this case higher wages have an income effect on the
household’s labour supply.
1.5.3 The labour supply curve
The labour supply curve shows the optimal choice of 𝑁𝑁 𝑠𝑠 = β„Ž − 𝑙𝑙 for each level of real wages
𝑀𝑀. An example is shown in Figure 1.13 with the real wage 𝑀𝑀 on the vertical axis and the
quantity of labour on the horizontal axis.
Figure 1.13: The labour supply curve
28
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
A labour supply curve can be drawn for an individual or for all households together. For
those participating in the labour market, hours worked increase with higher wages if the
substitution effect is larger than the income effect. For such households, the 𝑁𝑁 𝑠𝑠 curve is
upward sloping if the substitution effect dominates the income effect. For those not
participating initially, high wages make participation more likely and there is no offsetting
income effect until some labour is supplied by a household. An upward-sloping labour
supply curve drawn for all households can also represent more participation at higher wages
as well as those already working choosing to supply more hours.
If all households are participating in the labour market, we can think of the labour supply
curve as representing the optimality condition 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 = 𝑀𝑀 with the marginal rate of
substitution between leisure and consumption rising as more labour is supplied because
leisure falls.
The labour supply curve shifts if there is a change in any variable that matters for optimal
labour supply other than the real wage. Lower non-wage income πœ‹πœ‹, or higher taxes 𝑇𝑇, would
make a household worse off, reducing demand for leisure as a normal good and increasing
the supply of labour. This would cause 𝑁𝑁 𝑠𝑠 to shift to the right.
A very long-run perspective on labour supply is provided by the 160 years of data for the UK
shown in Figure 1.14. This graph shows time series of real wages, average hours worked per
week for those who have jobs and the fraction of the whole population who have jobs. Over
the 160 years, UK real wages rise by a factor of 20. Hours per worker, though, fall by around
half over this period. A broad measure of labour-market participation, the ratio of workers
to the total population (not adjusting for those of ‘working age’) does not display any clear
trend. One interpretation of this evidence points to the importance of income effects as UK
workers became significantly better off over this period and chose to work fewer hours.
Figure 1.14: UK wages and labour supply in the long run
29
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
1.5.4 Do higher tax rates raise more revenue?
In the household budget constraint, taxes 𝑇𝑇 were assumed to have a ‘lump sum’ form
because the amount 𝑇𝑇 paid to the government does not depend on the household’s
choices. In practice, the amount of tax paid depends on individual behaviour. For example, a
proportional labour income tax has households pay a fixed percentage of labour income as
tax. If the household chooses to work more, more tax will be paid. Different from earlier,
this means that taxes also have effects on incentives.
Assume there is a proportional labour income tax rate of 𝜏𝜏, for example, 𝜏𝜏 = 0.2 if wages
are taxed at a 20 per cent rate. The pre-tax wage is denoted by 𝑀𝑀, and labour supply by 𝑁𝑁 𝑠𝑠 .
The amount of tax revenue collected by the government is 𝑇𝑇 = πœπœπœπœπ‘π‘ 𝑠𝑠 in this case. The aftertax real wage is (1 − 𝜏𝜏)𝑀𝑀, so the households now supply labour up to the point where:
𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 = (1 − 𝜏𝜏)𝑀𝑀
A higher tax rate 𝜏𝜏 reduces the after-tax wage (1 − 𝜏𝜏)𝑀𝑀, so the effects of tax are similar to
those of lower wages.
Since the tax rate 𝜏𝜏 affects tax revenue 𝑇𝑇 = πœπœπœπœπ‘π‘ 𝑠𝑠 indirectly through its impact on
behaviour 𝑁𝑁 𝑠𝑠 as well as directly, the relationship between 𝜏𝜏 and 𝑇𝑇 is not always positive.
This leads to the ‘Laffer curve’ relationship between tax rates and revenue shown in Figure
1.15. To understand the Laffer curve, note that a 0 per cent tax rate obviously generates no
revenue. On the other hand, a 100 per cent tax rate implies no incentive to supply labour
because the after-tax wage is zero, so no tax revenue would be obtained in this case as
𝑁𝑁 𝑠𝑠 = 0. This basic logic indicates there is a tax rate somewhere between 0 per cent and 100
per cent where the Laffer curve peaks and tax revenue is maximised. After this point, higher
tax rates would reduce revenue.
Figure 1.15: A Laffer curve
Although the Laffer curve implies that ever higher tax rates eventually result in lower
revenue, it does not give specific guidance at which tax rate 𝜏𝜏 revenue will start to fall as 𝜏𝜏
rises. This is an empirical question.
30
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
1.6 Equilibrium and efficiency
Having studied firms’ demand for labour in the context of the distribution of income in
Section 1.2 and now households’ supply of labour in Section 1.4, we can put the two
together to set up our first simple macroeconomic model. At this stage, the model is static
with only a single period, which misses out many important macroeconomic issues we will
address later.
1.6.1 A static macroeconomic model
This first macroeconomic model looks at the implications of optimising behaviour of
households and firms in the markets for labour and goods, where those markets will ‘clear’,
i.e. reach an equilibrium between demand and supply. The model will also have a
government that chooses a fiscal policy setting tax and public expenditure.
In the model, assume that all households share the same preferences (same indifference
curves over consumption and leisure) and all have equal claims on non-wage income (which
arises from ownership of capital or land). In this case, there is said to be a ‘representative
household’: the economy is comprised of many households, each of which is small relative
to the size of the economy but all will optimally choose to behave the same way because
they have the same preferences and face the same constraints.
As we have seen, optimisation by households implies a labour supply curve 𝑁𝑁 𝑠𝑠 . Since all
households are the same and will have to participate in the labour market in equilibrium, we
can represent the labour supply curve by the optimality condition 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 = 𝑀𝑀, where
𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 is the marginal rate of substitution between leisure 𝑙𝑙 = β„Ž − 𝑁𝑁 𝑠𝑠 and consumption 𝐢𝐢,
and 𝑀𝑀 is the real wage.
We have also seen that profit maximisation by firms implies a labour demand curve. Firms
hire labour up to the point where 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀, where 𝑀𝑀𝑃𝑃𝑁𝑁 is the marginal product of labour
and 𝑁𝑁 is employment. Firms face a neoclassical production function where labour and
capital, and/or land are used to produce goods.
Other factors of production apart from labour, such as land or capital, are assumed to be in
fixed supply. This means we do not consider changes in the capital stock through investment
and there is no depreciation of existing capital. Factors of production are equally owned by
all households.
The government’s fiscal policy sets the level of public expenditure 𝐺𝐺, interpreted as
government purchases of privately produced goods and services. This expenditure is
financed by a lump-sum tax 𝑇𝑇 and every household faces the same tax 𝑇𝑇, so there is no
redistribution. Since the model is static, there is no scope here for government budget
deficits and debt, so the government’s budget constraint is 𝑇𝑇 = 𝐺𝐺.
1.6.2 Equilibrium in labour and goods markets
The labour market of the model is shown in Figure 1.16. Firms’ demand for labour 𝑁𝑁 𝑑𝑑 is
determined by 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀, and households’ supply of labour is determined by 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 = 𝑀𝑀.
The model is based on the real wage 𝑀𝑀 adjusting to clear the labour market, i.e. achieve
31
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
𝑁𝑁 𝑑𝑑 = 𝑁𝑁 𝑠𝑠 , so the desired supply of labour is equal to the amount firms want to demand. This
occurs if the real wage 𝑀𝑀 adjusts to 𝑀𝑀 ∗ where the demand and supply curves cross.
Figure 1.16: Labour market equilibrium
Once equilibrium employment 𝑁𝑁 ∗ is known, the amount of goods and services π‘Œπ‘Œ produced
by firms is determined because all other factors (land, capital) are in fixed supply. This is the
amount firms supply to the goods market. On the demand side, households buy goods for
consumption 𝐢𝐢 and the government to provide public services 𝐺𝐺, so aggregate demand is
𝐢𝐢 + 𝐺𝐺. The government’s fiscal policy sets 𝐺𝐺 and households choose 𝐢𝐢 subject to their
budget constraint when making the consumption-leisure trade-off that underlies labour
supply. Equilibrium of the goods market requires 𝐢𝐢 + 𝐺𝐺 = π‘Œπ‘Œ.
It turns out that another diagram for the goods market is not necessary given that labourmarket equilibrium has already been found. The amount of non-wage income received by
households is πœ‹πœ‹ = π‘Œπ‘Œ − 𝑀𝑀𝑁𝑁 𝑑𝑑 . This is true irrespective of whether households or firms own
factors of production such as land and capital as households will ultimately receive these
factor payments as the owners of firms.
The household budget constraint is 𝐢𝐢 = 𝑀𝑀𝑁𝑁 𝑠𝑠 + πœ‹πœ‹ − 𝑇𝑇, and substituting the government
budget constraint 𝐺𝐺 = 𝑇𝑇 and the equation for πœ‹πœ‹, it follows that 𝐢𝐢 = π‘Œπ‘Œ + 𝑀𝑀(𝑁𝑁 𝑠𝑠 − 𝑁𝑁 𝑑𝑑 ) − 𝐺𝐺.
Writing this as π‘Œπ‘Œ − (𝐢𝐢 + 𝐺𝐺 ) = 𝑀𝑀(𝑁𝑁 𝑑𝑑 − 𝑁𝑁 𝑠𝑠 ), labour-market equilibrium 𝑁𝑁 𝑑𝑑 = 𝑁𝑁 𝑠𝑠 implies
𝐢𝐢 + 𝐺𝐺 = π‘Œπ‘Œ, so the goods market must also be in equilibrium.
1.6.3 Economic efficiency
One important implication of equilibrium, at least for the simple model studied here, is that
the outcomes for employment, output and consumption are economically efficient,
conditional on the government’s choice of public expenditure 𝐺𝐺. This is because 𝑁𝑁 𝑑𝑑 = 𝑁𝑁 𝑠𝑠
means that 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀 ∗ = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 , so the market-clearing real wage is equal to the marginal
product of labour and the marginal rate of substitution between leisure and consumption.
Intuitively, once the economy reaches equilibrium, the marginal value (measured in goods)
that households put on a unit of their time is equal to what amount of goods firms can
32
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
produce with that unit of time. A basic function of an economy is to allow households to
turn their time as work into income that can be spent on goods and services. Here, the
market economy performs this task as well as possible and we say that the market
equilibrium is economically efficient, or Pareto efficient.
Note that we say nothing here about whether the government’s choice of public
expenditure 𝐺𝐺 is optimal and, hence, whether the allocation of resources between private
spending 𝐢𝐢 and public spending 𝐺𝐺 is optimal. The desirability of a particular amount of
public expenditure is taken as given here.
A more careful way to reach the efficiency result is to imagine a hypothetical world where a
government can control all economic decisions (consumption, employment, etc.) without
the need for markets. The government is said to be the ‘social planner’ in this case. Assume
the government acts benevolently with the aim of making the representative household as
well off as possible.
The economy’s ability to produce goods and services is still limited by the production
function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁), where 𝐾𝐾 is denotes a factor of production other than labour, for
example, capital. The supply of this factor is fixed here. Assume that a particular level of
public expenditure 𝐺𝐺 is desirable. The government faces a resource constraint 𝐢𝐢 + 𝐺𝐺 = π‘Œπ‘Œ
and a constraint 𝑙𝑙 + 𝑁𝑁 = β„Ž on households’ time. Combining these and the production
function leads to a single constraint 𝐢𝐢 = 𝑧𝑧𝑧𝑧 (𝐾𝐾, β„Ž − 𝑙𝑙 ) − 𝐺𝐺 linking consumption 𝐢𝐢 and
leisure 𝑙𝑙, the two things households ultimately care about. The constraint and the
representative household’s indifference curves are illustrated in Figure 1.17.
The ability to raise consumption by reducing leisure and setting households to work longer
is found by differentiating the constraint with respect to leisure 𝑙𝑙:
πœ•πœ•πœ•πœ•
= −𝑧𝑧𝐹𝐹𝑁𝑁 (𝐾𝐾, β„Ž − 𝑙𝑙 ) = −𝑀𝑀𝑃𝑃𝑁𝑁
πœ•πœ•πœ•πœ•
The gradient is the negative of the marginal product of labour, so the constraint is
downward sloping and becomes steeper as 𝑙𝑙 rises (because 𝑁𝑁 falls, increasing 𝑀𝑀𝑃𝑃𝑁𝑁 ). To
make representative-household utility as high as possible subject to the constraint, the
social planner would choose a combination (𝑙𝑙, 𝐢𝐢) such that 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 = 𝑀𝑀𝑃𝑃𝑁𝑁 , where an
indifference curve is tangent to the constraint.
33
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Figure 1.17: The social planner allocation
Since the market equilibrium features 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 = 𝑀𝑀𝑃𝑃𝑁𝑁 and satisfies the same constraints
faced by the government, the market equilibrium and the benevolent planner’s choice
coincide. It is not possible to make the representative household better off than in the
market equilibrium, so the equilibrium is efficient.
More generally, without a representative household, the market equilibrium is said to be
Pareto-efficient if no one can be made better off without making someone else worse off. It
is a general result (the first welfare theorem) that an equilibrium is Pareto efficient if:
•
•
•
Markets are perfectly competitive
There are no externalities or tax distortions
There are no missing markets or restrictions on trade
We will see many examples later where an equilibrium is not efficient. In these cases, the
economy is failing in its basic function to allow households to convert their time into work
and enjoy the fruits of their labours.
Box 1.4: Should wages or rents be taxed to pay for public expenditure?
This application addresses the question of how a government should best pay for a given
amount of public services 𝐺𝐺 it needs to provide. Here we assume that lump-sum taxes are
not available. Instead, the government is restricted to taxing different types of income.
Assume firms produce output of goods and services using land 𝐿𝐿 and labour 𝑁𝑁. The
production function is π‘Œπ‘Œ = π‘Žπ‘Žπ‘Žπ‘Ž + 𝑏𝑏𝑏𝑏, which is linear in both 𝐿𝐿 and 𝑁𝑁. This is not a
neoclassical production function but it is useful for illustration and the arguments
developed here apply more generally. The marginal products of labour and land are
𝑀𝑀𝑃𝑃𝑁𝑁 = π‘Žπ‘Ž and 𝑀𝑀𝑃𝑃𝐿𝐿 = 𝑏𝑏, where π‘Žπ‘Ž and 𝑏𝑏 are positive constants. With firms hiring labour
and renting land in competitive factor markets, the pre-tax wage 𝑀𝑀 and rent π‘₯π‘₯ must be
equal to these constant marginal products:
𝑀𝑀 = 𝑀𝑀𝑃𝑃𝑁𝑁 = π‘Žπ‘Ž
,
34
π‘₯π‘₯ = 𝑀𝑀𝑃𝑃𝐿𝐿 = 𝑏𝑏
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
The economy has a fixed supply of land 𝐿𝐿. The supply of labour 𝑁𝑁 is chosen by a
representative household that owns an equal share of the economy’s land. The
government can only levy proportional taxes on incomes, setting a tax rate πœπœπ‘€π‘€ on wage
income and a tax rate 𝜏𝜏π‘₯π‘₯ on rental income. The total amount of tax revenue raised is
πœπœπ‘€π‘€ 𝑀𝑀𝑀𝑀 + 𝜏𝜏π‘₯π‘₯ π‘₯π‘₯π‘₯π‘₯. With 𝑀𝑀 and π‘₯π‘₯ equal to the constants π‘Žπ‘Ž and 𝑏𝑏, the budget constraint of
the government is:
π‘Žπ‘Žπœπœπ‘€π‘€ 𝑁𝑁 + π‘π‘πœπœπ‘₯π‘₯ 𝐿𝐿 = 𝐺𝐺
The total amount of income (wages and rents) the representative household receives
after tax is (1 − πœπœπ‘€π‘€ )𝑀𝑀𝑀𝑀 + (1 − 𝜏𝜏π‘₯π‘₯ )π‘₯π‘₯π‘₯π‘₯ and the budget constraint is 𝐢𝐢 = π‘Žπ‘Ž(1 − πœπœπ‘€π‘€ )𝑁𝑁 +
𝑏𝑏(1 − 𝜏𝜏π‘₯π‘₯ )𝐿𝐿. With a given amount of time β„Ž available, labour supply is equal to 𝑁𝑁 = β„Ž −
𝑙𝑙, where 𝑙𝑙 is the choice of leisure. The combined constraint faced by the household is:
𝐢𝐢 + π‘Žπ‘Ž(1 − πœπœπ‘€π‘€ )𝑙𝑙 = π‘Žπ‘Ž(1 − πœπœπ‘€π‘€ )β„Ž + 𝑏𝑏(1 − 𝜏𝜏π‘₯π‘₯ )𝐿𝐿
1.6.4 Taxing wages but not rents
Suppose initially that only wage income is taxed, so 𝜏𝜏π‘₯π‘₯ = 0. The government’s budget
constraint simplifies to π‘Žπ‘Žπ‘Žπ‘Žπ‘€π‘€ 𝑁𝑁 = 𝐺𝐺 and the household’s budget constraint is 𝐢𝐢 +
π‘Žπ‘Ž(1 − πœπœπ‘€π‘€ )𝑙𝑙 = π‘Žπ‘Ž(1 − πœπœπ‘€π‘€ )β„Ž + 𝑏𝑏𝑏𝑏. The household chooses 𝐢𝐢 and 𝑙𝑙 at the tangency of the
budget constraint and an indifference curve, i.e. where 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 = π‘Žπ‘Ž (1 − πœπœπ‘€π‘€ ). The
gradient of the budget constraint is the after-tax wage π‘Žπ‘Ž(1 − πœπœπ‘€π‘€ ). Figure 1.18 depicts
this tangency point (𝑙𝑙1∗ , 𝐢𝐢1∗ ) as the economy’s initial equilibrium.
Figure 1.18: Taxes on rents instead of taxes on wages
1.6.5 Taxing rents but not wages
Now suppose the government switches completely to taxing rents instead of wages, so
πœπœπ‘€π‘€ = 0. The government’s budget constraint is now π‘π‘πœπœπ‘₯π‘₯ 𝐿𝐿 = 𝐺𝐺. For this alternative tax
system to be feasible it is necessary that 𝑏𝑏𝑏𝑏 > 𝐺𝐺, which says that there is enough rental
income to tax given the need for public expenditure (the government budget constraint
can hold for a tax rate less than 100 per cent). This is an important limitation on this
analysis that should be borne in mind.
35
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
The household budget constraint in this case is 𝐢𝐢 + π‘Žπ‘Žπ‘Žπ‘Ž = π‘Žπ‘Žβ„Ž + 𝑏𝑏(1 − 𝜏𝜏π‘₯π‘₯ )𝐿𝐿. This has
gradient −π‘Žπ‘Ž instead of −π‘Žπ‘Ž(1 − πœπœπ‘€π‘€ ), so the budget constraint becomes steeper as the aftertax wage rises. The budget constraint also shifts down at 𝑙𝑙 = β„Ž, reflecting the reduction in
non-wage income after tax.
A crucial observation is that the choice of 𝐢𝐢1∗ and 𝑙𝑙1∗ under the previous tax system remains
affordable under the new one. This is because 𝐢𝐢1∗ and 𝑙𝑙1∗ satisfy π‘Žπ‘Žπœπœπ‘€π‘€ (β„Ž − 𝑙𝑙1∗ ) = π‘Žπ‘Žπœπœπ‘€π‘€ 𝑁𝑁1∗ =
𝐺𝐺 = π‘π‘πœπœπ‘₯π‘₯ 𝐿𝐿 and 𝐢𝐢1∗ + π‘Žπ‘Ž(1 − πœπœπ‘€π‘€ )𝑙𝑙1∗ = π‘Žπ‘Ž(1 − πœπœπ‘€π‘€ )β„Ž + 𝑏𝑏𝑏𝑏, so they also consistent with the new
budget constraint 𝐢𝐢1∗ + π‘Žπ‘Žπ‘™π‘™1∗ = π‘Žπ‘Žβ„Ž + 𝑏𝑏(1 − 𝜏𝜏π‘₯π‘₯ )𝐿𝐿. Therefore, the original choice of leisure
and consumption remains on the new budget constraint. Since the budget constraint is now
steeper, it must cut the indifference curve at this point because there is a tangency with a
less steep budget line there.
It is possible to reach a higher indifference curve under the new tax system by choosing less
leisure, a higher labour supply, and more consumption. The analysis indicates a switch from
taxing wages to taxing rents makes the representative household better off. This is because
households’ labour supply choice is distorted by a proportional income tax that
disincentivises work. On the other hand, the supply of land is inelastic and does not respond
to tax. The removal of this distortion to labour supply and production allows the
representative household to reach a higher indifference curve.
As noted, in practice, rental income may not be high enough to shift tax burden completely
away from wages (which would require total pre-tax rents exceed total public expenditure,
𝑏𝑏𝑏𝑏 > 𝐺𝐺). By having a representative household, the analysis also ignores the distributional
consequences of such shifts in the tax system.
1.7 Capital accumulation
Modern industrial or service-based economies produce output mainly using capital and
labour rather than land. Capital is defined as goods used for the production of other goods
and services in the future (in other words, capital is not an intermediate input that is
immediately used up in current production). Capital includes such things as machinery,
buildings, computers, and aeroplanes. An important question we will address is whether
economic growth can be explained through a process of accumulating capital. In Section
2.30, we will also look at whether different levels of capital accumulation across countries
can explain differences in countries’ income levels.
Capital 𝐾𝐾 used for production is a stock variable, not a flow. Adding new capital to the
capital stock is a flow variable known as investment 𝐼𝐼. While capital is not immediately used
up in producing other goods, it does not last forever, in other words, there is depreciation.
Depreciation is the loss of capital from wear and tear or obsolescence, or maintenance costs
incurred to avoid this loss. We assume depreciation of capital takes place at a constant rate
𝑑𝑑 over time. The following equation describes the process of capital accumulation:
𝐾𝐾 ′ = (1 − 𝑑𝑑 )𝐾𝐾 + 𝐼𝐼
36
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Next year’s capital stock 𝐾𝐾′ is equal to capital left over (1 − 𝑑𝑑 )𝐾𝐾 after depreciation from the
current time plus investment 𝐼𝐼.
Focusing on capital and labour as the relevant factors of production and ignoring land, the
production function is:
π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧 (𝐾𝐾, 𝑁𝑁)
GDP is π‘Œπ‘Œ, the labour force is 𝑁𝑁 (the number of workers), the capital stock is 𝐾𝐾, and total
factor productivity (TFP) is 𝑧𝑧. The production function 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) is assumed to be
neoclassical, and the most important of the neoclassical assumptions here is the diminishing
marginal product of capital. The Cobb-Douglas production function π‘Œπ‘Œ = 𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 1−𝛼𝛼 is one
such example of a neoclassical production function. The value of the parameter 𝛼𝛼 could be
estimated using the capital share of income.
How is the capital stock 𝐾𝐾 measured? There are two approaches. First, the perpetual
inventory method. The change in the capital stock from one year to the next can be
calculated using 𝐾𝐾 ′ − 𝐾𝐾 = 𝐼𝐼 − 𝑑𝑑𝑑𝑑. Hence, given an estimate of 𝐾𝐾, the capital stock 𝐾𝐾′ can
be estimated by adding investment 𝐼𝐼 from the national accounts and subtracting an
estimate of the depreciation rate 𝑑𝑑 multiplied by 𝐾𝐾. Starting from some conjectured initial
value, this method can be applied iteratively to construct a time series of capital-stock
estimates.
A second method is based on imputation from capital income. Suppose we have an estimate
of the gross percentage return on capital 𝑅𝑅. Then given a measure of GDP π‘Œπ‘Œ and the capital
share of GDP 𝛼𝛼, the implied capital stock is 𝐾𝐾 = 𝛼𝛼𝛼𝛼⁄𝑅𝑅 .
In studying capital accumulation, we will mainly be concerned with the output per worker
𝑦𝑦 = π‘Œπ‘Œ/𝑁𝑁, not total output π‘Œπ‘Œ. Since the production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) has constant
returns to scale, output per worker is given by:
𝑦𝑦 =
𝐾𝐾 𝑁𝑁
𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁)
= 𝑧𝑧𝑧𝑧 οΏ½ , οΏ½ = 𝑧𝑧𝑧𝑧 (π‘˜π‘˜, 1) = 𝑧𝑧𝑧𝑧(π‘˜π‘˜)
𝑁𝑁
𝑁𝑁 𝑁𝑁
This shows that 𝑦𝑦 depends on capital per worker π‘˜π‘˜ = 𝐾𝐾/𝑁𝑁 and TFP 𝑧𝑧. The equation 𝑦𝑦 =
𝑧𝑧𝑧𝑧(π‘˜π‘˜) is called the ‘per worker production function’, where the function 𝑓𝑓 (π‘˜π‘˜) is defined by
𝑓𝑓 (π‘˜π‘˜) = 𝐹𝐹(π‘˜π‘˜, 1). It is an increasing and concave function as shown in Figure 1.19 because
𝑧𝑧𝑓𝑓′(π‘˜π‘˜) is the marginal product of capital.
37
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Figure 1.19: Output per worker and capital per worker
1.8 The Solow model
The Solow model explains the level of capital accumulation in an economy with a
neoclassical production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) and its implications for the level and growth
rate of real GDP. To begin with, we will assume that TFP 𝑧𝑧 is constant over time. Later in 0,
we consider a version of the Solow model where 𝑧𝑧 is increasing over time owing to
technological progress.
The Solow model focuses on capital accumulation, so the supply of labour 𝑁𝑁, which is both
the labour force and the population, is taken to be exogenous. Assume that 𝑁𝑁 grows at rate
𝑛𝑛 over time. The Solow model assumes a closed economy with no government sector, which
implies investment 𝐼𝐼 is equal to household saving 𝑆𝑆 in equilibrium. This follows from the
definition 𝑆𝑆 = π‘Œπ‘Œ − 𝐢𝐢 with no taxes, and the goods-market equilibrium condition π‘Œπ‘Œ = 𝐢𝐢 + 𝐼𝐼
with no public expenditure or international trade. Households’ saving behaviour is
exogenous – specifically, households save a given fraction 𝑠𝑠 of income π‘Œπ‘Œ.
Mathematically, the assumptions of the Solow model are the following equations:
•
•
•
•
•
The neoclassical production function: π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁)
The capital accumulation equation: 𝐾𝐾 ′ = (1 − 𝑑𝑑 )𝐾𝐾 + 𝐼𝐼
Investment equals saving condition: 𝐼𝐼 = 𝑆𝑆
Households’ saving behaviour: 𝑆𝑆 = 𝑠𝑠𝑠𝑠
The labour force grows at a constant rate: 𝑁𝑁 ′ = (1 + 𝑛𝑛)𝑁𝑁
Putting together the production function, the requirement that investment equals saving
and the fixed saving rate, the implications for next year’s capital stock 𝐾𝐾′ are:
𝐾𝐾 ′ = (1 − 𝑑𝑑 )𝐾𝐾 + 𝑠𝑠𝑠𝑠𝑠𝑠(𝐾𝐾, 𝑁𝑁)
Together with the demographic assumption of constant population growth, we have
equations for 𝐾𝐾′ and 𝑁𝑁′ in terms of 𝐾𝐾 and 𝑁𝑁, so we can calculate how the supplies of factors
of production evolve over time in the economy.
38
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
The per worker production function 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜) implies output per worker depends on the
ratio of 𝐾𝐾 to 𝑁𝑁. As 𝑦𝑦 is the ultimate variable of interest, we only need to keep track of the
ratio of capital per worker π‘˜π‘˜ = 𝐾𝐾/𝑁𝑁 over time. By combining the equations for 𝐾𝐾 ′ and 𝑁𝑁 ′ :
π‘˜π‘˜ ′ =
𝐾𝐾 ′ (1 − 𝑑𝑑 )𝐾𝐾 + 𝑠𝑠𝑠𝑠 (1 − 𝑑𝑑 )π‘˜π‘˜ + 𝑠𝑠𝑠𝑠
=
=
𝑁𝑁′
(1 + 𝑛𝑛)𝑁𝑁
1 + 𝑛𝑛
Using 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜), the right-hand side of the equation depends on capital per worker π‘˜π‘˜ only:
π‘˜π‘˜ ′ =
(1 − 𝑑𝑑 )π‘˜π‘˜ + 𝑠𝑠𝑠𝑠𝑠𝑠(π‘˜π‘˜)
1 + 𝑛𝑛
Subtracting π‘˜π‘˜ from both sides leads to an equation for the change in π‘˜π‘˜ over time:
π‘˜π‘˜ ′ − π‘˜π‘˜ =
𝑠𝑠𝑠𝑠𝑠𝑠 (π‘˜π‘˜) − (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜
1 + 𝑛𝑛
We conclude from this equation that changes in the amount of capital accumulated per
worker are explained by the difference between two terms. First, 𝑠𝑠𝑠𝑠𝑠𝑠 (π‘˜π‘˜), the amount of
saving and hence of investment per worker, which is the saving rate 𝑠𝑠 multiplied by the perworker production function 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜). Second, (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜, the amount of investment per
worker needed to sustain the same level of capital per worker next year. This interpretation
comes from a fraction 𝑑𝑑 of all capital depreciating, so an amount of capital per worker 𝑑𝑑𝑑𝑑
must be replaced to keep capital per worker unchanged. Furthermore, the number of
workers increases by a percentage 𝑛𝑛 each year, so if existing workers use capital π‘˜π‘˜ each,
there needs to be investment 𝑛𝑛𝑛𝑛 per existing worker to give future workers the same capital
π‘˜π‘˜ each as current workers.
1.8.1 The Solow diagram
We can use a diagram to study the evolution over time of capital per worker π‘˜π‘˜ and its
implications for output per worker 𝑦𝑦. Figure 1.20 is this key diagram of the Solow model
that plots:
•
•
•
The per worker production function 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜), which is an increasing and concave
function of π‘˜π‘˜.
The ‘saving line’ 𝑠𝑠𝑠𝑠𝑠𝑠(π‘˜π‘˜), a scaled-down version of 𝑧𝑧𝑧𝑧(π‘˜π‘˜) because of 0 < 𝑠𝑠 < 1 and,
hence, an increasing and concave function of π‘˜π‘˜.
The ‘effective depreciation line’ (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜, an upward-sloping straight line with
gradient given by the effective depreciation rate of capital per worker: the sum of
the depreciation rate plus the growth rate of the labour force.
Starting from a low level of capital per worker, the saving line is above the effective
depreciation line because the production function and saving line are initially very steep,
reflecting a high marginal product of capital. This means π‘˜π‘˜ ′ − π‘˜π‘˜ > 0, so capital per worker π‘˜π‘˜
is increasing over time. This leads to higher output per worker 𝑦𝑦 since 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜), with the
economy moving along the per worker production function over time. Starting from this
position, by adding more capital per worker, workers’ productivity can be increased and the
economy experiences growth in output per worker coming from capital accumulation.
39
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Figure 1.20: The Solow model diagram
The steady state of the Solow model
It is an important implication of the Solow model that such growth in output per worker
cannot continue indefinitely. We can see this from the Solow model diagram because there
is a steady state where the saving line crosses the effective depreciation line. If the economy
reaches this level of capital per worker π‘˜π‘˜, it will remain at that level of π‘˜π‘˜ unless something
changes. Mathematically, if 𝑠𝑠𝑠𝑠𝑠𝑠 (π‘˜π‘˜) = (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜ then π‘˜π‘˜ ′ = π‘˜π‘˜.
With no further change in π‘˜π‘˜, there is no further growth in output per worker 𝑦𝑦 unless
something else changes because 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜). Notice that both GDP π‘Œπ‘Œ and the total capital
stock 𝐾𝐾 are still growing in line with the labour force 𝑁𝑁 at rate 𝑛𝑛 but the more important
variable is how much is produced and earned per worker.
Does the basic Solow model always have such a steady state where growth in per worker
incomes grinds to a halt? If the production function is neoclassical, the gradient of 𝑓𝑓(π‘˜π‘˜) is
extremely large for π‘˜π‘˜ close to zero but declines as π‘˜π‘˜ increases and approaches zero as π‘˜π‘˜
becomes large. The saving line 𝑠𝑠𝑠𝑠𝑠𝑠(π‘˜π‘˜), as a multiple of 𝑓𝑓(π‘˜π‘˜), shares the same properties.
On the other hand, the gradient of the effective depreciation line is constant. Consequently,
there exists a positive steady state π‘˜π‘˜ ∗ (and only one) where the saving and effective
depreciation lines cross.
Moreover, the saving line is above the effective depreciation line for π‘˜π‘˜ below π‘˜π‘˜ ∗ , which
means that π‘˜π‘˜ rises over time when it is below π‘˜π‘˜ ∗ (and would fall over time if above π‘˜π‘˜ ∗ ). The
economy thus converges to π‘˜π‘˜ ∗ in the long run, so this point of stagnation is eventually
reached.
The Solow model also always has a steady state with zero capital per worker because some
capital is essential for production with a neoclassical production function. However, the
economy would diverge from this steady state no matter how close it gets, so this does not
need to be taken seriously and is ignored in what follows.
In summary, while the Solow model can explain how a country can become richer starting
from a low level of capital accumulation, it cannot explain long-run growth. Intuitively,
returns to capital are high when capital is initially scarce, so investment leads to large
40
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
increases in income. After that, diminishing returns to capital means that further investment
has lower returns and, with less extra output generated per unit of capital while the
maintenance cost of capital increases proportionately, a point is reached where the capital
stock cannot rise any further.
Box 1.5: The ‘Asian tiger’ economies
The so-called ‘Asian tiger’ economies (Singapore, Taiwan, Hong Kong, South Korea)
experienced very fast economic growth in the period from 1960 to 1990 but their growth
rates subsequently declined. The process of development in these economies provides a
good example of the mechanisms at work in the Solow model.
Let us consider Singapore for illustration. Figure 1.21 shows that Singapore has had a very
high national saving rate, which was above 40 per cent since the 1980s and sometimes
even above 50 per cent. While not all of this saving was channelled into domestic
investment, the share of investment in GDP was also very high in Singapore. Consequently,
Singapore’s economy experienced rapid capital accumulation, although it started from a
low base in the 1960s. The time series of the average amount of capital per worker in
Singapore is plotted in Figure 1.22. The amount of capital per worker increases by
approximately 400 per cent (in real terms) over the 40-year period after 1970.
Figure 1.21: Singapore national saving rate
This process of rapid capital accumulation is what the Solow model predicts for an
economy with a high saving rate and a low initial level of capital per worker. The model
predicts this would lead to a significant increase in income per worker and income per
person as the economy moves along the per worker production function. Data on real
income per person are shown in Figure 1.23 and we see that there is a dramatic
improvement in prosperity.
41
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Figure 1.22: Singapore capital per worker
Another prediction of the Solow model is that the rate of progress, specifically, the growth
rate of income per person would gradually slow down over time, even if the economy
maintained very high rates of saving. It is not easy to read growth rates from a graph of
income plotted on an ordinary scale against time. By taking logarithms of the data, or
using a logarithmic scale, the gradient of the time series is informative about the growth
rate.
Figure 1.23: Singapore real GDP per person
Figure 1.24 plots the natural logarithm of real income per person against time. Here we
see that the gradient of the graph tends to decrease over time, indicating that economic
growth is slowing down in Singapore. This happens even though the saving rate does not
fall over this period but actually rises. Such a growth slowdown as capital per worker rises
is in line with the Solow model’s prediction, which is a consequence of diminishing returns
to capital.
42
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Figure 1.24: Singapore log real GDP per person
The growth slowdown can be seen more directly in Figure 1.25, which plots growth rates of
real income per person averaged over each decade.
Figure 1.25: Singapore average growth rates by decade
While these predictions are consistent with the basic Solow model, we do not necessarily see
evidence that growth rates of income per person are falling all the way to zero. Even in
developed economies that have been experiencing growth for more than a century, we still
typically see growth being positive. In contrast, the basic Solow model predicts that the longrun growth rate of income per person is zero. The inability to explain long-run growth is one
of the major weaknesses of the Solow model and we will return to this issue in Chapter 2.
43
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Box 1.6: Interest rates in the long run
Capital accumulation in the Solow model is financed by households’ saving and,
moreover, in a closed economy with no government debt that investment is the only
outlet for those savings. What then are the Solow model’s implications for the return
received by savers?
The real interest rate π‘Ÿπ‘Ÿ is real return earned from owning capital. In an economy with
competitive markets, owners of capital are able to rent it to firms at price 𝑅𝑅 = 𝑀𝑀𝑃𝑃𝐾𝐾 , the
marginal product of capital. The real return π‘Ÿπ‘Ÿ on capital is equal to the rental price 𝑅𝑅
minus the cost of replacing capital lost through depreciation, which is 𝑑𝑑 per unit of
capital.
π‘Ÿπ‘Ÿ = 𝑅𝑅 − 𝑑𝑑
Firms are producing output according to the production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁). In per
worker terms, this production function is 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜), where 𝑓𝑓 (π‘˜π‘˜) = 𝐹𝐹(π‘˜π‘˜, 1). Since
aggregate output can be written as π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧𝑧𝑧(𝐾𝐾⁄𝑁𝑁), the marginal product of capital is
𝑀𝑀𝑃𝑃𝐾𝐾 = πœ•πœ•πœ•πœ•⁄πœ•πœ•πœ•πœ• = 𝑧𝑧𝑧𝑧′(𝐾𝐾⁄𝑁𝑁) = 𝑧𝑧𝑧𝑧′(π‘˜π‘˜). Therefore, the real return on capital is
π‘Ÿπ‘Ÿ = 𝑧𝑧𝑓𝑓 ′ (π‘˜π‘˜) − 𝑑𝑑
The Solow model predicts that capital per worker π‘˜π‘˜ converges to a steady state in the
long run. Starting from a low initial level, the stock of capital rises over time relative to
the number of workers as the economy converges to its steady state. With a diminishing
marginal product of capital, 𝑓𝑓′(π‘˜π‘˜) is decreasing in π‘˜π‘˜, so this means π‘Ÿπ‘Ÿ declines as capital
per worker increases. The Solow model predicts the real return π‘Ÿπ‘Ÿ received by savers falls
over time, ultimately converging to a steady state. This argument is illustrated
graphically in Figure 1.26. As we will see, this steady state for π‘Ÿπ‘Ÿ could be positive or
negative.
Figure 1.26: Real interest rates over time in the Solow model
What can we say about real returns empirically over time? It is not easy to measure π‘Ÿπ‘Ÿ in
a consistent way over very long periods. Figure 1.27 plots a time series of real interest
rates on UK government bonds over a 300-year period. This real bond yield is taken as a
proxy for the real return earned by savers (even though government bonds are
44
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
not included in the Solow model, if they were, savers would have a choice of holding
bonds or capital, so the returns on the two assets would be linked).
Real interest rates in the UK have fluctuated over a wide range during those three
centuries, sometimes being as high as 8 per cent and sometimes turning negative.
Looking at the picture as a whole, it appears there is a moderate downward trend in
real interest rates over time.
Figure 1.27: UK real interest rates in the long run
45
EC2065 Macroeconomics | Chapter 2: Economic growth
Chapter 2: Economic growth
This chapter delves deeper into the study of economic growth that was introduced in
Chapter 1. Economic growth refers to increases over time in the level of real GDP per
person, which are a crucial component of living standards in an economy. In Chapter 1, we
have seen several models that fail to explain sustained economic growth. The Malthusian
model predicts living standards can stagnate at extremely low levels of income close to
subsistence. The Solow model, while it can explain economic growth through capital
accumulation, for example, when an economy industrialises, it fails to explain why economic
growth will continue in the long run.
This chapter explores alternative models that help to understand long-run growth. These
models focus on human capital and technological progress rather than physical capital
accumulation. We will also look at the implications of growth models for levels of income
per person across countries.
Essential reading
•
Williamson, Chapters 7 and 8.
2.1 Evidence on economic growth
We will use real GDP per person, or sometimes per worker, as a measure of living standards.
This variable is defined by 𝑦𝑦𝑑𝑑 = GDP / Population and the growth rate of GDP per person is
𝑔𝑔𝑑𝑑 = (𝑦𝑦𝑑𝑑 − 𝑦𝑦𝑑𝑑−1 )/𝑦𝑦𝑑𝑑−1 . Figure 2.1 plots a time series of real GDP per person in the USA
between 1870 and 2018, which is measured in units of 2011 dollars per person. We see that
per person real income has risen by more than 10 times over this period of almost 150
years.
Figure 2.1: Economic growth in the USA
46
EC2065 Macroeconomics | Chapter 2: Economic growth
2.1.1 Measuring economic growth
While we can see from the graph that incomes have increased over time in the USA, the
growth rate of real income per person is not simply the gradient of this plot of 𝑦𝑦𝑑𝑑 against
time 𝑑𝑑. Instead, the gradient of a plot of the natural logarithm of real GDP per person is
approximately equal to 𝑔𝑔𝑑𝑑 . The gradient of ln 𝑦𝑦𝑑𝑑 plotted against time 𝑑𝑑 is
(ln 𝑦𝑦𝑑𝑑 − ln 𝑦𝑦𝑑𝑑−1 )⁄(𝑑𝑑 − (𝑑𝑑 − 1)) = ln 𝑦𝑦𝑑𝑑 − ln 𝑦𝑦𝑑𝑑−1 . By using ln(1 + 𝑔𝑔) ≈ 𝑔𝑔 for small 𝑔𝑔, this
gradient is ln 𝑦𝑦𝑑𝑑 − ln 𝑦𝑦𝑑𝑑−1 = ln 𝑦𝑦𝑑𝑑 ⁄𝑦𝑦𝑑𝑑−1 = ln(1 + 𝑔𝑔𝑑𝑑 ) ≈ 𝑔𝑔𝑑𝑑 . Hence, when studying long-run
economic growth, it makes sense to plot time series data in logarithms. This is done for US
real GDP per person in Figure 2.2.
Figure 2.2: US real GDP per person in logarithms
2.1.2 Why growth matters
Taking the period from 1870 to 2000 in the USA, real GDP per person reached 𝑦𝑦2000 =
$36,000 in the year 2000 (measured in year-2000 dollars). The growth rate over this 130year period was 𝑔𝑔 =1.75% per year on average. Small differences in 𝑔𝑔, if they had been
sustained over this long period of time 1870-2000, would have made a huge difference to
the resulting level of 𝑦𝑦. If US growth rates had been a percentage point lower on average at
𝑔𝑔 = 0.75% then we would have had 𝑦𝑦2000 = $10,000, leaving it as merely a middle-income
country. On the other hand, if 𝑔𝑔 = 2.75% instead then we would have had 𝑦𝑦2000 =
$120,000, a level of income per person not yet attained in any country.
These numerical examples demonstrate why long-run economic growth is so important. A
small difference in growth rates compounded over a long period of time leads to
dramatically different economic outcomes.
One way to appreciate the magnitude of the difference between growth rates is known as
the ‘rule of 70’. It states that if economic growth occurs at 𝑔𝑔 per cent then it takes
approximately 70/𝑔𝑔 years for income to double. For example, the 2 per cent growth seen as
normal in a developed country implies a doubling of income in 35 years, with each
generation enjoying living standards at twice the level of their parents’ generation.
47
EC2065 Macroeconomics | Chapter 2: Economic growth
Some countries have experienced much faster growth. In the ‘Asian tiger’ economies,
growth rates were often above 5 per cent in the 1960–90 period. This implies income
doubles in less than 15 years. Other countries have experienced even faster growth rates.
China grows at a rate of 6.3 per cent on average between 1980 and 2000, and even more
rapidly in the following decade. Figure 2.3 shows this led China’s income per person, which
in 1980 was only 5 per cent of the level in the USA, to rise to more than 20 per cent of the
US level by 2010. Income per person in India was also close to 5 per cent of the US level in
1980 but, owing to a lower growth rate than China’s, it rises only to 10 per cent of the USA
by 2016.
Figure 2.3: Why growth matters
2.1.3 Economic growth in historical perspective
Although sustained economic growth has come to be taken for granted in advanced
economies, it is a relatively recent phenomenon from a historical perspective. For much of
history, most countries experienced stagnation in income per person. This stagnation is not
because there was no growth in total GDP but because increases in population offset any
rise in GDP. However, starting in the 18th and 19th centuries, some countries began to
experience ongoing growth in income per person.
Economic historians have estimated levels of real GDP per person over the last 2000 years
using historical sources. Figure 2.4 shows these estimates for a number of countries (strictly
speaking, the geographical areas spanned by the modern countries).
48
EC2065 Macroeconomics | Chapter 2: Economic growth
Figure 2.4: Stagnation for much of history
For much of the last 2000 years, the graphs of real GDP per person over time are close to
flat lines as economies stagnated. Only in the last 300 years do we see sustained economic
growth in some countries – modern economic growth – with others only beginning to see
significant growth in the last 40–50 years.
Although GDP per person stagnates until relatively recently, there is sustained population
growth in some parts of the world over a much longer period of time. Figure 2.5 shows plots
of estimated populations over time for a number of countries. This evidence is in line with a
version of the Malthusian model from Section 1.3 where there are improvements in
technology. This technological progress does not lead to sustained rises in living standards
but does increase the population.
Figure 2.5: Rising world population
49
EC2065 Macroeconomics | Chapter 2: Economic growth
2.1.4 The distribution of income across countries
Prior to the 19th century, differences in income across the countries of the world were
much smaller than today. Research suggests the more successful countries had incomes that
were only around twice those of the less successful countries. As countries began to
experience modern economic growth at different times, they diverged in the 19th century.
Continuing through the 20th century, huge differences in income per person have opened
up across countries. It is not unusual to have difference of 10–20 times between average
incomes per person in rich and poor countries.
Figure 2.6 shows the world income distribution across countries in 2017. Average incomes
per person are expressed in comparable units across countries by adjusting for purchasing
power parity and converting into a US dollar equivalent. Ordering countries by their income
levels, real GDP per person is plotted against the percentiles of the world population
(ignoring any income differences within countries). We see that much of the world
population lives in countries that are far below the income levels achieved in advanced
economies.
Figure 2.6: World income distribution across countries
2.1.5 Convergence
Given the large income differences that have arisen between countries, an important
question is whether poor countries will remain much poorer than rich countries in the
future, or whether they will catch up. For poorer countries to close the gap with richer
countries, the poor must grow faster than the rich. If this happens, we say that those
countries are converging.
We distinguish between two notions of convergence. First, absolute convergence, which
means simply that poor countries grow faster than the rich. Second, conditional
convergence, which means that among a group of countries with similar fundamentals,
poorer ones grow faster than richer ones. Data from the post-Second-World-War period
50
EC2065 Macroeconomics | Chapter 2: Economic growth
suggest convergence among some groups of countries but certainly not worldwide. This
means there is evidence of conditional convergence, but not absolute convergence.
Figure 2.7 is a scatterplot of the annual average growth rates of real GDP per person
achieved by a large number of countries over the period 1960–2017 against their initial
levels of real GDP per person in 1960 (measured in comparable 2017 US dollars). If there
were absolute convergence, we would expect to see a clear negative relationship between
initial income levels and subsequent growth rates as poorer countries grow faster than
richer countries. However, this is not the case. While some poorer countries grow faster,
there are many examples of poorer countries that grow at slower rates than richer
countries. Overall, the correlation between growth rates and initial incomes is close to zero.
Focusing on a narrower set of countries, Figure 2.8 restricts the sample to OECD (mainly
Europe, North America and Australasia) and East Asian countries. Now, there is a clear
negative relationship between growth rates and initial income levels, providing evidence of
convergence within this group. To the extent that these countries can be seen as having
similar fundamentals, this indicates that conditional convergence is taking place. Observe
that the growth rates in East Asian countries are even higher than the relationship between
growth and initial incomes for the OECD would suggest. Restricting attention to a narrower
group of countries would provide even clearer evidence of conditional convergence within
that group.
Figure 2.7: Limited evidence of convergence worldwide
51
EC2065 Macroeconomics | Chapter 2: Economic growth
Figure 2.8: Convergence (OECD and East Asia)
One characteristic of some groups of countries is that they save and invest a greater share
of their GDP than the global average. For example, high levels of saving and investment are
a feature of many East Asian countries. Figure 2.9 is a scatterplot of countries’ investment
shares of GDP and their levels of real GDP per person. While the relationship is far from
perfect, overall, we can say that investment rates and levels of real GDP per person are
positively correlated. Differences in the amount of resources countries devote to capital
accumulation is thus one fundamental reason why they might not converge to the same
level of income per person in the long run.
Figure 2.9: Investment rates and income
2.2 Income and growth rates across countries
In Section 1.8, we saw that the Solow model can explain growth for a period in an economy
that begins with a low level of capital per worker but also that the model fails to explain
growth in the long run. Before turning to alternative models of long-run economic growth,
52
EC2065 Macroeconomics | Chapter 2: Economic growth
we will first explore the implications of the Solow model for the cross-section of countries.
In particular, does the Solow model help in understanding the large differences in growth
rates and levels of income per person across countries?
In the cross-section of countries, the Solow model makes two key predictions. First, if
countries share the same fundamentals such as technologies and saving rates, then they
should converge to the same level of output per worker in the long run, even if they start
from different initial conditions. This means that controlling for a group of countries with
the same fundamentals, the initially poorer countries in the group should grow faster than
the initially richer countries. The Solow model therefore predicts conditional convergence.
The second key prediction is that if saving rates differ across countries, then there will be
differences in output per worker even in the long run. This means that the Solow model
does not predict absolute convergence (except in the unrealistic case where every country
in the world shares the same fundamentals).
We now consider the logic for the first prediction. If countries share the same fundamentals,
then they have the same lines in the Solow model diagram introduced in Section 1.8. A
version of this diagram is reproduced in Figure 2.10. There is a per worker production
function 𝑧𝑧𝑧𝑧(π‘˜π‘˜), a saving line 𝑠𝑠𝑠𝑠𝑠𝑠(π‘˜π‘˜), and an effective depreciation line (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜. The
fundamentals are the saving rate 𝑠𝑠, the population growth rate 𝑛𝑛, the level of total factor
productivity (TFP) 𝑧𝑧 and the per worker production function 𝑓𝑓(π‘˜π‘˜) itself.
As the fundamentals are the same for the countries we are considering, all have the same
per worker production function, saving line and effective depreciation line. This implies they
share the same steady state for π‘˜π‘˜ and 𝑦𝑦, which is found where the saving line intersects the
depreciation line.
Figure 2.10: Conditional convergence
With the fundamentals being the same, the only differences between countries are the
amounts of capital per worker they have accumulated owing to different histories.
Specifically, those countries that have accumulated less capital begin poorer because a
lower level of π‘˜π‘˜ implies less 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜) with 𝑧𝑧 being the same. The more π‘˜π‘˜ is below the
53
EC2065 Macroeconomics | Chapter 2: Economic growth
steady state π‘˜π‘˜ ∗ , the larger the gap between the saving and effective depreciation lines
relative to the level of π‘˜π‘˜. Since this gap determines the change in capital per worker over
time and, hence, output per worker, poorer countries grow faster than richer countries. All
end up converging to the same steady-state income level in the long run.
Now suppose countries are identical in all fundamentals except for their saving rates 𝑠𝑠.
Different levels of 𝑠𝑠 imply different saving lines 𝑠𝑠𝑠𝑠𝑠𝑠(π‘˜π‘˜) and different saving lines result in
different steady-state levels of capital per worker, hence, output per worker. This is
illustrated in Figure 2.11. A higher saving rate allows a higher stock of capital per worker to
be sustained for the same production technology. With more capital, output per worker is
higher, although not necessarily consumption (see the analysis of the golden rule in Section
2.5). Long-run growth is always zero and is not affected by the saving rate.
Figure 2.11: Differences in saving rates
The logic of the second prediction of the Solow model is that if countries are converging to
different steady states, there is no presumption that starting from a lower level of income
per worker leads to faster subsequent growth. A poor country with a low saving rate might
be closer to its steady state than a richer one with a high saving rate, so the poorer country
grows more slowly. The countries might thus move further apart, indicating a failure of
absolute convergence. However, given the difference in saving rates, the ratio of incomes
across the two countries must stop widening further in the long run.
Box 2.1: Can the Solow model explain large income differences
across countries?
In principle, differences in saving rates and the resulting levels of capital
accumulation in the Solow model could explain why some countries are rich and
others are poor. How large are the differences in income levels predicted by the
Solow model for realistic differences in saving rates across countries? We will focus
on the income differences that will prevail in the long run, i.e. where economies
have reached their steady-state levels of capital per worker.
54
EC2065 Macroeconomics | Chapter 2: Economic growth
Performing this exercise quantitatively requires some specific assumptions. We will
work with a Cobb-Douglas production function π‘Œπ‘Œ = 𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 1−𝛼𝛼 , which implies the
per worker production function 𝑦𝑦 = π‘§π‘§π‘˜π‘˜ 𝛼𝛼 . With competitive markets, the parameter
𝛼𝛼 is equal to the capital share of income, so we set 𝛼𝛼 = 1/3 as a reasonable value.
We will consider differences in saving rates 𝑠𝑠 across countries, but 𝑧𝑧 and 𝛼𝛼, and the
depreciation and population growth rates 𝑑𝑑 and 𝑛𝑛, are assumed to be the same
everywhere.
The Solow model’s long-run prediction for levels of income per person across
countries is found by solving for steady-state capital per worker π‘˜π‘˜ ∗ . This is
determined by the equation:
π‘ π‘ π‘ π‘ π‘˜π‘˜ ∗𝛼𝛼 = (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜ ∗
This can be rearranged to obtain π‘˜π‘˜ ∗1−𝛼𝛼 = 𝑠𝑠𝑠𝑠/(𝑑𝑑 + 𝑛𝑛) and hence the following
solution:
1
𝑠𝑠𝑠𝑠 1−𝛼𝛼
π‘˜π‘˜ ∗ = οΏ½
οΏ½
𝑑𝑑 + 𝑛𝑛
Substituting this into the per-worker production function 𝑦𝑦 ∗ = π‘§π‘§π‘˜π‘˜ ∗𝛼𝛼 yields the
solution:
∗
𝑦𝑦 =
1
𝑧𝑧 1−𝛼𝛼
𝛼𝛼
𝑠𝑠 1−𝛼𝛼
οΏ½
οΏ½
𝑑𝑑 + 𝑛𝑛
If we consider two countries 𝐴𝐴 and 𝐡𝐡 where all parameters above are the same
except for the saving rates 𝑠𝑠𝐴𝐴 and 𝑠𝑠𝐡𝐡 , then dividing 𝑦𝑦𝐴𝐴∗ by 𝑦𝑦𝐡𝐡∗ results in:
𝛼𝛼
𝑦𝑦𝐴𝐴∗
𝑠𝑠𝐴𝐴 1−𝛼𝛼
∗ =οΏ½ οΏ½
𝑦𝑦𝐡𝐡
𝑠𝑠𝐡𝐡
In a closed economy with no government sector, the saving rate 𝑠𝑠 is equal to the
ratio of investment to GDP. Looking at the evidence in Figure 2.9, investment
shares of GDP in countries with very high investment ratios are approximately five
times higher than those with very low investment ratios (for example, 35 per cent
in high-investment countries to 7 per cent in low-investment countries). Hence, we
suppose that country A saves a fraction of income five times higher than in country
B, that is, 𝑠𝑠𝐴𝐴 ⁄𝑠𝑠𝐡𝐡 = 5. Noting 𝛼𝛼 = 1/3 implies 𝛼𝛼⁄(1 − 𝛼𝛼 ) = 1/2, it follows that
𝑦𝑦𝐴𝐴∗ ⁄𝑦𝑦𝐡𝐡∗ = 51/2 ≈ 2.2. A saving rate five times higher thus implies income per person
only slightly more than double. Intuitively, although country 𝐴𝐴 accumulates more
capital than country 𝐡𝐡, diminishing returns to capital imply that the resulting
income difference is much smaller than the difference in saving rates.
With this reasonable parameterisation of the Solow model, very large difference in
rates of saving and investment lead only to modest difference in income levels. This
means the Solow model has a very limited capacity to explain the income
55
EC2065 Macroeconomics | Chapter 2: Economic growth
differences of an order of magnitude of 10–20 between rich and poor countries seen in
Figure 2.6. There must be some other cause of these large income differences, perhaps
total factor productivity 𝑧𝑧.
Box 2.2: How long does convergence to the steady state take in the
Solow model?
A key prediction of the basic Solow model is that there is no growth in income per
person in the long run, i.e. when the economy has converged to its steady-state value of
capital per worker. The relevance of this prediction depends on how long it takes to get
close to the steady state – if this were to take hundreds of years then the basic Solow
model would have the ability to explain economic growth over quite long periods of
time. This exercise presents an example to illustrate the likely timeframe for
convergence to the steady state in the Solow model.
We assume the production function is π‘Œπ‘Œ = 𝑧𝑧√𝐾𝐾 √𝑁𝑁, which is a Cobb-Douglas form with
𝛼𝛼 = 1/2, implying that the capital and labour income shares are 50 per cent each if
markets are competitive. This is larger than the typical value of 𝛼𝛼 = 1/3, and the use of
a larger 𝛼𝛼 turns out to slow down convergence to the steady state. The per-worker
production function in this case is 𝑦𝑦 = �𝑧𝑧√𝐾𝐾 √𝑁𝑁�⁄𝑁𝑁 = 𝑧𝑧�𝐾𝐾⁄𝑁𝑁 = 𝑧𝑧√π‘˜π‘˜. The marginal
product of capital can be obtained by differentiating 𝑦𝑦 with respect to π‘˜π‘˜, thus 𝑀𝑀𝑃𝑃𝐾𝐾 =
πœ•πœ•πœ•πœ•⁄πœ•πœ•πœ•πœ• = 𝑧𝑧⁄οΏ½2√π‘˜π‘˜οΏ½ = 𝑦𝑦⁄(2π‘˜π‘˜). The steady state for capital per worker π‘˜π‘˜ ∗ is the solution
of the equation 𝑠𝑠𝑦𝑦 ∗ = (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜ ∗ :
𝑠𝑠𝑠𝑠√π‘˜π‘˜ ∗ = (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜∗ ⇒ √π‘˜π‘˜ ∗ =
𝑠𝑠𝑠𝑠
𝑑𝑑 + 𝑛𝑛
⇒
π‘˜π‘˜ ∗ = οΏ½
𝑠𝑠𝑠𝑠 2
οΏ½
𝑑𝑑 + 𝑛𝑛
The steady state for output per worker 𝑦𝑦 ∗ is then found using 𝑦𝑦 ∗ = 𝑧𝑧√π‘˜π‘˜ ∗ :
𝑦𝑦 ∗ = 𝑧𝑧 οΏ½
𝑠𝑠𝑠𝑠
𝑠𝑠𝑧𝑧 2
οΏ½=
𝑑𝑑 + 𝑛𝑛
𝑑𝑑 + 𝑛𝑛
Since the marginal product of capital 𝑀𝑀𝑃𝑃𝐾𝐾 is the effect on output per worker of a
marginal increase in capital per worker, the change in output 𝑦𝑦 ′ − 𝑦𝑦 is approximately
equal to 𝑀𝑀𝑃𝑃𝐾𝐾 multiplied by π‘˜π‘˜ ′ − π‘˜π‘˜. Hence, using π‘˜π‘˜ ′ − π‘˜π‘˜ = (𝑠𝑠𝑠𝑠 − (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜)⁄(1 + 𝑛𝑛):
𝑦𝑦 ′
𝑦𝑦 𝑠𝑠𝑠𝑠 − (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜
𝑠𝑠𝑦𝑦 2 ⁄π‘˜π‘˜ − (𝑑𝑑 + 𝑛𝑛)𝑦𝑦
(π‘˜π‘˜ − π‘˜π‘˜) =
𝑦𝑦 − 𝑦𝑦 ≅
οΏ½
οΏ½=
1 + 𝑛𝑛
2(1 + 𝑛𝑛)
2π‘˜π‘˜
2π‘˜π‘˜
′
The per-worker production function implies 𝑦𝑦 2 ⁄π‘˜π‘˜ = 𝑧𝑧 2 , so 𝑦𝑦 ′ − 𝑦𝑦 can be expressed as:
𝑦𝑦 ′ − 𝑦𝑦 ≅
(𝑑𝑑 + 𝑛𝑛) 𝑠𝑠𝑧𝑧 2
οΏ½
− 𝑦𝑦�
2(1 + 𝑛𝑛) 𝑑𝑑 + 𝑛𝑛
56
EC2065 Macroeconomics | Chapter 2: Economic growth
Recalling that the steady state is 𝑦𝑦 ∗ = 𝑠𝑠𝑧𝑧 2⁄(𝑑𝑑 + 𝑛𝑛), the change in income per
person is 𝑦𝑦 ′ − 𝑦𝑦 ≅ ((𝑑𝑑 + 𝑛𝑛)⁄(2(1 + 𝑛𝑛)))(𝑦𝑦 ∗ − 𝑦𝑦). The gap between 𝑦𝑦 and its
steady-state value 𝑦𝑦 ∗ is 𝑦𝑦 ∗ − 𝑦𝑦, and the change in this gap over time is (𝑦𝑦 ∗ − 𝑦𝑦 ′ ) −
(𝑦𝑦 ∗ − 𝑦𝑦) = 𝑦𝑦 − 𝑦𝑦 ′ , so:
(𝑦𝑦 ∗ − 𝑦𝑦 ′ ) − (𝑦𝑦 ∗ − 𝑦𝑦) ≅ −
(𝑑𝑑 + 𝑛𝑛) ∗
(𝑦𝑦 − 𝑦𝑦)
2(1 + 𝑛𝑛)
Therefore, the dynamics of the gap to steady state 𝑦𝑦 ∗ − 𝑦𝑦 are determined by:
(𝑦𝑦 ∗ − 𝑦𝑦 ′ ) ≅ οΏ½1 −
(𝑑𝑑 + 𝑛𝑛)
οΏ½ (𝑦𝑦 ∗ − 𝑦𝑦)
2(1 + 𝑛𝑛)
Taking 𝑑𝑑 ≈ 9 per cent and 𝑛𝑛 ≈ 1 per cent as reasonable values of the depreciation
rate of capital and the population growth rate results in (𝑦𝑦 ∗ − 𝑦𝑦 ′ ) ≈ 0.95(𝑦𝑦 ∗ − 𝑦𝑦).
This means 5 per cent of the gap between 𝑦𝑦 and its steady state 𝑦𝑦 ∗ is closed each
year. The rule-of-70 then implies approximately half of the gap closed in 70⁄5 = 14
years and three quarters of the gap in approximately 28 years. With only a small gap
remaining after four decades, the zero steady-state growth prediction of the Solow
model is reached after an economy has been accumulating capital for a relatively
short space of time.
2.3 Technological progress
With the basic Solow model being unable to generate long-run growth and the long run of
the model being reached in just a few decades, it is necessary to turn to other explanations
of why advanced economies have continued to experience sustained economic growth for
more than a century. We will now explore the possibility of growth in total factor
productivity (TFP) 𝑧𝑧.
Technological progress is one reason why total factor productivity 𝑧𝑧 can increase over time.
New ideas provide new uses for capital and labour, or lead to efficiency gains in producing
existing goods. In the Solow model diagram, a one-off increase in 𝑧𝑧 shifts up the per worker
production function 𝑧𝑧𝑧𝑧(π‘˜π‘˜) and saving line 𝑠𝑠𝑠𝑠𝑠𝑠(π‘˜π‘˜) as shown in Figure 2.12.
57
EC2065 Macroeconomics | Chapter 2: Economic growth
Figure 2.12: Technological progress
The improvement in technology results in higher steady-state values of π‘˜π‘˜ and 𝑦𝑦. It is
possible to sustain a higher level of capital and output per worker with the same saving rate
𝑠𝑠 because technological progress raises the marginal product of capital. However, a one-off
improvement in technology ultimately leads only to a one-off improvement in output per
worker. This means explaining long-run growth requires ongoing technological progress.
2.3.1 A constant growth rate of technology
In studying ongoing technological progress, we assume the following production function:
π‘Œπ‘Œ = 𝐹𝐹(𝐾𝐾, 𝐴𝐴𝐴𝐴)
Instead of TFP 𝑧𝑧, a variable 𝐴𝐴 represents labour-augmenting technology, labour-augmenting
in that 𝐴𝐴 scales up labour input 𝑁𝑁. With labour-augmenting technology, effective labour
input into production is not 𝑁𝑁 but 𝐴𝐴𝐴𝐴. The function 𝐹𝐹(𝐾𝐾, 𝑁𝑁) has the usual neoclassical
properties. Technological progress is represented by 𝐴𝐴 growing at a constant rate 𝑔𝑔:
𝐴𝐴′ = (1 + 𝑔𝑔)𝐴𝐴
In analysing this version of the Solow model, it is convenient to define variables π‘˜π‘˜π‘’π‘’ and 𝑦𝑦𝑒𝑒
representing capital and output per effective labour input 𝐴𝐴𝐴𝐴:
π‘˜π‘˜π‘’π‘’ =
𝐾𝐾
𝐴𝐴𝐴𝐴
and
𝑦𝑦𝑒𝑒 =
π‘Œπ‘Œ
𝐴𝐴𝐴𝐴
A per-effective-worker production function is derived in the same way as the usual perworker production function, with 𝑓𝑓(π‘˜π‘˜π‘’π‘’ ) used as a shorthand for 𝐹𝐹(π‘˜π‘˜π‘’π‘’ , 1):
𝑦𝑦𝑒𝑒 =
𝐹𝐹(𝐾𝐾, 𝐴𝐴𝐴𝐴)
𝐾𝐾 𝐴𝐴𝐴𝐴
= 𝐹𝐹 �
,
οΏ½ = 𝐹𝐹 (π‘˜π‘˜π‘’π‘’ , 1) = 𝑓𝑓(π‘˜π‘˜π‘’π‘’ )
𝐴𝐴𝐴𝐴
𝐴𝐴𝐴𝐴 𝐴𝐴𝐴𝐴
Understanding the dynamics of π‘˜π‘˜π‘’π‘’ is central to the analysis. By using the equations for 𝐾𝐾′,
𝑁𝑁′, and 𝐴𝐴′ and 𝑦𝑦𝑒𝑒 = 𝑓𝑓(π‘˜π‘˜π‘’π‘’ ), the future-period value of π‘˜π‘˜π‘’π‘’ is:
58
EC2065 Macroeconomics | Chapter 2: Economic growth
π‘˜π‘˜π‘’π‘’′ =
(1 − 𝑑𝑑 )𝐾𝐾 + 𝑠𝑠𝑠𝑠
(1 − 𝑑𝑑 )π‘˜π‘˜π‘’π‘’ + 𝑠𝑠𝑠𝑠(π‘˜π‘˜π‘’π‘’ )
𝐾𝐾 ′
=
=
𝐴𝐴′ 𝑁𝑁′ (1 + 𝑔𝑔)(1 + 𝑛𝑛)𝐴𝐴𝐴𝐴
(1 + 𝑔𝑔)(1 + 𝑛𝑛)
Subtracting π‘˜π‘˜π‘’π‘’ yields an equation for the change in capital per effective worker:
π‘˜π‘˜π‘’π‘’′ − π‘˜π‘˜π‘’π‘’ =
𝑠𝑠𝑠𝑠 (π‘˜π‘˜π‘’π‘’ ) − (𝑑𝑑 + 𝑛𝑛 + 𝑔𝑔 + 𝑛𝑛𝑛𝑛)π‘˜π‘˜π‘’π‘’ 𝑠𝑠𝑠𝑠(π‘˜π‘˜π‘’π‘’ ) − (𝑑𝑑 + 𝑛𝑛 + 𝑔𝑔)π‘˜π‘˜π‘’π‘’
≈
(1 + 𝑔𝑔)(1 + 𝑛𝑛)
(1 + 𝑔𝑔)(1 + 𝑛𝑛)
The approximation in this equation uses that 𝑛𝑛𝑛𝑛 is highly likely to be small relative to 𝑛𝑛 and
𝑔𝑔 themselves. The resulting equation has the same form as that for π‘˜π‘˜ ′ − π‘˜π‘˜ in the basic
Solow model: the numerator is the difference between a ‘saving line’ 𝑠𝑠𝑠𝑠(π‘˜π‘˜π‘’π‘’ ) and an
‘effective depreciation line’ (𝑑𝑑 + 𝑛𝑛 + 𝑔𝑔)π‘˜π‘˜π‘’π‘’ . These lines have the same shapes as in the basic
Solow model because 𝑓𝑓(π‘˜π‘˜π‘’π‘’ ) is also a concave function here, and (𝑑𝑑 + 𝑛𝑛 + 𝑔𝑔)π‘˜π‘˜π‘’π‘’ is linear.
Capital per effective worker π‘˜π‘˜π‘’π‘’ is rising when the saving line is above the depreciation line,
and is at a steady state where the two lines cross. Once π‘˜π‘˜π‘’π‘’ is known, output per effective
worker is found using the per-effective-worker production function 𝑦𝑦𝑒𝑒 = 𝑓𝑓(π‘˜π‘˜π‘’π‘’ ). The
equivalent of the usual Solow model diagram here is shown in Figure 2.13.
Figure 2.13: Modified Solow diagram
The figure reveals there is a steady state for capital and output per effective worker π‘˜π‘˜π‘’π‘’ and
𝑦𝑦𝑒𝑒 , and the economy will converge to this steady state in the long run using the same logic
as in the basic Solow model. But this does not mean that there is no economic growth in the
long run. Output per worker is by definition 𝑦𝑦 = π‘Œπ‘Œ/𝑁𝑁 = 𝐴𝐴𝑦𝑦𝑒𝑒 . Since 𝑦𝑦𝑒𝑒 converges to a steady
state while 𝐴𝐴 grows at a constant rate 𝑔𝑔, the long-run growth rate of 𝑦𝑦 is 𝑔𝑔. Similarly, capital
per worker π‘˜π‘˜ has long-run growth rate 𝑔𝑔.
Intuitively, this Solow model with constant growth in labour-augmenting technology is able
to generate long-run economic growth because growth in the capital stock in line with
labour-augmenting technology is able to scale up inputs of both capital and labour per
person in the economy. Hence, by constant returns to scale, output per person grows at the
rate of labour-augmenting technological progress. These improvements in technology are
effectively raising the return to capital and offsetting the diminishing returns to capital that
limit long-run growth in the basic version of the Solow model. The economy has a balanced
59
EC2065 Macroeconomics | Chapter 2: Economic growth
growth path where all per-worker variables grow in line with technological progress. On this
balanced growth path, returns to capital remain constant over time.
This version of the Solow model is consistent with the occurrence of long-run growth.
However, that growth comes only from technological progress, which is exogenous in the
model. The Solow model here is not actually explaining the source of long-run economic
growth – it is simply assuming it. In Section 2.6, we look at theories that make long-run
economic growth endogenous.
2.4 International flows of investment
In the Solow model we have so far assumed a closed economy. The consequence of this
assumption is that investment must be financed by domestic saving (𝐼𝐼 = 𝑆𝑆). But if crosscountry income differences are explained by differences in capital accumulation, then
returns to capital have diminished more in some countries than others. This implies
differences in returns to capital across countries. To the extent that economies are actually
open to trade, there would be an incentive for savings to flow from rich countries with
abundant capital to poor countries with scarce capital where returns are higher. We present
here an exercise to calculate how large are the incentives for capital flows implied by the
Solow model given the observed differences in income levels across countries.
Suppose all countries have a Cobb-Douglas production function π‘Œπ‘Œ = 𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 1−𝛼𝛼 , which in
per-worker terms is 𝑦𝑦 = π‘§π‘§π‘˜π‘˜ 𝛼𝛼 . Further, assume the same TFP 𝑧𝑧 across countries and the
same parameter 𝛼𝛼. If markets are competitive and absent distortions in the economy, the
gross return 𝑅𝑅 received by owners of capital is equal to the marginal product of capital 𝑀𝑀𝑃𝑃𝐾𝐾 ,
and 𝛼𝛼 is the share of capital income in total income. We will set 𝛼𝛼 = 1/3 to match a
reasonable value of the capital share.
Taking as given an observed value of income per person 𝑦𝑦, the required level of capital per
worker can be found conditional on TFP 𝑧𝑧 by rearranging the per worker production
function 𝑦𝑦 = π‘§π‘§π‘˜π‘˜ 𝛼𝛼 to give π‘˜π‘˜ = (𝑦𝑦⁄𝑧𝑧)1⁄𝛼𝛼 . Differentiating the per worker production, the
marginal product of capital is 𝑀𝑀𝑃𝑃𝐾𝐾 = π›Όπ›Όπ›Όπ›Όπ‘˜π‘˜ 𝛼𝛼−1 and substituting for π‘˜π‘˜ in terms of 𝑦𝑦 and 𝑧𝑧, the
implied capital return 𝑅𝑅 as a function of income per person 𝑦𝑦 is:
𝑦𝑦
𝑅𝑅 = 𝛼𝛼𝛼𝛼 οΏ½ οΏ½
𝑧𝑧
𝛼𝛼−1
𝛼𝛼
1
1−𝛼𝛼
οΏ½
𝛼𝛼
= 𝛼𝛼𝑧𝑧 𝛼𝛼 𝑦𝑦 −οΏ½
We now compare implied returns to capital in two countries 𝐴𝐴 and 𝐡𝐡, supposing 𝐴𝐴 has an
income per person 10 times higher than in 𝐡𝐡, that is, 𝑦𝑦𝐴𝐴 ⁄𝑦𝑦𝐡𝐡 = 10. The relative (gross)
return on capital between two countries with the same 𝑧𝑧 and 𝛼𝛼 is:
1
1−𝛼𝛼
οΏ½
−οΏ½
𝑅𝑅𝐡𝐡 𝛼𝛼𝑧𝑧 𝛼𝛼 𝑦𝑦𝐡𝐡 𝛼𝛼
𝑦𝑦𝐴𝐴
=
=οΏ½ οΏ½
1−𝛼𝛼
1 −οΏ½
𝑅𝑅𝐴𝐴
𝑦𝑦𝐡𝐡
οΏ½
𝛼𝛼𝑧𝑧 𝛼𝛼 𝑦𝑦𝐴𝐴 𝛼𝛼
1−𝛼𝛼
𝛼𝛼
Using parameter 𝛼𝛼 = 1/3, note that (1 − 𝛼𝛼)⁄𝛼𝛼 = 2, so the implied capital returns across
two countries with 𝑦𝑦𝐴𝐴 ⁄𝑦𝑦𝐡𝐡 = 10 are 𝑅𝑅𝐡𝐡 ⁄𝑅𝑅𝐴𝐴 = 102 = 100. This numerical exercise suggests
the gross return to capital would be 100 times higher in the poorer country compared to
60
EC2065 Macroeconomics | Chapter 2: Economic growth
the rich country. Returns after depreciation π‘Ÿπ‘Ÿ = 𝑅𝑅 − 𝑑𝑑 would also differ by a large amount,
with π‘Ÿπ‘Ÿπ΅π΅ = 𝑅𝑅𝐡𝐡 − 𝑑𝑑 = 100𝑅𝑅𝐴𝐴 − 𝑑𝑑 = 100π‘Ÿπ‘Ÿπ΄π΄ + 99𝑑𝑑.
However, if the difference in capital returns suggested by this exercise for realistic income
differences were really true then there would be huge incentives for rich countries to lend
to poor countries for investment. But international capital flows have not occurred to the
extent that would be expected if the calculation were correct. We conclude that some
assumptions made in this exercise must be incorrect. There are a number of possibilities:
•
•
•
Some other factor of production that differs across countries has not been
accounted for, e.g. human capital. If human capital is higher in some countries, then
that raises their marginal product of physical capital 𝑀𝑀𝑃𝑃𝐾𝐾 .
TFP 𝑧𝑧 differs across countries and differences in TFP imply differences in the
marginal product of capital 𝑀𝑀𝑃𝑃𝐾𝐾 . We explore in Box 2.3 what a difference in TFP
across countries might capture.
Property rights are weaker in some countries, implying that investors’ actual return
is less than 𝑀𝑀𝑃𝑃𝐾𝐾 .
Box 2.3: Institutions and income differences across countries
Differences in total factor productivity (TFP) 𝑧𝑧 across countries help to resolve a number of
difficulties in reconciling the Solow model with the observed distribution of income levels
around the world. We have seen in Box 2.1 that differences in saving rates and capital
accumulation can explain only a small proportion of cross-country income differences.
Moreover, there would be very large gaps in the implied rate of return to capital across
countries if income differences arise from different stocks of capital per worker.
If richer countries benefit from higher TFP then this can account for some of the income
differences across countries without having to rely on differences in saving rates.
Moreover, if richer countries have higher TFP than poorer countries, it does not follow
that they necessarily have a much lower return on capital and making it less of a puzzle
that capital does not flow from rich to poor countries.
However, if TFP represents technology, which is what we assumed when explaining longrun growth through technological progress, how can very large differences in TFP exist
when knowledge can be copied? While there may be some costs of imitation, which we
consider later in Section 2.10, it is nonetheless a challenge to explain the size of the TFP
differences across countries needed to account for income levels purely in terms of
differences in technology.
Equating TFP with technology is, though, too narrow an interpretation. In a production
function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁), a change in TFP 𝑧𝑧 simply represents any change in π‘Œπ‘Œ that cannot
be explained by changes in the quantities of capital 𝐾𝐾 or labour 𝑁𝑁. The efficiency with
which an economy’s stock of capital and labour are allocated to the best uses to produce
aggregate output π‘Œπ‘Œ would also be part of TFP 𝑧𝑧. It is more plausible that this broader
notion of TFP differs across countries, perhaps owing to differences in institutions.
61
EC2065 Macroeconomics | Chapter 2: Economic growth
Institutions
Institutional differences across countries are one hypothesis for why there are large
differences in TFP. The ‘institutions’ of a country mean the set of rules that govern the
process of decision making and resource allocation in a country. These include the country’s
political and legal systems and economic institutions such as its markets, financial system,
and taxes and regulations. Institutions can affect both incentives to accumulate factors of
production and the efficiency with which those factors of production are allocated to
different uses.
A country with weak property rights, severe corruption and a poorly designed tax system
does not provide strong incentives to save, invest, or start a new business. For example, the
risk of expropriation reduces the expected return on any long-term investments, lowering
capital accumulation. This problem would also make it harder to attract foreign direct
investment, helping to explain the puzzle of why capital does not flow from rich to poor
countries.
Misallocation
Institutions can also affect the efficiency with which factors of production are allocated. For
example, in making loans, corruption might lead banks to favour those with political
connections over entrepreneurs with the best business ideas. Consequently, the economy’s
supply of capital 𝐾𝐾 and labour 𝑁𝑁 is not allocated to the businesses that can add the most to
GDP π‘Œπ‘Œ. This shows up as a lower level of TFP 𝑧𝑧 in a country.
More generally, institutions affect incentives to pursue ‘rent-seeking’ rather than productive
activities. Productive activities are those that increase the total size of the ‘pie’, that is,
investments in physical and human capital, new businesses, and new technologies. Rentseeking refers to activities that give someone a larger share of the pie but do not increase its
size. For example, someone might seek a position of political power to benefit from
corruption. Institutions that reward rent-seeking activities thus divert effort from growing
the pie, which leads to lower TFP.
2.5 The golden rule
We have seen in the Solow model that a higher saving rate allows the economy to sustain
more capital per worker in steady state and thus obtain a permanently higher level of
income per worker. Does this mean that more saving is always a good thing? Although
income is higher, since a greater fraction of it is saved, consumption per worker might end
up being lower. Underlying this is the need to maintain in steady state the capital
accumulated through more saving and these resources used for investment cannot also be
used for consumption.
This logic suggests we should look for a saving rate 𝑠𝑠 that maximises consumption per
worker 𝑐𝑐 = 𝐢𝐢/𝑁𝑁. Since 𝐢𝐢 = π‘Œπ‘Œ − 𝑆𝑆 and 𝑆𝑆 = 𝑠𝑠𝑠𝑠, consumption is 𝐢𝐢 = (1 − 𝑠𝑠)π‘Œπ‘Œ and the
equation for consumption per worker is 𝑐𝑐 = (1 − 𝑠𝑠)𝑦𝑦. Taking 𝑦𝑦 as given in the short run, 𝑐𝑐
62
EC2065 Macroeconomics | Chapter 2: Economic growth
is maximised by saving nothing. However, that level of income 𝑦𝑦 cannot be sustained with
zero saving because the capital stock needed to produce output would eventually be
depleted.
Instead, we ask what is the highest sustainable level of 𝑐𝑐, and which saving rate is needed to
attain it? A sustainable value of 𝑐𝑐 is one that is a steady state for a particular saving rate 𝑠𝑠.
For extreme values of 𝑠𝑠, we can say immediately what the steady state for 𝑐𝑐 results. Having
𝑠𝑠 = 1 results in 𝑐𝑐 = 0, even though steady-state income 𝑦𝑦 might be very high. Having 𝑠𝑠 = 0
results in 𝑐𝑐 = 0 because the steady-state value of π‘˜π‘˜ with zero saving is zero and this implies
that 𝑦𝑦 = 0 because some capital is essential for production. With the extreme values of 𝑠𝑠
leading to the worse possible outcomes for 𝑐𝑐 in steady state, it follows that an intermediate
saving rate with 0 < 𝑠𝑠 < 1 is optimal.
2.5.1 Finding the golden rule
To find which saving rate is best, we first ask what saving rate 𝑠𝑠 is necessary to sustain a
particular level of capital per worker π‘˜π‘˜ in steady state. The required saving rate must satisfy
𝑠𝑠𝑠𝑠𝑠𝑠(π‘˜π‘˜) = (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜. Given π‘˜π‘˜, output per worker is 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜), which implies consumption
per worker is 𝑐𝑐 = (1 − 𝑠𝑠)𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜) − 𝑠𝑠𝑠𝑠𝑠𝑠(π‘˜π‘˜). By substituting the equation with the saving
rate 𝑠𝑠, steady-state consumption per worker 𝑐𝑐 is only a function of the steady-state π‘˜π‘˜:
𝑐𝑐 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜) − (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜
This equation can be interpreted as saying steady-state consumption can be found
geometrically in the Solow model as the difference in height between the per-worker
production function and the effective depreciation line. The logic is that consumption is the
difference between the production function and the saving line by definition but, in steadystate, the effective depreciation line has the same height as the saving line.
Figure 2.14 illustrates this argument, showing the steady-state levels of capital per worker
and consumption per worker associated with low, medium and high saving rates. Note that
it is not necessary to know the saving rate associated with a steady-state for π‘˜π‘˜ to calculate
the implied steady state for 𝑐𝑐.
63
EC2065 Macroeconomics | Chapter 2: Economic growth
Figure 2.14: Steady states for different saving rates
The golden rule level of capital per worker is the steady state where consumption per
worker is maximised. Mathematically, this can be found by differentiating the expression for
steady-state 𝑐𝑐 with respect to steady-state π‘˜π‘˜, and the first-order condition is πœ•πœ•πœ•πœ•⁄πœ•πœ•πœ•πœ• = 0:
𝑧𝑧𝑓𝑓 ′ (π‘˜π‘˜) = 𝑑𝑑 + 𝑛𝑛
Therefore, the golden rule calls for capital to be accumulated up to the point where the
marginal product of capital 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝑧𝑧𝑓𝑓 ′ (π‘˜π‘˜) is equal to the effective depreciation rate 𝑑𝑑 + 𝑛𝑛.
This first-order condition is illustrated geometrically in Figure 2.15. The golden rule is the
steady state for π‘˜π‘˜ where the tangent to the per worker production function has the same
gradient as the effective depreciation line. This maximises the difference between the
heights of the production function and the effective depreciation line, with this difference
being the steady-state consumption per worker.
Figure 2.15: Golden rule diagram
Intuitively, the golden rule equation reflects a comparison of the marginal benefits and costs
of a higher level of capital per worker in steady state. The marginal benefit of maintaining an
64
EC2065 Macroeconomics | Chapter 2: Economic growth
extra unit of capital per worker is the additional output per worker 𝑀𝑀𝑃𝑃𝐾𝐾 each year. The
marginal cost of maintaining a stock of capital per worker permanently one unit higher is
the depreciation cost 𝑑𝑑 and the new capital 𝑛𝑛 required for new workers each year. The
difference between the benefit and the cost is extra output that can sustainably be
consumed, so the highest steady state for consumption per worker is where the marginal
benefit equals the marginal cost.
Once the golden rule level of π‘˜π‘˜οΏ½ is found, the saving rate 𝑠𝑠̂ that brings the economy to that
steady state is found from the equation 𝑠𝑠𝑠𝑠𝑠𝑠 (π‘˜π‘˜) = (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜:
𝑠𝑠̂ =
(𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜οΏ½
𝑧𝑧𝑧𝑧(π‘˜π‘˜οΏ½)
2.5.2 Transitional paths to the golden rule
While the golden rule maximises steady-state consumption per worker, it does not
necessarily follow that households should choose that saving rate because there may be a
cost in terms of lower consumption during a transitional period to reach the golden rule.
Starting from a steady state with a saving rate below the golden rule, the required increase
in saving initially lowers consumption because the higher long-run capital and output that
results from more saving does not become available instantaneously – there is always a
transitional period in moving from one steady state to another. This case of a short-term
sacrifice of consumption for a long-term gain is illustrated in the left panel of Figure 2.16.
Figure 2.16: Transitional paths to the golden rule
However, starting from a steady state with a saving rate above the golden rule, the required
reduction in saving to reach the golden rule raises consumption during the transitional
period as well as the long run. In the short run, consumption per worker is actually higher
than in the long run because capital and output per worker are falling as the economy
approaches the golden rule steady state. This case is depicted in the right panel of Figure
2.16.
An economy with more capital in steady state than the golden rule level is said to be
dynamically inefficient because extra consumption in the long run can be obtained without
any short-run sacrifice. It is possible to raise consumption per worker in all subsequent years
65
EC2065 Macroeconomics | Chapter 2: Economic growth
by saving less. Where this is not true, so that some sacrifice now is necessary for a gain in
the future, the economy is said to be dynamically efficient.
If all households in the economy were alike (a representative household), then it is irrational
to continue to save more than the golden rule because everyone could be better off by
saving less. However, if saving is done by a young generation to accumulate assets for
retirement in old age, then it is possible to have a dynamically inefficient economy where
savers do not want to save less. We discuss dynamic inefficiency further in the ‘overlapping
generations’ economy introduced in Section 4.7.
2.5.3 Testing for dynamic inefficiency
How is it possible to know whether or not an economy is dynamically inefficient? Here, we
briefly discuss two possible tests. The first test compares the real interest rate to the growth
rate of the economy. The second test compares the capital share of income to the share of
investment expenditure in GDP.
In an economy with competitive markets, the gross return on capital is equal to the
marginal product of capital, that is, 𝑅𝑅 = 𝑀𝑀𝑃𝑃𝐾𝐾 . The real interest rate received by savers
would be equal to the real rate of return on capital after allowing for depreciation costs,
which means that π‘Ÿπ‘Ÿ = 𝑅𝑅 − 𝑑𝑑. In a steady state for capital per worker, π‘˜π‘˜ and 𝑦𝑦 are constant
over time, so total GDP π‘Œπ‘Œ = 𝑁𝑁𝑁𝑁 grows at the same rate 𝑛𝑛 as the population 𝑁𝑁. If the
economy’s real GDP growth rate 𝑛𝑛 exceeds the real interest rate π‘Ÿπ‘Ÿ then:
𝑛𝑛 > 𝑀𝑀𝑃𝑃𝐾𝐾 − 𝑑𝑑
This implies 𝑀𝑀𝑃𝑃𝐾𝐾 < 𝑑𝑑 + 𝑛𝑛, and therefore the economy is dynamically inefficient because it
has more capital per worker in steady state than the golden rule level that satisfies 𝑀𝑀𝑃𝑃𝐾𝐾 =
𝑑𝑑 + 𝑛𝑛 (recalling that 𝑀𝑀𝑃𝑃𝐾𝐾 diminishes with π‘˜π‘˜).
The second test for dynamic inefficiency compares the capital share of income to the
investment share of expenditure. In a competitive economy, gross capital income is equal to
𝑅𝑅𝑅𝑅 = 𝑀𝑀𝑃𝑃𝐾𝐾 𝐾𝐾, so the capital share of all gross income is 𝑀𝑀𝑃𝑃𝐾𝐾 𝐾𝐾/π‘Œπ‘Œ. By definition of π‘˜π‘˜ = 𝐾𝐾/𝑁𝑁
and 𝑦𝑦 = π‘Œπ‘Œ/𝑁𝑁, the capital-output ratio can also be written as 𝐾𝐾⁄π‘Œπ‘Œ = π‘˜π‘˜⁄𝑦𝑦. Using 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜)
and 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝑧𝑧𝑧𝑧′(π‘˜π‘˜), it follows that the capital share of income is 𝑧𝑧𝑓𝑓 ′ (π‘˜π‘˜)π‘˜π‘˜⁄(𝑧𝑧𝑧𝑧(π‘˜π‘˜)).
The investment share of total expenditure is 𝐼𝐼/π‘Œπ‘Œ, which is equal to 𝑆𝑆/π‘Œπ‘Œ and to the saving
rate 𝑠𝑠 in the basic Solow model. The steady-state level of capital per worker π‘˜π‘˜ satisfies the
equation 𝑠𝑠𝑠𝑠𝑠𝑠 (π‘˜π‘˜) = (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜, so the investment share of GDP is 𝑠𝑠 = (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜/(𝑧𝑧𝑧𝑧 (π‘˜π‘˜)). If
the investment share is greater than the capital income share:
(𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜ 𝑧𝑧𝑓𝑓 ′ (π‘˜π‘˜)π‘˜π‘˜
>
𝑧𝑧𝑧𝑧(π‘˜π‘˜)
𝑧𝑧𝑧𝑧(π‘˜π‘˜)
By cancelling terms in π‘˜π‘˜ and 𝑧𝑧𝑧𝑧(π‘˜π‘˜) from both sides, we that this holds if 𝑧𝑧𝑓𝑓 ′ (π‘˜π‘˜) < 𝑑𝑑 + 𝑛𝑛,
which is the condition for dynamical inefficiency.
In conclusion, dynamically inefficient economies have low real interest rates relative to real
GDP growth, or high levels of investment expenditure relative to capital income.
66
EC2065 Macroeconomics | Chapter 2: Economic growth
Box 2.4: Climate change and the economy
Climate change has become an increasing concern for the welfare of future
generations. How might climate change affect long-run economic outcomes
according to the Solow model?
One way is through disrupting agriculture and generally raising the costs of economic
activity. This means less output is produced from a given amount of factor inputs,
which is equivalent to lower total factor productivity 𝑧𝑧. We can analyse effects of this
kind as the opposite of the better technology studied earlier in Section 2.3.
Another way climate change matters is that more frequent natural disasters such as
flooding raise the costs of maintaining the capital stock, for example, repairing
damage and rebuilding. Furthermore, existing capital becomes obsolete faster when
greener technologies are adopted, most notably creating a need to replace capital
used in the transportation and energy sectors of the economy. Both of these
considerations point to a higher depreciation rate 𝑑𝑑 of capital.
The effects of a higher depreciation rate 𝑑𝑑 on long-run economic outcomes are
studied in Figure 2.17. The effective depreciation line becomes steeper and pivots to
the left. This implies the economy’s new steady state has less capital per worker π‘˜π‘˜ ∗ .
Intuitively, if capital is more costly to maintain due to climate change, or needs to be
replaced by greener capital more frequently, a given saving rate sustains less capital
per worker. With less capital per worker, output per worker is reduced. This negative
effect on 𝑦𝑦 is in addition to any direct negative effects through lower 𝑧𝑧.
Figure 2.17: Higher depreciation rate of capital
How might economies mitigate these negative economic effects? In particular, should
economies save more to compensate? One benchmark for an appropriate level of the
saving rate 𝑠𝑠 is the golden rule, which says that the goal of the highest sustainable
level of consumption per person is at steady state for capital per worker with 𝑀𝑀𝑃𝑃𝐾𝐾 =
𝑑𝑑 + 𝑛𝑛. A higher depreciation rate 𝑑𝑑 thus reduces the golden rule capital stock. With
67
EC2065 Macroeconomics | Chapter 2: Economic growth
no change in the saving rate, we have already seen that steady-state capital per
worker will decline with higher 𝑑𝑑, so it is not clear whether the golden rule calls for a
higher or lower saving rate 𝑠𝑠.
A higher depreciation rate 𝑑𝑑 reduces steady-state output per worker 𝑦𝑦 ∗ but how large
is this effect? Consider an example we used earlier in Box 2.1 to study the effect of
differences in saving rates across countries. The production function is the CobbDouglas form π‘Œπ‘Œ = 𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 1−𝛼𝛼 with 𝛼𝛼 = 1/3. With a Cobb-Douglas production function,
steady-state output per worker is:
∗
𝑦𝑦 =
1
𝑧𝑧 1−𝛼𝛼
𝛼𝛼
𝑠𝑠 1−𝛼𝛼
οΏ½
οΏ½
𝑑𝑑 + 𝑛𝑛
We can find the effect of going from depreciation rate 𝑑𝑑1 to 𝑑𝑑2 by taking the ratio of
the resulting steady-state levels of 𝑦𝑦2∗ and 𝑦𝑦1∗ holding 𝑧𝑧 fixed:
1 2
𝑦𝑦2∗
𝑑𝑑1 + 𝑛𝑛 3οΏ½3
𝑑𝑑1 + 𝑛𝑛
=
οΏ½
οΏ½
=οΏ½
∗
𝑦𝑦1
𝑑𝑑2 + 𝑛𝑛
𝑑𝑑2 + 𝑛𝑛
Assuming a population growth rate 𝑛𝑛 = 1%, and a depreciation rate that rises just
one percentage point from 𝑑𝑑1 = 9% to 𝑑𝑑2 = 10%, we obtain:
0.1
𝑦𝑦2∗
οΏ½
≈ √0.91 ≈ 0.95
∗ =
𝑦𝑦1
0.11
Therefore, when capital becomes obsolete or is destroyed 11% faster than before
(going from 9% to 10%), output per worker is 5% lower in the long run.
2.6 The AK model
The diminishing marginal product of capital lies behind many of the troubling predictions of
the Solow model that we have seen earlier. First, its failure to generate long-run economic
growth. Second, the modest differences in income levels it predicts even when there are
large differences in saving rates. Third, the extremely large differences in the implied return
to capital that result from the observed distribution of income across countries.
To try to overcome these failures of the Solow model, we now look at an alternative model
with constant returns to capital. As there are now no diminishing returns to capital, the
production function does not satisfy the neoclassical assumptions. All other assumptions of
the Solow model are maintained for comparison.
2.6.1 The AK production function
We assume that 𝐹𝐹 (𝐾𝐾, 𝑁𝑁) = 𝐾𝐾, so the production function is π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧. This is often written
with 𝐴𝐴 denoting TFP 𝑧𝑧:
68
EC2065 Macroeconomics | Chapter 2: Economic growth
π‘Œπ‘Œ = 𝐴𝐴𝐴𝐴
The resulting model is known as the ‘AK’ model after the equation for its production
function. In this model, TFP 𝐴𝐴 is assumed to be constant over time, so there is no exogenous
source of long-run growth.
Although the AK production function is not neoclassical, it still has constant returns to scale,
so it can be represented by a per-worker production function as in the Solow model. With
𝑓𝑓(π‘˜π‘˜) = 𝐹𝐹(π‘˜π‘˜, 1) = π‘˜π‘˜, it follows that the per worker production function is 𝑦𝑦 = 𝐴𝐴𝐴𝐴, which can
also be seen directly by dividing π‘Œπ‘Œ = 𝐴𝐴𝐴𝐴 by 𝑁𝑁.
The marginal product of capital in the AK model is 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝐴𝐴𝑓𝑓 ′ (π‘˜π‘˜) = 𝐴𝐴, which is a constant.
This does not change with the amount of capital accumulated because there are no
diminishing returns to capital. It is also constant over time because there is no growth in TFP
𝐴𝐴 in this model.
As all of the other assumptions of the Solow model are maintained, the same equation for
the change over time in capital per worker is applicable here, the only difference being that
the per-worker production function 𝑦𝑦 = 𝐴𝐴𝐴𝐴 is used in place of 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜):
π‘˜π‘˜ ′ − π‘˜π‘˜ =
𝑠𝑠𝑠𝑠𝑠𝑠 − (𝑑𝑑 + 𝑛𝑛)π‘˜π‘˜
(1 + 𝑛𝑛)
The dynamics of capital per worker and output per worker can thus be analysed using a
diagram similar to the one in the Solow model. Crucially, since the per worker production
function is linear, it and the saving line in the equivalent of the Solow diagram are now
straight lines. The AK model diagram is shown in Figure 2.18.
Figure 2.18: The AK model
The key difference compared to the Solow model diagram is that there is no intersection
between the saving and effective depreciation lines and, hence, no steady state (apart from
the uninteresting steady state at zero). This is because the saving line is a straight line,
69
EC2065 Macroeconomics | Chapter 2: Economic growth
reflecting the absence of diminishing returns to capital, unlike its concave shape in the
Solow model.
If 𝑠𝑠𝑠𝑠 > 𝑑𝑑 + 𝑛𝑛 then the saving line is steeper than the effective depreciation line and
therefore the change in capital per worker π‘˜π‘˜ ′ − π‘˜π‘˜ is always positive. This implies there is
perpetual growth with no convergence to a steady state. Moreover, the production function
𝑦𝑦 = 𝐴𝐴𝐴𝐴 implies output per worker 𝑦𝑦 grows at same rate as capital per worker π‘˜π‘˜:
𝑦𝑦 ′ − 𝑦𝑦 π‘˜π‘˜ ′ − π‘˜π‘˜ 𝑠𝑠𝑠𝑠 − (𝑑𝑑 + 𝑛𝑛)
=
=
𝑦𝑦
π‘˜π‘˜
1 + 𝑛𝑛
2.6.2 Endogenous growth
The AK model is able to generate long-run growth endogenously through capital
accumulation if the saving rate is high enough. Intuitively, more capital allows more output
to be produced, some of which is saved, which is ploughed back into capital accumulation,
leading to more output. Unlike the Solow model, there are no diminishing returns to capital
that weaken this feedback loop.
Moreover, the economy’s rate of growth depends on the saving rate 𝑠𝑠. In contrast to the
Solow model, changing the saving rate has a permanent effect on growth rate, not only on
level of income. An economy can enjoy a faster long-run growth rate by choosing to save a
higher fraction of income.
Although the AK model makes some interesting predictions, the AK production function
itself is problematic. As it completely excludes labour input 𝑁𝑁, the implied capital share of
income is 100 per cent – the AK production function is a special case of a Cobb-Douglas with
𝛼𝛼 = 1. However, including labour input 𝑁𝑁 in the production function while preserving
linearity in capital input 𝐾𝐾 would result in a production function with increasing returns to
scale. Nonetheless, there are alternative models of endogenous growth with similar
predictions to the AK model that make more plausible assumptions about the production
function.
Box 2.5: Endogenous growth and divergence between countries
Theories that generate endogenous growth through physical capital accumulation
such as the AK model imply that the saving rate has a permanent effect on the
economy’s long-run growth rate of output per worker. Higher saving rates lead to
faster long-run growth. This means that a higher saving rate always leads to higher
consumption per worker eventually in the long run. Figure 2.19 shows the trajectories
of (log) income and consumption per worker for an economy where the saving rate
rises. Recall that a constant growth rate corresponds to a straight-line trajectory when
variables are plotted as logarithms.
70
EC2065 Macroeconomics | Chapter 2: Economic growth
Figure 2.19: Effects of a higher saving rate
In contrast, the Solow model predicts that a higher saving rate only raises the long-run
steady-state level of income per worker, without any effect on the long-run growth
rate. This means saving too much has a negative effect on long-run consumption and
there is a golden rule saving rate that maximises long-run consumption. In the AK
model, saving more (as long as the saving rate is less than 100 per cent) always
increases consumption in the long run.
Another important feature of endogenous growth models is that they do not generate
convergence among economies. For countries with the same TFP level, same
population growth and depreciation rates and the same saving rate, the AK model
implies the growth rate of income per worker is the same irrespective of whether a
country starts with more or less capital per worker than others. This is illustrated in
the left panel of Figure 2.20.
Figure 2.20: No convergence, or divergence, across countries
71
EC2065 Macroeconomics | Chapter 2: Economic growth
Moreover, endogenous growth also means that small but persistent differences in
saving rates can explain large gaps in income per worker across countries. This is
because different saving rates imply different growth rates, which when compounded
over time can open up very large gaps in income, as shown in the right panel of Figure
2.20. We have seen that the Solow model could not explain large income differences
even with large differences in saving rates.
Finally, observe that the AK model predicts there is no difference between the return on
capital in poor and rich countries. If countries share the same level of TFP, the marginal
product of capital is the same everywhere. This would resolve the puzzle of why capital
does not flow from rich to poor countries.
2.7 Learning by doing
We now consider some alternative routes to generating endogenous economic growth
beyond the direct assumption of an AK production function. We do this because the AK
production function itself does not seem very plausible. The first of these alternative routes
to endogenous growth is known as ‘learning by doing’.
Learning-by-doing is when skills or knowledge are accumulated as a by-product of the
production process. We will look at a model with the feature that when new capital is
installed, workers using the capital discover through use how to deploy it most effectively.
Learning-by-doing models claim that capital accumulation leads to a positive ‘spillover’ or
externality that increases the economy’s stock of ideas and knowledge. As knowledge and
ideas are non-rival, all firms can benefit from greater productivity in using capital. This
supposes firms cannot have intellectual property rights over any knowledge discovered
through learning by doing. We return to that issue in Section 2.9 when discussing research
and development.
Suppose production is done by perfectly competitive firms using production function π‘Œπ‘Œπ‘–π‘– =
𝐹𝐹(𝐾𝐾𝑖𝑖 , 𝐴𝐴𝑁𝑁𝑖𝑖 ). Firm 𝑖𝑖 uses capital 𝐾𝐾𝑖𝑖 and hires labour 𝑁𝑁𝑖𝑖 to produce output π‘Œπ‘Œπ‘–π‘– . There is labouraugmenting technology 𝐴𝐴 that is common to all firms because knowledge is a public good
(non-rival and non-excludable). Unlike the AK model, here the production function
𝐹𝐹(𝐾𝐾𝑖𝑖 , 𝐴𝐴𝑁𝑁𝑖𝑖 ) at the level of individual firms is neoclassical, so there are diminishing returns to
capital.
Each firm 𝑖𝑖 hires labour 𝑁𝑁𝑖𝑖 up to the point where the wage 𝑀𝑀 equals the marginal product of
labour 𝑀𝑀𝑃𝑃𝑁𝑁𝑖𝑖 . As all competitive firms face the same wage, the marginal products of labour
and capital end up being the same across all firms. These marginal products then determine
the distribution of income between labour and capital in the usual way. Given knowledge 𝐴𝐴,
the production function for whole economy π‘Œπ‘Œ = 𝐹𝐹(𝐾𝐾, 𝐴𝐴𝐴𝐴) is the same as the one for an
individual firm but with π‘Œπ‘Œ being GDP, and 𝐾𝐾 and 𝑁𝑁 being the total capital stock and labour
force. Assume the aggregate supply of labour 𝑁𝑁 is constant because there is no population
growth (𝑛𝑛 = 0).
72
EC2065 Macroeconomics | Chapter 2: Economic growth
The key assumption of the learning-by-doing model is that economy-wide knowledge 𝐴𝐴 rises
with the economy’s capital stock 𝐾𝐾. Specifically, we assume the two variables are
proportional with πœ†πœ† > 0 being a constant:
𝐴𝐴 = πœ†πœ†πœ†πœ†
By substituting 𝐴𝐴 = πœ†πœ†πœ†πœ† into π‘Œπ‘Œ = 𝐹𝐹(𝐾𝐾, 𝐴𝐴𝐴𝐴), the aggregate production function becomes
π‘Œπ‘Œ = 𝐹𝐹(𝐾𝐾, πœ†πœ†πœ†πœ†πœ†πœ†). The constant-returns-to-scale property of the production function then
implies π‘Œπ‘Œ = 𝐹𝐹(1, πœ†πœ†πœ†πœ†)𝐾𝐾. As 𝐹𝐹(1, πœ†πœ†πœ†πœ†) is a constant, this has the same form as an ‘AK’
production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧, where the constant 𝑧𝑧 is 𝑧𝑧 = 𝐹𝐹(1, πœ†πœ†πœ†πœ†).
Intuitively, the spill over effect from the aggregate capital stock to knowledge offsets the
usual diminishing returns to capital. If labour-augmenting technology 𝐴𝐴 is proportional to
capital 𝐾𝐾, the spill over cancels out diminishing returns and implies constant returns to
capital overall. This makes it equivalent to an ‘AK’ production function in the aggregate,
even though individual production functions at the level of firms are neoclassical. Therefore,
it is possible to generate endogenous growth without having directly to assume the unusual
features of the AK production function itself.
However, even if there is a spill over from capital accumulation to knowledge, the model’s
implications can change substantially if knowledge is not proportional to capital. If following
a 1 per cent increase in 𝐾𝐾, knowledge 𝐴𝐴 rises by less than 1 per cent then there are
diminishing returns to aggregate capital and it is not possible to generate endogenous longrun growth. On the other hand, if 𝐴𝐴 rises more than 1 per cent following the 1 per cent rise
in 𝐾𝐾 then there are increasing returns to aggregate capital. This would imply endogenous
growth but, unrealistically, the economy’s growth rate would increase over time.
Unfortunately, the version of the model with stable endogenous growth resembles a special
case for which there is no clear support.
Even in the case where 𝐴𝐴 and 𝐾𝐾 are proportional, the learning-by-doing model implies what
are known as ‘scale effects’. This means that the economy’s growth rate of income per
person depends positively on the size of the population, a prediction that has a lack of
empirical support. This feature of the learning-by-doing model results from a larger
economy with more people producing more non-rival knowledge from which everyone
benefits. Mathematically, it can be seen from the aggregate production function displaying
increasing returns to capital 𝐾𝐾 and labour 𝑁𝑁 together.
2.8 Human capital
In the models we have seen so far, labour input 𝑁𝑁 is simply the number of workers, or
sometimes the number of hours worked, as in the model of the choice of labour supply. But
in practice, the amount of effective labour input that goes into producing goods and
services depends on ‘human capital’. This concept refers to the education, skills and training
of workers that affects their productivity. Crucially, like physical capital, human capital is
something that can be accumulated. By devoting time and resources to improve education
or training, an economy can increase its stock of human capital.
73
EC2065 Macroeconomics | Chapter 2: Economic growth
How might an economy’s stock of human capital be measured? It is possible to look at data
on average years of schooling and the fraction of college graduates in the population.
Combined with research on wages that estimates the effect of, for example, an extra year of
education on workers’ pay, a measure of human capital based on the wage premiums
associated with education/training can be constructed.
Although we have seen models earlier where effective labour input depended on
technology, it is important to note that human capital is different from the notions of ideas
and technology. Human capital is embodied in a worker who has learned a particular skill –
it is not the abstract concept or discovery of the knowledge for the first time.
We now consider a model of human capital accumulation as a potential source of
endogenous long-run growth. The economy’s stock of human capital is denoted by 𝐻𝐻.
Assume the labour force 𝑁𝑁 has a constant size, so a change in 𝐻𝐻 reflects a change in average
human capital per person. Output of final goods and services π‘Œπ‘Œ is produced with physical
capital 𝐾𝐾 and human capital according to the neoclassical production function:
π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑒𝑒𝑒𝑒)
In the production function, 𝑒𝑒 denotes fraction of human capital 𝐻𝐻 used to produce goods.
As we will see, some existing human capital is dedicated to producing future human capital,
not goods and services in the present. We assume TFP 𝑧𝑧 is constant over time, and continue
to maintain all the other assumptions about capital and goods from the Solow model.
Human capital itself needs to be produced using existing human capital. Given 𝑒𝑒, the
amount of existing 𝐻𝐻 producing new human capital is (1 − 𝑒𝑒)𝐻𝐻. We assume a production
function 𝐺𝐺(βˆ™) for producing new human capital. The only input is (1 − 𝑒𝑒)𝐻𝐻, although it is
possible to generalise this making production of human capital depend on both physical and
human capital.
The equation for next time period’s human capital 𝐻𝐻′ is:
𝐻𝐻′ = (1 − 𝑑𝑑𝐻𝐻 )𝐻𝐻 + 𝐺𝐺((1 − 𝑒𝑒)𝐻𝐻)
The first term represents undepreciated current human capital that remains usable in the
future, assuming the rate at which human capital depreciates is a constant 𝑑𝑑𝐻𝐻 . The
dynamics of human capital are therefore determined by:
𝐻𝐻′ − 𝐻𝐻 = 𝐺𝐺�(1 − 𝑒𝑒)𝐻𝐻� − 𝑑𝑑𝐻𝐻 𝐻𝐻
The difference between the terms on the right-hand side is the difference between a ‘saving
line’ 𝐺𝐺�(1 − 𝑒𝑒)𝐻𝐻�, the shape of which depends on the function 𝐺𝐺(βˆ™), and a depreciation line
𝑑𝑑𝐻𝐻 𝐻𝐻, which is a straight line. A diagram for human capital accumulation analogous to the
Solow model diagram is displayed in Figure 2.21.
74
EC2065 Macroeconomics | Chapter 2: Economic growth
Figure 2.21: Human capital accumulation
The predictions of the human capital model depend on whether the returns to producing
human capital are diminishing or constant. The case of diminishing returns is shown in the
right panel of the figure. The human capital production function 𝐺𝐺(βˆ™) is concave, which
implies it intersects the depreciation line and there exists a steady state 𝐻𝐻 ∗ for the stock of
human capital. As the 𝐻𝐻 entering the goods production function does not grow in the long
run, diminishing returns to physical capital in producing goods imply that there is no longrun economic growth by the usual Solow model logic.
But what happens if there are constant returns to producing human capital? In this case, the
function 𝐺𝐺 (βˆ™) is linear. There is no intersection between the saving and depreciation lines
for human capital (except at zero), so it is possible to have a constant positive growth rate of
𝐻𝐻 in the long run. The implications for the production of goods and services is then exactly
equivalent to the Solow model with labour-augmenting technological progress studied in
Section 2.3. Mathematically, constant growth in workers’ human capital 𝐻𝐻 plays exactly the
same role as constant growth of labour-augmenting technology 𝐴𝐴.
An increase in the amount of resources devoted to education and training can be
represented by a decline in 𝑒𝑒 and an increase in 1 − 𝑒𝑒. If the human capital model
generates positive long-run growth, then higher 1 − 𝑒𝑒 increases the economy’s long-run
growth rate. Although there is certain to be a benefit in the long run, this is not a free lunch
because there is a cost during a transitional period with lower 𝑒𝑒 implying lower output π‘Œπ‘Œ
starting from the initial level of 𝐻𝐻.
If the model does not generate long-run growth, then reducing 𝑒𝑒 has two conflicting effects
on the long-run level of output. First, it leads to a higher steady-state stock of human
capital, which boosts output. Second, it reduces the share of human capital used to produce
goods, which lowers output of goods and services. There is an optimal choice of 𝑒𝑒 to
maximise long-run output level analogous to the Golden Rule for capital accumulation.
2.9 Research and development
Advances in ideas, technologies, and production techniques are central to explaining longrun economic growth in several of the models we have seen so far. The Solow model with
75
EC2065 Macroeconomics | Chapter 2: Economic growth
exogenous improvements in labour-augmenting technology from Section 2.3 showed how
technological progress can drive long-run growth but the improvements in technology there
were treated as exogenous. The learning-by-doing model from Section 2.7 featured
endogenous discovery of new ideas and techniques as a by-product of installing new capital.
capital accumulation. However, since ideas are non-excludable public goods in that model,
firms did not have any incentive consciously to try to make new discoveries.
But we see that firms and governments devote considerable resources to research and
development (R&D). R&D activity directly aims to discover new ideas and technologies, with
countries at the frontiers of research often spending 2–3 per cent of GDP on R&D. The R&D
model introduced in this section studies growth through the deliberate accumulation of
ideas and technologies rather than of physical or human capital.
2.9.1 Non-rivalrous but excludable technologies
For private firms to undertake R&D activity, there needs to be some protection of
intellectual property rights such as patents. Some excludability of newly discovered ideas
and technologies is necessary to earn profits from R&D otherwise other firms would copy
any innovations and compete away the profits. This is what happens in the learning-bydoing model, where technology only improves because some ideas arise as a by-product of
using new capital.
Physical and human capital, along with most consumption goods, are rivalrous and
excludable private goods. Rivalrous means the same capital or good cannot be enjoyed by
multiple users simultaneously. Excludable means that private ownership of the capital or
good is enforced. Ideas are by their nature are non-rivalrous. One firm or person using a
new idea or technology does not prevent others benefitting from using the same idea.
However, ideas may be excludable to some extent if there are legal restriction on using
others’ discoveries, for example, patent protection.
The stock of ideas and knowledge in the economy is represented by labour-augmenting
technology 𝐴𝐴. The production function for final goods and services is:
π‘Œπ‘Œ = 𝐹𝐹(𝐾𝐾, 𝑒𝑒𝑒𝑒𝑒𝑒)
The fraction of the labour force 𝑁𝑁 who produce goods and services is 𝑒𝑒, so 𝑒𝑒𝑒𝑒 is labour
input in the production function above, augmented by technology 𝐴𝐴. The labour force 𝑁𝑁 is
assumed to have a constant size (no population growth, 𝑛𝑛 = 0).
The production function has the neoclassical properties, including constant returns to scale
with respect to inputs of capital 𝐾𝐾 and effective labour 𝐴𝐴𝐴𝐴. Note that this implies constant
returns to scale with respect to 𝐾𝐾 and 𝑁𝑁 but not with respect to 𝐾𝐾, 𝑁𝑁 and 𝐴𝐴 together. Since
technology 𝐴𝐴 is non-rivalrous, a doubling of the rivalrous inputs of capital 𝐾𝐾 and labour 𝑁𝑁 is
sufficient to double output. Ideas do not need to be discovered again for additional capital
and labour to use them in producing more output.
76
EC2065 Macroeconomics | Chapter 2: Economic growth
2.9.2 A production function for ideas
New ideas are produced in the R&D sector, which adds to the stock of knowledge 𝐴𝐴. We will
write down a production function for ideas, analogous to the production functions we have
used for goods and human capital. For simplicity, only labour input is used in the R&D
sector, though generalising this to have capital used as well to produce ideas would not
fundamentally change this model.
The R&D sector employs (1 − 𝑒𝑒)𝑁𝑁 workers, a fraction 1 − 𝑒𝑒 of the labour force. We assume
the production function for new knowledge 𝐴𝐴′ − 𝐴𝐴 has the following form:
𝐴𝐴′ − 𝐴𝐴 =
(1 − 𝑒𝑒)𝐴𝐴𝐴𝐴
πœ‡πœ‡
The parameter πœ‡πœ‡ measures the cost of innovation, in that labour input (1 − 𝑒𝑒)𝑁𝑁 must be
0.01πœ‡πœ‡ to get 𝐴𝐴 to grow by 1 per cent. It is inversely related to the productivity of
researchers in growing the stock of knowledge. An important contrast to earlier equations
for the dynamics of physical capital 𝐾𝐾 ′ − 𝐾𝐾 or human capital 𝐻𝐻′ − 𝐻𝐻 is that there is no
depreciation of ideas. Knowledge, once discovered, is never lost.
2.9.3 Endogenous growth
The production function for new ideas implies that future technology 𝐴𝐴′ is:
𝐴𝐴′ = οΏ½1 +
(1 − 𝑒𝑒)𝑁𝑁
� 𝐴𝐴
πœ‡πœ‡
This means the growth rate (𝐴𝐴′ − 𝐴𝐴)/𝐴𝐴 of the stock of ideas 𝐴𝐴 is (1 − 𝑒𝑒)𝑁𝑁⁄πœ‡πœ‡. The linearity
of the production function for new ideas implies constant returns to R&D activity, i.e. the
same workforce (1 − 𝑒𝑒)𝑁𝑁 in the R&D sector can produce new ideas at a constant rate.
Given 𝑒𝑒, πœ‡πœ‡, and 𝑁𝑁, the R&D model predicts a constant positive growth rate of 𝐴𝐴. The
implications for real income per worker can then be analysed using the framework from
Section 2.3 where there was exogenous labour-augmenting technological progress. With a
constant growth rate of 𝐴𝐴, there is positive long-run growth in output π‘Œπ‘Œ (and, hence, output
per worker here) at the same rate (1 − 𝑒𝑒)𝑁𝑁⁄πœ‡πœ‡, so the R&D model generates growth
endogenously. Moreover, shifting resources to the R&D sector, i.e. lowering 𝑒𝑒 and raising
1 − 𝑒𝑒, raises the economy’s long-run growth rate. This comes with the short-run cost of
lower output of goods when workers are diverted from the goods-producing sector to the
R&D sector but before they have produced any new knowledge.
We have not analysed the determinants of the fraction 1 − 𝑒𝑒 of labour allocated to R&D
using the model but we can think about this as resulting from a comparison of benefits and
costs to firms engaged in R&D. The cost to a firm of having more R&D workers is the wages
they need to be paid. The benefit of having them depends on the value to the firm itself of
the knowledge the R&D workers discover, for example, the value of new patents registered.
This value is affected by the strength of patent protection. Stronger protection of
intellectual property rights raises the private benefit of R&D and should lead to a lower
value of 𝑒𝑒.
77
EC2065 Macroeconomics | Chapter 2: Economic growth
2.9.4 Constant returns or diminishing returns to R&D?
The conclusion that the R&D model is able to generate endogenous growth rests on the
assumption of constant returns in the R&D sector. However, there are reasons to be
sceptical of this. We see that the fraction of resources allocated to R&D in advanced
economies has risen substantially over the course of the 20th century. The model suggests
this should lead to faster rates of economic growth but that has not been observed.
We can allow for increasing or decreasing returns in R&D by making the cost of discoveries
πœ‡πœ‡ depend on the stock of existing knowledge 𝐴𝐴. Increasing returns would mean πœ‡πœ‡(𝐴𝐴) is a
decreasing function of 𝐴𝐴. One justification for this assumption might be that past discoveries
provide the tools and foundation for future discoveries – ‘standing on the shoulders of
giants’. On the other hand, decreasing returns in R&D can be represented by πœ‡πœ‡(𝐴𝐴) being an
increasing function of 𝐴𝐴. This could be because more advanced ideas are inherently harder
to discover – the ‘low-hanging fruit have been picked’.
The standard version of the model with πœ‡πœ‡ being a constant can be seen as assuming these
two forces are approximately in balance. However, the case of πœ‡πœ‡(𝐴𝐴) being an increasing
function of 𝐴𝐴 is more consistent with the observation that 𝑒𝑒 has declined over time in
advanced countries while growth rates (1 − 𝑒𝑒)𝑁𝑁/πœ‡πœ‡(𝐴𝐴) have not increased. This case with
decreasing returns in R&D means that if πœ‡πœ‡(𝐴𝐴) keeps rising, rates of long-run economic
growth will fall to zero.
2.10 International technology transfer
If endogenous growth is coming from the discovery of new ideas, such as with learning-bydoing or as in the R&D model, can those models be applied to understand differences in
income levels across countries? These models do not imply convergence to a steady state,
which seems to suggest they might be consistent with large income differences around the
world through the logic developed in Box 2.5.
However, a key feature of models that generate endogenous growth through the discovery
of knowledge is that ideas are non-rivalrous and this non-rivalrous nature of ideas should
also apply across countries, not only across firms within a country. Therefore, countries
should be able to copy the ideas and knowledge of those at or closer to the frontier of
knowledge. If ideas can be copied at no cost, this would severely limit the ability of
endogenous growth models based on knowledge to explain large cross-country income
differences.
This section explores whether this logic also applies when there is a cost of imitating ideas
discovered in other countries. Take two countries labelled 1 and 2, and assume both have
the same labour force 𝑁𝑁. Suppose country 1 is at the frontier of knowledge, while country 2
lags behind. Using the notation from the R&D model in Section 2.9, we have 𝐴𝐴1 > 𝐴𝐴2 , which
is due to 𝑒𝑒1 < 𝑒𝑒2 since there are no other differences between them. That is, country 1
allocates a greater fraction of its labour force to R&D than country 2. Country 1 innovates as
described by the R&D model with constant returns. Its growth rate of 𝐴𝐴1 and its GDP is
(1 − 𝑒𝑒1 )𝑁𝑁/πœ‡πœ‡π‘–π‘– , where πœ‡πœ‡π‘–π‘– is the cost parameter for innovation in country 1’s R&D sector.
78
EC2065 Macroeconomics | Chapter 2: Economic growth
Country 2 can copy ideas from country 1 at a cost. Some labour input is required to
understand and apply the discoveries of country 1 but less than is needed to ‘reinvent the
wheel’ through independent R&D. We write down a ‘production function’ for ideas newly
imitated in country 2 that is also of the form 𝐴𝐴′2 − 𝐴𝐴2 = (1 − 𝑒𝑒2 )𝑁𝑁/πœ‡πœ‡π‘π‘ , but with πœ‡πœ‡π‘π‘ being
the cost of copying ideas rather than innovating. This cost πœ‡πœ‡π‘π‘ is lower than the cost of
innovation πœ‡πœ‡π‘–π‘– .
While πœ‡πœ‡π‘–π‘– is a constant parameter, reflecting the assumption of constant returns in R&D for
the frontier country, it is reasonable that the cost of imitation πœ‡πœ‡π‘π‘ should depend on the size
of the knowledge gap between countries 1 and 2. Hence, πœ‡πœ‡π‘π‘ is assumed to be a function
πœ‡πœ‡π‘π‘ = 𝑐𝑐(𝐴𝐴1 ⁄𝐴𝐴2 ) of the knowledge ratio 𝐴𝐴1 /𝐴𝐴2 . The cost function is decreasing in the size of
the knowledge gap 𝐴𝐴1 /𝐴𝐴2 because it is easier to copy more basic ideas discovered by
country 1 further in the past. The cost is close to zero when 𝐴𝐴1 /𝐴𝐴2 is extremely large
because when country 2 starts far behind, picking up some simple knowledge from country
1 should be almost free. Finally, the cost approaches the cost of innovation πœ‡πœ‡π‘–π‘– when the
knowledge gap becomes small (𝐴𝐴1 /𝐴𝐴2 is close to 1). This says that cost of quickly imitating
the most recent discoveries of country 1 is close to the cost of doing innovation itself. A cost
function with these features is depicted in Figure 2.22.
Figure 2.22: Imitation cost function
The growth rate of knowledge 𝐴𝐴1 and GDP in country 1 is (1 − 𝑒𝑒1 )𝑁𝑁/πœ‡πœ‡π‘–π‘– , and the growth
rate of knowledge 𝐴𝐴2 and GDP in country 2 is (1 − 𝑒𝑒2 )𝑁𝑁/𝑐𝑐(𝐴𝐴1 ⁄𝐴𝐴2 ). The two growth rates
are plotted in Figure 2.23 as a function of the knowledge ratio 𝐴𝐴1 /𝐴𝐴2 . For country 1, its
growth rate is independent of how far country 2 is behind. For country 2, its growth rate is
faster when it is further behind because it becomes cheaper to imitate, i.e. 𝑐𝑐(𝐴𝐴1 ⁄𝐴𝐴2 ) is
lower. Over time, the ratio 𝐴𝐴1 /𝐴𝐴2 declines to the extent that the growth rate of 𝐴𝐴2 exceeds
the growth rate of 𝐴𝐴1 . We can see from the diagram that there is convergence to a steady
state for the knowledge ratio 𝐴𝐴1 /𝐴𝐴2 where the growth rate lines intersect. Mathematically,
this occurs where:
πœ‡πœ‡π‘π‘ = 𝑐𝑐 οΏ½
(1 − 𝑒𝑒2 )
𝐴𝐴1
οΏ½=
πœ‡πœ‡ < πœ‡πœ‡π‘–π‘–
𝐴𝐴2
(1 − 𝑒𝑒1 ) 𝑖𝑖
79
EC2065 Macroeconomics | Chapter 2: Economic growth
Since 𝐴𝐴1 /𝐴𝐴2 converges to a steady state in the long run, the follower country’s long-run
growth rate equals the growth rate of the frontier country. This means that even if 𝑒𝑒1 and
𝑒𝑒2 are permanently different, the gap between the countries eventually stabilises. The
international technology transfer model thus suggests it is difficult to explain very large
income differences across countries through knowledge gaps because those countries that
have not yet adopted much knowledge from the rest of the world can improve at a
relatively low cost. Note that this does not mean that knowledge gaps will shrink to zero in
the long run. With fewer resources allocated to obtaining knowledge, country 2 cannot
imitate the newest technologies of country 1.
Figure 2.23: The steady-state knowledge gap across countries
If country 2 were to lower 𝑒𝑒2 , allocating more workers to imitating country 1’s ideas, this
will shrink the knowledge gap between the two in the long run but it would not change
country 2’s long-run growth rate (assuming country 2 remains the follower). This reflects
the diminishing returns to imitation because the stock of existing knowledge to copy from
country 1 is necessarily finite.
Box 2.6: How strong should intellectual property rights be?
The strength of intellectual property rights such a patent protection affects incentives
to undertake research and development. If owners of patents can benefit from their
innovations for longer then there is a greater incentive to carry out R&D. However,
while excludability encourages the discovery of new ideas, ideas are fundamentally
non-rivalrous, so putting obstacles in the way of those who can benefit from using
them has a welfare cost. Here we study the international dimension of this issue using
the model of international technology transfer.
Suppose there is greater protection of intellectual property rights across countries’
borders. In the international technology transfer model, the follower country (2) now
faces higher costs of adopting new technologies developed by the innovating country
(1).
80
EC2065 Macroeconomics | Chapter 2: Economic growth
The imitation cost 𝑐𝑐(𝐴𝐴1 ⁄𝐴𝐴2 ) rises for each level of the knowledge ratio 𝐴𝐴1 /𝐴𝐴2 . This
implies the growth rate (1 − 𝑒𝑒2 )𝑁𝑁/𝑐𝑐(𝐴𝐴1 ⁄𝐴𝐴2 ) of 𝐴𝐴2 is lower for each value of 𝐴𝐴1 ⁄𝐴𝐴2 ,
shifting down the growth rate function for country 2 as shown in the left panel of
Figure 2.24.
Firms in the innovating country (1) now gain more from discovering new technologies
and have a greater incentive to do R&D. This increases the fraction of workers
employed in the R&D sector, i.e. 𝑒𝑒1 is lower. The growth rate (1 − 𝑒𝑒1 )𝑁𝑁⁄πœ‡πœ‡π‘–π‘– of 𝐴𝐴1 is
now higher, which shifts up the growth rate line for country 1 in the figure (it is a
horizontal line because the growth rate is independent of the levels of 𝐴𝐴1 and 𝐴𝐴2 ).
Figure 2.24: Effects of stronger intellectual property rights
The right panel of the figure depicts the time paths of knowledge (in logarithms) for the
two countries. The gradient of the path for country 1 becomes steeper as the growth
rate of knowledge 𝐴𝐴1 increases. From the perspective of the follower country 2,
stronger intellectual property rights slow down the adoption of new technologies from
the frontier country, which means 𝐴𝐴2 is below the path it would have followed for
some time after the strengthening of intellectual property rights. This implies its output
per worker grows more slowly for some time.
However, since there is a steady state for relative technology 𝐴𝐴1 /𝐴𝐴2 , albeit at a higher
level than before, eventually 𝐴𝐴2 will rise above the path it would have followed
because of the faster growth in 𝐴𝐴1 now country 1 has a greater incentive to innovate. It
follows that country 2 gains from greater protection of intellectual property rights in
the long run, although loses out during a transitional period (which could last for a
considerable amount of time).
81
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Chapter 3: Aggregate demand
This chapter focuses on the demand side of the economy. We look at the determinants of
expenditure on goods and services by households and firms. These components of
aggregate expenditure, consumption and investment, are the largest components of GDP
broken down by expenditure. An analysis of the effects of the government’s fiscal policies
on demand is deferred until Chapter 4. The links between international trade and aggregate
expenditure are analysed in Chapter 10 but, for now, we continue to consider a closed
economy.
The focus on the demand side means we will give more emphasis to fluctuations in the
economy compared to the long-run trends that were the subject of the previous two
chapters. We will look at how macroeconomic data can be detrended to zoom in on the
business-cycle, i.e. the fluctuations of macroeconomic variables around their long-run
trends.
After having studied the determinants of consumption and investment, we will see how to
integrate our analysis of the demand side of the economy with the supply side from earlier
chapters. This entails setting up a dynamic macroeconomic model where demand and
supply factors both matter in understanding changes in GDP.
Essential reading
•
Williamson, Chapters 9 and 11.
3.1 Detrending macroeconomic data
Many macroeconomic variables are rising over time for reasons discussed in the analysis of
economic growth. These long-run trends dominate the data but sometimes we would like to
emphasise the fluctuations seen in data. This is done by detrending time series data.
First, a trend line is estimated. These data points are typically plotted as logarithms so that a
constant growth rate would correspond to the straight line plotted against time.
The estimated trend is then removed from the data by subtracting the height of the trend
line from each data point. This leaves a time series of deviations from trend. If the data were
in logarithms, these deviations from trend can be interpreted as percentage deviations. The
deviations from trend are then taken as a measure of the business-cycle component of a
variable. A stylised representation of this way of measuring business cycles is shown in
Figure 3.1.
3.1.1 Business cycles
Applying this methodology to real GDP, we identify business cycles with fluctuations around
the trend in real GDP. Persistent positive deviations from trend are referred to as ‘booms’
82
EC2065 Macroeconomics | Chapter 3: Aggregate demand
and persistent negative deviations from trend are ‘recessions’. The turning points of the
business cycle are the peaks and troughs in the deviations of real GDP from its trend. Note
that the definition of a recession here differs from the conventional two consecutive
quarters of decline in real GDP and that an economy can still be in recession if it is still
below its trend while growing.
Figure 3.1: Stylised representation of business cycles
Some terms are widely used to describe the patterns seen in diagrams like Figure 3.1. The
amplitude of the fluctuations in a variable refers to how far above or below trend a variable
goes. The persistence of the fluctuations in a variable refers to how long it typically takes for
the variable to return to its trend line. The frequency of fluctuations refers to how often the
variable switches from being above trend to below trend, or below to above.
3.1.2 Detrending
How is the detrending done to construct a version of the figure with real data? It is not
necessary to know the technical details in this course but it essentially involves putting a line
of best fit through the data points plotted as a time series. If the long-run percentage
growth rate is stable, the trend line for data plotted in logarithms can be estimated as the
best-fitting straight line. In practice, long-run growth rates are not entirely stable, so the
best trend line is not a completely straight line. Researchers often use the Hodrick-Prescott
(HP) filter, which is essentially a trend line that can change ‘smoothly’ over time.
Figure 3.2 shows the HP-filter trend line for the logarithm of US real GDP, plotted as a
quarterly time series from 1947 to 2021. The estimated HP-filter trend line is not completely
straight. For instance, the gradient is steeper in the 1960s and 1990s than the 1970s,
reflecting the ‘productivity slowdown’ experienced by the US economy in the 1970s. We see
that the trend dominates the data, so that even events such the recessions after the 2008
financial crisis and during the COVID pandemic appear as relatively small dips compared to
the secular progress in the US economy over many decades.
83
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Figure 3.3 shows the percentage deviations from US real GDP from its HP-filter trend line,
and it is calculated as the difference between actual and trend log real GDP multiplied by
100. This is our measure of the business cycle in the USA. The business cycle is typically
ranges from +4% to −4% of trend real GDP, although the recession owing to COVID is
much larger. The period from the mid-1980s up to the 2008 financial crisis is known as the
‘great moderation’ owing to the small ±2% range of fluctuations in real GDP.
Figure 3.2: HP filter applied to US real GDP
Figure 3.3: US business cycles as measured by detrended real GDP
3.1.3 Business-cycle stylised facts
After detrending real GDP and other macroeconomic variables, we document some typical
patterns seen in the fluctuations. While each business-cycle episode has unique features,
nonetheless, we observe some relationships across different variables that generally hold
true in different booms and recessions. We refer to these patterns as ‘stylised facts’.
For a particular macroeconomic variable π‘₯π‘₯, we document:
84
EC2065 Macroeconomics | Chapter 3: Aggregate demand
•
•
•
Co-movement: the correlation of π‘₯π‘₯ with real GDP
Relative volatility: the size of fluctuations in π‘₯π‘₯ relative to fluctuations in real GDP
Leads or lags: the dynamics of π‘₯π‘₯ relative to real GDP.
The pattern of co-movement between a variable π‘₯π‘₯ and real GDP can be determined either
by plotting time series of percentage deviations from trend of both on the same axis, as is
done in Figure 3.4, or using a scatterplot of the deviations from trend in Figure 3.5. The
variable π‘₯π‘₯ is said to be ‘procyclical’ if π‘₯π‘₯ and real GDP have peaks and troughs at similar
times, or if the scatterplot reveals a positive relationship. π‘₯π‘₯ is said to be ‘countercyclical’ if
peaks of π‘₯π‘₯ are associated with troughs of real GDP and vice versa, or if the scatterplot
relationship is negative. A variable is said to be ‘acyclical’ if it displays neither a procyclical
nor countercyclical pattern.
Figure 3.4: Co-movement in data plotted over time
Figure 3.5: Co-movement shown in scatterplots
The relative volatility of a variable π‘₯π‘₯ can be judged by plotting a time series of its percentage
deviations from trend against those of real GDP. If the amplitude of the fluctuations in π‘₯π‘₯ is
typically larger than those of real GDP then π‘₯π‘₯ has a higher relative volatility. If π‘₯π‘₯ has smaller85
EC2065 Macroeconomics | Chapter 3: Aggregate demand
amplitude fluctuations than real GDP, as shown for example in Figure 3.6, then π‘₯π‘₯ has a
lower relative volatility.
Finally, the dynamic relationship between π‘₯π‘₯ and real GDP can be described in terms of leads
or lags. If the fluctuations in π‘₯π‘₯ anticipate those of real GDP that follow then π‘₯π‘₯ is said to be a
‘leading’ variable. On the other hand, if fluctuations in π‘₯π‘₯ follow those of real GDP with some
delay than π‘₯π‘₯ is said to be ‘lagging’. These two cases are illustrated in Figure 3.7. If π‘₯π‘₯ neither
leads nor lags real GDP then it is said to be ‘coincident’.
Figure 3.6: Relative volatility
Figure 3.7: Leading and lagging variables
3.2 Consumption
Consumption refers to purchases of final goods and services by households. Consumption is
the ultimate purpose of all economic activity and a key determinant of households’ welfare.
We begin by documenting some simple facts about consumption. In advanced economies,
consumption expenditure is typically around two-thirds of GDP. This makes it the largest
component of aggregate expenditure. While it makes up a large fraction of GDP,
86
EC2065 Macroeconomics | Chapter 3: Aggregate demand
consumption is generally less volatile than GDP. This observation is important in motivating
the theories of consumption we will study.
Figure 3.8 shows detrended real consumption expenditure in the United States alongside
detrended real GDP for the period 1947–2021. The peaks and troughs of consumption
closely align with those of real GDP, so consumption is said to be a procyclical and
coincident variable because no leads or lags can be discerned relative to GDP. The most
notable observation is that the absolute value of the peaks and troughs of consumption are
almost always smaller than those of GDP. Consumption therefore has a lower relative
volatility compared to GDP.
Figure 3.8: Empirical evidence on consumption
We now turn to explaining the determinants of households’ consumption decisions. While it
is obvious that a household’s income puts limits on its consumption, note that households
can use saving or borrowing to transfer purchasing power from one point in time to
another. Hence, it is not necessarily a household’s current income that constrains its current
consumption but a notion of total income over time. Income from the past that has been
saved can be used to finance current consumption, as can borrowing that could be repaid
using future income the household anticipates receiving.
Moreover, a key idea in studying consumption is that households are forward-looking and
care about consumption in the future. The central theory of consumption we will study is
based on the idea that households have a desire for ‘consumption smoothing’, i.e. avoiding
fluctuations in how much goods and services they purchase. To do this, the theory supposes
households make use of saving or borrowing to smooth out income fluctuations.
However, as we will see, it may not always be possible to achieve this consumption
smoothing. In particular, borrowing against the expectation of receiving income in the
future might be very costly or difficult. This is a particular example of what we will refer to
as ‘credit-market imperfections’ and we will see how they place limits on the extent to
which households are able to smooth consumption even if they want to.
87
EC2065 Macroeconomics | Chapter 3: Aggregate demand
3.3 A two-period consumption model
The basic principles underlying what are known as dynamic theories of consumption can be
illustrated using a model with two periods 1 and 2. Period 1 refers to the present and period
2 to the future. A more realistic version of this framework might have a separate period for
each year but ideas can be illustrated with just two periods.
The key features of the model are that households make plans for the future with a
preference for smoothing out fluctuations in consumption expenditure. In doing so, they
can use saving or borrowing to transfer purchasing power across time subject to an
intertemporal budget constraint.
3.3.1 Preferences
In the model, households choose consumption plans (𝑐𝑐, 𝑐𝑐 ′ ), where 𝑐𝑐 is current consumption
and 𝑐𝑐′ is future consumption. Preferences over consumption plans can be represented in a
diagram with 𝑐𝑐 and 𝑐𝑐′ on the axes using indifference curves as shown in Figure 3.9.
Indifference curves are downward sloping because more consumption is preferred to less at
each date. Importantly, indifference curves are drawn as convex to origin. This indicates a
dislike of extreme consumption plans, for example, the household does not like to have high
𝑐𝑐 but 𝑐𝑐′ being very low and would prefer a greater balance between 𝑐𝑐 and 𝑐𝑐′. Similarly, the
household would prefer 𝑐𝑐 and 𝑐𝑐′ closer together than very low 𝑐𝑐 but high 𝑐𝑐′. This is how we
represent a preference for consumption smoothing.
The absolute value of the gradient of the indifference curve at a point is known as the
marginal rate of substitution between 𝑐𝑐 and 𝑐𝑐′, denoted by 𝑀𝑀𝑀𝑀𝑆𝑆𝑐𝑐,𝑐𝑐 ′ . Since the indifference
curve gradient indicates how much extra future consumption 𝑐𝑐′ the household would
require to be as well off after losing a unit of current consumption 𝑐𝑐, the marginal rate of
substitution indicates the relative value households put on a unit of current consumption
compared to future consumption. The convex shape of the indifference curves is equivalent
to assuming the marginal rate of substitution is diminishing as current consumption 𝑐𝑐 rises.
Another assumption on preferences is that both 𝑐𝑐 and 𝑐𝑐′ are normal goods. A normal good
is a good the household wants more of when better off. Geometrically, this assumption
means that if points on different indifference curves with the same marginal rate of
substitution were joined up then they would trace out an upward sloping line. This
assumption captures the idea that if more consumption is affordable, the household would
like to spread out the extra expenditure over time.
88
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Figure 3.9: Household indifference curves
3.3.2 Budget constraint
The dynamic consumption model works by putting together the preferences described
above with a budget constraint that spans the two periods. The link between the two
periods comes from saving or borrowing decisions.
Suppose the household receives real income 𝑦𝑦 in the current period and 𝑦𝑦′ in the future
period. These incomes are not explained by the model: consumption is endogenous but
income is exogenous. In Section 3.11, we will see how a larger model can explain
consumption and aggregate income simultaneously. Income here should be interpreted as
non-financial income. Wages are included but not any interest income, which will be
accounted for separately.
The incomes here are before tax. The government collects tax revenue 𝑑𝑑 and 𝑑𝑑′ from the
household in the two periods, so disposable incomes in the two periods are 𝑦𝑦 − 𝑑𝑑 and 𝑦𝑦 ′ −
𝑑𝑑′. Here, these taxes are assumed to have a ‘lump sum’ form: the amount paid to the
government does not depend on any decisions the household makes, for example, a poll
tax. More realistically, the government typically sets tax rates on earned income and
consumption expenditure, and we will see examples of what difference this makes. For now,
maintain the assumption of lump-sum taxes.
Saving 𝑠𝑠 done by the household refers to current disposable income that is not currently
consumed, that is, 𝑠𝑠 = 𝑦𝑦 − 𝑑𝑑 − 𝑐𝑐. Saving can put in a bank account to earn interest or used
to purchase financial assets. For now, suppose savings are held in a bank account or as
bonds that pay a known real rate of interest π‘Ÿπ‘Ÿ. This real interest rate adjusts the payment of
interest in terms of money for inflation that is expected to occur between the current and
future periods. The distinction between different types of interest rate is analysed further in
Chapter 4. It is possible for real interest rates to be negative, so it is not necessarily assumed
that π‘Ÿπ‘Ÿ > 0.
Saving, as defined, can be either positive or negative. Negative saving 𝑠𝑠 < 0 represents
borrowing, where consumption expenditure exceeds disposable income in the current
89
EC2065 Macroeconomics | Chapter 3: Aggregate demand
period. In cases where the household is a borrower, interest has to be paid on debts.
Assume for now that the household can borrow at the same interest rate π‘Ÿπ‘Ÿ that savers
receive. Later financial-market imperfections will explain why borrowers face higher interest
rates than savers in practice.
In the simple two-period model, we suppose that the household begins from a blank slate in
terms of not having any initial assets or debts at the beginning of the current period. Given
saving 𝑠𝑠, the household has financial wealth (1 + π‘Ÿπ‘Ÿ)𝑠𝑠 in the future period, which includes
the saving 𝑠𝑠 plus real interest income π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ. Note that interest income is counted separately
from future non-financial income 𝑦𝑦′.
In the future period, the household has disposable income 𝑦𝑦 ′ − 𝑑𝑑′ and net financial wealth
(1 + π‘Ÿπ‘Ÿ)𝑠𝑠. Net wealth is negative for borrowers. Taking disposable income and financial
wealth together, the household can afford future consumption:
𝑐𝑐 ′ = 𝑦𝑦 ′ − 𝑑𝑑 ′ + (1 + π‘Ÿπ‘Ÿ)𝑠𝑠
Given the nature of the two-period model, the future period comprises the whole future the
household is planning for, so there is no reason to save for anything after the future period.
Hence, 𝑐𝑐’ is given by the equation above conditional on the choice of current saving 𝑠𝑠. As
mentioned, saving 𝑠𝑠 can be negative and the only limit imposed on borrowing here is that
debts including interest do not exceed the ability to repay as measured by future disposable
income 𝑦𝑦 ′ − 𝑑𝑑 ′ . It may be reasonable to consider stricter limits on borrowing with financialmarket imperfections and examples of these are considered in Chapter 4.
Dividing both sides of the equation for 𝑐𝑐′ by 1 + π‘Ÿπ‘Ÿ implies
𝑦𝑦 ′ − 𝑑𝑑′
𝑐𝑐 ′
=
+ 𝑠𝑠
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
and substituting the definition of saving 𝑠𝑠 = 𝑦𝑦 − 𝑑𝑑 − 𝑐𝑐 and grouping consumption and
disposable income terms together:
𝑐𝑐 ′
𝑦𝑦 ′ − 𝑑𝑑′
𝑐𝑐 +
= 𝑦𝑦 − 𝑑𝑑 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
Consumption plans (𝑐𝑐, 𝑐𝑐 ′ ) must satisfy this equation to be affordable to the household.
Geometrically, the budget constraint is a straight line with gradient −(1 + π‘Ÿπ‘Ÿ) passing
through the ‘endowment’ point (𝑦𝑦 − 𝑑𝑑, 𝑦𝑦 ′ − 𝑑𝑑 ′ ) as illustrated in Figure 3.10.
90
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Figure 3.10: Lifetime budget constraint
The household can always choose neither to be a saver nor a borrower, meaning that 𝑐𝑐 =
𝑦𝑦 − 𝑑𝑑 and 𝑐𝑐 ′ = 𝑦𝑦 ′ − 𝑑𝑑′, so disposable income is simply consumed in both time periods. This
means choosing a consumption plan at the endowment point in the diagram (labelled E).
Choosing a consumption plan to the left of the endowment point means choosing to be a
saver (𝑠𝑠 > 0), while choosing a plan to the right of the endowment point means choosing to
be a borrower (𝑠𝑠 < 0).
For each unit less of income consumed in the current period, an extra unit can be saved,
which means an extra amount 1 + π‘Ÿπ‘Ÿ of financial wealth in the future and thus an extra
amount 1 + π‘Ÿπ‘Ÿ of future consumption is affordable. This is why the budget constraint has
gradient −(1 + π‘Ÿπ‘Ÿ). In the absence of financial-market imperfections, the budget line is a
straight line (it has the same gradient in the saving and borrowing regions) and extends all
the way down to the horizontal axis.
The budget constraint is described as a ‘lifetime budget constraint’ because it includes
consumption and income in both periods. However, 𝑐𝑐 and 𝑐𝑐′ and 𝑦𝑦 − 𝑑𝑑 and 𝑦𝑦 ′ − 𝑑𝑑′ cannot
simply be added together because they occur in different periods. Instead, present
discounted values (PDVs) of future consumption 𝑐𝑐 ′ /(1 + π‘Ÿπ‘Ÿ) and future income (𝑦𝑦 ′ −
𝑑𝑑 ′ )/(1 + π‘Ÿπ‘Ÿ) are added to current 𝑐𝑐 and 𝑦𝑦 − 𝑑𝑑. The present value of a future amount 1 is
equal to 1/(1 + π‘Ÿπ‘Ÿ) because if 1/(1 + π‘Ÿπ‘Ÿ) were saved in the current period, after including
interest, it would be worth (1 + π‘Ÿπ‘Ÿ)/(1 + π‘Ÿπ‘Ÿ) = 1 in the future period. The lifetime budget
constraint shows that the present discounted value of all consumption across the two
periods is limited by the present value of all current and future disposable income.
The present discounted value of all income after tax can be interpreted as ‘lifetime’ or
‘human’ wealth, which is denoted by β„Ž:
𝑦𝑦 ′ − 𝑑𝑑 ′
β„Ž = 𝑦𝑦 − 𝑑𝑑 +
1 + π‘Ÿπ‘Ÿ
This is what an asset making payments equal to disposable income in the two periods would
be worth to the household. If the household had any initial financial assets in the current
period, the value of these would be added to β„Ž to obtain total wealth in all forms.
91
EC2065 Macroeconomics | Chapter 3: Aggregate demand
3.3.3 Choice of an optimal consumption plan
Having described households’ preferences and constraints, we now turn to analysing the
optimal choice of consumption plan (𝑐𝑐, 𝑐𝑐 ′ ). Analogous to similar constrained maximisation
problems in microeconomics, the optimal consumption plan is found on the highest
indifference curve that can be reached from the lifetime budget constraint.
The optimal plan is at the tangency point between an indifference curve and the lifetime
budget constraint, i.e. where an indifference curve has the same gradient as that of the
budget constraint, which is −(1 + π‘Ÿπ‘Ÿ) all along the budget line. The indifference curve
gradient is −𝑀𝑀𝑀𝑀𝑆𝑆𝑐𝑐,𝑐𝑐′ , where 𝑀𝑀𝑀𝑀𝑆𝑆𝑐𝑐,𝑐𝑐′ is the marginal rate of substitution between 𝑐𝑐 and 𝑐𝑐′.
Hence, the optimal plan is the point on the budget constraint where 𝑀𝑀𝑀𝑀𝑆𝑆𝑐𝑐,𝑐𝑐′ = 1 + π‘Ÿπ‘Ÿ as
shown in Figure 3.11.
Figure 3.11: Optimal consumption plan
Intuitively, the budget constraint gradient 1 + π‘Ÿπ‘Ÿ (in absolute value) represents the market
price of current consumption in terms of future consumption, while the indifference curve
gradient represents how much extra 𝑐𝑐′ is needed to compensate the household for the loss
of one unit of 𝑐𝑐. The tangency point is thus where the subjective value the household puts
on current consumption relative to future consumption is equal to the relative cost of
current consumption given the market interest rate π‘Ÿπ‘Ÿ.
92
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Box 3.1: The consumption puzzle
How does the empirical evidence on consumption and income compare to the predictions
of the model? We can look at two types of data:
•
•
Cross-sectional data: observations of consumption and income for many individual
households in a particular year
Time-series data: observations of aggregate consumption and income for a
number of years in a particular economy.
Interestingly, the relationship between consumption and income seen in these two types
of data is not the same. This apparent contradiction is known as the ‘consumption puzzle’.
Figure 3.12 shows the typical shape of a line of best fit drawn through data on
consumption and income in a cross-section of households. There is evidence of
consumption smoothing in that different levels of income are typically associated with
smaller differences in consumption. In other words, there are high levels of saving for
those households with high incomes, and low or negative saving levels for those with low
incomes.
Figure 3.12: Consumption and income in cross-sectional data
A relationship between consumption and current income consistent with the crosssectional empirical evidence is sometimes directly assumed in macroeconomics. This is
the Keynesian consumption function relating aggregate consumption 𝐢𝐢 to aggregate
disposable income π‘Œπ‘Œ − 𝑇𝑇, where 𝑇𝑇 denotes taxes:
𝐢𝐢 = π‘Žπ‘Ž + 𝑏𝑏(π‘Œπ‘Œ − 𝑇𝑇)
The terms a and b are parameters with a>0 and 0<b<1. The marginal propensity to
consume ∂C/∂Y is the constant b. The average propensity to consume C/Y declines as
income rises because a is positive, that is to say, a higher fraction of income is saved when
income is high. Although this equation is consistent with the cross-sectional evidence, it
actually posits that the same relationship holds between aggregate consumption and
income over time. As we will see, this is not the case. Moreover, the equation is not
derived from a model of rational behaviour by households.
93
EC2065 Macroeconomics | Chapter 3: Aggregate demand
To understand the consumption-income relationship seen in the cross-sectional data using
the dynamic consumption model, let us think first about the sources of the variation seen
in income levels across households. Some, but not all, of this variation is due to:
•
•
Stage of life cycle: the young and old have lower incomes on average compared to
the middle-aged
Temporary shocks: some people experience periods of unemployment, others earn
large bonuses/overtime payments.
First, there are large predictable differences between current income and future income
because households are of different ages. However, the preference for consumption
smoothing means that the optimal consumption plan has 𝑐𝑐 and 𝑐𝑐′ closer together than are
𝑦𝑦 − 𝑑𝑑 and 𝑦𝑦 ′ − 𝑑𝑑′. This is consistent with the cross-sectional empirical evidence.
The left panel of Figure 3.13 illustrates the consumption choice diagram for a young person
with a low current income but expectations of higher income in the future, so the
endowment point 𝐸𝐸 is at a relatively extreme north-west position in the diagram. The right
panel shows a middle-aged person with a high current income but expectations of lower
income in the future owing to retirement, so the endowment point is at a relatively
extreme south-east position. Households with quite different levels of current income
could choose very similar levels of consumption.
Figure 3.13: Life-cycle differences in income
If the life cycle were the only reason for income differences across households then the
predicted consumption-income relationship would be even weaker than found in the data.
However, there are plenty of income differences that do not average out over the life
cycle, for example, people having different careers.
A second source of income variation comes from temporary shocks, such as receiving a
bonus, or a period of unemployment. We can show the predicted effects of such shocks in
the dynamic consumption model by shifting the endowment point 𝐸𝐸 with the change in
income. A temporary shock affects only current income 𝑦𝑦, leaving 𝑦𝑦 ′ unchanged, which
moves the endowment point horizontally. A movement of 𝐸𝐸 causes a parallel shift of the
lifetime budget constraint. Taking the case of a temporary rise in income 𝑦𝑦, Figure 3.14
shows the endowment point move to the right from 𝐸𝐸0 to 𝐸𝐸1 . The budget constraint shifts
to the right, and both 𝑐𝑐 and 𝑐𝑐′ rise in response because they are normal goods. The desire
for consumption smoothing reflected in the rise94of both 𝑐𝑐 and 𝑐𝑐′ means that 𝑐𝑐 rises by less
than 𝑦𝑦. This is also consistent with cross-sectional data on consumption and income.
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Figure 3.14: Temporary and permanent shocks to income
Our analysis is different for income changes expected to be permanent. With a
permanent rise in income, both 𝑦𝑦 and 𝑦𝑦 ′ increase, and the endowment point moves
horizontally and vertically. In the figure, this is shown as 𝐸𝐸0 to 𝐸𝐸2 . Unlike with a
temporary income shock, 𝑐𝑐 and 𝑦𝑦 now adjust by similar amounts, which could be
exactly the same.
Turning now to time series data on aggregate consumption and income, Figure 3.15
shows the typical shape of a line of best fit drawn through the time series data in
comparison to the cross-sectional data relationship already discussed. In contrast to
the cross-sectional data, time-series data suggest that aggregate consumption is
approximately proportional to aggregate income. Over long periods, the fractions of
income consumed and saved are fairly stable.
The relationship between consumption and income in time-series data is quite
different from what is found in cross-sectional data. The apparent inconsistency
between these findings is known as the ‘Kuznets consumption puzzle’. Simply
assuming a Keynesian consumption function that matches the cross-sectional data for
a model of aggregate consumption and income is therefore inconsistent with what we
know from time series data. However, the puzzle can be resolved using the dynamic
model of consumption.
The crucial difference between time series and cross-sectional data is the source of the
variation in income levels. In time series data, most of the variation comes from longrun trends in income that are explained through the analysis of long-run economic
growth in 0. Increases in income over time are expected to be permanent because
long-run growth is not expected to be reversed. In contrast, cross-sectional data
includes many changes in income that are only temporary.
95
EC2065 Macroeconomics | Chapter 3: Aggregate demand
3.4 Bonds, yields, and interest rates
For households to save, they must hold an asset that transfers purchasing power to the
future. For example, deposits in bank account, bonds, or shares. The real return on an asset
is how much purchasing power over goods grows by holding that asset over time.
The simplest case is where savings are held as bank deposits in an account that pays
interest. The real return on such savings deposits is the interest rate adjusted for inflation.
Households can also save by holding bonds. Bonds are a particular type of financial asset
that promise a sequence of fixed future payments. For now, assume these payments are
fixed in real terms, rather than in units of money. This would correspond to what is called an
inflation-indexed bond.
The yield or interest rate on a bond (these terms are interchangeable) is the discount rate
that makes the present value of all the bond’s future fixed payments equal to the actual
market price it currently trades at. Since the bond payments are fixed, high bond prices are
equivalent to low yields/interest rates and low bond prices are equivalent to high
yields/interest rates.
The simplest form of bond is a discount bond that makes only one payment (its face value)
at the maturity date of the bond. If a bond is held to maturity, the real return received is
equal to the bond’s real yield.
Consider a discount bond maturing in the next period and assume the face value is worth
one unit of goods. If the bond’s price currently is worth 𝑝𝑝 units of goods, then the real yield
or real interest rate π‘Ÿπ‘Ÿ on the bond is the discount rate that makes the present value of 1
equal to the bond price 𝑝𝑝:
𝑝𝑝 =
1
1 + π‘Ÿπ‘Ÿ
If the bond has no risk of default, an investment of 𝑝𝑝 gives a payoff of 1 in the future. An
investor therefore makes a profit of 1 − 𝑝𝑝 on an investment of 𝑝𝑝, so the percentage real
return is (1 − 𝑝𝑝)/𝑝𝑝 = π‘Ÿπ‘Ÿ.
3.5 Interest rates and consumption
The real interest rate π‘Ÿπ‘Ÿ matters for households’ consumption plans because it determines
the market price of current consumption relative to future consumption. If a household
buys 1 unit more of 𝑐𝑐, saving 𝑠𝑠 is reduced by 1, which means future financial wealth
(1 + π‘Ÿπ‘Ÿ)𝑠𝑠 is lower by 1 + π‘Ÿπ‘Ÿ, and so future consumption must fall by 1 + π‘Ÿπ‘Ÿ, all else equal.
Hence, the price of current consumption in terms of future consumption is 1 + π‘Ÿπ‘Ÿ. This
relative price appears in the consumption choice diagram as the gradient of the lifetime
budget constraint in absolute value.
Changes in the real interest rate π‘Ÿπ‘Ÿ affect the gradient of the lifetime budget constraint. For
example, higher π‘Ÿπ‘Ÿ makes the budget constraint steeper as shown in Figure 3.16. When its
96
EC2065 Macroeconomics | Chapter 3: Aggregate demand
gradient changes, the lifetime budget constraint pivots around the endowment point 𝐸𝐸,
which is always affordable.
To understand how the household’s optimal consumption plan (𝑐𝑐 ∗ , 𝑐𝑐 ′∗ ) adjusts, we break
down the response into income and substitution effects. The substitution effect is found by
looking at the response to the change in the budget constraint gradient with a hypothetical
income transfer that shifts the position of the budget constraint so that the original
indifference curve can be reached. This leaves the household no better or worse off. The
substitution effect obtain in this way captures the pure incentive effect of changes in
interest rates, controlling for whether this makes the household better off or worse off.
The income effect is the response to removing the hypothetical income transfer used to
derive the substitution effect. This holds the gradient of the budget constraint constant, so
it is a response to a parallel shift of the budget constraint. The income effect thus captures
the household’s response to being made better off or worse off by the interest rate change.
Figure 3.15: The lifetime budget constraint with a higher interest rate
Figure 3.17 analyses the effects of higher interest rates on a saver. A saver is a household
that initially chooses a consumption plan to the left of the endowment point 𝐸𝐸. The increase
in π‘Ÿπ‘Ÿ pivots the budget constraint around 𝐸𝐸, making it steeper. The household can now reach
a higher indifference curve, so savers are made better off by higher interest rates because
they earn a higher return on their financial wealth. The substitution effect is found by
making a parallel downward shift of the true budget constraint to where it is tangent to the
original indifference curve. This leaves the household no better off. Even so, the household
still has an incentive to shift towards less consumption in the current period and more in the
future because the relative price of current consumption has risen. This substitution effect is
labelled 𝑆𝑆𝑆𝑆 in the diagram. It is movement north-west to where the indifference curve is
steeper.
97
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Figure 3.16: Effects of higher interest rates on a saver
The income effect is found by removing the parallel downward shift of the true budget
constraint used to derive the substitution effect. Thus, the income effect is the response to
a parallel upward shift of the budget constraint. Since both current and future consumption
are normal goods, the income effect leads the saver to increase both 𝑐𝑐 and 𝑐𝑐′. Intuitively,
the household wants to spread out over time its gains from higher interest rates. The
income effect is labelled 𝐼𝐼𝐼𝐼 in the diagram and is a movement in a north-east direction.
Figure 3.18 performs the same exercise for a borrower. This is a household that initially
chooses a consumption plan to the right of the endowment point. When π‘Ÿπ‘Ÿ rises, the budget
constraint becomes steeper, pivoting around 𝐸𝐸. The initial consumption plan is no longer
affordable, so the borrower is made worse off when access to credit becomes more
expensive. While the hypothetical income transfer used for the substitution effect (𝑆𝑆𝑆𝑆) now
shifts the true budget constraint upwards, the analysis is qualitatively the same as for the
saver. The 𝑆𝑆𝑆𝑆 is a north-west movement in the diagram with lower 𝑐𝑐 and higher 𝑐𝑐′. On the
other hand, the income effect (𝐼𝐼𝐼𝐼) is the response to a parallel downward shift of the
budget constraint. The 𝐼𝐼𝐼𝐼 is therefore a south-west movement that reduces both 𝑐𝑐 and 𝑐𝑐′.
98
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Figure 3.17: Effects of higher interest rates on a borrower
For savers, the income and substitution effects of π‘Ÿπ‘Ÿ on current consumption 𝑐𝑐 go in opposite
directions so the overall response is ambiguous. The diagram shows the special case where
𝑆𝑆𝑆𝑆 and 𝐼𝐼𝐼𝐼 exactly cancel out. For future consumption 𝑐𝑐′, 𝑆𝑆𝑆𝑆 and 𝐼𝐼𝐼𝐼 are reinforcing, so 𝑐𝑐′
rises unambiguously. For borrowers, the 𝑆𝑆𝑆𝑆 and 𝐼𝐼𝐼𝐼 are reinforcing for 𝑐𝑐, which
unambiguously declines, while 𝑆𝑆𝑆𝑆 and 𝐼𝐼𝐼𝐼 are conflicting for 𝑐𝑐′ and the overall response is
ambiguous. The diagram shows the special case where 𝑆𝑆𝑆𝑆 and 𝐼𝐼𝐼𝐼 exactly cancel out for 𝑐𝑐′.
99
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Box 3.2: Durables, non-durables, and services
The business-cycle properties of consumption documented in Section 3.2 are obtained
for overall consumption expenditure. However, consumption can be broken down into
durables, non-durables and services components. Durable goods are products that can
be used for a long period (more than three years) after being purchased, for example,
a car or a refrigerator. Non-durables are physical goods that are used up or worn out
sooner after being purchased and hence, not classified as durable, for example, food
or clothing. Services are intangibles and the benefit of purchasing them is typically
obtained in a short period after a purchase, for example, a haircut.
Figure 3.19 shows the percentage deviations from trend of consumption of durables
alongside those of real GDP. An important aspect of the behaviour of aggregate
consumption is that it is less volatile than aggregate income, which motivated our
emphasis on consumption smoothing in the dynamic consumption choice model.
However, durables consumption is much more volatile than income, so we do not
observe smoothing of households’ expenditure on durable goods. This is not
surprising, though, because durable goods do not need to be purchased all the time to
enjoy the benefit of using them.
Figure 3.18: Consumption of durable goods
In justifying the desire of households to smooth consumption, we have implicitly
assumed that the benefit of current consumption 𝑐𝑐 comes only in the current period,
which explained why households do not want to have too great an imbalance between
𝑐𝑐 and 𝑐𝑐′. The assumptions of the consumption choice model therefore make more
sense for consumption of non-durables and services. Figures 3.20 and 3.21 show the
percentage deviations from trend of non-durables and services alongside those of real
GDP. Here, we see that these categories of consumption are indeed significantly less
volatile than real GDP. Durables consumption is better understood as a form of
investment, which we will analyse in Section 3.8.
100
EC2065 Macroeconomics | Chapter 3: Aggregate demand
3.6 Consumption smoothing in the aggregate
The desire for consumption smoothing is a key feature of the dynamic model of
consumption in Section 3.3. We have seen that it helps to explain patterns seen in crosssectional data and resolve the ‘consumption puzzle’. But in time series data, consumption
and income are approximately proportional over long periods of time, so it appears at first
glance that consumption smoothing does not have much relevance when analysing
aggregate consumption. However, by removing trends from the data to focus on
fluctuations, we have seen in Figure 3.8 that consumption is less volatile than income, a
pattern that is more pronounced when focusing on non-durables and services.
Figure 3.19: Consumption of non-durables
Figure 3.20: Consumption of services
However, while aggregate consumption is less volatile than income, it is not much less
volatile. Although we cannot investigate this issue quantitatively in the simple two-period
model, for very short-term fluctuations in income relative to the whole future over which
101
EC2065 Macroeconomics | Chapter 3: Aggregate demand
households are making plans, we would expect households to smooth consumption to a
much greater extent.
Why then is there not more smoothing of aggregate consumption? We will investigate two
explanations. In Chapter 4, we will look at credit-market imperfections such as borrowing
constraints. If income were to fall, some households might want to borrow against future
income to smooth consumption. But if they do not have access to credit, their current
consumption will fall by more than desired. Hence, it may be that households would like to
smooth consumption to a greater extent, but are prevented from doing so by credit-market
imperfections.
Another explanation is that we know changes in interest rates create an incentive to deviate
from a plan where current and future consumption are close together. This is because the
substitution effect of interest rates moves current and future consumption in different
directions. This matters for the relationship between aggregate consumption and income in
time series data as interest rates are also changing over time. However, in cross-sectional
data where all households are observed at the same point in time, interest rates are not
changing and so are of less concern in understanding the relationship between consumption
and income.
Not only can interest rates influence aggregate consumption, there are reasons to believe
that interest rate changes might be systematically linked to changes in aggregate income.
This is because the equilibrium level of income must be consistent with consumption
choices and other demands for goods in aggregate.
To understand this point, consider the following example. The economy is comprised of 𝑁𝑁
representative households each with exogenous income 𝑦𝑦. Since income is treated as
exogenous, this implicitly assumes no adjustment of households’ supply of labour.
Furthermore, we ignore the use of capital goods in production, which also means
investment is zero (𝐼𝐼 = 0). Real GDP in the economy is π‘Œπ‘Œ = 𝑁𝑁𝑁𝑁, which is exogenous here.
The government’s fiscal policy is an exogenous level of expenditure is 𝐺𝐺.
Households choose a consumption plan (𝑐𝑐, 𝑐𝑐 ′ ) as in the two-period model. Given 𝑐𝑐,
aggregate consumption is 𝐢𝐢 = 𝑁𝑁𝑁𝑁 because all 𝑁𝑁 households make the same choices given
that they have the same preferences and face the same income and interest rates.
Suppose the economy is closed, so market clearing in the goods market requires π‘Œπ‘Œ = 𝐢𝐢 + 𝐺𝐺.
The aggregate supply of goods, as given by the economy’s real GDP, must equal aggregate
expenditure 𝐢𝐢 + 𝐺𝐺 (there is no investment or net exports). With π‘Œπ‘Œ and 𝐺𝐺 being exogenous,
equilibrium is reached through adjustment of the real interest rate π‘Ÿπ‘Ÿ. The equilibrium real
interest rate π‘Ÿπ‘Ÿ ∗ ensures that the consumption plan chosen by the representative household
is 𝑐𝑐 ∗ = (π‘Œπ‘Œ − 𝐺𝐺)⁄𝑁𝑁 and 𝑐𝑐 ′∗ = (π‘Œπ‘Œ ′ − 𝐺𝐺 ′ )⁄𝑁𝑁, which are equivalent to goods-market clearing
in the current and future time periods.
Figure 3.22 below shows how π‘Ÿπ‘Ÿ ∗ is determined graphically. At an arbitrary real interest rate
π‘Ÿπ‘Ÿ, the lifetime budget constraint of the representative household has gradient −(1 + π‘Ÿπ‘Ÿ) and
passes through the point ((π‘Œπ‘Œ − 𝐺𝐺)⁄𝑁𝑁 , (π‘Œπ‘Œ ′ − 𝐺𝐺 ′ )⁄𝑁𝑁), labelled 𝐸𝐸. A detailed justification of
102
EC2065 Macroeconomics | Chapter 3: Aggregate demand
the second of these claims is found in Section 4.1 but, for now, just note that the
government could pay for its expenditure using lump-sum taxes 𝑑𝑑 = 𝐺𝐺/𝑁𝑁 and 𝑑𝑑 ′ = 𝐺𝐺 ′ /𝑁𝑁
levied on each household.
The optimal consumption plan (𝑐𝑐 ∗ , 𝑐𝑐 ′∗ ) for a given π‘Ÿπ‘Ÿ will generally not be at
((π‘Œπ‘Œ − 𝐺𝐺 )⁄𝑁𝑁 , (π‘Œπ‘Œ ′ − 𝐺𝐺 ′ )⁄𝑁𝑁), which means that interest rate is not the equilibrium interest
rate. There is a particular interest rate π‘Ÿπ‘Ÿ ∗ at which each household willingly chooses
((π‘Œπ‘Œ − 𝐺𝐺)⁄𝑁𝑁 , (π‘Œπ‘Œ ′ − 𝐺𝐺 ′ )⁄𝑁𝑁). Since the optimal consumption plan is at a tangency point
between an indifference curve and the lifetime budget constraint, the value of 1 + π‘Ÿπ‘Ÿ ∗ is
pinned down by the gradient of the indifference curve passing through the point
((π‘Œπ‘Œ − 𝐺𝐺 )⁄𝑁𝑁 , (π‘Œπ‘Œ ′ − 𝐺𝐺 ′ )⁄𝑁𝑁).
Figure 3.21: Market-clearing real interest rate
It is also possible to understand the determination of π‘Ÿπ‘Ÿ ∗ in terms of balancing saving and
investment. There is no investment in the example and suppose the government runs a
balanced budget, so 𝑑𝑑 = 𝐺𝐺/𝑁𝑁. Aggregate saving by households is 𝑁𝑁(𝑦𝑦 − 𝑑𝑑 − 𝑐𝑐 ) = π‘Œπ‘Œ − 𝐺𝐺 −
𝑁𝑁𝑁𝑁 and saving equals investment is equivalent to zero saving here, which occurs when 𝑐𝑐 =
(π‘Œπ‘Œ − 𝐺𝐺)/𝑁𝑁.
In this example, if there were a change in income π‘Œπ‘Œ, the equilibrium real interest rate π‘Ÿπ‘Ÿ ∗
would adjust, all else being equal. The example is an extreme one because, in equilibrium,
consumption must adjust one-for-one with any change in income. Although households
would want to smooth consumption through saving or borrowing, this is not possible in
equilibrium. The mechanism by which individual households are dissuaded from trying to
smooth consumption is adjustment of the real interest rate.
In less extreme cases considered later in this chapter we can ask how the economy as a
whole can adjust its level of saving or borrowing. There are two ways this is possible. First,
through adjustment of investment. Second, in an open economy, through adjustment of the
current account, which we will study in Chapter 10.
103
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Box 3.3: Supply disruptions and real interest rates
To illustrate how equilibrium real interest rates are affected by macroeconomic shocks,
consider an economy hit by a supply shock that disrupts its ability to produce goods.
Assume here that real GDP is exogenously determined by the economy’s capacity to
produce goods and services. We look at a closed economy with a representative
household and zero investment and we suppose that fiscal policy remains unchanged
after the shock.
The supply shock reduces the economy’s current GDP π‘Œπ‘Œ. The shock is assumed to be
temporary, so that expected future GDP π‘Œπ‘Œ′ is unaffected. The direct consequence of the
shock is to reduces all households’ incomes temporarily. Faced with this temporary
decline in income, households would aim to smooth consumption, spreading out
adjustment of consumption between the present and future time periods. But with zero
investment and international trade, it is not possible for all households to save less or
borrow more simultaneously. Therefore, the real interest rate must adjust for the
economy to reach equilibrium.
The representative household’s lifetime budget constraint passes through the point
((π‘Œπ‘Œ − 𝐺𝐺 )⁄𝑁𝑁 , (π‘Œπ‘Œ ′ − 𝐺𝐺 ′ )⁄𝑁𝑁) and has gradient −(1 + π‘Ÿπ‘Ÿ). Equilibrium in the goods market
requires 𝐢𝐢 + 𝐺𝐺 = π‘Œπ‘Œ, so the real interest rate π‘Ÿπ‘Ÿ must adjust until the consumption plan
𝑐𝑐 = (π‘Œπ‘Œ − 𝐺𝐺)⁄𝑁𝑁 and 𝑐𝑐 ′ = (π‘Œπ‘Œ ′ − 𝐺𝐺 ′ )⁄𝑁𝑁 is willingly chosen. This is illustrated in Figure 3.23
below. Starting from the initial real interest rate, the decline in π‘Œπ‘Œ causes a parallel
leftward shift of the lifetime budget constraint. With no change in π‘Ÿπ‘Ÿ, households would
choose a consumption plan where both 𝑐𝑐 and 𝑐𝑐′ decline and 𝑐𝑐 falls by less than (π‘Œπ‘Œ −
𝐺𝐺)/𝑁𝑁 does.
Figure 3.22: Temporary reduction in GDP and equilibrium real interest rate
This is not consistent with equilibrium. In order for the chosen consumption plan to move
horizontally to the left as much as the lifetime budget constraint shifts, the real interest
rate must rise so that the budget constraint is tangent to the indifference curve passing
through that point. The indifference curve is steeper here than at the initial consumption
plan because of the diminishing marginal rate of substitution. Therefore, the equilibrium
real interest rate will need to rise to achieve equilibrium in the goods market.
104
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Intuitively, after the negative supply shock, at an unchanged real interest rate,
households would smooth consumption by borrowing against unchanged future
income. However, all households are hit by the shock in this example, so all want to
increase borrowing and none wants to save more. The interest rate must rise to
dissuade households from borrowing. More generally, as seen later in economies
where investment or net exports can adjust, the shock causes investment or net
exports to fall in addition to, or in place of, the rise in the equilibrium real interest rate.
3.7 Investment
In the national accounts, investment comprises:
•
•
•
Business fixed investment: firms purchasing new capital for use over time in
producing goods and services
Residential investment: purchases of new housing units
Inventory investment: changes in value of stock of finished goods.
In the USA, investment is typically 15–20 per cent of GDP. Of this total amount of
investment, business fixed investment is around two thirds, residential investment is around
one third, and inventory investment is small but volatile.
Figure 3.24 shows a time series of detrended investment in the USA alongside detrended
real GDP. We see that investment is procyclical, approximately coincident with GDP but is
much more volatile. The percentage deviations of investment from its trend are around 3–4
times larger than those of GDP. This is very different from the pattern of consumption
expenditure (except durables, which behaves more like investment) being less volatile than
GDP.
Figure 3.23: US investment over the business cycle
105
EC2065 Macroeconomics | Chapter 3: Aggregate demand
3.8 A two-period model of investment
We now consider model of investment decisions with two periods, as we did for our
dynamic model of consumption. Although investment expenditure also includes residential
investment and inventories, here, we will focus on purchases of new capital by firms, i.e.
business fixed investment, the largest component of total investment.
3.8.1 The production function
Our two-period model of business fixed investment assumes firms face a neoclassical
production function 𝐹𝐹(𝐾𝐾, 𝑁𝑁). Firms produce output using capital 𝐾𝐾 and labour 𝑁𝑁. The
amounts of output produced in the current and future time periods are:
π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁)
and
π‘Œπ‘Œ ′ = 𝑧𝑧 ′ 𝐹𝐹(𝐾𝐾 ′ , 𝑁𝑁 ′ )
Total factor productivity (TFP) in the current and future periods is 𝑧𝑧 and 𝑧𝑧′, respectively,
where 𝑧𝑧 and 𝑧𝑧′ may differ, for example, technological progress makes 𝑧𝑧′ larger than 𝑧𝑧. The
level of TFP in each time period is exogenous. Labour input is obtained by firms hiring labour
in a competitive market, as was studied in Section 1.2. Unlike the Chapter 1 analysis, firms
are assumed to own the capital 𝐾𝐾 that they use for production. There is no rental market for
capital, though this assumption does not matter greatly here.
3.8.2 Capital accumulation
Firms can change their stock of capital through purchases of new capital. Current
investment 𝐼𝐼 adds to the future stock of capital 𝐾𝐾′ that is used to produce output in the
future. This means it takes time for new capital to be produced and installed and be ready
for use in production (this assumption is known as ‘time-to-build’). The capital accumulation
equation is the following, which is the same one seen in the Solow model from Section 1.8:
𝐾𝐾 ′ = 𝐼𝐼 + (1 − 𝑑𝑑 )𝐾𝐾
The future capital stock is the sum of undepreciated current capital (1 − 𝑑𝑑 )𝐾𝐾 and
investment 𝐼𝐼. Investment is purchases of new capital goods, and we treat capital goods as
equivalent to any other goods for simplicity. This means a new unit of capital costs one unit
of goods (its price is 1), and in principle, undepreciated capital goods can be sold off and
used in the same way as newly produced goods.
As in the Solow model, capital depreciates at a constant rate 𝑑𝑑 over time. Note also that the
current capital stock 𝐾𝐾 is determined by decisions made in the past and is not affected by
current investment. Since the model has only two time periods, it is assumed firms sell off
any undepreciated capital (1 − 𝑑𝑑 )𝐾𝐾 ′ that remains after production in the future time
period.
3.8.3 Firms’ profits
The firm’s profits are the difference between its revenues and its costs of production, which
here are simply payments to factors of production. The firm owns its capital, so factor
payments are just the wage bill. Real wages are 𝑀𝑀 and 𝑀𝑀′ in the current and future periods,
106
EC2065 Macroeconomics | Chapter 3: Aggregate demand
hence, current and future real costs are 𝑀𝑀𝑀𝑀 and 𝑀𝑀′𝑁𝑁′. Real revenues are the same as
production π‘Œπ‘Œ and π‘Œπ‘Œ′ because all goods are homogeneous. Real profits are thus given by:
πœ‹πœ‹ = π‘Œπ‘Œ − 𝑀𝑀𝑀𝑀
and
πœ‹πœ‹ ′ = π‘Œπ‘Œ ′ − 𝑀𝑀 ′ 𝑁𝑁′
Note that these profits implicitly include the return on capital owned by the firm, so the
argument in Section 1.2 that perfect competition and a constant-returns-to-scale
production function push profits to zero does not apply here. Observe that the cost of any
investment 𝐼𝐼 is not deducted from profits because purchases of capital mean acquisition of
an asset that is not immediately used in current production. As profits are gross profits,
there is also no deduction of any cost of depreciation. However, both the cost of purchasing
capital and depreciation will be relevant when analysing the optimal level of investment.
3.8.4 Options for financing investment
Suppose a firm decides to purchase an amount of new capital 𝐼𝐼, which entails real
investment expenditure of the same amount. What are the options for financing this
expenditure?
The first possibility is borrowing. Suppose the firm can take out a loan of size 𝐿𝐿 = 𝐼𝐼 to pay
for the whole cost of purchasing the capital. If this loan has interest rate π‘Ÿπ‘Ÿπ‘™π‘™ , the firm must
repay (1 + π‘Ÿπ‘Ÿπ‘™π‘™ )𝐿𝐿 in the future period. We allow for π‘Ÿπ‘Ÿπ‘™π‘™ to differ from the interest rate π‘Ÿπ‘Ÿ on
government bonds received by savers because of credit-market imperfections discussed
further in Chapter 4. As the loan is sufficient to pay the whole cost, the firm can pay out all
its current profits πœ‹πœ‹ as dividends 𝑣𝑣 = πœ‹πœ‹. However, in the future, the debt repayment must
be made out of future profits before these can be distributed to shareholders. The future
dividend 𝑣𝑣′ is:
𝑣𝑣 ′ = πœ‹πœ‹ ′ + (1 − 𝑑𝑑 )𝐾𝐾′ − (1 + π‘Ÿπ‘Ÿπ‘™π‘™ )𝐼𝐼
Note that firms can also distribute the proceeds (1 − 𝑑𝑑 )𝐾𝐾′ from selling off undepreciated
capital as future dividends.
Another possible option is to use retained earnings to pay for the investment. Retained
earnings refers to profits the firm has made but has not distributed to shareholders. In this
case, the firm would pay a current dividend of 𝑣𝑣 = πœ‹πœ‹ − 𝐼𝐼, distributing only profits πœ‹πœ‹ in
excess of investment expenditure 𝐼𝐼. The future dividend would be 𝑣𝑣 ′ = πœ‹πœ‹ ′ + (1 − 𝑑𝑑 )𝐾𝐾′,
which is future profits plus the proceeds of selling off undepreciated capital. There are no
future debts to repay in this case.
The use of retained earnings to pay for investment requires current profits to be sufficiently
large, that is, πœ‹πœ‹ ≥ 𝐼𝐼. A third option is the issuance of new equity, which we will treat as
being equivalent to a negative dividend 𝑣𝑣 (the firm receives new funds from shareholders,
rather than a distribution of funds to shareholders). In this case, the current dividend is 𝑣𝑣 =
πœ‹πœ‹ − 𝐼𝐼 < 0 and the future dividend is 𝑣𝑣 ′ = πœ‹πœ‹ ′ + (1 − 𝑑𝑑 )𝐾𝐾′, which is mathematically
equivalent to using retained earnings, the only difference being that 𝑣𝑣 is negative. Although
algebraically the same, financial-market imperfections may limit firms’ ability to issue new
equity when current profitability is low.
107
EC2065 Macroeconomics | Chapter 3: Aggregate demand
3.8.5 The optimal investment decision
We assume that the managers of the firm making investment decisions act in interests of
the firm’s shareholders. If those shareholders are not credit-constrained, we have seen that
they care only about their lifetime wealth. This means the owners of the firm would want
managers to maximise the present value of dividends discounted using the interest rate π‘Ÿπ‘Ÿ:
𝑉𝑉 = 𝑣𝑣 +
𝑣𝑣′
1 + π‘Ÿπ‘Ÿ
The managers of the firm choose employment levels 𝑁𝑁 and 𝑁𝑁′ and investment 𝐼𝐼 to
maximise 𝑉𝑉. There is no dynamic dimension of the employment decisions: these are
equivalent to choosing 𝑁𝑁 and 𝑁𝑁′ to maximise πœ‹πœ‹ and πœ‹πœ‹ ′ respectively. The equations for
labour demand are 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀 and 𝑀𝑀𝑃𝑃𝑁𝑁′ = 𝑀𝑀′ in the two periods. At all times, firms hire
labour up to the point where the marginal product of labour is equal to the real wage. For
the choice of investment 𝐼𝐼, note that the capital accumulation equation implies this is
equivalent to choosing the future capital stock 𝐾𝐾′, with investment then given by 𝐼𝐼 = 𝐾𝐾 ′ −
(1 − 𝑑𝑑 )𝐾𝐾, where 𝐾𝐾 is the initial capital stock.
First, suppose a firm finances investment using borrowing. The present discounted value of
the dividends it distributes to shareholders in this case is
𝑧𝑧 ′ 𝐹𝐹 (𝐾𝐾 ′ , 𝑁𝑁 ′ ) − 𝑀𝑀 ′ 𝑁𝑁 ′ + (1 − 𝑑𝑑 )𝐾𝐾 ′ − (1 + π‘Ÿπ‘Ÿπ‘™π‘™ )(𝐾𝐾 ′ − (1 − 𝑑𝑑 )𝐾𝐾)
𝑉𝑉 = 𝑧𝑧𝑧𝑧 (𝐾𝐾, 𝑁𝑁) − 𝑀𝑀𝑀𝑀 +
1 + π‘Ÿπ‘Ÿ
The first-order condition to maximise 𝑉𝑉 with respect to 𝐾𝐾 ′ is:
𝑀𝑀𝑃𝑃𝐾𝐾′ + (1 − 𝑑𝑑 ) − (1 + π‘Ÿπ‘Ÿπ‘™π‘™ )
=0
1 + π‘Ÿπ‘Ÿ
This can be rearranged to obtain the following condition for the optimal level of investment:
𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = π‘Ÿπ‘Ÿπ‘™π‘™
Alternatively, suppose that retained earnings are available and used to finance investment
instead. The present discounted value of dividends is:
𝑉𝑉 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) − 𝑀𝑀𝑀𝑀 − (𝐾𝐾 ′ − (1 − 𝑑𝑑 )𝐾𝐾 ) +
𝑧𝑧 ′ 𝐹𝐹 (𝐾𝐾 ′ , 𝑁𝑁 ′ ) − 𝑀𝑀 ′ 𝑁𝑁 ′ + (1 − 𝑑𝑑 )𝐾𝐾 ′
1 + π‘Ÿπ‘Ÿ
The first-order condition to maximise 𝑉𝑉 with respect to 𝐾𝐾′ is:
−1 +
𝑀𝑀𝑃𝑃𝐾𝐾′ + (1 − 𝑑𝑑 )
=0
1 + π‘Ÿπ‘Ÿ
This yields an equation for the optimal investment decision:
𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = π‘Ÿπ‘Ÿ
In all cases, the optimal investment decision equates the marginal benefit and marginal cost
of investment. More investment allows the firm to hold an extra unit of capital. The
marginal benefit of this is the extra production of future output it makes possible, which is
worth 𝑀𝑀𝑃𝑃𝐾𝐾′ , the marginal product of capital in the future.
108
EC2065 Macroeconomics | Chapter 3: Aggregate demand
The marginal cost of holding an extra unit of capital includes the loss of a fraction 𝑑𝑑 of the
capital through depreciation. There is also the financing cost of holding more capital. If the
funds to purchase it are borrowed, the financing cost is the interest rate π‘Ÿπ‘Ÿπ‘™π‘™ on the loan. If
the funds come from retained earnings, the financing cost is an opportunity cost to the
firm’s shareholders. Funds are returned to shareholders in the future period rather than in
the current period and these funds could have earned shareholders a return π‘Ÿπ‘Ÿ if received
and saved.
3.8.6 Does the source of financing matter?
If there are no credit-market imperfections then firms can borrow at interest rate π‘Ÿπ‘Ÿ, the
same interest rate as received by savers. With π‘Ÿπ‘Ÿπ‘™π‘™ = π‘Ÿπ‘Ÿ, the direct cost of borrowing equals
the opportunity cost to shareholders of delaying distribution of profits if retained earnings
are used to pay for investment. This means the condition for optimal investment is 𝑀𝑀𝑃𝑃𝐾𝐾′ −
𝑑𝑑 = π‘Ÿπ‘Ÿ in both cases and hence, the source of financing does not matter for investment. As
we will see in Chapter 4, this may not be true once we allow for credit-market
imperfections.
3.8.7 The investment demand curve
The equation 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = π‘Ÿπ‘Ÿ implies a downward-sloping investment demand curve. A
neoclassical production function has diminishing returns to capital, so 𝑀𝑀𝑃𝑃𝐾𝐾′ decreases with
𝐾𝐾′. Given initial capital 𝐾𝐾, the equation 𝐾𝐾 ′ = 𝐼𝐼 + (1 − 𝑑𝑑 )𝐾𝐾 shows that 𝐾𝐾′ moves one-forone with investment 𝐼𝐼. The left-hand side of the optimal investment equation thus falls with
𝐼𝐼. The right-hand side is the real interest rate π‘Ÿπ‘Ÿ. Hence, the investment demand curve is
depicted in Figure 3.25 with π‘Ÿπ‘Ÿ on the vertical axis. The demand curve is given by plotting the
marginal product of capital as a function of 𝐼𝐼 shifted down by the depreciation rate 𝑑𝑑.
Figure 3.24: The investment demand curve
The Inada conditions for a neoclassical production function imply that 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 is negative
for very high levels of investment, so 𝐼𝐼 𝑑𝑑 cuts the horizontal axis and falls as low as π‘Ÿπ‘Ÿ = −𝑑𝑑.
Observe also that 𝑀𝑀𝑃𝑃𝐾𝐾′ is finite if investment were zero, even though the Inada conditions
hold. This is because there is existing capital, so 𝐾𝐾′ is not zero even if there were no
109
EC2065 Macroeconomics | Chapter 3: Aggregate demand
investment currently. In principle, investment 𝐼𝐼 could be negative, which would mean selling
off capital goods for other uses. However, much investment may be irreversible in practice,
hence, this possibility is ignored when drawing the investment demand curve.
3.9 The stock market
The analysis of investment assumes firms are owned by shareholders. Thus, savers are
holding shares in firms in addition to other assets such as bonds. Buying shares in firms is
often described as ‘investment’ but such financial investment is logically distinct from
investments in physical capital made by firms themselves.
Each share in a firm is an equal claim on the profits of the firms after other creditors, for
example, bondholders, have been paid. Shareholders are the residual claimants on the firm.
If shares can be traded in a stock market, what determines the price of those shares? Is
financial investment and the stock-market value of firms related to firms’ decisions to invest
in physical capital?
Assume shareholders expect to be paid dividends 𝑣𝑣 and 𝑣𝑣′ in current and future time
periods but none after that in our two-period model. Suppose these shares trade at a price
𝑝𝑝 at the beginning of the current period, with any purchaser obtaining the right to both
dividends 𝑣𝑣 and 𝑣𝑣′. Taking the dividend 𝑣𝑣 and reinvesting in financial assets results in a
future payoff (1 + π‘Ÿπ‘Ÿ)𝑣𝑣. Assume the investor expects to be able to sell shares at price 𝑝𝑝′ in
the future before the dividend 𝑣𝑣′ is paid. The rate of return for an investor who buys shares
at current price 𝑝𝑝 is the sum of the dividend yield and any capital gains or losses as a
percentage of 𝑝𝑝:
(1 + π‘Ÿπ‘Ÿ)𝑣𝑣 𝑝𝑝′ − 𝑝𝑝
+
𝑝𝑝
𝑝𝑝
An investor who does not care about risk (said to be ‘risk-neutral’) would choose to hold the
asset with the highest expected return. Bonds have real interest rate π‘Ÿπ‘Ÿ, which is their real
return. For both bonds and shares to be willingly held by savers, their expected returns must
be the same:
π‘Ÿπ‘Ÿ =
(1 + π‘Ÿπ‘Ÿ)𝑣𝑣 𝑝𝑝′ − 𝑝𝑝
+
𝑝𝑝
𝑝𝑝
Solving this equation for 𝑝𝑝 shows that the share price must be 𝑝𝑝 = 𝑣𝑣 + 𝑝𝑝′ ⁄(1 + π‘Ÿπ‘Ÿ). Owning
the share in the future simply gives a claim to the final dividend 𝑣𝑣′, hence, the anticipated
future share price must be 𝑝𝑝′ = 𝑣𝑣 ′ . Therefore, the equilibrium share price 𝑝𝑝 and stockmarket value of the firm is equal to the present discounted value of dividends 𝑉𝑉:
𝑝𝑝 = 𝑉𝑉 = 𝑣𝑣 +
𝑣𝑣′
1 + π‘Ÿπ‘Ÿ
With equilibrium share price found here, firms choosing investment to maximise the
present value of dividends is the same as aiming to maximise their stock-market value.
110
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Box 3.4: Stock prices and firms’ investment decisions
We have seen that optimal investment by firms can be understood in terms of maximising
their stock-market value. There is also a more specific prediction of the theory of
investment about the relationship between stock prices and investment in terms of a
variable known as Tobin’s Q.
For a firm that uses both debt and equity financing, Tobin’s Q is defined as follows:
Tobin's Q =
Market value of firm
Replacement cost of capital
The market value of the firm in the numerator of Tobin’s Q is the sum of the value of
bonds issued and the value of its shares, the latter known as the firm’s stock-market
capitalisation. The replacement cost of capital in the denominator is what the physical
capital owned by the firm would cost to buy as new.
Working in the context of the two-period model of investment from Section 3.8, we will
calculate Tobin’s Q in the first period after firms have paid their current dividend 𝑣𝑣. Let 𝑆𝑆
denote a firm’s stock-market value at this point where its shares are ex dividend
(purchasing them gives a claim to future dividends only). With only one period remaining
in the future, the expected future value of the shares is 𝑝𝑝′ = 𝑣𝑣 ′ , where 𝑣𝑣′ is the one
remaining future dividend. If π‘Ÿπ‘Ÿ is the rate of interest and buyers of the shares must
receive the same expected return, then π‘Ÿπ‘Ÿ = (𝑝𝑝′ − 𝑆𝑆)/𝑆𝑆, which implies the current exdividend value of the shares is:
𝑆𝑆 =
𝑝𝑝′
𝑣𝑣′
=
1 + π‘Ÿπ‘Ÿ 1 + π‘Ÿπ‘Ÿ
If the firm borrows an amount 𝐿𝐿, for example by issuing bonds, with (1 + π‘Ÿπ‘Ÿ)𝐿𝐿 repayable
in the future period (assuming it can borrow at rate π‘Ÿπ‘Ÿ absent financial-market
imperfections) then the future dividend it can pay is 𝑣𝑣 ′ = πœ‹πœ‹ ′ + (1 − 𝑑𝑑 )𝐾𝐾 ′ − (1 + π‘Ÿπ‘Ÿ)𝐿𝐿,
conditional on the capital 𝐾𝐾′ it will have available. The firm’s future profits are πœ‹πœ‹ ′ = π‘Œπ‘Œ ′ −
𝑀𝑀 ′ 𝑁𝑁 ′ = π‘Œπ‘Œ ′ − 𝑀𝑀𝑃𝑃𝑁𝑁′ 𝑁𝑁 ′ because a profit-maximising choice of future employment implies
𝑀𝑀𝑃𝑃𝑁𝑁′ = 𝑀𝑀′. With constant returns to scale, π‘Œπ‘Œ ′ = 𝑀𝑀𝑃𝑃𝐾𝐾′ 𝐾𝐾 ′ + 𝑀𝑀𝑃𝑃𝑁𝑁′ 𝑁𝑁 ′ , so future profits
are πœ‹πœ‹ ′ = 𝑀𝑀𝑃𝑃𝐾𝐾′ 𝐾𝐾 ′ , from which the stock-market value of the firm can be deduced:
𝑆𝑆 =
𝑀𝑀𝑃𝑃𝐾𝐾′ 𝐾𝐾 ′ + (1 − 𝑑𝑑 )𝐾𝐾′
− 𝐿𝐿
1 + π‘Ÿπ‘Ÿ
If a firm issues debt 𝐿𝐿 and has stock-market capitalisation 𝑆𝑆 then its total market value is
𝑆𝑆 + 𝐿𝐿, which is given by:
𝑀𝑀𝑃𝑃𝐾𝐾′ 𝐾𝐾 ′ + (1 − 𝑑𝑑 )𝐾𝐾 ′ (𝑀𝑀𝑃𝑃𝐾𝐾′ + 1 − 𝑑𝑑 )𝐾𝐾 ′
𝑆𝑆 + 𝐿𝐿 =
=
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
If the firm chooses a level of investment that gives it capital stock 𝐾𝐾′ then the
replacement cost of the capital stock 𝐾𝐾 ′ is simply 𝐾𝐾 ′ because each unit of physical capital
costs one unit of goods. Tobin’s Q can therefore be calculated as the ratio of 𝑆𝑆 + 𝐿𝐿 to 𝐾𝐾′:
𝑆𝑆 + 𝐿𝐿 (𝑀𝑀𝑃𝑃𝐾𝐾′ + 1 − 𝑑𝑑 )𝐾𝐾 ′ 𝑀𝑀𝑃𝑃𝐾𝐾′ + 1 − 𝑑𝑑
𝑄𝑄 =
=
=
(1 + π‘Ÿπ‘Ÿ)𝐾𝐾 ′
𝐾𝐾 ′
1 + π‘Ÿπ‘Ÿ
111
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Observe that Tobin’s Q is greater than 1 when 𝑀𝑀𝑃𝑃𝐾𝐾′ + 1 − 𝑑𝑑 > 1 + π‘Ÿπ‘Ÿ, which is
equivalent to 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 > π‘Ÿπ‘Ÿ and would mean investment is too low according to the
theory developed in Section 3.8. If firms were to invest up to the point where 𝑀𝑀𝑃𝑃𝐾𝐾′ −
𝑑𝑑 = π‘Ÿπ‘Ÿ, this would mean the same as investing until Tobin’s Q has declined to 1. Thus, the
theory of investment developed earlier is equivalent to the prediction that investment is
higher the further Tobin’s Q is above 1.
In principle, Tobin’s Q is measurable and can be compared to data on investment. In
practice, for many modern firms, intangible capital such as patents and brand reputation
has become more important than physical capital. The replacement cost of intangible
capital is harder to quantify, and makes measurement of the denominator of Tobin’s Q
challenging.
Nonetheless, if we look at just the numerator of Tobin’s Q and focus on stock-market
capitalisation, we see that stock market values are often correlated with investment.
Figure 3.26 plots the detrended S&P500 index and investment in the USA, between which
there is a strong positive correlation.
Figure 3.25: S&P 500 and investment over the business cycle (USA)
Box 3.5: Should capital be taxed?
This section looks at the implications of taxes on capital or capital income for the level of
investment in an economy and investigates how large the distortions from such taxes
might be. To simplify the analysis, we focus on the implications of capital taxes for the
steady-state capital stock 𝐾𝐾 ′ = 𝐾𝐾.
Suppose that net capital income (𝑅𝑅 − 𝑑𝑑 )𝐾𝐾 is taxed at a proportional rate 𝜏𝜏, where 𝑅𝑅 =
𝑀𝑀𝑃𝑃𝐾𝐾 is the marginal product of capital and 𝑑𝑑 is the depreciation rate. For example,
suppose a firm with profits πœ‹πœ‹ = 𝑀𝑀𝑃𝑃𝐾𝐾 𝐾𝐾 distributes dividends 𝑣𝑣 = πœ‹πœ‹ − 𝑑𝑑𝑑𝑑 after using
retained earnings to finance investment 𝐼𝐼 = 𝑑𝑑𝑑𝑑 to maintain the steady-state capital stock
𝐾𝐾. These dividends might then be subject to an income tax rate 𝜏𝜏.
112
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Let π‘Ÿπ‘Ÿ denote the equilibrium after-tax return that investors receive, which we will take as
given here. In the absence of tax, investment would be determined by 𝑅𝑅 − 𝑑𝑑 = π‘Ÿπ‘Ÿ. When
the tax on capital income is 𝜏𝜏, firms invest in capital up to the point where:
(1 − 𝜏𝜏)(𝑅𝑅 − 𝑑𝑑 ) = π‘Ÿπ‘Ÿ
Here, the cost of depreciation reduces the tax liability, although we can also do the
analysis under the assumption that gross capital income is taxed. Given the required
after-tax real rate of return π‘Ÿπ‘Ÿ on capital, investment and the stock of capital are
determined by:
𝑀𝑀𝑃𝑃𝐾𝐾 − 𝑑𝑑 =
π‘Ÿπ‘Ÿ
1 − 𝜏𝜏
Assume the production function is π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁), which is 𝑦𝑦 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜) in per worker
terms. The marginal product of capital is 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝑧𝑧𝑧𝑧′(π‘˜π‘˜). In a competitive economy, the
real wage received by workers is 𝑀𝑀 = 𝑀𝑀𝑃𝑃𝑁𝑁 . A neoclassical production function has the
feature that π‘Œπ‘Œ = 𝑀𝑀𝑃𝑃𝐾𝐾 𝐾𝐾 + 𝑀𝑀𝑃𝑃𝑁𝑁 𝑁𝑁, which implies 𝑀𝑀 = 𝑦𝑦 − 𝑀𝑀𝑃𝑃𝐾𝐾 π‘˜π‘˜ = 𝑧𝑧�𝑓𝑓(π‘˜π‘˜) − π‘˜π‘˜π‘“π‘“ ′ (π‘˜π‘˜)οΏ½.
Taking a given positive value of π‘Ÿπ‘Ÿ, a higher tax rate 𝜏𝜏 raises π‘Ÿπ‘Ÿ⁄(1 − 𝜏𝜏) and thus 𝑀𝑀𝑃𝑃𝐾𝐾 =
𝑧𝑧𝑧𝑧′(π‘˜π‘˜) must be higher in equilibrium. Since 𝑓𝑓′(π‘˜π‘˜) is decreasing in π‘˜π‘˜, this means there will
be less investment and lower steady-state π‘˜π‘˜. The tax thus reduces output per worker 𝑦𝑦 =
𝑧𝑧𝑧𝑧(π‘˜π‘˜) and also lowers wages 𝑀𝑀 = 𝑀𝑀𝑃𝑃𝑁𝑁 because 𝑀𝑀𝑃𝑃𝑁𝑁 is lower when there is less capital
per worker π‘˜π‘˜. This can be shown by noting 𝑀𝑀 = 𝑧𝑧(𝑓𝑓(π‘˜π‘˜) − π‘˜π‘˜π‘“π‘“ ′ (π‘˜π‘˜)) is increasing in π‘˜π‘˜ since
𝑓𝑓 ′′ (π‘˜π‘˜) < 0.
Suppose capital income is taxed and used to subsidise the wage income of workers. More
generally, we could ask whether capital income or wages should be taxed to finance
public expenditure. The amount of tax raised per worker is 𝜏𝜏(𝑅𝑅 − 𝑑𝑑 )𝐾𝐾⁄𝑁𝑁 =
𝜏𝜏(𝑧𝑧𝑓𝑓 ′ (π‘˜π‘˜) − 𝑑𝑑 )π‘˜π‘˜, so the wage including the subsidy is 𝑀𝑀 + 𝜏𝜏(𝑧𝑧𝑓𝑓 ′ (π‘˜π‘˜) − 𝑑𝑑 )π‘˜π‘˜. Using 𝑀𝑀 =
𝑧𝑧(𝑓𝑓 (π‘˜π‘˜) − π‘˜π‘˜π‘“π‘“ ′ (π‘˜π‘˜)) the wage plus the subsidy is:
𝑧𝑧𝑧𝑧(π‘˜π‘˜) − π‘˜π‘˜π‘˜π‘˜π‘“π‘“ ′ (π‘˜π‘˜) + 𝜏𝜏(𝑧𝑧𝑓𝑓 ′ (π‘˜π‘˜) − 𝑑𝑑 )π‘˜π‘˜ = 𝑧𝑧𝑧𝑧 (π‘˜π‘˜) − π‘˜π‘˜ (1 − 𝜏𝜏)𝑧𝑧𝑓𝑓 ′ (π‘˜π‘˜) − 𝜏𝜏𝜏𝜏𝜏𝜏
With π‘˜π‘˜ determined by 𝑧𝑧𝑧𝑧′(π‘˜π‘˜) = 𝑑𝑑 + π‘Ÿπ‘Ÿ⁄(1 − 𝜏𝜏), the subsidised wage is given by the
formula 𝑧𝑧𝑧𝑧(π‘˜π‘˜) − (1 − 𝜏𝜏)𝑑𝑑𝑑𝑑 − π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ − 𝜏𝜏𝜏𝜏𝜏𝜏 = 𝑧𝑧𝑧𝑧(π‘˜π‘˜) − (π‘Ÿπ‘Ÿ + 𝑑𝑑 )π‘˜π‘˜.
Taking account of the capital tax’s effect on investment, each worker receives 𝑧𝑧𝑧𝑧(π‘˜π‘˜) −
(π‘Ÿπ‘Ÿ + 𝑑𝑑 )π‘˜π‘˜, where π‘˜π‘˜ depends on 𝜏𝜏. This is maximised over π‘˜π‘˜ where 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝑧𝑧𝑓𝑓 ′ (π‘˜π‘˜) = π‘Ÿπ‘Ÿ +
𝑑𝑑. But since π‘˜π‘˜ must satisfy 𝑀𝑀𝑃𝑃𝐾𝐾 − 𝑑𝑑 = π‘Ÿπ‘Ÿ/(1 − 𝜏𝜏) given 𝜏𝜏, the value of 𝜏𝜏 that is in the
interests of workers is 𝜏𝜏 = 0. Even though the revenue from taxing capital is directly
redistributed to workers, workers prefer no capital tax. This is because the negative effect
of the capital tax on investment and wages is too large.
One important assumption in this analysis is that there is a perfectly elastic supply of
funds for investment at a constant required rate of return π‘Ÿπ‘Ÿ, which makes the economy’s
supply of capital perfectly elastic. This makes capital the exact opposite of land, which is
perfectly inelastic in supply.
113
EC2065 Macroeconomics | Chapter 3: Aggregate demand
3.10 Labour supply over time
In the remainder of the chapter, we will see how to put together the various aspects of
macroeconomics that we have analysed separately: labour demand; labour supply;
consumption; saving and investment. Doing this requires considering one other aspect of
households’ plans, namely, the dynamics of households’ labour supply.
Our analysis of households’ plans has so far considered labour supply at a given time with
reference to the trade-off between leisure and consumption. This was studied in the static
macroeconomic model from Section 1.4. We have also explored the decision to save or
borrow in this chapter through the trade-off between consumption now or future
consumption in the two-period model. This analysis of saving was done taking as given the
income the household expected to earn in the current and future periods. However, income
is affected by the decision to supply labour. This means that saving, defined as disposable
income minus consumption, can be increased by raising income as well as by reducing
consumption. Hence, there is an interaction between the decision to save and the decision
to supply labour that we should take account of.
This section studies the labour supply decision or choice of leisure over time. In a twoperiod model, a household can choose current and future consumption 𝑐𝑐 and 𝑐𝑐′ and current
and future leisure 𝑙𝑙 and 𝑙𝑙′. The household’s preferences are assumed to feature diminishing
marginal rates of substitution between any pair of 𝑐𝑐,𝑐𝑐′,𝑙𝑙,𝑙𝑙′, and all are assumed to be normal
goods.
With β„Ž hours available in each period, the choice of 𝑙𝑙 and 𝑙𝑙′ determines the current and
future labour supplies 𝑁𝑁 𝑠𝑠 = β„Ž − 𝑙𝑙 and 𝑁𝑁 𝑠𝑠 ′ = β„Ž − 𝑙𝑙′. Current and future real wages are 𝑀𝑀
and 𝑀𝑀′, and the household also receives dividends 𝑣𝑣 and 𝑣𝑣′ from firms. As in the earlier
analysis of consumption, the household faces a lifetime budget constraint:
𝑀𝑀 ′ (β„Ž − 𝑙𝑙 ′ ) + 𝑣𝑣 ′ − 𝑑𝑑′
𝑐𝑐′
= 𝑀𝑀 (β„Ž − 𝑙𝑙 ) + 𝜈𝜈 − 𝑑𝑑 +
𝑐𝑐 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
Here, income 𝑦𝑦 has been replaced by the sum of wage income 𝑀𝑀𝑁𝑁 𝑠𝑠 and dividends 𝑣𝑣.
Income is no longer exogenous because the household can change 𝑦𝑦 through its choice of
labour supply 𝑁𝑁 𝑠𝑠 .
The household’s optimal plans equate marginal rates of substitution to relative prices
implied by the lifetime budget constraint for all the possible pairs of 𝑐𝑐, 𝑐𝑐′, 𝑙𝑙, and 𝑙𝑙′. Most of
these have already been analysed. First, considering current leisure 𝑙𝑙 and current
consumption 𝑐𝑐, holding fixed future 𝑐𝑐 ′ and 𝑙𝑙 ′ , the relative price is the current real wage 𝑀𝑀
and the optimality condition is 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝑐𝑐 = 𝑀𝑀, as seen in 0. This is gives the optimal labour
supply from a static point of view. The same logic applies in the future period as well,
considering 𝑙𝑙′ and 𝑐𝑐′ and holding 𝑐𝑐 and 𝑙𝑙 fixed. The relative price is the future real wage 𝑀𝑀′
and the optimality condition is 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙′ ,𝑐𝑐 ′ = 𝑀𝑀 ′ .
We have also studied the trade-off between 𝑐𝑐 and 𝑐𝑐′, holding 𝑙𝑙 and 𝑙𝑙′ fixed (which is
equivalent to fixing incomes 𝑦𝑦 and 𝑦𝑦′). The relative price is 1 + π‘Ÿπ‘Ÿ and the optimality
condition is 𝑀𝑀𝑀𝑀𝑆𝑆𝑐𝑐,𝑐𝑐 ′ = 1 + π‘Ÿπ‘Ÿ. So far, the optimality conditions have been derived and
114
EC2065 Macroeconomics | Chapter 3: Aggregate demand
explained earlier. The new condition we will look at comes from the trade-off between
current leisure 𝑙𝑙 and future leisure 𝑙𝑙′.
Sacrificing leisure in the current period (reducing 𝑙𝑙) increases labour supply (higher 𝑁𝑁 𝑠𝑠 ) and
raises current income by the wage 𝑀𝑀 for each hour of leisure given up. This income could be
spent now on consumption but it could also be saved. By saving the extra wage 𝑀𝑀, future
financial assets are increased by (1 + π‘Ÿπ‘Ÿ)𝑀𝑀. This extra wealth could be spent on consumption
but it could also be used to reduce labour supply in future by replacing future wage income.
For example, earlier retirement with no loss of future consumption becomes possible if the
household has more financial assets.
Mathematically, holding 𝑐𝑐 and 𝑐𝑐′ constant, a reduction of 𝑙𝑙 by one unit means that 𝑙𝑙′ can
increase by (1 + π‘Ÿπ‘Ÿ)𝑀𝑀/𝑀𝑀′. Hence, (1 + π‘Ÿπ‘Ÿ)𝑀𝑀/𝑀𝑀′ is the relative price of the household’s
current time in terms of its future time and this relative price matters for how much it is
optimal to work now compared to working in the future period. The relative price increases
when the current wage 𝑀𝑀 rises relative to the future wage 𝑀𝑀′, or when the real interest rate
π‘Ÿπ‘Ÿ increases. The household’s optimal plan must feature:
𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝑙𝑙′ = (1 + π‘Ÿπ‘Ÿ)𝑀𝑀/𝑀𝑀′
Figure 3.27 draws indifference curves and a lifetime budget constraint in a diagram with
current leisure 𝑙𝑙 on the horizontal axis and future leisure 𝑙𝑙′ on the vertical axis. The range is
limited to leisure between 0 and the maximum available time β„Ž where the household would
not participate in the labour market. All other variables 𝑐𝑐 and 𝑐𝑐′ are held fixed. The budget
constraint is downward sloping and has gradient −(1 + π‘Ÿπ‘Ÿ)𝑀𝑀/𝑀𝑀′. The diagram shows the
case where it is feasible to have one of, but not both, 𝑙𝑙 and 𝑙𝑙′ equal to β„Ž, so the household
could choose not to participate in the labour market in one of the periods. However, given
the indifference curves as shown, the household chooses to participate in both periods to
some extent.
Figure 3.26: Optimal timing of leisure/labour supply
115
EC2065 Macroeconomics | Chapter 3: Aggregate demand
The optimality condition 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝑙𝑙′ = (1 + π‘Ÿπ‘Ÿ)𝑀𝑀/𝑀𝑀′ implies the real interest rate π‘Ÿπ‘Ÿ changes
optimal labour supply in the same direction as the real wage 𝑀𝑀. A higher π‘Ÿπ‘Ÿ raises the
incentive for work because the extra income obtained can be saved to earn a higher return.
Or in other words, a higher π‘Ÿπ‘Ÿ increases the incentive to save and one way to do this is to
earn more and not to spend it. Since π‘Ÿπ‘Ÿ matters for the labour supply decision, it follows that
the labour supply curve 𝑁𝑁 𝑠𝑠 shifts when π‘Ÿπ‘Ÿ changes. Where labour supply increases with the
real wage 𝑀𝑀, the upward-sloping 𝑁𝑁 𝑠𝑠 curve shifts to the right when π‘Ÿπ‘Ÿ rises.
3.11 A dynamic macroeconomic model
Here we take all that we have learned about household and firm behaviour and put
together the pieces:
•
•
Labour demand and supply
Consumption and investment.
We will set up a dynamic macroeconomic model and see how to analyse the economy in
general equilibrium. General equilibrium means there is equilibrium in all markets
simultaneously. The three markets we consider are:
•
•
•
Goods market
Labour market
Bond market.
For now, we will work with the case of a closed economy. Section 9.1 shows how to adapt
our dynamic macroeconomic model to an open economy.
3.11.1 A representative household
We simplify matters by assuming a representative household. This means that all
households in the economy have the same preferences and the same level of wealth.
Specifically, all households are paid the same amount for work per hour, face the same
taxes and all own an equal number of shares in firms. Since the economy is closed, firms
must be owned by domestic residents.
The consequence of the representative household assumption is that all households will
ultimately make the same choices, even if they act individually. With 𝑁𝑁 households in the
economy, aggregate consumption is 𝐢𝐢 = 𝑁𝑁𝑁𝑁, where 𝑐𝑐 is an individual household’s
consumption choice. The same is true for other aggregates, which are simply scaled-up
versions of each individual’s choices and we will often ignore the distinction between
individual and aggregate variables. While we cannot use the representative-household
framework to study how inequality affects the economy, it is nonetheless a useful starting
point for many other issues.
One simplification that comes from there being a representative household in a closed
economy is that income effects from changes in the wage 𝑀𝑀 or interest rate π‘Ÿπ‘Ÿ can be
ignored. All else being equal, a higher wage makes households better off as workers but
worse off as owners of firms and these two effects cancel out. All else being equal, a higher
116
EC2065 Macroeconomics | Chapter 3: Aggregate demand
interest rate makes households better off as savers but worse off as owners of firms with
higher borrowing costs and worse off as taxpayers if the government has higher borrowing
costs.
With no income effects coming from 𝑀𝑀 or π‘Ÿπ‘Ÿ, there are only substitution effects to consider.
A higher wage 𝑀𝑀 implies an incentive to supply more labour, so the labour supply curve 𝑁𝑁 𝑠𝑠
is upward sloping. A higher interest rate π‘Ÿπ‘Ÿ increases saving, which lowers the consumption
demand 𝐢𝐢 𝑑𝑑 . The left panel of Figure 3.28 depicts a downward-sloping consumption demand
curve with π‘Ÿπ‘Ÿ on the vertical axis and 𝐢𝐢 on the horizontal axis. Higher π‘Ÿπ‘Ÿ also increases labour
supply as discussed in the analysis of the dynamics of labour supply. This means the labour
supply curve 𝑁𝑁 𝑠𝑠 shifts to the right when π‘Ÿπ‘Ÿ increases as shown in the right panel of the
figure.
Figure 3.27: Consumption demand and labour supply
It is important to note this logic does not mean all income effects can be neglected. Often,
the exogenous cause of a change in 𝑀𝑀 or π‘Ÿπ‘Ÿ will make households better off or worse off
overall. For example, higher productivity (TFP 𝑧𝑧 or 𝑧𝑧′) makes households better off, raising
either wages, profits, or some combination of the two. The argument is that conditional on
the level of TFP, households are not made better off overall by higher wages. Another
example is when the tax burden falls, which makes households better off, all else equal. It is
not always possible to infer from the direction of change in GDP whether households are
better off or not, as we will see in later examples.
Since consumption and leisure are both normal goods, anything that makes households
better off overall leads to a ‘wealth effect’ that increases consumption demand and reduces
labour supply. These wealth effects shift the 𝐢𝐢 𝑑𝑑 curve to the right and the 𝑁𝑁 𝑠𝑠 curve to the
left. If households are made worse off, the wealth effects are lower consumption demand
𝐢𝐢 𝑑𝑑 and higher labour supply 𝑁𝑁 𝑠𝑠 .
3.11.2 Firms
We assume a representative firm. All firms share the same neoclassical production function
with the same total factor productivity. Firms own the capital they use and all have the
same capital stock, so there is no need to consider a rental market for capital. Investment is
117
EC2065 Macroeconomics | Chapter 3: Aggregate demand
financed by issuing bonds. Firms are owned by the representative household. There is no
need for trade in shares, so we do not consider the stock market in the dynamic
macroeconomic model.
3.11.3 Government
We will analyse the government’s fiscal policies further in Chapter 4. At this stage we simply
note that the government chooses current and future public expenditure 𝐺𝐺 and 𝐺𝐺′, which is
part of the economy’s demand for goods and services along with private expenditure 𝐢𝐢 𝑑𝑑
and 𝐼𝐼 𝑑𝑑 on consumption and investment. The government levies the same lump-sum taxes 𝑑𝑑
and 𝑑𝑑′ on all households, and finances any budget deficit by issuing bonds.
3.12 General equilibrium
We now investigate how the economy reaches general equilibrium in goods, labour and
bond markets.
The analysis starts in the labour market. There, firms’ labour demand 𝑁𝑁 𝑑𝑑 is determined by
the equation 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀. Households’ labour supply 𝑁𝑁 𝑠𝑠 (π‘Ÿπ‘Ÿ) is determined by the static and
dynamic optimality conditions 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 = 𝑀𝑀 and 𝑀𝑀𝑅𝑅𝑆𝑆𝑙𝑙,𝑙𝑙′ = (1 + π‘Ÿπ‘Ÿ)𝑀𝑀/𝑀𝑀′. At this stage, we
take the real interest rate π‘Ÿπ‘Ÿ as given, which influences the position of the labour supply
curve. The left panel of Figure 3.29 depicts the labour market diagram. The real wage
adjusts to 𝑀𝑀 ∗ to match demand and supply and clear the labour market. Intuitively, wages
affect the cost of hiring labour for firms and the incentive to work for households.
Figure 3.28: Labour-market equilibrium
The equilibrium level of employment 𝑁𝑁 ∗ in the labour market has implications for the goods
market through the amount of goods and services firms produce. The production function is
π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁). Given exogenous TFP 𝑧𝑧 and a predetermined current capital stock 𝐾𝐾, firms’
output of goods π‘Œπ‘Œ rises or falls with employment. The right panel of the figure depicts the
production function and shows how the supply of goods π‘Œπ‘Œ ∗ is determined by equilibrium
employment 𝑁𝑁 ∗ .
In the goods market, firms’ supply curve for goods is derived from labour-market
equilibrium. The supply of output comes from the production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁 ∗ ) given
118
EC2065 Macroeconomics | Chapter 3: Aggregate demand
the equilibrium level of employment 𝑁𝑁 ∗ in the labour market. The derivation of the output
supply curve π‘Œπ‘Œ 𝑠𝑠 is shown in Figure 3.30. The right panel is the goods market diagram where
the quantity of output (real GDP π‘Œπ‘Œ) is on the horizontal axis and the real interest rate π‘Ÿπ‘Ÿ is on
the vertical axis. The interest rate is relevant because 1 + π‘Ÿπ‘Ÿ is relative price of current goods
in terms of future goods.
A higher real interest rate π‘Ÿπ‘Ÿ shifts 𝑁𝑁 𝑠𝑠 (π‘Ÿπ‘Ÿ) to the right as shown in the left panel. This
increases employment 𝑁𝑁 ∗ leading to a movement up the production function in the middle
panel, which raises the supply of output π‘Œπ‘Œ 𝑠𝑠 . This explains the upward-sloping π‘Œπ‘Œ 𝑠𝑠 curve seen
in the right panel.
Figure 3.29: Derivation of the output supply curve
The analysis of the goods market is completed by deriving the demand curve. This sums up:
•
•
•
Consumption demand 𝐢𝐢 𝑑𝑑
Investment demand 𝐼𝐼 𝑑𝑑
Government expenditure 𝐺𝐺.
The equation describing the aggregate demand π‘Œπ‘Œ 𝑑𝑑 for goods and services is:
π‘Œπ‘Œ 𝑑𝑑 = 𝐢𝐢 𝑑𝑑 + 𝐼𝐼 𝑑𝑑 + 𝐺𝐺
Consumption demand and investment demand depend negatively on the real interest rate
π‘Ÿπ‘Ÿ. Summing up the components of aggregate demand implies a downward-sloping demand
curve for goods as shown in Figure 3.31.
119
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Figure 3.30: Derivation of the output demand curve
With upward-sloping π‘Œπ‘Œ 𝑠𝑠 and downward-sloping π‘Œπ‘Œ 𝑑𝑑 curves, the goods market clears through
adjustment of the real interest rate π‘Ÿπ‘Ÿ. A higher real interest rate discourages spending on
consumption and investment, which reduces aggregate demand for goods. A higher real
interest rate also encourages greater labour supply so households can save more, which
raises employment and production. Figure 3.32 shows the equilibrium real interest rate π‘Ÿπ‘Ÿ ∗
where the π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 curves intersect. In this model, notice that the real interest rate is
determined by competitive markets, not by the central bank’s monetary policy. This is
different in the sticky-price New Keynesian model we will study in Chapter 8.
Figure 3.31: Goods market equilibrium
The intersection of the π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 curves is not only a point of goods-market equilibrium. It
is also a labour-market equilibrium because every point on π‘Œπ‘Œ 𝑠𝑠 represents equilibrium in the
labour market by construction. That covers two of the three markets in the dynamic
macroeconomic model. What about the remaining market, the bond market?
It might be expected the interest rate π‘Ÿπ‘Ÿ would adjust to achieve bond-market equilibrium
but it must be at π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ to clear the goods market. However, it turns out that the bond
market is in equilibrium once the labour and goods markets are both in equilibrium. This
120
EC2065 Macroeconomics | Chapter 3: Aggregate demand
follows from the logic of budget constraints. Spending (demand) in markets other than the
bond market is balanced by earnings (supply) in those markets, so budget constraints then
imply a balance between spending and earning in final market, which is equivalent to
demand equals supply in that market. This result is known as Walras’ law and consequently,
we do not need to consider the bond market separately.
The complete dynamic macroeconomic model can then be summarised by the labourmarket and goods-market diagrams shown in Figure 3.33.
Box 3.6: Growth slowdowns and real interest rates
Recent decades have seen persistently low real interest rates in advanced economies.
Why might this have happened? One hypothesis is that there is greater pessimism about
the economy’s future growth potential. We will explore the connection between
expectations of the economy’s prospects and the equilibrium level of real interest rates
using the dynamic macroeconomic model.
We represent greater pessimism about future growth by lowering expectations of future
TFP 𝑧𝑧′ while leaving current TFP 𝑧𝑧 unchanged. Lower 𝑧𝑧′ than otherwise reduces the future
marginal product of capital 𝑀𝑀𝑃𝑃𝐾𝐾′ , so investment demand 𝐼𝐼 𝑑𝑑 falls for each interest rate π‘Ÿπ‘Ÿ.
This shifts the π‘Œπ‘Œ 𝑑𝑑 curve to the left. Lower 𝑧𝑧 ′ also reduces households’ expectations of
income in the future, which makes them worse off and means a negative wealth effect
and less demand for all normal goods. The reduction in 𝐢𝐢 𝑑𝑑 shifts the π‘Œπ‘Œ 𝑑𝑑 curve further to
the left. The decrease in demand for leisure means an increase in labour supply 𝑁𝑁 𝑠𝑠 . This
causes rightward shift of the π‘Œπ‘Œ 𝑠𝑠 curve. These effects are shown in Figure 3.34.
Figure 3.32: Diagrams of dynamic macroeconomic model
With a leftward shift of π‘Œπ‘Œ 𝑑𝑑 and a rightward shift of π‘Œπ‘Œ 𝑠𝑠 , the equilibrium real interest rate
declines unambiguously. The impact on the other variables π‘Œπ‘Œ, 𝑁𝑁, and 𝑀𝑀 is ambiguous but
this analysis helps to explain the observation of persistently low real interest rates.
121
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Figure 3.33: Effects of productivity growth slowdown
122
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Chapter 4: Fiscal policy and credit-market
imperfections
In Chapter 3 we studied the determinants of consumption and investment expenditure,
covering the two components of private expenditure in a closed economy. The main
omission was fiscal policy, the subject of the first part of this chapter. We will explore to
what extent public expenditure and changes in taxes can be used by governments as a tool
of demand management.
Another limitation of what we covered so far in Chapter 3 is that consumption and
investment are analysed in an economy with perfect financial markets. There were no limits
on access to credit other than the ability to repay and everyone faced the same interest
rate. The second part of this chapter introduces what are known as ‘credit-market
imperfections’ and explores how these change our understanding of consumption and
investment. We will see that there are important limitations on the power of fiscal policy in
an economy with perfect financial markets but that fiscal policy becomes both more
powerful and more useful when there are credit-market imperfections.
Essential reading
•
Williamson, Chapter 10.
4.1 Taxes and the government’s budget constraint
How do the government’s tax policies affect aggregate expenditure? In particular, can tax
cuts be used as a tool of demand management to stimulate demand when needed? This is
the first aspect of fiscal policy we study, taking as given for now the government’s own
direct public expenditure.
A key point in the analysis of taxation is that changes in taxes – if not matched by changes in
government expenditure – affect the government’s budget deficit and, in turn, the level of
government debt. The need to repay government debt in the future if a default is to be
avoided has implications for the level of future taxes. These considerations lead us to the
idea that the government itself has a budget constraint.
We will begin by analysing taxes and government debt using the two-period model from
Section 3.3 that was central to our study of households’ consumption decisions. That model
has a current period and a future period.
Government expenditure, or public expenditure, refers to purchases of goods and services
by the government, irrespective of whether these are produced by private firms or are the
result of the government’s own production. This does not include transfer payments, for
example, those associated with the welfare state. Government expenditure in the current
123
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
period is 𝐺𝐺 and 𝐺𝐺′ in the future period. These variables are in real terms, as are all variables
in this chapter.
Suppose the government collects tax revenue 𝑇𝑇 in the current period and expects to collect
𝑇𝑇 ′ in the future. Any transfer payments where the government makes payments to others
rather than receiving tax revenue are counted as negative values of tax revenue.
The government does not have to match current expenditure 𝐺𝐺 and taxes 𝑇𝑇: it can run a
budget deficit or surplus. The government’s budget deficit is defined as 𝐡𝐡 = 𝐺𝐺 − 𝑇𝑇, with a
negative value of 𝐡𝐡 denoting a surplus. If the government runs a deficit it borrows by issuing
bonds with real interest rate π‘Ÿπ‘Ÿ. We assume the government begins from a blank slate at the
beginning of the current period with no initial debt outstanding. This means the total stock
of government debt is 𝐡𝐡 and an amount (1 + π‘Ÿπ‘Ÿ)𝐡𝐡 including interest falls due for repayment
in the future.
In the future period, the government must repay (1 + π‘Ÿπ‘Ÿ)𝐡𝐡 otherwise it defaults on its debt.
The analysis here assumes the government debt is in the form of inflation-indexed bonds
paying a known real interest rate π‘Ÿπ‘Ÿ. Later in 0 where money is studied, we will see how
governments can print money and how a ‘soft’ default can occur through inflation. Here, we
can still make our equations consistent with the possibility of a ‘hard’ default by considering
a default as a special form of wealth tax applying to holdings of government bonds.
The budget constraint on the government in the future period is therefore that the budget
surplus 𝑇𝑇 ′ − 𝐺𝐺′ is sufficient to cover the repayment (1 + π‘Ÿπ‘Ÿ)𝐡𝐡, because in a two-period
model there is no possibility of rolling over the debt any longer. In accounting for the
government’s interest payments π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ, it is possible to distinguish between the primary
budget surplus 𝑇𝑇 ′ − 𝐺𝐺′ that excludes the interest cost, and the budget surplus 𝑇𝑇 − 𝐺𝐺 ′ − π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ
that deducts the interest cost.
Dividing both sides of the budget constraint 𝑇𝑇′ − 𝐺𝐺′ = (1 + π‘Ÿπ‘Ÿ)𝐡𝐡 by 1 + π‘Ÿπ‘Ÿ implies a
constraint on the amount of bonds the government can issue in the current period:
𝐡𝐡 =
𝑇𝑇 ′ − 𝐺𝐺 ′
1 + π‘Ÿπ‘Ÿ
Substituting the budget deficit definition 𝐡𝐡 = 𝐺𝐺 − 𝑇𝑇 yields a present-value constraint:
𝑇𝑇 +
𝑇𝑇′
𝐺𝐺 ′
= 𝐺𝐺 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
This equation states that the government must raise tax revenue across the two periods of a
present value sufficient to cover the present value of all government expenditure across the
two periods.
We now want to consider how taxes and government debt affects private expenditure in
the economy. We start by supposing all taxes are lump-sum taxes levied on households. A
lump-sum tax is one where the amount of tax paid by a household does not depend on the
household’s behaviour. Most taxes do not have this form, for example, income taxes or
sales taxes. We consider later taxes where revenue raised depends on choices.
124
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
We also make an assumption that the government is not redistributing when levying taxes
on households. This means that everyone shares the tax burden equally. Assume there are
𝑁𝑁 households in the economy and these households remain unchanged over the two
periods (later we will consider what happens if new generations are born). If 𝑑𝑑 and 𝑑𝑑′ are the
lump-sum amounts of tax collected from a household in the current and future periods then
an equal distribution of taxes means 𝑑𝑑 = 𝑇𝑇/𝑁𝑁 and 𝑑𝑑 ′ = 𝑇𝑇 ′ /𝑁𝑁.
What are the implications of the government’s tax policy for household budget constraints
under the assumptions made so far? We have seen that a household’s present value of
consumption must be equal to its lifetime wealth β„Ž or present value of income after tax:
β„Ž = 𝑦𝑦 − 𝑑𝑑 +
𝑦𝑦′
𝑑𝑑 ′
𝑦𝑦 ′ − 𝑑𝑑 ′
= �𝑦𝑦 +
οΏ½ − �𝑑𝑑 +
οΏ½
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
The second expression for β„Ž above breaks down lifetime wealth into the present value of
pre-tax income and the present value of the taxes a household will face. Using government
budget constraint and the equal distribution of taxes 𝑑𝑑 = 𝑇𝑇/𝑁𝑁 and 𝑑𝑑 ′ = 𝑇𝑇 ′ /𝑁𝑁:
𝑑𝑑 +
1
𝑇𝑇 ′
1
𝐺𝐺 ′
𝑑𝑑′
= �𝑇𝑇 +
� = �𝐺𝐺 +
οΏ½
1 + π‘Ÿπ‘Ÿ
𝑁𝑁
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ 𝑁𝑁
It follows that lifetime wealth can be expressed as the present value of pre-tax income net
of a 1/𝑁𝑁th share of the present value of all government expenditure.
𝐺𝐺 ′
1
𝐺𝐺
𝐺𝐺
𝑦𝑦′
𝑁𝑁
οΏ½ − �𝐺𝐺 +
οΏ½ = 𝑦𝑦 − +
β„Ž = �𝑦𝑦 +
1 + π‘Ÿπ‘Ÿ
𝑁𝑁
𝑁𝑁
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
′
𝑦𝑦 ′ −
Note that the exact values of taxes 𝑑𝑑 and 𝑑𝑑′ drop out of the equation. It is therefore
sufficient to know the government’s plans for public expenditure because these determine
the present value of taxes given the government’s budget constraint. The exact combination
of 𝑑𝑑 and 𝑑𝑑′ is not relevant in calculating β„Ž.
Since the position of a household’s lifetime budget constraint can be determined by
knowing β„Ž and π‘Ÿπ‘Ÿ, a corollary is that the position of the budget constraint can known simply
with reference to the present value of pre-tax income and the government’s expenditure
plans. Another way to deduce this is to note that the household’s budget constraint must
always pass through the point (𝑦𝑦 − 𝐺𝐺 ⁄𝑁𝑁 , 𝑦𝑦 ′ − 𝐺𝐺 ′ ⁄𝑁𝑁). Figure 4.1 illustrates the combined
household and government budget constraint.
125
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Figure 4.1: The combined household and government budget constraint
4.2 Ricardian equivalence
This section outlines a famous argument claiming the government’s tax policy has no power
to affect aggregate demand. The argument is known as ‘Ricardian equivalence’ and, if valid,
it says that if the government increases households’ current disposable income by cutting
taxes, households will not spend the extra income. The reason is that a tax cut is not a free
lunch. It increases the budget deficit and raises government debt, which leads to higher
future taxes. The government’s present-value budget constraint is central to this claim.
The government budget constraint implies a tax cut does not reduce the present value of all
taxes unless government expenditure is also reduced. Given 𝐺𝐺 and 𝐺𝐺′, reducing 𝑑𝑑 raises 𝑑𝑑′ by
an equal amount in present value. This means that changes in 𝑑𝑑 and 𝑑𝑑 ′ /(1 + π‘Ÿπ‘Ÿ) cancel out
when calculating the impact on the present value of taxes and, consequently, households’
lifetime wealth β„Ž is not affected by the tax cut.
In the consumption choice diagram in Figure 4.2, the endowment point on the household
budget constraint moves from 𝐸𝐸1 to 𝐸𝐸2 along an unchanged household budget constraint.
We have seen that the position of this budget constraint depends on pre-tax income and
the government’s spending plans but it does not depend on the exact timing of taxes. If
households can choose any point on the standard lifetime budget constraint then there is
no change to the optimal consumption plan. The tax cut does not increase consumption
demand but instead increases the desire to save. Intuitively, the tax cut now means higher
taxes in the future and households must save more now to sustain their optimal choice of
consumption in the future.
126
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Figure 4.2: Tax cut financed by higher budget deficit
We have shown that the tax cut does not affect households’ lifetime wealth β„Ž. But might it
affect the real interest rate π‘Ÿπ‘Ÿ, which determines the gradient of the budget constraint? In
the basic consumption choice model, the real interest rate is exogenous. But we have seen
in Chapter 3 how equilibrium interest rates can be found given exogenous incomes and
optimal consumption choices. Would that analysis of interest rates predict that issuing more
government bonds causes the bond price to fall and the interest rate to rise?
First note that the extra supply of bonds is equal to the size of tax cut. Our analysis above
shows that at any arbitrary real interest rate π‘Ÿπ‘Ÿ, the optimal consumption plan (𝑐𝑐 ∗ , 𝑐𝑐 ′∗ ) does
not change following the tax cut, so private saving 𝑠𝑠 = 𝑦𝑦 − 𝑑𝑑 − 𝑐𝑐 also increases by an
amount equal to the tax cut. All else being equal, greater private saving means a larger
demand for bonds. With the supply and demand for bonds increasing by the same amount,
no change in π‘Ÿπ‘Ÿ is required for the extra issuance of bonds to be absorbed by the market.
An equivalent argument points out that with no change in 𝑐𝑐 for an arbitrary π‘Ÿπ‘Ÿ following the
tax cut and no change in 𝑦𝑦 or 𝐺𝐺, goods-market equilibrium 𝑐𝑐𝑐𝑐 + 𝐺𝐺 = 𝑦𝑦𝑦𝑦 still prevails at the
original interest rate π‘Ÿπ‘Ÿ. Therefore, neither the position nor the gradient of households’
budget constraint changes after the tax cut.
Although the Ricardian equivalence argument is a useful benchmark for the analysis of fiscal
policy, the argument requires strong assumptions. These include the lump-sum nature of
taxes. When taxes are not lump sum, they can have incentive effects, even if the timing of
taxes does not change the present value of all tax revenue that needs to be collected. Lumpsum taxes matter only through an income effect but this channel is neutralised by the
government’s budget constraint. Other types of taxes also have substitution effects.
Another assumption is that the tax burden is shared equally among households, so taxes do
not redistribute between different households. We will see examples of how taxes with
distributional effects that affect households’ consumption choices. Different from the
analysis above, there are winners and losers whose consumption adjusts. Depending on the
exact behaviour of these groups, aggregate consumption can depend on tax policy. One
127
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
particularly relevant case is where intergenerational distribution occurs. The current
generation benefits from a tax cut, but the higher taxes to repay the extra government debt
will be borne by future generations.
Finally, it is crucial to the Ricardian equivalence argument that there are no credit-market
imperfections. We will see examples where Ricardian equivalence breaks down when there
are limits on households’ borrowing or where borrowers face different interest rates from
savers.
Box 4.1: Taxes on consumption
Taxes are usually not lump-sum amounts. What difference does this make to our
analysis of fiscal policy? We will consider an example where there are proportional
taxes on consumption expenditure, for example, a sales tax or a value-added tax
(VAT).
Assume that households buying consumption goods 𝑐𝑐 in the current period face a
tax rate 𝜏𝜏, which means this consumption costs (1 + 𝜏𝜏)𝑐𝑐 inclusive of tax. Similarly,
in the future period, the cost of consumption 𝑐𝑐′ is (1 + 𝜏𝜏 ′ )𝑐𝑐′, where 𝜏𝜏 ′ is the future
tax rate.
To simplify matters in this example, we assume a representative household. This
means that all 𝑁𝑁 households have same incomes 𝑦𝑦 and 𝑦𝑦 ′ and same preferences.
Consequently, all households choose the same consumption plan (𝑐𝑐, 𝑐𝑐 ′ ) and the
government raises tax revenues 𝑇𝑇 = 𝜏𝜏𝜏𝜏𝜏𝜏 and 𝑇𝑇 ′ = 𝜏𝜏 ′ 𝑐𝑐 ′ 𝑁𝑁 in total.
In this setting, the lifetime budget constraint of a representative household is:
(1 + 𝜏𝜏)𝑐𝑐 +
(1 + 𝜏𝜏 ′ )𝑐𝑐 ′
𝑦𝑦′
= 𝑦𝑦 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
Instead of subtracting lump-sum amounts of tax revenue from the right-hand side,
tax rates multiplying consumption choices are added to the left-hand side. Observe
that the horizontal intercept of this budget constraint is 𝑐𝑐 =
(𝑦𝑦 + 𝑦𝑦 ′ /(1 + π‘Ÿπ‘Ÿ))⁄(1 + 𝜏𝜏) and the vertical intercept is 𝑐𝑐 ′ =
((1 + π‘Ÿπ‘Ÿ)𝑦𝑦 + 𝑦𝑦 ′ )⁄(1 + 𝜏𝜏 ′ ). The gradient of the budget constraint is −(1 + π‘Ÿπ‘Ÿ)(1 +
𝜏𝜏)/(1 + 𝜏𝜏 ′ ). Notice that the tax rates 𝜏𝜏 and 𝜏𝜏 ′ affect the gradient. This is because
they can have implications for the relative cost of consumption goods at different
dates.
The government’s present-value budget constraint with public expenditure (𝐺𝐺, 𝐺𝐺 ′ )
is 𝑇𝑇 + 𝑇𝑇′⁄(1 + π‘Ÿπ‘Ÿ) = 𝐺𝐺 + 𝐺𝐺′⁄(1 + π‘Ÿπ‘Ÿ), or in terms of tax rates:
𝜏𝜏 ′ 𝑐𝑐 ′
1
𝐺𝐺 ′
𝜏𝜏𝜏𝜏 +
= �𝐺𝐺 +
οΏ½
1 + π‘Ÿπ‘Ÿ 𝑁𝑁
1 + π‘Ÿπ‘Ÿ
128
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
How does this type of tax system affect the Ricardian equivalence argument seen
earlier? Consider a deficit-financed cut of current tax rate 𝜏𝜏 and hold constant the
public expenditure 𝐺𝐺 and 𝐺𝐺 ′ planned in the current and future time periods. Just as
before, the government budget constraint implies the future tax rate 𝜏𝜏 ′ must
adjust.
Reducing the current tax rate 𝜏𝜏 implies the household budget constraint’s
horizontal intercept (𝑦𝑦 + 𝑦𝑦 ′ /(1 + π‘Ÿπ‘Ÿ))/(1 + 𝜏𝜏) rises and its gradient −(1 + π‘Ÿπ‘Ÿ)(1 +
𝜏𝜏)/(1 + 𝜏𝜏 ′ ) is lower. This is illustrated in Figure 4.2. The reduction in the gradient is
similar to what happens when the interest rate π‘Ÿπ‘Ÿ declines and, in both cases,
current consumption becomes relatively cheaper compared to future consumption.
Figure 4.3: Failure of Ricardian equivalence
An immediate consequence of this is because the gradient of household budget
constraint has changed, it cannot be tangent to an indifference curve at the original
consumption plan. The household’s choice of consumption plan is affected by the
tax cut and so Ricardian equivalence fails. Intuitively, the tax cut has a substitution
effect working through the change in the budget constraint’s gradient that is absent
in our earlier analysis of lump-sum taxes.
Although Ricardian equivalence does not hold, accounting for the government
budget constraint is still important and has some interesting implications. Moving
the tax terms to the right-hand side, the representative household’s budget
constraint is:
𝑐𝑐 +
𝑐𝑐 ′
𝑦𝑦 ′
𝜏𝜏 ′ 𝑐𝑐 ′
= 𝑦𝑦 +
− �𝜏𝜏𝜏𝜏 +
οΏ½
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
129
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Notice that the representative household’s choice of consumption plan (𝑐𝑐, 𝑐𝑐 ′ ) also
appears in the government budget constraint because it affects how much tax
revenue is collected from given tax rates. Since all households make the same
consumption choices here, the household and government budget constraints can
be combined:
𝐺𝐺 ′
′
𝑦𝑦
−
𝐺𝐺
𝑐𝑐
𝑁𝑁
𝑐𝑐 +
= 𝑦𝑦 − +
𝑁𝑁
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
′
The tax rates 𝜏𝜏 and 𝜏𝜏 ′ have been eliminated and the equation has exactly the same
form as what was obtained with lump-sum taxes. Graphically, the equation is a
straight line with gradient −(1 + π‘Ÿπ‘Ÿ) passing through the point (𝑦𝑦 − 𝐺𝐺/𝑁𝑁, 𝑦𝑦 ′ −
𝐺𝐺 ′ /𝑁𝑁). This is not the budget constraint faced by any individual household acting
independently of others but implies that after tax rates have been adjusted to
satisfy the government’s budget constraint, the consumption plan chosen by
households must lie on the same line as it must with a lump-sum tax system.
Considering the deficit-financed cut of current tax rate 𝜏𝜏, the budget constraint
faced by individual households becomes flatter. After taking account of the
adjustment of households’ choices and the future tax rate 𝜏𝜏 ′ needed to satisfy the
government’s budget constraint, the consumption plan must lie on combined
budget constraint and, without any changes to public expenditure 𝐺𝐺 and 𝐺𝐺 ′ , this
line does not shift. Hence, overall, the new consumption plan must be tangent to a
flatter individual budget constraint while also lying on the same downward-sloping
combined household-and-government budget constraint. Therefore, the new
consumption plan must feature higher 𝑐𝑐 and lower 𝑐𝑐 ′ as shown in Figure 4.4.
These unambiguous effects can be thought of as the substitution effect of lower
relative price of 𝑐𝑐 compared to 𝑐𝑐 ′ brought about by the tax rate changes, analogous
to the substitution effect of a lower interest rate. However, the income effects of
the tax changes are weakened by accounting for the government budget
constraint, as is the case with lump-sum taxes.
Although changing tax rates is seen to have an impact on consumption, the analysis
so far does not provide a reason for governments to vary tax rates over time. To
understand why, note that if 𝜏𝜏 = 𝜏𝜏 ′ , i.e. tax rates remain the same, the household
budget constraint gradient becomes −(1 + π‘Ÿπ‘Ÿ). This allows households to reach the
highest indifference curve conditional on the government satisfying its budget
constraint, because the combined household-and-government budget constraint
has gradient −(1 + π‘Ÿπ‘Ÿ). Any choice of 𝜏𝜏 ≠ 𝜏𝜏 ′ would mean the representative
household ends up on a lower indifference curve.
130
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Figure 4.4: Effects of timing of taxes
Intuitively, having 𝜏𝜏 = 𝜏𝜏 ′ avoids the government distorting relative prices of
consumption at different points in time. This is an argument for what is called ‘tax
smoothing’: minimising changes in tax rates. Note that the case for tax smoothing
applies more broadly than just this example and holds even when the tax system
cannot avoid all distortions. But an important caveat is that credit-market
imperfections, another reason for the failure of Ricardian equivalence, will give rise
to an argument for activist fiscal policy.
Box 4.2: The effects of a fiscal stimulus
If Ricardian equivalence holds, we have seen that governments cannot use changes
in (lump-sum) taxes to influence private expenditure. But can a government directly
boost real GDP by increasing public expenditure 𝐺𝐺? If so, by how much?
We will consider here a temporary fiscal stimulus, i.e. higher 𝐺𝐺 with no change in 𝐺𝐺 ′
planned. It is important to note that the analysis of a permanent change in 𝐺𝐺 is not
the same. The direct effect of higher government expenditure 𝐺𝐺 is to shift the π‘Œπ‘Œ 𝑑𝑑
curve to the right because 𝐺𝐺 is one of the components of aggregate demand. But it
also raises the present value of taxes 𝑇𝑇 + 𝑇𝑇′⁄(1 + π‘Ÿπ‘Ÿ) owing to the government’s
budget constraint.
The increase in the present value of taxes is the same amount as the increase in 𝐺𝐺
no matter whether 𝑇𝑇 or 𝑇𝑇′ rises. Consequently, we do not need to specify exactly
how the government pays for higher 𝐺𝐺. This is Ricardian equivalence in this context
– the timing of taxes does not matter – but it does not mean that 𝐺𝐺 has no effects,
only that any effect is the same whether current taxes or the budget deficit rise.
131
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
The higher burden of taxes after the increase in 𝐺𝐺 has a negative wealth effect on
households because they are less able to afford a given amount of private
consumption and leisure. This leads them to reduce their demand for consumption
𝐢𝐢 𝑑𝑑 as it is a normal good, which has the effect of shifting the π‘Œπ‘Œ 𝑑𝑑 curve to the left.
However, the desire for consumption smoothing means that 𝐢𝐢 𝑑𝑑 falls by less than
the tax burden and 𝐺𝐺 rise. Overall, the output demand curve π‘Œπ‘Œ 𝑑𝑑 shifts to the right.
A way of avoiding some of the reduction in consumption is for households to work
more and earn more income. This is what the wealth effect of taxes on labour
supply represents – corresponding to a lower demand for leisure as a normal good.
Note that we are assuming taxes have a lump-sum form, so there are no
disincentive effects here. The labour supply curve 𝑁𝑁 𝑠𝑠 thus shifts to the right as
shown in Figure 4.5. Labour-market equilibrium now occurs at a higher level of
employment and the movement along the production function implies that firms
increase output of goods at the same real interest rate, shifting the output supply
curve π‘Œπ‘Œ 𝑠𝑠 to the right. Since future consumption and leisure are normal goods,
households do not want to raise π‘Œπ‘Œ 𝑠𝑠 by more than increase in tax burden (equal to
the increase in 𝐺𝐺) minus the reduction in consumption expenditure 𝐢𝐢 𝑑𝑑 .
Figure 4.5: Wealth effect on labour supply and output supply
With both the π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 curves shifting rightwards, GDP π‘Œπ‘Œ rises unambiguously as
shown in Figure 4.6. The overall shift of π‘Œπ‘Œ 𝑑𝑑 is larger than the shift of π‘Œπ‘Œ 𝑠𝑠 because
smoothing of consumption and leisure makes the combined wealth effect on 𝐢𝐢 𝑑𝑑
and 𝑁𝑁 𝑠𝑠 (and hence, π‘Œπ‘Œ 𝑠𝑠 ) smaller than direct effect of 𝐺𝐺 on π‘Œπ‘Œ 𝑑𝑑 . A larger shift of π‘Œπ‘Œ 𝑑𝑑
than π‘Œπ‘Œ 𝑠𝑠 implies the real interest rate π‘Ÿπ‘Ÿ rises unambiguously.
There are two shifts of 𝑁𝑁 𝑠𝑠 to right in the labour market diagram from Figure 4.6.
First, the wealth effect of the higher tax burden leading to a shift from 𝑁𝑁1𝑠𝑠 (π‘Ÿπ‘Ÿ1 ) to
𝑁𝑁2𝑠𝑠 (π‘Ÿπ‘Ÿ1 ). Second, the saving incentive effect of the higher real interest rate π‘Ÿπ‘Ÿ
producing the shift from 𝑁𝑁2𝑠𝑠 (π‘Ÿπ‘Ÿ1 ) to 𝑁𝑁2𝑠𝑠 (π‘Ÿπ‘Ÿ2 ). Hence, we can conclude unambiguously
that 𝑁𝑁 rises and 𝑀𝑀 falls. Note that there is no effect on 𝑁𝑁 𝑑𝑑 – the increase in 𝐺𝐺 does
not boost productivity here, so our analysis might reach different conclusions if we
considered expenditure on infrastructure that increased total factor productivity
and hence, labour demand.
132
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Although we conclude that GDP π‘Œπ‘Œ rises overall, this analysis shows it increases by
less than 𝐺𝐺 does. This is due to π‘Œπ‘Œ = 𝐢𝐢 + 𝐼𝐼 + 𝐺𝐺 and both 𝐢𝐢 𝑑𝑑 and 𝐼𝐼 𝑑𝑑 falling.
Consumption falls because of the negative wealth effect and the negative
substitution effect of a higher real interest rate π‘Ÿπ‘Ÿ. Investment falls because the
higher real interest rate raises the borrowing cost or opportunity cost of
investment. These displacement effect on private expenditure of higher public
expenditure 𝐺𝐺 are known as ‘crowding out’ effects.
Figure 4.6: Effects of temporary fiscal stimulus
In summary, our model predicts a temporary fiscal stimulus would boost real GDP
but with no ‘multiplier’ effect whereby π‘Œπ‘Œ rises more than 𝐺𝐺. What is missing? First,
we could imagine that some types of public expenditure are complementary with
production or consumption. Infrastructure expenditure might raise the return to
capital and boost investment by firms. It might also cause households to spend
more on certain types of goods (for example, cars or air travel) that are
complementary to the public expenditure (although it is also possible to envisage
goods and services that are substitutes for public expenditure).
More importantly, our analysis is missing the idea that deficit-financed public
expenditure raises households’ current disposable income through higher GDP. The
argument is that this encourages some households to spend more on consumption
because previously they were unable to pay for this consumption by borrowing
against future income. This supposes that there are credit-market imperfections of
the type we will study in the remainder of the chapter. We return to the analysis of
a fiscal stimulus with credit-market imperfections when studying macroeconomic
policy in Chapter 9.
133
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
4.3 Credit-market imperfections
The two-period consumption model has households’ consumption plans limited only by a
single lifetime budget constraint:
𝑐𝑐 ′
𝑦𝑦 ′ − 𝑑𝑑′
𝑐𝑐 +
= β„Ž = 𝑦𝑦 − 𝑑𝑑 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
This assumes households can borrow as much as they like at the same interest rate that
savers receive. This is subject only to it being possible to repay debts with future income. In
reality, however, households are subject to much tighter constraints on borrowing and face
higher interest rates when they do borrow. These additional restrictions arise from what are
known as ‘credit-market imperfections’.
We will start by considering a simple borrowing constraint. Households cannot borrow more
than some amount 𝐿𝐿. Example of borrowing constraints could be a credit limit on a credit
card or an overdraft limit. To begin with, we treat the maximum loan size 𝐿𝐿 as exogenous.
Also for now, assume that borrowers face the same interest rate π‘Ÿπ‘Ÿ as savers receive.
We now add the borrowing constraint to the two-period consumption choice model. With
no initial assets, borrowing means a negative level of saving 𝑠𝑠 = 𝑦𝑦 − 𝑑𝑑 − 𝑐𝑐. The borrowing
constraint is thus:
−𝑠𝑠 ≤ 𝐿𝐿
Using the definition of saving 𝑠𝑠, this constraint is equivalent to 𝑐𝑐 ≤ 𝑦𝑦 − 𝑑𝑑 + 𝐿𝐿, which places
an upper limit on current consumption 𝑐𝑐. This additional constraint truncates the lifetime
budget constraint to the right of 𝑐𝑐 = 𝑦𝑦 − 𝑑𝑑 + 𝐿𝐿 as shown in Figure 4.7. Depending on
income and preferences, the constraint may or may not be binding.
Figure 4.7: The borrowing constraint may or may not be binding
The borrowing constraint is not binding when the tangency point between an indifference
curve and the full lifetime budget constraint without the borrowing constraint is to the left
134
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
of where the borrowing constraint is relevant, i.e. where the household wants to borrow
less than the borrowing constraint, or wants to save. The borrowing constraint is binding
when this tangency point lies to the right of where the budget constraint is truncated and,
hence, the choice of that level of borrowing is not feasible. In this case, the next best
consumption plan is the one at the ‘kink’ of the budget constraint where it is truncated by
the borrowing constraint. The household chooses to borrow the maximum amount allowed
by the constraint. In what follows, we shall assume the borrowing constraint is binding,
otherwise it is irrelevant and our analysis of consumption proceeds as it did in Chapter 3.
Binding borrowing constraints cause households’ consumption behaviour to behave in a
very different way from what we saw in Chapter 3 and Section 4.2. First, there is no
consumption smoothing following a temporary increase or decrease in disposable income.
This is illustrated in Figure 4.8. A decline in current income shifts the endowment point
horizontally to the left from 𝐸𝐸1 to 𝐸𝐸2 . Since the budget constraint is truncated at 𝑐𝑐 = 𝑦𝑦 −
𝑑𝑑 + 𝐿𝐿 because of the limit on borrowing, this truncation point also moves to the left by the
same amount.
Figure 4.8: No consumption smoothing
Starting from the borrowing constraint being binding, the household chooses the
consumption plan at the kink of the new budget constraint, which means that current
consumption falls by as much as current income. If current income were to increase, current
consumption would rise by the same amount if the borrowing constraint remains binding
(which it would as long as current income does not rise too much). Therefore, changes in
current disposable income have a one-for-one effect on current consumption if the
borrowing constraint is binding. In contrast, without a binding borrowing constraint,
households would choose to smooth consumption in response to temporary income shocks,
with 𝑐𝑐 moving less than one-for-one with a change in 𝑦𝑦.
A second difference compared to the earlier model of consumption is the failure of
Ricardian equivalence. A deficit-financed tax cut now increases current consumption
spending for those with a binding borrowing constraint as shown in Figure 4.9. The
135
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
endowment point 𝐸𝐸 moves along the downward-sloping budget constraint line because of
the government’s own present-value budget constraint for the same reason explained in
Section 4.2. However, the limit on borrowing truncates the budget constraint horizontally at
disposable income plus maximum debt, so a tax cut that directly affects disposable income
moves the truncation point. Lower taxes today raise current disposable income, moving the
truncation point to the right along the lifetime budget constraint.
A household with a binding borrowing constraint would like to spend more today by
borrowing. If such a household receives more disposable income then they will spend all or
some of it on more current consumption, so Ricardian equivalence fails. If the borrowing
constraint remains binding after the tax cut, the household will spend all the extra
disposable income immediately.
Figure 4.9: Failure of Ricardian equivalence
Box 4.3: Bequests and intergenerational redistribution
The argument for Ricardian equivalence is that tax cuts are saved by households
because they know they will face higher taxes in future. But what if those higher
taxes are paid by future generations? Should the current generation benefiting
from a tax cut spend it and leave future generations to deal with the extra
government debt?
One reason to think this would not happen is that we observe people leaving
bequests – transfers of wealth to the next generation – which suggests there is
altruism across the generations. Hence, bequests might play the role of saving in
the standard Ricardian equivalence argument, with tax cuts being saved by the
current generation to leave larger bequests.
We can study this point further by reinterpreting the two-period consumption
choice model to consider bequests and intergenerational altruism. For simplicity,
suppose there are only two generations, a current generation of parents and a
136
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
future generation comprised of their children. Each generation consumes in just
one period. Hence, 𝑐𝑐 denotes consumption of the current generation and 𝑦𝑦 − 𝑑𝑑
their income after tax, and 𝑐𝑐 ′ denotes consumption of the future generation and
𝑦𝑦 ′ − 𝑑𝑑 ′ their income after tax. Saving 𝑠𝑠 = 𝑦𝑦 − 𝑑𝑑 − 𝑐𝑐 now represents bequests, the
wealth of the current generation that is passed on to the future generation.
The current generation is altruistic in that they care about the next generation’s 𝑐𝑐′
as well as their own 𝑐𝑐 to some extent and indifference curves can be used to show
the current generation’s preferences over (𝑐𝑐, 𝑐𝑐 ′ ) as in the standard version of the
consumption choice model. The budget constraint 𝑐𝑐 ′ = 𝑦𝑦 ′ − 𝑑𝑑 ′ + (1 + π‘Ÿπ‘Ÿ)𝑠𝑠 of the
future generation and 𝑠𝑠 = 𝑦𝑦 − 𝑑𝑑 − 𝑐𝑐 imply a present-value budget constraint on
the consumption levels of the two generations that is mathematically identical to
the standard lifetime budget constraint:
𝑐𝑐 +
𝑐𝑐 ′
𝑦𝑦 ′ − 𝑑𝑑 ′
= 𝑦𝑦 − 𝑑𝑑 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
The current generation chooses their own consumption 𝑐𝑐, which determines
bequests 𝑠𝑠, to reach their highest indifference curve subject to this present-value
budget constraint.
As in Section 4.2, the present-value budget constraint is not affected by the timing
of taxes after accounting for government’s own budget constraint. This means that
a deficit-financed tax cut (lower 𝑑𝑑, higher 𝑑𝑑′) does not affect the position of the
constraint. The current generation’s choice of consumption 𝑐𝑐 therefore remains
unchanged as shown in Figure 4.10. Since the endowment point moves to the right,
the implied bequest 𝑠𝑠 rises.
Figure 4.10: Ricardian equivalence with bequests
The present-value budget constraint does not rule out the current generation
choosing a level of its own consumption with 𝑐𝑐 > 𝑦𝑦 − 𝑑𝑑. But having 𝑐𝑐 > 𝑦𝑦 − 𝑑𝑑
means that 𝑠𝑠 < 0, a negative bequest. However, it is usually not possible
137
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
to bequeath debt. This leads us to impose a non-negative bequest constraint 𝑠𝑠 ≥ 0,
which is equivalent to 𝑐𝑐 ≤ 𝑦𝑦 − 𝑑𝑑. Mathematically and geometrically, this has exactly
the same form as a borrowing constraint with the effect of truncating the presentvalue budget constraint at 𝑐𝑐 = 𝑦𝑦 − 𝑑𝑑.
As with a borrowing constraint, the non-negative bequest constraint may or may
not bind. All else being equal, it is more likely to bind if the current generation’s
altruism is weak, which corresponds to the current generation having steep
indifference curves where 𝑐𝑐′ is relatively unimportant compared to 𝑐𝑐. When the
constraint binds, bequests are exactly zero. A binding constraint causes Ricardian
equivalence to fail in essentially the same way as a binding borrowing constraint did
in Section 4.3. This case is illustrated in Figure 4.11.
The logic so far suggests that Ricardian equivalence should still hold when the
government practises intergenerational redistribution by cutting taxes and leaving
debt to future generations as long as people are observed to make positive
bequests. But there may be other motives for bequests beyond altruism. The
current generation might use bequests strategically to influence their children’s
behaviour with the threat of removal of a bequest – zero being the smallest
amount – providing incentives.
The key point here is that a larger planned bequest 𝑠𝑠 gives parents greater control
over their children. As an example, suppose we start from sufficiently large 𝑠𝑠 where
parents fully achieve their goal of influencing children’s behaviour, albeit at the cost
of making a bequest that is too large from a purely altruistic perspective. This case
is depicted in Figure 4.12. The indifference curves take account only of 𝑐𝑐 and
altruism over 𝑐𝑐′, not parents’ desire to influence their children’s behaviour. Hence,
the initial point does not feature a point of tangency between the indifference
curves and the present-value budget constraint because parents are choosing 𝑐𝑐 too
low and 𝑠𝑠 too high from a purely altruistic perspective.
Figure 4.11: Failure of Ricardian equivalence when bequests are zero
138
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Following a deficit-financed tax cut, the endowment point moves along the budget
constraint as usual. If the current generation maintained the same 𝑐𝑐 and increased
bequests 𝑠𝑠 then they would have even greater control over their children but we
started from a point where parents were already achieving their goals in that
respect. Hence, parents would respond by raising 𝑐𝑐 and leaving 𝑠𝑠 unchanged,
resulting in a failure of Ricardian equivalence even though bequests are positive.
Figure 4.12: Strategic bequests and a failure of Ricardian equivalence
4.4 Interest-rate spreads
Another type of credit-market imperfection is that borrowers face higher interest rates than
savers even when they have the ability and intention to repay a loan. Unlike earlier, there
are no additional restrictions on the quantity of borrowing here. Suppose savers receive an
interest rate π‘Ÿπ‘Ÿ, while borrowers face an interest rate π‘Ÿπ‘Ÿπ‘™π‘™ on loans with π‘Ÿπ‘Ÿπ‘™π‘™ > π‘Ÿπ‘Ÿ. The difference
between two interest rates is known as an interest rate spread.
Conditional on a non-negative amount of saving 𝑠𝑠 ≥ 0, the lifetime budget constraint can be
derived in exactly the same way as before in terms of savers’ interest rate π‘Ÿπ‘Ÿ:
𝑐𝑐 +
𝑐𝑐 ′
𝑦𝑦 ′ − 𝑑𝑑′
= β„Ž = 𝑦𝑦 − 𝑑𝑑 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
If the household is a borrower, so saving is negative, 𝑠𝑠 < 0, the lifetime budget constraint
can be derived in usual way, but with borrowers’ interest rate π‘Ÿπ‘Ÿπ‘™π‘™ replacing π‘Ÿπ‘Ÿ:
𝑐𝑐 ′
𝑦𝑦 ′ − 𝑑𝑑′
𝑐𝑐 +
= β„Žπ‘™π‘™ = 𝑦𝑦 − 𝑑𝑑 +
1 + π‘Ÿπ‘Ÿπ‘™π‘™
1 + π‘Ÿπ‘Ÿπ‘™π‘™
These two versions of the lifetime budget constraint are straight lines with gradients −(1 +
π‘Ÿπ‘Ÿ) and −(1 + π‘Ÿπ‘Ÿπ‘™π‘™ ) respectively. However, the first applies only to the left of the endowment
point and the second applies only to the right of 𝐸𝐸. Therefore, the gradient of the lifetime
139
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
budget constraint increases (in absolute value) when a household switches from saving to
borrowing. This is depicted in Figure 4.13, where the lifetime budget constraint has a ‘kink’
at the endowment point.
Figure 4.13: Lifetime budget constraint with different interest rates for borrowers and savers
Compared to the case of no credit-market imperfections, someone who would have chosen
to borrow if π‘Ÿπ‘Ÿπ‘™π‘™ = π‘Ÿπ‘Ÿ now chooses to borrow less or not to borrow at all if π‘Ÿπ‘Ÿπ‘™π‘™ > π‘Ÿπ‘Ÿ. The case
where a household decides not to borrow as a result of π‘Ÿπ‘Ÿπ‘™π‘™ > π‘Ÿπ‘Ÿ is shown in Figure 4.14. If the
interest-rate spread is large enough to dissuade households from borrowing then they
choose a consumption plan at the kink of the lifetime budget constraint. Households in this
position will not smooth consumption after a temporary shock to income. The logic is similar
to the case with a quantitative limit on borrowing. This is easiest to see in the special case of
an extremely high interest-rate spread. The borrowing segment of the budget constraint
becomes so steep it is equivalent to a borrowing limit 𝐿𝐿 of zero.
Figure 4.14: An interest-rate spread causes a household to stop borrowing
140
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
The presence of an interest-rate spread for borrower households also leads to a failure of
Ricardian equivalence. Although households who borrow face an interest rate π‘Ÿπ‘Ÿπ‘™π‘™ > π‘Ÿπ‘Ÿ, the
government is able to borrow at savers’ interest rate π‘Ÿπ‘Ÿ (savers may indeed be receiving π‘Ÿπ‘Ÿ as
a result of holding government bonds). The government present-value budget constraint is:
𝑑𝑑 +
1
𝐺𝐺 ′
𝑑𝑑 ′
= �𝐺𝐺 +
οΏ½
1 + π‘Ÿπ‘Ÿ 𝑁𝑁
1 + π‘Ÿπ‘Ÿ
By the logic of Section 4.1, the lifetime wealth β„Ž of savers who receive interest rate π‘Ÿπ‘Ÿ is
unaffected by the timing of taxes 𝑑𝑑 and 𝑑𝑑′ conditional on the levels of public expenditure 𝐺𝐺
and 𝐺𝐺′. On the other hand, the timing of taxes affects the lifetime wealth β„Žπ‘™π‘™ of borrowers,
defined as the present value of all disposable income discounted at rate π‘Ÿπ‘Ÿπ‘™π‘™ :
𝑦𝑦 ′
1
𝐺𝐺 ′
(π‘Ÿπ‘Ÿπ‘™π‘™ − π‘Ÿπ‘Ÿ)𝑑𝑑 ′
β„Žπ‘™π‘™ = 𝑦𝑦 +
− �𝐺𝐺 +
οΏ½+
1 + π‘Ÿπ‘Ÿπ‘™π‘™ 𝑁𝑁
1 + π‘Ÿπ‘Ÿ
(1 + π‘Ÿπ‘Ÿ)(1 + π‘Ÿπ‘Ÿπ‘™π‘™ )
A deficit-financed tax cut lowers 𝑑𝑑 but raises 𝑑𝑑′, which increases β„Žπ‘™π‘™ when π‘Ÿπ‘Ÿπ‘™π‘™ > π‘Ÿπ‘Ÿ. Intuitively,
private borrowers effectively discount the future at a higher rate π‘Ÿπ‘Ÿπ‘™π‘™ than the interest rate π‘Ÿπ‘Ÿ
at which the government is able to borrow. Hence, shifting taxes away from the present to
the future actually increases the present value of lifetime income for borrower households.
For unchanged 𝐺𝐺 and 𝐺𝐺′, the tax cut shifts the kink point on households’ lifetime budget
constraint to the right as shown in Figure 4.15. This has no effect on the set of feasible
consumption plans for savers but expands the feasible set for borrowers. Consequently,
credit-constrained households are better off and choose to consume more. If a household is
dissuaded from borrowing by π‘Ÿπ‘Ÿπ‘™π‘™ > π‘Ÿπ‘Ÿ then the household spends all of increase in disposable
income on higher current consumption 𝑐𝑐. If the household was still borrowing initially, then
the tax cut leads to an increase in both 𝑐𝑐 and 𝑐𝑐′. Either way, there is a failure of Ricardian
equivalence.
Figure 4.15: Failure of Ricardian equivalence with an interest-rate spread
141
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
4.5 Asymmetric information
While the two credit market imperfections, namely borrowing constraints and interest rate
spreads, are realistic, we would like to understand why they arise. Moreover, what
determines the size of borrowing limits and interest rate spreads, and what might cause
them to change?
In this section and the next, we will look at two theories:
•
•
Asymmetric information, to explain interest rate spreads
Limited commitment, to explain borrowing constraints.
Asymmetric information is where one party to a transaction, a buyer or a seller, is better
informed than the other. We will apply this idea to study the market for loans with
asymmetric information. The asymmetric information is that borrowers know their
probability repayment probability better than lenders.
We assume just two types of borrowers for simplicity. ‘Good types’ are those who always
repay loans and ‘bad types’ are those who always fully default. Borrowers know their type
but lenders cannot directly distinguish between them.
In reality, lenders go to considerable lengths in screening and credit scoring applicants for
loans to resolve this informational asymmetry. However, these efforts are imperfect in that
some bad types will still slip through, in which case, the asymmetric information is the
residual uncertainty that remains about an individual’s type after screening. Moreover,
screening and credit scoring entail costs, and accounting for those costs has similar
implications for interest-rate spreads as the original informational asymmetry itself. For this
reason, we will not account for screening and credit scoring explicitly in our analysis.
Good types make up a fraction 1 − 𝑑𝑑 of all borrowers and bad types are a fraction 𝑑𝑑. The
value of 𝑑𝑑 (0 < 𝑑𝑑 < 1) is common knowledge to everyone including lenders.
While lenders cannot directly observe an individual’s type, bad types do not want to reveal
their type indirectly by asking for a different loan size compared to good types. Suppose
good types want to borrow 𝐿𝐿 when the interest rate on loans is π‘Ÿπ‘Ÿπ‘™π‘™ . Bad types mimic them, so
all borrowers end up requesting the same loan size 𝐿𝐿.
Loans are provided by financial intermediaries such as banks who fund them exclusively by
taking safe deposits. There is no bank capital or reserves in this analysis – such
considerations are deferred until Chapter 7. A bank needs to take deposits 𝐿𝐿 per loan made
and depositors must be paid an interest rate π‘Ÿπ‘Ÿ, the same as the interest rate on government
bonds. We suppose there is a competitive market for loans with free entry of lenders.
For each loan 𝐿𝐿, lenders must repay (1 + π‘Ÿπ‘Ÿ)𝐿𝐿 to depositors. Since both good and bad types
obtain loans, with many borrowers, lenders can predict with confidence that a fraction 𝑑𝑑 will
default, repaying nothing. The repayment from the fraction 1 − 𝑑𝑑 of good types is (1 + π‘Ÿπ‘Ÿπ‘™π‘™ )𝐿𝐿,
which implies that lender profits per loan are:
πœ‹πœ‹ = (1 − 𝑑𝑑)(1 + π‘Ÿπ‘Ÿπ‘™π‘™ )𝐿𝐿 − (1 + π‘Ÿπ‘Ÿ)𝐿𝐿
142
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Competition between lenders with free entry pushes down lending rates π‘Ÿπ‘Ÿπ‘™π‘™ until profits are
zero. The equation for profits can be written as πœ‹πœ‹ = οΏ½(1 − 𝑑𝑑)(1 + π‘Ÿπ‘Ÿπ‘™π‘™ ) − (1 + π‘Ÿπ‘Ÿ)�𝐿𝐿, so πœ‹πœ‹ = 0
implies that (1 − 𝑑𝑑)(1 + π‘Ÿπ‘Ÿπ‘™π‘™ ) = 1 + π‘Ÿπ‘Ÿ. Solving this equation for the equilibrium loan interest
rate π‘Ÿπ‘Ÿπ‘™π‘™ implies for any 𝐿𝐿:
π‘Ÿπ‘Ÿπ‘™π‘™ = π‘Ÿπ‘Ÿ +
𝑑𝑑(1 + π‘Ÿπ‘Ÿ)
1 − 𝑑𝑑
This equation implies π‘Ÿπ‘Ÿπ‘™π‘™ > π‘Ÿπ‘Ÿ because 𝑑𝑑 > 0. Asymmetric information therefore provides a
rationale for a positive interest-rate spread between borrowers’ and savers’ interest rates.
Intuitively, to pay savers an interest rate π‘Ÿπ‘Ÿ, all borrowers, including the good types can only
receive loans with a higher interest rate than π‘Ÿπ‘Ÿ to compensate for the bad types defaulting.
The interest-rate spread π‘Ÿπ‘Ÿπ‘™π‘™ − π‘Ÿπ‘Ÿ = 𝑑𝑑(1 + π‘Ÿπ‘Ÿ)⁄(1 − 𝑑𝑑) increases with the fraction 𝑑𝑑 of bad
types.
If information were symmetric, good and bad types would face different offers from
lenders. We can think of there being two separate markets with different interest rates π‘Ÿπ‘Ÿπ‘™π‘™ in
which 𝑑𝑑 is known to be either 0 (only good types) or 1 (only bad types). With 𝑑𝑑 = 0, good
types would be offered π‘Ÿπ‘Ÿπ‘™π‘™ = π‘Ÿπ‘Ÿ, the same interest rate as savers. This corresponds to the
implicit assumption of perfect financial markets in Chapter 3. With 𝑑𝑑 = 1, bad types would
face π‘Ÿπ‘Ÿπ‘™π‘™ = ∞, meaning that financial intermediaries refuse to lend to them.
Box 4.4: Does the current profitability of firms matter for
investment?
Without credit-market imperfections, a firm’s investment demand is determined
by:
𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = π‘Ÿπ‘Ÿ
All that matters is the cost π‘Ÿπ‘Ÿ (or opportunity cost) of funding investment and the
future marginal product of capital 𝑀𝑀𝑃𝑃𝐾𝐾′ , which is related to a firm’s future profits.
However, empirical evidence suggests that firms’ current profitability has a positive
effect on investment decisions. We can understand this observation by introducing
credit-market imperfections into our analysis of investment, as we have done for
consumption.
Suppose that because of asymmetric information, firms can only borrow at interest
rate π‘Ÿπ‘Ÿπ‘™π‘™ > π‘Ÿπ‘Ÿ to fund investment. Think of the ‘good types’ in the asymmetric
information model as firms with profitable investment opportunities who
undertake investment and will repay loans. The ‘bad types’ are unprofitable firms
that are borrowing to keep operating, in that these firms make no investments and
loans are simply used to pay managers and workers until the firms ultimately
default.
For good types, if they have sufficient current profits to fund investment from
retained earnings, the opportunity cost of investment is π‘Ÿπ‘Ÿ, the cost of
143
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
internal funds. But if they do not have sufficient profits, the cost of borrowing for
investment is π‘Ÿπ‘Ÿπ‘™π‘™ > π‘Ÿπ‘Ÿ, the cost of external funds. For firms with sufficient internal
funds, they prefer to finance investment at the lower cost π‘Ÿπ‘Ÿ < π‘Ÿπ‘Ÿπ‘™π‘™ using internal
funds, in which case their level of investment is determined by the equation
𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = π‘Ÿπ‘Ÿ as in the case of perfect financial markets.
If a firm must use external funds then investment is determined by 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = π‘Ÿπ‘Ÿπ‘™π‘™
instead. Let π‘₯π‘₯ = π‘Ÿπ‘Ÿπ‘™π‘™ − π‘Ÿπ‘Ÿ denote the spread between cost of external funds
(borrowing) and internal funds (retained earnings). The equation that determines
investment is equivalent to:
𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 − π‘₯π‘₯ = π‘Ÿπ‘Ÿ
For a firm that was initially sufficiently profitable to finance investment from
internal funds, a decline in profits to the point where the firm must borrow to
invest has the effect of pushing up the funding cost of investment from π‘Ÿπ‘Ÿ to π‘Ÿπ‘Ÿπ‘™π‘™ . This
is equivalent to going from π‘₯π‘₯ = 0 to π‘₯π‘₯ > 0 in the equation above (π‘₯π‘₯ = 0
corresponds to 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = π‘Ÿπ‘Ÿ), implying a downward shift of the investment
demand curve. Hence, investment demand is also increasing in current profits for
some firms, all else equal.
Box 4.5: Financial crises
Our analysis of credit-market imperfections helps us understand why economic
activity falls sharply in a financial crisis.
A financial crisis features an increase in defaults on loans among other problems.
With asymmetric information, lenders might reassess the fraction 𝑑𝑑 of ‘bad types’
among borrowers. With higher 𝑑𝑑, the interest-rate spread π‘₯π‘₯ = 𝑑𝑑(1 + π‘Ÿπ‘Ÿ)/(1 − 𝑑𝑑)
between the interest rate π‘Ÿπ‘Ÿπ‘™π‘™ paid by borrowers and the interest rate π‘Ÿπ‘Ÿ received by
savers rises. Hence, given π‘Ÿπ‘Ÿ, borrowers face a higher loan interest rate π‘Ÿπ‘Ÿπ‘™π‘™ = π‘Ÿπ‘Ÿ + π‘₯π‘₯.
Firms that lack sufficient current profits (internal funds) and need to borrow have
their level of investment demand determined by 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 − π‘₯π‘₯ = π‘Ÿπ‘Ÿ. A higher
interest-rate spread π‘₯π‘₯ shifts investment demand curve downwards as shown in
Figure 4.16. Even with no change in the safe interest rate π‘Ÿπ‘Ÿ received by savers,
investment will be lower in a crisis.
144
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Figure 4.16: Effect of higher interest-rate spread on firms depending on external
funds
Some empirical support for the prediction of a negative relationship between the
interest-rate spread π‘₯π‘₯ and investment 𝐼𝐼 is presented in Figure 4.17. The graph
shows time series of US investment and the spread between the interest rates on
BAA-rated and AAA-rated corporate bonds. The logic here is that lending to firms
with a AAA rating is less subject to problems of asymmetric information (‘bad types’
being included in the pool of borrowers) than lending to firms with a lower BAA
rating. The negative relationship between the spread and investment is particularly
striking during the 2008 financial crisis.
Figure 4.17: Corporate bond spreads and investment (USA)
To the extent that corporate profits decline in a financial crisis, there is also an
additional amplification effect. With lower profits, fewer firms can rely on internal
funds to finance investment, which means more firms have investment determined
by 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 − π‘₯π‘₯ = π‘Ÿπ‘Ÿ rather than 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = π‘Ÿπ‘Ÿ, further shifting the aggregate
investment demand curve to the left.
145
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Increases in interest-rate spreads also affect households by steepening the budget
constraint in the region to the right of the endowment point where a household is a
borrower. This is shown in Figure 4.18. Those who were borrowing initially reduce
their borrowing and current consumption because income and substitution effects
of a higher interest rate π‘Ÿπ‘Ÿπ‘™π‘™ are reinforcing for borrowers.
Figure 4.18: Effect of higher interest-rate spread on borrower household
As we will see, financial crises also have important consequences through the effect
of falling asset prices. This reduces the value of collateral and makes it harder to
access credit.
4.6 Limited commitment
A loan contract is an intertemporal exchange. A borrower receives funds now, agreeing to a
future repayment. But as the benefit to the borrower of this exchange is received up front
and only a cost remains in future, this means the borrower may decide to default on the
loan repayment. For lenders to agree to make a loan in first place, it is therefore important
to provide incentives to borrowers not to default.
The standard lifetime budget constraint assumes a household can borrow up to a maximum
amount (𝑦𝑦 ′ − 𝑑𝑑 ′ )/(1 + π‘Ÿπ‘Ÿ). At this point, all future income would be needed to repay debt.
While it is feasible to repay, a household cannot make a binding and credible commitment
always to do so. Moreover, a household has a strong incentive to default if most future
income is simply used for debt service. Lenders cannot easily seize this labour income, or
force people to work to their full potential to repay debts. But if borrowers know lenders
cannot take their labour income, what stops them from defaulting?
We will see this problem is often resolved by the use of collateral, an asset pledged by a
borrower as security for a loan. For example, house is collateral for a mortgage loan and a
car for a car loan. Collateral assets can be more easily seized by lenders than income. Thus,
146
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
the threat of losing collateral on default can provide discipline for borrowers to encourage
them to repay loans. However, the size of the threat depends on what the collateral is
worth. This value will set a limit on the maximum loan sizes lenders are willing to agree to.
We now add a housing asset to the two-period consumption model. Assume a household
initially owns a house. Housing is illiquid, though, and it is not possible to sell a fraction of
the house in the current period (and selling the whole house is inconvenient). We allow for
the house to be sold in future at an anticipated price = 𝑝𝑝′. Suppose the household has some
amount of existing debt 𝐷𝐷 (including principal and interest) owed at the beginning of the
current period, for example, an existing mortgage. The lifetime budget constraint is now:
𝑐𝑐 +
𝑦𝑦 ′ − 𝑑𝑑 ′ + 𝑝𝑝′
𝑐𝑐 ′
= 𝑦𝑦 − 𝑑𝑑 − 𝐷𝐷 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
This adds the future sale value of the house 𝑝𝑝′ to future income, which is equivalent to
adding the present value of the housing asset 𝑝𝑝′ /(1 + π‘Ÿπ‘Ÿ) to lifetime wealth. Existing debt 𝐷𝐷
is subtracted from current income. This assumes that all of this debt is due for repayment or
refinancing in the current period. Long-term debt with interest rate π‘Ÿπ‘ŸΜ… fixed in the past would
enter the budget constraint with (1 + π‘Ÿπ‘ŸΜ… )𝐷𝐷 subtracted from future income 𝑦𝑦 ′ − 𝑑𝑑 ′ instead.
Suppose the household borrows a total amount 𝐿𝐿 (including rolling over or refinancing
existing debt 𝐷𝐷 due for repayment) between the current and future time periods. By
definition, 𝐿𝐿 = 𝐷𝐷 − 𝑠𝑠 = 𝑐𝑐 − (𝑦𝑦 − 𝑑𝑑 − 𝐷𝐷). If the interest rate is π‘Ÿπ‘Ÿ then the future repayment
due is (1 + π‘Ÿπ‘Ÿ)𝐿𝐿. The housing asset is collateral for this loan. If the household defaults in the
future, this avoids the debt repayment (1 + π‘Ÿπ‘Ÿ)𝐿𝐿, but leads to the loss of the house of value
𝑝𝑝′. Therefore, the household gains from default if 𝑝𝑝′ < (1 + π‘Ÿπ‘Ÿ)𝐿𝐿, so to avoid borrowers
defaulting, lenders must ensure that the following collateral constraint holds:
𝑝𝑝′
𝐿𝐿 ≤
1 + π‘Ÿπ‘Ÿ
The maximum loan size is thus limited by value of the collateral asset 𝑝𝑝′ /(1 + π‘Ÿπ‘Ÿ). This
provides a rationale for a borrowing constraint of the form assumed in Section 4.3, with
current consumption limited by 𝑐𝑐 ≤ 𝑦𝑦 − 𝑑𝑑 − 𝐷𝐷 + 𝑝𝑝′ /(1 + π‘Ÿπ‘Ÿ), which truncates the lifetime
budget constraint. Note that as long as the collateral constraint holds, borrowers are not
expected to default, so lenders anticipate being repaid in full. Therefore, there is no
interest-rate spread as seen in Section 4.5 and loans are available savers’ interest rate π‘Ÿπ‘Ÿ.
In the absence of any collateral and faced with an inability to seize borrowers’ incomes on
default, the limited commitment model implies there would be no lending in equilibrium, so
the household would face the tightest credit constraint 𝑐𝑐 ≤ 𝑦𝑦 − 𝑑𝑑. The presence of suitable
collateral benefits households by giving access to low-cost borrowing up to some limit. This
helps to smooth consumption as shown in Figure 4.19.
147
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Figure 4.19: Advantage of having collateral
The major drawback of lending depending on collateral is that fluctuations in the value of
the collateral cause fluctuations in the availability of credit. As Figure 4.20 shows, a fall in
house prices leads to a tightening of the borrowing constraint and a drop in current
consumption if that constraint is binding (there is also a wealth effect on existing
homeowners). This is problematic because of the volatility of asset prices. Figure 4.21 shows
the sizeable fluctuations in US house prices over time, which can contribute to fluctuations
in consumption demand by tightening or loosening borrowing constraints.
Figure 4.20: Effects of a fall in house prices
148
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Figure 4.21: Fluctuations in house prices
Box 4.6: Interest rates and the value of housing collateral
Our earlier analysis of the effects of interest rates π‘Ÿπ‘Ÿ on consumption in Section 3.5
focused on income and substitution effects and we did not consider credit-market
imperfections there. How do borrowing constraints affect the sensitivity of
consumption to interest rates? Do they magnify or dampen the response of
consumption to changes in interest rates?
With an exogenous borrowing limit 𝐿𝐿, which implies 𝑐𝑐 ≤ 𝑦𝑦 − 𝑑𝑑 + 𝐿𝐿, current
consumption 𝑐𝑐 would not respond to changes in interest rates π‘Ÿπ‘Ÿ unless the
borrowing constraint is not binding or ceases to bind for higher π‘Ÿπ‘Ÿ. However, the
limited commitment approach to understanding borrowing constraints suggests
that the maximum loan size 𝐿𝐿 is not exogenous but instead depends on the value of
collateral. Since asset prices are affected by interest rates π‘Ÿπ‘Ÿ, hence, the collateral
constraint is too.
Consider mortgage loans secured by value of an illiquid housing asset with
anticipated future value 𝑝𝑝′. The collateral constraint on the maximum loan size is
𝐿𝐿 ≤ 𝑝𝑝′ /(1 + π‘Ÿπ‘Ÿ), where the value of the house as collateral is 𝑝𝑝′ /(1 + π‘Ÿπ‘Ÿ). Given
expectations of 𝑝𝑝′, a higher interest rate π‘Ÿπ‘Ÿ reduces the collateral value of house.
This is because interest payments rise relative to repayment of capital, and the sum
of principal and interest owed is what matters for a borrower’s incentive to default.
More generally, asset prices fall as interest rates rise because present values of
future payoffs are smaller.
The left panel of Figure 4.22 shows the effects of higher π‘Ÿπ‘Ÿ with a binding collateral
constraint. The higher interest rate means the collateral constraint cuts off the
149
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
budget constraint at a point further to the left, which unambiguously reduces
current consumption 𝑐𝑐. If the collateral constraint remains binding, future
consumption 𝑐𝑐′ does not change.
Figure 4.22: Effect of higher interest rate with and without a collateral constraint
The right panel of the figure shows the case without a collateral constraint (or
equivalently, where it is not binding). We know from Section 3.5 that a higher
interest rate also unambiguously reduces current 𝑐𝑐 because income and
substitution effects are reinforcing. For future consumption 𝑐𝑐′, the two effects are
opposing, and 𝑐𝑐′ remains unchanged in the special case where income and
substitution effects are of exactly the same size. A dominant income effect and
weak substitution effect imply that 𝑐𝑐′ falls, with current 𝑐𝑐 declining by less than
when the two effects were balanced. A dominant income effect means that the
concern for consumption smoothing is more important than responding to
incentives for intertemporal substitution provided by the interest rate.
Therefore, comparing the left and right panels of the figure, in the case where
income effects are dominant and substitution effects weak, a binding collateral
constraint is likely to magnify the effect of π‘Ÿπ‘Ÿ on 𝑐𝑐. This is because the binding
collateral constraint blocks the attempt to smooth consumption that the household
prefers when income effects are strong.
150
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
4.7 Overlapping generations
In Chapter 3, our general equilibrium analysis in the dynamic macroeconomic model was
restricted to an economy with a representative household. We would also like to think
about economies with differences among the population. One alternative approach focuses
on differences between different generations of households, referred to as ‘overlapping
generations’ (OLG).
Using an OLG approach to macroeconomics is useful to study policies where there is an
aspect of intergenerational redistribution, for example, a public pension system. Members
of different generations treated as distinct households, rather than implicitly one large
family. The government, however, spans the generations.
We will study a simple two-period lives OLG model of the economy. Each generation lives
for two periods, ‘youth’ and ‘old age’, and we can analyse the consumption choices of any
particular generation using our existing two-period model.
A crucial feature of the OLG model is that generations overlap. One generation is old at the
same time as subsequent generation is young. At a point in time, suppose the number of old
people in the population is 𝑁𝑁 and the number of young people is 𝑁𝑁′ (this notation is chosen
because the young of today 𝑁𝑁′ will be the old of the future). We allow for demographic
change by allowing the sizes of the generations to differ. Assume that 𝑁𝑁 ′ = (1 + 𝑛𝑛)𝑁𝑁,
where 𝑛𝑛 is the population growth rate between the generations.
We take incomes here as exogenous unlike the full dynamic macroeconomic model from
Chapter 3. The young and old each earn non-financial incomes 𝑦𝑦 and 𝑦𝑦 ′ that remain the
same between generations. It follows that GDP is π‘Œπ‘Œ = 𝑁𝑁 ′ 𝑦𝑦 + 𝑁𝑁𝑁𝑁′, which grows at rate 𝑛𝑛
from one generation to the next. Note that growth in total GDP here comes entirely from
population growth: income per person remains the same over time. If we think of the young
as ‘workers’ and the old as ‘retirees’ then we can assume 𝑦𝑦′ is zero, or at least very small
relative to 𝑦𝑦.
The main application of the OLG framework is to questions of fiscal policy. Nonetheless, as a
simplification, assume no public expenditure (𝐺𝐺 = 0). There can still be taxes and transfers,
and government debt, even without public expenditure.
If income 𝑦𝑦 ′ when old is low relative to income 𝑦𝑦 when young then households have an
incentive to save to smooth consumption as we have seen earlier. The economy is assumed
to have a bond market but no other types of financial markets. However, it turns out that
there is no scope for different generations to trade with each other. The young would like to
save income and lend to others but the only other generation alive at the same time is the
old. The old cannot borrow from the current young because they are not alive in the future
to repay. As generations only overlap once, there is no mutually beneficial trade in the bond
market.
The two-period lives OLG model has the simple feature that the young can only save by
holding bonds issued by the government. Suppose there is a quantity 𝑏𝑏 of government
bonds issued per young person. These bonds will mature when the current young are old. In
151
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
equilibrium, the real interest rate π‘Ÿπ‘Ÿ on bonds adjusts so that saving of young 𝑠𝑠 = 𝑦𝑦 − 𝑑𝑑 − 𝑐𝑐
is equal to 𝑏𝑏.
To see how this works, start from the even simpler case of no government intervention.
There are no taxes and transfers (𝑑𝑑 = 0, 𝑑𝑑 ′ = 0) and no debt is issued (𝑏𝑏 = 0). In
equilibrium, saving must be zero, 𝑠𝑠 = 0, which implies the young must consume their
income, 𝑐𝑐 = 𝑦𝑦. As they do not acquire any assets because there are no government bonds,
the same is true when old, 𝑐𝑐 ′ = 𝑦𝑦′. In this no-intervention economy, the real interest rate π‘Ÿπ‘Ÿ
adjusts so that 𝑐𝑐 = 𝑦𝑦 and 𝑐𝑐 ′ = 𝑦𝑦′. Diagrammatically, the gradient of the lifetime budget
constraint 𝑐𝑐 + 𝑐𝑐 ′ /(1 + π‘Ÿπ‘Ÿ) = 𝑦𝑦 + 𝑦𝑦 ′ /(1 + π‘Ÿπ‘Ÿ) adjusts until 𝑐𝑐 = 𝑦𝑦 and 𝑐𝑐 ′ = 𝑦𝑦′ is chosen by the
young generation of households as shown in Figure 4.23. The resulting real interest rate is
given by 1 + π‘Ÿπ‘Ÿ = 𝑀𝑀𝑀𝑀𝑆𝑆𝑐𝑐,𝑐𝑐 ′ at the endowment point.
Figure 4.23: No trade between the generations
Is the market equilibrium of this economy Pareto efficient? In other words, would
government intervention making one person better off require making someone else worse
off? There is none of the usual externalities or market failures that would suggest the
market equilibrium is suboptimal. However, as we will see, it may be possible for
government intervention to make all generations better off.
Suppose there is a social planner who can directly choose consumption for each generation
subject to the total supply of goods available. Can this social planner do better than the
market? Assume the social planner picks the same consumption plan (𝑐𝑐, 𝑐𝑐 ′ ) for youth and
old age of each generation. Given the numbers of young and old people, this plan entails
aggregate consumption 𝐢𝐢 = 𝑁𝑁 ′ 𝑐𝑐 + 𝑁𝑁𝑁𝑁′. The plan is feasible if 𝐢𝐢 = π‘Œπ‘Œ:
𝑁𝑁 ′ 𝑐𝑐 + 𝑁𝑁𝑐𝑐 ′ = 𝑁𝑁 ′ 𝑦𝑦 + 𝑁𝑁𝑁𝑁′
Dividing both sides by 𝑁𝑁′ and using the demographic equation 𝑁𝑁⁄𝑁𝑁′ = 1⁄(1 + 𝑛𝑛):
𝑐𝑐 +
𝑦𝑦′
𝑐𝑐′
= 𝑦𝑦 +
1 + 𝑛𝑛
1 + 𝑛𝑛
152
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
In the diagram with consumption plans (𝑐𝑐, 𝑐𝑐 ′ ), this is a straight line passing through the
endowment point (𝑦𝑦, 𝑦𝑦 ′ ) with gradient −(1 + 𝑛𝑛) as plotted in Figure 4.24. This line
resembles a budget constraint, except that the population growth rate, which is also the
economy’s GDP growth rate, determines the gradient of the constraint instead of an
interest rate.
Figure 4.24: Feasible consumption plans for social planner
When the no-government-intervention market equilibrium has π‘Ÿπ‘Ÿ < 𝑛𝑛, the social planner can
choose a combination of 𝑐𝑐 and 𝑐𝑐′ where 𝑐𝑐′ is higher than the market equilibrium and a
higher indifference curve is reached as illustrated in the figure. As 𝑐𝑐′ is higher, moving to this
plan would benefit the generation of old alive when it is implemented because they only
care about 𝑐𝑐′. But as a higher indifference curve is reached, all current and future
generations of young are also better off. This constitutes a Pareto improvement on the
market equilibrium.
In light of this analysis, we say that the economy is dynamically inefficient when π‘Ÿπ‘Ÿ < 𝑛𝑛, that
is, the real interest rate is below the population growth rate (which is also the GDP growth
rate here). However, when π‘Ÿπ‘Ÿ > 𝑛𝑛, this Pareto improvement is not available. Moving to a
feasible consumption plan on a higher indifference curve would mean reducing 𝑐𝑐′, which
would hurt the initial generation of old. The economy is said to be dynamically efficient in
the case π‘Ÿπ‘Ÿ > 𝑛𝑛.
Box 4.7: Pay-as-you-go pension systems
Many governments have established what are known as ‘pay-as-you-go’ pension
systems. These are public pensions that are paid not from an accumulated fund of
assets but from the current contributions or taxes of younger people. They are an
important example of a government policy that redistributes between different
generations. Here, we look at why governments use this type of pension system
153
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
and whether it is a defect that the system is not fully funded. Unlike a typical
private pension, the system is not backed by any assets.
Suppose a government sets up a pay-as-you-go pension system. The young are
obliged to make contributions 𝑑𝑑 (a tax). The old are paid a pension 𝑝𝑝 (a transfer,
𝑑𝑑 ′ = −𝑝𝑝). The system has no assets. The payments to the old come from the
contributions collected at the same time from the young. Assuming the system
balances contributions from the 𝑁𝑁′ young and payments to the 𝑁𝑁 old, the system’s
budget constraint is:
𝑁𝑁𝑁𝑁 = 𝑁𝑁 ′ 𝑑𝑑
Dividing both sides by 𝑁𝑁′ and using the demographic equation 𝑁𝑁⁄𝑁𝑁′ = 1⁄(1 + 𝑛𝑛),
the contributions 𝑑𝑑 and pensions 𝑝𝑝 must be related as follows:
𝑑𝑑 =
𝑝𝑝
1 + 𝑛𝑛
The first generation of the old at the time the system is established necessarily gain
from it because they receive the benefit 𝑝𝑝 without having to pay the contributions
𝑑𝑑. Their consumption increases as they spend their additional pension income.
Given a real interest rate π‘Ÿπ‘Ÿ, the lifetime budget constraint of current and
subsequent generations of the young is:
𝑐𝑐 +
𝑐𝑐′
𝑦𝑦 ′ + 𝑝𝑝
= 𝑦𝑦 − 𝑑𝑑 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
By substituting 𝑑𝑑 = 𝑝𝑝⁄(1 + 𝑛𝑛) from the pension-system budget constraint:
(𝑛𝑛 − π‘Ÿπ‘Ÿ)𝑝𝑝
𝑐𝑐′
𝑝𝑝
𝑦𝑦 ′ + 𝑝𝑝
𝑦𝑦 ′
𝑐𝑐 +
= 𝑦𝑦 −
+
= 𝑦𝑦 +
+
1 + π‘Ÿπ‘Ÿ (1 + π‘Ÿπ‘Ÿ)(1 + 𝑛𝑛)
1 + π‘Ÿπ‘Ÿ
1 + 𝑛𝑛 1 + π‘Ÿπ‘Ÿ
The final term is positive and the budget constraint shifts out if 𝑛𝑛 > π‘Ÿπ‘Ÿ holds. This
case is depicted in Figure 4.25.
Figure 4.25: Effect of introducing a pay-as-you-go pension system
154
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
In equilibrium, the real interest rate π‘Ÿπ‘Ÿ rises as consumption falls when young and
rises when old, which means the gradient of the indifference curve is steeper. This
is why the gradient of the budget constraint increases as well as its position shifting
upwards in the diagram.
When 𝑛𝑛 > π‘Ÿπ‘Ÿ, current and subsequent generations of young are better off as a
result of the pay-as-you-go pension system being established, in addition to the
current old. This means everyone gains from the introduction of the system. We
know this is possible because the economy’s equilibrium is dynamically inefficient if
π‘Ÿπ‘Ÿ < 𝑛𝑛. Resources can be reallocated to make everyone better off, and the pay-asyou-go pension system is just one way of achieving this.
Intuitively, the market failure and case for government intervention is due to the
difficulty of trade between the generations. There is no pair of generations that can
make a mutually beneficial deal. Observe that the young lose by paying a tax that
goes to the old – they only gain because they in turn get a pension paid for by the
next generation. And that generation only gains because its pension is paid for by
the generation after, and so on. This pattern of trade between the generations is
difficult to arrange privately without government intervention.
If the economy is dynamically efficient (π‘Ÿπ‘Ÿ > 𝑛𝑛) then introducing the pay-as-you-go
pension system benefits the currently old generation but makes the young and all
subsequent generations worse off. In the model, 𝑛𝑛 is the GDP growth rate, which
comes only from population growth and π‘Ÿπ‘Ÿ is the real interest rate on bonds. For
everyone to gain from pay-as-you-go, we need real interest rates that are lower
than the economy’s real growth rate. Importantly, this must be true not just
currently but at all future times as well. Pay-as-you-go pension systems were often
established during ‘baby booms’ when GDP growth was high but such favourable
demographics have not been maintained in many countries.
Box 4.8: Should pensions be fully funded?
A pay-as-you-go pension system pays pensions using the current contributions to
the system, not from assets held by the system. An alternative is a fully funded
pensions system. This system pays current pensions using assets acquired by
investing past pension contributions. Such a system could be private, arranged by
households or firms without any government intervention – pensions are then just
a special case of private saving. Or it could be run or regulated by the government,
with individuals being obliged to make pension contributions from their income.
This is an example of ‘forced saving’.
We study fully funded pensions using the two-period model of consumption where
the first period is ‘youth’ and the second period is ‘old age’. Suppose the
155
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
government requires the young to pay an amount 𝑑𝑑 into their pension. These funds
are invested in assets with real return π‘Ÿπ‘Ÿ. These could be managed privately or by
the government but we assume that the return π‘Ÿπ‘Ÿ is the same irrespective of who
invests the funds. Assets are worth (1 + π‘Ÿπ‘Ÿ)𝑑𝑑 by old age and are used to pay out a
pension 𝑝𝑝 given by the constraint:
𝑝𝑝 = (1 + π‘Ÿπ‘Ÿ)𝑑𝑑
This is the budget constraint of the pension system, assuming there are no
administrative costs or intermediary profits taken out. A household’s lifetime
budget constraint is:
𝑐𝑐 +
𝑐𝑐 ′
𝑦𝑦 ′ + 𝑝𝑝
= 𝑦𝑦 − 𝑑𝑑 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
By substituting the pension budget constraint 𝑝𝑝 = (1 + π‘Ÿπ‘Ÿ)𝑑𝑑:
𝑐𝑐 ′
𝑦𝑦 ′
𝑦𝑦 ′
𝑐𝑐 +
= 𝑦𝑦 − 𝑑𝑑 +
+ 𝑑𝑑 = 𝑦𝑦 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
Notice that the pension 𝑝𝑝 and the contributions 𝑑𝑑 cancel out because they are of
the same present value. The argument here is analogous to Ricardian equivalence.
Absent credit-market imperfections, setting up a fully funded pension system has
no effect on households. This is because without credit-market imperfections, the
lifetime budget constraint is the only constraint faced by households, and we have
seen that a fully funded pension system has no impact on this constraint. As shown
in Figure 4.26, the endowment point would move along an unchanged lifetime
budget constraint and there would be no change in households’ consumption plans
or welfare. For the consumption plan to remain the same, households reduce their
own private saving (or borrow) when forced to make pension contributions. Total
savings including the pension contributions are unaffected.
Figure 4.26: Establishing a fully-funded pension system
156
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
However, households may not find it possible to borrow against future pension
income, which means they may not be able to cut private saving to compensate for
being obliged to contribute to the new pension system. Suppose we impose a
borrowing constraint that limits current consumption to 𝑐𝑐 ≤ 𝑦𝑦 − 𝑑𝑑. Without creditmarket imperfections, a fully funded pension system has no consequences for
welfare. With a binding borrowing constraint, a fully funded pension system
reduces current consumption if individuals are forced to save more than they
would have chosen to. In that case, individuals are made worse off by introducing
the pension system, as shown in Figure 4.26.
The analysis so far has failed to demonstrate a case for government intervention in
the pension system. However, if households were myopic and failed to make
adequate provision for the future then paternalistic government intervention might
improve welfare. It is also possible that when there are transaction costs in making
investments, the large scale of the government lowers costs relative to everyone
investing privately.
A further possible justification for imposing a fully funded pension system is that if
the government did not force people to save, political pressure might compel it to
provide public pensions to old people who have not saved enough. If such public
pensions are financed by taxes levied on the young, this would create a de facto
pay-as-you-go pension system. If π‘Ÿπ‘Ÿ > 𝑛𝑛 then the economy is dynamically efficient
and we know pay-as-you-go pensions make the young and subsequent generations
worse off. There is a need to impose a fully funded pension system in this case to
avoid political pressure to establish an inefficient pay-as-you-go system.
Box 4.9: Declining population growth rates and pay-as-you-go
pensions
Pay-as-you-go pensions were often established at times of high population growth
rates (‘baby booms’), population growth being one reason for growth in total GDP.
But fast population growth has not been sustained in many countries and this
demographic change affects what pay-as-you-go pension payments and
contributions are sustainable.
We know that the balanced budget constrained of a pay-as-you-go pension system
is 𝑑𝑑 = 𝑝𝑝⁄(1 + 𝑛𝑛), where 𝑑𝑑 is contributions made by the young, 𝑝𝑝 is pensions paid to
the old, and 𝑛𝑛 is the population growth rate. After a decline in population growth 𝑛𝑛,
the condition for dynamic efficiency π‘Ÿπ‘Ÿ > 𝑛𝑛 may now hold, undermining the case for
pay-as-you-go pensions. If this happens, should pay-as-you-go pensions be
reformed or abolished?
A pay-as-you-go pension system is one way that a government can affect the
allocation of consumption between young (𝑐𝑐) and old (𝑐𝑐′) generations. Absent
157
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
other government intervention, in equilibrium we know that 𝑐𝑐 = 𝑦𝑦 − 𝑑𝑑 and 𝑐𝑐 ′ =
𝑦𝑦 ′ + 𝑝𝑝. Substituting these equations into the pension system budget constraint 𝑑𝑑 =
𝑝𝑝⁄(1 + 𝑛𝑛) leads to the fundamental resource constraint on feasible allocations of 𝑐𝑐
and 𝑐𝑐′:
𝑐𝑐 +
𝑐𝑐 ′
𝑦𝑦 ′
= 𝑦𝑦 +
1 + 𝑛𝑛
1 + 𝑛𝑛
Lower population growth 𝑛𝑛 reduces gradient of this constraint, pivoting it around
the point (𝑦𝑦, 𝑦𝑦 ′ ) as shown in Figure 4.27. The diagram is set up so that starting from
the population growth rate 𝑛𝑛1 , the economy would be dynamically inefficient in the
absence of a pay-as-you-go pension system (π‘Ÿπ‘Ÿ < 𝑛𝑛1 if 𝑐𝑐 = 𝑦𝑦 and 𝑐𝑐 ′ = 𝑦𝑦′). Suppose
pension contributions 𝑑𝑑 have been raised by the maximum amount agreeable to all
generations in the past. This results in π‘Ÿπ‘Ÿ = 𝑛𝑛1 (the economy is just dynamically
efficient) at 𝑐𝑐 = 𝑦𝑦 − 𝑑𝑑1 and 𝑐𝑐 ′ = 𝑦𝑦 ′ + 𝑝𝑝1 . The indifference curve of each new young
generation is tangent to the government’s resource constraint at this point.
Figure 4.27: Lower population growth and pay-as-you-go pensions
The lower population growth rate 𝑛𝑛2 < 𝑛𝑛1 pivots the government’s constraint to
the left around the point (𝑦𝑦, 𝑦𝑦 ′ ). The original pension system 𝑑𝑑1 and 𝑝𝑝1 no longer
satisfies the constraint 𝑑𝑑 = 𝑝𝑝⁄(1 + 𝑛𝑛2 ). This means that either the pension 𝑝𝑝 is cut,
or the contributions 𝑑𝑑 are raised, or some combination of both.
Reducing pensions 𝑝𝑝 makes the current old generation worse off. The current old
clearly prefer contributions 𝑑𝑑 are raised to preserve their pension. However, if 𝑝𝑝 is
not cut then it is now the case that π‘Ÿπ‘Ÿ > 𝑛𝑛2 (the economy started from π‘Ÿπ‘Ÿ = 𝑛𝑛1 ). The
young are worse off if the pension 𝑝𝑝 is maintained at its former level. As the
economy has become dynamically efficient, current and future generations of
young prefer to cut pensions 𝑝𝑝 to some extent. We conclude that, unfortunately,
there is no reform that does not make some generation worse off.
158
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Note, however, that the young would generally not favour total abolition of the
pay-as-you-go pension system. Reducing contributions 𝑑𝑑 all the way to zero may
imply that π‘Ÿπ‘Ÿ falls below 𝑛𝑛2 , making the economy dynamically inefficient, from
which point a higher 𝑑𝑑 and 𝑝𝑝 would make both young and old better off. Therefore,
the young prefer pensions 𝑝𝑝 are cut until π‘Ÿπ‘Ÿ = 𝑛𝑛2 .
Box 4.10: Bubbles in financial markets
Our earlier analysis of financial markets in Chapter 3 explained asset prices in terms
of fundamentals. For example, share prices in Section 3.9 are equal to the present
discounted value of the dividends paid by firms. This approach to analysing financial
markets suggests there is no scope for ‘bubbles’ to arise.
A bubble occurs where an asset’s value exceeds the present value of the payments
received by owning the asset. This means some or all of asset’s current value comes
from expectations of its future value, not from expectations of the fundamentals
related to the payments it will make. Bubbles were not considered earlier in
Chapter 3 because rational investors would not want to pay more than the
fundamental worth of an asset. However, we will see that bubbles are possible in
our economy with overlapping generations in some circumstances.
Consider the following extreme example. In an overlapping generations economy,
suppose an old generation sets up companies and sells shares to young. If shares of
value 𝑉𝑉 are sold then each of the 𝑁𝑁 ′ young individuals must have paid an amount
𝑝𝑝 = 𝑉𝑉/𝑁𝑁′. Each of the 𝑁𝑁 old individuals receives 𝑉𝑉 ⁄𝑁𝑁 = 𝑁𝑁 ′ 𝑝𝑝⁄𝑁𝑁 = (1 + 𝑛𝑛)𝑝𝑝 using
the demographic equation 𝑁𝑁 ′ = (1 + 𝑛𝑛)𝑁𝑁. Assume these companies undertake no
investment and hence, cannot ever pay any dividends. Their shares have no
fundamental value, so there is a bubble if 𝑉𝑉 and 𝑝𝑝 have a positive equilibrium value.
Why would the young ever want to hold these shares? Since the shares never pay a
dividend, any expected return π‘Ÿπ‘Ÿ depends solely on capital gains:
𝑉𝑉 ′ = (1 + π‘Ÿπ‘Ÿ)𝑉𝑉
However, we will see that capital gains sufficient to persuade the young to hold the
shares might be sustainable. Suppose the young generation pays 𝑝𝑝 each for shares.
When old, they can sell to next generation of young and receive (1 + 𝑛𝑛)𝑝𝑝′ each,
where 𝑝𝑝′ = 𝑉𝑉 ′ /𝑁𝑁 ′′ is the anticipated amount to be paid by the next young
generation. The capital gain is ((1 + 𝑛𝑛)𝑝𝑝′ − 𝑝𝑝)⁄𝑝𝑝, which is equal to (𝑉𝑉′⁄𝑉𝑉 ) − 1 and
hence, π‘Ÿπ‘Ÿ using 𝑝𝑝 = 𝑉𝑉/𝑁𝑁′. Any saving done by the young earns π‘Ÿπ‘Ÿ, so their lifetime
budget constraint is the usual 𝑐𝑐 + 𝑐𝑐′⁄(1 + π‘Ÿπ‘Ÿ) = 𝑦𝑦 + 𝑦𝑦′⁄(1 + π‘Ÿπ‘Ÿ).
Here we ignore government debt and any taxes and transfers such as pay-as-you-go
pensions. To have the young paying 𝑝𝑝 each for the bubble asset, the per-person
159
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
saving 𝑠𝑠 = 𝑦𝑦 − 𝑐𝑐 of the young must equal 𝑝𝑝. The level of 𝑠𝑠 must also maximise
lifetime utility given the expected asset return π‘Ÿπ‘Ÿ. Conditional on 𝑝𝑝 and π‘Ÿπ‘Ÿ, since π‘Ÿπ‘Ÿ is
solely a capital gain, the amount 𝑝𝑝′ paid by the next generation is:
𝑝𝑝′ = οΏ½
1 + π‘Ÿπ‘Ÿ
οΏ½ 𝑝𝑝
1 + 𝑛𝑛
These conditions must continue to hold for all subsequent generations as well.
Figure 4.28 shows that in dynamically inefficient economies with π‘Ÿπ‘Ÿ < 𝑛𝑛, a bubble is
possible with the young paying 𝑝𝑝 each for bubble asset. As long as 𝑝𝑝 is not too
large, the increase in π‘Ÿπ‘Ÿ needed to give the young an incentive to hold the asset
leaves π‘Ÿπ‘Ÿ no more than 𝑛𝑛. The bubble asset is sold on to next generation of young,
with each paying 𝑝𝑝′ but 𝑝𝑝′ is no more than 𝑝𝑝 because π‘Ÿπ‘Ÿ ≤ 𝑛𝑛. The bubble asset
remains affordable to future generations of young because the amount paid per
person does not grow. The total bubble asset value is 𝑉𝑉 = (1 + 𝑛𝑛)𝑝𝑝𝑝𝑝, thus 𝑉𝑉′⁄𝑉𝑉 =
(1 + 𝑛𝑛)(𝑝𝑝′ ⁄𝑝𝑝) ≤ 1 + 𝑛𝑛, so the total value grows no faster than the size of the
economy. The largest possible bubble size pushes π‘Ÿπ‘Ÿ up to 𝑛𝑛 exactly, in which case 𝑝𝑝
is constant and 𝑉𝑉 grows at same rate 𝑛𝑛 as the economy.
In the dynamically efficient economy (π‘Ÿπ‘Ÿ ≥ 𝑛𝑛) depicted in Figure 4.29, a bubble
would require π‘Ÿπ‘Ÿ to rise above 𝑛𝑛, or even further above 𝑛𝑛. Since π‘Ÿπ‘Ÿ > 𝑛𝑛, the amount
𝑝𝑝′ that must be paid per person for the bubble asset by the next generation of
young is greater than the 𝑝𝑝 that the current generation of young pays. This perperson amount paid needs to keep rising, which is eventually impossible given the
incomes of the young. Such a bubble would ‘pop’ and the anticipation of this
prevents it forming if investors behave rationally. Therefore, no bubbles form in a
dynamically efficient economy.
Figure 4.28: A bubble in a dynamically inefficient economy
160
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Figure 4.29: No bubbles in dynamically efficient economies
Box 4.11: Does the government have a budget constraint when
interest rates are low?
In the two-period model, the government must adjust taxes or public expenditure
to satisfy its present value budget constraint if it is to avoid a hard default on its
debt. This government budget constraint is crucial to the argument for Ricardian
equivalence because it implies a tax cut now entails higher taxes in the future.
What difference does it make if there is no final period, as with the overlapping
generations economy? Can the government repay existing bonds by issuing new
bonds – rolling over its debts – and keep on doing this? Will investors be willing to
hold government bonds in this case?
Consider the following example to illustrate the argument. Suppose initially that no
government debt is outstanding, and there are no taxes, transfers, or public
expenditure plans and no public pension system is in place. The government then
makes a one-off transfer payment to the current old generation funded by issuing
bonds. However, the government does not ever plan to raise taxes to repay those
bonds. Instead, it simply keeps rolling over its debts as bonds mature in each time
period.
Suppose bonds of value 𝑏𝑏 per young person are issued. Each old person receives a
transfer 𝑁𝑁 ′ 𝑏𝑏⁄𝑁𝑁 = (1 + 𝑛𝑛)𝑏𝑏. Is this fiscal policy feasible? In other words, is there a
bond price (or equivalently, a yield π‘Ÿπ‘Ÿ) at which the young, the only possible buyers
of the bonds, are willing to hold them?
Suppose 𝑁𝑁 ′ 𝑏𝑏 bonds are issued at yield π‘Ÿπ‘Ÿ. This means that (1 + π‘Ÿπ‘Ÿ)𝑁𝑁 ′ 𝑏𝑏 is due for
repayment when these bonds mature. There are no future taxes or transfer
161
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
payments planned. With 𝑁𝑁 ′′ = (1 + 𝑛𝑛)𝑁𝑁 ′ young people in the future period, if new
bonds are issued to repay the existing debt then the following bond issuance 𝑏𝑏′ per
future young person is required:
𝑏𝑏′ =
(1 + π‘Ÿπ‘Ÿ)𝑁𝑁 ′ 𝑏𝑏
1 + π‘Ÿπ‘Ÿ
=οΏ½
οΏ½ 𝑏𝑏
′′
𝑁𝑁
1 + 𝑛𝑛
Given bond issuance 𝑏𝑏, an equilibrium requires there is an interest rate π‘Ÿπ‘Ÿ at which
the saving 𝑠𝑠 = 𝑦𝑦 − 𝑐𝑐 of the young is equal to 𝑏𝑏. Given this π‘Ÿπ‘Ÿ, next period’s bond
issuance is given by the equation above for 𝑏𝑏′ . There must then be a π‘Ÿπ‘Ÿ′ such that
𝑠𝑠 ′ = 𝑏𝑏′, and so on for all future generations.
This analysis is mathematically identical to that for ‘bubbles’ in Box 4.10 by simply
replacing the per young person purchases 𝑝𝑝 of the bubble asset with per young
person government bond issuance 𝑏𝑏.
If the economy is dynamically efficient (π‘Ÿπ‘Ÿ ≥ 𝑛𝑛), issuance of government bonds
requires π‘Ÿπ‘Ÿ > 𝑛𝑛 but then the amount of bonds outstanding per young person keeps
on growing until incomes are not high enough to keep rolling over the debt. There
would be a hard default and anticipation of this means that bonds not bought by
the young. In this case, it is not possible to run a budget deficit without raising
future taxes.
If the economy is dynamically inefficient (π‘Ÿπ‘Ÿ < 𝑛𝑛) then it is possible to run a deficit
and never raise taxes in the future. The usual government budget constraint does
not hold in this case. But this conclusion is subject to the deficit not being so large
as to require π‘Ÿπ‘Ÿ > 𝑛𝑛, so there are ultimately still some constraints on fiscal policy.
162
EC2065 Macroeconomics | Chapter 5: Unemployment
Chapter 5: Unemployment
The basic function of an economy is to allow people to turn their time as work into
production and incomes that allow them to consume goods and services. The occurrence of
unemployment, meaning those who do not have a job but who are actively seeking work,
suggests economies might be failing in this central task.
Our analysis of the labour market back in Chapter 1 had no place for unemployment. There,
the labour market cleared with the real wage adjusting to bring the demand for labour into
line with the supply of labour. Note that unemployment is defined as people who want to
work at the prevailing wage but who have not yet found a job. This is different from the
earlier notion of non-participation in the labour market, which refers to those who do not
want to work at the prevailing wage.
In this chapter, we will explore why unemployment occurs. Furthermore, we would like to
understand why the extent of unemployment varies over time and why unemployment can
differ significantly between countries. By understanding unemployment better, we will try
to answer the question of whether unemployment is a failure of the economy that can be
corrected with an appropriate policy response.
Essential reading
•
Williamson, Chapters 7 and 8.
5.1 Introduction to unemployment
Figure 5.1 below shows a time series of the unemployment rate in the USA. We see that the
unemployment has been around 5 per cent on average, although rising close to or above 10
per cent in severe recessions. As well as varying over time, average unemployment rates
differ considerably across countries as can be seen in Figure 5.2. Figure 5.3 plots cyclical
fluctuations in US unemployment and compares them to cyclical fluctuations in real GDP.
The unemployment rate is strongly countercyclical.
163
EC2065 Macroeconomics | Chapter 5: Unemployment
Figure 5.1: The unemployment rate in the USA
Figure 5.2: Unemployment rates for various countries
164
EC2065 Macroeconomics | Chapter 5: Unemployment
Figure 5.3: The unemployment rate over the business cycle in the USA
How can we understand the occurrence of unemployment? In our earlier supply and
demand analysis of the labour market, there was no unemployment because the real wage
𝑀𝑀 adjusts to a point where labour demand is equal to labour supply. This is depicted in
Figure 5.4, with market clearing at 𝑀𝑀 ∗∗ . At this point, not everyone need be participating the
labour market and working the maximum amount but there is no unemployment in the
sense of someone wanting to work more at wage 𝑀𝑀 ∗∗ but not being able to find extra
employment.
One approach to thinking about unemployment is to suppose there is some impediment to
wage adjustment. Assume that wages are ‘sticky’ downwards, so the labour market cannot
clear. With the real wage stuck at some level 𝑀𝑀 ∗ above 𝑀𝑀 ∗∗ , desired labour supply exceeds
desired labour demand, and unemployment π‘ˆπ‘ˆ ∗ is the gap between the two.
Figure 5.4: Sticky wages and unemployment
If wages are sticky, there can be a shortage of demand for labour and, hence, some who
want to work cannot find jobs. But in this case, why do wages not fall to clear the labour
165
EC2065 Macroeconomics | Chapter 5: Unemployment
market in a way that a price would in any other market if there were excess supply? We
could identify wage contracts as creating some rigidity in wage adjustment, along the lines
of the nominal rigidities we will consider in Chapter 8. However, contracts can eventually be
renegotiated, so this in itself does not seem a promising explanation for why there is
positive unemployment even in the long run. We could also think of regulations such as
minimum wages as creating wage rigidity. However, unemployment exists even in countries
with no minimum wages, so this cannot be an important part of the story.
5.2 Efficiency wages
We will now analyse a possible reason why wages are sticky, namely, that firms would not
want to cut wages even if they could recruit staff at lower wages than they are currently
paying. This incentive for firms to pay high wages, even when there is unemployment, is
known as the theory of ‘efficiency wages’.
A crucial idea in understanding efficiency wages is that labour market does not operate like
a standard goods market where products with a known specification and quality can be
purchased. Hiring workers is not like this. Firms do not know in advance the exact abilities of
the workers they hire, nor how much effort they will exert on the job. In our earlier analysis
of the labour market, firms could hire 𝑁𝑁 workers with known skills and effort levels at wage
𝑀𝑀. Now, effective labour input 𝐸𝐸 depends on the wage paid because it affects incentives to
apply for jobs and put in effort on the job.
Assume each worker contributes effective labour input 𝑒𝑒(𝑀𝑀), which is a function of the real
wage 𝑀𝑀. The dependence of effective labour input on the wage paid creates an incentive for
firms to pay ‘efficiency wages’. Total effective labour input from 𝑁𝑁 workers is 𝐸𝐸 = 𝑒𝑒(𝑀𝑀)𝑁𝑁,
and the production function π‘Œπ‘Œ = 𝐹𝐹(𝐸𝐸) depends on effective labour input 𝐸𝐸 rather than
simply the number of workers employed. The production function 𝐹𝐹(𝐸𝐸) is increasing and
concave. The function 𝑒𝑒(𝑀𝑀) for effective labour input is assumed to be increasing in 𝑀𝑀.
One justification for this is based on a moral hazard problem. The firm needs to pay high
wages to discourage workers from shirking (being lazy) on the job. The argument is that
firms cannot directly control worker effort, although some imperfect monitoring of workers
is possible. Workers do not like putting in effort but fear being caught shirking and being
dismissed. For workers, the cost of losing a job depends on how high the current wage is
compared to wages in other jobs that a dismissed worker could get. Hence, a higher wage
provides a greater incentive not to shirk.
The implicit assumptions in this argument are that dismissal is the worst punishment
available if a worker caught shirking. Furthermore, workers cannot commit to put in effort
when hired irrespective of whether they have an incentive to honour their promise. Finally,
workers have no reputational concerns, which might arise from needing a reference from
their current employer to obtain another job. While some of these assumptions may not
hold, particularly the latter, in practice, this would only weaken the moral hazard problem,
not eliminate it completely.
166
EC2065 Macroeconomics | Chapter 5: Unemployment
Another argument for having effective labour input as a function of wages is based on an
asymmetric information problem. Job applicants know more about their skills and abilities,
hence, potential earnings, than firms looking to recruit them. This means that good workers
will not want to apply for low-paying jobs. Even though firms cannot perfectly identify in
advance who will be a good worker, they know the wage they offer will affect the
composition of the group of applicants. Firms then need to offer high wages to attract
applications from good workers. A related argument makes the case that paying high wages
is important for retaining good workers.
As well as effective labour input 𝑒𝑒(𝑀𝑀) being increasing in wages 𝑀𝑀, it also makes sense to
assume 𝑒𝑒(𝑀𝑀) is a convex function for low 𝑀𝑀, which becomes concave for high 𝑀𝑀. Convexity
for low wages justified by there being some minimum level of wages required for people to
apply for jobs and exert effort on the job, so starting from very low 𝑀𝑀, the gradient of 𝑒𝑒(𝑀𝑀)
is initially increasing. Concavity for high wages justified by there being some maximum level
of effective labour input workers can physically put in, so the gradient of 𝑒𝑒(𝑀𝑀) ultimately
flattens out. An 𝑒𝑒(𝑀𝑀) function with these properties is shown in Figure 5.5.
Figure 5.5: Wages and effective labour input
There are other factors that affect workers’ effective labour input beyond just the wage paid
by their employers. This should also depend on market conditions. Higher wages 𝑀𝑀 ∗ in other
jobs weaken the incentives provided by the firm’s own wage 𝑀𝑀, all else being equal.
Workers would be less inclined to apply for a job paying 𝑀𝑀 or try to keep their current job if
wages 𝑀𝑀 ∗ at other firms are high. On the other hand, a higher unemployment rate 𝑒𝑒
strengthens incentives, making workers more inclined to apply for jobs or to try to keep
their current job. In general, we could write down a function 𝑒𝑒(𝑀𝑀, 𝑀𝑀 ∗ , 𝑒𝑒) that is decreasing
in 𝑀𝑀 ∗ and increasing in 𝑒𝑒, although we will often simplify matters by assuming that the
function depends on 𝑀𝑀 only.
How are wages determined when there is not a competitive market for workers who
provide a fixed and known amount of effective labour input? Firms are able to choose
wages 𝑀𝑀 to maximise profits, taking into account the relationship between effective labour
input and wages. A firm maximises profits πœ‹πœ‹ = 𝐹𝐹 (𝐸𝐸 ) − 𝑀𝑀𝑀𝑀 with respect to both
167
EC2065 Macroeconomics | Chapter 5: Unemployment
employment 𝑁𝑁 and wages 𝑀𝑀 subject to 𝐸𝐸 = 𝑒𝑒(𝑀𝑀)𝑁𝑁. Since 𝑁𝑁 = 𝐸𝐸/𝑒𝑒(𝑀𝑀) profits can be
written as:
πœ‹πœ‹ = 𝐹𝐹 (𝐸𝐸 ) −
𝑀𝑀
𝐸𝐸
𝑒𝑒(𝑀𝑀)
The interpretation of this equation is that given wage 𝑀𝑀, the cost of one unit of effective
labour input 𝐸𝐸 is 𝑀𝑀
οΏ½ = 𝑀𝑀/𝑒𝑒(𝑀𝑀). Taking this as given for now, by adjusting size of the
workforce 𝑁𝑁, firms demand 𝐸𝐸 up to the point where:
πœ•πœ•πœ•πœ•
= 𝐹𝐹 ′ (𝐸𝐸 ) − 𝑀𝑀
οΏ½ =0
πœ•πœ•πœ•πœ•
This gives rise to the equivalent of a labour demand curve, 𝐹𝐹 ′ (𝐸𝐸 ) = 𝑀𝑀
οΏ½, which says that firms
expand employment to where the marginal product of effective labour input is equal to the
cost per unit of effective labour input. Next, given demand for effective labour 𝐸𝐸, firms
would like to minimise the cost of eliciting this labour input from workers. This is because
profits πœ‹πœ‹ = 𝐹𝐹 (𝐸𝐸 ) − 𝑀𝑀
�𝐸𝐸 are decreasing in 𝑀𝑀
οΏ½. The profit-maximising choice of 𝑀𝑀 equivalent to
minimising 𝑀𝑀
οΏ½, and the first-order condition for minimising 𝑀𝑀
οΏ½ = 𝑀𝑀/𝑒𝑒(𝑀𝑀) is:
1
𝑀𝑀𝑒𝑒 ′ (𝑀𝑀)
πœ•πœ•π‘€π‘€
οΏ½
=
−
=0
𝑒𝑒(𝑀𝑀)2
πœ•πœ•πœ•πœ• 𝑒𝑒(𝑀𝑀)
This is equivalent to 𝑒𝑒 ′ (𝑀𝑀) = 𝑒𝑒(𝑀𝑀)⁄𝑀𝑀, that is, firms should raise wages to where the
marginal effect on 𝑒𝑒(𝑀𝑀) equals the average amount of labour input 𝑒𝑒(𝑀𝑀) per unit of wage
𝑀𝑀. Intuitively, firms have an incentive to set 𝑀𝑀 at the level 𝑀𝑀 ∗ where 𝑒𝑒(𝑀𝑀)/𝑀𝑀 is maximised,
which is equivalent to minimising 𝑀𝑀
οΏ½. Geometrically, this corresponds to where a ray from
the origin is tangent to the effective labour input function 𝑒𝑒(𝑀𝑀), as depicted in Figure 5.6.
Figure 5.6: Profit-maximising efficiency wage
The profit-maximising efficiency wage 𝑀𝑀 ∗ is where this tangency occurs, noting that the
initial convexity and subsequent concavity of 𝑒𝑒(𝑀𝑀) imply such a point exists. The inverse of
the gradient of the tangent and ray from the origin gives the effective cost 𝑀𝑀
οΏ½ of a unit of
effective labour input.
168
EC2065 Macroeconomics | Chapter 5: Unemployment
Since firms demand labour input 𝐸𝐸 up to the point where 𝐹𝐹 ′ (𝐸𝐸 ) = 𝑀𝑀
οΏ½, the implied demand
∗
for workers is 𝑁𝑁 = 𝐸𝐸/𝑒𝑒(𝑀𝑀 ). This is equivalent to finding 𝑁𝑁 where 𝑒𝑒(𝑀𝑀 ∗ )𝐹𝐹 ′ (𝑒𝑒(𝑀𝑀 ∗ )𝑁𝑁) = 𝑀𝑀 ∗ ,
which can be plotted as a downward-sloping labour demand curve with 𝑁𝑁 on the horizontal
axis. With wages set at 𝑀𝑀 ∗ , labour demand is 𝑁𝑁 ∗ , as seen in Figure 5.7. The supply of labour
𝑁𝑁 𝑠𝑠 depends on wages in usual way, but firms have no incentive to adjust the wage 𝑀𝑀 ∗ even
if there are unemployed workers when labour demand is 𝑁𝑁 ∗ . It is not in the interests of a
firm to hire at lower wages because the cost 𝑀𝑀
οΏ½ of effective labour input would actually be
higher.
Figure 5.7: Efficiency wage and unemployment
Is there any reason to expect the efficiency wage 𝑀𝑀 ∗ to be above the market-clearing real
wage 𝑀𝑀 ∗∗? If not, there would be no unemployment and, since all firms would pay the same
wage 𝑀𝑀 ∗ , all else being equal, no worker would have an incentive to exert effort. This is
because workers could obtain the sae pay in another job with no risk of unemployment risk.
Taking the moral-hazard argument for efficiency wages, there needs to be unemployment in
equilibrium to provide incentives. Note that we argued earlier that 𝑒𝑒(𝑀𝑀, 𝑀𝑀 ∗ , 𝑒𝑒) should be
increasing in unemployment 𝑒𝑒.
With an efficiency wage 𝑀𝑀 ∗ above the market-clearing wage 𝑀𝑀 ∗∗ , the theory explains the
persistence of some amount of unemployment. Conditional on the level of the efficiency
wage, unemployment fluctuates with shifts of labour demand. Figure 5.8 shows the effect of
a decline in labour demand. This leads to a drop in employment but does not change wages
and, consequently, unemployment is higher. A change in one of the determinants of firms’
optimal efficiency wage would also have implications for unemployment, with a higher
efficiency wage causing an increase in unemployment, all else being equal.
169
EC2065 Macroeconomics | Chapter 5: Unemployment
Figure 5.8: Decrease in labour demand
Box 5.1: Changes in firms’ ability to monitor workers
The moral-hazard argument for efficiency wages assumes that managers cannot
perfectly monitor workers on the job. But technological innovations may allow better
tracking of workers, which makes workers’ risk of being caught shirking greater. This
allows firms to elicit greater effort 𝑒𝑒 from workers for a given wage 𝑀𝑀, an upward
shift of the effective labour input function 𝑒𝑒(𝑀𝑀). Here we explore the labour-market
implications of this better monitoring technology.
For illustration, consider the following effort function:
𝑒𝑒(𝑀𝑀) = log
𝑀𝑀
π‘Žπ‘Ž
In this formula, π‘Žπ‘Ž is a positive parameter. It specifies the wage that must be exceeded
for workers to be motivated to exert effort on the job, that is, for 𝑒𝑒(𝑀𝑀) to be positive.
For 𝑀𝑀 ≤ π‘Žπ‘Ž, the level of effort is treated as zero. For wages greater than π‘Žπ‘Ž, the
function 𝑒𝑒(𝑀𝑀) is increasing and concave in 𝑀𝑀. Since no effort is exerted for 𝑀𝑀 below
π‘Žπ‘Ž, the function also satisfies the initial convexity requirement. A greater ease of
managers in monitoring workers can be represented by a lower value of π‘Žπ‘Ž because
this implies an upward shift of the effort function 𝑒𝑒(𝑀𝑀) = log 𝑀𝑀 − log π‘Žπ‘Ž.
We know that the profit-maximising efficiency wage 𝑀𝑀 ∗ is the solution of the
equation 𝑒𝑒 ′ (𝑀𝑀 ∗ ) = 𝑒𝑒(𝑀𝑀 ∗ )/𝑀𝑀 ∗ . Geometrically, this is where a ray from the origin is
tangent to the effort function, as depicted in Figure 5.9. For the function considered
here, the marginal effect of the wage on effort is 𝑒𝑒 ′ (𝑀𝑀) = 1/𝑀𝑀. Hence, the efficiency
wage 𝑀𝑀 ∗ is found where 𝑒𝑒(𝑀𝑀 ∗ ) = 1. In this case, it is optimal to elicit a fixed amount
of effort from each worker. Specifically, the equation 𝑒𝑒(𝑀𝑀 ∗ ) = 1 implies 𝑀𝑀 ∗ =
π‘Žπ‘Ž × exp(1). It can be seen immediately that 𝑀𝑀 ∗ and 𝑀𝑀
οΏ½ = 𝑀𝑀 ∗ /𝑒𝑒(𝑀𝑀 ∗ ) fall with π‘Žπ‘Ž as the
monitoring technology improves. In the figure, there is a parallel upward shift of the
effort function and the optimal efficiency wage falls from 𝑀𝑀1∗ to 𝑀𝑀2∗ .
170
EC2065 Macroeconomics | Chapter 5: Unemployment
Figure 5.9: Increase in firms' ability to monitor workers
A decline in 𝑀𝑀 ∗ implies a downward shift in the efficiency wage line in the labour-market
diagram in Figure 5.10. Since 𝑒𝑒(𝑀𝑀 ∗ ) = 1 for the particular functional form considered here,
the change in the monitoring technology does not shift the labour demand curve 𝑁𝑁 𝑑𝑑 . Since
the only change is the decline in 𝑀𝑀 ∗ , unemployment declines from π‘ˆπ‘ˆ1∗ to π‘ˆπ‘ˆ2∗ .
Figure 5.10: Labour-market implications of better monitoring technology
5.3 Search and matching in the labour market
A labour market that fails to clear through real wage adjustment is not essential to explain
unemployment. The search-and-matching approach to analysing the labour market offers
another way of understanding unemployment. In this approach, the labour market should
not be viewed as a centralised market that coordinates hiring of workers by firms. Instead,
171
EC2065 Macroeconomics | Chapter 5: Unemployment
there are individual workers and individual firms who have to search to form mutually
agreeable matches.
This view of the labour market emphasises heterogeneity among potential workers and
among jobs. Workers have different skills and jobs have different requirements. This means
it is not straightforward to match up jobs with suitable workers.
Search theories of the labour market also draw a distinction between stocks and flows.
Stocks refer to the number of people with particular labour-market statuses, such as those
who currently have jobs, or those who are currently unemployed. Flows refer to transitions
between different labour-market statuses, such as those who find a new job, or those who
lose or leave a job and become unemployed. The analysis of stocks and flows is absent from
the usual supply-and-demand approach to the labour market where it is implicitly assumed
flows are so rapid that only stocks (employment and unemployment) need to be
considered.
However, flows are interesting in their own right. Even with no change in overall
employment or unemployment, there is continual ‘churn’ in the labour market with
individuals making transitions between employment and unemployment, and back again.
Owing to the differences among potential workers and jobs, it takes time for firms to find an
‘acceptable’ worker to fill a particular job and it takes time for someone searching for work
to find an ‘acceptable’ job. In other words, the search process takes time because of limited
information. Furthermore, it is often not in interests of firms to accept the first potential
employee they see, or workers the first job they are offered.
Search theory studies this process of search and matching, the rates at which the flows
occur and the unemployment that results from them. The theory explains why both
unemployment and job vacancies coexist, i.e. unfilled jobs alongside people who want to
find jobs.
5.4 A model of job search
We now look at a model that describes the process of job search. Consider an unemployed
person looking for a job. The key idea is that not all jobs are the same. The person is not
suitable for all jobs, so not all job applications will result in job offers. Moreover, of the jobs
for which the person is acceptable to employers, the job offers are not all equally attractive.
For a particular job, the wage 𝑀𝑀 is a measure of how attractive it is but the ‘wage’ can be
given a broader interpretation and can include other features of the job that affect how
happy a person would be to have that job.
The person searches for jobs and sends applications but there is uncertainty about which
job offers will be received. Assume this uncertainty is represented by a probability
distribution of 𝑀𝑀 with cumulative distribution function 𝐹𝐹(𝑀𝑀). If a job offer is received, the
function 𝐹𝐹(𝑀𝑀) gives the probability that the offer has a wage less than or equal to 𝑀𝑀.
When someone has found and accepted a job, the person receives wage 𝑀𝑀 for as long as the
job lasts. Assume that people leave jobs at an exogenous rate 𝑠𝑠 per unit of time. This implies
that jobs last for 1/𝑠𝑠 units of time on average and creates the need for some ‘churn’ in the
172
EC2065 Macroeconomics | Chapter 5: Unemployment
labour market. The job separation rate 𝑠𝑠 represents dismissals, redundancies and workers
quitting jobs.
The value of currently having a job with wage 𝑀𝑀 is specified by the value function of
employment 𝑉𝑉𝑒𝑒 (𝑀𝑀). Value refers to the present value of all future payoffs starting from
having a job with wage 𝑀𝑀. The future payments in this present discounted sum are
discounted at some rate π‘Ÿπ‘Ÿ. The value function 𝑉𝑉𝑒𝑒 (𝑀𝑀) is increasing in the wage 𝑀𝑀 because it
is better to start from a job with a higher wage, all else equal. An example value function is
sketched in Figure 5.11.
Figure 5.11: The value function of employment
While unemployed, a person receives 𝑏𝑏 instead of a wage. This includes unemployment
benefits paid by the government and the value of the time not spent working beyond what
is needed to search for jobs. While unemployed and searching for jobs, a person obtains job
offers at rate 𝑝𝑝 per unit of time, each of which is an independent draw from the probability
distribution of 𝑀𝑀. It takes 1/𝑝𝑝 units of time on average to get a job offer. The difficult and
time-consuming process of searching for vacancies, submitting applications and going
through the recruitment process can be represented by a low value of 𝑝𝑝.
Once a job offer with wage 𝑀𝑀 is received, the person has to decide whether to accept. If it is
accepted, the person becomes employed and receives a payoff with present value 𝑉𝑉𝑒𝑒 (𝑀𝑀).
The person could decide not to accept the offer and remain unemployed. In that case, let 𝑉𝑉𝑒𝑒
denote the present value of all expected future payoffs starting from being unemployed. It
is rational to accept a job offer 𝑀𝑀 if 𝑉𝑉𝑒𝑒 (𝑀𝑀) ≥ 𝑉𝑉𝑒𝑒 . The range of job offers that are acceptable
can be found by comparing 𝑉𝑉𝑒𝑒 (𝑀𝑀) and 𝑉𝑉𝑒𝑒 in Figure 5.12. The value 𝑉𝑉𝑒𝑒 is independent of any
particular wage 𝑀𝑀 now because it is based on the expected value of job offers that could be
obtained in the future by continuing to search.
173
EC2065 Macroeconomics | Chapter 5: Unemployment
Figure 5.12: The reservation wage
The decision to accept or reject a job offer can be stated in terms of a reservation wage 𝑀𝑀 ∗
where 𝑉𝑉𝑒𝑒 (𝑀𝑀 ∗ ) = 𝑉𝑉𝑒𝑒 . In the diagram, the reservation wage is found where the value function
of employment 𝑉𝑉𝑒𝑒 (𝑀𝑀) intersects the value of unemployment 𝑉𝑉𝑒𝑒 . A job offer with wage 𝑀𝑀
should be accepted if 𝑀𝑀 ≥ 𝑀𝑀 ∗ and rejected if 𝑀𝑀 < 𝑀𝑀 ∗ because these ranges of 𝑀𝑀
correspond to where 𝑉𝑉𝑒𝑒 (𝑀𝑀) is respectively above and below 𝑉𝑉𝑒𝑒 .
Intuitively, a reservation wage captures the idea that people do not want to accept just any
job when better ones might be found by continuing to search. On the other hand, people do
not want to be so picky that they spend forever searching for a perfect job that they might
never find, forgoing the opportunity to earn a wage in an acceptable but not perfect job.
It is important to understand that the value of unemployment 𝑉𝑉𝑒𝑒 includes not only receiving
unemployment benefits 𝑏𝑏 but also the chance of receiving job offers while searching for
jobs, i.e. the value of search. This means 𝑉𝑉𝑒𝑒 > 𝑉𝑉𝑒𝑒 (𝑏𝑏), so the value of unemployment is more
than simply having a job that pays a wage equal to unemployment benefits 𝑏𝑏. It follows that
𝑀𝑀 ∗ > 𝑏𝑏, so the reservation wage lies above the level of unemployment benefits.
We now consider the implications of rational search behaviour for how quickly on average
people make the transition from unemployment to employment. Recall that job offers are
obtained at rate 𝑝𝑝 over time. The probability of a wage offer 𝑀𝑀 being less than the
reservation wage 𝑀𝑀 ∗ is 𝐹𝐹 (𝑀𝑀 ∗ ) because the cumulative distribution function 𝐹𝐹(𝑀𝑀 ∗ ) gives the
probability of 𝑀𝑀 ≤ 𝑀𝑀 ∗ . Therefore, the probability that an offer is accepted is 1 − 𝐹𝐹(𝑀𝑀 ∗ ). If 𝑓𝑓
denotes the average rate at people find jobs per unit of time then:
𝑓𝑓 = 𝑝𝑝(1 − 𝐹𝐹(𝑀𝑀 ∗ ))
The relationship between the job-finding rate 𝑓𝑓 and the reservation wage 𝑀𝑀 ∗ is illustrated in
Figure 5.13. The job-finding rate is a decreasing function of the reservation wage because a
higher reservation wage means people are more picky when searching for jobs and thus are
less likely to receive an acceptable offer. Mathematically, 𝐹𝐹(𝑀𝑀 ∗ ) increases with 𝑀𝑀 ∗ , so 1 −
𝐹𝐹(𝑀𝑀 ∗ ) declines with 𝑀𝑀 ∗ .
174
EC2065 Macroeconomics | Chapter 5: Unemployment
Figure 5.13: The job-finding rate
The expected time taken to find and accept a job is 1/𝑓𝑓. This expected time is longer when
offers are harder to get (low 𝑝𝑝), or when the reservation wage 𝑀𝑀 ∗ is high, meaning that
people are more picky (𝐹𝐹(𝑀𝑀 ∗ ) is low).
5.5 Stocks and flows in the labour market
The model of job search is designed to explain the rate at which people make the transition
from unemployment to employment on average. This describes one of the crucial ‘flows’
that underlies the search-and-matching approach to the labour market. Here, we analyse
the relationship between the stocks and the flows in the labour market and use this
determine equilibrium unemployment.
The two flows we will focus on are the transitions from unemployment to unemployment
and from employment to unemployment. The rate at which the first flow occurs is the jobfinding rate 𝑓𝑓 and the rate at which the second flow occurs is the job-separation rate 𝑠𝑠.
Taking as given these flow rates (using, for example, the model of job search to understand
the job-finding rate 𝑓𝑓), what are the implications for stocks in the labour market? By this we
mean the number of people with jobs, the number unemployed and the unemployment
rate. We will apply a method known as stock-flow accounting to determine equilibrium
stocks given the flow rates 𝑠𝑠 and 𝑓𝑓.
5.51 Stock-flow accounting
At a point in time, suppose the number of people currently unemployed is π‘ˆπ‘ˆ. Assume the
size of the labour force is 𝐿𝐿, which comprises those who are employed plus the unemployed
who are searching for jobs. Here, we assume the labour force is constant over time. This
ignores any changes in labour-market participation over time. The decision to participate in
the labour market was studied earlier in 0. Given 𝐿𝐿 and π‘ˆπ‘ˆ, by definition, the number of
people currently in jobs is 𝐿𝐿 − π‘ˆπ‘ˆ.
Given the job-separation rate 𝑠𝑠, over time there are inflows from employment to
unemployment of 𝑠𝑠(𝐿𝐿 − π‘ˆπ‘ˆ). This is because inflows from the group of size 𝐿𝐿 − π‘ˆπ‘ˆ occur at
rate 𝑠𝑠 per unit of time. Similarly, given the job-finding rate 𝑓𝑓, there are outflows from
175
EC2065 Macroeconomics | Chapter 5: Unemployment
unemployment to employment of 𝑓𝑓𝑓𝑓. Outflows from the group of size π‘ˆπ‘ˆ occur at rate 𝑓𝑓
over time.
Between two periods, stock-flow accounting says that the change in the number of people
unemployed is equal to inflows minus outflows. Letting π‘ˆπ‘ˆ′ denote the number of people
unemployed in the next period, the stock-flow accounting identity is:
π‘ˆπ‘ˆ ′ − π‘ˆπ‘ˆ = 𝑠𝑠 (𝐿𝐿 − π‘ˆπ‘ˆ) − 𝑓𝑓𝑓𝑓
Generally, we are more interested in the unemployment rate 𝑒𝑒 = π‘ˆπ‘ˆ/𝐿𝐿 rather than the
number of people unemployed. The stock-flow accounting identity can be transformed in
terms of unemployment rates by dividing both sides by 𝐿𝐿:
π‘ˆπ‘ˆ ′ − π‘ˆπ‘ˆ 𝑠𝑠 (𝐿𝐿 − π‘ˆπ‘ˆ) π‘“π‘“π‘ˆπ‘ˆ
=
−
𝐿𝐿
𝐿𝐿
𝐿𝐿
Since 𝐿𝐿′ = 𝐿𝐿, the unemployment rate is 𝑒𝑒′ = π‘ˆπ‘ˆ ′ /𝐿𝐿 next period, and thus:
𝑒𝑒′ − 𝑒𝑒 = 𝑠𝑠(1 − 𝑒𝑒) − 𝑓𝑓𝑓𝑓
This equation states that the inflow to unemployment increases the unemployment rate by
𝑠𝑠(1 − 𝑒𝑒) and the outflow decreases the unemployment rate by 𝑓𝑓𝑓𝑓. The relationship
between the two flows (relative to the size of the labour force) and the unemployment rate
is depicted in Figure 5.14. The outflow 𝑓𝑓𝑓𝑓 is increasing in the unemployment rate because a
greater number of people will leave unemployment for the same 𝑓𝑓 when there are initially
more people unemployed. The inflow 𝑠𝑠(1 − 𝑒𝑒) is decreasing in 𝑒𝑒 because higher 𝑒𝑒 means
fewer people have jobs, so fewer will leave for a given job-separation rate 𝑠𝑠.
Figure 5.14: Inflows and outflows to and from unemployment
5.52 The equilibrium unemployment rate
When the outflow from unemployment matches the inflow, the unemployment rate
remains constant over time (𝑒𝑒′ = 𝑒𝑒). This is a steady state for the unemployment rate,
which we will denote by 𝑒𝑒∗ . A steady state can be found by writing down an equation for
inflows equal to outflows:
176
EC2065 Macroeconomics | Chapter 5: Unemployment
𝑠𝑠(1 − 𝑒𝑒∗ ) = 𝑓𝑓𝑒𝑒∗
This is a linear equation that can be rearranged to deduce 𝑠𝑠 = (𝑠𝑠 + 𝑓𝑓)𝑒𝑒∗ . Hence, there is a
unique steady state for the unemployment rate:
𝑒𝑒∗ =
𝑠𝑠
1
=
𝑠𝑠 + 𝑓𝑓 1 + 𝑓𝑓⁄𝑠𝑠
In Figure 5.14, this steady state corresponds to where the outflow and inflow lines intersect.
The outflow line passes through the origin and the inflow line intersects the horizontal axis
at 𝑒𝑒 = 1, so the steady state 𝑒𝑒∗ lies between 0 and 1. The inflows line is above outflows if
𝑒𝑒 < 𝑒𝑒∗ and outflows are above inflows if 𝑒𝑒 > 𝑒𝑒∗ . It follows that there is convergence over
time to the steady state 𝑒𝑒∗ . In this sense, we will describe the steady state 𝑒𝑒∗ as the
equilibrium unemployment rate implied by the search model. The time taken to converge to
the steady state is usually short enough (unlike the Solow model) that we can take the
steady-state unemployment rate to be the model’s prediction for unemployment.
To give an example, assume one unit of time is a month. Suppose that it takes unemployed
people two months on average to find and accept a job (1⁄𝑓𝑓 = 2), which means a jobfinding rate of 𝑓𝑓 = 1/2. Suppose that workers remain in jobs for four years (48 months) on
average (1⁄𝑠𝑠 = 48), which means a job-separation rate of 𝑠𝑠 = 1/48. Together, the ratio of
the flow rates is 𝑓𝑓⁄𝑠𝑠 = 24, and this yields the following steady-state unemployment rate:
𝑒𝑒∗ =
1
1
1
=
=
= 0.04 = 4%
1 + 𝑓𝑓⁄𝑠𝑠 1 + 24 25
5.53 Unemployment in the search-and-matching model
In summary, the search-and-matching model has an equilibrium (steady-state)
unemployment rate 𝑒𝑒∗ that depends on the inflow and outflow rates to and from
unemployment. A higher job-separation rate 𝑠𝑠 or a lower job-finding rate 𝑓𝑓 increase
unemployment.
While the job-separation rate 𝑠𝑠 is exogenous here, the job-finding rate 𝑓𝑓 was analysed in
the earlier job-search model, where 𝑓𝑓 = 𝑝𝑝(1 − 𝐹𝐹 (𝑀𝑀 ∗ )). The formula shows that a higher
reservation wage 𝑀𝑀 ∗ increases unemployment. Thus, the model can explain what is
sometimes called ‘voluntary’ unemployment that arises from unemployed people not being
willing to accept the first job they can find. As we have discussed, some degree of ‘pickiness’
is rational given the uncertainties in the search process. The formula for 𝑓𝑓 also implies a
lower rate 𝑝𝑝 of obtaining job offers increases unemployment. The model is also consistent
with some ‘involuntary’ unemployment that arises from frictions in matching potential
workers to jobs.
177
EC2065 Macroeconomics | Chapter 5: Unemployment
Box 5.2: The generosity of the welfare state
Governments can establish labour-market institutions that provide insurance to workers
who lose a job. In our earlier job-search model, we assumed that benefits 𝑏𝑏 are paid while a
person remains unemployed. The size of the benefits 𝑏𝑏 but also duration of time for which
they can be claimed, varies considerably across countries. Here, we will focus on the
implications of differences in the size of 𝑏𝑏.
A higher value of unemployment benefits 𝑏𝑏 increases the value of unemployment 𝑉𝑉𝑒𝑒 relative
to the value function of employment 𝑉𝑉𝑒𝑒 (𝑀𝑀). Ignoring the cost to taxpayers of providing 𝑏𝑏,
which affects the value of wages after tax and ignoring the effect on 𝑉𝑉𝑒𝑒 (𝑀𝑀) that comes from
workers considering the risk of unemployment in the future, the main effect is higher 𝑉𝑉𝑒𝑒
because receiving 𝑏𝑏 is the one of the components of the value 𝑉𝑉𝑒𝑒 .
As shown in Figure 5.15, the upward shift of the 𝑉𝑉𝑒𝑒 line raises the reservation wage 𝑀𝑀 ∗ . This
means people become more picky during their job search. Intuitively, there is less incentive
quickly to take a low-paying job offer. The higher reservation wage lowers the job-finding
rate 𝑓𝑓, which reduces the outflow from unemployment and implies a higher unemployment
rate.
Figure 5.15: Increase in benefits paid to unemployed
Since more generous unemployment insurance raises the equilibrium unemployment rate,
why do governments set up welfare states that provide unemployment insurance? While
the diagram above shows that 𝑉𝑉𝑒𝑒 and the average value of 𝑉𝑉𝑒𝑒 (𝑀𝑀) above 𝑀𝑀 ∗ are higher, this
analysis does not account for cost of the insurance, i.e. the higher taxes on wages needed to
pay the unemployment benefits, which must be set against the gains.
Aside from distributional considerations, the main reason for governments to provide
unemployment insurance is because there is a missing market for people to obtain private
insurance against unemployment risk. Such insurance is usually not available because of the
problem of moral hazard. While this analysis lies outside the domain of our search model,
publicly provided unemployment insurance can be good because it substitutes for the
missing market.
Does the search model offer any guidance on the design of unemployment insurance
beyond its role in substituting for a missing market? At first glance, the model points to a
cost of unemployment insurance in making178
people excessively picky when searching for
EC2065 Macroeconomics | Chapter 5: Unemployment
jobs. The argument is that people would already optimally trade off higher wages against
longer search in the absence of insurance, so the insurance is distorting search decisions by
shifting the cost of delaying taking a job on to the taxpayers who fund the unemployment
benefits.
However, this argument may not be valid when the unemployed have little financial wealth
and there are credit-market imperfections that limit private insurance and the ability to
borrow against future income. In this case, people obtain extremely low utility while
unemployed compared to receiving even a very low wage. This leads them to accept lower
paid jobs that are worse matches compared to what they would with perfect credit markets.
Unemployment insurance might then help produce better quality matches between jobs
and workers by giving people the ability to search more thoroughly.
Box 5.3: Wage dispersion and incentives to search for jobs
In the search-and-matching approach to understanding unemployment, differences
between jobs and differences between potential workers are crucial. This is what explains
why time is needed to find suitable matches and why accepting the first job offer, or hiring
the first applicant, is not generally the best strategy for unemployed people and firms.
One reason why some jobs are more attractive than others is the dispersion of wages across
jobs. We will analyse here how the extent of wage dispersion affects job search behaviour
and the equilibrium unemployment rate. Recall that the value 𝑉𝑉𝑒𝑒 is the present value of all
current and future payoffs received conditional on being initially unemployed. This includes
receiving unemployment benefits 𝑏𝑏 in the current period but also the value of search
through expectations of the future wages received once a job is found.
In the job-search model, those searching for jobs receive an offer with probability 𝑝𝑝. An
offer is a wage 𝑀𝑀 drawn from a probability distribution with cumulative distribution function
𝐹𝐹(𝑀𝑀). It is this probability distribution that reflects the dispersion of wages across jobs. With
𝑀𝑀 ∗ denoting the reservation wage, job offers are accepted if 𝑀𝑀 ≥ 𝑀𝑀 ∗ , which has probability
1 − 𝐹𝐹(𝑀𝑀 ∗ ), and the job-finding rate that results is 𝑓𝑓 = 𝑝𝑝(1 − 𝐹𝐹 (𝑀𝑀 ∗ )). The value of being
unemployed 𝑉𝑉𝑒𝑒 includes the expected present value of 𝑉𝑉𝑒𝑒 (𝑀𝑀) conditional on 𝑀𝑀 ≥ 𝑀𝑀 ∗ with
probability 𝑓𝑓 in the next period and the present value of 𝑉𝑉𝑒𝑒 with probability 1 − 𝑓𝑓.
We now consider a more dispersed distribution of wages. Suppose the average wage that is
offered remains the same but low-wage offers are now further below the average and highwage offers are further above the average. As a simple example, suppose that job-offers
have one of three possible wages:
•
•
•
Low wage 𝑀𝑀𝐿𝐿
Medium wage 𝑀𝑀𝑀𝑀 (the average job-offer wage)
High wage 𝑀𝑀𝐻𝐻 .
There is more dispersion if 𝑀𝑀𝐿𝐿 falls and 𝑀𝑀𝐻𝐻 rises by same amount.
179
EC2065 Macroeconomics | Chapter 5: Unemployment
Suppose the reservation wage is initially 𝑀𝑀𝑀𝑀 (𝑀𝑀 ∗ = 𝑀𝑀𝑀𝑀 ), so low-wage offers are rejected
and medium- and high-wage offers are accepted. Since job offers with 𝑀𝑀 = 𝑀𝑀𝐿𝐿 are not
accepted anyway, a lower value of 𝑀𝑀𝐿𝐿 does not worsen the expected payoff of the
unemployed. But as job offers with 𝑀𝑀 = 𝑀𝑀𝐻𝐻 are accepted, an increase in 𝑀𝑀𝐻𝐻 means 𝑉𝑉𝑒𝑒 (𝑀𝑀𝐻𝐻 )
is higher, so the expected payoff of those searching for jobs increases. Figure 5.16 illustrates
how the expected payoff from job search increases when wages are more dispersed. Note
that by drawing the value function 𝑉𝑉𝑒𝑒 (𝑀𝑀) as a straight line, we are assuming that people do
not inherently dislike uncertainty about what job offers they will receive.
Figure 5.16: More wage dispersion increases the value of search
The increase in the value of searching for jobs implies that 𝑉𝑉𝑒𝑒 rises. As seen earlier, a higher
value of 𝑉𝑉𝑒𝑒 leads to a higher reservation wage 𝑀𝑀 ∗ and that reduces the job-finding rate 𝑓𝑓.
Consequently, greater wage dispersion leads to more unemployment. This example thus
illustrates the important role of differences across jobs in explaining unemployment using
the search-and-matching approach.
5.6 Vacancies and unemployment
So far, we have studied the search problem in the labour market only from the perspective
of workers. That is, taking as given the job vacancies available, we looked at how workers
should search optimally and the unemployment rate that results from the process of search.
However, firms face a similar search problem when trying to fill vacancies, namely, the
challenge of finding a suitable person to fill a position. We need to understand the search
and matching problem from firms’ perspective to analyse job creation and wages.
Studying the search-and-matching problem for firms will also allow us to understand data
on vacancies and the vacancy rate. A vacancy is an unfilled job and the vacancy rate is the
ratio of vacancies to the sum of all filled and unfilled jobs (employment plus vacancies).
Figure 5.17 displays time series of the vacancy rate alongside the unemployment rate in the
180
EC2065 Macroeconomics | Chapter 5: Unemployment
USA from 2000 to mid-2021. The vacancy rate is usually lower than the unemployment rate,
being typically around 2–3 per cent. Nonetheless, significant numbers of vacancies co-exist
with significant numbers of people searching for jobs. Although vacancies and
unemployment co-exist, it is also apparent there is almost always a clear negative
relationship between the two.
Figure 5.17: The vacancy rate and the unemployment rate
The negative relationship between the vacancy and unemployment rates is shown as a
scatterplot in Figure 5.18. Most of the time, the points trace out a downward-sloping curve
known as a ‘Beveridge curve’. However, this relationship displays occasional shifts, for
example, in 2009. An even bigger shift occurs with the COVID crisis in 2020 and, as of
writing, it remains to be seen whether subsequent data points will return to the original
Beveridge curve, or a Beveridge curve in a new position.
Figure 5.18: The Beveridge curve
5.6.1 An equilibrium search model of unemployment
To explain the Beveridge curve and to understand which point on the Beveridge curve the
economy will reach, we will now consider an equilibrium search model of unemployment.
This is called an equilibrium search model because it determines the number of jobs firms
181
EC2065 Macroeconomics | Chapter 5: Unemployment
create and the level of wages, rather than taking these as given as we did earlier in the jobsearch model.
The equilibrium search model has three elements:
1. A ‘matching function’: a representation of the frictions in the search and matching
problem of workers and firms.
2. A wage-bargaining problem: there is no competitive labour market to determine
wages, so we must think of wages as coming from bargaining between workers and
firms.
3. A job-creation decision of firms: given the frictions in hiring and the wages that arise
from bargaining, how many jobs do firms want to create?
5.6.2 The matching function
The first element of the equilibrium search model is the matching function. This summarises
the process by which firms find suitable workers to fill vacancies and unemployed people
find acceptable jobs. Condensing all the details of the search done by individuals into a
single aggregate-level function is analogous to using a production function as an aggregatelevel summary of the economy’s production processes.
Mathematically, the matching function relates the ‘output’ of successful matches π‘šπ‘š per unit
of time to the ‘inputs’ of unemployed workers 𝑒𝑒 and vacant jobs 𝑣𝑣:
π‘šπ‘š = πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣)
For simplicity, we do not distinguish between the number of unemployed and the
unemployment rate, or the number of vacancies and the vacancy rate. In the expression
above, πœ‡πœ‡ is a parameter representing matching efficiency. This is analogous to total factor
productivity in a production function because higher πœ‡πœ‡ means a greater output of matches
for the same inputs of unemployed workers and firms’ vacant jobs. There are frictions in the
matching process so new matches do not generally occur so fast as to provide jobs for all
unemployed people instantaneously (π‘šπ‘š < 𝑒𝑒), or fill all vacant positions (π‘šπ‘š < 𝑣𝑣).
The matching function 𝑀𝑀(𝑒𝑒, 𝑣𝑣) is assumed to have the same properties as a neoclassical
production function:
1. Constant returns to scale: 𝑀𝑀(πœ†πœ†πœ†πœ†, πœ†πœ†πœ†πœ†) = πœ†πœ†πœ†πœ†(𝑒𝑒, 𝑣𝑣)
2. Positive but diminishing marginal products πœ•πœ•πœ•πœ•⁄πœ•πœ•πœ•πœ• and πœ•πœ•πœ•πœ•⁄πœ•πœ•πœ•πœ•
3. Inada conditions on marginal products.
An example is the Cobb-Douglas matching function with parameter 0 < πœ‚πœ‚ < 1:
π‘šπ‘š = πœ‡πœ‡π‘’π‘’πœ‚πœ‚ 𝑣𝑣1−πœ‚πœ‚
In the earlier job-search model, the job-finding rate 𝑓𝑓 depends on both the frictions in
obtaining job offers and the ‘pickiness’ of workers in not accepting all offers. Here, to avoid
making the analysis too complicated, the matching function is used to represent both the
difficulty of finding jobs and the fact that some jobs are better than others. A ‘match’ means
182
EC2065 Macroeconomics | Chapter 5: Unemployment
both a person receiving a job offer, so the worker is acceptable to the firm, and the offer
being acceptable to the worker. Therefore, the average job-finding rate 𝑓𝑓 is the number of
matches π‘šπ‘š divided by number of unemployed 𝑒𝑒:
𝑓𝑓 =
π‘šπ‘š πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣)
=
𝑒𝑒
𝑒𝑒
5.6.3 Explaining the Beveridge curve
To explain the empirical regularity of a negative relationship between vacancies and
unemployment, we combine our earlier stock-flow accounting exercise and its implications
for the steady-state unemployment rate with the newly introduced matching function.
Taking as given the job-separation rate 𝑠𝑠 and the job-finding rate 𝑓𝑓, the steady-state
unemployment rate is:
𝑒𝑒 =
𝑠𝑠
𝑠𝑠 + 𝑓𝑓
We do not distinguish between the actual and steady-state unemployment rates 𝑒𝑒 and 𝑒𝑒∗
because we suppose convergence to the steady state is sufficiently rapid that the difference
can be ignored. We assume the job-separation rate 𝑠𝑠 is exogenous here, as we did earlier in
the job-search model. The job-finding rate is 𝑓𝑓 = π‘šπ‘š/𝑒𝑒, with new matches given by the
matching function π‘šπ‘š = πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣).
Multiplying both sides of the equation by 𝑠𝑠 + 𝑓𝑓 and noting 𝑓𝑓𝑓𝑓 = π‘šπ‘š, it follows that vacancies
𝑣𝑣 and unemployment 𝑒𝑒 satisfy the equation 𝑠𝑠𝑠𝑠 + πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣) = 𝑠𝑠. The left-hand side is
increasing in both 𝑒𝑒 and 𝑣𝑣 and the right-hand side is a fixed parameter, hence, we deduce
that high vacancies 𝑣𝑣 must be associated with low unemployment 𝑒𝑒. A matching function
π‘šπ‘š = πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣) that is increasing in 𝑒𝑒 and 𝑣𝑣 and the labour-market stock-flow accounting are
sufficient to explain the Beveridge curve. This theoretical Beveridge curve relationship is
depicted in Figure 5.19. A change in the matching function π‘šπ‘š = πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣) or the parameter
𝑠𝑠 will modify this relationship and cause the Beveridge curve to shift.
Figure 5.19: The model-implied Beveridge curve
The equilibrium search model is consistent with the Beveridge curve (BC). It is able to
explain why vacancies and unemployment co-exist, and why there is usually a negative
183
EC2065 Macroeconomics | Chapter 5: Unemployment
relationship between the two. We now go further and analyse which point on the BC the
economy will reach. This is done by introducing the notion of ‘market tightness’, the ratio of
vacancies to unemployment.
5.6.4 Market tightness
While vacancies and unemployment co-exist, it is interesting to study the ratio between the
two. Market tightness πœƒπœƒ is defined as the ratio of vacancies 𝑣𝑣 to unemployment 𝑒𝑒:
πœƒπœƒ =
𝑣𝑣
𝑒𝑒
The labour market is ‘tight’ when firms have opened many vacancies relative to the number
of unemployed people, in other words, when there are many ‘buyers’ compared to ‘sellers’
of labour.
In the Beveridge curve diagram with 𝑣𝑣 and 𝑒𝑒 on the vertical and horizontal axes, market
tightness πœƒπœƒ is equal to the gradient of the ray from the origin to the point on the Beveridge
curve the labour market has reached. Using the constant-returns-to-scale property of the
matching function, market tightness determines the job-finding rate 𝑓𝑓:
𝑓𝑓(πœƒπœƒ) =
𝑒𝑒 𝑣𝑣
πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣)
= πœ‡πœ‡πœ‡πœ‡ οΏ½ , οΏ½ = πœ‡πœ‡πœ‡πœ‡(1, πœƒπœƒ)
𝑒𝑒
𝑒𝑒 𝑒𝑒
This follows because scaling the inputs to the matching function by 1/𝑒𝑒 is equivalent to
scaling the output of matches by 1/𝑒𝑒. Since the matching function is increasing in both
inputs, it follows that the job-finding rate 𝑓𝑓 is an increasing function of market tightness πœƒπœƒ.
All else equal, unemployed people will find acceptable jobs faster in a tight labour market.
In the equilibrium search model, market tightness and wages are simultaneously
determined by wage bargaining and firms’ job-creation decisions.
5.7 Wage bargaining
When a firm finds a suitable worker to fill a vacancy, the firm and worker need to agree a
wage. Similarly, workers already matched to jobs can negotiate over wages with their
employers. However, there are multiple wages consistent with the firm being willing to
employ the worker and the worker being willing to do the job. Setting the wage determines
how much each party receives of the gains from a deal between a firm and a worker. Note
that there is no competitive market to pin down wages using the requirement that demand
equals supply. In the search model, not all ‘demands’ or ‘supplies’ of firms and workers are
satisfied.
We will analyse wages by thinking about bargaining between firms and workers. We identify
the surpluses of the two parties, i.e. how much they stand to gain from a deal (the worker
accepting or remaining in a job and the firm employing or continuing to employ the worker)
relative to no deal (the worker looking for another job and the firm looking for another
worker).
184
EC2065 Macroeconomics | Chapter 5: Unemployment
Suppose a wage 𝑀𝑀 is agreed. A person who would remain or become unemployed if there
were no deal gains 𝑀𝑀 − 𝑏𝑏 per period from a deal, where 𝑏𝑏 is the level of unemployment
benefits. A worker thus has a surplus of 𝑀𝑀 − 𝑏𝑏 from employment. Assume each worker
produces goods of value 𝑦𝑦 per period once employed, hence, a firm gains 𝑦𝑦 − 𝑀𝑀 per period
by employing an extra person. Assume that each period of time a job vacancy is unfilled, a
firm must incur a recruitment cost 𝑐𝑐 to try to fill the position. Firms thus stand to gain an
amount 𝑦𝑦 − 𝑀𝑀 + 𝑐𝑐 𝑣𝑣 ⁄𝑒𝑒 per extra worker they hire, so a firm’s surplus is 𝑦𝑦 − 𝑀𝑀 + 𝑐𝑐𝑐𝑐.
Both surpluses must be positive for a deal to be in the interests of both parties. The total
surplus, the sum of the gains to both parties, is:
(𝑀𝑀 − 𝑏𝑏) + (𝑦𝑦 − 𝑀𝑀 + 𝑐𝑐𝑐𝑐) = 𝑦𝑦 − 𝑏𝑏 + 𝑐𝑐𝑐𝑐
Observe that the wage 𝑀𝑀 cancels out from this expression because it is a transfer between
the two parties. If the total surplus is positive then there are wages 𝑀𝑀 where both parties
would gain from a deal. The total surplus is always positive under the weak requirement
that 𝑦𝑦 > 𝑏𝑏, meaning a worker’s output exceeds the value of unemployment benefits. We
assume this in what follows.
When the total surplus is positive, there are many possible wages that give both parties a
positive surplus. The split of the surplus is resolved by assuming a specific form of bargaining
known as Nash bargaining. This has the bargaining powers 𝛾𝛾 and 1 − 𝛾𝛾 of workers and firms
given by an exogenous parameter 𝛾𝛾 between 0 and 1. The bargaining powers are the shares
of the total surplus received by each party.
The wage 𝑀𝑀 that achieves a split of the total surplus giving workers a share 𝛾𝛾 is given by:
𝑀𝑀 − 𝑏𝑏 = 𝛾𝛾 (𝑦𝑦 − 𝑏𝑏 + 𝑐𝑐𝑐𝑐)
The wage resulting from Nash bargaining is thus:
𝑀𝑀 = (1 − 𝛾𝛾)𝑏𝑏 + 𝛾𝛾𝛾𝛾 + 𝛾𝛾𝛾𝛾𝛾𝛾
This equation is referred to as the wage curve (WC). It implies a positive relationship
between market tightness πœƒπœƒ and the bargained wage 𝑀𝑀. In Figure 5.20, the upward-sloping
wage curve is drawn in a diagram with πœƒπœƒ on the horizontal axis and 𝑀𝑀 on the vertical axis.
Intuitively, a tighter labour market makes the cost to a firm of finding a substitute or
replacement for a worker more expensive because it will take longer to fill an open vacancy.
This puts workers in a stronger bargaining position.
It can be seen from the wage curve equation that WC shifts upwards if 𝑏𝑏 increases.
Intuitively, higher unemployment benefits raises the outside option of workers when
bargaining. An increase in productivity 𝑦𝑦 shifts WC upwards because this increases the gains
to a firm from striking a deal.
185
EC2065 Macroeconomics | Chapter 5: Unemployment
Figure 5.20: The wage curve
5.8 Job creation
The wage curve shows that bargaining between workers and firms implies a relationship
between wages and labour-market tightness. To determine which point on wage curve is
reached, it is necessary to analyse how many job vacancies firms want to create given the
wages that will arise from bargaining.
After filling a vacancy, a worker subsequently produces 𝑦𝑦 and is paid 𝑀𝑀 and the employment
relationship comes to an end at rate 𝑠𝑠 over time, where 𝑠𝑠 is the job-separation rate. If
future profits are discounted at rate π‘Ÿπ‘Ÿ, the expected present value of the profits once a
vacancy is filled is (𝑦𝑦 − 𝑀𝑀)/(π‘Ÿπ‘Ÿ + 𝑠𝑠).
This expected gain from filling a vacancy needs to be compared to the expected cost.
Suppose vacancies are filled at rate π‘žπ‘ž over time, which means that the expected time taken
to fill a vacancy is 1/π‘žπ‘ž. The firm faces a recruitment cost 𝑐𝑐 each period the vacancy remains
unfilled, so the expected cost of filling the vacancy is 𝑐𝑐/π‘žπ‘ž.
The matching function π‘šπ‘š = πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣) implies that the vacancy-filling rate π‘žπ‘ž is a function of
market tightness πœƒπœƒ using the constant-returns-to-scale property:
π‘žπ‘ž(πœƒπœƒ) =
π‘šπ‘š πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣)
𝑒𝑒 𝑣𝑣
=
= πœ‡πœ‡πœ‡πœ‡ οΏ½ , οΏ½ = πœ‡πœ‡πœ‡πœ‡(πœƒπœƒ −1 , 1)
𝑣𝑣
𝑣𝑣
𝑣𝑣 𝑣𝑣
Since the matching function is increasing in both inputs, it follows that the vacancy-filling
rate π‘žπ‘ž is decreasing in market tightness πœƒπœƒ. Therefore, the expected cost 𝑐𝑐/π‘žπ‘ž of filling a
vacancy is higher in a tighter market because it is harder for firms to find suitable workers
quickly.
Firms gain from creating more jobs and thus opening up additional vacancies if the expected
gain (𝑦𝑦 − 𝑀𝑀)/(π‘Ÿπ‘Ÿ + 𝑠𝑠) exceeds the expected cost 𝑐𝑐/π‘žπ‘ž(πœƒπœƒ). If firms open up more vacancies
then market tightness πœƒπœƒ = 𝑣𝑣/𝑒𝑒 increases, which pushes up the expected cost 𝑐𝑐/π‘žπ‘ž(πœƒπœƒ). This
means jobs are created up to the point where:
𝑦𝑦 − 𝑀𝑀
𝑐𝑐
=
π‘Ÿπ‘Ÿ + 𝑠𝑠 π‘žπ‘ž(πœƒπœƒ)
186
EC2065 Macroeconomics | Chapter 5: Unemployment
This equation implies a negative relationship between wages 𝑀𝑀 and tightness πœƒπœƒ.
Geometrically, it is represented in Figure 5.21 by a downward-sloping job-creation (JC) curve
drawn on the same axes as the wage curve. Intuitively, lower wages lead to more job
creation and a tighter labour market, all else equal. The JC curve shifts upwards if
productivity 𝑦𝑦 increases, and downwards if recruiting costs 𝑐𝑐 rise.
Figure 5.21: The job-creation curve
The equilibrium search model is completed by putting together the upward-sloping wage
curve (WC) and the downward-sloping job-creation curve (JC) in a diagram with market
tightness πœƒπœƒ on the horizontal axis and wages 𝑀𝑀 on the vertical axis. This is shown in the left
panel of Figure 5.22. The intersection between WC and JC determines equilibrium wages 𝑀𝑀 ∗
and labour market tightness πœƒπœƒ ∗ . Once market tightness is known, this determines the
gradient of the ray from the origin to the Beveridge curve and, hence, the equilibrium levels
of vacancies and unemployment in the right panel of the figure.
Figure 5.22: Equilibrium wages and market tightness
187
EC2065 Macroeconomics | Chapter 5: Unemployment
Box 5.4: Mismatch
Economies sometime faces shocks that require large reallocations of labour between
different sectors. Examples of this include the end of the construction boom in the USA after
the 2008 financial crisis and, more recently, with shifting patterns of demand for goods and
services during the COVID pandemic causing changes in the relative demands for hiring
workers in different sectors. Given the existing skills and experience of workers, these shifts
lead to greater mismatch between jobs and the unemployed. In the search-and-matching
model, we can represent the increase in mismatch by a decline in the efficiency parameter πœ‡πœ‡
of the matching function π‘šπ‘š = πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣).
A decline in matching efficiency πœ‡πœ‡ shifts the Beveridge curve relationship between
unemployment 𝑒𝑒 and vacancies 𝑣𝑣. The job-finding rate is 𝑓𝑓(πœƒπœƒ) = πœ‡πœ‡πœ‡πœ‡(1, πœƒπœƒ) conditional on
market tightness πœƒπœƒ = 𝑣𝑣 ⁄𝑒𝑒. The unemployment rate is determined by 𝑒𝑒 = 𝑠𝑠⁄(𝑠𝑠 + 𝑓𝑓 (πœƒπœƒ)),
and since 𝑓𝑓(πœƒπœƒ) = πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣)/𝑒𝑒, this is equivalent to the equation πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣) = 𝑠𝑠(1 − 𝑒𝑒).
Therefore, a lower πœ‡πœ‡ implies a higher 𝑒𝑒 for any given level of vacancies 𝑣𝑣, which means the
Beveridge curve shifts to the right as shown in Figure 5.23. Increases in mismatch thus
provide an explanation of the occasional shifts of the empirical Beveridge curve that are
observed.
An increase in mismatch also affects the job-creation curve (JC). Lower matching efficiency πœ‡πœ‡
implies a lower vacancy-filling rate π‘žπ‘ž (πœƒπœƒ) = πœ‡πœ‡πœ‡πœ‡(𝑒𝑒, 𝑣𝑣)⁄𝑣𝑣 = πœ‡πœ‡πœ‡πœ‡ (πœƒπœƒ −1 , 1) conditional on
market tightness πœƒπœƒ. This causes a downward shift of the JC curve as shown in Figure 5.23.
Intuitively, firms are less willing to create jobs if it is more difficult to recruit workers.
In equilibrium, the diagram shows wages 𝑀𝑀 and market tightness πœƒπœƒ must fall. Lower πœƒπœƒ,
which reduces the gradient of the ray from the origin to the Beveridge curve, and the shift
of Beveridge curve to the right imply higher unemployment 𝑒𝑒, but have an ambiguous
overall effect on vacancies 𝑣𝑣.
Figure 5.23: Increase in mismatch
188
EC2065 Macroeconomics | Chapter 5: Unemployment
Box 5.5: The bargaining power of workers
The search-and-matching approach to the labour market has wages determined not by the
usual forces of demand and supply but through bargaining between workers and firms. This
means that changes in the bargaining powers of workers and firms have implications for
unemployment and the labour market.
The wage curve (WC), representing the outcome of the bargaining process, is 𝑀𝑀 =
(1 − 𝛾𝛾)𝑏𝑏 + 𝛾𝛾𝛾𝛾 + 𝛾𝛾𝛾𝛾𝛾𝛾. In this equation, the parameter 𝛾𝛾 represents the bargaining power of
workers (0 ≤ 𝛾𝛾 ≤ 1). The interpretation of the parameter is that 𝛾𝛾 and 1 − 𝛾𝛾 are the shares
of the surplus from a successful match received by the worker and the firm respectively. A
reduction in workers’ bargaining power, lower 𝛾𝛾, implies a downward shift and a flattening
of the wage curve as shown in Figure 5.24.
Figure 5.24: Lower bargaining power of workers
The shift down of the wage curve reduces wages 𝑀𝑀 but equilibrium market tightness πœƒπœƒ rises
moving along the job-creation curve (JC). Higher πœƒπœƒ increases the gradient of the ray from the
origin to the Beveridge curve (BC), so this results in a movement up the Beveridge curve
with firms creating more vacancies 𝑣𝑣 and the unemployment rate 𝑒𝑒 falling.
As well as changing the average values of 𝑀𝑀, πœƒπœƒ, 𝑣𝑣, and 𝑒𝑒, the flatter wage curve also makes
market tightness πœƒπœƒ more responsive to shifts of the job-creation curve. Movements along
the Beveridge curve would become larger, so unemployment 𝑒𝑒 becomes more volatile
whenever the job-creation curve shifts. This is because wages are tied more closely to
workers’ fixed outside option 𝑏𝑏, so are effectively ‘stickier’ and less responsive to shocks.
Box 5.6: ‘Furlough’ policies in the COVID pandemic
Following the outbreak of the coronavirus pandemic in 2020, some governments introduced
job-support (‘furlough’) schemes giving firms incentives to retain workers. Such policies
were used in UK and some other European countries. In the absence of a policy of this kind,
it was expected there would have been a huge increase in the unemployment rate, as was
seen in the USA.
189
EC2065 Macroeconomics | Chapter 5: Unemployment
In the search-and-matching model, we could think of the disruption from the
pandemic and the ‘lockdowns’ that were imposed as causing a large temporary rise in
job-separation rate 𝑠𝑠 in the absence of a job-support scheme. One justification for
having such schemes was to avoid the costs of the large-scale rehiring of workers that
would be needed once the pandemic is over. But does the search-and-matching
model support this argument?
Without a job-support scheme in place, suppose the effect of the pandemic is a
temporary shutdown of employment in some sectors of the economy and a large rise
of the unemployment rate 𝑒𝑒. We assume there are no long-run effects: all exogenous
variables return to their former values once the pandemic over. This analysis ignores
any structural changes or mismatch that might also result from the pandemic. After
the pandemic, the job-creation curve and the wage curve are in their original
positions and the economy would go back to the same levels of wages 𝑀𝑀 ∗ and market
tightness πœƒπœƒ ∗ . This position is depicted in Figure 5.25.
However, even if market tightness returns to the same equilibrium value πœƒπœƒ ∗ once the
shutdown is over, the unemployment rate 𝑒𝑒 does not immediately go back to the
steady state 𝑒𝑒∗ . Time is required for the many unemployed people to find jobs again
and for firms to hire staff again. For each person out of a job during the transition
back to 𝑒𝑒∗ , an amount of production 𝑦𝑦 is lost, implying a net loss 𝑦𝑦 − 𝑏𝑏 at each point
in time the person is unemployed. For each additional vacancy 𝑣𝑣 unfilled above the
steady state 𝑣𝑣 ∗ at a point in time, extra hiring costs of 𝑐𝑐 are incurred by firms. Hence,
at a point in time where the unemployment rate is 𝑒𝑒 and vacancies are 𝑣𝑣, the loss is
(𝑒𝑒 − 𝑒𝑒∗ ) + 𝑐𝑐(𝑣𝑣 − 𝑣𝑣 ∗ ).
Having a job-support scheme would avoid some or all of these losses, so we can
assess the gains from the scheme by adding up all the losses from its absence. Note
that because 𝑣𝑣 = πœƒπœƒ ∗ 𝑒𝑒 at all times during the recovery, the loss can be simplified to
(𝑦𝑦 − 𝑏𝑏 + π‘π‘πœƒπœƒ ∗ )(𝑒𝑒 − 𝑒𝑒∗ ) at a point when the unemployment rate is 𝑒𝑒.
The dynamics of unemployment rate are given by 𝑒𝑒′ − 𝑒𝑒 = 𝑠𝑠(1 − 𝑒𝑒) − 𝑓𝑓𝑓𝑓, and the
steady state is the solution of the equation 𝑠𝑠(1 − 𝑒𝑒∗ ) = 𝑓𝑓𝑒𝑒∗ . Hence, the dynamics of
the gap between unemployment 𝑒𝑒 and its steady-state value 𝑒𝑒∗ are:
𝑒𝑒′ − 𝑒𝑒∗ = (1 − (𝑠𝑠 + 𝑓𝑓))(𝑒𝑒 − 𝑒𝑒∗ )
This says that the gap between 𝑒𝑒 and 𝑒𝑒∗ is closed at rate 𝑠𝑠 + 𝑓𝑓 per unit of time,
implying the average duration of the deviation of 𝑒𝑒 from 𝑒𝑒∗ is 1/(𝑠𝑠 + 𝑓𝑓). The
dynamics of the adjustment back to steady state are illustrated in Figure 5.25.
If 𝑒𝑒1 is the unemployment rate at the point in time when the shutdown is over, the
total costs of recovery back to steady state are approximately
(𝑦𝑦 − 𝑏𝑏 + π‘π‘πœƒπœƒ ∗ )(𝑒𝑒1 − 𝑒𝑒∗ )⁄(𝑠𝑠 + 𝑓𝑓). This is what is gained by having the job-support
scheme that avoids the rise in 𝑒𝑒.
190
EC2065 Macroeconomics | Chapter 6: Money
Chapter 6: Money
Up to this point, our study of macroeconomics has focused only on real variables. In ignoring
any reference to money, we have implicitly assumed individuals do not suffer ‘money
illusion’ and that they are able to make decisions solely with reference to real values and
relative prices. But are there reasons why money matters that are missing from our earlier
analysis? In other words, why is money important for the functioning of markets?
Essential reading
•
Williamson, Chapters 12 and 18.
6.1 Why does money matter?
We will explore three reasons why money matters. The first is that money is used as a
means of payment. Trade between individuals and firms in the economy depends on having
an object such as money that serves as a medium of exchange. The second reason money
matters is known as ‘nominal rigidity’. Some market prices are quoted in units of money and
slow to adjust, or are set in contracts that are renegotiated only infrequently. A third is that
changes in the value of money affect the willingness of individuals to hold money. The
reasons that money matters in an economy are closely related to the three functions of
money: medium of exchange; unit of account; and store of value.
Money functions as a medium of exchange because money is accepted as payment for
goods and services. Direct barter exchange of different goods, or of labour for goods, is
inconvenient and difficult.
In addition, money functions as a unit of account. It is convenient to quote prices in terms of
money instead of relative prices among a huge range of different goods. Money is also the
conventional unit of account used to specify wages in employment contracts and
repayments in debt contracts.
Finally, money functions as a store of value. Money is an asset that can be used to transfer
purchasing power over time. However, all assets act as stores of value, so this function is
not particular to money. We will focus on the two special functions of money, medium of
exchange and unit of account, with most of this chapter devoted to money as a medium of
exchange.
6.1.1 Medium of exchange
The medium of exchange function of money arises from the problem of the absence of a
double coincidence of wants. Among two people, there is said to be only a single
coincidence of wants if person A wants a good person B has but person B does not want any
good that person A has. A double coincidence of wants occurs when both person A and
191
EC2065 Macroeconomics | Chapter 6: Money
person B have a good that the other person wants. In a specialised economy producing a
vast range of goods and services, a double coincidence of wants is rare.
For trade to take place in markets, it must be in interests of both parties. Since double
coincidences of wants are hard to find, barter exchange is difficult. Exchange is made easier
if one party is willing to accept money because then trade requires only a single coincidence
of wants.
6.1.2 Unit of account
The unit of account function of money results from there being too many relative prices to
quote directly among all the different goods and factors of production in an economy. It is
convenient to express all prices as amounts of money. Similarly, in contracts written to
govern long-term employment and creditor-debtor relations, it is convenient to specify
payments in terms of a conventional unit of account.
In Chapter 9, we will further argue that prices may sometimes remain fixed even when
conditions in a market or the economy change. This is because it is costly and inconvenient
to update prices continuously. The same applies to contracts that are costly to renegotiate
or to write with many contingencies in advance.
6.1.3 Different objects that serve as money
We have defined money in terms of its functions but what objects can serve as money? The
common forms of money in use today are notes and coins issued by governments or central
banks that do not have intrinsic value, i.e. they are not valued for the material from which
they are made. This type of money is known as ‘fiat’ currency. The other type of money
commonly in use today is deposits at commercial banks. These deposits are claims to fiat
currency, so this a type of ‘credit’ money. Commercial banks themselves hold fiat money as
vault cash or reserves at the central bank, which is another form of fiat money.
There are also some new or experimental forms of money that are yet not in general use
but may become more widespread in the next decade. These include cryptocurrencies,
which are a private and decentralised system of money. Central banks also have plans to set
up central-bank digital currencies, which are a centralised system of accounts where
individuals hold money directly at a central bank.
Historically, money took other forms that are now extremely rare. These include commodity
money, where coins are made from precious metals, or notes issued by the government
were redeemable for precious metals on demand. There were also private bank notes, a
form of credit money, which were claims to commodity money.
6.2 A search-theory perspective on money
To understand the medium of exchange function of money, it helps to step away from our
earlier models that assumed trade in the economy takes place in centralised markets. The
assumption was that everyone can buy or sell in markets subject only to a budget
constraint. The sequence of transactions was irrelevant – all that mattered was that each
person’s overall budget constraint was satisfied.
192
EC2065 Macroeconomics | Chapter 6: Money
However, unless there is a double coincidence of wants for all trade, this implicitly assumed
a very high degree of coordination, or that short-term credit is freely available and works
without any friction. The reason is that a budget constraint allows for purchases in a period
even though people may not have yet received payment for what they plan to sell. An
alternative approach is known as ‘search’ theory. In a search model, all trade is
decentralised and occurs in meetings between pairs of individuals rather than in centralised
markets.
6.2.1 A simple search model of money
We can illustrate the ideas of search theory in a simple model with three types of
individuals. Think of these individuals as having different occupations, so that they specialise
in producing different goods. Moreover, individuals have different needs and tastes, so their
preferences are not the same. We assume no double coincidences of wants to highlight the
usefulness of money. A specific example of three individuals is given in Figure 6.1.
In a search model there are no competitive, centralised markets. All trade must be bilateral,
meaning that it takes place between pairs of individuals. To keep the analysis simple,
assume only one indivisible unit of each of the goods and services can be produced. This
avoids the need to discuss prices at this stage because any trade that takes place must
involve one unit of goods being purchased or sold.
In the absence of any double coincidence of wants, no trade can take place between any
pairs of individuals. However, this is inefficient. If all three individuals could meet and
coordinate a three-way exchange centrally then all three could have their wants met by one
of the others.
Figure 6.1: The absence of a double coincidence of wants
6.2.1 Commodity money
In the example in Figure 6.1, everyone produces a service, which cannot be stored. Once we
allow for physical goods, it is possible that one or more such goods can become a
commodity money. A commodity money has intrinsic value because of what it is made of, so
it would have a value even if it were not used as money. But, crucially, a commodity money
is accepted for payments even by those who do not want to consume the commodity itself.
193
EC2065 Macroeconomics | Chapter 6: Money
Figure 6.2 below modifies the earlier example so that one person can produce a physical
good and one person wants to consume that good. However, there is still no double
coincidence of wants for direct barter exchange. But all trade is possible if everyone accepts
the physical good in exchange for what they produce even if they do not want to consume
it. This enables the physical good to serve as a commodity money.
For a good to serve as a commodity money, it should be:
•
•
•
•
Easily storable at low cost, potentially for long periods of time
Easily transportable
Straightforward to verify the quality of the good
Easily divisible, for when exchange is not one-for-one.
Figure 6.2: Trade with commodity money
In the past, precious metals were a common form of commodity money, which satisfy the
first two of these requirements well. The use of coinage and convertible notes or tokens
added extra convenience, helping to satisfy the third and fourth requirements.
The advantage of a system of commodity money is that the limited supply and intrinsic
value of the commodity should give confidence that money will be a good store of value, or
at least not too bad. The disadvantage is that the system ties up valuable goods as money,
which either cannot be used directly, or there must be extra production of the commodity,
which has a cost.
6.2.2 Credit money
An alternative to a system of commodity money is to use credit money. Credit money is
where privately issued IOUs circulate as money. An IOU is a debt i.e. a promise to deliver a
payment in the future.
The intended purpose of an IOU is a simple credit instrument, where, say, person A offers an
IOU to person B in exchange for something, which is accepted. Person A then later redeems
the IOU, giving person B what is owed.
194
EC2065 Macroeconomics | Chapter 6: Money
But IOUs could in principle become money if the initial holder uses it to make a payment to
someone else and then that person might pass it on to someone else as well. Consequently,
the IOU is held by a third party at redemption.
Figure 6.3 returns to the example with three individuals who produce only services. Since
services must usually be consumed at the point they are produced, this rules out the use of
commodity money. However, if everyone is willing to accept someone’s IOU, that IOU can
circulate as money. Through the use of this credit money, all three individuals are able to
purchase the services they desire.
Figure 6.3: Trade with private IOUs
6.2.3 Money and credit
The example of a private IOU circulating as money shows that there is often a connection
between money and credit. However, it would be a mistake to see money and credit as the
same thing. First, not all types of money are credit. For example, in the example with
commodity money, no-one owes anything to anyone else. As we will see, fiat money is also
not a debt that the government is obliged to repay.
Second, far from all credit ever becomes money. Very few individuals are sufficiently well
known that their IOUs could circulate as money. To use an IOU as credit money, everyone
who would subsequently hold the IOU needs to know and trust the issuer, in addition to the
first person to accept the IOU, which is all that would be required if the IOU were used as a
simple credit instrument. Only in the smallest communities are these requirements likely to
be met for circulation of individuals’ own IOUs.
Considering these difficulties, for credit to serve as money, the IOUs need to be issued by
large, well-known and trusted companies or organisations. In practice, this means banks.
Historically, bank IOUs took the form of their own issue of banknotes, which were promises
to repay deposits of commodity money. In the modern world, bank IOUs typically take the
form of deposits, which are claims to fiat money.
195
EC2065 Macroeconomics | Chapter 6: Money
If banks have created IOUs that are accepted for payments then it is easy to see how these
can be used to facilitate exchange among the three individuals in the earlier example. Thus,
trade can take place using credit money issued by banks even if the individuals in the
economy cannot persuade others to use their own IOUs as money.
A system of credit money has some important advantages. It is efficient system with a low
resource cost because it does not tie up goods with intrinsic value to be used as money.
Even if bank IOUs are claims to commodity money, banks would not need to hold 100 per
cent of deposits as vault cash with intrinsic value. Furthermore, as we will see, banks can
loan deposits to support long-term investment. Some of the return on these investments
can be paid to depositors as interest, making bank deposits a better store of value.
However, credit money also has disadvantages. The biggest of these is that default by banks
on their IOUs may cause a collapse of confidence in the monetary system. Bank runs and
bank failure disrupt trade and trigger financial crises. Defaults by banks may be due to losses
made on their loans, or even caused by a bank run itself. These problems also lead to
pressure for bailouts from the government, creating a problem of moral hazard (‘too big to
fail’).
As we have discussed, individuals’ own IOUs cannot circulate as money. However, it is
possible that some forms of credit can be substitutes for money in making payments, for
example, credit cards. An individual paying with a credit card does not need to hold money
at the time of making a purchase, hence, this payment method acts as a substitute for
money. Essentially, the financial intermediaries that issue credit cards endorse individuals’
IOUs so others can be assured these debts will be repaid. The ability to use credit in this way
reduces frictions in payment because there is less need to hold money. However, such a
system of credit-based payments has costs for financial intermediaries coming from the
need to track credit histories and collect debt repayments.
6.2.4 Fiat money
Another form of money in widespread use is fiat money. This is defined as governmentissued money of no intrinsic value. The term ‘fiat’ suggests this money has value by
government decree but that is misleading because ultimately the real value of fiat money is
determined in markets. Fiat money is not credit money because it is not a claim to anything
other than itself – it is not redeemable as an IOU is. Historically, government-issued notes
may have been claims to commodity money but this is no longer the case. In accounting
terms, fiat money is recorded as a liability of the government or central bank but it is
important to remember that is quite unlike private-sector liabilities such as bonds or loans.
The physical form of fiat money is cash, comprising notes and coins of non-precious metal.
Commercial banks hold some fiat money as vault cash but in modern monetary systems, the
reserves of commercial banks are usually held in accounts at the central bank. In this form,
fiat money is only an entry in a database recording how much each commercial bank has on
‘deposit’ in its reserve account at the central bank. While currently households and firms do
not hold reserves directly, commercial banks can convert reserves and cash one-for-one.
196
EC2065 Macroeconomics | Chapter 6: Money
Figure 6.4 below shows how trade can take place using fiat money in the earlier example
with three individuals. Initially, someone holds a unit of fiat money and everyone accepts it
for payments. The fiat money circulates among the individuals and everyone can consume
the service they desire. Note that the fiat money remains in circulation after all the
exchanges have taken place. This implicitly assumes the fiat money will go on being used for
future trade.
Fiat money shares the advantage of credit money in being an efficient, low-cost system
because the intrinsically worthless money that is used has a negligible resource cost (there
are still some costs of production for the notes and coins, and costs of handling cash for the
private sector). It is important that individuals can easily recognise units of fiat money as
genuine but this can be achieved to a sufficient degree of accuracy with appropriate antiforgery devices.
Figure 6.4: Trade with fiat money
The potential disadvantages of fiat monetary systems also stem from fiat money lacking any
intrinsic value. As currency has a much lower cost for the government to produce it than its
market value and, as there is no obligation to redeem it, there is a temptation to issue more
fiat currency to raise revenue. The abuse of this money-issuing power by governments
results in money being a poor store of value and, at worst, hyperinflation.
Furthermore, because fiat money is not redeemable for anything other than itself, its value
depends on the belief that others will continue to accept it for payments. Note that in Figure
6.4, the fiat money is never withdrawn from circulation. This means that those choosing to
accept money must always believe that others will continue to accept money in the future.
In principle, such beliefs could be subject to self-fulfilling shifts because the belief that
others will not accept fiat money justifies individuals choosing not to accept it.
However, in practice, the concern about self-fulfilling losses of confidence in fiat currency
may be mitigated by the government’s power of taxation. Governments can insist on
payment of taxes in their own currencies, which ensures there is always some demand for
197
EC2065 Macroeconomics | Chapter 6: Money
money. The payment of taxes in fiat money to the government could also be seen as a
mechanism through which fiat money is withdrawn from circulation.
Box 6.1: Cryptocurrencies
Recent years have seen a rise to prominence of cryptocurrencies, most famously
Bitcoin, but now many others too. Cryptocurrencies are a type of privately created
money in an electronic form. The operation of cryptocurrencies is a decentralised
system, unlike the centralised control of fiat money by governments.
Nonetheless, cryptocurrencies share some features of fiat money. They have no
intrinsic value, which makes them very different from commodity money.
Cryptocurrencies are also not IOUs in any sense, which means they are not credit
money.
Proponents of cryptocurrencies have put forward several advantages. First, the
blockchain technology they build on makes transactions very secure. Second,
because the supply of a particular cryptocurrency is limited by design, it is argued
there is less risk of the cryptocurrency losing value due to oversupply – in contrast
to fiat money where governments have discretion to create more – making
cryptocurrencies a better store of value. However, limited supply is necessary but
not sufficient to preserve value, which also depends on a stable or growing demand
for a currency.
Critics of cryptocurrencies argue there are serious disadvantages. First, the value of
a cryptocurrency depends on others’ beliefs about its future value, which risks
significant volatility (in theory, this is also a drawback of fiat money). Values of
cryptocurrencies have indeed been extremely volatile, making them far from a
traditional risk-free asset. Second, as with cash, the anonymity allowed by
cryptocurrencies may facilitate criminal activity, although this anonymity might also
be valuable in fostering civil liberties. Third, there is the cost of the computing
power used to maintain the distributed ledger, implying that cryptocurrencies may
entail significant resource costs, a disadvantage shared with commodity money.
How do cryptocurrencies fit into our analysis of money? In this chapter, we will
usually think of money as an asset that serves as a medium of exchange but which
is less good compared to other assets as a store of value. But cryptocurrencies are
currently little used as a traditional medium of exchange – not many purchases of
goods and services use cryptocurrency. Cryptocurrencies have had high (although
volatile) rates of return, unlike traditional forms of money. In light of these
observations, it may be better to think of people holding cryptocurrencies as a
financial asset rather than as money in the usual sense of the term. With no
dividends and all returns coming from capital gains, one approach to analysing
cryptocurrencies is as ‘bubbles’ in the overlapping generations model.
198
EC2065 Macroeconomics | Chapter 6: Money
6.3 Money and assets as stores of value
Storing value is a function of money but this function is not unique to money. All assets
must act serve as stores of value to some extent and many often do this better than money.
For example, bonds may offer a real return π‘Ÿπ‘Ÿ from interest payments, shares pay dividends,
property earns rents and shares and property may benefit from capital gains.
Considering money, let π‘–π‘–π‘šπ‘š denote the return on holding money for a period in terms of units
of money itself. For example, if money is interpreted as funds in a bank account, π‘–π‘–π‘šπ‘š is the
interest rate paid on deposits. If money is cash for which no interest is paid then π‘–π‘–π‘šπ‘š = 0.
Note that π‘–π‘–π‘šπ‘š is a nominal interest rate and a nominal return: it is the percentage increase in
the amount of money held simply by holding on it for some amount of time.
The terminology we will use throughout this chapter is that nominal refers to something
measured in units of money, while real refers to something measured in units of goods. If
the nominal return on money is π‘–π‘–π‘šπ‘š , what is the implied real return? The real return on
money, and nominal assets more broadly, depends on the inflation rate in the economy.
6.3.1 Inflation
Inflation is defined as a general rise in prices quoted in terms of money. Inflation affects
how good or bad money is as a real store of value.
In previous chapters, we have measured real variables in terms of a homogeneous good or
basket of goods. Let 𝑃𝑃 denote the price of this good, or basket of goods, in terms of units of
money. If the price level in the current period is 𝑃𝑃, the notation for the price level in the
next period is 𝑃𝑃′. The rate of inflation between these time periods is denoted by πœ‹πœ‹:
πœ‹πœ‹ =
𝑃𝑃′ − 𝑃𝑃
𝑃𝑃
Note that this definition of inflation refers to the percentage change in prices between the
current level and the level that will prevail in the future. It is also possible to measure
inflation between the past and current periods and where that inflation rate is relevant, the
notation will be adjusted to accommodate it. Note also that the future price level 𝑃𝑃′ and the
resulting inflation rate are not known in current period. Where the distinction between and
expected inflation is important, the notation πœ‹πœ‹ 𝑒𝑒 will be used to denote expected inflation.
Figure 6.5 shows data on inflation for the USA in the post-war period. Inflation is volatile at
the end of the 1940s but becomes very low and stable in the 1950s. The 1960s see an
increase in the inflation rate, which reaches double digits in the 1970s. Inflation is brought
under control in the 1980s and remains stable throughout the 1990s. This stability continues
into the 2000s except for the years around the 2007–8 financial crisis and its aftermath.
To calculate the real change in spending power when holding money, the value of money,
plus any interest π‘–π‘–π‘šπ‘š that accrues, is adjusted for changes in the money prices of goods and
services. Take the amount of money 𝑃𝑃 that currently buys one unit of goods. If held simply
as money then this becomes (1 + π‘–π‘–π‘šπ‘š )𝑃𝑃 units of money in the future period and with a price
level 𝑃𝑃′ , it would be possible to buy (1 + π‘–π‘–π‘šπ‘š )𝑃𝑃/𝑃𝑃′ units of goods in the future. The
199
EC2065 Macroeconomics | Chapter 6: Money
definition of the real return π‘Ÿπ‘Ÿπ‘šπ‘š on money is that holding an amount of money sufficient to
purchase a unit of goods now yields purchasing power over 1 + π‘Ÿπ‘Ÿπ‘šπ‘š units of future goods.
The percentage real return on money π‘Ÿπ‘Ÿπ‘šπ‘š is therefore calculated from the equation:
1 + π‘Ÿπ‘Ÿπ‘šπ‘š =
(1 + π‘–π‘–π‘šπ‘š )𝑃𝑃 1 + π‘–π‘–π‘šπ‘š 1 + π‘–π‘–π‘šπ‘š
=
=
𝑃𝑃′
𝑃𝑃′⁄𝑃𝑃
1 + πœ‹πœ‹
Observe that (1 + π‘Ÿπ‘Ÿπ‘šπ‘š )(1 + πœ‹πœ‹) = 1 + π‘–π‘–π‘šπ‘š implies 1 + π‘Ÿπ‘Ÿπ‘šπ‘š + πœ‹πœ‹ + π‘Ÿπ‘Ÿπ‘šπ‘š πœ‹πœ‹ = 1 + π‘–π‘–π‘šπ‘š . If π‘Ÿπ‘Ÿπ‘šπ‘š πœ‹πœ‹ is small
compared to π‘Ÿπ‘Ÿπ‘šπ‘š and πœ‹πœ‹, the real return on money is approximately given by π‘Ÿπ‘Ÿπ‘šπ‘š ≈ π‘–π‘–π‘šπ‘š − πœ‹πœ‹. In
the case where no interest is paid on money (π‘–π‘–π‘šπ‘š = 0), for example, when money is
interpreted as cash, then the real return is approximately π‘Ÿπ‘Ÿπ‘šπ‘š ≈ −πœ‹πœ‹. This says that the
inflation rate is approximately the percentage loss of purchasing power of money over time.
Figure 6.5: US inflation
6.3.2 The Fisher equation
The equivalent calculation of the real return on holding nominal bonds is known as the
Fisher equation. A nominal bond is one that specifies payments in terms of units of money.
It is natural for bonds to make payments in this form following on from our earlier
discussion of money’s role as a unit of account.
We consider somewhat more general nominal bonds in Chapter 7 but here suppose that a
nominal bond makes a single payment of interest in terms of money in the next period and
this payment is certain. The interest rate specified by the bond is 𝑖𝑖, the nominal interest rate
and nominal return on holding the bond. This is known when the bond is purchased. The
substantive assumption here is that the bond payment is not indexed to inflation.
The Fisher equation gives the implied real return, referred to as the real interest rate π‘Ÿπ‘Ÿ:
1 + π‘Ÿπ‘Ÿ =
(1 + 𝑖𝑖 )𝑃𝑃 1 + 𝑖𝑖
=
1 + πœ‹πœ‹
𝑃𝑃′
The justification for this equation is that one unit of goods costs 𝑃𝑃 units of money in the
current period. If this money is used to buy bonds that offer a nominal interest rate 𝑖𝑖, the
200
EC2065 Macroeconomics | Chapter 6: Money
amount of money returned when the bond matures in the next period is (1 + 𝑖𝑖 )𝑃𝑃. Dividing
this by the future price level 𝑃𝑃′ gives the amount of future goods that can be purchased. The
equation then follows by noting that π‘Ÿπ‘Ÿ is defined so that buying nominal bonds worth a unit
of goods today gives the ability to buy 1 + π‘Ÿπ‘Ÿ units of goods in the future.
Rearranging the equation gives 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹ + π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ, so if π‘Ÿπ‘Ÿ and πœ‹πœ‹ are small, the term π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ is
negligible compared to the other terms, and it follows that π‘Ÿπ‘Ÿ ≈ 𝑖𝑖 − πœ‹πœ‹. This equation, which
can also be written as 𝑖𝑖 ≈ π‘Ÿπ‘Ÿ + πœ‹πœ‹, is the approximate version of the Fisher equation. It states
that the real return on bonds is the difference between the nominal interest rate and the
inflation rate. This is an approximation, and in contexts where the inflation rate can be very
high, the exact version 1 + 𝑖𝑖 = (1 + π‘Ÿπ‘Ÿ)(1 + πœ‹πœ‹) will be used.
6.3.3 Ex-ante and ex-post interest rates
We have seen that inflation affects the real return on nominal assets such as money and
nominal bonds. However, inflation πœ‹πœ‹ as defined earlier is the percentage change in the price
level between the current and future time periods, which is therefore not known in
advance. This means the Fisher equation can be used with either actual inflation πœ‹πœ‹ or
expected inflation πœ‹πœ‹ 𝑒𝑒 as appropriate.
Using the expected inflation rate πœ‹πœ‹ 𝑒𝑒 leads to an expected (ex-ante) real interest rate π‘Ÿπ‘Ÿ 𝑒𝑒 ≈
𝑖𝑖 − πœ‹πœ‹ 𝑒𝑒 , or by giving this equation in its exact form, 1 + π‘Ÿπ‘Ÿ 𝑒𝑒 = (1 + 𝑖𝑖)/(1 + πœ‹πœ‹ 𝑒𝑒 ). Using the
actual inflation leads to the actual (ex-post) real interest rate π‘Ÿπ‘Ÿ ≈ 𝑖𝑖 − πœ‹πœ‹, or in its exact form,
1 + π‘Ÿπ‘Ÿ = (1 + 𝑖𝑖)/(1 + πœ‹πœ‹).
Different from nominal bonds, inflation-indexed bonds have the same actual and expected
real returns. This type of bond is sometimes referred to as a real bond to distinguish it from
bonds that specify payments in terms of fixed amounts of money.
6.3.4 The opportunity cost of holding money
We have argued that money provides an important service in facilitating transactions as a
medium of exchange. Despite this advantage, money generally performs less well as a store
of value than other assets. To the extent that the return on holding money is below the
return on alternative assets, there is an opportunity cost of holding money that must be set
against its benefits in facilitating transactions. This opportunity cost is inversely related to
how well money performs as a store of value.
In what follows, we will take nominal bonds as the alternative asset to which money is
compared. The nominal interest rate on bonds is 𝑖𝑖, which is also the nominal return on
holding bonds. If money pays interest at nominal rate π‘–π‘–π‘šπ‘š then the relative return on bonds
compared to money is the difference between the interest rates 𝑖𝑖 − π‘–π‘–π‘šπ‘š . Note that this
relative return is the same if calculated as a comparison of real returns π‘Ÿπ‘Ÿ and π‘Ÿπ‘Ÿπ‘šπ‘š because the
same inflation rate πœ‹πœ‹ is subtracted from both:
π‘Ÿπ‘Ÿ − π‘Ÿπ‘Ÿπ‘šπ‘š = (𝑖𝑖 − πœ‹πœ‹) − (π‘–π‘–π‘šπ‘š − πœ‹πœ‹) = 𝑖𝑖 − π‘–π‘–π‘šπ‘š
The opportunity cost of holding money is therefore 𝑖𝑖 − π‘–π‘–π‘šπ‘š . As we will see, the opportunity
cost is usually positive, with holders of money forgoing a generally higher return on bonds.
201
EC2065 Macroeconomics | Chapter 6: Money
If no interest is paid on money (π‘–π‘–π‘šπ‘š = 0), for example where money is physical cash, then the
opportunity cost is simply 𝑖𝑖. In this case, the level of nominal interest rates on bonds is a
measure of the opportunity cost of holding money.
6.3.5 Real and nominal interest rates
Our earlier analyses of consumption, saving, and investment in Chapter 3 shows that it is the
(expected) real interest rate π‘Ÿπ‘Ÿ that is important for incentives. For example, the theory of
investment links the real interest rate to the marginal product of capital net of depreciation.
We have seen there are reasons to believe that real interest rates should be positive on
average if the productivity of capital is sufficiently high, or households are impatient. But
theory does not rule out times when real interest rates are negative.
A time series of US real interest rates is shown in Figure 6.6. This shows that real interest
rates are positive but low on average (around 2 per cent). The 1980s featured much higher
real interest rates, which peaked at close to 10 per cent. Real interest rates were positive
but lower in the 1960s and the 1990s. There are also times of negative real interest rates in
the late 1940s, 1970s and, more recently, from the aftermath of the 2008 financial crisis
through to 2021.
Figure 6.6: US real and nominal interest rates
As we will see in this chapter, the nominal interest rate also matters independently of the
level of real interest rates. This is because it affects the relative returns on money and bonds
in a world where at least some forms of money pay no interest, which influences how
households allocate wealth between different assets.
Empirically, US nominal interest rates have almost always been positive, though there have
been long spells where they have been close to zero, most notably after the 2008 financial
crisis. On average, nominal interest rates are higher than real interest rates, reflecting the
positive average rate of inflation. The broad pattern for US nominal interest rates is that
they were very low in the 1940s, increased over the subsequent decades to peak close to 15
per cent in the early 1980s and then declined in through to the time of writing (2021).
202
EC2065 Macroeconomics | Chapter 6: Money
Nominal interest rates are clearly positive on average, indicating a positive opportunity cost
of holding money, particularly cash. In some countries, nominal interest rates have
occasionally turned negative. We will return to that unusual case later in this chapter.
6.4 The demand for money
To understand prices, inflation and nominal interest rates, we now set up a framework for
analysing the demand for money. The basic trade-off is that money facilitates economic
activity by acting as a medium of exchange but may not be so good as a store of value
compared to alternative assets. So, while money is useful to households and firms, they
have incentives to economise on holding money, use alternatives to money, or carry out
fewer transactions.
We suppose that real GDP π‘Œπ‘Œ is an indicator of the number of transactions taking place in an
economy and each transaction requires using a means of payment such as money. Here, we
take the level of real GDP π‘Œπ‘Œ as given, returning later to the question of whether that is
affected by money and monetary policy. If the price level is 𝑃𝑃, the money value of all
transactions in the economy is 𝑃𝑃𝑃𝑃.
To represent the role of money as a medium of exchange, we impose the following
transaction constraint in addition to the budget constraints faced by agents in the economy:
𝑀𝑀 ≥ 𝑃𝑃(π‘Œπ‘Œ − 𝑋𝑋)
This constraint specifies a minimum level of money that households and firms must hold to
carry out transactions. Mathematically, it requires that the level of money holdings on
average during a period is sufficient to pay for transactions of real value π‘Œπ‘Œ − 𝑋𝑋, which
correspond to an amount of money 𝑃𝑃(π‘Œπ‘Œ − 𝑋𝑋). The variable 𝑋𝑋 has several possible
interpretations, including efforts to economise on holding money or use alternatives to
money in carrying out transactions.
We will assume in what follows that money pays no interest. Holding money balances 𝑀𝑀 on
average during a period means forgoing interest 𝑖𝑖𝑖𝑖 that could have been earned from
holding bonds instead. If money pays interest at rate π‘–π‘–π‘šπ‘š , all we need to do is replace the
opportunity cost 𝑖𝑖 with 𝑖𝑖 − π‘–π‘–π‘šπ‘š throughout this chapter.
Taking as given 𝑃𝑃 and π‘Œπ‘Œ, the minimum amount of money holdings consistent with the
transaction constraint can only be reduced and thus forgone interest saved, by increasing 𝑋𝑋.
As we now discuss, increasing 𝑋𝑋 has costs that we can compare to forgone interest to derive
households’ and firms’ demand for money.
6.4.1 Economising on holding money
The first interpretation of 𝑋𝑋 is efforts to economise on the amount of money held on
average. All 𝑃𝑃𝑃𝑃 transactions are carried out with money but agents make frequent
exchanges between bonds and money to keep their average holdings of money lower.
Consequently, as 𝑋𝑋 rises, average holdings of money 𝑀𝑀 fall further below 𝑃𝑃𝑃𝑃.
203
EC2065 Macroeconomics | Chapter 6: Money
The benefit of higher 𝑋𝑋 is a reduction in forgone interest but this uses up time or incurs
transaction costs. These costs in real terms are specified by the function 𝑍𝑍(𝑋𝑋), which is
increasing in 𝑋𝑋. We assume 𝑍𝑍(𝑋𝑋) has the properties 𝑍𝑍(0) = 0, 𝑍𝑍 ′ (𝑋𝑋) > 0, and 𝑍𝑍 ′′ (𝑋𝑋) > 0,
the third of these implying that the marginal cost 𝑍𝑍′(𝑋𝑋) is increasing in 𝑋𝑋.
Conditional on 𝑋𝑋, the lowest money holdings can be is 𝑀𝑀 = 𝑃𝑃(π‘Œπ‘Œ − 𝑋𝑋). An increase of 𝑋𝑋 by 1
reduces the need to hold real money balances 𝑀𝑀/𝑃𝑃 by 1, which reduces the real value of
forgone interest 𝑖𝑖𝑖𝑖/𝑃𝑃 on money holdings by 𝑖𝑖. The marginal benefit of higher 𝑋𝑋 is thus
equal to 𝑖𝑖. The marginal cost of higher 𝑋𝑋 is 𝑍𝑍 ′ (𝑋𝑋), which we will denote by π‘žπ‘ž in what
follows. The optimal choice of 𝑋𝑋 ∗ is where the marginal benefit equals the marginal cost:
𝑖𝑖 = 𝑍𝑍′(𝑋𝑋 ∗ )
The marginal cost function 𝑍𝑍′(𝑋𝑋) is shown as an upward-sloping line in Figure 6.7 with 𝑋𝑋 on
the horizontal axis and the marginal cost π‘žπ‘ž on the vertical axis. The optimal value of 𝑋𝑋 ∗ for a
particular nominal interest rate 𝑖𝑖 is derived by drawing a horizontal line at π‘žπ‘ž = 𝑖𝑖 and finding
where it intersects the marginal cost function. The figure shows that a higher nominal
interest rate 𝑖𝑖 leads to an increase in the optimal 𝑋𝑋 ∗ . Intuitively, if money is a worse store of
value, meaning that the opportunity cost 𝑖𝑖 is higher, it is rational to make more efforts to
avoid holding it.
Figure 6.7: Optimal reduction in money holdings
Having derived 𝑋𝑋 ∗ , agents’ demand for money 𝑀𝑀𝑑𝑑 is the minimum amount allowed by the
transaction constraint (assuming 𝑖𝑖 > 0, so there is a positive opportunity cost):
𝑀𝑀𝑑𝑑 = π‘ƒπ‘ƒοΏ½π‘Œπ‘Œ − 𝑋𝑋 ∗ (𝑖𝑖 )οΏ½
This a money demand function of the form 𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, 𝑖𝑖), where real money demand
𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖 ) = π‘Œπ‘Œ − 𝑋𝑋 ∗ (𝑖𝑖) is increasing in π‘Œπ‘Œ and decreasing in 𝑖𝑖.
The choice of how much money to hold is related to the following measure of the velocity of
money 𝑉𝑉, which is defined by 𝑀𝑀𝑉𝑉 = 𝑃𝑃𝑃𝑃. This is a measure of how fast a unit of money
circulates in a given period as it is used for multiple transactions. Since 𝑉𝑉 = 𝑃𝑃𝑃𝑃⁄𝑀𝑀𝑑𝑑 =
π‘Œπ‘Œ/(π‘Œπ‘Œ − 𝑋𝑋 ∗ (𝑖𝑖 )), velocity is inversely related to 𝑀𝑀𝑑𝑑 and increases with the opportunity cost 𝑖𝑖
204
EC2065 Macroeconomics | Chapter 6: Money
of holding money. Intuitively, money circulates faster with people holding it for shorter
periods when money is a poor store of value.
6.4.2 Alternatives to money
A second interpretation of 𝑋𝑋 in the transaction constraint 𝑀𝑀 + 𝑃𝑃𝑃𝑃 ≥ 𝑃𝑃𝑃𝑃 is using
alternatives to money as a means of payment. We discussed in Section 6.2 how credit might
be used as a substitute for money in some situations. Suppose banks offer credit facilities,
for example, credit cards and let 𝑋𝑋 now denote the real value of transactions paid for with
credit. Think of this as short-term credit, with borrowing during a period repaid at the end of
the period. A fee π‘žπ‘ž is charged for using credit as a fraction of the amount borrowed.
In offering credit, banks face costs of screening borrowers and collection of debts. Assume
that these costs are an increasing function 𝑍𝑍(𝑋𝑋) of 𝑋𝑋. In addition, the marginal cost 𝑍𝑍 ′ (𝑋𝑋) of
extending provision of credit is increasing in the amount of borrowing 𝑋𝑋. This represents the
idea that banks would face higher costs when they expand lending to a wider group of less
credit-worthy borrowers, or extend more credit to existing borrowers. Assuming the
banking system is competitive, banks offer credit 𝑋𝑋 𝑠𝑠 up to the point where the fee charged
π‘žπ‘ž is equal to the marginal cost 𝑍𝑍′(𝑋𝑋):
π‘žπ‘ž = 𝑍𝑍′(𝑋𝑋 𝑠𝑠 )
This yields an upward-sloping supply curve 𝑋𝑋 𝑠𝑠 (π‘žπ‘ž ) for credit facilities.
Now consider the demand for credit by households and firms as a means of payment.
Transactions can equally well be carried out using credit facilities or using money. The credit
facility fee is π‘žπ‘ž per unit of spending is the cost of using credit for payments. If money is used
instead then the cost is the opportunity cost 𝑖𝑖 of holding money. Since the two means of
payment are perfect substitutes, if π‘žπ‘ž < 𝑖𝑖 then payment with credit facilities is preferred, if
π‘žπ‘ž > 𝑖𝑖 then payment using money holdings is preferred, and if π‘žπ‘ž = 𝑖𝑖 then everyone is
indifferent between the two. It follows that the demand for credit facilities 𝑋𝑋 𝑑𝑑 (π‘žπ‘ž) is
perfectly elastic in the range 0 ≤ 𝑋𝑋 ≤ π‘Œπ‘Œ with respect to the fee at π‘žπ‘ž = 𝑖𝑖.
The demand function 𝑋𝑋 𝑑𝑑 (π‘žπ‘ž) is plotted alongside the supply function 𝑋𝑋 𝑠𝑠 (π‘žπ‘ž) in Figure 6.8.
The demand curve shifts vertically if the nominal interest rate 𝑖𝑖 changes, moving upwards if
𝑖𝑖 rises. The equilibrium of the market for credit facilities is at the intersection of the demand
and supply curves. Assuming 𝑋𝑋 ∗ < π‘Œπ‘Œ, so some amount of money is held to make payments,
the equilibrium features π‘žπ‘ž ∗ = 𝑖𝑖, so the credit fee is equal to the nominal interest rate on
bonds. An increase in 𝑖𝑖 shifts the demand function upwards, so the equilibrium 𝑋𝑋 ∗ (𝑖𝑖) rises
with 𝑖𝑖. As it does not make sense to hold more money than required to satisfy the
transaction constraint when 𝑖𝑖 > 0, the money demand function is 𝑀𝑀𝑑𝑑 = π‘ƒπ‘ƒοΏ½π‘Œπ‘Œ − 𝑋𝑋 ∗ (𝑖𝑖 )οΏ½,
which has the same form as seen earlier with the first interpretation of 𝑋𝑋.
6.4.3 The money demand function
Considering 𝑋𝑋 as either effort to economise on holding money, or substitution towards
alternatives to money, the resulting money demand function has the form:
𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, 𝑖𝑖)
205
EC2065 Macroeconomics | Chapter 6: Money
The function 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖 ) = π‘Œπ‘Œ − 𝑋𝑋 ∗ (𝑖𝑖 ) for holdings of real money balances 𝑀𝑀𝑑𝑑 /𝑃𝑃 increases in π‘Œπ‘Œ
and decreases in 𝑖𝑖. Nominal money demand 𝑀𝑀𝑑𝑑 is proportional to the price level 𝑃𝑃 for given
real transactions and interest rates because higher prices scale up the need for units of
money to make payments. Money demand 𝑀𝑀𝑑𝑑 increases with π‘Œπ‘Œ as higher GDP means more
transactions. Money demand 𝑀𝑀𝑑𝑑 decreases with 𝑖𝑖 because a higher opportunity cost
increases incentives to reduce money holdings through various means.
Figure 6.8: The market for credit facilities
The money demand function is plotted against the price level 𝑃𝑃 in the left panel of Figure
6.9 for given values of real GDP π‘Œπ‘Œ and the nominal interest rate 𝑖𝑖. It is an upward-sloping
straight line because nominal money demand is proportional to the price level. The demand
function pivots to the right if π‘Œπ‘Œ increases or 𝑖𝑖 falls.
Figure 6.9: The demand for money
The relationship between the nominal interest rate 𝑖𝑖 and real money demand 𝑀𝑀𝑑𝑑 /𝑃𝑃 is
depicted in the right panel of Figure 6.9. The negative relationship reflects the incentive to
reduce money holdings when the opportunity cost 𝑖𝑖 is high. Mathematically, the demand
curve represents the optimality condition 𝑖𝑖 = 𝑍𝑍′(𝑋𝑋), where 𝑋𝑋 = π‘Œπ‘Œ − (𝑀𝑀⁄𝑃𝑃) using the
206
EC2065 Macroeconomics | Chapter 6: Money
binding transaction constraint. The demand curve shifts to the right if π‘Œπ‘Œ increases. In the
special case 𝑖𝑖 = 0, there is no forgone interest when holding money and the optimal value
of 𝑋𝑋 is 0. Moreover, there is no incentive to reduce money holdings until the transaction
constraint just holds. Hence, with a zero nominal interest rate, money demand is 𝑀𝑀𝑑𝑑 ≥
𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, 0) = 𝑃𝑃𝑃𝑃, which corresponds to a horizontal line at 𝑖𝑖 = 0. Money demand thus
becomes perfectly interest elastic at 𝑖𝑖 = 0.
Finally, we note that when money itself pays interest at rate π‘–π‘–π‘šπ‘š , all references to 𝑖𝑖 above in
the money demand function should be replaced by the correct opportunity cost 𝑖𝑖 − π‘–π‘–π‘šπ‘š .
6.5 Money and economic activity
Our study of money demand revealed the ways in which it is affected by real GDP and
interest rates. But does money itself matter for real GDP? The analysis here will focus on
money’s medium of exchange function, which affects the efficiency with which markets
operate. Later in Chapter 8, money’s unit of account function becomes relevant in the
presence of nominal rigidities. In this chapter, we look at the implications of economic
activity depending on holding money for some period between selling one thing and buying
another. Money that is a poor store of value over this period acts as a tax on economic
activity, therefore discouraging production and exchange.
We illustrate this idea using the labour market as an example. Suppose the period is a
month and workers are paid a wage π‘Šπ‘Š per hour of labour only at the end of the month.
Wages arrive too late to be spent directly during the same month, and suppose it is not
possible for workers to barter labour for goods, or offer IOUs for payment when they buy
goods.
Households’ labour supply condition 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 = 𝑀𝑀 derived in Chapter 1 assumed an extra
hour of labour paid money wage π‘Šπ‘Š buys 𝑀𝑀 = π‘Šπ‘Š/𝑃𝑃 goods in the same period. But to work
more and spend more during the same month in the monetary economy described above, a
household must either forgo interest by holding on to more cash at the beginning of the
month, swap money and other assets more frequently during the month at some cost, or
pay for goods using credit as an alternative to money. All of these ways of spending more
during the month before actually receiving the wage at the end of the month entail some
cost.
We derive the amount of goods π‘€π‘€π‘šπ‘š that can be purchased in the same month when a
household supplies an additional hour of labour paid money wage π‘Šπ‘Š, holding constant the
household’s future plans for consumption and labour supply. The effective real purchasing
power of a household’s wages during the month is π‘€π‘€π‘šπ‘š , which will generally differ from the
real cost 𝑀𝑀 = π‘Šπ‘Š/𝑃𝑃 to firms when wages are paid at the end of the month.
Real purchases π‘€π‘€π‘šπ‘š cost π‘ƒπ‘ƒπ‘€π‘€π‘šπ‘š units of money. If the household holds extra money π‘ƒπ‘ƒπ‘€π‘€π‘šπ‘š
instead of bonds during the month to make the purchases then this reduces nominal wealth
by (1 + 𝑖𝑖 )π‘ƒπ‘ƒπ‘€π‘€π‘šπ‘š at beginning of next month. To leave future spending plans unchanged, this
needs to be replenished with the extra wages π‘Šπ‘Š received at the end of the month, hence,
(1 + 𝑖𝑖 )π‘ƒπ‘ƒπ‘€π‘€π‘šπ‘š = π‘Šπ‘Š. Dividing both sides by 𝑃𝑃 implies that π‘€π‘€π‘šπ‘š is related to 𝑀𝑀 as follows:
207
EC2065 Macroeconomics | Chapter 6: Money
π‘€π‘€π‘šπ‘š =
𝑀𝑀
1 + 𝑖𝑖
This equation says that the effective purchasing power of the wages households receive is
reduced by 𝑖𝑖 because spending more requires holding more money, which forgoes interest.
Alternatively, the household could maintain the same average money holdings during the
month and avoid forgoing interest. However, this requires swapping between money and
other assets more frequently (higher 𝑋𝑋) during the month to cover the additional spending.
But this entails transaction costs π‘žπ‘ž = 𝑍𝑍′(𝑋𝑋) per unit of extra spending. Deducting these
from the wage received implies π‘ƒπ‘ƒπ‘€π‘€π‘šπ‘š = π‘Šπ‘Š − π‘žπ‘žπ‘žπ‘žπ‘€π‘€π‘šπ‘š . It follows that π‘€π‘€π‘šπ‘š = 𝑀𝑀⁄(1 + π‘žπ‘ž).
Finally, credit could be used for the extra purchases made during the month. This requires
paying a fee π‘žπ‘žπ‘žπ‘žπ‘€π‘€π‘šπ‘š at the end of month. Deducting that from the wage implies π‘ƒπ‘ƒπ‘€π‘€π‘šπ‘š = π‘Šπ‘Š −
π‘žπ‘žπ‘žπ‘žπ‘€π‘€π‘šπ‘š and hence, π‘€π‘€π‘šπ‘š = 𝑀𝑀⁄(1 + π‘žπ‘ž). In Section 6.4, we saw that households’ optimal choice
of money holdings implies π‘žπ‘ž = 𝑍𝑍 ′ (𝑋𝑋) = 𝑖𝑖, so this means that π‘€π‘€π‘šπ‘š = 𝑀𝑀/(1 + 𝑖𝑖) whichever
way of paying for current consumption that households choose.
Households’ labour supply decision in a monetary economy with the timing restriction on
receiving and spending wages equates the marginal rate of substitution 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 between
leisure and current consumption to the effective current purchasing power of the wage π‘€π‘€π‘šπ‘š :
𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 =
𝑀𝑀
1 + 𝑖𝑖
The right-hand side of the equation is lower when the opportunity cost 𝑖𝑖 rises, indicating
that money is worse as a store of value. Since working and consuming more depends on
holding money for some time, a positive opportunity cost 𝑖𝑖 works in a way similar to a
proportional tax 𝜏𝜏 on wages. We know from Section 1.5 that a proportional income tax on
wages means the households’ labour supply is determined by 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 = (1 − 𝜏𝜏)𝑀𝑀.
This logic points to one way that money matters for real GDP. If money is worse as a store of
value (a high opportunity cost 𝑖𝑖) then the implicit tax on economic activity rises. This leads
to a lower labour supply, shifting the 𝑁𝑁 𝑠𝑠 curve to the left. All else equal, there is less
employment and lower production, which causes a shift of the π‘Œπ‘Œ 𝑠𝑠 curve to the left.
Households are worse off, which reduces consumption demand and shifts the π‘Œπ‘Œ 𝑑𝑑 curve to
the left as well. Consequently, real GDP π‘Œπ‘Œ is lower. If π‘Œπ‘Œ 𝑠𝑠 and π‘Œπ‘Œ 𝑑𝑑 shift by the same amount,
then the real interest rate remains unchanged.
6.6 The supply of money
We now turn to thinking about the supply of money. We first consider the supply of fiat
money by a government or central bank, deferring discussion of ‘credit money’ created by
the banking system until Chapter 7. Assume all money is fiat money, for which the
government is the monopoly supplier. The quantity of money in circulation is denoted by
𝑀𝑀 𝑠𝑠 . For now, we make no distinction between cash and reserves.
208
EC2065 Macroeconomics | Chapter 6: Money
How does the central bank change the money supply? In other words, how does new money
enter circulation or existing money is removed from circulation? We will see that there are
two basic ways this can happen:
•
•
Open-market operations
Transfers.
Since fiat money is intrinsically worthless, the resource costs of creating new money are
negligible and we ignore them in our analysis.
6.6.1 Open-market operations
An open-market operation is where the central bank buys or sells assets. When the central
bank buys assets, it pays with newly created money, which increases the quantity of money
in circulation. When the central bank sells assets, it receives existing money as payment,
which is effectively removed from circulation.
The central bank can in principle buy any asset in an open market operation or sell any asset
it already holds. It usually transacts with the private sector through its dealings with
commercial banks (rather than buying bonds directly from the government). Traditionally,
open-market operations were in markets for short-term government bonds, or repos
(repurchase/resale agreements) of long-term government bonds.
These assets were chosen because they have low credit risk and a short maturity and thus
protect the central bank from capital losses that would make it harder to reverse an
expansionary open-market operation in the future. But since the 2008 financial crisis, many
central banks have also made outright purchases of long-term bonds or risky assets, for
example, quantitative easing (QE) purchases of mortgage-backed securities in the US.
6.6.2 Transfers
A transfer payment is where the central bank distributes money without acquiring any asset
in return, for example, the payment of central-bank profits to a country’s finance ministry.
These profits often arise as a normal outcome of the central bank’s operations and are
distributed to the finance ministry as the owner of the central bank.
However, in principle – putting aside legal rules – a central bank can create new money and
simply distribute it to the finance ministry or others. This could mean directly paying for
government expenditure, or giving the government money to compensate for lower tax
revenues. The case of a direct payment of new money to households is known as a
‘helicopter drop’ of money, though the same economic effect could be achieved by a
transfer to the government to fund a tax cut for households.
6.6.3 Monetary policy
The decisions the central bank makes that affect the supply of money are described as its
monetary policy. For now, we assume monetary policy is an exogenous supply of money
𝑀𝑀 𝑠𝑠 . This can be represented as a perfectly inelastic money supply curve. This supply curve
shifts if monetary policy changes. We will consider later what monetary policy should be
chosen to meet the objectives of a country’s government.
209
EC2065 Macroeconomics | Chapter 6: Money
6.7 Money and prices
This section combines the demand and supply of money to see how the level of prices, the
inflation rate and the nominal interest rate are determined. Here, we suppose that goods
prices in terms of money are fully flexible. This means that the real value of money adjusts
to be equal to the real amount of money that households and firms are willing to hold. In
Chapter 8 we consider how an economy functions differently if there are nominal rigidities,
for example ‘sticky prices’.
With flexible prices, the price level 𝑃𝑃 adjusts to ensure the money market clears. The
nominal money supply 𝑀𝑀 𝑠𝑠 is assumed to be an exogenous amount 𝑀𝑀 chosen by the central
bank. Rather than consider a completely general monetary policy, we will restrict attention
here to monetary policies where the money supply is expected to grow at some exogenous
rate πœ‡πœ‡ over time:
𝑀𝑀′ = (1 + πœ‡πœ‡)𝑀𝑀
The money demand function is 𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, 𝑖𝑖), and money-market equilibrium 𝑀𝑀𝑑𝑑 = 𝑀𝑀 𝑠𝑠
therefore requires 𝑀𝑀 = 𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, 𝑖𝑖) in the current period and 𝑀𝑀′ = 𝑃𝑃′ 𝐿𝐿(π‘Œπ‘Œ ′ , 𝑖𝑖 ′ ) in the future.
Nominal and real interest rates are linked by the Fisher equation 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹, where the
inflation rate is defined by πœ‹πœ‹ = (𝑃𝑃′ − 𝑃𝑃)/𝑃𝑃. The conditions for equilibrium in the money
market now and in the future are therefore 𝑀𝑀 = 𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, π‘Ÿπ‘Ÿ + πœ‹πœ‹) and 𝑀𝑀′ = 𝑃𝑃′ 𝐿𝐿(π‘Œπ‘Œ ′ , π‘Ÿπ‘Ÿ ′ + πœ‹πœ‹ ′ ). A
graphical representation of the current period equilibrium is shown in Figure 6.10, where
the equilibrium price level 𝑃𝑃∗ is at the intersection of 𝑀𝑀𝑑𝑑 and 𝑀𝑀 𝑠𝑠 .
Figure 6.10: Money-market equilibrium
By dividing the future money-market equilibrium condition by the current money-market
equilibrium condition we obtain the equation:
𝑀𝑀′ 𝑃𝑃′ 𝐿𝐿(π‘Œπ‘Œ ′ , π‘Ÿπ‘Ÿ ′ + πœ‹πœ‹ ′ )
=
𝑀𝑀
𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, π‘Ÿπ‘Ÿ + πœ‹πœ‹)
We suppose that the types of monetary policies considered here do not affect future real
GDP π‘Œπ‘Œ′ differently from how they affect current real GDP π‘Œπ‘Œ (that is, they do not change the
210
EC2065 Macroeconomics | Chapter 6: Money
future real GDP growth rate), or the current real interest rate π‘Ÿπ‘Ÿ relative to its future level π‘Ÿπ‘Ÿ′,
or πœ‹πœ‹ relative to πœ‹πœ‹′. All else being equal, this means π‘Œπ‘Œ = π‘Œπ‘Œ′, π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ′, and πœ‹πœ‹ = πœ‹πœ‹′. Note we have
not ruled out that monetary policy affects the levels of π‘Œπ‘Œ, π‘Ÿπ‘Ÿ, or πœ‹πœ‹. The equation for moneymarket equilibrium then reduces to 𝑀𝑀′⁄𝑀𝑀 = 𝑃𝑃′⁄𝑃𝑃. With definitions 𝑀𝑀′⁄𝑀𝑀 = 1 + πœ‡πœ‡ and
𝑃𝑃′ /𝑃𝑃 = 1 + πœ‹πœ‹, money-market equilibrium therefore implies:
πœ‹πœ‹ = πœ‡πœ‡
The rate of inflation πœ‹πœ‹ is equal to the money-supply growth rate πœ‡πœ‡, which means that
inflation is determined by monetary policy through the choice of πœ‡πœ‡. Intuitively, increases in
the money supply shift the 𝑀𝑀 𝑠𝑠 curve to the right, which imply that the intersection with 𝑀𝑀𝑑𝑑
occurs at a higher price level 𝑃𝑃 to leave holdings of real money balances unchanged.
Given a real interest rate π‘Ÿπ‘Ÿ, the Fisher equation implies:
𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‡πœ‡
A higher money-supply growth rate thus raises the nominal interest rate 𝑖𝑖 if π‘Ÿπ‘Ÿ remains
unchanged. This is because a higher nominal interest rate is required to cancel out the
effect of inflation and leave the real return on bonds the same.
Box 6.2: The instability of money demand
The analysis of the equilibrium inflation rate might give the impression that only the
money supply growth rate matters, a form of ‘monetarism’. This is because we have
considered only shifts of the money supply curve for a completely stable money demand
curve. However, the equilibrium of the money market can also be affected by shifts of the
money demand function and this affects the equilibrium price level for a given supply of
money. If such demand shifts occur then this leads to fluctuations in inflation even if
monetary policy keeps the money supply or money growth constant.
The money demand curve 𝑀𝑀𝑑𝑑 ⁄𝑃𝑃 = 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖) can shift because of changes in real GDP π‘Œπ‘Œ,
which affect the need to use money for transactions. But in addition to this, the function
𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖) itself might not be stable owing to financial innovation. New ideas or technologies
can change the costs of providing substitutes for money, for example, credit cards, or
change the costs of economising on the average amount of money held to carry out
transactions, for example, ATMs, debit cards and electronic payments. We can represent
the effects of these innovations in the model by shifts of the marginal cost function 𝑍𝑍′(𝑋𝑋).
Since the money demand is determined by the equation 𝑖𝑖 = 𝑍𝑍′(π‘Œπ‘Œ − 𝑀𝑀𝑑𝑑 ⁄𝑃𝑃), these
changes also shift the money demand function as shown in Figure 6.11. A reduction in the
marginal cost of providing substitutes for money or economising on money holdings
increases 𝑋𝑋 and reduces money demand, causing the price level to increase for a given
money supply 𝑀𝑀 𝑠𝑠 .
211
EC2065 Macroeconomics | Chapter 6: Money
Figure 6.11: Financial innovation shifts money demand
How serious an issue is the instability of money demand? The left panel of Figure 6.12
reports a time series of the quantity of money 𝑀𝑀 𝑠𝑠 in the USA relative to nominal GDP 𝑃𝑃𝑃𝑃.
Note that this uses the M1 measure of the money supply, which is broader than the
monetary base and that we study further in Chapter 7. The measure 𝑀𝑀 𝑠𝑠 /(𝑃𝑃𝑃𝑃) is the
inverse of the velocity of money 𝑉𝑉 = (𝑃𝑃𝑃𝑃)/𝑀𝑀. In equilibrium, it is also equal to
(𝑀𝑀𝑑𝑑 ⁄𝑃𝑃)/π‘Œπ‘Œ, which is real money demand relative to real GDP π‘Œπ‘Œ. The scaling by GDP is done
to control for changes in the demand for money owing to transactions rising with GDP.
We see that M1 as a fraction of GDP followed a stable trend prior to the 1980s but has
experienced various shifts in the 1980s, 1990s, and 2000s. The right panel of the figure is a
scatterplot of 𝑀𝑀 𝑠𝑠 /(𝑃𝑃𝑃𝑃) against the nominal interest rate 𝑖𝑖, which should show the
downward-sloping real money demand curve scaled by GDP. However, the plot indicates
this relationship has been unstable.
Figure 6.12: Demand for money according to the M1 measure in the USA
The exercise is repeated for the broader M2 measure of the US money supply in Figure
6.13. The time series in the left panel suggests the demand for M2 (relative to GDP) has
212
EC2065 Macroeconomics | Chapter 6: Money
been more stable than M1. The scatterplot in the right panel comes closer to tracing out
something that resembles a negative relationship between real money demand (scaled by
GDP) and the nominal interest rate 𝑖𝑖, though this relationship still appears to shift at some
points in time.
Overall, the evidence presented here suggests we cannot be confident that regulating the
money-supply growth rate will give tight control over inflation.
6.8 Money and public finance
Governments derive a fiscal advantage from being able to issue fiat money that is
demanded by the private sector. Unlike bonds, there is no obligation to ‘repay’ or redeem
fiat money. Furthermore, money may pay no interest (π‘–π‘–π‘šπ‘š = 0), or pay a lower rate of
interest than bonds (π‘–π‘–π‘šπ‘š < 𝑖𝑖). These fiscal gains from issuing money are often referred to as
the ‘seigniorage’ revenue of the government. They represent an implicit tax on holders of
money. This section looks at how to quantify the fiscal gains that arise from different
monetary policies.
Figure 6.13: Demand for money according to the M2 measure in the USA
6.8.1 Seigniorage: ‘printing money’
If the money supply is growing at a rate πœ‡πœ‡ then an amount of new money πœ‡πœ‡πœ‡πœ‡ is created
each time. If it were directly used to finance government expenditure, this seigniorage
revenue would be worth πœ‡πœ‡πœ‡πœ‡⁄𝑃𝑃 in real terms. Using πœ‹πœ‹ = πœ‡πœ‡ that results from money-market
equilibrium in Section 6.7, the real amount of seigniorage is πœ‹πœ‹πœ‹πœ‹⁄𝑃𝑃.
This is simplest and most direct measure of seigniorage as the fiscal advantage that comes
from ‘printing money’. But this calculation ignores the saving of regular interest payments
on past spending that has been financed in this way rather than by issuing bonds. Moreover,
most central banks are not in the business of directly financing government expenditure.
What if – the usual case – the central bank is buying assets with newly created money?
213
EC2065 Macroeconomics | Chapter 6: Money
6.8.2 Seigniorage: central-bank investment income
Assume money pays no interest and all money created by the central bank has been used to
buy nominal bonds. The central bank holds bonds of monetary value 𝑀𝑀 that matches exactly
the existing supply of money 𝑀𝑀. In this case, the central bank earns interest 𝑖𝑖𝑖𝑖 in each
period and, ignoring resource costs of creating money and any operating costs, these are
profits that can be paid out to the finance ministry. Real seigniorage revenues are 𝑖𝑖𝑖𝑖/𝑃𝑃,
which represents a flow of revenue received by the government in each period. Note that
this is different from the seigniorage measure based on the real value of the increase in the
money supply.
6.8.3 A general definition of seigniorage
Even if the central bank does not buy assets, the central-bank investment income definition
of seigniorage still accurately represents the fiscal advantage derived from steady growth in
the money supply. The government reduces the cost of financing public expenditure by
creating money rather than issuing interest-bearing bonds. The size of this advantage can be
calculated as the real quantity of money 𝑀𝑀/𝑃𝑃 in circulation multiplied by the difference in
the returns on bonds and money, which is the nominal interest rate 𝑖𝑖 when money does not
pay interest. Seigniorage then simply represents money being less good as a store of value
than bonds, which is an advantage from the perspective of the issuer of money.
With 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹, and πœ‹πœ‹ = πœ‡πœ‡ in equilibrium with a constant money-supply growth rate πœ‡πœ‡, the
central-bank profits definition of seigniorage can be broken down into:
𝑖𝑖𝑖𝑖 π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ πœ‹πœ‹πœ‹πœ‹ π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ πœ‡πœ‡πœ‡πœ‡
=
+
=
+
𝑃𝑃
𝑃𝑃
𝑃𝑃
𝑃𝑃
𝑃𝑃
This is the saving of real interest payments on bonds otherwise issued plus the erosion of
existing money’s real value due to new money being created.
6.8.4 Limits on real seigniorage revenues
As seigniorage arises from money being less good a store of value than other assets, it is an
implicit tax on money. Seigniorage is closely related to the notion of forgone interest we
saw in the analysis of money demand in Section 6.4 and is essentially identical to the total
amount of forgone interest on money.
If money becomes a worse store of value because the nominal interest rate 𝑖𝑖 is higher then
real money demand 𝑀𝑀𝑑𝑑 ⁄𝑃𝑃 = 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖) falls. Real seigniorage revenues are 𝑖𝑖𝑖𝑖⁄𝑃𝑃 = 𝑖𝑖𝑖𝑖(π‘Œπ‘Œ, 𝑖𝑖),
so there are two conflicting effects of higher 𝑖𝑖. First, the direct effect of money being worse
as a store of value. Second, the indirect effect of falling real money demand reducing the
real value of seigniorage. This means the relationship between real seigniorage revenues
and 𝑖𝑖 is not unambiguously positive. Observe that seigniorage is zero if 𝑖𝑖 = 0 and becomes
zero again for high 𝑖𝑖 if real money demand 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖) falls towards zero sufficiently fast as 𝑖𝑖
increases. This gives rise to a Laffer curve for real seigniorage revenues as shown in Figure
6.14, indicating there are limits on the amount of real seigniorage government can obtain.
6.8.5 The inflation tax
If there is an unpredictable increase in the inflation rate πœ‹πœ‹, the nominal interest rate 𝑖𝑖 on
bonds cannot rise to leave the real return π‘Ÿπ‘Ÿ unchanged. An inflation surprise thus reduces
214
EC2065 Macroeconomics | Chapter 6: Money
both the real value of nominal government bonds as well as existing money. This is reflected
in the ex-post real interest rate π‘Ÿπ‘Ÿ being less than the ex-ante real interest rate π‘Ÿπ‘Ÿ 𝑒𝑒 .
The fiscal advantage derived from such surprise inflation is referred to here as an ‘inflation
tax’. A different term is used because the mechanism through which the inflation tax works
is distinct from the source of seigniorage revenue discussed earlier. With the definitions
adopted here, seigniorage revenues derive only from money not bonds and do not depend
on inflation being a surprise. In contrast, the inflation tax depends on inflation that was
unexpected when nominal bonds were first issued. This means that, ex post, the inflation
tax does not have any incentive effects on behaviour – like a lump-sum tax – because it is
completely unexpected.
There is no inflation tax on the real value of nominal bonds when the inflation is anticipated.
In this case, the real return is protected from expected inflation when the nominal interest
rate 𝑖𝑖 adjusts in advance. Inflation-indexed bonds are also protected against surprise
inflation and offer a guaranteed real return.
Figure 6.14: Seigniorage Laffer curve
6.8.6 The government budget constraint and Ricardian equivalence
In spite of the fiscal advantage that governments can derive from issuing money – a form of
‘soft default’ on government debt – a government ‘budget constraint’ still holds once
seigniorage and the inflation tax are counted alongside other more conventional sources of
tax revenue. Moreover, seigniorage and the inflation tax also show up in households’
budget constraints alongside explicit taxes because of they bear the losses from forgone
interest and the erosion of the real value of nominal bonds by surprise inflation.
If the economy has a representative household, it is possible to combine the household and
government budget constraints in the way seen in Section 4.1. The present value of
government expenditure ultimately determines the present value of tax revenue from all
sources, including seigniorage and the inflation tax. However, Ricardian equivalence fails
because seigniorage is effectively a tax that distorts incentives as seen in Section 6.5.
215
EC2065 Macroeconomics | Chapter 6: Money
6.9 Does monetary policy matter?
This section asks whether it matters what monetary policy is chosen by the central bank or
government. We know that monetary policy should affect nominal prices and inflation but
what effects are there, if any, on real variables such as GDP? Here, we answer this question
in the context of a model where the special feature of money is its role as a medium of
exchange. Money matters in different ways, in particular, through its unit of account
function, in the models with nominal rigidities we will see from Chapter 8.
We will consider two different changes to monetary policy:
•
•
A permanent change in the quantity of money in circulation
A permanent change in the growth rate of the supply of money in circulation.
6.9.1 A permanent change in the level of the money supply
Suppose there is an exogenous permanent change in money supply 𝑀𝑀 𝑠𝑠 = 𝑀𝑀. Since this is
exogenous, it is not a reaction to other events or shocks. We assume the change is
unexpected and that no repeat is expected in the future. This means the level of 𝑀𝑀 changes
but not its subsequent growth rate πœ‡πœ‡. As the future money supply 𝑀𝑀′ changes in the same
way as the current money supply 𝑀𝑀, a zero money-supply growth rate πœ‡πœ‡ = 0 is expected
subsequently because 𝑀𝑀′ = 𝑀𝑀. However, when the policy change is implemented, there is
still an unexpected change in the money supply 𝑀𝑀 relative to its past level.
Since the change in the money supply is the same in the present as in the future, the effects
on the equilibrium price levels 𝑃𝑃 and 𝑃𝑃′ are the same. This means that expected inflation
πœ‹πœ‹ = (𝑃𝑃′ − 𝑃𝑃)/𝑃𝑃 between now and the future period is πœ‹πœ‹ = πœ‡πœ‡ = 0. From the Fisher
equation we therefore conclude that 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹ = π‘Ÿπ‘Ÿ. Although there is no further inflation
expected in the future, there can still be unexpected inflation/deflation of 𝑃𝑃 relative to the
past price level.
In what follows, we take the case of an increase in the money supply 𝑀𝑀 for illustration. We
will see that the model predicts this increase in 𝑀𝑀 has no real effects at all. This result is
shown in the supply-and-demand diagrams for the goods, labour and money markets
depicted in Figure 6.15. But what is the logic for this striking claim?
Figure 6.15: Permanent increase in money supply
216
EC2065 Macroeconomics | Chapter 6: Money
First, and most importantly, prices and wages expressed in units of money are fully flexible
here. With no impediments to price adjustment, the same real wage 𝑀𝑀 and real interest rate
π‘Ÿπ‘Ÿ can continue to ensure supply and demand are brought into equilibrium in the labour and
goods markets. Moreover, there is no money illusion – everyone’s decisions depend on
relative prices and real variables.
Second, the policy change does not affect perceptions of how good money is as a store of
value going forwards between the current and future time periods. Since 𝑖𝑖 = π‘Ÿπ‘Ÿ, there is no
change in the nominal interest rate 𝑖𝑖 unless π‘Ÿπ‘Ÿ changes. The nominal interest rate 𝑖𝑖 is a
measure of how bad money is as a store of value relative to other assets. This means no
greater tax on economic activity that depends on holding money is expected and, hence,
there is no reason for the labour and output supply curves to shift through the effect of 𝑖𝑖 on
the labour-supply condition 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 = 𝑀𝑀/(1 + 𝑖𝑖).
Third, while existing holdings of money and nominal government bonds are caught by a
surprise inflation tax that reduces the real value of households’ financial assets, the inflation
tax also allows the government to reduce other taxes and still pay for the same level of
public expenditure. These tax cuts offset the reduction in the value of financial assets and
there is no wealth effect overall on households – Ricardian equivalence holds after
accounting for the government budget constraint.
Thus, we conclude there are no reasons for any shifts of the 𝑁𝑁 𝑑𝑑 , 𝑁𝑁 𝑠𝑠 , π‘Œπ‘Œ 𝑑𝑑 , or π‘Œπ‘Œ 𝑠𝑠 curves.
Therefore, the equilibrium values of 𝑀𝑀 ∗ , π‘Ÿπ‘Ÿ ∗ , 𝑁𝑁 ∗ , and π‘Œπ‘Œ ∗ are unaffected. A permanent change
in the money supply has no real effects. No change in π‘Œπ‘Œ or 𝑖𝑖 means there is no shift of the
money demand curve 𝑀𝑀𝑑𝑑 . The rightward shift of the money supply curve 𝑀𝑀 𝑠𝑠 thus leads 𝑃𝑃 to
rise in proportion to 𝑀𝑀. Given these predictions, money is said to be ‘neutral’.
6.9.2 A permanent change in the growth rate of the money supply
Alternatively, suppose there is a permanent adjustment of the growth rate of the money
supply πœ‡πœ‡. This money-supply growth rate is defined by πœ‡πœ‡ = (𝑀𝑀′ − 𝑀𝑀)/𝑀𝑀, hence, the future
money supply is given by 𝑀𝑀′ = (1 + πœ‡πœ‡)𝑀𝑀. The change in πœ‡πœ‡ is exogenous, unexpected and no
further adjustments of πœ‡πœ‡ are expected. Note that there is no change in the initial money
supply 𝑀𝑀 here.
Since the policy change affects expectations of the future money supply, inflation
expectations πœ‹πœ‹ = (𝑃𝑃′ − 𝑃𝑃)/𝑃𝑃 adjust. As we have seen in Section 6.7, the effect on the
equilibrium current and future price levels 𝑃𝑃 and 𝑃𝑃′ is such that πœ‹πœ‹ = πœ‡πœ‡, so any changes in
money-supply growth are reflected one-for-one in changes in expected inflation.
Let us take the case of faster money growth for illustration. We will see that increasing the
money growth rate does have real effects. These are depicted in the supply-and-demand
diagrams in Figure 6.16.
217
EC2065 Macroeconomics | Chapter 6: Money
Figure 6.16: Permanent increase in money supply growth rate
The logic for the real effects is that higher money growth πœ‡πœ‡ raises expectations of future
inflation πœ‹πœ‹. The Fisher equation 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹ then implies the nominal interest rate 𝑖𝑖 is higher
for each value of the real interest rate π‘Ÿπ‘Ÿ. From the labour-supply equation 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 =
𝑀𝑀/(1 + 𝑖𝑖), higher 𝑖𝑖 has a negative effect on labour supply. Intuitively, because money is a
worse store of value, the implicit tax on economic activity rises, which causes the supply of
labour to decline. Consequently, the output supply curve π‘Œπ‘Œ 𝑠𝑠 shifts to the left and, as this
change is permanent, 𝐢𝐢 𝑑𝑑 falls in line with income, leading to a leftward shift of π‘Œπ‘Œ 𝑑𝑑 of same
size as the shift of π‘Œπ‘Œ 𝑠𝑠 .
If a permanent change in the money supply growth were to have no effects on any real
variables then we would say that money is ‘superneutral’. The term ‘neutrality’ used earlier
refers to there being no real effects of a permanent change to the level of 𝑀𝑀. We see in
Figure 6.16 that the model predicts money is not superneutral. Permanently faster growth
of the money supply reduces real GDP and employment because money is less good as a
store of value. This inflationary policy has a negative real effect on the economy’s supply
side.
In the money market, there is no initial change in 𝑀𝑀, so no shift of the money supply curve
𝑀𝑀 𝑠𝑠 to begin with. The money demand curve 𝑀𝑀𝑑𝑑 pivots to the left as there are fewer
transactions due to lower GDP π‘Œπ‘Œ and more efforts to economise on holding money or make
use of money substitutes (higher 𝑋𝑋) because of higher 𝑖𝑖. This leads real money balances
𝑀𝑀/𝑃𝑃 to fall as 𝑀𝑀𝑑𝑑 /𝑃𝑃 is lower, which causes an immediate jump up in the level of prices 𝑃𝑃.
Box 6.3: Money supply increases that the central bank announces are
temporary
We have looked at the consequences for prices, inflation and real economic variables of
permanent changes to the quantity of money or the growth rate of the money supply. But
central banks might change the money supply temporarily in some circumstances.
For example, quantitative easing (QE) might increase the money supply but it is the central
bank’s stated intention to unwind the policy in the future. QE expansions of money
218
EC2065 Macroeconomics | Chapter 6: Money
supply have turned out to be persistent in most countries, although this may not have
been expected when they were first begun. There are cases where QE has been
temporary, such as the Bank of Japan QE policy from 2001, which was largely reversed in
2006. Another example of a temporary change is the ‘de-monetization’ experiment in
India in 2016, where there was a temporary decline of the money supply.
To see what difference it makes when a money-supply change is expected to be
temporary, suppose 𝑀𝑀 𝑠𝑠 = 𝑀𝑀 is expected to change for only one time period. Throughout,
we hold the expected future money supply 𝑀𝑀′ constant. Consequently, the equilibrium
future price level 𝑃𝑃′ does not change in this example. The Fisher equation 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹ and
the definition of expected inflation πœ‹πœ‹ = (𝑃𝑃′ − 𝑃𝑃)/𝑃𝑃 imply that the nominal interest rate 𝑖𝑖
is:
𝑖𝑖 = π‘Ÿπ‘Ÿ +
𝑃𝑃′ − 𝑃𝑃
𝑃𝑃′
= π‘Ÿπ‘Ÿ + − 1
𝑃𝑃
𝑃𝑃
Money-market equilibrium is the equation 𝑀𝑀 = 𝑀𝑀 𝑠𝑠 = 𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, 𝑖𝑖), and, hence:
𝑀𝑀 = 𝑃𝑃𝑃𝑃 οΏ½π‘Œπ‘Œ, π‘Ÿπ‘Ÿ +
𝑃𝑃′
− 1οΏ½
𝑃𝑃
The key point to note is that a higher price level 𝑃𝑃 lowers the nominal interest rate 𝑖𝑖 here,
so the effect of 𝑃𝑃 on money demand is magnified. We ignore here any effect of 𝑖𝑖 on π‘Œπ‘Œ (but
accounting for that would further boost the impact of 𝑃𝑃 on 𝑀𝑀𝑑𝑑 ). In what follows, we
assume that nominal interest rate 𝑖𝑖 remains positive throughout. Figure 6.17 shows the
relationship between 𝑀𝑀𝑑𝑑 and 𝑃𝑃 in this case for given π‘Œπ‘Œ and π‘Ÿπ‘Ÿ and the relationship in the
case where changes in the money supply are permanent, in which case money demand
𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, π‘Ÿπ‘Ÿ) is proportional to 𝑃𝑃 and is thus represented by a straight line in the
diagram.
Figure 6.17: Temporary increase in money supply
219
EC2065 Macroeconomics | Chapter 6: Money
Following temporary increase in 𝑀𝑀, the money supply curve 𝑀𝑀 𝑠𝑠 shifts to the right as usual.
If the policy is expected to be reversed in future, any rise in the price level is also expected
to be reversed. Therefore, a higher price level 𝑃𝑃 would create expectations of future
deflation, reducing the nominal interest rate 𝑖𝑖 and boosting money demand. In the
diagram, 𝑀𝑀𝑑𝑑 is thus less steep than the usual 𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, π‘Ÿπ‘Ÿ) money-demand function. It follows
that the price level rises by proportionately less than 𝑀𝑀 does, in contrast to the case of a
permanent change where 𝑃𝑃 rises in proportion to 𝑀𝑀.
This different prediction compared to the case of a permanent change in 𝑀𝑀 is likely to be
quantitatively significant. If there were a 25 per cent higher money supply temporarily and
𝑃𝑃 went up by 25 per cent initially then this would require 25 per cent expected deflation
subsequently. But that cannot be an equilibrium because 𝑖𝑖 ≥ 0 implies deflation cannot
exceed the much lower equilibrium value of the real interest rate π‘Ÿπ‘Ÿ. The price level 𝑃𝑃 must
therefore rise by far less than 25 per cent.
6.10 Optimal monetary policy and the costs of inflation
Focusing on money’s role as a medium of exchange, what should the central bank do if it
desires to make the economy run smoothly?
6.10.1 Economic efficiency
To answer this question, we need to know what the socially optimal level of economic
activity is. The marginal value of households’ time in terms of goods is the marginal rate of
substitution 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 between leisure and consumption. The economy’s ability to transform
households’ time into goods at margin, the marginal rate of transformation 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 , is given
by the marginal product of labour 𝑀𝑀𝑃𝑃𝑁𝑁 . Ignoring transaction costs, efficiency therefore
requires that 𝑀𝑀𝑀𝑀𝑁𝑁 = 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 .
In respect of transaction costs, we noted that there are costs of using substitutes for money,
or using time and effort to economise on holding money. Those resource costs are
represented by the function 𝑍𝑍(𝑋𝑋) and they constitute a social cost of carrying out
transactions. On the other hand, when holding money, the cost of forgone interest is not a
social cost because the government gains an equal amount of seigniorage revenues.
Forgone interest is simply a transfer from holders of money to issuers of money.
6.10.2 Monetary policy, efficiency, and the Friedman rule
We now consider how the choice of monetary policy affects the efficiency of the economy’s
equilibrium. The demand for labour 𝑁𝑁 𝑑𝑑 is given by 𝑀𝑀𝑀𝑀𝑁𝑁 = 𝑀𝑀 and labour supply 𝑁𝑁 𝑠𝑠 is
determined by 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 = 𝑀𝑀⁄(1 + 𝑖𝑖). This equation for 𝑁𝑁 𝑠𝑠 comes from wages being paid as
money that must be held for some period before it can be spent, as explained in Section 6.5.
When the labour market is in equilibrium (𝑁𝑁 𝑑𝑑 = 𝑁𝑁 𝑠𝑠 ), it follows that 𝑀𝑀𝑀𝑀𝑁𝑁 = 𝑀𝑀 =
(1 + 𝑖𝑖 )𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 . A positive value of the nominal interest rate 𝑖𝑖 implies 𝑀𝑀𝑀𝑀𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 , which
corresponds to employment and output being inefficiently low relative to the optimal
outcome with 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 .
220
EC2065 Macroeconomics | Chapter 6: Money
It is also possible to consider efficiency in respect of transaction costs. There is a demand
𝑋𝑋 𝑑𝑑 (𝑖𝑖 ) for substitutes for money that depends on the nominal interest rate 𝑖𝑖, which is the
opportunity cost of holding money. As explained in Section 6.4, the demand for 𝑋𝑋 is
increasing in the nominal interest rate 𝑖𝑖. The social cost of transactions is then 𝑍𝑍(𝑋𝑋 𝑑𝑑 (𝑖𝑖 )),
where forgone interest is not included directly because it is not a social cost. A positive
nominal interest rate 𝑖𝑖 implies 𝑋𝑋 𝑑𝑑 (𝑖𝑖 ) > 0 and 𝑍𝑍(𝑋𝑋 𝑑𝑑 (𝑖𝑖 )) > 0, which means that transaction
costs are inefficiently high.
Now suppose monetary policy is conducted so that 𝑖𝑖 = 0. This means there is no forgone
interest when holding money. Consequently, there is no incentive to find substitutes for
money, or incur costs in economising on holding money, so transaction costs are reduced to
zero, i.e. 𝑍𝑍(𝑋𝑋) = 0. Furthermore, there is no implicit tax on economic activity (work) that
depends on holding money because money is as good a store of value as other assets. This
means that 𝑀𝑀𝑀𝑀𝑁𝑁 = 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐢𝐢 , so employment and output are at their efficient levels. In
conclusion, the monetary policy 𝑖𝑖 = 0 ensures the economy’s equilibrium is efficient. This
policy of keeping the nominal interest rate at zero is known as the ‘Friedman rule’.
6.10.3 The optimal rate of inflation and the Friedman rule
To implement the Friedman rule, the central bank needs to set the money supply growth
rate πœ‡πœ‡ so that 𝑖𝑖 = 0 is achieved. The analysis in Section 6.7 shows that inflation and money
supply growth are equal, that is, πœ‹πœ‹ = πœ‡πœ‡. By using the Fisher equation 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹, it follows
that πœ‡πœ‡ = −π‘Ÿπ‘Ÿ is required to implement the Friedman rule. A positive equilibrium real interest
rate π‘Ÿπ‘Ÿ (which is independent of monetary policy here) implies the required money growth
rate πœ‡πœ‡ is negative. Therefore, the central bank must reduce the nominal money supply over
time to implement the Friedman rule.
Since πœ‹πœ‹ = πœ‡πœ‡ = −π‘Ÿπ‘Ÿ < 0, the Friedman rule requires deflation. In other words, the inflation
rate needed for economic efficiency is negative. Higher rates of inflation πœ‹πœ‹, including zero or
positive rates, imply that 𝑖𝑖 is higher, which means the economy’s equilibrium is further away
from what is efficient. The Friedman rule thus provides a way to understand the costs of
inflation but also suggests that deflation is a good thing.
6.10.4 The fiscal implications of following the Friedman rule
Implementing the Friedman rule has fiscal implications because no seigniorage revenue is
received when 𝑖𝑖 = 0. Governments therefore need to find alternative sources of tax
revenue to continue to pursue their plans for public expenditure.
Another way to think of this is that the negative inflation rate required for the Friedman rule
implies there is a positive real return on non-interest-bearing money coming from its
purchasing power growing over time. This makes it as good a store of value as bonds with
real return π‘Ÿπ‘Ÿ. Since money offers the same return as bonds, money is effectively being
treated by the government as a debt liability that must be repaid. The deflation that
supports the Friedman rule is achieved by buying back money to reduce its supply, which
works like repaying a debt. Money can only be repurchased by the central bank selling its
assets or using tax revenue transferred from the government. Hence, other taxes must rise
if government spending is to remain unchanged.
221
EC2065 Macroeconomics | Chapter 6: Money
Note that if other taxes create distortions, for example, income tax, then it may not be
optimal to follow the Friedman rule because this would replace one distortion (positive 𝑖𝑖)
with another distortion (a higher income tax rate). We have implicitly been assuming that
the lost seigniorage revenue could be replaced by lump-sum taxes.
Box 6.4: Hyperinflations
Hyperinflation refers to an extremely high rate of inflation. The exact definition is
arbitrary, but a threshold of ≥ 50 per cent inflation per month is conventional. Such high
rates of inflation occur with extremely fast money growth rates, causing nominal interest
rates to be very high as well.
Our analysis of the non-superneutrality of money in Section 6.9 provides a reason why
such high rates of inflation have a damaging effect on the real economy. As explained in
Section 6.5, inflation works as a tax on economic activity that depends on using money.
This effect was incorporated into our model through the equation 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 = 𝑀𝑀/(1 + 𝑖𝑖) for
labour supply and the Fisher equation 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹. Section 6.10 shows that these negative
supply-side effects of inflation give rise to inefficiencies and constitute a social cost of
inflation.
While such effects are theoretically present even at single-digit rates of inflation, they are
likely to be very small in that case. The implicit tax rate on economic activity coming from
inflation is approximately equal to the nominal interest rate 𝑖𝑖 and, to be precise, this is the
nominal interest rate over the period of time people cannot avoid holding on to cash they
receive, not the annual nominal interest rate. Taking that period to be no more than a
month, single-digit annual inflation rates would not generate an implicit tax rate of more
than 1 per cent, resulting in a generally small social cost. However, in a hyperinflation with
monthly inflation rates above 50 per cent, it is easy to see that the implicit tax rate can be
very high, even if people try to shorten the period over which they hold on to cash they
receive. Hence, the social cost of inflation through this mechanism is far larger in a
hyperinflation.
If hyperinflation has such serious negative effects on economy, why do governments
choose very fast money supply growth? We have seen in Section 6.8 that governments
derive a fiscal advantage from money creation (‘seigniorage’). If there is a sudden, large
change in public expenditure needs, for example, a war, seigniorage provides a quick
source of extra revenue for governments without having to adjust explicit tax rates.
However, as inflation and the nominal interest rate rise, the demand for real money
balances declines. This implies real seigniorage revenues are limited, as shown in the
seigniorage Laffer curve in Figure 6.14.
If the government’s fiscal needs in an emergency exceed the top of the seigniorage Laffer
curve then attempts to raise further seigniorage revenues may lead to explosive rates of
money growth and accelerating inflation as real seigniorage revenues fall short
222
EC2065 Macroeconomics | Chapter 6: Money
of the government’s needs. Thus, a hyperinflation can easily spiral out of control with
severe consequences for the economy unless the government can reduce its expenditure
or find alternative sources of tax revenue.
Box 6.5: Cash and tax evasion
An important feature of money in the form of physical cash is its anonymity. Unlike most
other assets, there is no register of ownership or necessarily any record of transactions in
cash. Thus, cash is sometimes described as a ‘bearer bond’, meaning the owner is deemed
to be whoever has physical possession of the asset. The anonymity of cash makes it ideal
to evade taxes, including taxes on purchases, income, or wealth. Part of the demand for
cash therefore comes from its tax-evasion advantages relative to other assets.
Our analysis of the demand for money in Section 6.4 was based on a comparison of
benefits and costs. The marginal cost of holding higher real money balances 𝑀𝑀𝑑𝑑 /𝑃𝑃 is 𝑖𝑖, the
forgone interest on bonds. The marginal benefit of higher 𝑀𝑀𝑑𝑑 /𝑃𝑃 was π‘žπ‘ž = 𝑍𝑍 ′ (𝑋𝑋)
previously. Now, a marginal value 𝑒𝑒 of the tax-evasion advantage of money is added to this
and the overall marginal benefit of higher 𝑀𝑀𝑑𝑑 /𝑃𝑃 is π‘žπ‘ž + 𝑒𝑒.
The demand for cash is found where the marginal benefit equals the marginal cost, i.e.
where 𝑖𝑖 = π‘žπ‘ž + 𝑒𝑒 = 𝑍𝑍 ′ (𝑋𝑋) + 𝑒𝑒. This is equivalent to the equation 𝑍𝑍 ′ (𝑋𝑋) = 𝑖𝑖 − 𝑒𝑒, so the
nominal interest rate 𝑖𝑖 in the usual money demand function 𝑀𝑀𝑑𝑑 /𝑃𝑃 = 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖) is replaced by
𝑖𝑖 − 𝑒𝑒 when cash has a tax evasion advantage. The new money demand function 𝑀𝑀𝑑𝑑 /𝑃𝑃 =
𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖 − 𝑒𝑒) is higher at each interest rate 𝑖𝑖, so there is a rightward shift of 𝑀𝑀𝑑𝑑 plotted
against 𝑖𝑖 as shown in Figure 6.18.
While individuals might gain from the use of cash for tax evasion, the marginal private
benefit 𝑒𝑒 is not a social benefit of using money. When tax evasion occurs, other taxes need
to be higher to pay for public expenditure. Furthermore, cash also facilitates criminal
activity, imposing negative externalities on others. These considerations mean it is not
desirable for monetary policy to maximise individuals’ use of cash in the way that it would
if the Friedman rule were followed.
223
EC2065 Macroeconomics | Chapter 6: Money
Figure 6.18: Money demand with tax evasion
Since the only social benefit of money is in reducing transaction costs π‘žπ‘ž = 𝑍𝑍′(𝑋𝑋), optimal
monetary policy should aim to push π‘žπ‘ž to zero, ignoring both the private cost of money in
terms of forgone interest 𝑖𝑖 (not a social cost) and the private benefit 𝑒𝑒 when money is
used for tax evasion (not a social benefit). With 𝑖𝑖 = π‘žπ‘ž + 𝑒𝑒 in equilibrium, this suggests
monetary policy should aim for 𝑖𝑖 = 𝑒𝑒 > 0, a positive nominal interest rate 𝑖𝑖 and a higher
rate of inflation πœ‹πœ‹ (or less deflation) than what is implied by the Friedman rule. By
following this policy, the implicit tax on money through 𝑖𝑖 > 0 cancels out the tax evasion
advantage. Money being a worse store of value makes tax evasion harder and provides a
way to tax illegal activities.
Technological advances have made transactions by debit card and bank transfers much
cheaper and easier, even for small payments. As the payments system now enables bank
deposits to be used for almost all transactions, it has been suggested that physical cash
can and should be abolished. Since bank deposits lack the anonymity of cash, eliminating
cash would help to reduce tax evasion and criminal activity. But some argue that cash
might occasionally offer greater convenience and, more importantly, the anonymity
offered by cash could be a desirable feature in preserving individuals’ privacy and civil
liberties.
6.11 Conducting monetary policy by setting interest rates
Monetary policy has so far been described as an exogenous level, or growth rate, of the
money supply. The instrument of monetary policy was the quantity of money and the target
for monetary policy was a monetary target.
However, the ultimate objective of monetary policy is usually not the money supply itself
but control of interest rates, inflation, or other macroeconomic variables. Moreover, central
banks are often described as setting interest rates rather than setting the money supply.
And we know from Box 6.2 that a fixed money supply target does not achieve price stability
224
EC2065 Macroeconomics | Chapter 6: Money
if money demand is unstable, so a monetary target might not be desirable with these other
objectives in mind. This section explores how monetary policy works if the central bank uses
an interest rate as its operating target.
Rather than setting the money supply 𝑀𝑀 at some target level, the central bank now has a
target for the nominal interest rate 𝑖𝑖. However, market interest rates are not directly
controlled by the central bank, so it needs to vary some policy instrument under its control
to achieve its interest rate target. We assume this means the use of open-market
operations. The central bank must be willing to increase or decrease 𝑀𝑀 so that the money
market is in equilibrium at its target for the nominal interest rate 𝑖𝑖. This makes the supply of
money 𝑀𝑀 become an endogenous variable determined by the equation:
𝑀𝑀
= 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖 )
𝑃𝑃
What are the implications of conducting monetary policy in this way? We will focus here on
the determination of prices and inflation, assuming that monetary policy does not affect
real GDP π‘Œπ‘Œ or the real interest rate π‘Ÿπ‘Ÿ.
The Fisher equation is 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹ ′𝑒𝑒 , where πœ‹πœ‹ ′𝑒𝑒 = (𝑃𝑃′𝑒𝑒 − 𝑃𝑃)/𝑃𝑃 explicitly denotes expected
inflation between the current and future time periods, which might not be the same as the
realised inflation rate. In equilibrium, expected inflation must be
πœ‹πœ‹ ′𝑒𝑒 = 𝑖𝑖 − π‘Ÿπ‘Ÿ
Given an exogenous target for 𝑖𝑖 and an equilibrium for π‘Ÿπ‘Ÿ that is independent of monetary
policy, this equation determines a unique equilibrium for expected inflation πœ‹πœ‹ ′𝑒𝑒 . A low
nominal rate 𝑖𝑖 is associated with low expected inflation in equilibrium. But what about the
absolute level of prices 𝑃𝑃 and the inflation rate between the past and current period?
With an endogenous money supply 𝑀𝑀, there are many possible price levels 𝑃𝑃 consistent
with 𝑀𝑀⁄𝑃𝑃 = 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖). This is because 𝑀𝑀 adjusts to ensure this equation holds at the central
bank’s target level of 𝑖𝑖. It follows that setting a target for interest rates does not by itself
give the economy a ‘nominal anchor’ – the level of prices 𝑃𝑃 in terms of money is
indeterminate. In practice, what this means is that a fixed 𝑖𝑖 policy does not rule out
unexpected fluctuations in inflation, even though expected inflation is determinate.
A traditional monetary target does provide a nominal anchor in the sense of there being a
unique equilibrium for the levels of prices and inflation. Given a demand for real money
balances, an exogenous nominal quantity of money means there is only one possible level of
prices in equilibrium. But we have seen that a fixed money supply not desirable if real
money demand fluctuates. Moving away from targeting the money supply is desirable but
simply setting a fixed target for the nominal interest rate has pitfalls. The way forward is to
consider an alternative approach to monetary policy, the use of an interest-rate feedback
rule, the most famous example of which being the Taylor rule.
225
EC2065 Macroeconomics | Chapter 6: Money
6.12 Taylor rules and the Taylor principle
The Taylor rule is an example of what is known as an interest-rate feedback rule. Instead of
an exogenous money supply or interest rate target, a feedback rule has the central bank
actively adjust the interest rate it sets to meet an objective. Assume here that an inflation
target πœ‹πœ‹ ∗ is the central bank’s only objective.
A simple version of the Taylor rule is given in the following equation:
𝑖𝑖 = π‘Ÿπ‘ŸΜ‚ + πœ‹πœ‹ ∗ + πœ™πœ™(πœ‹πœ‹ − πœ‹πœ‹ ∗ )
This describes how the central bank sets the nominal interest rate 𝑖𝑖. According to the Taylor
rule, the level of 𝑖𝑖 should depend on the actual rate of inflation πœ‹πœ‹ that occurs between the
past and current period. The coefficient πœ™πœ™ indicates how strongly the central bank reacts to
inflation πœ‹πœ‹ missing its target πœ‹πœ‹ ∗ . If πœ‹πœ‹ is one percentage point higher then 𝑖𝑖 is raised by πœ™πœ™
percentage points. The special case of πœ™πœ™ = 0 represents an exogenous interest rate target
that is not adjusted to the actual inflation rate. Finally, the term π‘Ÿπ‘ŸΜ‚ denotes the central
bank’s estimate of the equilibrium real interest rate π‘Ÿπ‘Ÿ.
It is argued that a sufficiently strong response to inflation, as measured by the parameter πœ™πœ™,
ensures the equilibrium inflation rate πœ‹πœ‹ is determinate (and will be on target πœ‹πœ‹ ∗ if the
estimate of π‘Ÿπ‘Ÿ is correct). To be precise, a sufficiently strong response means that πœ™πœ™ > 1, so
the nominal interest rate reacts more than one-for-one to inflation. This is known as the
‘Taylor principle’.
To see the argument, we combine the Fisher equation 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹ ′𝑒𝑒 written explicitly in terms
of expected inflation πœ‹πœ‹ ′𝑒𝑒 = (𝑃𝑃′𝑒𝑒 − 𝑃𝑃)/𝑃𝑃 between the current and future periods and the
Taylor rule 𝑖𝑖 = π‘Ÿπ‘ŸΜ‚ + πœ‹πœ‹ ∗ + πœ™πœ™(πœ‹πœ‹ − πœ‹πœ‹ ∗ ). By eliminating 𝑖𝑖 from these equations:
π‘Ÿπ‘Ÿ + πœ‹πœ‹ ′𝑒𝑒 = π‘Ÿπ‘ŸΜ‚ + πœ‹πœ‹ ∗ + πœ™πœ™(πœ‹πœ‹ − πœ‹πœ‹ ∗ ) = π‘Ÿπ‘ŸΜ‚ + πœ‹πœ‹ + (πœ™πœ™ − 1)(πœ‹πœ‹ − πœ‹πœ‹ ∗ )
This can be rearranged to write an equation for the expected change in the inflation rate
over time:
πœ‹πœ‹ ′𝑒𝑒 − πœ‹πœ‹ = (πœ™πœ™ − 1)(πœ‹πœ‹ − πœ‹πœ‹ ∗ ) − (π‘Ÿπ‘Ÿ − π‘Ÿπ‘ŸΜ‚ )
The Taylor principle πœ™πœ™ > 1 implies the coefficient on πœ‹πœ‹ − πœ‹πœ‹ ∗, the extent to which inflation πœ‹πœ‹
misses its target πœ‹πœ‹ ∗ , is positive. Consequently, higher inflation now would mean that
inflation is expected to rise faster in the future and lower inflation now would mean that
subsequent inflation is expected to fall faster. It follows that there is only one value of
inflation πœ‹πœ‹ where subsequent inflation is not expected to keep rising or keep falling
(although this argument ignores the lower bound on 𝑖𝑖 as explained later in Box 6.6).
If πœ‹πœ‹ ′𝑒𝑒 = πœ‹πœ‹, so inflation is neither expected to rise or fall further in future, then:
πœ‹πœ‹ = πœ‹πœ‹ ∗ +
π‘Ÿπ‘Ÿ − π‘Ÿπ‘ŸΜ‚
πœ™πœ™ − 1
This is the unique stable equilibrium for inflation when πœ™πœ™ > 1. If the Taylor principle πœ™πœ™ > 1
is not satisfied then there are many stable paths of inflation over time that are equally
226
EC2065 Macroeconomics | Chapter 6: Money
consistent with the interest-rate rule and equilibrium in the economy. We see from the
equation that inflation is on target if the central bank’s estimate π‘Ÿπ‘ŸΜ‚ of the equilibrium real
interest rate π‘Ÿπ‘Ÿ is correct. Underestimating this (π‘Ÿπ‘ŸΜ‚ < π‘Ÿπ‘Ÿ) leads to inflation above the target
(πœ‹πœ‹ > πœ‹πœ‹ ∗ ).
Our analysis of the Taylor rule suggests that central banks wanting to meet an inflation
target have two key tasks. First, to make a strong reaction to any deviation of inflation from
its target so that the Taylor principle (πœ™πœ™ > 1) is satisfied. Second, to obtain an accurate
estimate of market-clearing real interest rate π‘Ÿπ‘Ÿ.
6.13 The liquidity trap and the zero lower bound
In this section we will study two important limitations on the power of monetary policy.
6.13.1 The zero lower bound
The zero lower bound is the claim that the nominal interest rate 𝑖𝑖 on bonds cannot be
negative, so the equilibrium of the economy always features 𝑖𝑖 ≥ 0. The logic for this comes
from the money demand trade-off analysed in Section 6.4. Holding more money has the
cost of forgoing interest when 𝑖𝑖 is positive. On the other hand, money is useful for making
payments and holding more of it avoids the costs of using substitutes for money, or of
managing to carry out transactions while holding only a small amount of money on average.
The marginal benefit of holding an extra unit of real money balances is represented by the
cost reduction 𝑍𝑍′(𝑋𝑋).
The marginal benefit 𝑍𝑍′(𝑋𝑋) cannot be negative but can be zero if 𝑋𝑋 has already reached 0. If
the nominal interest rate 𝑖𝑖 on bonds were negative, this would mean that cash is a better
store of value than bonds, while also having a non-negative benefit relative to bonds in
saving transaction costs. Hence, 𝑖𝑖 < 0 would imply money is always preferred to bonds,
which is not possible in equilibrium because then there would be no demand for bonds.
6.13.2 The liquidity trap
The liquidity trap is the idea that money and bonds become perfect substitutes at the
margin once the lower bound on nominal interest rates is reached. It implies increases in
the quantity of money might be passively absorbed with no impact on the economy.
At the zero lower bound 𝑖𝑖 = 0, no interest is forgone by holding more money. Furthermore,
once holdings of real money balances 𝑀𝑀𝑑𝑑 /𝑃𝑃 reach π‘Œπ‘Œ (the amount needed to make all
payments using money without incurring any transaction costs), the marginal benefit of
lowering transaction costs 𝑍𝑍(𝑋𝑋) by holding more money is zero. With a zero marginal cost
and a zero marginal benefit, households and firms are indifferent about whether they hold
higher 𝑀𝑀𝑑𝑑 /𝑃𝑃 or not. But 𝑀𝑀𝑑𝑑 /𝑃𝑃 can be larger than π‘Œπ‘Œ at 𝑖𝑖 = 0 because money has become as
good a store of value as bonds. Consequently, money demand 𝑀𝑀𝑑𝑑 is perfectly interest
elastic at the interest-rate lower bound 𝑖𝑖 = 0.
Using the same framework as in Box 6.3, we see that a temporary expansion of 𝑀𝑀 𝑠𝑠 has no
effect on either the nominal interest rate 𝑖𝑖 or the price level 𝑃𝑃 once the zero lower bound is
reached. In the left panel of Figure 6.19, the demand for real money balances is perfectly
interest elastic (horizonal) at 𝑖𝑖 = 0. The money market is still in equilibrium at 𝑖𝑖 = 0 after a
227
EC2065 Macroeconomics | Chapter 6: Money
shift of 𝑀𝑀 𝑠𝑠 /𝑃𝑃 to the right with higher 𝑀𝑀 𝑠𝑠 (assuming no change in 𝑃𝑃 for now). This additional
money is willingly held with no change in the interest rate. To confirm that the price level 𝑃𝑃
does not change either, the right panel of the figure shows the money demand function
𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, π‘Ÿπ‘Ÿ + (𝑃𝑃′ ⁄𝑃𝑃) − 1), which combines the Fisher equation 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹ = π‘Ÿπ‘Ÿ + (𝑃𝑃′ −
𝑃𝑃)/𝑃𝑃 with 𝑀𝑀𝑑𝑑 ⁄𝑃𝑃 = 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖). Taking the future price level 𝑃𝑃′ as given (because the change in
𝑀𝑀 𝑠𝑠 is only temporary), money demand is perfectly elastic with respect to the price level 𝑃𝑃
once 𝑖𝑖 = 0 is reached. The horizontal demand curve implies that the rightward of the supply
curve 𝑀𝑀 𝑠𝑠 does not change the equilibrium price level.
Figure 6.19: Money demand at zero nominal interest rate
It is important to note this argument does not apply to permanent expansions of the money
supply 𝑀𝑀 𝑠𝑠 where the central bank can convince people that it will never reverse the policy
change. This should affect expectations of future prices 𝑃𝑃′, which would shift 𝑀𝑀𝑑𝑑 =
𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, π‘Ÿπ‘Ÿ + (𝑃𝑃′ ⁄𝑃𝑃) − 1), resulting in a change of 𝑃𝑃 or 𝑖𝑖 or both.
Box 6.6: A deflation trap
We have seen in Section 6.12 that if a central bank aims to control inflation by setting the
nominal interest rate 𝑖𝑖 then it is important to satisfy the ‘Taylor principle’, a more than
one-for-one adjustment of 𝑖𝑖 to current inflation πœ‹πœ‹. However, if 𝑖𝑖 is subject to the zero
lower bound, it may not be possible to cut 𝑖𝑖 sufficiently when inflation is significantly
below target. This means the Taylor principle cannot be satisfied for all rates of inflation πœ‹πœ‹.
Therefore, a monetary policy based on setting the nominal interest rate has the risk that
the economy falls into a ‘deflation trap’ where inflation is persistently negative.
The central bank is assumed to follow the Taylor rule where possible, setting 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹ ∗ +
πœ™πœ™(πœ‹πœ‹ − πœ‹πœ‹ ∗ ) if this results in 𝑖𝑖 ≥ 0, or 𝑖𝑖 = 0 otherwise. We assume here that the equilibrium
real interest rate π‘Ÿπ‘Ÿ > 0 is known to central bank. We suppose the inflation target is πœ‹πœ‹ ∗ ≥
0, which means a target for price stability or a positive inflation rate. We have not yet seen
any reasons why it is desirable to have πœ‹πœ‹ ∗ ≥ 0, but that will be covered in Section 9.5. The
Taylor rule has πœ™πœ™ > 1, so the central bank aims to satisfy the Taylor principle.
228
EC2065 Macroeconomics | Chapter 6: Money
With πœ™πœ™ > 1, the Taylor rule equation implies 𝑖𝑖 = 0 is reached for some inflation rate πœ‹πœ‹
lying between −π‘Ÿπ‘Ÿ and πœ‹πœ‹ ∗ , which is where the zero lower bound becomes binding. We can
study what happens to inflation in equilibrium by combing the Fisher equation 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹ ′𝑒𝑒
in terms of expected future inflation πœ‹πœ‹ ′𝑒𝑒 with the Taylor rule as was done in Section 6.12.
This leads to πœ‹πœ‹ ′𝑒𝑒 − πœ‹πœ‹ = (πœ™πœ™ − 1)(πœ‹πœ‹ − πœ‹πœ‹ ∗ ) if 𝑖𝑖 ≥ 0 as seen in the earlier analysis of Taylor
rules. But it leads to πœ‹πœ‹ ′𝑒𝑒 − πœ‹πœ‹ = −π‘Ÿπ‘Ÿ − πœ‹πœ‹ if 𝑖𝑖 = 0 because the Fisher equation implies πœ‹πœ‹ ′𝑒𝑒 =
−π‘Ÿπ‘Ÿ at the interest-rate lower bound.
The relationship between the expected change in inflation πœ‹πœ‹ ′𝑒𝑒 − πœ‹πœ‹ and the current
inflation rate πœ‹πœ‹ is plotted in Figure 6.20. The upward-sloping segment at the right of the
diagram where the Taylor principle is satisfied has already been discussed in Section 6.12
and the inflation target πœ‹πœ‹ ∗ is a steady state. The downward-sloping segment on the left is
where the lower bound binds and the Taylor principle cannot be satisfied.
Figure 6.20: Multiple equilibria with a deflation trap
The diagram shows that there are paths of inflation falling below the target πœ‹πœ‹ ∗ where πœ‹πœ‹
eventually becomes negative and converges to another steady state at πœ‹πœ‹ = −π‘Ÿπ‘Ÿ. This
second steady state is the ‘deflation trap’ that the Taylor rule fails to avoid.
6.14 Negative nominal interest rates
The models of money we have seen in this chapter predict that bonds cannot have a
negative nominal interest rate in equilibrium. But instances of negative nominal interest
rates have been observed, for example, in some eurozone countries from around 2016.
Does this mean we missing something important about money from our model?
6.14.1 Is there a lower bound on nominal interest rates?
The logic of our earlier argument for why 𝑖𝑖 < 0 should be impossible is that a negative
nominal return on bonds would lead wealth held in the form of bonds to be switched into
cash to receive the guaranteed zero nominal return (π‘–π‘–π‘šπ‘š = 0) on cash. Note that only cash by
its nature necessarily offers a zero nominal return; other forms of money in electronic
229
EC2065 Macroeconomics | Chapter 6: Money
accounts, such as reserves held at the central bank, could in principle have π‘–π‘–π‘šπ‘š < 0.
However, as long as physical cash exists as a form of money, its zero nominal return is
always available to investors.
We will argue the key point missing here is that switching large amounts of wealth into
physical cash would entail security costs of keeping it safe. Unlike bonds and most assets,
there no register of ownership of physical cash – this is why cash gives anonymity – and so
holders of cash must consider the cost of keeping this physical object secure. Assume that
holding cash entails a security or storage cost that is a proportion β„Ž of the amount of cash
held. This is a resource cost of holding cash, in addition to forgone interest 𝑖𝑖 (if any).
Consider again the money demand trade-off from Section 6.4. Raising 𝑋𝑋 to reduce the
amount of cash held and increase bond holdings leads to a gain of 𝑖𝑖 + β„Ž per unit increase in
𝑋𝑋, saving security costs plus any forgone interest. Raising 𝑋𝑋 by one unit has a marginal cost
π‘žπ‘ž = 𝑍𝑍 ′ (𝑋𝑋), so the optimal 𝑋𝑋 is found where 𝑖𝑖 + β„Ž = π‘žπ‘ž = 𝑍𝑍′(𝑋𝑋), not where 𝑖𝑖 = 𝑍𝑍′(𝑋𝑋) as
before. Since 𝑖𝑖 + β„Ž replaces 𝑖𝑖 but the equation is otherwise the same, it follows that the
usual money demand function 𝑀𝑀𝑑𝑑 /𝑃𝑃 = 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖) becomes 𝑀𝑀𝑑𝑑 /𝑃𝑃 = 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖 + β„Ž). As money
demand is decreasing in 𝑖𝑖, adding the positive value of β„Ž shifts 𝑀𝑀𝑑𝑑 downwards.
Geometrically, this is a parallel downward shift by an amount β„Ž of the real money demand
curve plotted against 𝑖𝑖, as shown in Figure 6.21.
Figure 6.21: Demand for cash with security cost
The diagram shows it is now possible to have an equilibrium with a negative nominal
interest rate 𝑖𝑖 < 0. Bonds with a negative nominal return are willingly held because the
security costs of switching to physical cash holdings are too large. Note that bonds
themselves are not subject to the same security problems of cash because there is a register
of ownership.
Although we can now understand why nominal interest rates can be negative, it turns out
that there is still a lower bound on 𝑖𝑖, only now a negative one. As can be seen from the
diagram, it is not possible to have 𝑖𝑖 be less than −β„Ž, that is, 𝑖𝑖 cannot be below the negative
of the security cost as percentage of the value of cash stored. The lower bound is therefore
230
EC2065 Macroeconomics | Chapter 6: Money
𝑖𝑖 ≥ −β„Ž. In practice, β„Ž should not be much more than 1 per cent for large amounts of
physical cash, which would suggest a lower bound on 𝑖𝑖 of approximately −1 per cent.
6.14.2 Costs of negative interest rates
Although our analysis has explained why a negative nominal interest rate 𝑖𝑖 is possible, it
does not explain why one would be desirable – and indeed the logic suggests that negative
nominal interest rates have social costs.
Consider what is optimal monetary policy here. There are resource costs of both holding
more cash (β„Ž) and using substitutes for cash or economising on cash holdings (𝑍𝑍′(𝑋𝑋)). All
means of payment therefore have a social cost. The marginal net social cost of lower 𝑋𝑋 (or
equivalently, higher 𝑀𝑀𝑑𝑑 /𝑃𝑃) is β„Ž − 𝑍𝑍 ′ (𝑋𝑋), so economic efficiency requires β„Ž = 𝑍𝑍′(𝑋𝑋). Since
𝑖𝑖 + β„Ž = 𝑍𝑍′(𝑋𝑋) in equilibrium, the Friedman rule 𝑖𝑖 = 0 therefore achieves efficiency. Note
that it is now efficient to have 𝑋𝑋 > 0, so cash is not used for all transactions. Having a
negative nominal interest rate 𝑖𝑖 < 0 has social costs because it leads to over-use of cash,
which wastes resources on security and storage costs, just as the earlier argument for the
Friedman rule pointed to the waste of resources in finding substitutes for cash when 𝑖𝑖 > 0.
6.14.3 Lowering the lower bound
The analysis suggests there is a negative lower bound on the nominal interest rate but one
not too far below zero in practice. What if governments want to lower interest rates further
below zero?
It is possible to envisage changes to the monetary system that would permit this. For
example, cash might have an ‘expiry date’, with an amount deducted if people need to
convert old, expired cash into new cash (what is sometimes known as a ‘Gesell tax’). If 𝜏𝜏 is
an explicit tax on holding cash, money demand 𝑀𝑀𝑑𝑑 is determined by the equation 𝑖𝑖 + 𝜏𝜏 =
π‘žπ‘ž = 𝑍𝑍′(𝑋𝑋). Analogous to having a security cost β„Ž, this implies the lower bound on 𝑖𝑖 is now
−𝜏𝜏, which can be lowered by raising 𝜏𝜏. Similar outcomes can be achieved by limits on
convertibility between physical cash and other forms of money such as reserves and bank
deposits. Finally, abolishing physical cash and moving to a system of purely electronic
money would remove the lower bound on nominal interest rates entirely.
None of this analysis explains why governments should do such things and points to
inefficiencies in making it harder or more costly to make use of money for transactions. In
Chapter 9, we see that there might be circumstances where macroeconomic stabilisation
policies work more effectively if the lower bound on nominal interest rates can be
circumvented.
231
EC2065 Macroeconomics | Chapter 7: Banking and finance
Chapter 7: Banking and finance
Our analysis of the supply of money in Chapter 6 assumed that all of the money supply was
fiat money created by a central bank. While a central bank is the monopoly supplier of its
currency and fiat money more broadly including reserves, this is not the only form of money
in use, nor even the most quantitatively important form of money in many countries.
This chapter investigates how the banking system affects the supply of money and how the
central bank interacts with commercial banks in setting monetary policy. The chapter also
considers the roles of banks and financial markets in the economy.
Essential reading
•
Williamson, Chapter 18.
7.1 Fractional reserve banking
In advanced economies, bank deposits are the type of money most prevalent in use for
transactions purposes. Bank deposits are a form of credit money. A deposit is a liability of
the commercial bank at which the account is held and this liability is a promise to repay fiat
money to the depositor on withdrawal. Unlike bonds or loans where there is a fixed date at
repayment falls due, many bank deposits can be transferred or withdrawn on demand. For
this reason, we refer to them as ‘demand deposits’ (held in ‘checking accounts’ or ‘current
accounts’, the terminology differing between countries). In what follows, we ignore savings
deposits (‘time deposits’) that are locked in for a definite period.
For households and firms to make use of demand deposits for payments, recipients must be
willing to treat such deposits as being as good as cash. This requires not only a right to
withdraw deposits and receive cash on demand but also the ability freely to transfer
deposits to payees by means of debit cards, bank transfers and cheques to other accounts,
including those held at other banks.
As demand deposits can be withdrawn or transferred, commercial banks themselves need
to hold fiat money. When there is a withdrawal, the bank must provide cash to the
depositor. When there is a payment made to an account at another bank, the payer’s bank
must provide an asset to payee’s bank for settlement of the transfer of funds. To avoid
credit risk, banks require settlement with a risk-free asset. The ultimate risk-free asset for
transfers of funds is fiat money itself because a unit of fiat money defines what asset a bank
deposit is a claim to.
Commercial banks hold fiat money as vault cash or reserves. Reserves are deposits at
central bank, essentially money in banks’ own accounts at the central bank. These reserves
are transferrable between banks. Moreover, the central bank is willing to exchange reserves
and cash one-for-one if requested by commercial banks.
232
EC2065 Macroeconomics | Chapter 7: Banking and finance
Although commercial banks hold fiat money as reserves (or vault cash), the amount is
normally smaller than the deposits households and firms hold with the commercial banks.
This is what is known as a fractional-reserve banking system, in that bank reserves are less
than 100 per cent of deposits. Not holding reserves equal to deposits is in the interests of
commercial banks because reserves typically offer an inferior return to other assets, for
example, loans made by the commercial banks or other financial assets.
Nonetheless, it is prudent for commercial banks to hold some reserves to cover transfers or
withdrawals of deposits. There may also be regulatory reserve requirements that impose
legal minimum levels of reserve holdings relative to deposits. Even where such reserve
requirements exist, banks usually hold some amount of excess reserves above the
minimum.
With the presence of deposits, cash and reserves, there are now several distinct measures
of the supply of money. The first is quantity of money usable by households or firms for
payments, referred to as the broad money supply.
Broad money supply = Bank deposits + Cash
This corresponds to M1 measure of money supply in the USA. A narrower notion of money
considers only the supply of fiat money, the narrow money supply.
Narrow money supply = Reserves + Cash
This measure is also known as the ‘monetary base’, and it corresponds to what is called M0
in some countries such as the UK. You may also encounter the terms ‘inside money’ and
‘outside money’. Inside money is deposits, which are created by private sector and ‘outside
money’ is fiat money, which is created by governments.
If transfers of funds deposited at commercial banks are treated as equivalent to fiat money
then households and firms can use either deposits or cash to make payments. One
difference between cash and deposits is that no interest is received by holding cash, while
deposits may pay interest at rate π‘–π‘–π‘šπ‘š , where π‘–π‘–π‘šπ‘š refers to the interest rate paid on ‘money’ as
distinct from bonds. Assuming no difference in convenience between cash and deposits
when making payments, there would be no demand for cash if π‘–π‘–π‘šπ‘š > 0. While cash remains
in use for payments to varying degrees around the world, making this assumption allows us
to focus on the demand for bank deposits, which is what is new in this chapter. The demand
for cash has already been studied in Section 6.4.
If the interest rate on bonds (or savings deposits that cannot be transferred to make
payments) is 𝑖𝑖 then interest 𝑖𝑖 − π‘–π‘–π‘šπ‘š is forgone when wealth is held as money in the form of
demand deposits. A theory of the demand for money can be developed along the same lines
as Section 6.4, with the interest-rate spread 𝑖𝑖 − π‘–π‘–π‘šπ‘š replacing the opportunity cost 𝑖𝑖 of
holding cash. The demand for deposits by households and firms is given by the equation:
𝑀𝑀𝑑𝑑
= 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖 − π‘–π‘–π‘šπ‘š )
𝑃𝑃
233
EC2065 Macroeconomics | Chapter 7: Banking and finance
This equation specifies a demand for real money balances 𝑀𝑀/𝑃𝑃 that depends positively on
real income π‘Œπ‘Œ and negatively on 𝑖𝑖 − π‘–π‘–π‘šπ‘š . The downward-sloping demand curve is depicted in
Figure 7.1 for given π‘Œπ‘Œ, where the opportunity cost 𝑖𝑖 − π‘–π‘–π‘šπ‘š is the variable on the vertical axis.
Figure 7.1: The demand for deposits
7.2 The tools of monetary policy
In this chapter we will study in more detail the tools that a central bank can use to
implement monetary policy and affect the supply of money, including indirectly the supply
of deposits created by commercial banks. Our earlier analysis in Section 6.6 supposed that
all money was fiat money and the supply of this was controlled by open-market operations.
However, open-market operations are not the only tool of monetary policy.
In general, we can identify three types of monetary policy instrument:
1. Open-market operations
2. Standing facilities
3. Reserve requirements.
7.2.1 Open-market operations
Open-market operations are purchases or sales of financial assets by the central bank. These
affect the supply of fiat money because the central bank makes and receives payments in
fiat money. Let us simplify our analysis by ignoring cash, so the supply of fiat money, the
narrow money supply or monetary base, is just supply of reserves, denoted by 𝑅𝑅.
Central-bank asset purchases raise 𝑅𝑅 as payment is added to the reserve accounts of the
commercial banks from which the financial assets are purchased. Central-bank asset sales
lower 𝑅𝑅 as commercial banks’ reserve balances are reduced when they make payments to
the central bank.
In principle, open-market operations can be conducted in a wide range of financial markets.
These could be outright purchases of long-term government bonds paid for with newly
created reserves. If the central bank continues to hold these bonds then the effect on the
234
EC2065 Macroeconomics | Chapter 7: Banking and finance
supply of reserves persists until the bonds mature. However, the central bank could sell the
bonds before maturity to reverse the increase in 𝑅𝑅. An alternative to outright purchases or
sales of assets is repo or reverse-repo operations. A ‘repo’ is a sale-and-repurchase
agreement where an asset is sold but also agreed to be bought back at a prearranged price
(the percentage increase in the repurchase price relative to the sale price being the interest
rate on the repo). Open-market operations with repos change the supply of reserves 𝑅𝑅 only
temporarily.
It is important to note that, while individual commercial banks can change the quantity of
reserves they hold by buying or selling financial assets, or lending or borrowing, the central
bank is the monopoly supplier of reserves 𝑅𝑅. Transfers of reserves or loans of reserves
among commercial banks do not change the aggregate supply of reserves.
7.2.2 Standing facilities
A feature of open-market operations is that they can be conducted at the discretion of the
central bank, meaning the central bank can choose the size of the open-market operations it
carries out. An alternative monetary policy tool where the extent of its use depends on the
actions of commercial banks is known as a standing facility. An example of a standing facility
is the ‘discount window’, an arrangement whereby the central bank agrees to lend reserves
to commercial banks at a known interest rate against collateral. The interest rate charged
on such discount loans is known as the central bank’s discount rate.
While the extent to which standing facilities are used depends on the behaviour of
commercial banks, the central bank can vary the terms on which the facilities are available,
for example, by changing the discount rate. For the discount window, or ‘borrowing facility’
as it is described by some central banks, the central bank stands ready to lend reserves to
commercial banks against appropriate collateral at discount rate 𝑖𝑖𝑏𝑏 , which can be varied by
the central bank. Note that 𝑖𝑖𝑏𝑏 is not a market-determined interest rate; it is under the direct
administrative control of the central bank.
Another standing facility is the payment of interest on commercial banks’ reserve balances,
sometimes referred to as a ‘deposit facility’. The central bank can choose to vary the
interest rate π‘–π‘–π‘Ÿπ‘Ÿ paid on reserve balances.
7.2.3 Reserve requirements
The final monetary policy instrument we will consider is the imposition of reserve
requirements on commercial banks, which have a direct effect on the demand for fiat
money by banks. Suppose that a commercial bank with deposits 𝑀𝑀 must hold at least an
amount π‘žπ‘žπ‘žπ‘ž in reserves, where π‘žπ‘ž is the required reserve ratio. The ratio π‘žπ‘ž may be 0 if no
reserve requirements are present, or close to zero where reserve requirements are small.
The case π‘žπ‘ž = 1 represents a requirement to hold 100 per cent of demand deposits as
reserves, which while far from the reality of bank regulation today, has been suggested by
some as an alternative to the fractional-reserve banking system.
If a commercial bank fails to meet its reserve requirement then it is charged interest at a
penalty rate on the size of any shortfall, or it is obliged to borrow from the central bank’s
235
EC2065 Macroeconomics | Chapter 7: Banking and finance
discount window to make good the shortfall. Here, we suppose the penalty rate is the same
as the discount rate 𝑖𝑖𝑏𝑏 . In the presence of reserve requirements, the central bank can
choose to pay interest at a different rate π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ on required reserves, with π‘–π‘–π‘Ÿπ‘Ÿ being the interest
rate on reserves held in excess of reserve requirements.
As well as mandating reserve requirements, central banks could vary the required reserve
ratio π‘žπ‘ž but this is not often done in advanced economies.
7.3 The interbank market
Reserves are used by commercial banks to settle transfers of funds between them. Banks
can manage their holdings of reserves by borrowing or lending in the interbank markets.
Commercial banks can borrow or lend reserves among themselves at interest rate 𝑖𝑖. We will
treat this interest rate as being the same as the interest rate on nominal bonds. However,
note that interbank lending is typically of a very short maturity, for example, overnight.
Loans are often not secured by collateral, although the repo market allows for secured
lending of reserves. In practice, we will ignore these subtleties and suppose that lending in
such markets is equivalent to holding short-term bonds. The interbank and repo markets are
often referred to as the ‘money markets’ but, more precisely, they are markets for
borrowing money, specifically reserves.
7.3.1 The demand for reserves
What amount of reserves should a commercial bank hold and, hence, how much should it
borrow or lend in the interbank market to obtain its desired amount of reserves? The bank’s
depositors will make payments to accounts at other banks, which need to be settled by
transferring reserves. On the other hand, the bank will receive some reserves due to
payments from other banks going into its own depositors’ accounts. The required net
transfer of reserves resulting from the payments system is not completely predictable.
If a bank is left with more reserves relative to reserve requirements at a point when it
cannot go back to the interbank market to manage its reserve holdings then it keeps excess
reserves in its account at the central bank and receives interest π‘–π‘–π‘Ÿπ‘Ÿ . If it has insufficient
reserves relative to reserve requirements and borrowing from other banks is not feasible
then it must borrow the shortfall from the central bank at penalty rate 𝑖𝑖𝑏𝑏 . By holding more
reserves by borrowing from other banks when the interbank markets are open, the first of
these outcomes becomes more likely and the second less likely.
Suppose a bank borrows an additional unit of reserves. The cost of this borrowing is
interbank interest rate 𝑖𝑖. The benefit of holding more reserves depends on whether the
bank ends up with excess reserves or a shortfall of reserves. With excess reserves, the
benefit is the extra interest π‘–π‘–π‘Ÿπ‘Ÿ paid on its reserve balance. With a shortfall of reserves, the
benefit of an extra unit of reserves is saving the borrowing cost 𝑖𝑖𝑏𝑏 because the shortfall is
smaller. The overall expected benefit of is a weighted average of π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 depending on the
probabilities of the two events. When reserves are initially low, the probability of a shortfall
is high, which means more weight is put on the saving 𝑖𝑖𝑏𝑏 when calculating the expected
benefit of borrowing more in the interbank market.
236
EC2065 Macroeconomics | Chapter 7: Banking and finance
It is optimal for commercial banks to borrow reserves up to the point where the cost 𝑖𝑖
equals expected benefit. This implies the demand to borrow reserves decreases with the
cost 𝑖𝑖. With the interbank interest rate 𝑖𝑖 on the vertical axis and the quantity of reserve
holdings on the horizontal axis, there is a downward-sloping reserve demand curve as
shown in Figure 7.2.
Figure 7.2: The demand for reserves
The demand curve is bounded in a range of interbank interest rates between π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 . This
is because there would be an unlimited demand to borrow reserves if the cost were below
π‘–π‘–π‘Ÿπ‘Ÿ , with banks able to profit from arbitraging the difference between 𝑖𝑖 and π‘–π‘–π‘Ÿπ‘Ÿ . There would
be no demand to borrow reserves if the cost were above 𝑖𝑖𝑏𝑏 because commercial banks can
always borrow from central bank instead at interest rate 𝑖𝑖𝑏𝑏 . Since the height of the demand
curve represents the expected benefit of borrowing a unit of reserves, and the probabilities
of having an excess or shortfall of reserves depend on reserve holdings relative to deposits,
the appropriate horizontal-axis variable is the quantity of reserves 𝑅𝑅 relative to amount of
deposits 𝑀𝑀, the ratio 𝑅𝑅⁄𝑀𝑀.
The reserve demand curve shifts vertically if there are changes to the central bank’s
standing-facility interest rates π‘–π‘–π‘Ÿπ‘Ÿ or 𝑖𝑖𝑏𝑏 . Higher reserve requirements π‘žπ‘ž shift the demand
curve to the right. Note that the interest rate on required reserves π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ does not affect the
reserve demand curve here because the amount of interest paid on required reserves does
not depend on banks’ decisions that affect the extent of their holdings of excess reserves.
7.3.2 Equilibrium in the interbank market
The reserve demand curve can be combined with a reserve supply curve to understand the
determinants of the interbank interest rate and how it is affected by the various tools of
monetary policy.
The total supply of reserves available to commercial banks as a whole is determined by the
central bank. This means there is an inelastic supply of reserves 𝑅𝑅 and the position of the
reserve supply curve is affected by open-market operations. Central-bank asset purchases
237
EC2065 Macroeconomics | Chapter 7: Banking and finance
shift the reserve supply curve to the right and asset sales shift the supply curve to the left.
Taking deposits 𝑀𝑀 as given for now, the supply of 𝑅𝑅⁄𝑀𝑀 is also inelastic.
Figure 7.3 represents the interbank market equilibrium by putting together the demand and
supply curves for reserves. The equilibrium interbank interest rate 𝑖𝑖 ∗ is at the intersection of
the demand and supply curves 𝑅𝑅𝑑𝑑 and 𝑅𝑅 𝑠𝑠 .
Figure 7.3: Equilibrium in the interbank market
238
EC2065 Macroeconomics | Chapter 7: Banking and finance
Box 7.1: The ‘channel’ system of monetary policy
The traditional approach to implementing monetary policy is for the central bank to use
open-market operations. These purchases or sales of assets change the supply of
reserves, which affects equilibrium interest rates in money markets. An alternative
system works by varying the terms of the central bank’s standing facilities instead.
This system depends on there being a pair of standing facilities:
•
•
Interest π‘–π‘–π‘Ÿπ‘Ÿ paid on excess reserves held by commercial banks
Interest 𝑖𝑖𝑏𝑏 charged on loans of reserves to commercial banks.
It is known as the ‘channel’ or ‘corridor’ system because the interbank interest rate lies in
a channel between π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 as shown in Figure 7.3. The system is usually a target for the
interbank interest rate 𝑖𝑖 ∗ that lies at the centre of the channel between π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 .
Suppose the central bank wants to raise the interest rate 𝑖𝑖 using the channel system, for
example, increase 𝑖𝑖 by 25 basis points (0.25 percentage points). To do this, the central
bank simply raises its standing-facility interest rates π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 in parallel by the amount it
wants to increase 𝑖𝑖. In Figure 7.4, π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 are raised by 25 basis points, and this leads to
a parallel upwards shift of the reserve demand curve 𝑅𝑅𝑑𝑑 by exactly the size of the
increase in π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 . Intuitively, 𝑅𝑅𝑑𝑑 depends on 𝑖𝑖 relative to π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖 relative to 𝑖𝑖𝑏𝑏 . With
no change in 𝑅𝑅 𝑠𝑠 because no open-market operation is conducted, the equilibrium 𝑖𝑖 ∗ rises
by same amount as π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 do.
We have seen that varying the terms of a pair of standing facilities in parallel allows very
precise adjustments of interest rates in the channel system. It is not necessary to know
anything about the exact position or shape of the reserve demand curve 𝑅𝑅𝑑𝑑 to achieve
this. Furthermore, while there is no guarantee that the equilibrium interest rate is
necessarily in the middle of the channel, a narrow corridor between π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 means 𝑖𝑖 ∗
cannot stray too far from its target. If the central bank wants to bring 𝑖𝑖 ∗ as close as
possible to the middle of the channel then it can use ‘fine-tuning’ open-market
operations to shift 𝑅𝑅 𝑠𝑠 so that it intersects 𝑅𝑅𝑑𝑑 in the middle of the channel.
Figure 7.4: Raising interest rates using the channel system
239
EC2065 Macroeconomics | Chapter 7: Banking and finance
In principle, the channel used in the channel system could be very narrow but that would
tend to reduce trade in the interbank market and result in frequent use by commercial
banks of central bank’s borrowing facility instead. To keep the interbank market active
and avoid potential credit risk exposure of the central bank, the channel needs to be
wide enough to encourage commercial banks to borrow or lend reserves among each
other in the interbank market.
Box 7.2: The ‘floor’ system of monetary policy
Another system of monetary policy implementation is known as the ‘floor’ system. Like
the channel system, this relies on there being a central-bank standing facility to pay
interest on excess reserves. But rather than use fine-tuning open-market operations to
keep the interbank interest rate near the centre of the channel, the floor system features
a permanently large supply of reserves.
Figure 7.5 is a representation of how the floor system operates. The reserve supply curve
𝑅𝑅 𝑠𝑠 has been shifted far to the right. The supply curve intersects the reserve demand
curve 𝑅𝑅𝑑𝑑 on the flat section of 𝑅𝑅𝑑𝑑 where 𝑖𝑖 ≈ π‘–π‘–π‘Ÿπ‘Ÿ , so the market interest rate is very close
to the interest rate π‘–π‘–π‘Ÿπ‘Ÿ paid on holdings of excess reserves. Note that the very large supply
of reserves reduces commercial banks’ incentive to trade in the interbank market
because the likelihood of being short of reserves and needing to borrow from the central
bank at rate 𝑖𝑖𝑏𝑏 is very low.
When the floor system is used, market interest rates are changed by varying the interest
rates π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 on the central bank’s standing facilities just as is done with the channel
system. Figure 7.6 shows the effects of increasing π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 by 25 basis points, which
causes a parallel upward shift of 𝑅𝑅𝑑𝑑 by the same amount and thus raises the market
interest rate 𝑖𝑖 by 25 basis points.
Figure 7.5: The floor system of monetary policy
Like the channel system, we see that the floor system allows interest rates to be
controlled very precisely. Moreover, it does so without the need for any ‘fine-tuning’
open-market operations to steer the market interest rate to the centre of a channel.
240
EC2065 Macroeconomics | Chapter 7: Banking and finance
Another advantage of the floor system is that it enables interest-rate decisions
implemented by changing π‘–π‘–π‘Ÿπ‘Ÿ to be separated from choices about the quantity of reserves
𝑅𝑅 𝑠𝑠 the central bank supplies – as long as 𝑅𝑅 𝑠𝑠 is above the amount where the reserve
demand curve flattens out at 𝑖𝑖 ≈ π‘–π‘–π‘Ÿπ‘Ÿ . This allows central banks to raise interest rates
without having to unwind their quantitative easing (QE) policies. Indeed, many central
banks found themselves using the floor system by default having chosen to expand the
supply of reserves massively with QE, but also to pay interest on reserves.
Figure 7.6: Raising interest rates using the floor system of monetary policy
As we will see in Box 7.3, the floor system has an efficiency advantage because it does
not depend on maintaining an artificial scarcity of reserves. Central banks face no
resource cost in increasing the supply of reserves but reserves have a social benefit
through their implications for commercial banks’ supply of deposits. In the floor system,
reserves are supplied up to the point where commercial banks are ‘satiated’, while in the
channel system, 𝑅𝑅 𝑠𝑠 needs to be limited to keep 𝑖𝑖 ∗ at the centre of channel. Although
decisions about the supply of reserves have no resource costs, they do have implications
for the profitability of central banks and, hence, for seigniorage revenues received by
governments.
7.4 The supply of bank deposits
The money supply 𝑀𝑀 𝑠𝑠 available to households and firms is the quantity of deposits created
by the banking system, ignoring cash here for simplicity. A bank deposit is created whenever
a commercial bank makes a loan and credits an account at the bank with the funds. By doing
this, a commercial bank earns interest at rate 𝑖𝑖 on the loan and pays interest at rate π‘–π‘–π‘šπ‘š on
the deposit. The commercial bank therefore profits from the interest margin, or interestrate spread 𝑖𝑖 − π‘–π‘–π‘šπ‘š . If this interest margin is positive, what stops a commercial bank from
making more loans and creating more deposits?
241
EC2065 Macroeconomics | Chapter 7: Banking and finance
7.4.1 Costs of maintaining adequate reserves
Even if a commercial bank could obtain a risk-free return 𝑖𝑖 by making a loan, creating a
larger quantity of deposits increases the risk of a shortfall of reserves when the household
or firm holding those deposits uses them to make payments. This means there are costs of
reserve management that need to be set against the interest margin 𝑖𝑖 − π‘–π‘–π‘šπ‘š .
A shortfall of reserves leaves a commercial bank having to borrow from the central bank at a
penalty interest rate 𝑖𝑖𝑏𝑏 , while excess reserves only earn a lower interest rate π‘–π‘–π‘Ÿπ‘Ÿ < 𝑖𝑖𝑏𝑏 .
Creating more deposits 𝑀𝑀 𝑠𝑠 relative to reserve holdings 𝑅𝑅 thus increases the expected cost
to the commercial bank of maintaining an adequate supply of reserves. Therefore, to be
willing to supply more deposits, a higher interest margin 𝑖𝑖 − π‘–π‘–π‘šπ‘š is needed. This implies the
ratio 𝑅𝑅/𝑀𝑀 𝑠𝑠 of reserves to deposits is negatively related to the interest-rate spread 𝑖𝑖 − π‘–π‘–π‘šπ‘š as
depicted in Figure 7.7.
Figure 7.7: Relationship between the reserve ratio and interest-rate spread
The real quantity of deposits 𝑀𝑀 𝑠𝑠 /𝑃𝑃 supplied by commercial banks can be broken down into:
𝑀𝑀 𝑠𝑠 𝑀𝑀 𝑠𝑠 𝑅𝑅
=
×
𝑃𝑃
𝑅𝑅
𝑃𝑃
The term 𝑅𝑅/𝑃𝑃 is the real supply of reserves, with the aggregate quantity of reserves 𝑅𝑅 held
by commercial banks being determined by the reserve supply 𝑅𝑅 𝑠𝑠 chosen by the central
bank. We have seen that the costs to commercial banks of maintaining adequate reserves
imply that the ratio 𝑀𝑀 𝑠𝑠 /𝑅𝑅 increases with 𝑖𝑖 − π‘–π‘–π‘šπ‘š . Hence, given 𝑅𝑅/𝑃𝑃, the real supply of
deposits 𝑀𝑀 𝑠𝑠 /𝑃𝑃 increases with the interest margin 𝑖𝑖 − π‘–π‘–π‘šπ‘š .
Reserve requirements imposed on commercial banks also limit deposit creation and these
impose additional costs when interest π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ paid on required reserves is below the market
interest rate 𝑖𝑖. Having π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ < 𝑖𝑖 means that a larger interest margin 𝑖𝑖 − π‘–π‘–π‘šπ‘š is required by
commercial banks when creating deposits, all else equal.
242
EC2065 Macroeconomics | Chapter 7: Banking and finance
7.4.2 Bank capital requirements
A further constraint on deposit creation comes from bank capital requirements. These
impose an upper limit on a commercial bank’s assets (for example, loans) relative to the
bank’s capital. The ‘capital’ of a commercial bank refers to the bank’s equity, which strictly
defined is funds contributed by shareholders plus undistributed bank profits. Bank capital
requirements are usually expressed as a minimum ratio of bank capital to assets and those
assets may be ‘risk weighted’, with safer assets given a lower weight.
When a commercial bank creates deposits by making loans, this adds both to the bank’s
assets and its liabilities, increasing the total size of its balance sheet. However, it does not
immediately have any impact on bank capital and thus deposit creation lowers the ratio of
capital to assets. Bank capital requirements can therefore impose limits on the quantity of
deposits created by commercial banks. Whether bank capital requirements are the binding
constraint on deposit creation or the need to maintain adequate reserves depends on the
regulatory rules in place and the system of monetary policy implementation.
The need for bank capital is due to bank assets being risky while deposits should be safe.
Capital can absorb losses up to a point without jeopardising banks’ ability to repay
depositors. As we will discuss in Section 7.12, bank capital requirements are imposed by
regulators to avoid bank failures but even in the absence of explicit rules, prudential
concerns ought to constrain banks’ own decisions about deposit creation.
7.5 Equilibrium in the banking market
The equilibrium money supply, the quantity of deposits created by the banking system, can
be found by combining the demand curve for deposits 𝑀𝑀𝑑𝑑 /𝑃𝑃 = 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖 − π‘–π‘–π‘šπ‘š ) from Section
7.1 with the supply curve 𝑀𝑀 𝑠𝑠 /𝑃𝑃 = (𝑀𝑀 𝑠𝑠 ⁄𝑅𝑅) × (𝑅𝑅⁄𝑃𝑃) derived in Section 7.4. We have seen
that the supply of deposits 𝑀𝑀 𝑠𝑠 /𝑃𝑃 increases with the interest margin 𝑖𝑖 − π‘–π‘–π‘šπ‘š , all else equal.
This interest margin is also the opportunity cost to households and firms of holding money
in the form of deposits and deposit demand depends negatively on it. Therefore, given 𝑃𝑃
and π‘Œπ‘Œ, we can draw a downward-sloping deposit demand curve and an upward-sloping
deposit supply curve and find the equilibrium in the banking market. This equilibrium is
depicted in Figure 7.8.
243
EC2065 Macroeconomics | Chapter 7: Banking and finance
Figure 7.8: Equilibrium in the banking market
The banking market can be analysed in a similar way to the money market of Chapter 6, the
only difference being that the quantity of money (deposits here) is endogenous owing to the
upward-sloping supply curve. The demand curve behaves in the same way as in Chapter 6.
The supply curve shifts to the right if there is an increase in the real supply of reserves 𝑅𝑅/𝑃𝑃.
A higher interest-rate penalty 𝑖𝑖𝑏𝑏 − π‘–π‘–π‘Ÿπ‘Ÿ faced by banks that have a shortfall of reserves would
increase the costs of reserve management and lower 𝑀𝑀 𝑠𝑠 /𝑅𝑅, shifting the deposit supply
curve to the left. Similarly, increasing reserve requirements π‘žπ‘ž when the interest rate paid on
required reserves is low (such as if π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ = π‘–π‘–π‘Ÿπ‘Ÿ < 𝑖𝑖𝑏𝑏 ) shifts the deposit supply curve to the left.
An increase in binding bank capital requirements would also reduce the supply of deposits.
Box 7.3: Should central banks pay interest on reserves?
The supply of fiat money comprises cash and reserves held by commercial banks in
accounts at the central bank. By its nature, it is impractical to pay interest on physical
cash in the form of notes and coins. Traditionally, central banks did not pay interest on
reserves even though there is no technical barrier to doing so. However, in the last two
decades, more central banks have begun to pay interest on reserves, including the US
Federal Reserve.
Reserves held by commercial banks can be broken down into required reserves, i.e.
those held to satisfy regulatory reserve requirements (if any), and excess reserves,
those held beyond the minimum requirements. In principle, central banks can pay
interest at different rates on these two types of reserves, with π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ being the interest
rate on required reserves and π‘–π‘–π‘Ÿπ‘Ÿ the interest rate on excess reserves. What reasons are
there why central banks would want to have π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ and/or π‘–π‘–π‘Ÿπ‘Ÿ be greater than zero?
Monetary policy implementation
A first reason to pay a generally positive interest rate π‘–π‘–π‘Ÿπ‘Ÿ on excess reserves is if the
central bank wants to use either the ‘channel/corridor’ or ‘floor’ systems of monetary
policy implementation. We have seen in Box 7.1 and Box 7.2 that the market interest
rate 𝑖𝑖 is controlled by varying π‘–π‘–π‘Ÿπ‘Ÿ in those systems, so it is not possible to use them if
π‘–π‘–π‘Ÿπ‘Ÿ = 0.
244
EC2065 Macroeconomics | Chapter 7: Banking and finance
If the central bank chooses π‘–π‘–π‘Ÿπ‘Ÿ = 0 then this leaves it with the ‘traditional’ approach to
implementing monetary policy. In that traditional system, open-market operations are used
to shift the reserve supply curve 𝑅𝑅 𝑠𝑠 . This changes the point of intersection with the reserve
demand curve 𝑅𝑅𝑑𝑑 in the interbank market and hence, affects the equilibrium interest rate
𝑖𝑖 ∗ as seen in Figure 7.9.
Figure 7.9: The traditional system of monetary policy implementation
However, this traditional system is subject to greater practical difficulties to the extent that
the reserve demand curve 𝑅𝑅𝑑𝑑 has an uncertain shape and position, with shocks to banks’
demand for reserves shifting 𝑅𝑅𝑑𝑑 . This means it is not certain what size of open-market
operation is needed to achieve a given change in 𝑖𝑖 ∗ . Moreover, unlike the channel system
that bounds 𝑖𝑖 ∗ in a tight range between π‘–π‘–π‘Ÿπ‘Ÿ and 𝑖𝑖𝑏𝑏 , only the discount-window standing
facility acts to cap 𝑖𝑖 ∗ at the interest rate 𝑖𝑖𝑏𝑏 on loans of reserves from the central bank. And
often even that was not effective because of a ‘stigma’ attached to commercial banks that
were seen to borrow from the central bank through its discount window. The traditional
system can be seen as essentially a very wide channel between π‘–π‘–π‘Ÿπ‘Ÿ = 0 and 𝑖𝑖𝑏𝑏 + stigma.
The cost of banking and the efficiency of money as a medium of exchange
Where money is physical cash, we saw in Section 6.10 that the Friedman rule 𝑖𝑖 = 0 yields
efficiency in the use of money as medium of exchange. Note that real money demand
𝑀𝑀𝑑𝑑 /𝑃𝑃 = 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖) is maximised at 𝑖𝑖 = 0 because this means the opportunity cost of holding
cash is zero, which avoids taxing economic activity that depends on using money.
Where money is deposits at commercial banks that pay interest rate π‘–π‘–π‘šπ‘š , the real demand
for money is 𝑀𝑀𝑑𝑑 /𝑃𝑃 = 𝐿𝐿(π‘Œπ‘Œ, 𝑖𝑖 − π‘–π‘–π‘šπ‘š ), with 𝑖𝑖 − π‘–π‘–π‘šπ‘š now being the opportunity cost of holding
money. Just as there are no resource costs of central banks creating fiat money, similarly,
there are no resource costs of commercial banks creating deposits. Hence, when money is
bank deposits, the equivalent of the Friedman rule that maximises the use of money and
avoids imposing costs on economic activity that depends on money is to have π‘–π‘–π‘šπ‘š = 𝑖𝑖.
To achieve π‘–π‘–π‘šπ‘š = 𝑖𝑖, the real supply of deposits 𝑀𝑀 𝑠𝑠 /𝑃𝑃 must be sufficiently large. As discussed
in Section 7.1, commercial banks need to hold reserves for the payments system to
operate, even when there are no regulatory reserve requirements. Banks’ deposit creation
245
EC2065 Macroeconomics | Chapter 7: Banking and finance
decision studied in Section 7.4 then explains why deposit supply 𝑀𝑀 𝑠𝑠 is discouraged by a low
interest rate π‘–π‘–π‘Ÿπ‘Ÿ paid on reserves (and required reserves π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ ) relative to 𝑖𝑖 and also by a high
interest rate 𝑖𝑖𝑏𝑏 for borrowing reserves if there is a shortfall.
If interest π‘–π‘–π‘Ÿπ‘Ÿ is paid on reserves (and π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ on required reserves where relevant) then
commercial banks increase their supply of deposits for a given interest margin 𝑖𝑖 − π‘–π‘–π‘šπ‘š . This
shifts the 𝑀𝑀 𝑠𝑠 curve to right and lowers the spread 𝑖𝑖 − π‘–π‘–π‘šπ‘š using the banking-market
equilibrium from Section 7.5. Therefore, by paying interest on reserves, the gap between
π‘–π‘–π‘šπ‘š and 𝑖𝑖 becomes smaller and the economy moves closer to efficiency. An expansion of the
real supply of reserves 𝑅𝑅 𝑠𝑠 /𝑃𝑃 also reduces the spread 𝑖𝑖 − π‘–π‘–π‘šπ‘š because banks paying the
borrowing cost 𝑖𝑖𝑏𝑏 becomes less likely. Paying interest on reserves – and using a floor
system – thus supports efficiency in a similar way to the Friedman rule.
It would still be possible to achieve efficiency by directly following the original Friedman
rule 𝑖𝑖 = 0. However, achieving π‘–π‘–π‘šπ‘š = 𝑖𝑖 has the advantage that the absolute level of interest
rates 𝑖𝑖 is not restricted, and hence, the Fisher equation 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹ does not imply that
monetary policy must generate deflation (πœ‹πœ‹ < 0) for money to work best as a medium of
exchange.
Seigniorage revenue and the profitability of the central bank
In a monetary system with physical cash, real seigniorage revenue is 𝑖𝑖𝑖𝑖/𝑃𝑃 as explained in
Section 6.8. Where fiat money is used as reserves 𝑅𝑅 but not as cash by households and
firms, real seigniorage revenues would be 𝑖𝑖𝑖𝑖/𝑃𝑃 if no interest is paid on reserves. However,
if interest π‘–π‘–π‘Ÿπ‘Ÿ is paid on all reserves (assuming π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ = π‘–π‘–π‘Ÿπ‘Ÿ as a simplification) then seigniorage
revenues would be (𝑖𝑖 − π‘–π‘–π‘Ÿπ‘Ÿ )𝑅𝑅/𝑃𝑃 instead. This depends on the spread between the bond
interest rate 𝑖𝑖 and the interest rate π‘–π‘–π‘Ÿπ‘Ÿ paid on reserves.
We see that seigniorage revenue is reduced by paying interest on reserves (π‘–π‘–π‘Ÿπ‘Ÿ > 0). With a
floor system (𝑖𝑖 ≈ π‘–π‘–π‘Ÿπ‘Ÿ ), seigniorage would be reduced to zero. Hence, one drawback of paying
interest on reserves is the loss of seigniorage that requires the government to find
alternative sources of tax revenue.
7.6 Bond maturity and the yield curve
When thinking about the role of interest rates and macroeconomics, so far we have
distinguished between nominal and real (inflation-adjusted) interest rates. We have also
made a distinction between interest rates offered to borrowers and savers when there is a
problem of asymmetric information. In addition to these considerations, interest rates also
differ by maturity. Maturity refers to the length of the period of borrowing or lending.
The relationship between interest rates and maturity is known as the term structure of
interest rates. Graphically, this is shown in the yield curve, a plot of interest rates against
maturity for different bonds. Note that the terms interest rate and yield are synonymous.
Figure 7.10 shows the yield curve for US government bonds on 1 September 2021 as an
example. Interest rates for bonds with a short maturity of one year or less were close to
246
EC2065 Macroeconomics | Chapter 7: Banking and finance
zero, while as maturity increases beyond two years, interest rates are higher, reaching
almost 2 per cent for 30-year maturity bonds.
Figure 7.10: A yield curve for US government bonds
What information is conveyed by the yield curve and the term structure of interest rates?
What difference does it make to the economy whether the yield curve is upward sloping,
downward sloping or flat?
The basics of the term structure of interest rates can be illustrated by considering just two
bonds. First, a bond that pays one unit of money next period. Second, a bond that pays one
unit of money two time periods in future. These simple bonds with one payment at maturity
(no coupon payments before maturity) are known as ‘discount’ bonds. The bonds have
maturities of one and two periods respectively. Another feature of these bonds is that they
are nominal bonds, i.e. ones that specify payments in units of money that are not indexed to
inflation.
The yield to maturity (or just yield) on a bond is defined as the discount rate that makes the
present value of the payments promised by the bond equal to the market price at which the
bond currently trades. Suppose the prices in current units of money of the one- and twoperiod bonds are 𝑉𝑉1 and 𝑉𝑉2 respectively, their respective yields 𝑖𝑖 and 𝐼𝐼 are defined by the
price-yield relationships:
𝑉𝑉1 =
1
1 + 𝑖𝑖
and
𝑉𝑉2 =
1
(1 + 𝐼𝐼 )2
By definition, there is necessarily an inverse relationship between yields and bond prices. A
high bond yield 𝑖𝑖 means a one-period bond price 𝑉𝑉1 and similarly for the two-period bond.
Suppose a saver is considering which bond to hold. What returns are obtained between the
current and future periods? Here, we look at nominal returns. For real returns, we would
also need to adjust for inflation as discussed in Chapter 6. If a saver uses a unit of money to
buy one-period bonds at price 𝑉𝑉1 then an amount 1/𝑉𝑉1 is purchased. Each bond pays off a
unit of money in the next period, so the return is (1⁄𝑉𝑉1 ) − 1. Using 𝑉𝑉1 = 1/(1 + 𝑖𝑖), the
247
EC2065 Macroeconomics | Chapter 7: Banking and finance
return is equal to the yield 𝑖𝑖. In general, returns and yields are not the same but the return
on a discount bond that is held to maturity is equal to the yield when purchased, as seen in
this example.
If instead the saver buys two-period bonds, a unit of money can purchase a quantity 1/𝑉𝑉2 of
such bonds. By the next period, the bond has not yet matured, so the only return that can
be realised immediately comes from selling it. With one period remaining until maturity,
what was originally a two-period bond is now equivalent to a one-period bond, so its price is
now 𝑉𝑉1 ′, which denotes the price of a one-period bond in the future period. Using 𝑉𝑉1 ′ =
1/(1 + 𝑖𝑖′) and 𝑉𝑉2 = 1/(1 + 𝐼𝐼 )2 , the return on holding the two-period bond for just one
period is:
Return =
𝑉𝑉1′ − 𝑉𝑉2 (1 + 𝐼𝐼 )2
1 + 𝐼𝐼
=
− 1 = 𝐼𝐼 + (𝐼𝐼 − 𝑖𝑖 ′ ) οΏ½
οΏ½
′
𝑉𝑉2
1 + 𝑖𝑖
1 + 𝑖𝑖 ′
Observe that this return is not equal to the yield 𝐼𝐼, except in the special case 𝑖𝑖 ′ = 𝐼𝐼 because
this bond has not reached maturity over the period the return is calculated.
In choosing what combination of bonds to hold, savers compare the returns they offer.
These returns depend on the yields 𝑖𝑖 and 𝐼𝐼 and the return on the two-period bond also
depends on the future yield 𝑖𝑖′, which is not known with certainty now. An important point
here is that the two-period bond’s return, if it is held for only one period, is in general riskier
than that of the one-period bond. All else equal, savers are assumed to prefer higher
expected returns and lower risk.
7.7 The expectations theory of long-term interest rates
One theory of the term structure of interest rates assumes savers care only about expected
returns on assets. In this case, savers are said to be ‘risk neutral’. Given these preferences,
savers rationally choose to hold the bond with the highest expected return. However, since
all bonds that have been issued must be willingly held by someone in equilibrium, bond
prices, or equivalently, bond yields, must adjust so that all bonds offer the same expected
return. This logic implies a connection between short-term and long-term interest rates that
is called the expectations theory of interest rates.
We will illustrate the expectations theory of interest rates with reference to the one-period
and two-period bonds introduced earlier. We refer to those bonds in what follows as ‘shortterm’ and ‘long-term’ bonds, respectively. The two bonds have yields 𝑖𝑖 and 𝐼𝐼 and, for the
one-period bond, its actual and expected return is simply equal to its yield 𝑖𝑖. The return on
the two-period bond is 𝐼𝐼 + (𝐼𝐼 − 𝑖𝑖 ′ )(1 + 𝐼𝐼)/(1 + 𝑖𝑖 ′ ), which depends on the future oneperiod bond yield denoted by 𝑖𝑖 ′ . As long as 𝑖𝑖′ and 𝐼𝐼 are not too large, we can say that
(𝐼𝐼 − 𝑖𝑖 ′ )(1 + 𝐼𝐼)/(1 + 𝑖𝑖 ′ ) ≈ 𝐼𝐼 − 𝑖𝑖′, which means the approximate return on two-period bond
is 2𝐼𝐼 − 𝑖𝑖′. The expected return is 2𝐼𝐼 − 𝑖𝑖 ′𝑒𝑒 , where 𝑖𝑖 ′𝑒𝑒 denotes the expected value of the
future one-period yield 𝑖𝑖 ′ .
For risk-neutral savers to be willing to hold either bond, the expected returns must be the
same:
248
EC2065 Macroeconomics | Chapter 7: Banking and finance
𝑖𝑖 = 2𝐼𝐼 − 𝑖𝑖 ′𝑒𝑒
Solving this equation for the long-term interest rate 𝐼𝐼 shows that
𝐼𝐼 =
𝑖𝑖 + 𝑖𝑖 ′𝑒𝑒
2
The theory therefore predicts that the long-term interest rate 𝐼𝐼 is equal to an average of the
current and expected future short-term interest rates 𝑖𝑖 and 𝑖𝑖 ′𝑒𝑒 until the maturity of the
long-term bond. This idea applies to bonds with longer than a two-period maturity, where
the long-term yield is an average on expected short-term interest rates over a longer
horizon.
Box 7.4: Forecasting from the shape of the yield curve
If the expectations theory of long-term interest rates is correct, the yield curve can be used
to forecast the future path of short-term interest rates, exploiting the information and
analysis of the participants in the bond market. We will see how to derive market
expectations of interest rates from the yield curve and why an upward-sloping yield curve
implies short-term interest rates expected to rise, and a downward-sloping yield curve
implies short-term interest rates expected to fall. This relationship between the gradient of
the yield curve and the expected direction of the future path of interest rates means that
the yield-curve gradient might be used as a leading indicator of the business cycle.
We consider here just two points on the yield curve corresponding to a short-term (oneperiod) bond with yield 𝑖𝑖 and a long-term (two-period) bond with yield 𝐼𝐼. The expectations
theory of interest rates from Section 7.7 implies that 𝐼𝐼 = (𝑖𝑖 + 𝑖𝑖 ′𝑒𝑒 )/2, where 𝑖𝑖 ′𝑒𝑒 is the
expected future short-term interest rate 𝑖𝑖′. Subtracting 𝑖𝑖 from both sides implies:
𝑖𝑖 ′𝑒𝑒 − 𝑖𝑖
= 𝐼𝐼 − 𝑖𝑖
2
This implies a positive relationship between the gradient of the yield curve as represented
by the term 𝐼𝐼 − 𝑖𝑖 and the expected change in the future short-term interest rate 𝑖𝑖 ′𝑒𝑒 − 𝑖𝑖. An
explicit formula for the expected future interest rate is 𝑖𝑖 ′𝑒𝑒 = 2𝐼𝐼 − 𝑖𝑖, which can be
calculated using the yields 𝑖𝑖 and 𝐼𝐼 currently observed.
Figure 7.11 shows the relationship between spreads of 10-year over three-month US
government bonds and the subsequent changes in three-month Treasury bill yields
averaged over the following ten years. There is indeed a positive relationship, although
weaker than the expectations theory would suggest. Note that the scatterplot makes the
relationship look better than it really is because the vertical coordinates of the data points
are calculated using overlapping 10-year spells, so each point is not an independent
observation.
249
EC2065 Macroeconomics | Chapter 7: Banking and finance
Figure 7.11: Predictions from US yield curve (1934–2011)
7.8 Risk and portfolio choice
How should savers choose between different assets if they care about the riskiness of assets
as well as their expected returns? This section explores how an optimal portfolio trading off
risk and return can be selected.
Consider a saver choosing portfolio of two assets. One is a safe asset offering a risk-free real
return π‘Ÿπ‘Ÿπ‘“π‘“ . Another is a risky asset with an uncertain real return π‘Ÿπ‘Ÿ. We keep the analysis
simple by supposing there are two possible outcomes for the risky asset. In the ‘bad’
outcome, the return is π‘Ÿπ‘Ÿ1 over some period and in the ‘good’ outcome the return is π‘Ÿπ‘Ÿ2 ,
where π‘Ÿπ‘Ÿ1 < π‘Ÿπ‘Ÿ2 . The outcome π‘Ÿπ‘Ÿ1 has probability 𝑑𝑑, and π‘Ÿπ‘Ÿ2 has probability 1 − 𝑑𝑑, where 𝑑𝑑 is a
probability between zero and one. We will assume π‘Ÿπ‘Ÿ1 < π‘Ÿπ‘Ÿπ‘“π‘“ < π‘Ÿπ‘Ÿ2 , so the ‘good’ outcome for
the risky asset is better than the risk-free, while the ‘bad’ outcome is worse. Otherwise, one
asset is unambiguously better than the other and the choice of portfolio is uninteresting.
The saver chooses the fraction π‘₯π‘₯ of wealth to allocate to the safe asset. Suppose that wealth
is equal to 1 for illustration. This means that purchases of the safe and risky assets are π‘₯π‘₯ and
1 − π‘₯π‘₯ respectively. Assume that the investment proceeds will be consumed immediately
after the returns are received. Here, there is no choice of how much to save, or for how long
to save, only which assets to hold. Consequently, in the bad (1) and good (2) outcomes, the
levels of consumption 𝑐𝑐1 and 𝑐𝑐2 are:
𝑐𝑐1 = 1 + π‘Ÿπ‘Ÿπ‘“π‘“ π‘₯π‘₯ + π‘Ÿπ‘Ÿ1 (1 − π‘₯π‘₯ )
and
𝑐𝑐2 = 1 + π‘Ÿπ‘Ÿπ‘“π‘“ π‘₯π‘₯ + π‘Ÿπ‘Ÿ2 (1 − π‘₯π‘₯)
Individuals dislike uncertainty about whether they will get to consume 𝑐𝑐1 or 𝑐𝑐2 . We say they
are ‘risk averse’. More precisely, this means they prefer to get expected consumption 𝑐𝑐 𝑒𝑒 =
𝑑𝑑𝑐𝑐1 + (1 − 𝑑𝑑)𝑐𝑐2 for certain (𝑐𝑐 𝑒𝑒 in both the ‘good’ and ‘bad’ scenarios for the risky asset)
than receive 𝑐𝑐1 with probability 𝑑𝑑 and 𝑐𝑐2 with probability 1 − 𝑑𝑑. We can represent these
250
EC2065 Macroeconomics | Chapter 7: Banking and finance
preferences graphically using indifference curves in a similar way to the two-period
consumption choice model.
In Figure 7.12, the horizontal and vertical coordinates give the two outcomes for
consumption 𝑐𝑐1 and 𝑐𝑐2 . Geometrically, uncertainty increases as we move away from the 45π‘œπ‘œ
line, the set of point where 𝑐𝑐1 = 𝑐𝑐2 , representing cases where the individual faces no
uncertainty about the amount that will be consumed. In the diagram, expected
consumption 𝑐𝑐 𝑒𝑒 = 𝑑𝑑𝑐𝑐1 + (1 − 𝑑𝑑)𝑐𝑐2 remains constant along any straight line with gradient
−𝑑𝑑/(1 − 𝑑𝑑). Indifference curves representing risk averse preferences are convex to the
origin, lying above any tangent line drawn to them. On 45π‘œπ‘œ line, the tangent line to
indifference curves has gradient −𝑑𝑑/(1 − 𝑑𝑑) because this is a point where the individual
faces no risk.
Figure 7.12: Indifference curves over uncertain consumption outcomes
We would like to add to this diagram the equivalent of a ‘budget constraint’, allowing us to
use the familiar framework of constrained utility maximisation to solve the portfolio choice
problem. This can be done by eliminating π‘₯π‘₯ from the equations 𝑐𝑐1 = 1 + π‘Ÿπ‘Ÿ1 + π‘₯π‘₯(π‘Ÿπ‘Ÿπ‘“π‘“ − π‘Ÿπ‘Ÿ1 )
and 𝑐𝑐2 = 1 + π‘Ÿπ‘Ÿ2 − π‘₯π‘₯(π‘Ÿπ‘Ÿ2 − π‘Ÿπ‘Ÿπ‘“π‘“ ), showing that different portfolios lead to different
combinations of 𝑐𝑐1 and 𝑐𝑐2 with:
𝑐𝑐2 = 1 + π‘Ÿπ‘Ÿ2 − οΏ½
π‘Ÿπ‘Ÿ2 − π‘Ÿπ‘Ÿπ‘“π‘“
οΏ½ (𝑐𝑐1 − 1 − π‘Ÿπ‘Ÿ1 )
π‘Ÿπ‘Ÿπ‘“π‘“ − π‘Ÿπ‘Ÿ1
This relationship between 𝑐𝑐1 and 𝑐𝑐2 is plotted in Figure 7.13. Since π‘Ÿπ‘Ÿ1 < π‘Ÿπ‘Ÿπ‘“π‘“ < π‘Ÿπ‘Ÿ2 , the
equation is a downward-sloping straight line. It can be seen the line always passes through
the point (1 + π‘Ÿπ‘Ÿπ‘“π‘“ , 1 + π‘Ÿπ‘Ÿπ‘“π‘“ ), which corresponds to π‘₯π‘₯ = 1, a portfolio comprising only safe
assets. The gradient of the line is − (π‘Ÿπ‘Ÿ2 − π‘Ÿπ‘Ÿπ‘“π‘“ )⁄(π‘Ÿπ‘Ÿπ‘“π‘“ − π‘Ÿπ‘Ÿ1 ) and it also passes through the point
(1 + π‘Ÿπ‘Ÿ1 , 1 + π‘Ÿπ‘Ÿ2 ), which corresponds to π‘₯π‘₯ = 0, that is, holding none of the safe asset in the
portfolio. Unlike budget constraints with no credit-market imperfections, we do not extend
this straight line all the way to the horizontal and vertical axes as the points to the right of
(1 + π‘Ÿπ‘Ÿπ‘“π‘“ , 1 + π‘Ÿπ‘Ÿπ‘“π‘“ ) correspond to π‘₯π‘₯ greater than 1 and the points to the left of (1 + π‘Ÿπ‘Ÿ1 , 1 + π‘Ÿπ‘Ÿ2 )
251
EC2065 Macroeconomics | Chapter 7: Banking and finance
correspond to negative π‘₯π‘₯. Such portfolios would mean ‘negative’ or short positions in one of
the two assets, which we rule out.
The gradient of the constraint can be expressed in terms of the expected risky asset return
π‘Ÿπ‘Ÿ 𝑒𝑒 = π‘‘π‘‘π‘Ÿπ‘Ÿ1 + (1 − 𝑑𝑑)π‘Ÿπ‘Ÿ2 as follows:
π‘Ÿπ‘Ÿ2 − π‘Ÿπ‘Ÿπ‘“π‘“
π‘Ÿπ‘Ÿ 𝑒𝑒 − π‘Ÿπ‘Ÿπ‘“π‘“
𝑑𝑑
1
=
+
οΏ½
οΏ½
π‘Ÿπ‘Ÿπ‘“π‘“ − π‘Ÿπ‘Ÿ1 1 − 𝑑𝑑 1 − 𝑑𝑑 π‘Ÿπ‘Ÿπ‘“π‘“ − π‘Ÿπ‘Ÿ1
The term π‘Ÿπ‘Ÿ 𝑒𝑒 − π‘Ÿπ‘Ÿπ‘“π‘“ is the risk premium, the expected risky-asset return π‘Ÿπ‘Ÿ 𝑒𝑒 minus the risk-free
asset’s certain return π‘Ÿπ‘Ÿπ‘“π‘“ . We see that the gradient of the constraint depends on the risk
premium π‘Ÿπ‘Ÿ 𝑒𝑒 − π‘Ÿπ‘Ÿπ‘“π‘“ relative to the amount of risk taken in holding the risky asset. The term in
the denominator is π‘Ÿπ‘Ÿπ‘“π‘“ − π‘Ÿπ‘Ÿ1 , the size of the loss from holding the risky asset relative to the
safe return on the risk-free asset.
The optimal portfolio choice is found where an indifference curve is tangent to the
constraint, the straight line joining the ‘all risky asset’ portfolio to the ‘all risk-free asset’
portfolio. This is depicted in Figure 7.13. The fractions π‘₯π‘₯ and 1 − π‘₯π‘₯ allocated to the risk-free
and risky assets can be deduced from how far the ray to the origin from the optimal
portfolio lies between the rays to the origin from the extreme portfolios.
Figure 7.13: Optimal portfolio choice
We can see from the formula for the gradient of the constraint that a positive amount of
risky asset can be held (π‘₯π‘₯ < 1) only if π‘Ÿπ‘Ÿ 𝑒𝑒 > π‘Ÿπ‘Ÿπ‘“π‘“ . The risky asset must offer a higher expected
return π‘Ÿπ‘Ÿ 𝑒𝑒 than the certain return π‘Ÿπ‘Ÿπ‘“π‘“ on the safe asset to be attractive to savers who dislike
risk, creating a trade-off between risk and return. A higher expected return compensates for
taking greater risk.
What happens to the optimal portfolio if the asset returns were to change? If the risk-free
return π‘Ÿπ‘Ÿπ‘“π‘“ rises then the gradient of the constraint declines, all else being equal. If the price
of the risky asset rises, which would lower the returns 1 + π‘Ÿπ‘Ÿ1 , 1 + π‘Ÿπ‘Ÿ2 , and 1 + π‘Ÿπ‘Ÿ 𝑒𝑒
proportionately, then the gradient of the constraint also declines. A lower gradient of the
252
EC2065 Macroeconomics | Chapter 7: Banking and finance
constraint in these two cases gives rise to a substitution effect would unambiguously raise
the safe-asset portfolio share π‘₯π‘₯. However, the income effect on π‘₯π‘₯ is ambiguous. We usually
assume preferences where the substitution effect dominates, in which case portfolio shares
adjust in the same direction as assets’ expected returns, all else equal. An example where
the expected return on the risky asset declines is illustrated in Figure 7.14.
Figure 7.14: Lower expected return on risky asset
Box 7.5: The typical shape of the yield curve
Yield curves typically have a positive gradient, that is to say, long-maturity bonds have
higher yields (interest rates) than short-maturity bonds on average. For example, taking US
government bond yields over the period 1934–2021, three-month Treasury bills have an
average yield of 3.4%, while 10-year Treasury bonds have an average yield of 5 per cent.
What explains this phenomenon of yield curves being upward sloping on average?
We investigate the question using one- and two-period discount bonds as examples of
short- and long-maturity bonds. Yields on the short- and long-term bonds are 𝑖𝑖 and 𝐼𝐼, and
their average yields are denoted by πš€πš€Μ… and 𝐼𝐼 ,Μ… respectively. We will argue that investors’
attitude to risk provides an explanation for the observation 𝐼𝐼 Μ… > πš€πš€Μ… .
Suppose investors are considering holding bonds for just one period, so they have a
relatively short horizon. As explained in Section 7.6, the return on holding the short-term
(one-period) bond is 𝑖𝑖, which is the same as its yield 𝑖𝑖 because this bond is held to maturity.
Section 7.6 also shows the return on holding the long-term (two-period) bond for only one
period is 𝐼𝐼 + (𝐼𝐼 − 𝑖𝑖 ′ )(1 + 𝐼𝐼)/(1 + 𝑖𝑖 ′ ) ≈ 2𝐼𝐼 − 𝑖𝑖 ′ , where 𝑖𝑖 ′ is the yield on a one-period bond
in the future. These bonds are nominal bonds and both returns are calculated in nominal
terms. However, given 𝑖𝑖′, the real returns would be equally affected by inflation over the
first period.
253
EC2065 Macroeconomics | Chapter 7: Banking and finance
Observe that there is an important difference in the risk characteristics of the bond returns
over the same period. The nominal return on the one-period bond is known with certainty
when purchased at yield 𝑖𝑖, while the nominal return on the two-period bond (if held for
only one period) is uncertain because 𝑖𝑖′ is not known in advance. This makes the long-term
bond riskier than the short-term bond.
If investors were risk neutral and cared only about expected returns, the average returns
must be same on both bonds. Using the formulas for the returns above, the average return
on the short-term bond is πš€πš€Μ… and the average return on the long-term bond is approximately
2𝐼𝐼 Μ… − πš€πš€Μ… since οΏ½
𝚀𝚀′ = πš€πš€Μ…. Risk neutrality implies 2𝐼𝐼 Μ… − πš€πš€Μ… ≈ πš€πš€Μ…, and hence, πš€πš€Μ… ≈ 𝐼𝐼,Μ… which means the
yield curve would be flat on average, contrary to what is observed.
Now suppose investors are risk averse regarding the return in the first period. Since the
long-term bond has a risky return if it is sold before maturity, the portfolio choice model of
Section 7.8 explains why the long-term bond needs to offer a higher average return. This
risk premium for the long-term bond requires 2𝐼𝐼 Μ… − πš€πš€Μ… > πš€πš€Μ…, hence, implying 𝐼𝐼 Μ… > πš€πš€Μ…. Investors’
dislike of risk can thus explain why the yield curve is upward sloping on average.
7.9 The functions of banks
Although saving is related to investment in equilibrium, many savers do not directly make
loans to firms or hold shares. Instead, banks are intermediaries between those who want to
save and those who want to borrow. What is the value of this intermediation service
provided by banks?
One important aspect of this intermediation is that individuals lack knowledge about
creditworthiness of borrowers and would face large costs of monitoring them closely. Banks
as intermediaries between many individual savers and borrowers with whom they have an
established relationship and thus more knowledge about are able to reduce costs arising
from asymmetries of information. A further advantage of banks coming from their scale is
that they can diversify lending much more easily than lending done by individuals, which is
more concentrated when an individual has limited funds to deploy.
Another important aspect of the intermediation done by banks is known as maturity
transformation. This is where a bank takes deposits from savers, offering to pay interest and
give access to funds on demand but uses depositors’ funds to make loans to firms for
longer-term investment projects. In other words, the bank borrows short from depositors
and lends long. This is a valuable service because borrowers typically want long maturity
loans, while savers want quick access to their funds on demand and for those funds to be
safe.
In what follows, we will focus on the role of banks in maturity transformation. We will see
how bank deposits are a more liquid asset for savers to hold than direct equity investment
in firms, so the maturity transformation done by banks has a social value. However, this
maturity transformation also exposes banks to the risk of bank runs.
254
EC2065 Macroeconomics | Chapter 7: Banking and finance
7.9.1 The Diamond-Dybvig model
We will analyse banks using a framework known as the Diamond-Dybvig model. This model
considers an economy with a large number 𝑁𝑁 of individuals who want to save but who do
not know when in future they will need access to their savings.
The model assumes three periods, 0, 1, and 2. All individuals start with one unit of real
wealth in period 0. Some will be ‘early types’ who want to consume their wealth in period 1,
while others will be ‘late types’ who want to consume at 2. No one knows in advance
whether they will be ‘early’ or ‘late’ types but all independently have probability 𝑑𝑑 of being
an ‘early’ type. Individuals learn their types in period 1.
Let 𝑐𝑐1 denote the consumption received by an early type, and 𝑐𝑐2 the consumption of a late
type. Now consider the preferences of an individual over these consumption outcomes from
the perspective of period 0 when it is not known whether 𝑐𝑐1 or 𝑐𝑐2 will be received.
Individuals prefer more of each of 𝑐𝑐1 and 𝑐𝑐2 to less but, crucially, they are assumed to be
risk averse, i.e. they dislike uncertainty about whether they will consume 𝑐𝑐1 or 𝑐𝑐2 .
We can represent risk-averse preferences over 𝑐𝑐1 and 𝑐𝑐2 graphically using indifference
curves as we did in the portfolio choice model from Section 7.8. Here, an individual
consumes either 𝑐𝑐1 or 𝑐𝑐2 , analogous to the two outcomes in the portfolio-choice model,
although here these occur at different dates. We ignore impatience to focus on risk
aversion, so we treat the two periods symmetrically even though period 2 is further in the
future than period 1.
In a diagram with 𝑐𝑐1 and 𝑐𝑐2 on the axes, indifference curves with risk aversion are convex to
the origin. This means they lie above their tangent lines, or in other words, given outcomes
𝑐𝑐1 and 𝑐𝑐2 with probabilities 𝑑𝑑 and 1 − 𝑑𝑑, an individual prefers 𝑐𝑐 𝑒𝑒 = 𝑑𝑑𝑐𝑐1 + (1 − 𝑑𝑑)𝑐𝑐2 , the
expected level of consumption, for certain. Since we ignore impatience here, the tangent
lines to the indifference curves crossing the 45π‘œπ‘œ line with no consumption uncertainty all
have gradient −𝑑𝑑/(1 − 𝑑𝑑).
We turn now to what can be done with the initial real wealth individuals have in period 0 to
allow them to consume in the future time periods 1 or 2. We assume wealth can be stored
at no cost between periods 0 and 1, and between periods 1 and 2 but no return is earned
from this storage.
Wealth can be put into a long-term investment starting in period 0. One unit invested in
period 0 yields real wealth of 1 + 𝑅𝑅 in period 2 when the investment reaches maturity,
where 𝑅𝑅 denotes the total return between periods 0 and 2. However, if the investment is
abandoned before maturity in period 1 then no return is earned, although the initial
amount invested can be recovered. This means the long-term investment is fundamentally
illiquid – only long-term investors who can tie up funds between periods 0 and 2 can earn a
return on the investment.
255
EC2065 Macroeconomics | Chapter 7: Banking and finance
7.9.2 An economy with no financial intermediaries
In the Diamond-Dybvig model, we can see the benefit of having banks by first analysing
what happens if there are no financial intermediaries. To begin with, suppose there are no
financial markets active after period 0, so individuals simply choose what to do with their
wealth in period 0 and use the proceeds to consume in periods 1 or 2.
Under the assumptions, the long-term investment is never worse than storage because the
initial investment can be recovered if necessary in period 1. Hence, to save for the future, all
individuals initiate one unit of investment in period 0, or lend funds to someone to do it for
them. If they learn they are early types in period 1, they abandon the investment and
recover their initial wealth, which permits consumption 𝑐𝑐1 = 1. If they learn they are late
types, they keep the investment going until the return is received in period 2 and enjoy
consumption 𝑐𝑐2 = 1 + 𝑅𝑅. If the investment return 𝑅𝑅 is large, this means there is a large
difference between the outcomes 𝑐𝑐1 and 𝑐𝑐2 , which is bad for those who are risk averse.
Even without financial intermediaries, perhaps having a financial market where ongoing
investments can be traded for consumption goods might help. Suppose in period 1, there is
a market where ongoing investment projects can be bought or sold at price 𝑝𝑝 in units of real
consumption goods.
Since investments can be liquidated to recover the initial unit of real wealth put in, early
types only gain from selling investments in the market if 𝑝𝑝 > 1. But investments can be
bought only using stored wealth, or the proceeds from liquidating other investments, hence,
late types buying an ongoing investment at price 𝑝𝑝 would forgo 𝑝𝑝 other investments, so they
only gain from buying if 𝑝𝑝 < 1. Since there is no price 𝑝𝑝 ≠ 1 at which both early types want
to sell and late types want to buy, the equilibrium is 𝑝𝑝 = 1 and no trade takes place in the
financial market because there are no gains from trade. Having a financial market where the
investment can be traded does not circumvent its fundamental illiquidity.
7.10 Banking as maturity transformation
In the Diamond-Dybvig model, we will see that banks can improve upon the outcome with
no financial intermediation. Banks will offer to take savers’ funds as deposits and pay them
some interest, while still giving savers the right to withdraw on demand. This makes bank
deposits more liquid than the assets held by banks, namely, the lending that finances the
illiquid long-term investments.
Banks will engage in maturity transformation, issuing short-term liabilities (deposits) to fund
long-term assets (lending). Banks help by pooling the risk of individuals needing early access
to funds because they are able to take deposits from a large number of savers.
7.10.1 Bank deposits
Instead of savers directly investing or lending to investors, a bank offers to take deposits
from them on the following terms (the deposit contract). For each unit of funds deposited in
period 0, a depositor has the right to withdraw either 𝑑𝑑1 in period 1 or 𝑑𝑑2 in period 2.
256
EC2065 Macroeconomics | Chapter 7: Banking and finance
Implicitly, 𝑑𝑑1 and 𝑑𝑑2 define the interest rates π‘Ÿπ‘Ÿ and π‘Ÿπ‘Ÿ′ paid on deposits between periods 0
and 1 (with 𝑑𝑑1 = 1 + π‘Ÿπ‘Ÿ), and periods 1 and 2 (with 𝑑𝑑2 ⁄𝑑𝑑1 = 1 + π‘Ÿπ‘Ÿ′).
An important point is that the bank does not know either ex ante or ex post who is an early
or a late type – think of the true need for funds as being private information. Everyone who
deposits one unit of funds in period 0 has the right to withdraw an amount 𝑑𝑑1 in period 1.
Suppose a large number of individuals 𝑁𝑁 each deposits one unit of funds. Deposit taking is
the only source of funds here for banks – we ignore bank capital because there is no risk
when investments are held to maturity (𝑅𝑅 is known with certainty). A bank chooses the
fraction π‘₯π‘₯ of funds that are stored (a liquid asset) and the fraction 1 − π‘₯π‘₯ put into the longterm investment (an illiquid asset). The bank needs to have enough liquid assets to be able
to meet the demand for withdrawals in period 1.
Suppose for now that only early types withdraw 𝑑𝑑1 in period 1, which requires among other
things that 𝑑𝑑2 ≥ 𝑑𝑑1 (i.e. the bank never offers a negative interest rate π‘Ÿπ‘Ÿ ′ < 0), otherwise
everyone wants to withdraw in period 1. As each of the large number 𝑁𝑁 of depositors has
an independent chance of being an early type with probability 𝑑𝑑, the law of large numbers
implies there will be 𝑑𝑑𝑑𝑑 early types in total. Given π‘₯π‘₯, the value of liquid assets in period 1 is
π‘₯π‘₯π‘₯π‘₯, so 𝑑𝑑1 and π‘₯π‘₯ must satisfy:
𝑑𝑑𝑑𝑑𝑑𝑑1 ≤ π‘₯π‘₯π‘₯π‘₯
By period 2, the (1 − π‘₯π‘₯ )𝑁𝑁 funds put into long-term investments have earned a return and
now have value (1 + 𝑅𝑅)(1 − π‘₯π‘₯ )𝑁𝑁. All early types have withdrawn already and only the
deposits of (1 − 𝑑𝑑)𝑁𝑁 late types remain. The amount 𝑑𝑑2 promised to those withdrawing in
period 2 must satisfy:
(1 − 𝑑𝑑)𝑁𝑁𝑑𝑑2 ≤ (1 + 𝑅𝑅)(1 − π‘₯π‘₯ )𝑁𝑁
7.10.2 Competition between banks
We assume that banks are competitive, so entry of new banks or the threat of entry implies
banks will make zero economic profits from the deposit contract they offer. This means the
feasibility conditions derived above hold with equality, leaving nothing to be paid out as
bank profits in either period 1 or 2. Cancelling 𝑁𝑁 from both sides, the zero-profit conditions
are 𝑑𝑑𝑑𝑑1 = π‘₯π‘₯ and (1 − 𝑑𝑑)𝑑𝑑2 = (1 + 𝑅𝑅)(1 − π‘₯π‘₯). Substituting the first into the second to
eliminate π‘₯π‘₯ and dividing both sides by 1 + 𝑅𝑅 leads to an overall feasibility and zero-profit
condition on the terms (𝑑𝑑1 , 𝑑𝑑2 ) of a bank’s deposit contract:
𝑑𝑑𝑑𝑑1 + (1 − 𝑑𝑑)
𝑑𝑑2
=1
1 + 𝑅𝑅
This condition is the downward-sloping straight line drawn in Figure 7.15, which shows the
combinations of 𝑑𝑑1 and 𝑑𝑑2 that are feasible given the investment returns and the fraction of
early types, and where the bank makes no profits. Points below the line correspond to
positive bank profits, while those above are not feasible because banks would make losses.
The condition was derived supposing a late type is willing to wait until period 2 to withdraw,
257
EC2065 Macroeconomics | Chapter 7: Banking and finance
which is true if 𝑑𝑑2 ≥ 𝑑𝑑1 , assuming other late types wait as well. We therefore ignore the
points below the 45∘ line in the figure.
What interest rates π‘Ÿπ‘Ÿ and π‘Ÿπ‘Ÿ′ can banks offer? By definition, 𝑑𝑑1 = 1 + π‘Ÿπ‘Ÿ and 𝑑𝑑2 =
(1 + π‘Ÿπ‘Ÿ ′ )𝑑𝑑1 . One possibility is π‘Ÿπ‘Ÿ = 0 and π‘Ÿπ‘Ÿ′ = 𝑅𝑅, which is equivalent to 𝑑𝑑1 = 1 and 𝑑𝑑2 = 1 +
𝑅𝑅. However, this deposit contract is not particularly interesting because it simply replicates
the outcome if there were no banks in the economy. This corresponds to the point labelled
N in Figure 7.15. Another possibility is π‘Ÿπ‘Ÿ = (1 − 𝑑𝑑)𝑅𝑅/(1 + 𝑑𝑑𝑑𝑑) and π‘Ÿπ‘Ÿ′ = 0, which is
equivalent to 𝑑𝑑1 = 𝑑𝑑2 = (1 + 𝑅𝑅)/(1 + 𝑑𝑑𝑑𝑑). This deposit contract is special because it
eliminates all liquidity risk for depositors – they receive the same amount whether they
need to withdraw in period 1 or period 2. Every depositor earns interest π‘Ÿπ‘Ÿ < 𝑅𝑅 between
periods 0 and 1. It is the point labelled L on the 45∘ line in the figure.
Figure 7.15: Feasible deposit contracts
7.10.3 The equilibrium deposit contract offered by banks
Competition compels banks to offer in period 0 the deposit contract on the highest
indifference curve of depositors subject to the zero-profit condition. Geometrically, this
corresponds to the point of tangency (𝑑𝑑1∗ , 𝑑𝑑2∗ ) between a depositor’s indifference curve and
the zero-profit line shown in Figure 7.16. If banks did not pick this deposit contract, a new
bank would be able to enter offering depositors a contract they prefer while still making
positive profits for the entrant bank.
258
EC2065 Macroeconomics | Chapter 7: Banking and finance
Figure 7.16: Optimal deposit contract
If individuals are sufficiently risk-averse, meaning that their indifference curves have
enough curvature, then banks offer a deposit contract with 0 < π‘Ÿπ‘Ÿ < 𝑅𝑅 and 0 < π‘Ÿπ‘Ÿ′ < 𝑅𝑅,
which corresponds to a point in the diagram with 𝑑𝑑1 > 1 and 𝑑𝑑2 < 1 + 𝑅𝑅, that is, one to the
right of N but to the left of L. If depositors are not very risk averse then banks might offer a
point at N or even to the left, while only if depositors have the maximum risk aversion (‘L’shaped indifference curves) will banks offer L.
With the equilibrium deposit contract for sufficiently risk-averse depositors, the withdrawals
and consumption levels of the early and late types are 𝑐𝑐1 = 𝑑𝑑1 > 1 and 𝑐𝑐2 = 𝑑𝑑2 < 1 + 𝑅𝑅.
Individuals are able to get to a higher indifference curve than without banks, which
corresponds to the point N where 𝑐𝑐1 = 1 and 𝑐𝑐2 = 1 + 𝑅𝑅.
The optimal deposit contract offered by banks helps with risk sharing. It provides insurance
to those who need access to their funds early by paying some portion of the illiquid
investment return, even though investments have not yet yielded any return. The trade-off
is that late types do not get the full investment return but risk-averse individuals value this
insurance ex ante.
The reason why financial intermediation is valued here is similar to why risk-averse people
value insurance. For example, home insurance and car insurance cover the risk of
unexpectedly needing to find funds to make repairs. Banks provides liquidity insurance –
access on demand to funds without sacrificing all investment returns. A crucial difference is
that insurance policies pay out when an objectively verifiable event occurs, for example, a
fire. However, problems of asymmetric information and moral hazard mean that some risks
are not directly insurable, for example, the risk of unemployment, which is what creates a
need for liquidity insurance instead. While banks help to provide this, we will see that the
unverifiable nature of being an ‘early type’ creates a danger of bank runs.
259
EC2065 Macroeconomics | Chapter 7: Banking and finance
7.11 Bank runs
The deposit contract offered by banks can provide valuable risk sharing for those facing
liquidity shocks. But using demand deposits to fund illiquid investments makes banks
vulnerable to runs. Bank runs are self-fulfilling losses of confidence in the ability of a bank to
honour its contract with depositors. This fragility of banks comes from a maturity mismatch
between their assets and liabilities.
7.11.1 Strategic demands for withdrawals and bank failures
In the analysis of the Diamond-Dybvig model in Section 7.10, it was assumed only the early
types would want to withdraw in period 1. Clearly, all early types do want to withdraw in
period 1 but what about the late types? The bank offers 𝑑𝑑2 > 𝑑𝑑1 if they withdraw in period
2, which appears better than getting 𝑑𝑑1 at 1 and subsequently receiving no return until
wealth is consumed in period 2. But this does not consider whether the bank is able to
honour its contract and how the behaviour of other depositors affects this.
Suppose a bank has 𝑁𝑁 depositors (where 𝑁𝑁 is large) and consider a single late-type
depositor who believes all other late types will attempt to withdraw in period 1. What
should this person do? If all late-type depositors but one try to withdraw in period 1 then
𝑁𝑁 − 1 depositors in total request withdrawals (all early types and all late types but one). The
deposit contract offers 𝑑𝑑1 to those withdrawing in period 1 and remember that withdrawal
in period 1 is not restricted to early types. An individual’s type is private information and the
virtue of the liquidity insurance provided by banks is that it was not necessary for people to
prove they need funds in order to make a withdrawal.
When those who do not need funds in period 1 request withdrawal at that date, we say a
bank is faced with a ‘run’. In period 1, a bank holds π‘₯π‘₯π‘₯π‘₯ of depositors’ funds stored in a liquid
asset and (1 − π‘₯π‘₯ )𝑁𝑁 in a long-term investment. The long-term investment can be liquidated
in period 1 but no return is earned and only the initial funds (1 − π‘₯π‘₯ )𝑁𝑁 are recovered. It
follows the maximum amount recoverable in period 1 by disposing of the bank’s assets is 𝑁𝑁.
Suppose a bank faces 𝑁𝑁 − 1 requests to withdraw 𝑑𝑑1 > 1 each in period 1. In this case, we
assume the bank processes withdrawal requests from depositors in a random queuing order
(‘sequential service’). For large 𝑁𝑁, we have that (𝑁𝑁 − 1)/𝑁𝑁 ≈ 1, so (𝑁𝑁 − 1)𝑑𝑑1 > 𝑁𝑁 because
𝑑𝑑1 > 1. The bank cannot recover enough from its assets to meet all withdrawal requests.
Therefore, faced with a run of 𝑁𝑁 − 1 depositors in period 1, the bank will fail because of the
return promised to depositors (𝑑𝑑1 > 1) combined with the illiquidity of its investments.
Only some of the 𝑁𝑁 − 1 withdrawal requests are met.
When the bank fails, there is nothing left in period 2. A late-type depositor who waits until
period 2 after the bank run to make a withdrawal request gets nothing for sure. This implies
that joining the queue and participating in the bank run is better than waiting because there
is a chance to get back something in period 1 (which can be stored until consumption in
period 2).
260
EC2065 Macroeconomics | Chapter 7: Banking and finance
7.11.2 Multiple equilibria and the possibility of bank runs
We conclude that the Diamond-Dybvig model does not have a unique prediction even
though depositors and bank are individually rational in their behaviour. There is the ‘good’
equilibrium that we have seen in Section 7.10 where only early types request withdrawal in
period 1 and the bank survives and succeeds in facilitating risk sharing. However, there is
also a ‘bad’ equilibrium with a bank run where all depositors request withdrawal in period 1,
the bank fails and risk sharing breaks down.
There are multiple equilibria because bank runs are self-fulfilling. It is individually rational to
join in a run if others are doing so. Although both the good and bad equilibria are consistent
with individual rationality, collectively, many are better off and no-one is worse off in the
good equilibrium compared to the bad equilibrium.
7.11.3 The 2007 Northern Rock bank run
In 2007, with the bank Northern Rock, there was the first bank run in the UK since 1866.
After a rapid expansion, Northern Rock sought liquidity support from the Bank of England on
12 September 2007 and this became public knowledge the next day. Queues outside
branches began to form on 14 September as depositors began to panic. By 17 September,
the UK government had stepped in to guarantee all deposits.
7.11.4 The ‘shadow’ banking system and the 2008 financial crisis
The Diamond-Dybvig model focuses on runs on the retail banking system but financial
markets can sometimes face situations similar to bank runs. A version of this was seen in the
so-called ‘shadow’ banking system during the 2008 financial crisis.
Prior to the financial crisis, there was a large expansion in ‘securitisation’, which is where
mortgages and other long-term loans were originated and then sold in packages to other
financial institutions such as investment banks. This ‘shadow’ banking system financed
purchases of securitised assets through short-term borrowing in the money markets, which
required frequent rollovers. In 2008, investment banks such as Bear Stearns and Lehman
Brothers struggled to roll over short-term financing. The belief that these banks and other
institutions could not repay short-term debts through the sale of illiquid assets caused new
financing to dry up, creating a self-fulfilling freeze in the money markets.
7.12 Deposit insurance and bank regulation
We have seen in Section 7.10 that banks provide a valuable service but the Diamond-Dybvig
model also highlights their inherent fragility, as indicated by the analysis of bank runs in
Section 7.11. Can government intervention prevent bank runs, leaving only the ‘good
equilibrium’ where banks are successful? Some possible policy interventions are:
•
•
•
•
A system of deposit insurance
The central bank acting as ‘lender of last resort’
Imposing bank capital requirements
Imposing reserve requirements.
261
EC2065 Macroeconomics | Chapter 7: Banking and finance
7.12.1 Deposit insurance
Suppose the government guarantees that it will compensate depositors for losses arising
from bank failures, a system of deposit insurance. This is ultimately backed by the
government’s tax revenues if needed. If deposit insurance is complete and credible then the
late types in the Diamond-Dybvig model need never request withdrawal in period 1 because
they are guaranteed to receive 𝑑𝑑2 > 𝑑𝑑1 by waiting until period 2. Therefore, no bank runs
occur and, absent any other problems, no banks will fail and the government would not
actually ever need to pay out on its guarantee to depositors.
This system of deposit insurance appears to provide a ‘free lunch’ of eliminating bank runs
at no cost. However, it comes with the drawback of removing the incentive for depositors to
avoid banks that take too much risk. To the extent that banks are ‘too big to fail’
(systemically important), there is also an implicit government guarantee to other bank
creditors. This leads banks to take excessive risks because these guarantees remove the
discipline otherwise provided by creditors. Note that this line of argument goes beyond the
basic Diamond-Dybvig model, which has no investment risk. It suggests deposit insurance
may indeed avoid self-fulfilling panics but there will still be bank failures owing to losses on
excessively risky investments.
Remember that the bank run in the Diamond-Dybvig model is due to an illiquidity problem.
A bank can repay all late types the promised amount in period 2 but cannot meet
unexpected and collectively irrational demands for withdrawals in period 1. This is different
from an insolvency problem where a bank has suffered losses on its investments that leave
it unable to honour its promises to creditors even without any panic. A system of deposit
insurance protects depositors against bank failures for both reasons but we only want this
protection for the problem of illiquidity – insurance that also operates in the case of
insolvency creates a problem of moral hazard. A serious challenge for bank regulation is that
it is hard to distinguish between illiquidity and insolvency in real time.
7.12.2 The central bank as ‘lender of last resort’
Another way to avoid bank runs is for the central bank to act as ‘lender of last resort’. This
means the central bank operates a discount window or borrowing facility that provides
liquidity to commercial banks by making loans against illiquid assets pledged as collateral.
The central bank usually lends through these facilities at a penalty rate but this is still more
favourable to a bank facing a bank run than attempting to obtain liquidity on commercial
terms and can enable the bank to survive. While ‘lender of last resort’ can stop runs, it also
leads to a moral-hazard problem because of the difficultly of distinguishing illiquidity and
insolvency in real time.
7.12.3 Bank capital requirements
Given the moral-hazard problems of policies designed to avoid bank runs, other bank
regulation is important in reducing the risk of bank insolvency and the cost of bailouts. Bank
capital requirements are a widely used regulatory tool to achieve this (‘bank capital’
narrowly defined refers to a bank’s equity). Capital requirements specify a minimum ratio of
262
EC2065 Macroeconomics | Chapter 7: Banking and finance
bank equity to total bank assets (sometimes weighted by assets’ riskiness). Funds provided
by a bank’s shareholders and retained profits not paid out to shareholders can absorb some
losses on a bank’s investments without jeopardising depositors’ funds.
7.12.4 Reserve requirements
Another regulatory tool is reserve requirements. These restrict banks to hold a minimum
fraction of deposits as reserves. The advantage of this is that reserves are the most liquid
type of asset, which helps ensure banks are able to meet demands for withdrawals
whenever they occur.
In the Diamond-Dybvig model, the liquid asset is ‘storage’, which earns no return. We saw in
Section 7.10 that banks would already choose to hold some liquid assets, so reserve
requirements may not even be binding. Moreover, reserve requirements would not
eliminate the risk of the bank runs studied in Section 7.11, only making their consequences
less severe. The only case where having reserve requirements eliminates bank runs is when
those reserve requirements are 100 per cent of deposits.
In terms of the model from Section 7.4, imposing a higher reserve-to-deposit ratio than
banks would otherwise choose reduces the supply of deposits and increases the interest
margin 𝑖𝑖 − π‘–π‘–π‘šπ‘š in equilibrium, which is inefficient, although this problem can be mitigated if
sufficient interest is paid on the required reserves themselves.
Box 7.6: The 100 per cent reserve requirements
One radical proposal for making the banking system safer is imposing a 100 per cent
reserve requirement on banks. This policy is also known as ‘narrow banking’. As deposits
are fully backed by reserves, banks are always able to satisfy any requests for withdrawals.
Therefore, a bank cannot fail as a result of a bank run, which gives the banking system
much greater resilience.
However, 100 per cent reserve requirements mean that banks cannot make loans financed
by taking deposits, which reduces bank lending if deposits cannot easily be replaced by
other unregulated bank liabilities (bonds or equity). This means banks cannot perform
maturity transformation, which we saw in Section 7.10 has a social value in the DiamondDybvig model.
We now consider how to incorporate reserve requirements into the Diamond-Dybvig
model, which were not explicitly considered in our earlier analysis. There, the liquid asset is
‘storage’, which earns no return and investments in the illiquid asset are recoverable,
although only by sacrificing all of the investment return. The liquidation value of a bank’s
assets in period 1 is thus equal to 1 per depositor. We interpret a minimum reserve-deposit
ratio π‘žπ‘ž as a minimum value of the ratio of the asset liquidation value 1 to the amount 𝑑𝑑1 in
a depositor’s account in period 1, that is, 1⁄𝑑𝑑1 ≥ π‘žπ‘ž. This is equivalent to 𝑑𝑑1 ≤ 1⁄π‘žπ‘ž.
263
EC2065 Macroeconomics | Chapter 7: Banking and finance
We know from Section 7.10 that banks have an incentive to offer 𝑑𝑑1 > 1. If π‘žπ‘ž is low, so 1/π‘žπ‘ž
is high, the reserve requirement is not binding. Reserve requirements ultimately become
binding for higher values of π‘žπ‘ž closer to 1. Once this point is reached, the deposit contract is
limited to 𝑑𝑑1 = 1⁄π‘žπ‘ž, which is less than 𝑑𝑑1 would otherwise be.
Reserve requirements with π‘žπ‘ž < 1 still result in 𝑑𝑑1 > 1. This means a large enough bank run
in period 1 still causes a bank to fail because 𝑑𝑑1 exceeds the liquidation value of all assets,
although a greater number of requests for withdrawals could be satisfied during a run.
Therefore, less than 100 per cent reserve requirements do not rule out self-fulfilling bank
runs in the Diamond-Dybvig model.
The 100 per cent reserve requirements (π‘žπ‘ž = 1) imply that banks must offer 𝑑𝑑1 = 1. In this
case, they can always meet all requests for withdrawals, so self-fulfilling runs cannot occur,
eliminating the bad outcome of the Diamond-Dybvig model. But as shown in Figure 7.17,
the 100% reserve requirements prevent maturity transformation, so the good outcome is
worse.
Figure 7.17: The 100 per cent reserve requirements in the Diamond-Dybvig model
More generally, beyond the Diamond-Dybvig model, reserve requirements reduce the
supply of bank deposits when interest is not paid on required reserves, as explained in
Section 7.4. This reduction in 𝑀𝑀 𝑠𝑠 raises the gap between 𝑖𝑖 and π‘–π‘–π‘šπ‘š . Having 100 per cent
reserve requirements mean that banks supply deposits up to the point where π‘–π‘–π‘šπ‘š = π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ ,
where π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ is interest paid on required reserves. To avoid a reduction in the supply of
deposits 𝑀𝑀 𝑠𝑠 , the central bank must pay interest on required reserves π‘–π‘–π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ and increase the
supply of reserves to the banking system. Maintaining an efficient supply of money to the
economy after imposing 100 per cent reserve requirements thus entails a fiscal cost.
264
EC2065 Macroeconomics | Chapter 7: Banking and finance
Box 7.7: Central-bank digital currency
Central-bank digital currency (CBDC) refers to an electronic form of fiat money held directly
by non-banks. CBDC would enable households and firms to keep money directly in accounts
at the central bank, similar to the way commercial banks hold reserves in accounts at the
central bank. As of 2021, apart from some small-scale experiments, CBDCs are not currently
in use – but this may soon change.
Unlike deposits at a commercial bank, there cannot be a ‘run’ on central banks offering
CBDC accounts. This is because fiat money not redeemable for anything but itself. Thus,
CBDC would offer the convenience of electronic payments without the risks associated with
commercial-bank deposits. Moreover, it would offer greater security than the physical cash
that is currently the only way non-banks can hold fiat money, removing the anonymity that
is a feature of cash.
Central banks can pay interest on CBDCs in the same way they can pay interest on reserves.
If they were to do so at a sufficiently high rate, this might lead people to switch from
commercial-bank deposits to CBDCs. That would present a similar challenge to commercial
banks – how to fund bank lending – as is present with 100 per cent reserve requirements.
In some ways, a CBDC has similarities to the proposal for 100 per cent reserve
requirements, which is also designed to make the monetary system free of ‘runs’. The
difference is that with CBDC, there would be a choice of whether to hold the CBDC or a
commercial-bank deposit backed by less than 100 per cent reserves, rather than forcing
everyone to hold fully reserve-backed deposits or cash.
CBDC may ultimately be intended as a replacement for physical cash. As discussed in
Section 6.14, this would open the door to significantly negative nominal interest rates.
265
EC2065 Macroeconomics | Chapter 8: Business cycles
Chapter 8: Business cycles
This chapter investigates the causes of booms and busts – cyclical fluctuations in economic
activity. We will also explore whether policy interventions, such as monetary or fiscal policy,
should be used to tame business cycles. Economists have not reached a consensus on the
causes of business cycles, so we will present a number of different theories. We begin by
looking at an economy with nominal rigidity where markets fail to clear because prices are
slow to adjust. When an economy with nominal rigidity is hit by shocks, GDP tends to
fluctuate excessively because markets do not function efficiently. There is scope for central
banks or governments to intervene to deliver better economic outcomes.
We will also examine theories of business cycles that do not depend on nominal rigidity. We
cover real business cycle theory, which argues that the business cycle is simply the
economy’s efficient response to supply shocks. An alternative approach, the coordination
failure model, suggests business cycles are driven by self-fulfilling changes in optimism or
pessimism, rather than fundamental shocks.
Essential reading
•
Williamson, Chapters 13 and 14.
8.1 Nominal rigidity
Most models we have used so far feature ‘market clearing’, with the relevant prices in each
market of the economy adjusting so that desired demand and supply are equalised.
However, there is an important set of ideas about how the economy works that suggests
prices change only slowly to achieve market clearing.
In this chapter, we will explore the implications of prices quoted in units of money
remaining fixed even when supply or demand conditions change. The term ‘nominal rigidity’
is used to refer to any type of prices quoted in money not adjusting as required to clear
markets. Why the focus on prices quoted in units of money? For prices to remain fixed, they
must already have been set and specified in some particular units. We discussed in Chapter
6 the convenience advantage of quoting all prices in terms of money – what we called
money’s ‘unit of account’ function.
One common form of nominal rigidity is the ‘stickiness’ of retail prices faced by consumers,
although there are other nominal rigidities too such as sticky wages and sticky producer
prices. Why would goods prices be sticky? One basic explanation points to physical costs of
adjustment, for example, the costs of printing new price labels. Costs of this type are known
as ‘menu costs’. However, technology and online retailing have greatly reduced such costs.
More broadly, we can also envisage costs of making pricing decisions as a cost of
adjustment, for example, the managerial time and resources needed to select a new price.
266
EC2065 Macroeconomics | Chapter 8: Business cycles
Leaving a price unchanged saves costs because no new price needs to be determined. Firms’
relationships with their customers might also create barriers to price adjustment. For
example, the coordination problem of no firm wanting to go first with a price increase and
antagonise its customers.
Data on observations of individual prices also provides some support for the relevance of
nominal rigidity. The price data shown in Figure 8.1 suggests individual goods prices quoted
in units of money change infrequently, even during times when the economy experiences
shocks.
Figure 8.1: Evidence on sticky prices
Models of the economy where there is a failure of market clearing because of nominal
rigidity are often labelled ‘Keynesian’ models. We will study extensively what is called a
‘new Keynesian’ model. This model is ‘new’ in the sense that it has many of the features of
modern macroeconomic models seen in earlier chapters but in combination with older
Keynesian ideas about the failure of markets to clear.
The only nominal rigidity in the new Keynesian model is stickiness of goods prices. Having
price stickiness in the model implicitly assumes prices are set by firms, not markets. For this
reason, the model also features an imperfectly competitive goods market.
It is important to note that the reasons for nominal rigidity are unlikely to prevent eventual
price adjustment in the long run. Prices will ultimately adjust to shocks, so markets still clear
in the long run. Consequently, the model will make a distinction between the short run
where prices are sticky and the long run where prices are flexible. The transition from the
short run to the long run is studied in Chapter 9.
There is also an important difference compared to our earlier analysis of the failure of
labour-market clearing owing to efficiency wages in Section 5.2. There, firms have incentives
to pay efficiency wages at all times, so the rigidity of wages is not temporary and does not
disappear in the long run. Moreover, efficiency wages impart rigidity to real wages, which is
conceptually distinct from nominal rigidity.
267
EC2065 Macroeconomics | Chapter 8: Business cycles
8.2 The new Keynesian model
The new Keynesian model is essentially the dynamic macroeconomic model introduced in
Chapter 3 with nominal rigidity added to it. The goods market has sticky prices and is
treated as an imperfectly competitive market for consistency. This is in contrast to the
perfect competition and fully flexible prices of the earlier dynamic macroeconomic model.
We start with a simple version of the model where all prices are completely fixed and are
expected to remain so in the near term. Later in Chapter 9, we will add partial price
adjustment to analyse inflation. We can illustrate the main consequences of nominal rigidity
by only having sticky prices in the goods market. Nominal wages are fully flexible and we can
treat the labour market as being perfectly competitive. However, it is possible to combine
our earlier analysis of efficiency wages with the new Keynesian model and this case is also
considered later.
Goods prices being sticky has consequences for our analysis of firms’ labour demand and
the implied level of output supply in the goods market. Recall that with perfect competition,
firms can sell as much output as they like at the market price, which adjusts to clear the
goods market. Each extra unit of labour allows a firm to produce 𝑀𝑀𝑃𝑃𝑁𝑁 units of output, the
marginal product of labour, so a profit-maximising firm hires labour at real wage 𝑀𝑀 up to the
point where 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀. This is the labour demand curve with perfect competition.
With nominal rigidity, if an imperfectly competitive firm does not change the fixed price 𝑃𝑃� of
the good it is selling then it cannot choose how much it sells. If demand falls, shifting the
demand curve for the firm’s product leftwards, it cannot sell as much as before at price 𝑃𝑃�,
and as a result hires less labour. Even if the marginal product of labour exceeds the real
wage, the firm does not hire more labour because it cannot sell the extra output that could
be produced with additional employment. If demand rises, a firm can now sell more at the
same price 𝑃𝑃�, so it hires more labour to meet the additional demand. Strictly speaking, this
is true only as long as 𝑀𝑀 ≤ 𝑀𝑀𝑃𝑃𝑁𝑁 , otherwise the firm would not want to sell more, although
as we will see, cases where this condition fails to hold are not likely to be relevant in
practice.
This logic tells us that the labour demand curve is no longer given by the marginal product of
labour but is instead perfectly wage inelastic as depicted in the right panel of Figure 8.2. The
vertical labour demand curve shifts with the aggregate demand for goods, which we
suppose affects the demand for each individual good. We truncate the vertical labour
demand curve where it goes above 𝑀𝑀𝑃𝑃𝑁𝑁 .
268
EC2065 Macroeconomics | Chapter 8: Business cycles
Figure 8.2: Labour demand with sticky prices
The wage-inelastic labour demand curve 𝑁𝑁 𝑑𝑑 (π‘Œπ‘Œ) is determined using the production function
π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) to find what level of employment 𝑁𝑁 is needed to produce output π‘Œπ‘Œ sufficient
to meet demand, taking as given the stock of capital 𝐾𝐾 and TFP 𝑧𝑧. This is shown in the left
panel of Figure 8.2. An increase in π‘Œπ‘Œ shifts labour demand to the right.
How do we analyse outcomes in the labour market when goods prices are sticky? Since
labour demand is wage inelastic, the outcome for employment 𝑁𝑁 is directly determined by
the position of 𝑁𝑁 𝑑𝑑 (π‘Œπ‘Œ). For wages 𝑀𝑀, we need to be more specific about how the supply side
of the labour market works.
Let us first suppose that wages are fully flexible. In that case, wages are determined by a
standard upward-sloping labour supply curve 𝑁𝑁 𝑠𝑠 (π‘Ÿπ‘Ÿ) derived from households’ optimality
conditions 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 = 𝑀𝑀 and 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝑙𝑙′ = (1 + π‘Ÿπ‘Ÿ)𝑀𝑀/𝑀𝑀′ as explained in Chapters 1 and 3. With
a representative household, Chapter 3 explains why 𝑁𝑁 𝑠𝑠 (π‘Ÿπ‘Ÿ) is unambiguously upward sloping
owing to the substitution effect of wages and shifts to the right when the real interest rate π‘Ÿπ‘Ÿ
increases (intertemporal substitution). Although we consider a monetary economy, we
ignore here the effect of money’s medium of exchange function on 𝑁𝑁 𝑠𝑠 , that is to say, we
neglect any effects coming from money being less good as a store of value and acting as a
tax on economic activity (see Section 6.5).
With flexible wages, the real wage 𝑀𝑀 adjusts so that the labour market clears with 𝑁𝑁 𝑑𝑑 (π‘Œπ‘Œ) =
𝑁𝑁 𝑠𝑠 (π‘Ÿπ‘Ÿ) as shown in the left panel of Figure 8.3. We will see that employment fluctuates with
aggregate demand, although if there are no impediments to wage adjustment then there is
no unemployment or fluctuations in unemployment. If we want to study unemployment
over the business cycle, we can combine our earlier analysis of efficiency wages from
Section 5.2 with the new Keynesian model. In that case, the real wage is determined by
firms’ desire to pay an efficiency wage and remains constant. Employment is found on the
inelastic labour demand curve but desired labour supply can be higher, so unemployment
exists. Given desired labour supply, unemployment changes in the opposite direction to
changes in employment. This case is depicted in the right panel of Figure 8.3.
269
EC2065 Macroeconomics | Chapter 8: Business cycles
Figure 8.3: The labour market with sticky prices
In an economy with flexible prices, the output supply curve in the goods market represent
the production of goods and services by firms given employment at the labour-market
equilibrium. However, with sticky prices, the demand for labour depends on the aggregate
demand for goods. This means there is no independent decision made by firms about how
much to sell. The supply of goods passively accommodates changes in demand and thus
there is no output supply curve relevant for determining outcomes in the goods market as
long as goods prices remain sticky. One caveat to this logic is that demand must not be so
large that firms do not want to meet it because wages are too high. We require that 𝑀𝑀 ≤
𝑀𝑀𝑃𝑃𝑁𝑁 , which in the goods market diagram is equivalent to remaining on the left of the
hypothetical output supply curve with perfect competition and flexible prices (π‘Œπ‘ŒοΏ½ 𝑠𝑠 ), where
the real wage would be exactly equal to the marginal product of labour. The hypothetical
supply curve is depicted as a dashed line in Figure 8.4. This condition is not likely to be of
concern in most applications of the model.
Figure 8.4: The goods market with sticky prices
The demand curve in the goods market simply represents the same aggregate demand for
goods and services that was found in the earlier dynamic macroeconomic model from
Section 3.12. The π‘Œπ‘Œ 𝑑𝑑 curve represents the equation π‘Œπ‘Œ 𝑑𝑑 = 𝐢𝐢 𝑑𝑑 + 𝐼𝐼 𝑑𝑑 + 𝐺𝐺 as before and is
270
EC2065 Macroeconomics | Chapter 8: Business cycles
shown in Figure 8.4. But since prices are sticky, the economy does not have to be at the
intersection of the π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 curves. Many points on the π‘Œπ‘Œ 𝑑𝑑 curve can be a goods-market
equilibrium in the short run. Note that in Keynesian models, the π‘Œπ‘Œ 𝑑𝑑 curve was traditionally
known as the 𝐼𝐼𝐼𝐼 (investment = saving) curve. The equation for investment 𝐼𝐼 being equal to
national saving π‘Œπ‘Œ − 𝐢𝐢 − 𝐺𝐺 is equivalent to π‘Œπ‘Œ = 𝐢𝐢 + 𝐼𝐼 + 𝐺𝐺.
The point on the output demand curve the economy reaches and the outcome for real GDP
π‘Œπ‘Œ are determined by the level of interest rates. With complete stickiness of prices there is
zero inflation (πœ‹πœ‹ = 0) and, hence, the Fisher equation 𝑖𝑖 = π‘Ÿπ‘Ÿ + πœ‹πœ‹ 𝑒𝑒 implies that π‘Ÿπ‘Ÿ = 𝑖𝑖, so the
real interest rate π‘Ÿπ‘Ÿ is the same as the nominal interest rate 𝑖𝑖. As discussed in Chapters 6 and
7, there are various instruments of monetary policy that can control the nominal interest
rate 𝑖𝑖. With sticky prices, monetary policy also effectively sets the real interest rate π‘Ÿπ‘Ÿ, which
selects a point on the π‘Œπ‘Œ 𝑑𝑑 curve.
In the goods market diagram, we represent the central bank’s choice of nominal interest
rate 𝑖𝑖 (and, hence, π‘Ÿπ‘Ÿ) with a line that we will label 𝑀𝑀𝑀𝑀 (money and monetary policy). The
𝑀𝑀𝑀𝑀 line is usually assumed to be flat or upward-sloping, which is to say that the central
bank either sets some particular interest rate, in which case 𝑀𝑀𝑀𝑀 is horizontal, or we think of
the central bank as systematically adjusting interest rates up and down with GDP. The
intersection between the π‘Œπ‘Œ 𝑑𝑑 (𝐼𝐼𝐼𝐼) curve and the 𝑀𝑀𝑀𝑀 line determines the real interest rate π‘Ÿπ‘Ÿ
and output π‘Œπ‘Œ as seen in Figure 8.4.
8.3 The real effects of monetary policy
What happens when the central bank changes its monetary policy according to the new
Keynesian model? Does monetary policy have real effects?
Suppose the central bank cuts the nominal interest rate 𝑖𝑖. If prices are completely sticky,
inflation remains zero and, consequently, the real interest rate falls. Supposing the stance of
monetary policy is represented by a horizontal 𝑀𝑀𝑀𝑀 line, the interest rate cut shifts the 𝑀𝑀𝑀𝑀
line downwards. The economy moves along the output demand curve as shown in Figure
8.5, with the lower real interest rate π‘Ÿπ‘Ÿ stimulating consumption and investment demand.
This works through a lower cost of borrowing for firms and a substitution effect on
households’ consumption expenditure plans. GDP π‘Œπ‘Œ rises with the resulting increase in
aggregate demand.
271
EC2065 Macroeconomics | Chapter 8: Business cycles
Figure 8.5: Cutting interest rates
In the labour market, the increase in aggregate demand for goods shifts the labour demand
𝑁𝑁 𝑑𝑑 (π‘Œπ‘Œ) curve to the right. This is because firms selling goods at a fixed price want to hire
more workers when they are able sell more output. Employment rises as a result.
When wages are flexible, the real wage 𝑀𝑀 increases because of a movement along the
upward-sloping labour supply curve 𝑁𝑁 𝑠𝑠 (π‘Ÿπ‘Ÿ), which also shifts to the left when π‘Ÿπ‘Ÿ declines. If
real wages are rigid owing to efficiency wages then higher employment and lower desired
labour supply imply a decline in unemployment. These two cases are depicted in Figure 8.6.
Box 8.1: The Volcker disinflation
The new Keynesian model predicts changes in interest rates by central banks have
real effects. Specifically, higher interest rates reduce demand and real GDP. The
most striking evidence for this is seen following substantial increase in interest
rates in USA in the early 1980s. Paul Volcker became Chairman of the US Federal
Reserve in 1979 at a time when inflation had reached double-digit levels and there
was pressure to bring inflation back down.
Figure 8.6: Labour market with interest-rate cut
272
EC2065 Macroeconomics | Chapter 8: Business cycles
As seen in Figure 8.7, the shift in the Federal Reserve’s monetary policy stance
brought about by Volcker saw interest rates rise from below 10 per cent to peak at
15 per cent in 1981. Inflation started to fall and was below 5 per cent by 1983.
While we cannot yet analyse inflation using our basic new Keynesian model with
completely sticky prices, we will study inflation with partial price adjustment in
Chapter 9. For now, we focus on the effects of real interest rates on aggregate
demand and GDP. Volcker’s tightening of monetary policy led the real interest to
rise from negative levels to more than 5 per cent by 1981. This was followed by a
sharp fall in real GDP during the 1981–82 recession.
Figure 8.7: The Volcker disinflation
8.4 Business cycles due to demand shocks
The new Keynesian model provides a framework for understanding business-cycle
fluctuations. Let us analyse how the economy would respond to an unexpected fall in
aggregate demand.
What are the possible sources of such a negative demand shock? One possibility is a decline
in consumption demand 𝐢𝐢 𝑑𝑑 or investment demand 𝐼𝐼 𝑑𝑑 owing to lower confidence about the
future. If expectations of the future are formed rationally, this would be triggered by some
specific bad news about the economy’s future fundamentals, for example, less optimistic
expectations of future TFP 𝑧𝑧′.
Another source of demand shocks is a worsening of credit-market imperfections. As
discussed in Chapter 4, borrowers would face a higher interest rate π‘Ÿπ‘Ÿπ‘™π‘™ even if there is no
change in the risk-free interest rate π‘Ÿπ‘Ÿ received by savers. Further possibilities are shifts in
preferences toward saving more for the future, or an increase in uncertainty about the
future that triggers greater saving owing to concern about future risks, which would both
reduce consumption demand 𝐢𝐢 𝑑𝑑 . Fiscal austerity, where the government reduces its
expenditure 𝐺𝐺 is another possibility.
273
EC2065 Macroeconomics | Chapter 8: Business cycles
Consider a decline in the expected value of future TFP 𝑧𝑧 ′ as a specific example of a demand
shock. This leads to lower 𝐢𝐢 𝑑𝑑 and 𝐼𝐼 𝑑𝑑 and shifts the π‘Œπ‘Œ 𝑑𝑑 curve to the left. Assuming no
automatic or discretionary monetary policy response, that is to say, a horizontal 𝑀𝑀𝑀𝑀 line in
same position, the real interest rate π‘Ÿπ‘Ÿ remains the same and GDP π‘Œπ‘Œ declines as shown in
Figure 8.8.
Figure 8.8: Effects of a negative demand shock
In the labour market, lower aggregate demand π‘Œπ‘Œ reduces labour demand, with 𝑁𝑁 𝑑𝑑 (π‘Œπ‘Œ)
shifting to the left and resulting in lower employment 𝑁𝑁. Owing to the negative wealth
effect of lower 𝑧𝑧′, the labour supply curve 𝑁𝑁 𝑠𝑠 (π‘Ÿπ‘Ÿ) shifts rightwards. With flexible wages, the
real wage 𝑀𝑀 would decline, while with a rigid efficiency wage, higher unemployment would
result. It can also be seen from the production function that the decline in output and
employment would raise average labour productivity π‘Œπ‘Œ/𝑁𝑁, which is the gradient of the ray
from the origin to the production function.
Box 8.2: Can the new Keynesian model match the business-cycle
stylised facts?
It is desirable that any theory of the business cycle is consistent with empirical
evidence on the behaviour of fluctuations of various macroeconomic variables. We
can document a set of business-cycle ‘stylised facts’ using methods described in
Section 3.1. Variables are detrended and their percentage deviations from trend
can be compared to those of real GDP. We have already looked at the behaviour of
fluctuations in consumption and investment in Chapter 3 and unemployment in
Chapter 5. Both consumption and investment are procyclical, meaning that their
deviations from trend are positively correlated with deviations of real GDP from its
trend, while unemployment is countercyclical, i.e. negatively correlated with real
GDP. Consumption is less volatile than GDP – its percentage fluctuations are smaller
than those of real GDP – while investment is more volatile than GDP.
We can also look at a broader range of macroeconomic variables. Figure 8.9 shows
detrended employment, which is procyclical, generally less volatile than
274
EC2065 Macroeconomics | Chapter 8: Business cycles
We can also look at a broader range of macroeconomic variables. Figure 8.9 shows
detrended employment, which is procyclical, generally less volatile than GDP and
slightly lagging. Figure 8.10 shows real wages, for which it is harder to discern a
clear pattern but which is overall weakly procyclical and less volatile than GDP.
Figure 8.11 displays the data for average labour productivity, which is procyclical
and less volatile than GDP. The procyclicality of average labour productivity reflects
the fact that employment typically moves by less in percentage terms than GDP.
Figure 8.9: Fluctuations in employment over the business cycle (USA)
Figure 8.10: Fluctuations in real wages over the business cycle (USA)
Fluctuations of the real interest rate are shown alongside fluctuations of GDP in
Figure 8.12. The cyclicality of the real interest rate appears to change over time,
generally being countercyclical prior to the 1990s and procyclical afterwards. Taking
an overview of the whole period covered by the data, the real interest rate is
weakly countercyclical. Finally, fluctuations of inflation are shown in Figure 8.13.
Here again the relationship with GDP fluctuations appears to have changed over
time. There is a strong countercyclical relationship in the 1970s but at other times
inflation appears procyclical. Overall, we conclude that inflation is weakly
procyclical.
275
EC2065 Macroeconomics | Chapter 8: Business cycles
Figure 8.11: Fluctuations of average labour productivity over the business cycle (USA)
Figure 8.12: Fluctuations of real interest rates over the business cycle (USA)
How do the predictions of the new Keynesian model compare to this evidence on
business-cycle fluctuations? We will consider two different types of shock studied
earlier. First, we will consider a demand shock coming from news that changes
confidence about the future, or a change in the extent of credit-market imperfections.
Second, we will look at a shift in the stance of monetary policy.
Figure 8.13: Fluctuations of inflation over the business cycle (USA)
276
EC2065 Macroeconomics | Chapter 8: Business cycles
Consider the negative demand shock considered earlier that was caused by lower
confidence about the future. A lower expected value of 𝑧𝑧′ shifts the π‘Œπ‘Œ 𝑑𝑑 curve to the
left due to declines in 𝐢𝐢 𝑑𝑑 and 𝐼𝐼 𝑑𝑑 . We assume a horizontal 𝑀𝑀𝑀𝑀 line in an unchanged
position, indicating a passive stance of monetary policy throughout.
Referring to Figure 8.8, the model predicts that GDP π‘Œπ‘Œ falls and consumption 𝐢𝐢 and
investment 𝐼𝐼 are lower. Prices and inflation do not change because all prices fixed. The
real and nominal interest rates π‘Ÿπ‘Ÿ and 𝑖𝑖 are unchanged because of the passive
monetary policy and the absence of any change in inflation. The production function
implies employment 𝑁𝑁 declines and the leftward shift of 𝑁𝑁 𝑑𝑑 and rightward shift of 𝑁𝑁 𝑠𝑠
in the labour market result in a lower real wage 𝑀𝑀. If we assumed firms are paying
efficiency wages instead then the real wage would remain constant and
unemployment would rise owing to the direction of the shifts of 𝑁𝑁 𝑑𝑑 and 𝑁𝑁 𝑠𝑠 .
Using the production function diagram, we can also see the model’s prediction for the
response of average labour productivity 𝐴𝐴𝐴𝐴𝐴𝐴 = π‘Œπ‘Œ/𝑁𝑁. Average labour productivity is
given the gradient of the ray from the origin to the relevant point on the production
function. Figure 8.14 shows that since the production function has a concave shape
and does not shift with a demand shock, a decline in employment and output raises
the gradient of this ray. Intuitively, the shape of the production function comes from
diminishing returns to labour, so a decline in employment raises labour productivity.
In summary, the new Keynesian model with demand shocks predicts that consumption
and investment are procyclical (they both move in the same direction as real GDP),
employment is procyclical and average labour productivity is countercyclical. Apart
from productivity, these predictions are in line with the business-cycle stylised facts.
With a competitive labour market, the model predicts a strongly procyclical real wage
contrary to the empirical evidence but this cyclicality of wages would be weakened by
integrating the model of efficiency wages into the analysis of the labour market.
Adding efficiency wages also allows the new Keynesian model to match the
countercyclicality of unemployment. These predictions of the model and the
corresponding stylised facts from the data are summarised in Table 8.1. Note that the
model makes similar predictions for other forms of demand shock such as a worsening
of credit-market imperfections that raises interest-rate spreads.
Figure 8.14: Prediction of countercyclical average labour productivity
277
EC2065 Macroeconomics | Chapter 8: Business cycles
As the stance of monetary policy remains completely passive following the shock by
assumption, the model predicts an acyclical real interest rate. Having an upwardsloping 𝑀𝑀𝑀𝑀 line, as discussed in Box 8.4, would result in the model predicting a
procyclical real interest rate. While the overall pattern in the data is weak
countercyclicality, there are periods where real interest rates appear procyclical, so
the model’s predictions are not too far from the empirical evidence. With completely
sticky prices, the model predicts that inflation is acyclical. We will see in Chapter 9 that
adding partial price adjustment means that inflation would be procyclical in an
economy with demand shocks, which helps to match the data.
Table 8.1: Predictions of new Keynesian model with demand shock to confidence about
the future
Variable
Model
Data
Consumption
Procyclical
Procyclical
Investment
Procyclical
Procyclical
Real interest
rate
Acyclical
(Procyclical with upwardsloping MM line)
Countercyclical (weakly)
Employment
Procyclical
Procyclical
Real wage
Procyclical
(Acyclical with efficiency wage)
Procyclical (weakly)
Unemployment
Acyclical
(Countercyclical with efficiency
wage)
Countercyclical
Average labour
productivity
Countercyclical
Procyclical
Acyclical
Procyclical
Inflation
It is also possible to consider business cycles triggered by shifts in monetary policy, for
example, the Volcker disinflation discussed in Box 8.1. Suppose the central bank shifts
the stance of monetary policy towards higher interest rates, shifting the 𝑀𝑀𝑀𝑀 line
upwards. This is the opposite of the case depicted in Figures 8.5 and 8.6 from Section
8.3.
The model predicts that consumption 𝐢𝐢 and investment 𝐼𝐼 fall, moving up the π‘Œπ‘Œ 𝑑𝑑 curve
in the goods-market diagram. Real GDP π‘Œπ‘Œ falls and the real interest rate π‘Ÿπ‘Ÿ rises. It can
be seen from the production function and labour-market diagrams that employment
𝑁𝑁 and real wages 𝑀𝑀 are lower, with 𝑁𝑁 𝑑𝑑 shifting to the left and 𝑁𝑁 𝑠𝑠 shifting to the right.
With efficiency wages, unemployment π‘ˆπ‘ˆ rises and the real wage 𝑀𝑀 is constant. Prices
and inflation do not change because all prices are fixed.
278
EC2065 Macroeconomics | Chapter 8: Business cycles
Table 8.2 summarises the predictions of the model and compares them to the
business-cycle stylised facts. Consumption and investment are procyclical, matching
the data. The real interest rate is countercyclical, which fits the overall pattern weakly
present in the data. Employment is procyclical, matching the data. Real wages are
strongly procyclical but that prediction can be tempered by efficiency wages, which
also allows the model to generate the countercyclical unemployment seen in the data.
Inflation is acyclical but partial price adjustment would change that prediction to
procyclicality. As with demand shocks, the model predicts countercyclical average
labour productivity, contrary to the empirical pattern.
In summary, the new Keynesian model is broadly consistent with most of the businesscycle stylised facts when the business cycle is caused by demand shocks, including
shifts in monetary policy. The only major failing is in accounting for the procyclicality
of productivity. One potential reconciliation with the productivity data is discussed in
Box 8.3.
Table 8.2: Predictions of the new Keynesian model with a monetary policy shock
Variable
Model
Data
Consumption
Procyclical
Procyclical
Investment
Procyclical
Procyclical
Real interest
rate
Countercyclical
Countercyclical (weakly)
Employment
Procyclical
Procyclical
Real wage
Procyclical
(Acyclical with efficiency wage)
Procyclical (weakly)
Unemployment
Acyclical
(Countercyclical with efficiency
wage)
Countercyclical
Average labour
productivity
Countercyclical
Procyclical
Acyclical
Procyclical
Inflation
279
EC2065 Macroeconomics | Chapter 8: Business cycles
Box 8.3: Labour hoarding
Empirical evidence indicates average labour productivity (𝐴𝐴𝐴𝐴𝐴𝐴 = π‘Œπ‘Œ⁄𝑁𝑁) is a
procyclical variable, moving in the same direction as real GDP π‘Œπ‘Œ. However, the new
Keynesian model with demand shocks predicts average labour productivity is
countercyclical.
The reason for the model’s prediction of countercyclical productivity is that the
neoclassical production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) features a diminishing marginal
product of labour, while demand shocks do not shift the production function by
affecting current TFP 𝑧𝑧. As the total stock of capital 𝐾𝐾 changes relatively little over
business cycle, when 𝑁𝑁 and π‘Œπ‘Œ fall, the marginal product of labour 𝑀𝑀𝑃𝑃𝑁𝑁 rises. This
implies average labour productivity π‘Œπ‘Œ/𝑁𝑁 rises as employment falls.
The empirical evidence suggests a relationship in the opposite direction. However,
data on employment might not measure accurately the true labour input 𝑁𝑁 going
into the production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁). Data on employment is a headcount
of firms’ employees but does not capture how intensively employees are working.
There is data on total hours worked but, for many jobs, this simply measures
contractual hours of work, which does not fully measure the intensity of work. A
true measure of labour input 𝑁𝑁 would account for the intensity of work.
The claim that true labour input 𝑁𝑁 might fall substantially in a recession while firms
retain most of their staff raises the question of why firms do not lay-off workers
when fewer are needed. One argument for this ‘labour hoarding’ is that a recession
is expected to be temporary and firms want to avoid incurring the costs of hiring
again in the future recovery (the costly process of search and matching in hiring
workers was discussed in Chapter 5).
The occurrence of labour hoarding helps explain why measured average labour
productivity is procyclical. As shown in Figure 8.15, if measured labour input falls by
much less than actual labour input, the drop in output will be associated with a fall
in measured productivity (calculated using data on actual output and measured
employment). This measurement problem suggests that the observed procyclicality
of productivity may not be inconsistent with the new Keynesian model of the
business cycle with demand shocks.
280
EC2065 Macroeconomics | Chapter 8: Business cycles
Figure 8.15: Procyclical measured productivity with labour hoarding
An issue similar to labour hoarding arises when measuring total factor productivity
(TFP). This is done by calculating a ‘Solow residual’, that is, the change in GDP not
explained by changes in inputs of labour and capital. However, estimate of the
capital stock do not fully capture changes in the usage of capital by firms because
utilisation of capital might vary over time. This is analogous to the change in labour
utilisation by firms when there is labour hoarding and suggests there is bias
towards detecting procyclicality in TFP.
8.5 The natural rate of interest
Although the objective of the new Keynesian model is to understand the functioning of an
economy with sticky prices, analysing the hypothetical case of fully flexible prices even in
the short run is nonetheless useful. This helps us understand the different predictions the
model makes for the short run and the long run. It also provides guidance on how monetary
policy should be conducted.
8.5.1 Imperfect competition and the output supply curve
We have assumed the goods market is imperfectly competitive to allow for sticky prices. We
now consider how imperfectly competitive firms would set prices if they were always free to
adjust them. In doing this, we make use of a model of monopolistic competition from
microeconomics. Each firm faces a downward-sloping demand curve for its product because
goods produced by different firms are imperfect substitutes. Conditional on aggregate
demand, a firm can only sell more of its product by charging a lower price, unlike perfect
competition where firms are able to sell any amount at the prevailing market price.
Profit maximisation by imperfectly competitive firms implies they will exploit market power
to sell at a price above marginal cost. This is because firms with market power face a
downward-sloping demand curve for their product, so they can charge a higher price by
choosing to sell less.
281
EC2065 Macroeconomics | Chapter 8: Business cycles
Given a production function π‘Œπ‘Œ = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) with a particular capital stock 𝐾𝐾 and level of TFP
𝑧𝑧, the decision to supply goods π‘Œπ‘Œ is equivalent to a decision to hire labour 𝑁𝑁. For an
imperfectly competitive firm, the effect on real revenue of hiring an extra worker is less
than the physical marginal product of labour because the price of its product relative to
other goods needs to be lowered to sell the extra output. The marginal gain in real revenue
from hiring an extra unit of labour is measured by the marginal revenue product of labour
𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 , which is below the marginal product of labour 𝑀𝑀𝑃𝑃𝑁𝑁 . The relationship between the
two is 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = (1 − πœ–πœ– −1 )𝑀𝑀𝑃𝑃𝑁𝑁 , where πœ–πœ– is the price elasticity of the demand curve for a
firm’s product.
In choosing how much labour to hire and how much output to produce, each firm compares
the real cost of hiring a worker, the real wage 𝑀𝑀, to the marginal benefit 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 . Hence,
firms’ demand for labour is given by 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀 instead of 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀 with perfect
competition. Workers being paid their marginal revenue product is equivalent to firms
pricing (πœ–πœ– − 1)−1 per cent above their marginal cost 𝑀𝑀/𝑀𝑀𝑃𝑃𝑁𝑁 of producing a unit of output.
Assuming each individual firm faces a demand curve for its product with a constant price
elasticity πœ–πœ– > 1, the marginal revenue product curve 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 is simply a scaling down of the
marginal product curve 𝑀𝑀𝑃𝑃𝑁𝑁 as shown in Figure 8.16. Labour demand thus behaves in the
same way as with perfect competition, just at a lower level, all else being equal. An output
supply curve π‘Œπ‘Œ 𝑠𝑠 can then be derived exactly as earlier in the dynamic macroeconomic model
from Section 3.12. The only difference is that π‘Œπ‘Œ 𝑠𝑠 is lower for any given π‘Ÿπ‘Ÿ because
imperfectly competitive firms restrict supply to raise profits. For comparison, what the
output supply curve would look like with perfect competition is shown labelled as π‘Œπ‘ŒοΏ½ 𝑠𝑠 in the
figure.
8.5.2 Market clearing in the absence of nominal rigidities
With flexible prices and wages in the goods and labour markets, the intersection of the
demand and supply curves determines equilibrium in all markets. As shown in Figure 8.16,
there is a market-clearing real interest rate π‘Ÿπ‘Ÿ ∗ . This interest rate π‘Ÿπ‘Ÿ ∗ is known as the ‘natural
rate of interest’. It is the hypothetical real interest rate that would prevail if there were no
nominal rigidities in the economy. The ‘natural’ terminology is also applied to other
variables. The ‘natural level of output’ is the market-clearing level of real GDP π‘Œπ‘Œ ∗ in the
absence of any nominal rigidities. By incorporating efficiency wages into the analysis of the
labour market, there would be a ‘natural rate of unemployment’, i.e. the unemployment
rate occurring with no nominal rigidity (note that efficiency wages are not a nominal rigidity
– they explain why firms do not want to adjust real wages).
282
EC2065 Macroeconomics | Chapter 8: Business cycles
Figure 8.16: Labour and goods markets with flexible prices
8.5.3 The long run and the short run
We now return to the actual assumption made in the new Keynesian model that goods
prices are sticky. The model implies we should think differently about how real GDP is
determined in the short run and the long run.
We take ‘long run’ to mean a situation where current market conditions have been correctly
foreseen and are not expected to change. Even if firms have sticky prices 𝑃𝑃�, all prices are set
appropriately for the current state of the economy. In this case, the new Keynesian model
predicts the real interest rate and output coincide with their ‘natural’ levels. Moreover, as
long as prices do not remain sticky forever, even if shocks do occur, the new Keynesian
model predicts all variables will tend to their respective natural levels in the long run absent
any further changes or shocks to the economy.
The ’short run’ is the time horizon in which market conditions can deviate from what was
expected when prices were originally set in the past. Shocks result in the economy
fluctuating around its natural level of output. Note that it is possible to have GDP above or
below its natural level π‘Œπ‘Œ ∗ . As long as π‘Œπ‘Œ and π‘Ÿπ‘Ÿ lie to the left of the hypothetical perfectcompetition output supply curve π‘Œπ‘ŒοΏ½ 𝑠𝑠 (which is true for π‘Ÿπ‘Ÿ ∗ and π‘Œπ‘Œ ∗ ) then 𝑀𝑀 < 𝑀𝑀𝑃𝑃𝑁𝑁 holds and
firms would willingly sell more if given the chance – and must sell less if demand falls.
When the economy experiences a shock and the actual level of real GDP π‘Œπ‘Œ deviates from its
natural level π‘Œπ‘Œ ∗ , we say there is an ‘output gap’ between π‘Œπ‘Œ and π‘Œπ‘Œ ∗ . We will see that there is
a case for the central bank or government to use demand-management policies to try to
close the output gap, moving GDP π‘Œπ‘Œ towards π‘Œπ‘Œ ∗.
8.6 Optimal stabilisation policy
With sticky prices and imperfect competition, the equilibrium of the economy is not
efficient. Following a shock to the economy, it is possible to obtain a better outcome for
households through a macroeconomic policy intervention. This is an improvement on
waiting for prices to adjust.
When the economy has a representative household, efficiency can be judged easily by
comparing the marginal product of labour 𝑀𝑀𝑃𝑃𝑁𝑁 to households’ marginal rate of substitution
283
EC2065 Macroeconomics | Chapter 8: Business cycles
between leisure and consumption 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 . The 𝑀𝑀𝑃𝑃𝑁𝑁 is what can be produced if people were
able to work more and the 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 is what value (in terms of goods) people put on their
time. The economy has inefficiently low employment and production if 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢
because the value of people’s time is less than the value they put on the extra goods that
can be produced and consumed if there were more economic activity.
How efficiently the economy is operating can be judged from the goods market diagram by
comparing the outcome for π‘Œπ‘Œ and π‘Ÿπ‘Ÿ to the hypothetical perfectly competitive output supply
curve π‘Œπ‘ŒοΏ½ 𝑠𝑠 , on which 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 . All points to the left of π‘Œπ‘ŒοΏ½ 𝑠𝑠 have 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 ,
meaning that output π‘Œπ‘Œ is too low. Inefficiency is thus measured by how far the economy is
to the left of the π‘Œπ‘ŒοΏ½ 𝑠𝑠 curve.
Why is the market equilibrium in the new Keynesian model generally not efficient? To
simplify the analysis, we ignore some other reasons for inefficiency we have studied
elsewhere that are not central to the new Keynesian model. First, wages are flexible, so the
labour-market equilibrium is always on the labour supply curve. This ignores the persistent
unemployment that results from firms’ incentives to pay ‘efficiency wages’ as seen in
Section 5.2. Second, we ignore the implications for labour supply of needing to use money
as a medium of exchange that were studied in Section 6.5. This neglects any inefficiencies
resulting from a failure of monetary policy to follow the ‘Friedman rule’ as discussed in
Section 6.10. The consequence of these simplifications is that real wages 𝑀𝑀 are always equal
to the marginal rate of substitution 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 between leisure and consumption.
There are two distinct reasons why output is inefficiently low in the New Keynesian model.
First, the natural level of output π‘Œπ‘Œ ∗ is already too low because imperfect competition gives
firms an incentive to reduce production. Even without nominal rigidity, imperfect
competition would result in 𝑀𝑀 = 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 < 𝑀𝑀𝑃𝑃𝑁𝑁 at π‘Œπ‘Œ = π‘Œπ‘Œ ∗ . Second, when prices are sticky, a
negative demand shock pushes GDP π‘Œπ‘Œ below π‘Œπ‘Œ ∗ and, as diminishing returns to labour then
implies 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 and 𝑀𝑀𝑃𝑃𝑁𝑁 rise while 𝑀𝑀 = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 falls, we have 𝑀𝑀 < 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 and the economy
is even further away from what is efficient.
How can economic policy achieve a better outcome? Let us consider the example studied in
Section 8.4 of a negative demand shock due to a decline in expected future TFP 𝑧𝑧′. As shown
in Figure 8.17, the π‘Œπ‘Œ 𝑑𝑑 curve shifts to the left and output π‘Œπ‘Œ falls below π‘Œπ‘Œ ∗ if the 𝑀𝑀𝑀𝑀 line
remains in its original position. Now, instead of leaving monetary policy unchanged, the
central bank lowers the nominal and real interest rates 𝑖𝑖 and π‘Ÿπ‘Ÿ. Reducing π‘Ÿπ‘Ÿ moves the
economy along the π‘Œπ‘Œ 𝑑𝑑 curve, raising GDP π‘Œπ‘Œ. Since 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 , the representative
household gains from this policy intervention. Fiscal policy could also be used to provide a
stimulus to demand. An increase in public expenditure 𝐺𝐺 would raise π‘Œπ‘Œ, although its effects
are not exactly equivalent because the composition of aggregate demand 𝐢𝐢 + 𝐼𝐼 + 𝐺𝐺 would
be different compared to the case where monetary policy is used.
What is the best monetary policy to follow? To close the output gap between π‘Œπ‘Œ and π‘Œπ‘Œ ∗
exactly, the central bank should set the nominal interest rate 𝑖𝑖 (and, hence, π‘Ÿπ‘Ÿ) equal to the
natural rate of interest π‘Ÿπ‘Ÿ ∗ . This moves the 𝑀𝑀𝑀𝑀 line to where it intersects the π‘Œπ‘Œ 𝑑𝑑 curve in the
same place as the output supply curve π‘Œπ‘Œ 𝑠𝑠 as shown in Figure 8.17. Such a monetary policy
284
EC2065 Macroeconomics | Chapter 8: Business cycles
achieves the same economic outcome as if prices were flexible. The policy intervention thus
neutralises the negative consequences of slow price adjustment following a shock. The
central bank adjusting interest rates compensates for the slow pace of price changes.
Figure 8.17: Optimal stabilization policy
To implement this optimal monetary policy, the central bank needs to know the natural rate
of interest π‘Ÿπ‘Ÿ ∗ . This is a practical problem because π‘Ÿπ‘Ÿ ∗ is not directly observable and needs to
be estimated. A more fundamental challenge is that it must be feasible to reduce the
nominal interest rate 𝑖𝑖 if π‘Ÿπ‘Ÿ ∗ falls. As explained in Section 6.13, nominal interest rates are
subject to a lower bound, so the required interest rate cut might not be possible if the lower
bound on 𝑖𝑖 is reached. Chapter 9 discusses alternative policies that could be used if the
lower bound is binding.
The stabilisation policy described here aims to close the output gap between actual real
GDP π‘Œπ‘Œ and its natural level π‘Œπ‘Œ ∗ . But that does not mean the policy should aim for GDP to be
stable if π‘Œπ‘Œ ∗ itself varies over time. Furthermore, it might be wondered why the policy
intervention should stop when π‘Œπ‘Œ reaches π‘Œπ‘Œ ∗ . That addresses only one of the two sources of
inefficiency in the new Keynesian model – remember output at π‘Œπ‘Œ ∗ is still inefficiently low.
Should the central bank continue to push output above π‘Œπ‘Œ ∗ to where π‘Œπ‘Œ 𝑑𝑑 intersects π‘Œπ‘ŒοΏ½ 𝑠𝑠 and
𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 hold? While at first glance this appears desirable, we will argue in Chapter 9
that such a policy would be unsustainable and give rise to negative side effects.
Box 8.4: Modelling monetary policy using Taylor rules and LM
curves
In the new Keynesian model so far, we have represented monetary policy using a
horizontal 𝑀𝑀𝑀𝑀 line. This shifts vertically if the central bank changes the nominal
and real interest rate. But the shape of the 𝑀𝑀𝑀𝑀 line is not an inherent feature of
the new Keynesian model, it depends on the most appropriate way to describe the
conduct of monetary policy.
285
EC2065 Macroeconomics | Chapter 8: Business cycles
More generally, we consider the 𝑀𝑀𝑀𝑀 (‘money and monetary policy’) line to be a
representation of how the money market and the monetary policy regime imply a
link between interest rates and GDP. We can draw 𝑀𝑀𝑀𝑀 lines for monetary policies
that target the money supply, or an interest rate feedback rule such as a Taylor
rule.
8.6.1 Money supply targets, the LM curve and the IS-LM model
Suppose the central bank’s monetary policy is a target for the money supply 𝑀𝑀 𝑠𝑠 . Assume
the money-supply target 𝑀𝑀 𝑠𝑠 = 𝑀𝑀∗ is exogenous. For this monetary policy regime, the
interest rate 𝑖𝑖 is endogenous. It is still the case that real and nominal interest rates are same
because prices are sticky, hence, π‘Ÿπ‘Ÿ = 𝑖𝑖. With a target for the money supply, the interest
rates 𝑖𝑖 and π‘Ÿπ‘Ÿ and GDP π‘Œπ‘Œ are determined jointly in the goods and money markets. The
equivalent of the 𝑀𝑀𝑀𝑀 line in this case represents money-market equilibrium and it is often
labelled as the ‘𝐿𝐿𝐿𝐿’ curve for this particular monetary policy.
The 𝑀𝑀𝑀𝑀 line/𝐿𝐿𝐿𝐿 curve for a money-supply target is upward sloping, as shown in the goods
market diagram in Figure 8.18. The 𝐿𝐿𝐿𝐿 curve is upward sloping because higher output
increases the real demand for money for transactions, as shown in the left panel of the
figure representing the money market. With a fixed nominal money supply 𝑀𝑀∗ and a sticky
price level 𝑃𝑃�, there is an inelastic supply of real money balances 𝑀𝑀∗ /𝑃𝑃� and the nominal
interest rate 𝑖𝑖 rises to restore equilibrium in the money market. Money-market equilibrium
thus requires higher real interest rates π‘Ÿπ‘Ÿ when real GDP π‘Œπ‘Œ is higher, explaining the upwardsloping 𝐿𝐿𝐿𝐿 curve.
Figure 8.18: The LM curve with a money supply target
Combining the 𝐿𝐿𝐿𝐿 curve (𝑀𝑀𝑀𝑀 line) with the output demand curve π‘Œπ‘Œ 𝑑𝑑 in the goods market
leads to the IS-LM model, which is a special case of our new Keynesian model. What is called
the 𝐼𝐼𝐼𝐼 curve in that model is simply another label for what we call output demand π‘Œπ‘Œ 𝑑𝑑 .
286
EC2065 Macroeconomics | Chapter 8: Business cycles
A change in the money-supply target causes the 𝐿𝐿𝐿𝐿 curve to shift, which has real effects on
the economy. Increasing the money supply 𝑀𝑀∗ implies the supply of real money balances
𝑀𝑀∗ /𝑃𝑃� is larger (the price level remaining constant at 𝑃𝑃�). Given the real demand for money
at a particular level of real GDP π‘Œπ‘Œ, the intersection of money supply and demand occurs at a
lower nominal interest rate 𝑖𝑖 and, hence, also π‘Ÿπ‘Ÿ. Since the 𝐿𝐿𝐿𝐿 curve represents
combinations of π‘Œπ‘Œ and π‘Ÿπ‘Ÿ where the money market is in equilibrium, the 𝐿𝐿𝐿𝐿 curve must shift
downwards and real GDP increases.
8.6.2 The Taylor rule
Another example of a monetary policy is to have the central bank adjust the nominal
interest rate 𝑖𝑖 systematically in response to inflation πœ‹πœ‹ and output π‘Œπ‘Œ, for example, by
following a Taylor rule. We have seen an example of a Taylor rule in Section 6.12 but that
focused only on the response of 𝑖𝑖 to inflation πœ‹πœ‹. In the basic new Keynesian model, prices
are completely fixed, so there is no inflation and response of 𝑖𝑖 to πœ‹πœ‹ not relevant here.
In response to changes in real GDP, the Taylor rule calls for a higher interest rate 𝑖𝑖 in a boom
and a lower 𝑖𝑖 in a recession. With real and nominal interest rates being equal, π‘Ÿπ‘Ÿ = 𝑖𝑖, the
positive response of 𝑖𝑖 to π‘Œπ‘Œ can be represented by an upward-sloping 𝑀𝑀𝑀𝑀 line as depicted in
Figure 8.19.
Figure 8.19: Using a Taylor rule
Why should the central bank want to choose interest rates that are positively related to π‘Œπ‘Œ?
One argument is that this helps to stabilise an economy that is hit by demand shocks,
avoiding large output gaps between actual GDP π‘Œπ‘Œ and the natural level of output π‘Œπ‘Œ ∗ .
Furthermore, if π‘Œπ‘Œ ∗ is known or estimated, the interest-rate rule can be refined to react to
the gap π‘Œπ‘Œ − π‘Œπ‘Œ ∗ , or an estimate of this output gap.
We know from Section 8.6 that the optimal monetary policy is for the central bank to set
𝑖𝑖 = π‘Ÿπ‘Ÿ ∗ , where π‘Ÿπ‘Ÿ ∗ is the natural rate of interest. However, the central bank may not have
perfect information about π‘Ÿπ‘Ÿ ∗ . In that case, Figure 8.20 shows having a positive response of 𝑖𝑖
to π‘Œπ‘Œ can yield a better outcome for the economy than having monetary policy keep 𝑖𝑖
constant.
287
EC2065 Macroeconomics | Chapter 8: Business cycles
Figure 8.20: Demand shocks when a Taylor rule is used
8.7 Real business cycle theory
In the new Keynesian model, the business cycle is the economy’s inefficient response to
shocks, usually demand shocks, owing to the failure of prices to adjust. This justifies policy
interventions to temper the business cycle. An alternative approach argues that business
cycles are simply the economy’s efficient response to variations in its ability to produce
goods due to supply shocks. Policy intervention is futile or counterproductive in this view.
This way of understanding fluctuations in the economy is known as real business cycle (RBC)
theory.
How does such a theory of the business cycle work? The RBC model is essentially just the
dynamic macroeconomic model developed earlier in Chapter 3. The core of the model
features flexible prices and perfectly competitive markets studied in general equilibrium as
we did in Section 3.12.
For completeness, we add a money market alongside the labour and goods markets studied
in the dynamic macroeconomic model. Money demand and supply come from our analysis
in Chapter 6 and we assume the central bank chooses exogenous path of the money supply.
This emphasises the medium of exchange function of money but has no special role for the
unit of account function owing to nominal rigidities unlike the earlier new Keynesian model.
For simplicity, we ignore the effect of money’s medium of exchange function on the labour
supply curve, or we assume the central bank is following the Friedman rule. This means that
the labour supply curve 𝑁𝑁 𝑠𝑠 (π‘Ÿπ‘Ÿ) derives from the household optimality conditions 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 =
𝑀𝑀 and 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝑙𝑙′ = (1 + π‘Ÿπ‘Ÿ)𝑀𝑀/𝑀𝑀′ discussed in Sections 1.4 and 3.10.
The RBC approach to understanding business cycles looks at how the equilibrium of the
economy in the goods, labour and money markets shown in Figure 8.21 is affected by supply
shocks.
288
EC2065 Macroeconomics | Chapter 8: Business cycles
8.8 Business cycles due to supply shocks
RBC theory identifies exogenous shocks to total factor productivity (TFP) as the source of
the business cycle: supply shocks (or ‘technology’ shocks). In the study of economic growth,
technological progress was seen as driving permanent increases in TFP. However, in RBC
theory, supply shocks are increases or decreases in TFP that are eventually reversed.
Figure 8.21: The RBC model
8.8.1 Supply shocks
Supply shocks in the RBC model are usually assumed to be deviations from the trend in TFP
growth that are expected to persist beyond current time period to some extent but which
are not permanent. The empirical counterpart to TFP is the Solow residual, representing
changes in the level of real GDP that cannot be explained by changes in inputs of capital and
labour. The deviations from trend of the Solow residual in the USA are shown in Figure 8.22
alongside the deviations from trend of real GDP. We see that movements in the Solow
residual display a clear positive correlation with real GDP, indicating the Solow residual is
procyclical. While it is less volatile than GDP, we do see transitory fluctuations that could be
a cause of business cycles. As discussed in Box 8.3, it is possible some of this procyclicality
could be the result of measurement error in accounting for factor inputs.
Figure 8.22: Fluctuations of the Solow residual (USA)
289
EC2065 Macroeconomics | Chapter 8: Business cycles
Assuming the evidence from the Solow residual correctly indicates that transitory supply
shocks are hitting the economy, what might such shocks represent? One possibility is an
uneven pace of technological progress, where TFP might rise by more than usual in some
years, or by less in others. However, the Solow residual often falls sufficiently far below its
trend that the implied level of TFP actually declines in absolute terms. That is hard to
understand if TFP is representing technology because we would not expect that to go
backwards.
However, there are a number of other possible sources of transitory changes in TFP:
•
•
•
•
Fluctuations in energy costs, which affect overall production costs
Supply disruptions, for example due to natural disasters, wars, pandemics
Changes in regulations that affect firms’ productivity
Weather (in an agricultural economy).
8.8.2 The predictions of the RBC model
Representing a supply shock as a transitory change in TFP 𝑧𝑧, the effects of a negative shock
to 𝑧𝑧 are shown in Figure 8.23. The production function moves down, shifting the π‘Œπ‘Œ 𝑠𝑠 curve to
the left. Lower 𝑧𝑧 implies lower 𝑀𝑀𝑃𝑃𝑁𝑁 , which shifts the 𝑁𝑁 𝑑𝑑 curve to the left and results in the
π‘Œπ‘Œ 𝑠𝑠 curve moving further to the left.
To the extent that the shock persists for some time, expectations of future TFP 𝑧𝑧′ decline,
which implies lower 𝑀𝑀𝑃𝑃𝐾𝐾′ and shifts the 𝐼𝐼 𝑑𝑑 and π‘Œπ‘Œ 𝑑𝑑 curves to the left. There is lower 𝐢𝐢 𝑑𝑑 and
higher 𝑁𝑁 𝑠𝑠 owing to the negative wealth effect of lower 𝑧𝑧 (and 𝑧𝑧′), although there is
consumption smoothing because the shock is not permanent. These wealth effects imply a
leftward shift of π‘Œπ‘Œ 𝑑𝑑 and a rightward shift of π‘Œπ‘Œ 𝑠𝑠 . The wealth effect on labour supply is smaller
than the impact of 𝑧𝑧 on π‘Œπ‘Œ 𝑠𝑠 both directly and through 𝑀𝑀𝑃𝑃𝑁𝑁 . Hence, the overall effects are
that the π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 curves shift to the left, so real GDP π‘Œπ‘Œ falls.
Figure 8.23: A negative supply shock in the RBC model
In the goods market, the real interest rate π‘Ÿπ‘Ÿ rises if π‘Œπ‘Œ 𝑠𝑠 shifts more than π‘Œπ‘Œ 𝑑𝑑 . The π‘Œπ‘Œ 𝑑𝑑 shift
becomes larger if the drop in TFP lasts longer because that leads to a greater impact on
consumption and investment demand (less consumption smoothing and a greater impact on
the expected future marginal product of capital). The effect on employment 𝑁𝑁 in the labour
290
EC2065 Macroeconomics | Chapter 8: Business cycles
market is ambiguous because 𝑁𝑁 𝑑𝑑 falls but 𝑁𝑁 𝑠𝑠 can rise. The wealth effect on 𝑁𝑁 𝑠𝑠 is smaller
when TFP is expected to recover quickly. If 𝑁𝑁 𝑑𝑑 shifts to the left and 𝑁𝑁 𝑠𝑠 shifts to the right,
the real wage 𝑀𝑀 increases. A smaller wealth effect on 𝑁𝑁 𝑠𝑠 means that 𝑀𝑀 rises by less. Finally,
in the money market, 𝑀𝑀𝑑𝑑 falls with lower π‘Œπ‘Œ, which leads to a higher price level 𝑃𝑃.
8.8.3 Stabilisation policy?
When a recession occurs because of lower TFP 𝑧𝑧, this clearly makes households worse off.
While the recession is bad, in the RBC model, it does not follow that the government should
intervene. The model predicts that policy intervention, even if it succeeds in raising GDP,
makes households worse off. First, with flexible prices, there is limited scope to raise GDP
with monetary policy – it is not possible to improve on following the Friedman rule. Second,
while increasing public expenditure 𝐺𝐺 raises GDP π‘Œπ‘Œ as we saw in Box 4.2, this is inefficient
because it leads people to work more when productivity is low.
Box 8.5: Sources of supply shocks in the RBC model
Considering a transitory positive supply shock, we have seen that the RBC model
makes the following predictions:
•
•
•
•
•
•
GDP π‘Œπ‘Œ rises as the π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 curves shift to the right
The real interest rate π‘Ÿπ‘Ÿ falls as π‘Œπ‘Œ 𝑠𝑠 shifts by more than π‘Œπ‘Œ 𝑑𝑑
Consumption 𝐢𝐢 rises because of the wealth effect and lower π‘Ÿπ‘Ÿ
Investment 𝐼𝐼 rises because of higher expectations of 𝑀𝑀𝑃𝑃𝐾𝐾′ and lower π‘Ÿπ‘Ÿ
Real wage 𝑀𝑀 rises as 𝑁𝑁 𝑑𝑑 shifts to the right and 𝑁𝑁 𝑠𝑠 shifts to the left
Employment 𝑁𝑁 rises if 𝑁𝑁 𝑑𝑑 shifts more than 𝑁𝑁 𝑠𝑠 , which occurs if the wealth
effect on 𝑁𝑁 𝑠𝑠 is weak, as would be the case for a transitory shock.
Finally, Figure 8.24 shows that average labour productivity π‘Œπ‘Œ/𝑁𝑁 can rise with GDP
because 𝑧𝑧 increases. This is in contrast to the prediction for (correctly measured)
average labour productivity in a model with demand shocks and a neoclassical
production function.
Figure 8.24: Average labour productivity in the RBC model
291
EC2065 Macroeconomics | Chapter 8: Business cycles
The RBC model generates business-cycle fluctuations due to occurrence of
transitory supply shocks. Its predictions are summarised in Table 8.3 alongside the
empirical evidence on the business-cycle stylised facts. We see that the fluctuations
of macroeconomic variables implied by the model are consistent with businesscycle stylised facts.
Table 8.3: Predictions of the RBC model with transitory supply shocks
Variable
Model
Data
Consumption
Procyclical
Procyclical
Investment
Procyclical
Procyclical
Real interest
rate
Countercyclical
Countercyclical (weakly)
Employment
Procyclical
Procyclical
Real wage
Procyclical
Countercyclical (weakly)
Average labour
productivity
Procyclical
Procyclical (weakly)
In spite of this success, the source of transitory supply shocks is not obvious in
many business-cycle episodes. We could consider instead supply shocks with
permanent effects that are more easily interpreted as being due to the uneven
pace of technological progress. However, it is much harder to make the RBC model
consistent with the business-cycle stylised facts when supply shocks are highly
persistent.
A permanent productivity shock changes current and future TFPs 𝑧𝑧 and 𝑧𝑧′ by a
similar amount. Considering the case of permanent positive shock, the RBC model
predicts the following in comparison to a transitory shock:
•
•
•
Wealth effects are larger, hence, 𝑁𝑁 𝑠𝑠 shifts further to the left, and 𝐢𝐢 𝑑𝑑 rises
by more, shifting π‘Œπ‘Œ 𝑑𝑑 further to the right
There is a greater incentive for firms to increase investment, and the larger
increase in 𝐼𝐼 𝑑𝑑 means π‘Œπ‘Œ 𝑑𝑑 shifts further to the right
The effect on 𝑁𝑁 𝑑𝑑 is same for a permanent and transitory shock to 𝑧𝑧.
Since π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 shift in the same direction, the RBC model can still generate
fluctuations in GDP π‘Œπ‘Œ with permanent shocks to TFP. However, as Figure 8.25
shows, the model will struggle to generate predictions consistent with the stylised
facts. The main problem is that the stronger wealth effect on labour supply 𝑁𝑁 𝑠𝑠
means that employment 𝑁𝑁 might fall when π‘Œπ‘Œ rises. Employment then becomes
countercyclical, which is clearly contrary to the data. Furthermore, the stronger
shift of 𝑁𝑁 𝑠𝑠 in the opposite direction to 𝑁𝑁 𝑑𝑑 implies much larger fluctuations in the
292
EC2065 Macroeconomics | Chapter 8: Business cycles
real wage, which now becomes very procyclical, contrary to the data. These
predictions are consistent with very long-run trends where hours worked have
fallen even though productivity and wages have risen permanently.
8.9 Coordination failure model
An alternative approach to understanding business cycles emphasises what is called
‘coordination failure’. Business cycles are caused by self-fulfilling changes in beliefs – even
though everyone is rational – because the economy has multiple equilibria. This means that
the requirements for agents to make decisions rationally and for markets to clear do not pin
down unique levels of GDP, employment, wages, and interest rates. Business cycles result
from shifts between optimism and pessimism, even if economy’s fundamentals remain
unchanged. Although an equilibrium with high GDP is preferred by everyone, there is a
difficulty of coordinating expectations on this best outcome.
Our earlier dynamic macroeconomic model has a unique equilibrium, so there are no
business cycles without the occurrence of exogenous shocks. This is true for both new
Keynesian and RBC models of the economy. In the coordination failure model, all the
assumptions of the dynamic macroeconomic model are maintained, except that the
(aggregate) production function has increasing returns to scale. This leads to strategic
complementarities in firms’ employment decisions and implies there can be multiple
equilibria.
Figure 8.25: A permanent productivity shock in the RBC model
8.9.1 Labour productivity spillover across firms
At the level of an individual firm, the production function π‘Œπ‘Œπ‘–π‘– = 𝑧𝑧(𝑁𝑁)𝐹𝐹(𝐾𝐾𝑖𝑖 , 𝑁𝑁𝑖𝑖 ) is assumed to
have the neoclassical properties, in particular, a diminishing marginal product of labour 𝑁𝑁𝑖𝑖 .
Firms are competitive, so their labour demand curve is given by marginal product of labour.
The new feature of the model is that each firm’s TFP level 𝑧𝑧(𝑁𝑁) is positively related to
aggregate employment 𝑁𝑁 but each firm takes 𝑁𝑁 as given when choosing its own 𝑁𝑁𝑖𝑖 . Each
firm benefits from higher employment and output at other firms but this effect is not
internalised. This ‘spillover’ or externality is a source of market failure.
293
EC2065 Macroeconomics | Chapter 8: Business cycles
Why would such positive labour productivity spillover occur? Advocates of coordinationfailure models point to several possibilities. First, there could be complementarities in
employment across firms, for example, if more people are employed writing computer
software, this makes computer hardware more useful, so workers producing the hardware
are now making something of greater value. Another example is that delivery and logistics
are more efficient when such companies have more orders to fulfil.
A second possibility is what are known as ‘thick market’ effects. This is where an increased
amount of activity in a market provides some direct benefit to market participants. For
example, it is harder to attract customers to a shopping mall when not many other shops in
the mall are open. The coordination failure model explores the implications of such thickmarket effects and complementarities. Of course, a priori, it is also possible to envisage
negative spillovers across firms, so the validity of the assumption is debatable.
The spillover effect has important consequences for firms’ demand for labour. If an
individual firm increases its employment 𝑁𝑁𝑖𝑖 , then its marginal product of labour declines, so
the firm-level labour demand curve is downward-sloping as usual. However, if all firms are
increasing employment 𝑁𝑁𝑖𝑖 together, then aggregate employment 𝑁𝑁 rises and 𝑧𝑧(𝑁𝑁)
increases. The rise in 𝑧𝑧(𝑁𝑁) boosts the marginal product of labour in each firm, offsetting the
decline due to higher 𝑁𝑁𝑖𝑖 . This spillover effect might outweigh the declining firm-level
marginal product of labour, so marginal product of labour increases with aggregate
employment 𝑁𝑁. In that case, the aggregate-level labour demand curve becomes upwardsloping. The firm-level and aggregate-level labour demand curves are plotted in Figure 8.26.
Figure 8.26: Aggregate labour demand in the coordination failure model
In what follows, we assume the positive spillover effect from firms’ employment is
sufficiently strong to make the aggregate labour demand curve upward sloping. Weaker
spillovers would mean 𝑁𝑁 𝑑𝑑 remains downward sloping, only becoming flatter. A strong
enough spillover to make 𝑁𝑁 𝑑𝑑 upward sloping implies increasing returns to labour at the
aggregate level, and results in an aggregate production function with a convex shape as
shown in Figure 8.27. Note that we assume a spillover effect – an externality – rather than
assume firms directly have an increasing-returns production function to maintain the
294
EC2065 Macroeconomics | Chapter 8: Business cycles
framework of perfect competition. Increasing returns at the firm level requires a model with
imperfectly competitive firms, similar to that used earlier in the new Keynesian model with
sticky prices.
8.9.2 Implications for the output supply curve
The coordination failure model not only needs an aggregate labour demand curve that is
upward sloping. In addition, 𝑁𝑁 𝑑𝑑 must be steeper than labour supply 𝑁𝑁 𝑠𝑠 (π‘Ÿπ‘Ÿ), as depicted in
Figure 8.28. In the dynamic macroeconomic model studied in Section 3.12, the supply of
output by firms is derived from the equilibrium level of employment in the labour market. In
that model, the output supply curve π‘Œπ‘Œ 𝑠𝑠 is upward sloping, meaning that the supply of goods
is positively related to the real interest rate π‘Ÿπ‘Ÿ. Intuitively, a higher real interest rate is
needed to induce more supply by increasing the desire to save through earning more by
supplying more labour.
Figure 8.27: Aggregate production function in the coordination failure model
The relationship between π‘Ÿπ‘Ÿ and the supply of goods can be reversed with a strong enough
spillover effect in the coordination failure model. High output and high employment
generate a strong productivity-boosting spillover, which means firms are willing to choose
high employment even when a low interest rate π‘Ÿπ‘Ÿ reduces workers’ desire to save by
earning more. As shown in Figure 8.29, the coordination failure model features a
downward-sloping output supply curve π‘Œπ‘Œ 𝑠𝑠 . This profoundly changes the predictions of the
model compared to the standard dynamic macroeconomic model.
295
EC2065 Macroeconomics | Chapter 8: Business cycles
Figure 8.28: Labour market equilibrium in the coordination failure model
Figure 8.29: The output supply curve in the coordination failure model
8.10 Multiple equilibria and business cycles
We now investigate how business cycles can occur in the coordination failure model, even
when there are no exogenous shocks to the economy’s fundamentals. The key feature of
the model is that its output supply curve is downward sloping.
Goods-market equilibrium occurs where the π‘Œπ‘Œ 𝑠𝑠 curve intersects the usual downwardsloping output demand curve π‘Œπ‘Œ 𝑑𝑑 . The derivation of the output demand curve from Section
3.12 is unchanged here because the spillover effect in the coordination failure model works
through the supply side of the economy.
With output demand and supply curves both being downward sloping, there may not be a
unique equilibrium in the goods market because π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 can cross more than once. We
will focus on a case where π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 intersect twice, though this is only one of many
possibilities. This case is illustrated in Figure 8.30.
When there are multiple intersections of output demand and supply curves, the economy
has multiple equilibria. Each equilibrium is consistent with utility maximisation by
households and profit maximisation by firms, and market clearing (in both goods and labour
296
EC2065 Macroeconomics | Chapter 8: Business cycles
markets because every point on the output supply curve represents a point of labourmarket equilibrium given the way the π‘Œπ‘Œ 𝑠𝑠 curve is constructed). Without more assumptions,
it is not possible to say which equilibrium will be the outcome. Usually, we derive the
predictions of a model from looking at what happens in equilibrium but that is not sufficient
when there are multiple equilibria.
Figure 8.30: Two equilibria in the coordination failure model
What explains why the coordination failure model has multiple equilibria? This feature is
due to the presence of what is known as ‘strategic complementarity’. Strategic
complementarity refers to a situation where one person’s desire to perform an action
increases when others are also performing the same action. In the model, an individual
firm’s marginal product of labour rises when other firms are expanding employment and,
hence, it becomes profitable for firms to expand employment when other firms are doing
so. This creates strategic complementarity in firms’ employment decisions and sufficiently
strong strategic complementarity leads to multiple equilibria because different choices
made by individuals are mutually reinforcing.
While both equilibria in Figure 8.30 are fully consistent with rational optimisation by
individual households and firms, nonetheless, households are generally not indifferent
between them. The high-output equilibrium is good. It features high consumption, and
wages and productivity are high. Leisure is low but that choice makes sense because
productivity is high. The low-output equilibrium is bad. Consumption is low, and wages and
productivity are low, only mitigated by high leisure but that is chosen because productivity
is low. The inefficiency of the low-output equilibrium is due to the productivity spillover
effect that individuals fail to internalise.
Although everyone prefers the equilibrium with high output, there can be a coordination
problem in reaching it. At a low level of output, it makes sense for all firms in the economy
collectively to switch to the high-output equilibrium. But at the low-output equilibrium, it is
individually rational for each firm to choose low output. Therefore, the economy could get
stuck at the equilibrium with low GDP for some time, even though everyone would gain by
297
EC2065 Macroeconomics | Chapter 8: Business cycles
coordinating on the high-GDP equilibrium. A recession can thus result from a coordination
failure.
What ultimately determines which of the two equilibria prevails? If everyone believes the
high-output equilibrium will prevail, then it will, while if everyone believes the low-output
equilibrium will prevail, then it will instead. Thus, the outcome depends on whether people
are optimistic or pessimistic about the economy’s prospects. However, both of these are
consistent with rational expectations. There is an independent role for beliefs, which are not
uniquely determined by requirement of rationality.
If the optimism or pessimism of households and firms determines which equilibrium
prevails, what explains how optimistic people should be? Again, the model provides no
direct answer. In principle, any extraneous factor could shift the economy from pessimism
to optimism and thus cause business cycles. Such extraneous factors unrelated to the
economy’s fundamentals are referred to by the term ‘sunspots’. For example, there could
be a media report on an event that triggers a wave of pessimism far beyond the importance
of the event to the economy’s fundamentals.
Business cycles in the coordination failure model can thus occur due to exogenous shifts in
optimism and pessimism, which are consistent with rational expectations because of
multiple equilibria. These induce movements of the economy between the low- and highoutput equilibria shown in Figure 8.30. If the economy goes into a recession then the real
interest rate π‘Ÿπ‘Ÿ rises as π‘Œπ‘Œ falls, moving along both π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 curves. As seen in Figure 8.31,
employment 𝑁𝑁 falls with π‘Œπ‘Œ when there is a movement along the aggregate production
function. Owing to there being increasing returns to labour, average labour productivity
π‘Œπ‘Œ/𝑁𝑁 falls with π‘Œπ‘Œ. Moving along the aggregate labour demand curve 𝑁𝑁 𝑑𝑑 , the real wage 𝑀𝑀
falls with π‘Œπ‘Œ and 𝑁𝑁.,
Figure 8.31: Business cycles in the coordination failure model
298
EC2065 Macroeconomics | Chapter 8: Business cycles
Box 8.6: The strength of strategic complementarities
The coordination failure model of business cycles depends on strategic
complementarity in firms’ employment decisions. This strategic complementarity
comes from a positive spillover from aggregate employment to each individual
firm’s productivity. But how strong does this spillover need to be for the model
successfully to generate business cycles?
It is clear from the workings of the model that explaining business cycles as waves
of optimism and pessimism requires multiple equilibria, which depends on the
output supply curve π‘Œπ‘Œ 𝑠𝑠 being downward sloping. For this to happen, the spillover
must be strong enough that there are increasing returns to labour at the level of
the aggregate economy, i.e. an upward-sloping aggregate labour demand 𝑁𝑁 𝑑𝑑 . Just
having a positive spillover might not be enough to offset the usual diminishing
returns to labour, as shown in Figure 8.32.
Figure 8.32: Spillover effect too weak to generate increasing returns
Even if the aggregate 𝑁𝑁 𝑑𝑑 curve is upward sloping, this is not enough. If labour
supply is insufficiently wage elastic, i.e. 𝑁𝑁 𝑠𝑠 (π‘Ÿπ‘Ÿ) is too steep, then there is a unique
equilibrium even if there are increasing returns to labour in aggregate. This case is
depicted in Figure 8.33.
Figure 8.33: Labour supply insufficiently wage elastic
299
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Chapter 9: Inflation, expectations and
macroeconomic policy
This chapter considers the link between inflation and business-cycle fluctuations when
prices are neither completely sticky nor completely flexible. We will also explore the
important role of expectations in analysing macroeconomic policy and consider the
challenges faced by the central bank when monetary policy is constrained by a lower bound
on interest rates.
Essential reading
•
Williamson, Chapter 15.
9.1 Inflation and the Phillips curve
This section introduces a model that links inflation to the business-cycle fluctuations studied
in Chapter 8. The basic new Keynesian model from the previous chapter assumes all goods
prices are completely rigid, so that model says nothing about how inflation is determined.
On the other hand, full price flexibility implies the economy’s real interest rate is at its
natural rate and its real GDP reaches its natural level. In that case, Chapter 6 showed how
inflation depends on monetary policy and other real and financial variables but there was
only a very limited effect of monetary policy on real variables for moderate inflation rates
because of the absence of nominal rigidity.
We now consider a model with partial price adjustment to bridge the two extremes above.
This model also implies a close link between inflation and the state of the real economy, the
Phillips curve.
9.1.1 Firms’ incentives to adjust prices
As explained in Section 8.5, with monopolistic competition, each firm wants to set a price
where 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀, i.e. the marginal revenue product of labour is equal to the real wage. If
𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 < 𝑀𝑀, the price charged by a firm is too low and it would want to raise its price and
sell less. If 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀, the firm’s price is too high, and it would want to lower the price and
sell more.
To understand why prices are not always set so that 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀, we assume there are costs
of price adjustment. These were discussed in Section 8.1 and include ‘menu costs’ and the
managerial costs of making pricing decisions. Firms compare these costs with the benefits of
price adjustment when deciding whether to set a new price. The gains a firm would make by
adjusting the price of its product and the magnitude of the desired price adjustment both
increase with the size of the gap between 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 and 𝑀𝑀. We will not set up a precise
comparison of the costs and benefits of price adjustment here but, in the background, this
300
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
trade-off is why firms will not change their prices all the time, nor leave their prices constant
forever.
9.1.2 Price changes and economic activity
We now link the incentives to adjust prices to economic activity as measured by real GDP π‘Œπ‘Œ.
When GDP π‘Œπ‘Œ is at its natural level π‘Œπ‘Œ ∗ , firms employ workers up to the point 𝑁𝑁 ∗ where
𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀, as shown in Figure 9.1. Firms would have no desire to change prices in this
case. If π‘Œπ‘Œ is below π‘Œπ‘Œ ∗ , employment 𝑁𝑁 is below 𝑁𝑁 ∗ and the diagram shows that 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀,
so a price cut is desired. If π‘Œπ‘Œ is above π‘Œπ‘Œ ∗ , then 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 < 𝑀𝑀, so a price increase is desired.
This is because the marginal revenue product 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 falls with employment (𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 is
downward sloping), while the real wage 𝑀𝑀 rises (𝑁𝑁 𝑠𝑠 is upward sloping).
Moreover, a larger gap between π‘Œπ‘Œ and π‘Œπ‘Œ ∗ means that firms’ desired price change is larger
and more firms will prefer to adjust prices after taking account of the costs of doing so.
Intuitively, as output and employment increase, the marginal product of labour 𝑀𝑀𝑃𝑃𝑁𝑁
declines (as does 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 ), which increases firms’ marginal cost of production. The wage 𝑀𝑀
must also increase to raise the supply of labour, which additionally adds to the cost of
production. It follows that the direction and size of price changes depends on the ‘output
gap’ between actual GDP π‘Œπ‘Œ and its natural level π‘Œπ‘Œ ∗ .
Figure 9.1: Firms' incentives to change prices
Inflation is defined as the rate of change of the price level 𝑃𝑃 over time. The notation we will
use in this chapter is that πœ‹πœ‹ is the inflation rate between the past and current time periods
(note this is different from our earlier notation in Chapter 6) and πœ‹πœ‹′ is the inflation rate
between the current and future time periods, i.e. πœ‹πœ‹ = (𝑃𝑃 − 𝑃𝑃�)/𝑃𝑃� and πœ‹πœ‹ ′ = (𝑃𝑃′ − 𝑃𝑃)/𝑃𝑃,
where 𝑃𝑃� is the past level of prices. Mechanically, inflation πœ‹πœ‹ is positive when firms are
increasing prices on average, or equivalently, when newly set prices are higher than the
average of past prices 𝑃𝑃�. By the same logic, future inflation πœ‹πœ‹′ is expected to be positive
when firms will set higher prices than the current average 𝑃𝑃 in the future.
301
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
9.1.3 Expectations
When there are costs of making price changes and firms do not expect to be adjusting prices
continually, they also need to consider future conditions when setting current prices. If it is
desirable to set higher prices in future time periods then expected future inflation πœ‹πœ‹ ′𝑒𝑒 will
be positive. Hence, any firm adjusting its prices in current period will choose a larger price
increase when πœ‹πœ‹ ′𝑒𝑒 is higher, so higher expected future inflation πœ‹πœ‹ ′𝑒𝑒 leads to more inflation πœ‹πœ‹
in the current time period. The effect of πœ‹πœ‹ ′𝑒𝑒 on πœ‹πœ‹ is less than one-for-one because less
weight is given to future conditions than current conditions when setting prices now.
9.1.4 The Phillips curve
We have seen that our model of partial price adjustment predicts inflation πœ‹πœ‹ is positively
related to the output gap between π‘Œπ‘Œ and π‘Œπ‘Œ ∗ because higher π‘Œπ‘Œ raises 𝑀𝑀 and lowers 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 ,
leading firms to have a greater desire to make larger price increases. Inflation is also
positively related to expected future inflation πœ‹πœ‹ ′𝑒𝑒 , capturing firms’ desire to raise prices preemptively in response to expected future economic conditions.
We refer to the relationship between inflation πœ‹πœ‹ and the output gap π‘Œπ‘Œ − π‘Œπ‘Œ ∗ as the ‘Phillips
curve’. This is the upward-sloping line or curve 𝑃𝑃𝑃𝑃 in Figure 9.2 with πœ‹πœ‹ on the vertical axis
and real GDP π‘Œπ‘Œ on the horizontal axis. If no future inflation is expected (πœ‹πœ‹ ′𝑒𝑒 = 0), the Phillips
curve passes through the point with πœ‹πœ‹ = 0 when π‘Œπ‘Œ = π‘Œπ‘Œ ∗ because there is no inflationary
pressure when output is at its natural level and no future inflation is expected. The Phillips
curve shifts to the right if the natural level of output π‘Œπ‘Œ ∗ rises and shifts upwards if expected
future inflation πœ‹πœ‹ ′𝑒𝑒 is higher.
Figure 9.2: The Phillips curve
The gradient of the Phillips curve indicates how much inflation πœ‹πœ‹ rises when output π‘Œπ‘Œ
increases. The Phillips curve would be flatter if there is less inflationary pressure because of
more nominal rigidity, for example, fewer firms being willing to change price because the
costs of price adjustment are larger.
302
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
The Phillips curve is also flatter if there is greater ‘real rigidity’. Real rigidity refers to firms’
desired prices being less sensitive to what happens to real GDP π‘Œπ‘Œ. For example, if the
marginal product of labour curve 𝑀𝑀𝑃𝑃𝑁𝑁 is flatter (and hence, also 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 ) because returns to
labour diminish less rapidly, then firms’ cost of production rises by less with output,
reducing inflationary pressure. This also occurs if the labour supply curve 𝑁𝑁 𝑠𝑠 is flatter, so
wages need to rise by less to induce an increase in labour supply. Another source of real
rigidity is the presence of efficiency wage concerns that mean real wages are less sensitive
to economic conditions, for example, when a particular constant real wage level maximises
the amount of effective labour input per unit of wages paid.
9.1.5 Inflation and unemployment
Traditionally, the Phillips curve was viewed as a negative relationship between the inflation
rate and the unemployment rate. By adding efficiency wages to our analysis of the labour
market, we immediately obtain this negative inflation-unemployment relationship from the
‘Phillips curve’ in terms of inflation and the output gap explained earlier.
The logic is that, for a given natural rate of unemployment and natural level of output π‘Œπ‘Œ ∗ , an
increase in π‘Œπ‘Œ raises employment and reduces unemployment relative to the natural rate of
unemployment. Hence, if unemployment below its natural rate then output π‘Œπ‘Œ is above π‘Œπ‘Œ ∗ ,
which increases inflationary pressure, while if unemployment above its natural rate then π‘Œπ‘Œ
is below π‘Œπ‘Œ ∗, which decreases inflationary pressure. The downward-sloping Phillips curve in
terms of inflation and unemployment passes through the natural rate of unemployment at
πœ‹πœ‹ = 0 when πœ‹πœ‹ ′𝑒𝑒 = 0. This Phillips curve shifts with changes in inflation expectations or
changes in the natural rate of unemployment.
9.2 Expectations and aggregate demand
We have seen that expectations of future inflation are an important feature of the Phillips
curve that explains current inflationary pressure. Expectations about the future are also
relevant to the level of aggregate demand for goods and services, which we represented
using the output demand curve π‘Œπ‘Œ 𝑑𝑑 derived in Chapter 3.
First, inflation expectations affect the real interest rate that results from the nominal
interest rate set by the central bank. Inflation can therefore affect incentives to save or
borrow.
Second, expectations of the future state of the economy influence current consumption and
investment demand. This is because households have a desire to smooth consumption in
response to expected changes in future income, and investment demand depends on
expectations of the amount of future employment of labour.
9.2.1 Inflation expectations and real interest rates
In Chapter 3, we saw that both consumption and investment demand depend on the
expected real interest rate between the current and future time periods. Here, with our
notation that distinguishes between actual and expected inflation, we denote the ex-ante
real interest rate by π‘Ÿπ‘Ÿ 𝑒𝑒 . The Fisher equation from Section 6.3 implies:
303
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
π‘Ÿπ‘Ÿ 𝑒𝑒 = 𝑖𝑖 − πœ‹πœ‹ ′𝑒𝑒
Given a nominal interest rate 𝑖𝑖, higher expected future inflation πœ‹πœ‹ ′𝑒𝑒 reduces the expected
real interest rate π‘Ÿπ‘Ÿ 𝑒𝑒 . This means there is less incentive to save and a greater incentive to
borrow. With these substitution effects, and ignoring income effects with a representative
household as explained in Section 3.11, higher πœ‹πœ‹ ′𝑒𝑒 leads to higher 𝐢𝐢 𝑑𝑑 . It also raises
investment demand 𝐼𝐼 𝑑𝑑 according to the model from Section 3.8. Output demand π‘Œπ‘Œ 𝑑𝑑
therefore depends positively on expected future inflation πœ‹πœ‹ ′𝑒𝑒 .
9.2.2 Expectations of the economy’s future GDP
The consumption choice model studied in Section 3.3 shows that households’ optimal
consumption plan satisfies 𝑀𝑀𝑀𝑀𝑆𝑆𝐢𝐢,𝐢𝐢 ′ = 1 + π‘Ÿπ‘Ÿ 𝑒𝑒 , where 𝑀𝑀𝑀𝑀𝑆𝑆𝐢𝐢,𝐢𝐢 ′ is the marginal rate of
substitution between current and future consumption. If consumption in the future 𝐢𝐢′ is
expected to be higher then 𝑀𝑀𝑀𝑀𝑆𝑆𝐢𝐢,𝐢𝐢 ′ rises (see Figure 3.9). Hence, for the same expected
real interest rate π‘Ÿπ‘Ÿ 𝑒𝑒 , current consumption 𝐢𝐢 must increase to satisfy 𝑀𝑀𝑀𝑀𝑆𝑆𝐢𝐢,𝐢𝐢 ′ = 1 + π‘Ÿπ‘Ÿ 𝑒𝑒 . This
reflects that both 𝐢𝐢 and 𝐢𝐢 ′ are normal goods, so for a given relative price of current and
future consumption as determined by the real interest rate, households want both 𝐢𝐢 and 𝐢𝐢′
to rise or fall together. This is the desire for consumption smoothing discussed in Chapter 3.
Therefore, the expectation of higher consumption in future raises consumption demand 𝐢𝐢 𝑑𝑑
in the current period.
The model of investment in Section 3.8 implies that firms invest in capital up to the point
where 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = π‘Ÿπ‘Ÿ 𝑒𝑒 . If expectations of future employment 𝑁𝑁 ′ increase, this raises the
expected future marginal product of capital 𝑀𝑀𝑃𝑃𝐾𝐾′ . Note that any neoclassical production
function with capital and labour has the feature that a greater employment of labour
increases the marginal product of capital. Therefore, for a given real interest rate, the
expectation of higher future employment raises current investment demand 𝐼𝐼 𝑑𝑑 .
In summary, we conclude that the output demand curve π‘Œπ‘Œ 𝑑𝑑 shifts to the right if
expectations of future inflation πœ‹πœ‹ ′𝑒𝑒 increase, or there are higher expectations of future
consumption or future employment. To simplify matters, we suppose that future
consumption and employment are both positively related to future real GDP, so π‘Œπ‘Œ 𝑑𝑑 shifts to
the right if expectations of future GDP π‘Œπ‘Œ ′ increase.
9.3 Aggregate demand with market imperfections
In analysing the effects of macroeconomic policy on aggregate demand, we will see that it
can be important to incorporate some of the insights of our study of credit-market
imperfections in Chapter 4 and imperfectly competitive markets in Chapter 8. For example,
it is sometimes claimed that increasing aggregate demand through extra government
expenditure 𝐺𝐺 gives rise to a positive feedback loop – a ‘multiplier’. The argument is that
higher demand raises output and income, and consumption then rises with income, further
increasing demand and so on.
However, this multiplier did not feature in our analysis of the output demand curve π‘Œπ‘Œ 𝑑𝑑 from
the dynamic macroeconomic model of Chapter 3. There we emphasised that higher
304
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
government expenditure 𝐺𝐺 increases the tax burden, reducing the amount of private
consumption 𝐢𝐢 that is affordable. Income π‘Œπ‘Œ did increase but, as seen in Box 4.2, that was
the result of the decision of households to supply more labour when faced with a higher tax
burden. Higher GDP π‘Œπ‘Œ caused by higher 𝐺𝐺 did not in itself make households better off.
9.3.1 Consumption and aggregate demand
In the new Keynesian model from Chapter 8, the presence of imperfect competition means
the level of GDP is not efficient. As 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 , households are made better off when
higher aggregate demand for goods increases employment. Higher income itself makes
households better off because the extra ability to consume is worth more to households
than the extra time spent working. In addition, real wages 𝑀𝑀 can rise with aggregate
demand and employment in the new Keynesian model because 𝑀𝑀 begins below the
marginal product of labour 𝑀𝑀𝑃𝑃𝑁𝑁 , unlike in the standard dynamic macroeconomic model.
The consequence of the market imperfections is that consumption demand 𝐢𝐢 𝑑𝑑 now
depends directly on aggregate demand and income π‘Œπ‘Œ in addition to other factors – it is still
necessary to consider the effect of higher government expenditure 𝐺𝐺 on the tax burden.
The sensitivity of 𝐢𝐢 𝑑𝑑 to higher π‘Œπ‘Œ is labelled the marginal propensity to consume (MPC):
πœ•πœ•πΆπΆ 𝑑𝑑
𝑀𝑀𝑀𝑀𝑀𝑀 =
πœ•πœ•πœ•πœ•
The aggregate demand for output is π‘Œπ‘Œ 𝑑𝑑 = 𝐢𝐢 𝑑𝑑 + 𝐼𝐼 𝑑𝑑 + 𝐺𝐺. Consumption demand 𝐢𝐢 𝑑𝑑 is now a
function of π‘Œπ‘Œ, and in equilibrium, income is equal to aggregate demand (π‘Œπ‘Œ = π‘Œπ‘Œ 𝑑𝑑 ). Figure 9.3
plots the expenditure function π‘Œπ‘Œ 𝑑𝑑 against aggregate output and income π‘Œπ‘Œ.
Figure 9.3: Output demand with multiplier
The gradient of the expenditure π‘Œπ‘Œ 𝑑𝑑 as a function of π‘Œπ‘Œ is given by marginal propensity to
consume πœ•πœ•πΆπΆ 𝑑𝑑 ⁄πœ•πœ•πœ•πœ•. In equilibrium, π‘Œπ‘Œ = π‘Œπ‘Œ 𝑑𝑑 , so the level of output demand must lie on the
45∘ line in the diagram. Any factors affecting aggregate expenditure π‘Œπ‘Œ 𝑑𝑑 apart from π‘Œπ‘Œ itself
cause shifts of the expenditure function.
305
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
If aggregate expenditure π‘Œπ‘Œ 𝑑𝑑 were to increase, for example, owing to higher 𝐺𝐺, the
expenditure function shifts vertically upwards by the same amount. By finding the new
point of intersection with the 45∘ line, we see that the overall effect on π‘Œπ‘Œ is larger than the
size of the shift of the π‘Œπ‘Œ 𝑑𝑑 function. A marginal propensity to consume between 0 and 1
(making the expenditure function upward sloping but less steep than the 45∘ line) amplifies
the effects of changes in output demand π‘Œπ‘Œ 𝑑𝑑 because of a positive feedback loop working
through income and consumption.
The consequences of this for the output demand curve π‘Œπ‘Œ 𝑑𝑑 (a relationship between real GDP
π‘Œπ‘Œ and the real interest rate π‘Ÿπ‘Ÿ) derived in Section 3.12 is that the π‘Œπ‘Œ 𝑑𝑑 curve becomes flatter,
i.e. more sensitive to π‘Ÿπ‘Ÿ and shifts become larger than they would be in the standard
dynamic macroeconomic model of Chapter 3.
9.3.2 Consumption and aggregate demand with credit-market
imperfections
Even in the new Keynesian model, the multiplier effect described above is usually not strong
enough to offset the direct effect of the higher tax burden on consumption when public
expenditure is increased. However, a stronger multiplier is found with the credit-market
imperfections that were studied in Chapter 4. We assume some households face a binding
borrowing limit. They cannot generate enough income in the current period to pay for their
desired level of consumption and they cannot borrow against future income. It is important
to appreciate that not all households in the economy will be in this position – some will be
savers – so we are moving away from the usual assumption of a representative household.
As explained in Section 4.3, households who are credit-constrained have a marginal
propensity to consume of 1 from disposable income (assuming all households are able to
increase the amount they work when aggregate output π‘Œπ‘Œ rises). Moreover, their current
consumption does not respond to future taxes, only to current taxes through their impact
on disposable income. In contrast, households who are not credit-constrained have a much
smaller response of consumption to π‘Œπ‘Œ and adjust consumption in response to the present
value of all current and future taxes.
The economy’s overall marginal propensity to consume is a weighted average of the 𝑀𝑀𝑀𝑀𝑀𝑀 =
1 for credit-constrained households and the low 𝑀𝑀𝑀𝑀𝑀𝑀 of unconstrained households, the
weights depending on how much of aggregate consumption spending comes from the two
groups of households. This overall 𝑀𝑀𝑀𝑀𝑀𝑀 determines the gradient of the expenditure
function seen in Figure 9.3 and the size of the multiplier effect. If there are sufficiently many
constrained households, we will see that the overall 𝑀𝑀𝑀𝑀𝑀𝑀 can be high enough to offset the
negative impact on current consumption of the higher future tax burden when the
government increases public expenditure.
306
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Box 9.1: Multiplier and crowding-out effects of fiscal policy
We now re-examine the effects of a fiscal stimulus. There are two important
changes compared to our earlier analysis in Box 4.2. First, some households are
credit constrained. Second, prices are sticky (completely sticky here for simplicity)
and we suppose monetary policy is accommodative in the sense that the central
bank holds the real interest rate π‘Ÿπ‘Ÿ constant throughout.
Consider an increase in government expenditure 𝐺𝐺 that is financed by running a
larger budget deficit, so there is no rise in current taxes 𝑇𝑇, only future taxes. The
wealth effect from the higher tax burden reduces consumption demand 𝐢𝐢 𝑑𝑑 but
only for those households who are not credit constrained.
Credit-constrained households consume their disposable income and so have a
marginal propensity to consume of one and their current consumption does not
respond to the higher future taxes. This means that 𝐢𝐢 𝑑𝑑 depends on disposable
income π‘Œπ‘Œ − 𝑇𝑇, which rises with π‘Œπ‘Œ. Ignoring the effects of the tax burden on
unconstrained households, the increase in 𝐺𝐺 raises π‘Œπ‘Œ 𝑑𝑑 more than one-for-one
because of the multiplier effect working through the consumption of the
constrained households.
Compared to Box 4.2, the presence of some credit-constrained households means
there is less crowding out of consumption from the higher tax burden and new
multiplier effect on consumption. With no constrained households, we know the
rightward shift of the output demand curve π‘Œπ‘Œ 𝑑𝑑 is smaller than the increase in 𝐺𝐺.
But as the fraction of credit-constrained households rises, the shift of π‘Œπ‘Œ 𝑑𝑑 becomes
larger and can exceed than the increase in 𝐺𝐺.
The other difference compared to the analysis in Box 4.2 is that sticky prices imply
the real interest rate π‘Ÿπ‘Ÿ is determined by the intersection of the π‘Œπ‘Œ 𝑑𝑑 curve and the
𝑀𝑀𝑀𝑀 line, not π‘Œπ‘Œ 𝑑𝑑 and the output supply curve π‘Œπ‘Œ 𝑠𝑠 . This is shown in Figure 9.4. If
monetary policy is accommodative with a horizontal 𝑀𝑀𝑀𝑀 line remaining in the
same position then the real interest rate π‘Ÿπ‘Ÿ does not rise, in contrast to what
happens in an economy with flexible prices.
307
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Figure 9.4: Effects of a fiscal stimulus with sticky prices
The constant π‘Ÿπ‘Ÿ reduces crowding out of private consumption and investment expenditure
when 𝐺𝐺 rises. The increase in real GDP π‘Œπ‘Œ is equal to the size of the horizontal shift of π‘Œπ‘Œ 𝑑𝑑 and
we have seen that this can be larger than the increase in 𝐺𝐺 when there are sufficiently many
credit-constrained households. It is therefore possible that ‘multiplier’ effects dominate
‘crowding-out’ effects and GDP rises by more than a deficit-financed increase in government
expenditure.
Box 9.2: Asset prices and the financial accelerator
Shocks to aggregate demand can also be amplified through financial markets. For
example, supposing a decline in GDP leads to a reduction in asset prices, this
tightens credit constraints if those assets are used as collateral for borrowing. The
greater difficulty of borrowing this causes then further reduces aggregate demand
and GDP. A feedback loop of this type is known as a ‘financial accelerator’.
We can illustrate the financial accelerator using the limited-commitment model of
borrowing constraints and house prices from Section 4.6. We add to the earlier
model a reason why lower aggregate demand and GDP reduces house prices, for
example, borrowing constraints linked to the incomes of those buying houses, or
complementarities between housing and consumption expenditure on durable
goods. We assume the collateral constraint in the limited-commitment model is
binding, which implies the consumption of existing homeowners is linked to house
prices.
Suppose a negative shock to the economy reduces GDP. This leads to lower house
prices and causes a reduction in consumption of credit-constrained homeowners,
which in turn reduces aggregate demand and GDP, and depresses house prices
further.
308
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Box 9.3: The 2008 financial crisis
What are the links between the 2008 financial crisis in the USA and the severity of
the ‘great recession’ that followed? Our analysis of credit-market imperfections
from Chapter 4 allows us to identify a number of channels through which a financial
shock can have a large impact on aggregate demand and GDP.
First, falling house prices reduce the value of collateral available to support
borrowing. Using the limited-commitment model from Section 4.6, this reduces the
consumption demand 𝐢𝐢 𝑑𝑑 of credit-constrained households, causing π‘Œπ‘Œ 𝑑𝑑 to shift to
the left.
A financial crisis also leads to expectations of more defaults on debts, which raises
interest-rate spreads through the asymmetric information mechanism explained in
Section 4.5. The consequences of the higher interest rates faced by borrowers are
analysed in Box 4.5 and result in lower 𝐢𝐢 𝑑𝑑 for borrower households and lower 𝐼𝐼 𝑑𝑑
for firms without sufficient internal funds to finance investment. These effects lead
to a further shift of π‘Œπ‘Œ 𝑑𝑑 to the left.
Furthermore, falling asset prices and defaults result in losses for banks, which
reduces bank capital. As explained in Chapter 7, this means that banks may need to
restrict lending and deposit creation to satisfy bank capital requirements. Overall,
these factors imply a large leftward shift of the output demand curve π‘Œπ‘Œ 𝑑𝑑 as shown
in Figure 9.5.
Figure 9.5: A financial shock
Finally, the Federal Reserve was unable fully to offset the large leftward shift of π‘Œπ‘Œ 𝑑𝑑
by reducing interest rates and shifting down the 𝑀𝑀𝑀𝑀 line. This is because of the
lower bound on the nominal interest rate, an issue we will discuss further in Section
9.7. Assuming the nominal interest rate 𝑖𝑖 cannot fall below zero, the lowest
possible real interest rate was −πœ‹πœ‹ ′𝑒𝑒 , the negative of expected future inflation, and
inflation expectations were relatively low during the 2000s.
309
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
9.4 Inflation, aggregate demand and monetary policy
The Phillips curve derived in Section 9.1 implies a link between economic activity and
inflation. We will now consider which point on the Phillips curve the economy will reach,
which depends on the output demand curve and monetary policy. There is an interaction
with monetary policy because the central bank may adjust interest rates in response to
changes in inflation, for example, if it uses an interest-rate feedback rule such as the Taylor
rule studied earlier in Section 6.12. We will summarise the π‘Œπ‘Œ 𝑑𝑑 curve and the stance of
monetary policy with a single curve that can be drawn in the same diagram as the Phillips
curve and the intersection between the two determines inflation πœ‹πœ‹ and real GDP π‘Œπ‘Œ.
Suppose the central bank sets the nominal interest rate 𝑖𝑖 and increases 𝑖𝑖 in response to
higher inflation πœ‹πœ‹. The ‘Taylor principle’ from Section 6.12 suggests 𝑖𝑖 should rise more than
one-for-one with πœ‹πœ‹. The central bank also increases 𝑖𝑖 in response to GDP π‘Œπ‘Œ being higher
than π‘Œπ‘Œ ∗ , using the natural level of output π‘Œπ‘Œ ∗ as the notion of ‘potential output’. In Section
8.2, in the diagram with the π‘Œπ‘Œ 𝑑𝑑 and 𝑀𝑀𝑀𝑀 curves (Figure 8.4), the 𝑀𝑀𝑀𝑀 curve is now upwardsloping and shifts upwards when inflation πœ‹πœ‹ increases.
Higher inflation πœ‹πœ‹ thus causes an upward shift of the 𝑀𝑀𝑀𝑀 curve. The higher real interest rate
induced by monetary policy causes a reduction in aggregate demand, a movement up the
π‘Œπ‘Œ 𝑑𝑑 curve and π‘Œπ‘Œ is lower. Taking as given expectations of πœ‹πœ‹ ′ and π‘Œπ‘Œ ′ , this implies a negative
demand-side relationship between inflation πœ‹πœ‹ and real GDP π‘Œπ‘Œ. We label this the π‘Œπ‘Œ 𝑑𝑑 − 𝑀𝑀𝑀𝑀
line, coming from the combination of the output demand curve and the stance of monetary
policy. Putting this together with the upward-sloping Phillips curve that represents the
supply side of the economy, the intersection between π‘Œπ‘Œ 𝑑𝑑 − 𝑀𝑀𝑀𝑀 and 𝑃𝑃𝑃𝑃 determines
inflation πœ‹πœ‹ and real GDP π‘Œπ‘Œ. This is illustrated in Figure 9.6.
Figure 9.6: The Phillips curve, output demand, and monetary policy
The π‘Œπ‘Œ 𝑑𝑑 − 𝑀𝑀𝑀𝑀 curve shifts to the right for the same reasons that cause the output demand
curve π‘Œπ‘Œ 𝑑𝑑 to shift to the right. It shifts to the left if there is an exogenous increase in the
tightness of monetary policy. Using the links between expectations of the future and
310
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
aggregate demand studied in Section 9.2, it also shifts to the right with an increase in
expected future inflation πœ‹πœ‹ ′ or expected future GDP π‘Œπ‘Œ ′ .
9.5 The costs of inflation
Our discussions of macroeconomic policy have focused on obtaining a desirable outcome
for real GDP π‘Œπ‘Œ, for example, by closing the output gap between π‘Œπ‘Œ and π‘Œπ‘Œ ∗ . But there are also
costs of inflation that make control of inflation a legitimate objective of policy in its own
right. We now consider these costs of inflation.
Some costs of high inflation, arising from money being a poor store of value, have already
been analysed in Section 6.10. However, the nominal rigidities and the unit of account
function of money studied in Chapter 8 and here give rise to further costs of inflation. They
also suggest the optimal inflation rate might be different from what was found in Chapter 6.
9.5.1 Money being a poor store of value
As explained in Section 6.10, for a given real interest rate on bonds, higher inflation makes
money a worse store of value. This leads to time and resources being wasted in trying to
economise on holding money or creating substitutes for money. It also reduces production
because money being a poor store of value is an implicit tax on economic activity.
Hence, money’s medium of exchange function suggests there are costs of inflation, or to be
precise, costs of anticipated inflation. The earlier analysis of the Friedman rule indicates
these costs are eliminated only when there is deflation at a rate equal to the real interest
rate (πœ‹πœ‹ ′𝑒𝑒 = −π‘Ÿπ‘Ÿ 𝑒𝑒 < 0). This suggests the optimal inflation rate is negative.
9.5.2 Menu costs and relative-price distortions
The nominal rigidities in the new Keynesian model imply there are additional costs of
inflation. Costs incurred by firms in adjusting prices, for example menu costs and managerial
time, are higher as inflation – or deflation – increases. Since price adjustments are not
perfectly synchronised across different firms, positive or negative inflation rates also affect
the relative prices of different goods, causing misallocation of spending.
Costs of these kinds increase as inflation πœ‹πœ‹ rises above or falls below zero and occur for both
anticipated and unanticipated inflation. These considerations suggest aiming for a zero rate
of inflation.
9.5.3 Inflation and redistribution
Another potential problem of unanticipated inflation is the redistribution between creditors
and debtors it causes. To see this, suppose savers hold nominal bonds issued by borrowers.
The ex-ante Fisher equation 𝑖𝑖 = π‘Ÿπ‘Ÿ 𝑒𝑒 + πœ‹πœ‹ ′𝑒𝑒 implies that higher expected inflation πœ‹πœ‹ ′𝑒𝑒 can
result in higher 𝑖𝑖, leaving the expected real return π‘Ÿπ‘Ÿ 𝑒𝑒 on nominal bonds unchanged. The expost Fisher equation is π‘Ÿπ‘Ÿ = 𝑖𝑖 − πœ‹πœ‹′, which indicates the real return on bonds depends on the
realised inflation rate πœ‹πœ‹ ′ but the nominal interest rate 𝑖𝑖 cannot adjust if this inflation is
unexpected. This logic suggests inflation should be as predictable as possible but does not
provide specific guidance on what the optimal inflation rate is.
311
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Box 9.4: Inflation targeting
In light of the costs of inflation, one widely used monetary policy strategy is to
make controlling inflation the primary goal of monetary policy. Inflation targeting is
where the central bank uses its policy instruments to try to achieve inflation πœ‹πœ‹ =
πœ‹πœ‹ ∗ , where πœ‹πœ‹ ∗ is the target rate of inflation. Inflation-targeting central banks
typically have targets for 2 per cent or 3 per cent inflation with some margin for
error.
The benefits of price stability in the new Keynesian model
Although real-world central banks typically have positive inflation targets, the
nominal rigidities of the standard new Keynesian model suggest a zero inflation
target πœ‹πœ‹ ∗ = 0 is best (we will consider later a reason for targeting a positive
inflation rate). The costs of inflation linked to nominal rigidity are minimised by
ensuring inflation πœ‹πœ‹ is kept close to zero. But what about the consequences for the
stability of real variables such as GDP of this exclusive focus on inflation?
Assume the inflation target πœ‹πœ‹ = πœ‹πœ‹ ∗ = 0 is achieved (and this is expected to
continue in the future as well). The new Keynesian model with partial price
adjustment implies there is no inflation or deflation only if firms are on average
happy with the existing prices they have previously set. With imperfect
competition, these firms have no desire to raise or lower prices when their
marginal revenue product of labour 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 is equal to the real wage 𝑀𝑀. But if
𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀 then employment must be such that the resulting supply of output is
on π‘Œπ‘Œ 𝑠𝑠 curve because 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀 is what occurs when prices are fully flexible.
This argument indicates that aiming for price stability (zero inflation) in the new
Keynesian model should result in real GDP π‘Œπ‘Œ being at the intersection of the π‘Œπ‘Œ 𝑑𝑑 and
π‘Œπ‘Œ 𝑠𝑠 curves, hence, equal to the natural level of output π‘Œπ‘Œ ∗ . This means there would
be no output gap and the real interest rate would be at the natural rate of interest
π‘Ÿπ‘Ÿ ∗ . The pursuit of price stability thus results in real economic outcomes that are the
same as if prices were completely flexible.
With πœ‹πœ‹ ′𝑒𝑒 = 0, monetary policy needs to set a nominal interest rate 𝑖𝑖 = π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ to
achieve the zero inflation target. Interestingly, this is exactly the same monetary
policy as the optimal stabilisation policy from Section 8.6 where there was no need
for any concern about inflation because prices were completely sticky. Here, we
have added an inflation objective alongside the desire to close the output gap, but
the same monetary policy is able to achieve both objectives.
Flexible or strict inflation targeting?
Should central banks interpret an inflation target strictly to the exclusion of other
objectives? Or should meeting the inflation target be ‘flexible’, allowing also for
stabilisation of fluctuations in real GDP and employment?
312
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
In the standard new Keynesian model, we have seen that aiming for πœ‹πœ‹ = 0 strictly
means accepting real GDP π‘Œπ‘Œ = π‘Œπ‘Œ ∗ , which closes the output gap. This suggests a
‘strict’ inflation target is not necessarily bad. One concern might be that real GDP
π‘Œπ‘Œ ∗ is inefficiently low because 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀 ∗ = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 implies 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢
(recalling that 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 ). But as we will see in Box 9.5, aiming for π‘Œπ‘Œ
systematically above π‘Œπ‘Œ ∗ risks an inflation bias.
Putting aside concerns that output π‘Œπ‘Œ ∗ is too low on average, what about
fluctuations in π‘Œπ‘Œ ∗ and, hence, in actual real GDP π‘Œπ‘Œ = π‘Œπ‘Œ ∗ when πœ‹πœ‹ = 0? Will a strict
inflation target cause real GDP to fluctuate too much? Let us consider this point
after when efficiency wages are added to the new Keynesian model. This means the
level of employment 𝑁𝑁 ∗ associated with the natural level of output π‘Œπ‘Œ ∗ is found
where the efficiency wage 𝑀𝑀 ∗ equals 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 , rather than where 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 intersects
οΏ½ is where 𝑀𝑀𝑃𝑃𝑁𝑁 =
the labour supply curve 𝑁𝑁 𝑠𝑠 . The efficient level of employment 𝑁𝑁
𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 , which lies on the 𝑁𝑁 𝑠𝑠 curve as shown in Figure 9.7.
Figure 9.7: Excessive real GDP fluctuations with a strict inflation target
Now consider a negative supply shock that shifts 𝑀𝑀𝑃𝑃𝑁𝑁 and 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 to the left. As
seen in the figure, this shock causes employment 𝑁𝑁 ∗ and output π‘Œπ‘Œ ∗ to fall too much
οΏ½ ). Rigidities such as efficiency wages can therefore
compared to what is efficient (𝑁𝑁
make the levels of employment and output too volatile if the central bank follows a
strict inflation target resulting in 𝑁𝑁 = 𝑁𝑁 ∗ and π‘Œπ‘Œ = π‘Œπ‘Œ ∗ .
In these circumstances, a flexible inflation target generally performs better than a
strict inflation target. Following a temporary negative supply shock, allowing
inflation πœ‹πœ‹ to rise means 𝑁𝑁 will be above 𝑁𝑁 ∗ (this is a movement along the Phillips
curve). This is a better outcome because employment drops too much if πœ‹πœ‹ = 0,
although there is a trade-off between the cost of the positive inflation rate and the
better outcome for real GDP.
313
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
9.6 Time inconsistency
We have seen that macroeconomic policies such as monetary and fiscal policy can have a
direct impact on the economy. But since expectations of the future matter for current
economic outcomes, expectations of future policies can also affect the economy today.
Policymakers will be able to achieve more if they are able to influence expectations as well
as take direct action. However, announcements of future policies might not be credible
because of the problem of ‘time inconsistency’.
A policy is said to be time inconsistent if the policymaker gains in the current period when
the announcement is believed but does not gain by implementing the policy when the time
comes in future – even if nothing fundamental has changed. A time-consistent policy is one
where there is no incentive ex post to follow a different policy from the one it was optimal
to announce earlier. Without the ability to commit to future actions, people have no
incentive to believe a policymaker will follow a time-inconsistent policy because the
announcement lacks credibility.
What makes some policies time inconsistent? Since expectations of the future influence the
economy in the present, the policymaker can achieve more in the current period if
announcements of future policy are believed and change expectations today. But that
benefit lies in the past when the time comes in the future actually to implement the
announcement, so the policy that is now optimal to the policymaker is different.
For example, consider the announcement that tax rates on capital or capital income will be
low. If this is believed, it encourages investment and raises the capital stock. Since building
up new capital takes some time, it is expected future tax rates that matters for investment
decisions. But once capital is accumulated, the government has an incentive to implement a
‘one-off’ capital levy to raise tax revenue.
Another example is announcing a target for a low rate of inflation. If this is believed, it
reduces the nominal interest rate and money becomes seen as a better store of value,
which encourages economic activity. Note that it is expected future inflation that matters
for the interest rate on nominal bonds and decisions about holding money. However, ex
post, the government has an incentive to tolerate a ‘one-off’ burst of inflation to reduce the
real value of government liabilities through an inflation tax.
Box 9.5: The inflation bias problem
In the new Keynesian model, the central bank is able to raise real GDP π‘Œπ‘Œ by lowering the real
interest rate π‘Ÿπ‘Ÿ. We saw in Section 8.6 and Box 9.4 how this makes it possible for the central
bank to pursue stabilisation policy that aims to close the output gap between real GDP π‘Œπ‘Œ and
the natural level of output π‘Œπ‘Œ ∗ , as well as achieve price stability.
However, the natural level of output π‘Œπ‘Œ ∗ is itself inefficiently low, with 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 even at
π‘Œπ‘Œ = π‘Œπ‘Œ ∗ because 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀 = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐢𝐢 and 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 . This inefficiency of π‘Œπ‘Œ ∗ arises
owing to distortions present in the economy in addition to the problem of nominal rigidity
that the stabilisation policy and inflation targeting are able to mitigate. The basic source of
314
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
distortions is the imperfect competition in the goods market that leads firms to produce
too little as a way of boosting profits. But other distortions such as ‘efficiency wages’ or
taxes that reduce the incentive to work also worsen the inefficiency of the natural level of
output.
In light of GDP being inefficiently low, should monetary policy aim to provide additional
stimulus even when π‘Œπ‘Œ = π‘Œπ‘Œ ∗ ? Suppose monetary policy reduces the real interest rate
below the natural interest rate π‘Ÿπ‘Ÿ ∗ to push real GDP above its natural level π‘Œπ‘Œ ∗ . Taking as
given inflation expectations πœ‹πœ‹ ′𝑒𝑒 , this policy moves the economy along a Phillips curve
with output π‘Œπ‘Œ and inflation πœ‹πœ‹ both rising. This seems to improve the outcome for real
GDP, albeit at the cost of some inflation.
However, this policy is not a response to a shock but to economic activity being judged
systematically too low. Hence, the higher inflation that results from it can be anticipated.
This causes inflation expectations to rise, which shifts the Phillips curve upwards. The
upward shift of the Phillips curve worsens the combinations of inflation and real GDP that
can be attained by the central bank. Now, an even higher rate of inflation is required to
achieve a given target for the level of real GDP, and when that is anticipated, this leads to
even higher expected inflation. Eventually, inflation expectations stop rising when further
inflation would be too costly for the central bank to tolerate.
These adverse shifts of the Phillips curve mean that there is higher inflation with only a
limited (or no) gain in terms of real GDP. Monetary policy therefore suffers from an
‘inflation bias’. The freedom to use monetary policy to aim for the efficient level of real
GDP leads to higher inflation, but fails to achieve its original goal. It results in higher costs
of inflation, without gaining much or anything by way of a better outcome for real GDP.
If the central bank were to announce it would target a lower rate of inflation and inflation
expectations fell, the Phillips curve would be in a more favourable position. But then
there would be a temptation to pursue an expansionary monetary policy to raise π‘Œπ‘Œ above
π‘Œπ‘Œ ∗ . This points to the time inconsistency of announcing a goal of lower inflation.
Owing to the time inconsistency problem, an announcement that the central bank will
pursue low inflation is not credible. But there would be gains from being able to reduce
inflation expectations and obtain a Phillips curve in a more favourable position if the
central bank were able to commit itself to a low-inflation monetary policy.
What institutional mechanisms might enable the central bank to do this? First, giving the
central bank independence might insulate it from political pressure to aim for real GDP π‘Œπ‘Œ
above π‘Œπ‘Œ ∗. Second, establishing a framework for monetary policy where the central bank
is given the primary task of controlling inflation and is judged by the public on how well it
performs in this might shift the focus of monetary policy away from the level of real GDP.
An independent central bank given the job of meeting an inflation target is the current
consensus on how best to achieve this. The central bank is usually allowed to interpret its
inflation target ‘flexibly’ and consider business-cycle fluctuations in real GDP in its policy
deliberations but to focus only on inflation in the long run.
315
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
9.7 Unconventional monetary policy at the interest-rate lower
bound
An important obstacle to using monetary policy to stabilise the economy in the way
described in Section 8.6 is the interest-rate lower bound. It was argued the central bank
could close the output gap and obtain π‘Œπ‘Œ = π‘Œπ‘Œ ∗ by setting the nominal interest rate 𝑖𝑖 so that
the implied real interest rate π‘Ÿπ‘Ÿ equals natural rate of interest π‘Ÿπ‘Ÿ ∗ . But as Section 6.13
explains, there is a limit on how far the nominal interest rate 𝑖𝑖 can be reduced by the central
bank. This limit used to be seen as zero but is now thought to be slightly negative (see
Section 6.14 for one reason why).
The ex-ante Fisher equation implies the real interest rate is π‘Ÿπ‘Ÿ = 𝑖𝑖 − πœ‹πœ‹ ′𝑒𝑒 . Taking as given
inflation expectations πœ‹πœ‹ ′𝑒𝑒 , if there is a lower bound 𝑖𝑖 ≥ 0 on the nominal interest rate
(taken to be zero here) then the real interest rate is subject to the lower bound π‘Ÿπ‘Ÿ ≥ −πœ‹πœ‹ ′𝑒𝑒 .
The real interest rate cannot fall below the negative of the rate of inflation that is expected.
The problem is that a large shock to the economy may reduce the natural rate of interest π‘Ÿπ‘Ÿ ∗
below −πœ‹πœ‹ ′𝑒𝑒 , meaning that it is not feasible to get to π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ by lowering 𝑖𝑖. This challenge for
monetary policy is illustrated in Figure 9.8.
Figure 9.8: Interest-rate lower bound and the limits to stabilisation policy
The inability to use conventional monetary policy to stabilise the economy once the
interest-rate lower bound is reached has led central banks to use or consider using
unconventional monetary policies instead, such as:
•
•
•
Quantitative easing
Forward guidance
Negative interest rates.
9.7.1 Quantitative easing
If the central bank cannot reduce the nominal interest rate 𝑖𝑖, can it stimulate the economy
by increasing the money supply instead? Policies of this type are known as quantitative
easing (QE).
316
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Suppose the central bank creates more money and purchases short-term nominal
government bonds. Since the interest-rate lower bound has been reached, these short-term
nominal government bonds have yield 𝑖𝑖 = 0. But as Section 6.13 has explained, at the
interest-rate lower bound, the demand for money becomes perfectly interest elastic
because money and risk-free nominal bonds are perfect substitutes at the margin – the
‘liquidity trap’. As Figure 9.9 shows, this additional money would be passively absorbed into
money demand but there would be no change in interest rates, prices, or real GDP.
Can quantitative easing be adapted so that expansions of the money supply do have an
effect on the economy? What needs to be done differently?
Figure 9.9: The liquidity trap and the limits to stabilisation policy
It is implicit in the example from Figure 9.9 that the expansion of the money supply is only
temporary and would be reversed once the interest-rate lower bound ceases to bind and QE
is no longer needed. As explained in Box 6.3, the effects of a permanent expansion of the
money supply would be quite different, causing an increase in prices and inflation, and
leading here to a movement along the Phillips curve with higher economic activity. Hence, a
commitment to maintain a monetary expansion even in normal times in the future might in
principle be a more effective form of QE, although it may suffer from a time-inconsistency
problem and lack credibility if people think the central bank will change course in the future.
Another proposal is a ‘helicopter drop’ of money, where a monetary expansion is
transferred to the government and given away in the form of lower taxes or increased
transfer payments for households. In principle, this is very similar to the permanent
expansion of the money supply described above – which would deliver a fiscal gain to the
government from reducing the real value of existing money and nominal debt, and
ultimately show up as lower taxes. But one important difference is that it would be more
difficult to reverse because the central bank has given the money away and does not hold
any additional assets that could be used to buy back the money. While this policy might be
more effective, it is widely seen as setting a dangerous precedent and blurring the lines
between monetary and fiscal policy.
317
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Rather than claim monetary expansions will be permanent, real-world central banks’ QE has
instead attempted to get around the liquidity trap problem by making purchases of longterm government bonds with a positive yield, or private-sector assets such as corporate
bonds or mortgage-backed securities. These assets share the feature that they are risky,
unlike short-term government bonds. Private-sector assets are subject to default risk and
even long-term government bonds are risky for those not holding them until maturity (see
Box 7.5). These risky assets are not perfect substitutes for money, unlike short-term
government bonds at interest-rate lower bound.
We can use the model of portfolio choice from Section 7.8 to analyse the effects of
purchases of risky assets by the central bank. Central-bank purchases of risky assets require
in equilibrium that private investors hold a smaller fraction of risky assets and a larger
fraction of risk-free assets (the money the central bank creates) in their portfolios. As shown
in Figure 9.10, a lower risk premium is needed for this portfolio to be chosen, so centralbank purchases result in a decline in risk premiums. This is known as the ‘portfolio balance
effect’. This then helps an economy at the lower bound by reducing the cost of credit for
risky borrowers, or those borrowing over long periods.
Figure 9.10: The portfolio balance effect
9.7.2 Forward guidance
The term ‘forward guidance’ refers to an announcement made by the central bank about
the future path of interest rates. When the current nominal interest rate is at its lower
bound, the central bank is still able to give an indication of what it will do with future
interest rates, at least to the extent that future interest rates are not already themselves
expected to be at the lower bound. For example, there could be an announcement that
interest rates will remain ‘lower for longer’, i.e. the central bank will not raise interest rates
as quickly as is currently expected.
If interest rates can be lowered in future periods then this would be expected to raise future
output π‘Œπ‘Œ ′ and future inflation πœ‹πœ‹ ′ . Higher expectations of πœ‹πœ‹ ′ and π‘Œπ‘Œ ′ both increase output
demand in the current period through the channels explained in Section 9.2, even though
318
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
there is no immediate change in interest rates. Thus, forward guidance might work through
manipulating expectations of the future in a way that improves current economic outcomes.
However, the policy is not a ‘free lunch’ because future outcomes are worse if future
interest rates are set at an inappropriate level for the conditions then prevailing.
This also points to a potential problem with the credibility of forward guidance. Once the
future period is reached, the central bank may not want to keep interest rates too low. The
benefit of announcing this was better economic outcomes in the past but that is now a
bygone, while the cost is worse current economic outcomes. This time-inconsistency
problem can undermine the credibility of a forward guidance announcement. For people to
believe the announcement, the central bank needs to make a binding commitment but lacks
any straightforward way to tie its hands.
Box 9.6: Inflation targeting and the interest-rate lower bound problem
If the lower bound on nominal interest rates jeopardises macroeconomic stability and
unconventional monetary policies are seen as ineffective or costly are there any reforms
to economic policy that would mitigate the lower bound problem? For example, should
the existing framework of inflation targeting that is used in many countries be changed
or abandoned? Below we explore a number of alternatives.
Raising the inflation target
Suppose a higher target πœ‹πœ‹ ∗ for inflation is chosen. The Fisher equation with the real
interest rate at its natural rate π‘Ÿπ‘Ÿ ∗ in the long run implies 𝑖𝑖 = π‘Ÿπ‘Ÿ ∗ + πœ‹πœ‹ ∗ . A higher inflation
target πœ‹πœ‹ ∗ thus means a higher nominal interest rate 𝑖𝑖 on average, which gives a larger
cushion to adjust 𝑖𝑖 downwards. With πœ‹πœ‹ ′𝑒𝑒 ≈ πœ‹πœ‹ ∗ and a lower bound of zero on the nominal
interest rate 𝑖𝑖, the real interest rate can be reduced to −πœ‹πœ‹ ∗ , which is lower when πœ‹πœ‹ ∗ is
higher.
Therefore, if inflation targets were raised, monetary policy can shift the 𝑀𝑀𝑀𝑀 line further
downwards. This gives the central bank a greater ability to offset larger negative demand
shocks. However, there are costs of having higher πœ‹πœ‹ on average, as discussed in Section
9.5.
Average inflation targeting
It would also be possible to replace the standard form of inflation targeting with what is
called ‘average inflation targeting’. This has the central bank aim for a target based on
the inflation rate averaged over a number of years. Average inflation targeting was
adopted by the US Federal Reserve in August 2020.
Consider an example of this policy where the target is for inflation averaged over two
periods. If this is credible, people expect that an average of current inflation πœ‹πœ‹ and future
inflation πœ‹πœ‹ ′ will remain stable even if πœ‹πœ‹ does not. Suppose πœ‹πœ‹ falls because of a negative
demand shock that the central bank cannot offset because of the interest-rate lower
bound. Under this policy, there is then an increase in expectations of πœ‹πœ‹ ′ – assuming
monetary policy is not also constrained in the future period. As explained in Section 9.2,
higher πœ‹πœ‹ ′𝑒𝑒 reduces the real interest rate and boosts demand in the current period.
319
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Under this policy, there is then an increase in expectations of πœ‹πœ‹ ′ – assuming monetary
policy is not also constrained in the future period. As explained in Section 9.2, higher πœ‹πœ‹ ′𝑒𝑒
reduces the real interest rate and boosts demand in the current period.
Price-level targeting or nominal GDP targeting
Following a similar logic to average inflation targeting, targets for the level of prices 𝑃𝑃
have also been suggested. Price-level targeting is equivalent to a target for the inflation
rate averaged over a long period. Lower inflation now causes expected future inflation to
rise so that the future price level can return to its target level or path.
Another alternative policy with some similar features is a target for the level of nominal
GDP. If credible, low inflation now means expectation of higher future inflation or higher
future real GDP, and expectations of higher πœ‹πœ‹ ′ or π‘Œπ‘Œ ′ boost current demand.
Box 9.7: Forward guidance and confidence
Central banks have increasingly used ‘forward guidance’ when nominal interest rates are
at their lower bound. With forward guidance, the central bank provides information
about future path of interest rates. But how is that information interpreted by people in
the economy?
In Section 9.7 we described one interpretation, where people believe the central bank
will keep interest rates lower in future when it is not constrained by the lower bound
problem and that interest rates will be lower than future economic conditions warrant.
This future stimulus raises expectations of πœ‹πœ‹ ′ and π‘Œπ‘Œ ′ , which boosts the current level of
demand. In this case, forward guidance is interpreted as a commitment to keep
monetary policy excessively loose in the future.
But another way people might interpret the forward guidance announcement is that the
central bank is more pessimistic about the economy. Hearing the forward guidance
announcement, they think the central bank must believe prospects for the economy
have become weaker if it is predicting future interest rates will be lower than before. If
the private sector revises its own beliefs after hearing the announcement then
confidence about the future declines. This leads to lower expectations of πœ‹πœ‹ ′ and π‘Œπ‘Œ ′ ,
which reduce demand through the channels explained in Section 9.2, the opposite of
what the central bank intended if it was trying to make a commitment.
320
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Box 9.8: Negative interest rate policies
We saw in Section 6.14 that it is possible for a central bank to reduce the nominal
interest rate 𝑖𝑖 below zero. But there is still a lower bound −β„Ž while physical cash
remains available, where β„Ž is the proportional holding cost of cash taking account
of the need for secure storage. With interest rate lower bound 𝑖𝑖 ≥ −β„Ž, the central
bank can lower the real interest rate π‘Ÿπ‘Ÿ = 𝑖𝑖 − πœ‹πœ‹ ′𝑒𝑒 to −πœ‹πœ‹ ′𝑒𝑒 − β„Ž at most. As shown in
Figure 9.11, this gives further scope for monetary stimulus, which can be effective
through the usual channels.
Figure 9.11: Negative nominal interest rates
By setting negative nominal interest rates, the central bank can shift the 𝑀𝑀𝑀𝑀 line
further downwards along the π‘Œπ‘Œ 𝑑𝑑 curve, which allows it to stabilise the economy for
a larger range of negative demand shocks.
However, there may be costs of this negative interest rate policy. First, there can be
an inefficiently large use of cash to avoid negative rates, which wastes resources on
security and storage costs (see Section 6.14). Negative interest rates also mean
banks cannot break even without negative interest rates charged on deposits. But if
households and firms were to switch from using bank deposits to using cash as
money then this would eliminate the benefits of financial intermediation provided
by banks (see Section 7.10).
321
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Chapter 10: Open-economy macroeconomics
This chapter explores how an economy’s interactions with the rest of the world influence
GDP, its pattern of trade, exchange rates, and welfare.
Essential reading
•
Williamson, Chapters 16 and 17.
10.1 International trade in goods and assets
Globalisation has increased the scope for countries to trade with one another and exploit
gains from trade. We usually think of gains from trade as coming from exchanging different
types of goods. For example, Saudi Arabia has a comparative advantage in extracting oil and
China has a comparative advantage in manufactured goods. But in thinking about trade and
macroeconomics, another type of trade known as ‘intertemporal’ trade is important.
In this type of trade, countries exchange goods at different points in time. We do not usually
observe such intertemporal exchanges of goods taking place direct but this type of exchange
is mediated whenever assets are traded between countries. For example, if a country has a
low income but expects a higher income in the future, firms and households gain by
borrowing from the rest of the world to invest and smooth consumption. This works by
selling bonds to foreigners now and repaying debt in future. By selling bonds, the country
imports goods to invest and consume, and exports more in future to repay its debts.
Similarly, a country expecting low growth or a decline in income might gain by lending to the
rest of the world. It buys bonds from foreigners now and spends the payoff from owing
those bonds later. It exports goods now to acquire assets and is able to import more when it
is repaid. In these examples, countries effectively have comparative advantages in
production at different points in time.
To analyse international trade, recall the terminology of the balance of payments. The
balance of payments 𝐡𝐡𝐡𝐡 is broken down into two accounts, the current account 𝐢𝐢𝐢𝐢 and the
financial account 𝐹𝐹𝐹𝐹:
𝐡𝐡𝐡𝐡 = 𝐢𝐢𝐢𝐢 + 𝐹𝐹𝐹𝐹
The current account is the sum of net exports and net foreign income:
𝐢𝐢𝐢𝐢 = 𝑁𝑁𝑁𝑁 + 𝑁𝑁𝑁𝑁𝑁𝑁
Net exports 𝑁𝑁𝑁𝑁 refers to the value of exports minus value of imports. Net foreign income
N𝐹𝐹𝐹𝐹 is income earned from ownership of foreign assets minus foreign claims to income
earned in domestic economy.
322
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
The terminology related to the financial account (𝐹𝐹𝐹𝐹) has changed over time. You may see it
referred to as the capital account (𝐾𝐾𝐾𝐾) in some textbooks. The financial account comprises
foreign net purchases of domestic assets minus domestic net purchases of foreign assets.
Any purchases or sales of official foreign-exchange reserves are included in 𝐹𝐹𝐹𝐹 in this
course, so the financial account captures all trade in assets, both by the private sector and
the government.
The balance of payments identity is 𝐡𝐡𝐡𝐡 = 0, so the financial and current accounts must in
equilibrium be related according to:
𝐹𝐹𝐹𝐹 = −𝐢𝐢𝐢𝐢
Next, let us see how international trade affects the goods market and the relationship
between saving and investment. Net exports 𝑁𝑁𝑁𝑁 are a component of aggregate
expenditure, so the breakdown of GDP by expenditure is now:
π‘Œπ‘Œ = 𝐢𝐢 + 𝐼𝐼 + 𝐺𝐺 + 𝑁𝑁𝑁𝑁
An open economy also has a separate notion of gross national product (GNP) as well as GDP.
GNP is equal to GDP plus net foreign income:
𝐺𝐺𝐺𝐺𝐺𝐺 = π‘Œπ‘Œ + 𝑁𝑁𝑁𝑁𝑁𝑁
National saving 𝑆𝑆 is the sum of private and public saving, 𝑆𝑆 = 𝑆𝑆 𝑝𝑝 + 𝑆𝑆 𝑔𝑔 , where private saving
is 𝑆𝑆 𝑝𝑝 = (π‘Œπ‘Œ + 𝐼𝐼𝐼𝐼𝐼𝐼 + 𝑁𝑁𝑁𝑁𝑁𝑁 − 𝑇𝑇) − 𝐢𝐢. This is disposable income minus consumption 𝐢𝐢, where
the private sector receives domestic income π‘Œπ‘Œ, interest on government debt 𝐼𝐼𝐼𝐼𝐼𝐼, net
foreign income 𝑁𝑁𝑁𝑁𝑁𝑁, and pays taxes net of transfers 𝑇𝑇. Public saving is 𝑆𝑆 𝑔𝑔 = 𝑇𝑇 − 𝐼𝐼𝐼𝐼𝐼𝐼 − 𝐺𝐺,
with the budget deficit being −𝑆𝑆 𝑔𝑔 . National saving is therefore 𝑆𝑆 = π‘Œπ‘Œ + 𝑁𝑁𝑁𝑁𝑁𝑁 − 𝐢𝐢 − 𝐺𝐺 =
𝐺𝐺𝐺𝐺𝐺𝐺 − 𝐢𝐢 − 𝐺𝐺, and the national accounts and balance-of-payments identities imply:
𝑆𝑆 = 𝐼𝐼 + 𝐢𝐢𝐢𝐢
This means that a country can save either through investment in the domestic capital stock,
or through running a current account surplus (𝐢𝐢𝐢𝐢 > 0). Equivalently, a financial account
deficit (𝐹𝐹𝐹𝐹 < 0) implies lending to the rest of the world. A country can borrow by running a
current account deficit (𝐢𝐢𝐢𝐢 < 0), noting that a financial account surplus (𝐹𝐹𝐹𝐹 > 0) implies
borrowing from the rest of the world.
Saving, investment, and the current account are flow variables. Net foreign assets 𝑁𝑁𝑁𝑁𝑁𝑁 is
net stock of international savings, defined as the value of foreign assets owned by domestic
residents minus value of domestic assets owned by foreign residents. The accounting
equation for the link between current and future net foreign assets denoted by 𝑁𝑁𝑁𝑁𝑁𝑁′ is:
𝑁𝑁𝑁𝑁𝐴𝐴′ = 𝑁𝑁𝑁𝑁𝑁𝑁 − 𝐹𝐹𝐹𝐹 + Net capital gains
For the purposes of this course, we will ignore changes in asset valuations. Hence, the
current account gives the change in net foreign assets:
𝑁𝑁𝑁𝑁𝐴𝐴′ − 𝑁𝑁𝑁𝑁𝑁𝑁 = 𝐢𝐢𝐢𝐢
323
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
We will focus on international trade done by a small open economy (SOE). An SOE is not
large enough to affect what happens in other countries or in world markets. This makes the
country a price taker in competitive world markets. The country can be affected by what
happens in the rest of the world but there is no feedback from it to the world. This
assumption is appropriate for some economies, such as Singapore, although not for others.
However, many of the lessons for SOEs carry over to the analysis of large open economies.
10.2 Gains from trade in assets
We will start by demonstrating how there can be gains from trade even if countries do not
have comparative advantages in producing different types of goods. To that end, suppose
there is a single homogeneous good everywhere in the world and all variables are measured
in real terms in units of this good.
There is a single type of asset traded internationally, namely, a bond with real interest rate
π‘Ÿπ‘Ÿ. Hence, all assets around the world have the same real return π‘Ÿπ‘Ÿ. If a country’s net foreign
assets 𝑁𝑁𝑁𝑁𝑁𝑁 are held in this bond then its net foreign income is 𝑁𝑁𝑁𝑁𝑁𝑁 = π‘Ÿπ‘Ÿ × π‘π‘π‘π‘π‘π‘. Initially, we
rule out any investments in physical capital (𝐼𝐼 = 0), which has the consequence that
national saving (or borrowing) is possible only through the current account because 𝑆𝑆 = 𝐢𝐢𝐢𝐢.
Our analysis is done in a two-period set-up like the consumption choice model from Chapter
3. Since the second period is final, no one wants to hold assets after (𝑁𝑁𝑁𝑁𝐴𝐴′′ = 0), which
requires 𝑁𝑁𝑁𝑁𝐴𝐴′ + 𝐢𝐢𝐴𝐴′ = 0. We start from a blank slate in the first period by considering a
country with no initial assets or debts internationally, that is, 𝑁𝑁𝑁𝑁𝑁𝑁 = 0. This means the
accumulation of future net foreign assets is solely due to the current account surplus in the
current period, 𝑁𝑁𝑁𝑁𝐴𝐴′ = 𝐢𝐢𝐢𝐢. These observations imply the country’s international budget
constraint is:
𝐢𝐢𝐢𝐢 + 𝐢𝐢𝐴𝐴′ = 0
This states that present and future current account balances must offset each other. A
country that accumulates assets today by saving will spend those savings in the future. A
country that borrows today must run a future current account surplus to repay its debts.
This budget constraint takes a more familiar form in terms of net exports. Note that 𝐢𝐢𝐢𝐢 =
𝑁𝑁𝑁𝑁 because 𝑁𝑁𝑁𝑁𝑁𝑁 = π‘Ÿπ‘Ÿ × π‘π‘π‘π‘π‘π‘ = 0, and 𝐢𝐢𝐴𝐴′ = 𝑁𝑁𝑋𝑋 ′ + 𝑁𝑁𝑁𝑁𝐼𝐼 ′ = 𝑁𝑁𝑋𝑋 ′ + π‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿπ‘Ÿ because 𝑁𝑁𝑁𝑁𝐴𝐴′ =
𝑁𝑁𝑁𝑁. Hence, (1 + π‘Ÿπ‘Ÿ)𝑁𝑁𝑁𝑁 + 𝑁𝑁𝑋𝑋 ′ = 0, and dividing both sides by 1 + π‘Ÿπ‘Ÿ implies:
𝑁𝑁𝑁𝑁 +
𝑁𝑁𝑁𝑁′
=0
1 + π‘Ÿπ‘Ÿ
This says that the present discounted value of current and future levels of net exports must
be zero. With no investment in physical capital, net exports are equal to 𝑁𝑁𝑁𝑁 = π‘Œπ‘Œ − 𝐢𝐢 − 𝐺𝐺
and so the country’s international budget constraint becomes the usual consolidated
household-and-government budget constraint in terms of aggregate consumption and
income:
𝐢𝐢 +
𝐢𝐢′
π‘Œπ‘Œ′
𝐺𝐺′
= οΏ½π‘Œπ‘Œ +
οΏ½ − �𝐺𝐺 +
οΏ½
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
324
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Underlying this combined constraint is a representative household choosing a consumption
plan (𝐢𝐢, 𝐢𝐢 ′ ) to get onto highest indifference curve subject to budget constraint:
𝐢𝐢 +
𝐢𝐢′
π‘Œπ‘Œ ′ − 𝑇𝑇 ′
= π‘Œπ‘Œ − 𝑇𝑇 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
Current and future income (π‘Œπ‘Œ, π‘Œπ‘Œ ′ ) are treated as exogenous here. We make the usual
assumptions on household preferences such as a desire for smooth consumption.
The government exogenously sets fiscal policy (𝐺𝐺, 𝐺𝐺 ′ ) and (𝑇𝑇, 𝑇𝑇 ′ ) subject to its budget
constraint:
𝐺𝐺 +
𝐺𝐺′
𝑇𝑇′
= 𝑇𝑇 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
To understand the gains from international trade, we first look at what happens in ‘autarky’
where there is no possibility of trade. In autarky, net exports and the current account are
necessarily zero, 𝑁𝑁𝑁𝑁 = 𝐢𝐢𝐢𝐢 = 0. With 𝑁𝑁𝑁𝑁 = 0 and 𝑁𝑁𝑋𝑋 ′ = 0, in equilibrium the economy
must have 𝐢𝐢 = π‘Œπ‘Œ − 𝐺𝐺 and 𝐢𝐢 ′ = π‘Œπ‘Œ ′ − 𝐺𝐺′. The economy’s real interest rate must therefore
adjust so that the chosen consumption plan (𝐢𝐢, 𝐢𝐢 ′ ) is consistent with these requirements,
just as seen earlier in Section 3.6. The real interest rate in this case is called the autarky real
interest rate π‘Ÿπ‘Ÿπ‘Žπ‘Ž .
Figure 10.1: Autarky real interest rate
Now consider a country open to international trade. We suppose that capital mobility is
perfect. Perfect capital mobility means there are no restrictions on capital flows. Domestic
residents can freely buy or sell foreign assets and foreigners can freely buy or sell domestic
assets. As domestic and foreign bonds are equivalent, the domestic real interest rate π‘Ÿπ‘Ÿ must
equal the foreign real interest rate π‘Ÿπ‘Ÿ ∗ and a small open economy is not big enough to
influence π‘Ÿπ‘Ÿ ∗ . This means the economy is effectively a participant in a perfectly competitive
world market for bonds with π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ .
325
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Since the international budget constraint always passes through the point (π‘Œπ‘Œ − 𝐺𝐺, π‘Œπ‘Œ ′ − 𝐺𝐺 ′ )
that is the autarky equilibrium irrespective of the level of π‘Ÿπ‘Ÿ ∗ , it follows that whenever π‘Ÿπ‘Ÿ ∗ ≠
π‘Ÿπ‘Ÿπ‘Žπ‘Ž , households can get on to a higher indifference curve through trade than they can reach
in autarky. This is because the indifference curve at the autarky point must cut the budget
constraint because the indifference curve has gradient −(1 + π‘Ÿπ‘Ÿπ‘Žπ‘Ž ) there, while the budget
constraint has gradient −(1 + π‘Ÿπ‘Ÿ ∗ ) everywhere.
Figure 10.2: Gains from trade
This account of the gains from international trade also points towards some potential
determinants of the current account. By making an analogy with an individual’s life cycle
and consumption choice, we can see that the stage of development of a country can be one
long-run factor that might affect its current account.
For example, a country with high GDP due to extraction of an exhaustible resource is in a
similar position to a middle-aged person planning for retirement. It makes sense to save to
smooth consumption, which for the country as a whole means running a 𝐢𝐢𝐢𝐢 surplus. This
helps to explain the large current-account surpluses of oil exporters such as Saudi Arabia.
Figure 10.3: Long-run explanations for the current account
326
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
On the other hand, a developing country is in a similar position to a young person who
expects a much higher income in the future. It makes sense to borrow to smooth
consumption, which for the country as a whole means running a 𝐢𝐢𝐢𝐢 deficit. However, there
are many examples where developing countries are seen not to behave in this way, running
large current-account surpluses instead. This is related to the puzzle of why capital does not
flow from rich countries to poorer countries.
Although we have seen that there are gains from international trade, changes in the world
interest rate can increase or decrease the size of the gains from trade, although these
always remain positive relative to autarky. The argument is equivalent to how individual
savers and borrowers are affected by changes in the real interest rate. Recalling the
argument from Section 3.5, a higher interest rate implies a substitution effect reducing
current consumption and increasing 𝐢𝐢𝐢𝐢 for both. The higher interest rate makes a country
with a current-account surplus (a saver) better off but a country with a current-account
deficit (a borrower) worse off. The overall effect on the current account is ambiguous when
it is initially in surplus but, if it is initially in deficit, then the deficit should fall.
Figure 10.4: Increase in world interest rate
327
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Intuitively,
income risk
shocksharing
leads a country to lower its international
Box 10.1:a negative
International
lending or increase its international borrowing. Adjustment of the current account
Suppose a country faces a temporary negative shock to real GDP π‘Œπ‘Œ. In the twothus helps households faced with income shocks to smooth consumption. The
period model, current income falls but expected future income remains the same.
model predicts that the current account falls with a negative income shock, so it is
Figure 10.5 shows the leftward shift of the budget constraint passing through (π‘Œπ‘Œ −
predicted
to be a procyclical variable. Figure 10.6 shows data on the cyclicality of
𝐺𝐺, π‘Œπ‘Œ ′ − 𝐺𝐺 ′ ) as π‘Œπ‘Œ falls. The optimal choice of current consumption 𝐢𝐢 ∗ declines but by
the current account in the USA. Empirically, the US current account is actually
less than π‘Œπ‘Œ does. This reflects the desire of households to smooth consumption by
countercyclical, although
the USA is far from a small open economy and many
choosing a decline in 𝐢𝐢 ′∗ as well to permit a smaller fall in 𝐢𝐢. It follows that the
countries’ own business-cycle fluctuations correlate with what happens in the USA.
current account 𝐢𝐢𝐢𝐢 = (π‘Œπ‘Œ − 𝐺𝐺) − 𝐢𝐢 declines after the income shock leaving the
If global business cycles are correlated across countries but more volatile outside
economy with a smaller 𝐢𝐢𝐢𝐢 surplus or a larger 𝐢𝐢𝐢𝐢 deficit.
the USA, it is less surprising that the USA borrows more from the rest of the world
in booms and lends to the rest of the world (or borrows less) in recessions.
Figure 10.5: Negative shock to current income
Figure 10.6: Cyclicality of the US current account
328
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
329
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Box 10.2: The ‘twin deficits’
How might fiscal policy affect the current account? Can a government budget
deficit cause a current account deficit – the so-called ‘twin deficits’ phenomenon?
In the USA in the 1980s, a rising government budget deficit occurred alongside a
rising current account deficit. We will explore this connection and consider whether
the link between the two deficits depends on whether the budget deficit is due to
higher public spending or tax cuts.
Suppose there is a temporary increase in government expenditure 𝐺𝐺, with 𝐺𝐺 ′
remaining constant. This raises the tax burden faced by households, shifting the
present-value budget constraint to the left in Figure 10.7. Households choose to
reduce 𝐢𝐢 ∗ but, with consumption smoothing, households also reduce 𝐢𝐢 ′∗ , which
means that 𝐢𝐢 falls by less than 𝐺𝐺 rises. This implies that the current account 𝐢𝐢𝐢𝐢 =
𝑆𝑆 = (π‘Œπ‘Œ − 𝐺𝐺 ) − 𝐢𝐢 declines. The increase in government expenditure thus causes a
larger current account deficit.
Figure 10.7: Temporary increase in public expenditure
However, this prediction for the current account is not tightly linked to the budget
deficit. We obtain the same prediction for 𝐢𝐢𝐢𝐢 irrespective of whether current taxes
𝑇𝑇 rise to pay for higher 𝐺𝐺, in which case there is no change in the budget deficit, or
future taxes 𝑇𝑇′ increase and the budget deficit rises. National saving 𝑆𝑆 = π‘Œπ‘Œ − 𝐢𝐢 − 𝐺𝐺
falls by the same amount in both these cases.
This is because Ricardian equivalence implies that timing of taxes does not affect
households’ consumption choices taking as given the government’s spending plans
(𝐺𝐺, 𝐺𝐺 ′ ). Households’ budget constraint passes through the point (π‘Œπ‘Œ − 𝐺𝐺, π‘Œπ‘Œ ′ − 𝐺𝐺 ′ ),
which determines its position. This means private saving 𝑆𝑆 𝑝𝑝 will adjust to offset any
changes in public saving 𝑆𝑆 𝑔𝑔 due to taxes and, consequently, national saving 𝑆𝑆 =
𝑆𝑆 𝑝𝑝 + 𝑆𝑆 𝑔𝑔 is unaffected by the timing of taxes. The same is therefore true of the
330
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
current account. There will be no international financing of a budget deficit
resulting from a tax cut because domestic households will save the whole tax cut.
Allowing for the credit-market imperfections of Chapter 4 leads to Ricardian
equivalence failing and implies that 𝐢𝐢 rises after a tax cut. In this case, national
saving and the current account would fall following a tax cut.
10.3 Sovereign default
The earlier assumption of perfect capital mobility meant that investors were not concerned
with which country’s bonds they hold – all bonds were equivalent. But how can lenders to a
foreign government compel that government to repay its debts? The problem of contract
enforcement is more difficult than when creditor and debtor are in the same legal
jurisdiction. In this section, we study international borrowing subject to limited
commitment: default can be chosen rationally by a government in some circumstances.
In this analysis, to simplify the relationship between households and the government within
a country, we assume that private and public spending are perfect substitutes, so
households have preferences over total spending 𝐢𝐢 + 𝐺𝐺 instead of only 𝐢𝐢 and we
consolidate the nation’s private and public debts. The argument for the latter is that private
debt can often end up as public debt, for example, bailouts during a financial crisis.
Considering the usual two periods, let 𝐡𝐡 denote the nation’s consolidated private and public
debts to rest of world, including principal and interest, at the beginning of the current
period. Unlike earlier versions of the two-period model, the country does not start from
blank slate, with 𝐡𝐡 being determined by decisions made in the past.
Denoting total international debts (principal and interest) by 𝐡𝐡′ at the beginning of the
future period, issuance of bonds during the current period at interest rate π‘Ÿπ‘Ÿ is 𝐡𝐡′ /(1 + π‘Ÿπ‘Ÿ).
The budget equation for current debt issuance is:
𝐢𝐢 + 𝐺𝐺 = π‘Œπ‘Œ − 𝐡𝐡 +
𝐡𝐡′
1 + π‘Ÿπ‘Ÿ
If debts are repaid in full, future private and public spending is limited to income less total
debt in the future period because no further bond issuance is possible at that point:
𝐢𝐢 ′ + 𝐺𝐺 ′ = π‘Œπ‘Œ ′ − 𝐡𝐡′
Dividing both sides of this equation by 1 + π‘Ÿπ‘Ÿ and substituting in place of 𝐡𝐡′ /(1 + π‘Ÿπ‘Ÿ) above
leads to the present-value budget constraint:
𝐢𝐢 ′ + 𝐺𝐺 ′
π‘Œπ‘Œ ′
𝐢𝐢 + 𝐺𝐺 +
= π‘Œπ‘Œ − 𝐡𝐡 +
1 + π‘Ÿπ‘Ÿ
1 + π‘Ÿπ‘Ÿ
To analyse household and government choices, the two-period model diagram now has 𝐢𝐢 +
𝐺𝐺 and 𝐢𝐢 ′ + 𝐺𝐺′ on the horizontal and vertical axes. Indifference curves have the usual convex
331
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
shape. The present-value budget constraint passes through the point (π‘Œπ‘Œ − 𝐡𝐡, π‘Œπ‘Œ ′ ), where
initial debts 𝐡𝐡 are subtracted from current income π‘Œπ‘Œ.
Now suppose that a nation has the option of (full) default in either the current or future
periods. If a default occurs in the future period then the country wipes out debt 𝐡𝐡′ but
incurs a penalty 𝑣𝑣, the effect of which is measured as equivalent to a reduction in output π‘Œπ‘Œ′.
Total spending is then 𝐢𝐢 ′ + 𝐺𝐺 ′ = π‘Œπ‘Œ ′ − 𝑣𝑣. The penalty 𝑣𝑣 can either be direct losses or
disruption from international creditors trying to enforce their claims, or a proxy in a twoperiod model for the loss from being excluded from world credit markets further in future.
The nation can also default in the current period. In this case, the country wipes out initial
debts 𝐡𝐡 but suffers exclusion from world credit markets in the future period as well as the
same future-period penalty 𝑣𝑣 described above. The default has the effect of returning the
country to autarky, with total expenditure limited by income net of penalties in each time
period, that is, 𝐢𝐢 + 𝐺𝐺 = π‘Œπ‘Œ and 𝐢𝐢 ′ + 𝐺𝐺 ′ = π‘Œπ‘Œ ′ − 𝑣𝑣.
Default is a rational choice for the nation if it allows the representative household to reach a
higher indifference curve. In making the default decision, suppose that the government acts
in the interests of the representative household.
Since a nation can choose to default, lenders will only make or roll over current loans if no
default is expected to occur in the future. If there is no default in the second period, the
representative household benefits from expenditure 𝐢𝐢 ′ + 𝐺𝐺 ′ = π‘Œπ‘Œ ′ − 𝐡𝐡′, while a default
allows it to benefit from 𝐢𝐢 ′ + 𝐺𝐺 ′ = π‘Œπ‘Œ ′ − 𝑣𝑣. Comparing the two, it follows that a futureperiod default is rational if 𝐡𝐡′ > 𝑣𝑣. Consequently, rational lenders make loans in the current
period only if 𝐡𝐡′ ≤ 𝑣𝑣. With this restriction, the current-period budget equation implies:
𝐢𝐢 + 𝐺𝐺 ≤ π‘Œπ‘Œ − 𝐡𝐡 +
𝑣𝑣
1 + π‘Ÿπ‘Ÿ
This is referred to as a limited-commitment constraint on borrowing. It takes the form of a
borrowing constraint analogous to those studied in Chapter 4. The present-value budget
constraint is truncated at 𝐢𝐢 + 𝐺𝐺 = π‘Œπ‘Œ − 𝐡𝐡 + 𝑣𝑣/(1 + π‘Ÿπ‘Ÿ), where the amount that expenditure
can exceed income is 𝑣𝑣 ⁄(1 + π‘Ÿπ‘Ÿ) − 𝐡𝐡. Subject to the limited-commitment constraint, if
lending occurs in the current period, debts will be repaid in the future if nothing
unexpectedly changes. When the limited-commitment constraint binds, 𝐢𝐢 ′ + 𝐺𝐺 ′ = π‘Œπ‘Œ ′ − 𝑣𝑣
because 𝐡𝐡′ = 𝑣𝑣 makes the nation indifferent between defaulting or not.
Since the possibility of default explains why a country faces a borrowing constraint, this
analysis can be used to understand why countries might fail to use international financial
markets to smooth consumption by as much as our earlier analysis would suggest.
Now consider the default decision in the current period, taking as given past lending
decisions 𝐡𝐡 (which might turn out ex post to have been unwise if default is rational). If a
default occurs, the country has no access to international financial markets and only one
expenditure point (π‘Œπ‘Œ, π‘Œπ‘Œ′ − 𝑣𝑣) is feasible in autarky. Supposing 𝐡𝐡 ≤ 𝑣𝑣/(1 + π‘Ÿπ‘Ÿ), the default
outcome lies to the left of the point where the limited-commitment constraint binds. It
follows that default is not chosen in this case.
332
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Figure 10.8: Case where no default is chosen
What happens if 𝐡𝐡 > 𝑣𝑣/(1 + π‘Ÿπ‘Ÿ)? In this case, the autarky point A lies to the right of the
point B where the limited-commitment constraint binds, so it might be in the nation’s
interest to default. If limited commitment constraint would bind in the absence of default, it
can be seen that default is definitely preferable when 𝐡𝐡 > 𝑣𝑣 ⁄(1 + π‘Ÿπ‘Ÿ). If the limitedcommitment constraint is not binding with the country not wanting to borrow the
maximum amount, then there is the value of being able to access world capital markets to
consider as well as the present value of the penalty 𝑣𝑣.
Figure 10.9: Case where default is chosen
This analysis points to factors that make default more likely. First, it is immediate that a
greater debt burden 𝐡𝐡 increases the likelihood of default, all else being equal. Default is also
more likely when the losses or penalties 𝑣𝑣 from default are small. For example, bailouts or
debt forgiveness might allow a nation to return to credit markets after a short spell of
exclusion.
333
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
For a country not already borrowing the maximum amount consistent with the limitedcommitment constraint, a negative shock to income π‘Œπ‘Œ makes default more likely. However,
π‘Œπ‘Œ becomes irrelevant to the default decision once the maximum borrowing amount is
reached because there is no option value of being able to borrow more to smooth
consumption.
A final crucial factor in the default decision is the interest rate π‘Ÿπ‘Ÿ faced by a country. If this is
high, default is more likely because a greater sacrifice of expenditure is necessary to roll
over existing debts at a higher interest rate. Countries with large debt burdens might not
have an incentive to default as long as interest rates remain low.
The link between interest rates and default decisions points to the possibility of self-fulfilling
expectations of default. Suppose international lenders believe there is a positive probability
of a country defaulting. If π‘Ÿπ‘Ÿ ∗ is the interest rate on a risk-free asset (a bond issued by a
country that will not default), lenders are only willing to lend to the country with default risk
if they receive a risk premium. This risk premium is a spread between π‘Ÿπ‘Ÿ and π‘Ÿπ‘Ÿ ∗ that depends
on the perceived probability of default.
Self-fulfilling defaults could occur because believing the probability of default is high drives
up π‘Ÿπ‘Ÿ, making it more likely the condition for default will be satisfied. On the other hand, if
there is a low default probability, π‘Ÿπ‘Ÿ is low and default is not chosen, assuming other
fundamentals do not point to default. This logic shows that lack of commitment can lead to
self-fulfilling defaults in some circumstances.
10.4 Open-economy real dynamic model
The simple examples with exogenous income and fiscal policy illustrate the gains from
international trade in assets and how the current account responds to shocks. These
findings apply more broadly in economies where GDP is endogenous, for example, where
firms hire labour supplied by households and invest in capital as we saw in the dynamic
macroeconomic model from Chapter 3. By adding international trade to the dynamic
macroeconomic model, we can also investigate whether GDP responds differently to shocks
in an open economy.
The closed-economy dynamic macroeconomic model developed in Section 3.12 explains
real GDP π‘Œπ‘Œ and the real interest rate π‘Ÿπ‘Ÿ by finding the intersection of the output demand π‘Œπ‘Œ 𝑑𝑑
and output supply π‘Œπ‘Œ 𝑠𝑠 curves. For comparison, we denote by π‘Œπ‘Œπ‘Žπ‘Ž and π‘Ÿπ‘Ÿπ‘Žπ‘Ž the predictions of the
model if the economy were in autarky, i.e. cut off from international trade. Output demand
in a closed economy is given by π‘Œπ‘Œπ‘Žπ‘Žπ‘‘π‘‘ = 𝐢𝐢 𝑑𝑑 + 𝐼𝐼 𝑑𝑑 + 𝐺𝐺. Unlike the new Keynesian model from
Chapter 8, we assume flexible goods prices here, which means we are examining a purely
‘real’ model with no monetary issues. We consider a model with sticky prices later in the
chapter.
10.4.1 Balance-of-payments equilibrium, capital flows and net exports
There are two important differences when we analyse an open economy using our dynamic
macroeconomic model. First, net exports 𝑁𝑁𝑁𝑁 are added as an additional component of
output demand:
334
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
π‘Œπ‘Œ 𝑑𝑑 = 𝐢𝐢 𝑑𝑑 + 𝐼𝐼 𝑑𝑑 + 𝐺𝐺 + 𝑁𝑁𝑁𝑁
Output demand is π‘Œπ‘Œ 𝑑𝑑 = π‘Œπ‘Œπ‘Žπ‘Žπ‘‘π‘‘ + 𝑁𝑁𝑁𝑁, the sum of domestic demand π‘Œπ‘Œπ‘Žπ‘Žπ‘‘π‘‘ , referred to as
absorption and net exports 𝑁𝑁𝑁𝑁. All else being equal, changes in net exports cause shifts of
the π‘Œπ‘Œ 𝑑𝑑 curve. The second important difference is that perfect mobility of capital in an open
economy aligns the domestic real interest rate π‘Ÿπ‘Ÿ with the foreign interest rate π‘Ÿπ‘Ÿ ∗ . This is a
necessary condition for balance-of-payments equilibrium 𝐡𝐡𝐡𝐡 = 𝐢𝐢𝐢𝐢 + 𝐹𝐹𝐹𝐹 = 0. However, if
π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ for a small open economy that takes π‘Ÿπ‘Ÿ ∗ as given, what ensures the goods market is in
equilibrium if the domestic real interest rate π‘Ÿπ‘Ÿ cannot adjust to where the π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 curves
intersect? The mechanism is now that capital flows occur until π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 cross at π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗.
Adjustment of 𝐹𝐹𝐹𝐹 implies changes in the current account 𝐢𝐢𝐢𝐢 = −𝐹𝐹𝐹𝐹 and, hence, net
exports, which shift the π‘Œπ‘Œ 𝑑𝑑 curve.
Consider the example in Figure 10.10. The hypothetical closed-economy output demand
curve π‘Œπ‘Œπ‘Žπ‘Žπ‘‘π‘‘ intersects π‘Œπ‘Œ 𝑠𝑠 at π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿπ‘Žπ‘Ž , which is above the foreign real interest rate π‘Ÿπ‘Ÿ ∗ . In an open
economy with perfect capital mobility, π‘Ÿπ‘Ÿ would fall to π‘Ÿπ‘Ÿ ∗ on the horizontal 𝐡𝐡𝐡𝐡 line
representing balance-of-payments equilibrium. At π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ , the output demand curve π‘Œπ‘Œπ‘Žπ‘Žπ‘‘π‘‘ is to
the right of the output supply curve π‘Œπ‘Œ 𝑠𝑠 . With domestic demand exceeding domestic
production, imports rise, reducing net exports 𝑁𝑁𝑁𝑁, which shifts the open-economy output
demand curve π‘Œπ‘Œ 𝑑𝑑 to the left until it intersects π‘Œπ‘Œ 𝑠𝑠 at π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ . The fall in the current account
𝐢𝐢𝐢𝐢 is matched by capital inflows that raise 𝐹𝐹𝐹𝐹.
Figure 10.10: Goods market in an open economy
In what follows, we simplify matters by assuming 𝑁𝑁𝑁𝑁𝑁𝑁 = 0, which means that the current
account 𝐢𝐢𝐢𝐢 is the same as net exports 𝑁𝑁𝑁𝑁. We will also consider an economy starting from
a current account close to balance (𝐢𝐢𝐢𝐢 is close to zero). This means that households hold
assets equal to all domestic assets and the economy is neither a net saver or borrower. This
allows us to continue to ignore income effects arising from changes in real wages 𝑀𝑀 and the
real interest rate π‘Ÿπ‘Ÿ (see the discussion in Section 3.11). The consequence of these
simplifications is that the labour demand 𝑁𝑁 𝑑𝑑 and labour supply 𝑁𝑁 𝑠𝑠 curves, and the output
supply curve π‘Œπ‘Œ 𝑠𝑠 , continue to behave as they do in a closed economy. The only change in an
335
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
open economy is the presence of net exports as a component of π‘Œπ‘Œ 𝑑𝑑 . Hence, the adjustment
to balance-of-payments equilibrium (π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ ) in an open economy occurs solely through
shifts of the output demand curve as seen in Figure 10.10.
10.4.2 Examples
Let us consider two types of shock to illustrate how the open-economy real dynamic model
works. We proceed by finding the effects of a shock in a closed economy and then ask what
adjustment of net exports (and, hence, what shift of π‘Œπ‘Œ 𝑑𝑑 ) is required to achieve balance-ofpayments equilibrium in an open economy.
First, consider a temporary negative supply shock (a decline in current TFP 𝑧𝑧) of the kind we
used to explain business cycles using the RBC approach from Section 8.8. In a closed
economy, this shock shifts both the output supply π‘Œπ‘Œ 𝑠𝑠 and output demand π‘Œπ‘Œ 𝑑𝑑 curves to the
left but the leftward shift of π‘Œπ‘Œ 𝑑𝑑 is smaller than that of π‘Œπ‘Œ 𝑠𝑠 owing to the desire for
consumption smoothing. This led the equilibrium real interest rate to rise in a closed
economy. In an open economy, the real interest rate cannot rise above π‘Ÿπ‘Ÿ ∗ and balance-ofpayments equilibrium (π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ ) is restored by net exports 𝑁𝑁𝑁𝑁 declining instead, shifting π‘Œπ‘Œ 𝑑𝑑
further to the left to match the shift of π‘Œπ‘Œ 𝑠𝑠 . This analysis is illustrated in Figure 10.11. The
negative TFP shock results in a larger decline in output in an open economy as households
smooth consumption by importing more and borrowing from the rest of the world.
Figure 10.11: Temporary TFP shock in an open economy
The second example is a temporary fiscal stimulus, specifically, a temporary increase in
government spending 𝐺𝐺. In a closed economy, this policy change was analysed in Box 4.2.
Both the output demand π‘Œπ‘Œ 𝑑𝑑 and output supply π‘Œπ‘Œ 𝑠𝑠 curves shift to the right but π‘Œπ‘Œ 𝑑𝑑 shifts by
more, leading to an increase in π‘Ÿπ‘Ÿ in a closed economy. In an open economy, balance-ofpayments equilibrium is restored by a decline of 𝑁𝑁𝑁𝑁, shifting π‘Œπ‘Œ 𝑑𝑑 to the left until its overall
shift matches that of π‘Œπ‘Œ 𝑠𝑠 . This is shown in Figure 10.12. The fiscal stimulus raises real GDP by
a smaller amount in an open economy because some of the additional expenditure is spent
on imports rather than buying more goods produced domestically.
336
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Figure 10.12: Temporary fiscal stimulus in an open economy
Box 10.3: Global imbalances
In the 1990s and 2000s, large current-account imbalances relative to GDP open up
in several countries. These became known as ‘global imbalances’, the most
noteworthy of which were the large current-account deficits of the USA and the
large current-account surpluses of China. Here, we use the international real
dynamic model to investigate some possible causes of such imbalances.
First, suppose that good news about future productivity growth creates
expectations of a high return on investments in country. We represent this in the
model by higher 𝑧𝑧′, which raises 𝑀𝑀𝑃𝑃𝐾𝐾′ . This might fit the case of the US economy in
the 1990s, which had a spell of high productivity growth and experienced an
investment boom. In a closed economy, this would shift the output demand curve
to the right (increasing 𝐼𝐼 𝑑𝑑 ) and the output supply curve to the left (when the shock
affects future 𝑧𝑧′ but not current 𝑧𝑧). The closed-economy effects are essentially the
opposite of those studied in Box 3.6, with the real interest rate rising
unambiguously.
In an open economy, adjustment to balance-of-payments equilibrium requires a
leftward shift of π‘Œπ‘Œ 𝑑𝑑 to intersect π‘Œπ‘Œ 𝑠𝑠 at π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ and, hence, a decline in net exports.
The economy runs a current-account deficit as borrowing from the rest of world
finances the investment boom. This case is illustrated in Figure 10.13.
Our second example considers a country with underdeveloped financial markets.
We interpret this as a larger spread between the safe real interest rate π‘Ÿπ‘Ÿ and the
interest rate π‘Ÿπ‘Ÿπ‘™π‘™ on domestic lending. As we saw in Section 4.5, a larger spread can
reflect a more severe asymmetric-information problem in underdeveloped financial
markets. The effect of a larger spread between π‘Ÿπ‘Ÿ and π‘Ÿπ‘Ÿπ‘™π‘™ is that the output demand
curve is further to the left than otherwise (for example, through lower 𝐼𝐼 𝑑𝑑 )
337
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
because π‘Ÿπ‘Ÿπ‘™π‘™ is higher for each real interest rate π‘Ÿπ‘Ÿ on the vertical axis. In a closed
economy, this would lead to a lower domestic real interest rate π‘Ÿπ‘Ÿ in equilibrium.
Figure 10.13: Country with a high return on capital
In an open economy, balance-of-payments equilibrium is restored by a rightward
shift of π‘Œπ‘Œ 𝑑𝑑 with net exports rising as illustrated in Figure 10.14. The economy runs a
current account surplus with lending to the rest of the world substituting for
domestic lending. This is because domestic lending is hampered by more severe
asymmetric information problems compared to foreign financial markets. This case
arguably fits the experience of some developing economies that have run
persistently large current-account surpluses.
Figure 10.14: Country with limited financial development
338
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Box 10.4: Capital controls
We have assumed throughout this chapter that capital mobility is perfect, which
effectively makes the financial account 𝐹𝐹𝐹𝐹 of the balance of payments extremely
sensitive to the domestic-foreign interest rate gap π‘Ÿπ‘Ÿ − π‘Ÿπ‘Ÿ ∗ . This leads to π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ in
balance-of-payments equilibrium.
One reason for less-than-perfect capital mobility is that there are restrictions in
place that prevent some capital flows, such as limits on foreign purchases of
domestic assets or domestic purchases of foreign assets. These are known as
capital controls and have the effect of making 𝐹𝐹𝐹𝐹 less sensitive to π‘Ÿπ‘Ÿ − π‘Ÿπ‘Ÿ ∗ .
To see the effects of capital controls, we consider the extreme case where all
capital flows are blocked. With a zero financial account (𝐹𝐹𝐹𝐹 = 0), balance-ofpayments equilibrium then requires 𝐢𝐢𝐢𝐢 = 0. Hence, in the open-economy real
dynamic model, the shifts of π‘Œπ‘Œ 𝑑𝑑 through net exports 𝑁𝑁𝑁𝑁 that ensured π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ are
prevented by the capital controls. This most extreme form of capital controls thus
leaves the economy at the same outcomes as if it were in autarky. It can be
analysed as if it were a closed economy with the domestic real interest rate
determined by the intersection of π‘Œπ‘Œ 𝑑𝑑 and π‘Œπ‘Œ 𝑠𝑠 with no adjustment of 𝑁𝑁𝑁𝑁. More
generally, capital controls that are less strict than this result in outcomes
somewhere between the autarky and perfect capital mobility extremes.
Figure 10.15: Negative supply shock with capital controls
Taking the example in Section 10.4 of a negative supply shock, Figure 10.15 shows
that GDP π‘Œπ‘Œ falls by less with full capital controls and the real interest rate π‘Ÿπ‘Ÿ rises.
Although GDP falls by less, households are better off with perfect capital mobility
because they can smooth consumption with cheaper international borrowing at
interest rate π‘Ÿπ‘Ÿ ∗ . The high interest rate π‘Ÿπ‘Ÿ is a reflection of the difficulties of
producing domestically after the negative supply shock.
339
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
10.5 The terms of trade
Our analysis in this chapter assumed all goods are homogeneous to emphasise the gains
available from purely intertemporal trade between countries. It was also possible to
develop a model of the determinants of net exports even though exports and imports were
the same type of good. This is because the current account is the opposite of the financial
account in balance-of-payments equilibrium, so international saving or borrowing shows up
as trade surpluses or deficits. Nonetheless, trade in different types of goods is obviously a
feature of the real world and it is important to account for this some contexts, particularly
when we consider international competitiveness later in the chapter.
We now allow for different types of goods to be produced in the domestic and foreign
economies. We denote the relative price of imports in terms of domestically produced
goods (some of which are exported) by π‘žπ‘ž. Supposing all domestically produced goods are
the same, this is a measure of the terms of trade, the relative price of imports to exports. As
defined, higher π‘žπ‘ž means imports become more expensive, which improves the
competitiveness of exports.
The quantity (or volume) of exports is denoted by 𝑋𝑋 and the quantity of imports by 𝑍𝑍. The
value of net exports in terms of domestic goods is:
𝑁𝑁𝑁𝑁 = 𝑋𝑋 − π‘žπ‘žπ‘žπ‘ž
The spending patterns of domestic and foreign consumers on different goods depend on
relative price of imports π‘žπ‘ž. The quantity of exports 𝑋𝑋 demanded increases with π‘žπ‘ž as
competitiveness improves and the quantity of imports 𝑍𝑍 demanded decreases with π‘žπ‘ž as
domestically produced goods become more competitive. We assume the value of net
exports 𝑁𝑁𝑁𝑁 = 𝑋𝑋 − π‘žπ‘žπ‘žπ‘ž rises with π‘žπ‘ž overall, that is, the volume effects of π‘žπ‘ž on 𝑋𝑋 and 𝑍𝑍
dominate import value effect (the direct negative effect of π‘žπ‘ž in the equation for 𝑁𝑁𝑁𝑁). This
occurs when the Marshall-Lerner condition is satisfied: the sum of the absolute value of the
elasticities of 𝑋𝑋 and 𝑍𝑍 with respect to π‘žπ‘ž is greater than 1. The relationship between net
exports and the term of trade is depicted in Figure 10.16.
Figure 10.16: Net exports and the terms of trade
340
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Accounting for domestic and foreign output being different types of goods does not
fundamentally affect the determination of net exports 𝑁𝑁𝑁𝑁 in the two-period model of the
current account from Section 10.2. Given the equilibrium value of 𝑁𝑁𝑁𝑁, the terms of trade π‘žπ‘ž
adjusts to the level that is consistent with it according to Figure 10.16. For example, the
temporary increase in government spending 𝐺𝐺 from Box 10.2 to leads to a fall in net exports
𝑁𝑁𝑁𝑁 and now to a lower terms of trade π‘žπ‘ž, which represents a real appreciation of the terms
of trade, a loss of competitiveness of domestic goods.
In the full real dynamic model, the output demand curve π‘Œπ‘Œ 𝑑𝑑 continues to shift with 𝑁𝑁𝑁𝑁
following shocks to the economy. However, a higher value of 𝑁𝑁𝑁𝑁, associated with a
rightward shift of π‘Œπ‘Œ 𝑑𝑑 requires higher π‘žπ‘ž. This real depreciation of the terms of trade reduces
the purchasing power of the real wage 𝑀𝑀 (which is in terms of domestic goods but
households also want to buy imports), implying a leftward shift of labour supply 𝑁𝑁 𝑠𝑠 and
causing a leftward shift of the output supply curve π‘Œπ‘Œ 𝑠𝑠 . Qualitatively, this does not change
the direction of our earlier results but implies smaller adjustments of 𝑁𝑁𝑁𝑁 and GDP π‘Œπ‘Œ are
needed to restore balance-of-payments equilibrium π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ . Hence, we shall usually ignore
this effect in practice.
10.6 Exchange rates
Our analysis of the current account and GDP in an open economy has so far made no
reference to exchange rates or monetary issues more generally (the terms of trade in
Section 10.5 is a relative price of two goods, a real variable). An exchange rate is a relative
price of different currencies in the foreign-exchange market. Exchange rates specify the
international values of currencies, so in order to study them, we must reintroduce money
into our analysis. We will consider both a model of the ‘long run’ with flexible prices based
on the treatment of money in Chapter 6 and a ‘short run’ model with sticky prices based on
the new Keynesian approach to macroeconomics from Chapter 8.
Suppose there are two currencies in the world, one for the domestic economy and one for
the foreign economy. The price of domestic goods in terms of domestic currency is 𝑃𝑃 and
the price of foreign goods in terms of foreign currency is 𝑃𝑃∗ . The exchange rate 𝑒𝑒 between
the domestic and foreign currencies, or to be precise, the nominal exchange rate is defined
as the domestic-currency price of a unit of foreign currency. Note that a higher value of 𝑒𝑒 is
a depreciation of the domestic currency with this definition.
Given the nominal exchange rate 𝑒𝑒 and the prices 𝑃𝑃 and 𝑃𝑃∗ we can calculate the implied real
exchange rate, denoted by π‘žπ‘ž. This is defined as the price of a unit of foreign goods in terms
of domestic goods. The real exchange rate π‘žπ‘ž is calculated by comparing the domestic price
𝑃𝑃 of domestic goods to the price of foreign goods 𝑃𝑃∗ converted into domestic currency 𝑒𝑒𝑃𝑃 ∗:
π‘žπ‘ž =
𝑒𝑒𝑃𝑃∗
𝑃𝑃
341
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
10.6.1 Purchasing power parity
In the long run with flexible prices, one theory of exchange rates builds on the ‘law of one
price’. This states that identical goods should sell at the same price in different locations
after adjusting for prices being quoted in different currencies.
We develop this theory going back to the case where the goods bought and sold in each
country around the world are homogeneous. If a unit of goods can be bought in foreign
markets at price 𝑃𝑃∗ , this can be imported at cost 𝑒𝑒𝑃𝑃 ∗ in terms of domestic currency.
Therefore, the flexible domestic price 𝑃𝑃 should adjust so that it and exchange rate 𝑒𝑒 satisfy
𝑃𝑃 = 𝑒𝑒𝑃𝑃∗ , implying a real exchange rate of π‘žπ‘ž = 1, so the price of goods is the same in both
countries after adjusting for different currencies. In this case, we say that purchasing power
parity (PPP) holds. Note that the argument treats all goods as tradable with no shipping
costs or tariffs, and assumes perfectly competitive markets.
10.6.2 Equilibrium exchange rates in the long run
We now analyse the equilibrium exchange rate with flexible prices, assuming that PPP holds.
The assumption of flexible prices and PPP makes this case more plausible for the long run
than the short run.
We treat the real equilibrium of economy as independent of monetary policy because prices
are flexible. This is exactly (or approximately) justifiable in some circumstances as discussed
in Chapter 6. For a small open economy with perfect capital mobility, π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ and real GDP π‘Œπ‘Œ
is found using the real dynamic model from Section 10.4, which is at the point on the π‘Œπ‘Œ 𝑠𝑠
curve where π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ (with 𝑁𝑁𝑁𝑁 adjusting so π‘Œπ‘Œ 𝑑𝑑 intersects π‘Œπ‘Œ 𝑠𝑠 at that point).
For monetary variables, prices and exchange rates must PPP, so 𝑃𝑃 = 𝑒𝑒𝑃𝑃∗ and the foreign
price level 𝑃𝑃∗ is exogenous in a small open economy. There is also the Fisher equation for
the nominal interest rate 𝑖𝑖 = π‘Ÿπ‘Ÿ ∗ + πœ‹πœ‹, where πœ‹πœ‹ is the domestic inflation rate and π‘Ÿπ‘Ÿ ∗ is the
foreign and domestic real interest rate.
Since the nominal exchange rate 𝑒𝑒 gives the value of one currency in terms of another, we
need to describe monetary policy to determine the equilibrium exchange rate. We start
with the case where there is monetary policy autonomy, meaning that the domestic central
bank is not required to intervene in the foreign-exchange market. We say there is a ‘floating’
or ‘flexible’ exchange rate. In contrast, in a fixed-exchange rate system, we will see that
domestic monetary policy lacks autonomy.
Monetary policy is specified as an exogenous path of the money supply 𝑀𝑀 𝑠𝑠 = 𝑀𝑀. We can
analyse the determinants of the price level 𝑃𝑃 and inflation rate πœ‹πœ‹ using the approach from
Chapter 6. Given the equilibrium price level 𝑃𝑃 resulting from domestic monetary policy, and
the foreign price level 𝑃𝑃∗ , the exchange rate 𝑒𝑒 is found from the PPP equation as follows:
𝑒𝑒 =
𝑃𝑃
𝑃𝑃∗
Another way to understand this is to consider that the exchange rate 𝑒𝑒 adjusts to clear the
money market, with PPP then determining the price level 𝑃𝑃 = 𝑒𝑒𝑃𝑃∗ . Substituting 𝑃𝑃 = 𝑒𝑒𝑃𝑃∗
and 𝑖𝑖 = π‘Ÿπ‘Ÿ ∗ + πœ‹πœ‹ into the money demand curve 𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, 𝑖𝑖) derived in Section 6.4, the
342
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
money demand function in terms of the exchange rate is 𝑀𝑀𝑑𝑑 = 𝑒𝑒𝑃𝑃∗ 𝐿𝐿(π‘Œπ‘Œ, π‘Ÿπ‘Ÿ ∗ + πœ‹πœ‹). Given π‘Œπ‘Œ,
π‘Ÿπ‘Ÿ ∗ and πœ‹πœ‹, this rises proportionally with 𝑒𝑒. Higher 𝑒𝑒 means a depreciation of the domestic
currency that raises domestic prices according to PPP, which increases the need for money
to carry out a given real quantity of transactions.
Figure 10.17 plots the money demand curve with the nominal exchange rate 𝑒𝑒 on the
vertical axis. The demand curve is upward sloping, analogous to the upward-sloping money
demand function in terms of the price level 𝑃𝑃 that were seen in Section 6.4. The money
supply curve is vertical because the quantity of money 𝑀𝑀 𝑠𝑠 is exogenous. The equilibrium
nominal exchange rate 𝑒𝑒 is found at the point of intersection between 𝑀𝑀𝑑𝑑 and 𝑀𝑀 𝑠𝑠 .
Figure 10.17: Equilibrium exchange rate
10.7 Exchange-rate regimes
A government can choose to let the exchange rate of its currency float – a ‘flexible’
exchange-rate regime – or intervene to fix the exchange rate. There are also intermediate
exchange-rate regimes between these extremes.
Under a flexible-exchange rate regime, the market determines the exchange rate 𝑒𝑒, which
will depend on the stance of monetary policy among other things. With a fixed-exchange
rate regime, there is a target value 𝑒𝑒̅ of the exchange rate and the government or central
bank intervenes in the foreign-exchange market to ensure 𝑒𝑒 = 𝑒𝑒̅.
A fixed exchange-rate regime depends on the government or central bank maintaining
sufficient foreign-exchange reserves (foreign-currency assets) to support the value of the
domestic currency if needed and being willing to accumulate more foreign-exchange
reserves if necessary. Foreign-exchange market interventions lead to the domestic money
supply 𝑀𝑀 𝑠𝑠 being endogenous.
10.7.1 Intervention in the foreign-exchange market
Suppose foreign-exchange market intervention is done by the central bank. The central
bank’s liabilities are domestic currency in circulation – the supply of fiat money 𝑀𝑀 𝑠𝑠 – and it
holds the following types of assets:
343
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
•
•
Domestic government bonds (acquired through open-market operations)
Foreign-exchange reserves (acquired through foreign-exchange market
intervention).
A sale of foreign currency by the central bank to purchase domestic currency shrinks the
stock of foreign-exchange reserves and the domestic money supply 𝑀𝑀 𝑠𝑠 . The central bank
must have acquired sufficient foreign-currency assets in the past to do this. A sale of
domestic currency by the central bank to purchase foreign currency expands 𝑀𝑀 𝑠𝑠 and the
stock of foreign-exchange reserves. In principle, there is no limit to the extent of such sales
of domestic currency.
It is possible for the central bank to ‘sterilise’ its foreign-exchange market interventions
through open-market operations in the domestic bond market that leave 𝑀𝑀 𝑠𝑠 unchanged, for
example, buying government bonds with newly created money after shrinking the money
supply after selling foreign-exchange reserves. If this does not happen, the intervention is
said to be ‘unsterilised’ and results in a change in the domestic money supply 𝑀𝑀 𝑠𝑠 .
10.7.2 A shock to foreign prices
We illustrate how foreign-exchange market intervention is necessary to support a fixed
exchange rate following a shock. For example, suppose there is a rise in prices 𝑃𝑃∗ in foreign
markets. The equation 𝑒𝑒 = 𝑃𝑃/𝑃𝑃∗ indicates this implies a fall in the exchange rate 𝑒𝑒
consistent with PPP, so there is pressure for the domestic currency to appreciate. A sale of
domestic currency is necessary to maintain the fixed exchange rate 𝑒𝑒 = 𝑒𝑒̅.
In the money-market diagram shown in Figure 10.18, a rise in 𝑃𝑃∗ pivots the money demand
curve 𝑀𝑀𝑑𝑑 rightwards. An increase of the money supply 𝑀𝑀 𝑠𝑠 is needed to keep the
equilibrium exchange rate at 𝑒𝑒 = 𝑒𝑒̅, which corresponds to a purchase of foreign-currency
assets by the central bank by selling domestic currency. Observe that sterilisation of the
foreign-exchange intervention would not be effective – only an unsterilised intervention
shifts 𝑀𝑀 𝑠𝑠 and achieves 𝑒𝑒 = 𝑒𝑒̅. With the exchange rate 𝑒𝑒 = 𝑒𝑒̅ fixed, higher 𝑃𝑃 ∗ implies a higher
domestic price level 𝑃𝑃 = 𝑒𝑒𝑃𝑃∗ . The fixed exchange rate causes inflation in foreign markets to
be ‘imported’ to the domestic economy.
344
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Figure 10.18: Foreign exchange intervention with rise in foreign price level
We now look at what difference it makes if the country has a flexible exchange rate. As
before, the increase in foreign prices 𝑃𝑃∗ implies fall in exchange rate 𝑒𝑒 = 𝑃𝑃/𝑃𝑃∗ consistent
with PPP, leading to pressure for an appreciation of the domestic currency. This is shown in
Figure 10.19, where the rise of 𝑃𝑃∗ pivots 𝑀𝑀𝑑𝑑 rightwards. With domestic monetary policy
unchanged, i.e. with no shift of the money supply curve 𝑀𝑀 𝑠𝑠 , the exchange rate falls from 𝑒𝑒1
to 𝑒𝑒2 . There is an appreciation of the domestic currency and no foreign-exchange
intervention occurs to prevent this. Observe that the nominal exchange rate 𝑒𝑒 adjusts so
that 𝑃𝑃 = 𝑒𝑒𝑃𝑃 ∗ is unaffected (money-market equilibrium 𝑀𝑀 𝑠𝑠 = 𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(π‘Œπ‘Œ, π‘Ÿπ‘Ÿ ∗ + πœ‹πœ‹) is
reached at the same price level 𝑃𝑃). Inflation in foreign markets does not cause a rise in
domestic prices 𝑃𝑃 under a flexible exchange-rate regime.
Figure 10.19: Flexible exchange rate with rise in foreign price level
10.7.3 Monetary policy autonomy with a flexible exchange rate
With a flexible exchange rate policy, the equilibrium value of the exchange rate 𝑒𝑒 is left to
float, leaving the country free to choose its own monetary policy. For example, suppose the
central bank chooses to make a one-off permanent increase of the money supply 𝑀𝑀 𝑠𝑠 . The
effects of this are shown in Figure 10.20.
345
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Figure 10.20: Exchange rate with increase in the money supply
The money supply curve 𝑀𝑀 𝑠𝑠 shifts to the right, and the equilibrium exchange rate 𝑒𝑒 rises in
proportion to the increase in 𝑀𝑀 𝑠𝑠 , implying a depreciation of the domestic currency.
Purchasing power parity 𝑃𝑃 = 𝑒𝑒𝑃𝑃∗ implies the price level 𝑃𝑃 also rises in proportion to 𝑀𝑀 𝑠𝑠 . As
no further inflation expected, there is no shift of 𝑀𝑀𝑑𝑑 through higher 𝑖𝑖 = π‘Ÿπ‘Ÿ ∗ + πœ‹πœ‹ and there
are no real effects because prices are fully flexible. This is the open-economy equivalent of
the money neutrality result from Section 6.9. Monetary policy autonomy is still valuable to
the extent that it gives monetary policy control over domestic prices and the inflation rate.
10.8 Open-economy sticky-price model
Nominal exchange rate fluctuations change international relative prices when the prices of
goods are sticky in units of money. This means that exchange rates have implications for
competitiveness, and it is important to account for this effect when studying an open
economy in the short run. We do this by adapting the new Keynesian model from Chapter 8
to an open economy.
10.8.1 Competitiveness and output demand
The domestic and foreign economies produce different types of goods. The prices of these
goods in terms of a country’s own currency are 𝑃𝑃 and 𝑃𝑃∗ . The nominal exchange rate of the
two currencies is 𝑒𝑒, defined as the domestic currency price of foreign currency. We suppose
there is no ‘pricing to market’ – no price discrimination between the domestic and foreign
markets. Domestic goods are sold at price 𝑃𝑃/𝑒𝑒 in the foreign market and foreign goods are
sold at price 𝑒𝑒𝑃𝑃 ∗ in the domestic market. The relative price of foreign goods to domestic
goods, the terms of trade π‘žπ‘ž, is:
𝑃𝑃∗
𝑒𝑒𝑃𝑃∗
π‘žπ‘ž =
=
𝑃𝑃⁄𝑒𝑒
𝑃𝑃
Since the domestic and foreign goods are different products, this does not have to equal 1
even though the law of one price holds for individual goods. Higher π‘žπ‘ž means a real
depreciation of the terms of trade, increasing the competitiveness of domestically produced
346
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
goods. We assume that net exports 𝑁𝑁𝑁𝑁 are positively related to competitiveness π‘žπ‘ž as
explained in Section 10.5.
Both goods prices 𝑃𝑃 and 𝑃𝑃∗ are sticky in this model for the reasons discussed in Section 8.1.
As explained in the new Keynesian model from Section 8.2, real GDP π‘Œπ‘Œ becomes demand
determined, i.e. the output supply curve π‘Œπ‘Œ 𝑠𝑠 is not relevant in the short run. Unlike the real
dynamic model in Section 10.4, only the output demand curve π‘Œπ‘Œ 𝑑𝑑 is relevant here.
Even though prices of goods are sticky in the currency of the country in which they are
produced, competitiveness π‘žπ‘ž can change because of fluctuations in the nominal exchange
rate 𝑒𝑒. A depreciation of domestic currency, higher 𝑒𝑒, increases competitiveness, higher π‘žπ‘ž,
which raises net exports 𝑁𝑁𝑁𝑁. This shifts output demand π‘Œπ‘Œ 𝑑𝑑 = 𝐢𝐢 + 𝐼𝐼 + 𝐺𝐺 + 𝑁𝑁𝑁𝑁 to the right.
Domestic demand in π‘Œπ‘Œ 𝑑𝑑 depends on the real interest rate π‘Ÿπ‘Ÿ as usual. If prices are expected
to remain constant, the Fisher equation implies the real interest rate π‘Ÿπ‘Ÿ equals the nominal
interest rate 𝑖𝑖 controlled by monetary policy. Although producer prices are sticky, consumer
prices can change with the exchange rate but we assume the share of spending on imports
is sufficiently small that this effect can be ignored when thinking about expected inflation.
10.8.2 Balance-of-payments equilibrium and uncovered interest parity
Assume investors are risk neutral, meaning they care only about the expected returns on
assets. For both domestic and foreign nominal bonds to be willingly held, both must offer
the same expected return. With perfect capital mobility, this means that balance-ofpayments equilibrium requires expected returns on domestic and foreign assets are equal.
Each unit of domestic currency invested in a domestic bond pays off 1 + 𝑖𝑖 units of domestic
currency in the future, where 𝑖𝑖 is nominal interest rate. To invest in a foreign bond with
nominal interest rate 𝑖𝑖 ∗ , a domestic investor converts domestic currency into foreign
currency, receiving 1/𝑒𝑒 per unit of domestic currency, which pays off (1 + 𝑖𝑖 ∗ )/𝑒𝑒 units of
foreign currency in the future when invested in the foreign bond. This is expected to be
worth (1 + 𝑖𝑖 ∗ )𝑒𝑒 ′ /𝑒𝑒 units of domestic currency in future, where 𝑒𝑒′ is the expected future
nominal exchange rate. The two bonds have the same expected return expressed in the
same currency when:
1 + 𝑖𝑖 = (1 + 𝑖𝑖 ∗ )
𝑒𝑒′
𝑒𝑒
Note that 𝑖𝑖 cannot be directly compared to 𝑖𝑖 ∗ because these are nominal returns in different
currencies. Let βˆ† = (𝑒𝑒 ′ − 𝑒𝑒)/𝑒𝑒 denote the expected change in the nominal exchange rate 𝑒𝑒,
where a positive value of βˆ† means an expected depreciation of the domestic currency. The
condition for domestic and foreign bonds to offer the same expected return is:
1 + 𝑖𝑖 = (1 + 𝑖𝑖 ∗ )(1 + βˆ†)
This is known as the uncovered interest parity (UIP) condition.
347
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Assuming βˆ† is not too large relative to 𝑖𝑖 ∗ , the UIP equation becomes 𝑖𝑖 ≈ 𝑖𝑖 ∗ + βˆ†. Ignoring the
approximation, with perfect capital mobility and risk-neutral investors, balance-of-payments
equilibrium requires that the following uncovered interest parity condition holds:
𝑖𝑖 = 𝑖𝑖 ∗ + βˆ†
This says that the domestic nominal interest rate 𝑖𝑖 must be equal to the foreign nominal
interest rate 𝑖𝑖 ∗ plus any expected depreciation βˆ† of the domestic currency relative to the
foreign currency. Given 𝑖𝑖 ∗ , the domestic interest rate 𝑖𝑖 must rise to compensate investors
for a positive expected depreciation βˆ† of the domestic currency.
Assuming consumer-price inflation is approximately zero so π‘Ÿπ‘Ÿ ≈ 𝑖𝑖, we can state the UIP
condition in terms of domestic and foreign real interest rates π‘Ÿπ‘Ÿ and π‘Ÿπ‘Ÿ ∗ :
π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ + βˆ†
10.8.3 Monetary policy and the exchange-rate regime
The equilibrium of the economy is found using a goods market diagram with π‘Ÿπ‘Ÿ and π‘Œπ‘Œ on the
axes. There is an output demand curve π‘Œπ‘Œ 𝑑𝑑 representing aggregate demand, which includes
net exports. The position of π‘Œπ‘Œ 𝑑𝑑 depends on the nominal exchange rate because a change in
𝑒𝑒 affects competitiveness π‘žπ‘ž and net exports 𝑁𝑁𝑁𝑁. Higher 𝑒𝑒, a depreciation of the domestic
currency, implies a rightward shift of π‘Œπ‘Œ 𝑑𝑑 . Balance-of-payments equilibrium is represented by
a horizontal 𝐡𝐡𝐡𝐡 line drawn at π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ + βˆ†, which is the UIP condition. If no change in the
nominal exchange rate is expected (Δ = 0) then the 𝐡𝐡𝐡𝐡 line is π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗ . Finally, as in the new
Keynesian model from Section 8.2, there is an 𝑀𝑀𝑀𝑀 line representing the central bank’s
monetary policy stance.
With a flexible exchange rate, the central bank has monetary policy autonomy, meaning it
can choose the position and shape of the 𝑀𝑀𝑀𝑀 line, which describes how it sets the domestic
nominal (and real) interest rate. In a closed economy, the outcome for real GDP is found
where π‘Œπ‘Œ 𝑑𝑑 and 𝑀𝑀𝑀𝑀 intersect but, in an open economy, the outcome must also be consistent
with balance-of-payments equilibrium as represented by the 𝐡𝐡𝐡𝐡 line. This is reached
through adjustment of the exchange rate 𝑒𝑒.
In Figure 10.21, suppose domestic output demand π‘Œπ‘Œπ‘Žπ‘Žπ‘‘π‘‘ = 𝐢𝐢 + 𝐼𝐼 + 𝐺𝐺 intersects 𝑀𝑀𝑀𝑀 at π‘Ÿπ‘Ÿπ‘Žπ‘Ž ,
which is above the 𝐡𝐡𝐡𝐡 line. The exchange rate appreciates (𝑒𝑒 falls), with lower
competitiveness reducing net exports and shifting π‘Œπ‘Œ 𝑑𝑑 to the left until it intersects 𝑀𝑀𝑀𝑀 on
the 𝐡𝐡𝐡𝐡 line. This process determines equilibrium real GDP π‘Œπ‘Œ0 and the exchange rate 𝑒𝑒0 .
Note that the 𝑀𝑀𝑀𝑀 line is drawn as upward sloping. With perfect capital mobility, an 𝑀𝑀𝑀𝑀
line that is horizontal or close to horizontal might require huge movements in the exchange
rate and real GDP to restore balance-of-payments equilibrium. For this reason, we assume
monetary policy is willing to adjust the nominal interest rate if the changes in real GDP
brought about by exchange-rate movements were too large.
If the government has a fixed exchange rate policy then the central bank must use monetary
policy to support the target 𝑒𝑒 = 𝑒𝑒̅ for the exchange rate. If the exchange rate is kept fixed at
𝑒𝑒̅ then there is no shift of the output demand curve π‘Œπ‘Œ 𝑑𝑑 because of 𝑒𝑒. Moreover, if the
348
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
exchange rate is expected to remain fixed in the future then there is no expected
depreciation or appreciation (βˆ† = 0) and the 𝐡𝐡𝐡𝐡 line is at π‘Ÿπ‘Ÿ = π‘Ÿπ‘Ÿ ∗. Given the requirement of
balance-of-payments equilibrium, Figure 10.22 shows that keeping the exchange rate fixed
requires the central bank to adjust its stance of monetary policy so that the 𝑀𝑀𝑀𝑀 line
intersects π‘Œπ‘Œ 𝑑𝑑 on the 𝐡𝐡𝐡𝐡 line.
Figure 10.21: Open-economy sticky-price model with flexible exchange rate
With a fixed exchange rate, having the 𝑀𝑀𝑀𝑀 line shift to intersect 𝐡𝐡𝐡𝐡 and π‘Œπ‘Œ 𝑑𝑑 is equivalent to
having a horizontal 𝑀𝑀𝑀𝑀 line in the same position as the 𝐡𝐡𝐡𝐡 line. Intuitively, this represents
the subordination of monetary policy to defending the fixed exchange rate, with the central
bank being forced set the same interest rate π‘Ÿπ‘Ÿ ∗ as the foreign economy.
Figure 10.22: Open-economy sticky-price model with fixed exchange rate
349
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Box 10.5: The trilemma
There is an important limitation on the government’s policy choices in an open
economy known as the ‘trilemma’. This says it is only possible to have two of the
following three:
1. A fixed exchange rate: a constant nominal exchange rate 𝑒𝑒 = 𝑒𝑒̅
2. Monetary policy autonomy: an independent choice of the money supply or
interest rate to pursue domestic macroeconomic objectives
3. Free capital flows: perfect capital mobility, where the UIP condition 𝑖𝑖 = 𝑖𝑖 ∗ +
βˆ† holds.
With the second and third of these policies, Figure 10.23 shows how a shift in the
monetary policy stance to meet some domestic objective requires that the
exchange rate 𝑒𝑒 must adjust so π‘Œπ‘Œ 𝑑𝑑 intersects 𝑀𝑀𝑀𝑀 and 𝐡𝐡𝐡𝐡, with free capital flows
tying down the position of the 𝐡𝐡𝐡𝐡 line if no change in the exchange rate is
expected. It is therefore not possible to have the first policy of keeping the
exchange rate fixed as well.
Figure 10.23: Monetary autonomy and free capital flows
Choosing the first and third policies of a fixed exchange rate 𝑒𝑒 = 𝑒𝑒̅ and free capital
flows, the position of the output demand curve is tied down at π‘Œπ‘Œ 𝑑𝑑 (𝑒𝑒̅). We have
seen that the stance of monetary policy must adjust so that the 𝑀𝑀𝑀𝑀 line intersects
𝐡𝐡𝐡𝐡 and π‘Œπ‘Œ 𝑑𝑑 , as shown in Figure 10.24. This means it is not possible to have the
second policy choice, an independent monetary policy, that is, the freedom to
choose the position of the 𝑀𝑀𝑀𝑀 line.
Making the first and second policy choices of a fixed exchange rate and monetary
policy autonomy, Figure 10.25 shows that it is not possible simultaneously to be on
the perfect capital mobility 𝐡𝐡𝐡𝐡 line in general. In the absence of large private
350
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
capital flows, a sterilised foreign-exchange intervention could succeed in keeping
the exchange rate fixed at 𝑒𝑒 = 𝑒𝑒̅ given the level of 𝑁𝑁𝑁𝑁, while monetary policy
autonomy is preserved because the intervention is sterilised and does not affect
the money supply. However, capital controls would be needed to prevent a large
adjustment of the financial account 𝐹𝐹𝐹𝐹 to the interest rate differential π‘Ÿπ‘Ÿ − π‘Ÿπ‘Ÿ ∗ , so
the policy choice of free capital flows would have to be abandoned.
Figure 10.24: Fixed exchange rate and free capital flows
Figure 10.25: Monetary autonomy and fixed exchange rate
If the government wants to maintain a fixed exchange rate and allow capital
mobility, are there policies that can substitute for an independent monetary policy?
Figure 10.26 shows how fiscal policy, for example, an increase in government
expenditure 𝐺𝐺, can shift the output demand curve π‘Œπ‘Œ 𝑑𝑑 for a given exchange rate 𝑒𝑒̅.
This affects real GDP because the stance of monetary policy must adjust to ensure
the 𝑀𝑀𝑀𝑀 line intersects both the π‘Œπ‘Œ 𝑑𝑑 curve
351
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
and the 𝐡𝐡𝐡𝐡 line (which can be shown as the 𝑀𝑀𝑀𝑀 line overlapping the 𝐡𝐡𝐡𝐡 line).
Thus, fiscal policy becomes highly effective under a fixed exchange rate regime.
Note that fiscal policy would not be effective under a flexible exchange rate regime
unless accommodated by a change in monetary policy.
Figure 10.26: Fiscal policy with fixed exchange rate and free capital flows
Box 10.6: Currency crises
Currency crises occur when a fixed exchange rate policy is put under pressure by
large capital flows and collapses. There are various reasons why currency crises can
occur and here we look at two examples.
Unsustainable macroeconomic policies
Suppose the government has a fixed exchange-rate policy but is also increasing the
money supply, for example, to obtain seigniorage revenues. The central bank
continues to buy government bonds, increasing 𝑀𝑀 𝑠𝑠 . We analyse these policies in an
economy with flexible prices using the model from Section 10.6.
The left panel of Figure 10.27 shows the rightward shift of the money supply curve
𝑀𝑀 𝑠𝑠 curve is inconsistent with an equilibrium exchange rate at the target 𝑒𝑒 = 𝑒𝑒̅.
There is pressure for 𝑒𝑒 to rise, i.e. for a depreciation of the domestic currency. The
central bank intervenes by selling foreign-exchange reserves and buying back
domestic currency, which reducing 𝑀𝑀 𝑠𝑠 and shifts the money supply curve back to
the left. This maintains the fixed exchange rate but the central bank is left holding
fewer foreign-exchange reserves but more domestic government bonds. If this
continues, eventually, foreign-exchange reserves will be depleted.
With limited foreign-exchange reserves and further expansions of the money
supply, a crisis eventually occurs. Once reserves are depleted, the fixed exchange
352
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
rate cannot be defended any longer and the exchange rate will float. With the
money supply continuing to increase, the exchange rate will depreciate over time,
with PPP implying this will lead to ongoing inflation. A higher inflation rate πœ‹πœ‹ raises
𝑖𝑖 = π‘Ÿπ‘Ÿ ∗ + πœ‹πœ‹ and reduces money demand as shown in the right panel of the figure. At
the time of the collapse of the fixed exchange rate, there is sharp reduction in
demand for domestic currency, triggering a large terminal drop in the central
bank’s foreign-exchange reserves when the crisis occurs.
Figure 10.27: Foreign-exchange interventions to defend fixed exchange rate
In this example, the government is following macroeconomic policies that are
ultimately inconsistent with a fixed exchange rate. A currency crisis is inevitable due
to bad policy.
Self-fulfilling currency crises
Under some circumstances, currency crises can have self-fulfilling features. This
means that a fixed exchange rate collapses owing to this being expected to happen
but, if this is not expected to happen, there is no fundamental reason for the fixed
exchange rate to fail. We explore this possibility using the model with sticky prices
from Section 10.8.
Suppose a currency crisis is expected to happen, implying a positive depreciation
βˆ†> 0 of the domestic currency is expected. As shown in Figure 10.28, this shifts the
𝐡𝐡𝐡𝐡 line upwards from π‘Ÿπ‘Ÿ ∗ to π‘Ÿπ‘Ÿ ∗ + βˆ†, so a higher interest rate is needed to avoid
capital outflows and maintain balance-of-payments equilibrium. In order to defend
the fixed exchange rate 𝑒𝑒 = 𝑒𝑒̅, the central bank must tighten monetary policy,
shifting the 𝑀𝑀𝑀𝑀 line upwards and moving along the π‘Œπ‘Œ 𝑑𝑑 (𝑒𝑒̅) curve. Thus, maintaining
a fixed exchange rate when a currency crisis is expected requires a higher real
interest rate π‘Ÿπ‘Ÿ, which results in lower real GDP π‘Œπ‘Œ.
353
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Figure 10.28: Expected collapse of fixed exchange rate
If the government is not willing to sacrifice real GDP and employment to defend the
fixed exchange rate then it fails to shift the 𝑀𝑀𝑀𝑀 line, which would cause the
exchange rate to depreciate to restore balance-of-payments equilibrium. However,
this justifies the initial expectation that the fixed exchange rate will be abandoned,
so the shift in expectations of the exchange rate is not irrational. This shows the
collapse of the fixed exchange rate regime can be a self-fulfilling prophecy.
However, if no collapse is expected then the 𝐡𝐡𝐡𝐡 line does not shift, the central
bank does not need to change monetary policy, and neither GDP nor the exchange
rate change, confirming the belief of no collapse is not irrational, so there are
multiple equilibria in this example.
We see that currency crisis can be self-fulfilling when the government will not
prioritise the defence of a fixed exchange rate if that jeopardises other objectives
such as GDP.
354
Download