1 A GENERAL THEORY OF SUSTAINED GROWTH WITH MULTIPLE GPTs By Kenneth I. Carlaw Associate Professor Department of Economics University of British Columbia - Okanagan Kelowna, British Columbia Canada Email: kenneth.carlaw@ubc.ca And Professor Emeritus Richard G. Lipsey Department of Economics Simon Fraser University Vancouver, British Columbia Canada Email: rlipsey@sfu.ca 2 I. INTRODUCTION Our evolutionary vision of long term growth is of an historical, irregular, path dependent process in which heterogeneous agents operate in a nonstationary environment of uncertainty. In our version, this environment is occasionally punctuated by the arrival of General Purpose Technologies (GPTs), which transform economic, social and political relations.1 This evolutionary view contrasts with the Newtonian approach found in more traditional neoclassical and endogenous growth models. One of the most important problems in evolutionary growth theory is to bridge the gap between (i) the vast amount that is known about micro behaviour of agents concerned with innovation and technological change and (ii) the aggregate growth behaviour of economies, which we seek to understand. Most evolutionary theories of technological change take a ‘bottom–up’ approach. They start with micro behaviour, rejecting the concept of representative agents and instead build on diversity and selection. However, these micro-based evolutionary approaches encounter the difficult, and as yet unresolved, challenge of aggregating from realistic, heterogeneous, micro behaviour to observed macro relations. Macro growth theories that seek to incorporate evolutionary concepts encounter two basic problems. First, they need to get beyond the flat concept of technology inherent in any aggregate production function. The early models of GPTs took an important step in this direction. However, whenever these models employ such neoclassical maximising techniques as dynamically stationary equilibrium concepts, the attempt to model technology in an increasingly structured way rapidly makes them analytically intractable. This in one important reason why all the first generation models of GPTs in the Helpman (1998) volume omit most of what our historical studies suggest are important characteristics of GPTs. We have circumvented this modelling problem by building non-conventional, dynamic, non-stationary equilibrium models of GPT-driven growth. In them, 1 Lipsey Carlaw and Bekar (2005: 98) (hereafter LC&B) define this concept as follows: “A GPT is a single generic technology, recognisable as such over its whole lifetime, that initially has much scope for improvement and eventually comes to be widely used, to have many uses, and to have many spillover effects.” 3 agents face an uncertain future and so do the best they can with limited knowledge of current relations.2 The second problem is to bridge the micro-macro gap. In spite of such works as John Foster’s (1987) Evolutionary Macroeconomics, the top-down approach still seems a long way from bridging this gap. We are approaching this problem by building a simple non-equilibrium macro model and then amending it incrementally in order to make it approach the kinds of micro behaviour that are observed in our historical studies and that are incorporated in many micro-evolutionary models of innovation and technological change. 3 In our original three-sector model, newly invented GPTs immediately had their full effects felt in the sector that used them. 4 Subsequent versions have introduced the following complications, all of which are found in Chapters 14 and 15 of Lipsey, Carlaw and Bekar (hereafter LC&B): allowing the effects of a new GPT to diffuse through the rest of the system according to a logistic time path; varying the types of expectations on which agents base their resource-allocation decisions allowing for heterogeneity in the effects of a new GPT by adding a second consumption sector that can be affected by a new GPT in a different way than the first sector; allowing for the effects that new GPTs have in requiring alterations to the structure of the economy (what we call the ‘facilitating structure’), which is a first approximation to modelling what Freeman and Perez (1988) and Freeman and Louca (2001) call the ‘crisis of adjustment’. We have used various versions of our single-GPT model for several applications. We refuted by counter-example the Alchian-Friedman hypothesis that any reasonable economy would behave as if all its agent were maximisers, even if they were not. We found some interesting conditions under which rational and adaptive expectations produced identical behaviour of the model and other conditions where they produced radically different behaviour. We were able to show in Monte Carlo simulations that changes in total factor productivity (TFP) do not come even near to tracking changes in technology, thus supporting the more general arguments to this effect given in Lipsey and Carlaw (2004). We were also 2 3 See LC&B Chapters 14 and 15, Carlaw and Lipsey (2006a). For our fuller justification of this approach, see LC&B, 367-9. For the historical evidence, see especially LC&B, Chapters 5 and 6. 4 This model was first given in Carlaw and Lipsey (2001) and finally published in Carlaw and Lipsey (2006a) 4 able to fit one version our model to Australian data, revealing some interesting possibilities. We are now tackling the next two major tasks in our research program. The first is to incorporate more than one GPT whose effects differ from each other, thus allowing for relations of complementarity and competitiveness among existing GPTs. This is an important task since one of the limitations of the first generation of GPT models in Helpman (1998) was that they contained only one GPT whose behaviour determined the macro behaviour of the whole economy, when in fact that behaviour is determined by the interaction of a number of GPTs that always exist simultaneously, some in competition with each other and some that cooperate with each other. In LC&B (2005) we placed great emphasis on this criticism but were not able, in the time available, to build more than a very crude two-GPT model in which each operated in a sector that was independent of the other. This barely scratched the surface of the problem. Modelling more than one GPT simultaneously operating in an economy will allow us to refine further the possibilities for applying the model to issues such as those just mentioned, as well as applying it to a range of issues that could not even be considered within the confines of single-GPT models. Although many applications are possible, each requires a separate paper. This, primarily theoretical paper, lays the groundwork for further applied work by building a model that is general in that it can handle as many GPTs, as many sectors, as many industries within each sector, as many different relations of competitiveness and complementarity among GPTs, and as many variations in the behaviour and expectations of the agents operating in the model’s various sectors and industries, as is desired. The second task is to incorporate one of the key aspects of evolutionary models: heterogeneous behaviour within any one activity and a selection mechanism that determines survival and demise. This extension will come next. II. THE SINGLE- GPT MODEL5 What we call our base-line model contains three sectors, each containing a single industry that produces (i) a consumption good, c, (ii) applied research, a, that improves productivity in the other two sectors and (iii) pure 5 The model described in this section is based on the ‘base-line’ model in LC&B (2005) Chapter 14. We repeat it here because we must amend it substantially to allow it to develop into the multi-GPT model of Part III. 5 knowledge, g, that occasionally leads to the discovery of a new GPT. Each sector has its own distinct aggregate production function. Thus the intraindustry technology is flat, while the technological structure is imposed through the inter-industry relations among these three different production functions. The index n counts the number of times that a new GPT has been invented and adopted; if a GPT is invented but not adopted, n does not change. We use subscripts n, n-1,… to indicate the most recently invented and the previously invented GPTswe speak of different vintages of the one generic type of GPT. When a new GPT is invented we call it the ‘challenger’, while the GPT that was already in place is called the ‘incumbent’. The term t always refers to the sequence of time periods. When subscripted, it refers to one specific time For examples, tn and (t-1)n refer respectively to the time at which the incumbent GPT was invented and adopted and to one time period before that. The Resource Constraint There is a single exogenously determined resource supply, R, that is allocated each period among the three sectors: (1) R = rc,t + ra,t + rg,t . The Consumption Sector Consumption output depends on the resources devoted to it and the productivity coefficient, which is the stock of applied knowledge, At multiplied by parameter, µ, which divides this knowledge between the proportions that are useful in the consumption sector and in the pure research sector, all raised to a positive power that does not exceed unity:6 6 LC&B define technology as follows. “Technological knowledge, technology for short, is the set of ideas specifying all activities that create economic value. It comprises: (1) knowledge about product technologies, the specifications of everything that is produced; (2) knowledge about process technologies, the specifications of all processes by which goods and services are produced; (3) knowledge about organisational technologies, the specification of how productive activity is organised in productive and administrative units for producing present and future goods and services (which thus includes knowledge about how to conduct R&D).” What matters for the present model is that, at our present level of abstraction, no distinction is needed between technological knowledge and the technological artefacts that embody it. So when we use the A 6 (2) ct = ( µ At )α1 rcα,t2 α1 ∈ (0,1], α 2 ∈ (0,1) The restriction on α1 implies non-increasing returns to the new knowledge created by the applied R&D sector. The restriction on α2 implies decreasing returns to the resources applied to this sector.7 The term µ reflects our decision on how to handle the complementarity between applied knowledge and the other two sectors. First, there is an obvious complementarity between applied R&D and the consumption sector because applied knowledge increases productivity directly by shifting the production function for the consumption good. Second, the spillovers from applied to pure research are well attested by many studies.8 There are two basic ways in which we could model this latter type of spillover. We could assume that the whole stock of applied knowledge was useful in the consumption sector, while some fraction of it spilled over as an externality to increase the efficiency of pure research. We have done this in other models, but here we assume that the stock of applied knowledge is divided between what is useful in the consumption sector and what is useful in the pure research sector. This division ensures that we do not introduce any permanent increasing returns to accumulating knowledge, nor any externalities. It has two advantages. First, it allows us to produce a model of sustained endogenous growth without some of the characteristics that are often needed to sustain growth in models of endogenous growth. Second, it creates a model suited to studying the relation between technological change and standard measures of changes in TFP, which usually assume constant returns to scale. Note that there are no lagged relations in (2). Appendix 1 provides a possibly more intuitively appealing description of the discrete time structure with an inter-temporal trade-off, where the A term is lagged by one period and the maximization problem is articulated using a Bellman equation. In the text, we assume that applied R&D knowledge has an immediate impact produced by the applied R&D sector in the consumption goods industry’s productivity parameter, this can represent applied knowledge and the capital goods that embody that knowledge. 7 We assume diminishing returns to the resources allocated to production in each sector for both empirical and theoretical reasons. Empirically, we argue that even in the long term there are fixed resources, such as were broadly included under the category ‘land’ by earlier economistse.g., natural resources, sites for cities and harbours and the carrying capacity of the environment. Thus, growth in the very long term is only possible if technology changes. Theoretically, we wish to show that sustained growth is possible without the assumption of either the increasing or constant returns to resource inputs that is required by many other models of sustained growth. 8 See, for example, LC&B, Chapter 4 and Rosenberg 1982: Chapter 7. 7 on the productivity of consumption. Doing so does not change any of the model’s qualitative behaviour.9 Because the results of applied research in the current period affect the productivity of consumption in the same period, there is no inter-temporal substitution in the model. Thus the maximizing trade off is between the amount of the consumption good directly produced by the resources devoted to its production with a given production function and amount of the consumption good indirectly produced by the resources that go into applied research to improve the consumption sector’s productivity. A marginal reallocation of resources from the consumption sector to applied research directly reduces the production of the consumption good, while indirectly increasing its production by raising the productivity of those resources that remain in the consumption sector. Maximisation of consumption output in any one period requires that no reallocation of resources between the two sectors can increase consumption output. Similar considerations apply to the pure research sector, as detailed later in the paper. In the text, we do not use the Bellman equation notation shown in Appendix 1 for two reasons. First is simplicity. 10 Second, we do not wish to model a closed form, stationary, dynamic equilibrium. So, we will not satisfy the maximum principle and the transversality condition normally applied to such maximization procedures. Instead, the agents generating the behaviour in the model have insufficient foresight and information to be able to solve for the infinite horizon stationary equilibrium. This models the welldocumented, genuine uncertainty that accompanies technological change.11 This approach turns out be beneficial for our modeling. It frees us from the technical feasibility constraints encountered by the first generation GPT models discussed above and allows us to model the long-term growth process as a dynamically non-stationary phenomenon. This seems to us to be consistent with the data of growth and change seen in most economies before such things as filtering and structural break modifications are made to the data. 9 We checked for this by removing all of the model’s many uncertainties and determining that it then behaves exactly like an endogenous balanced growth model 10 There are no technical reasons why we could not introduce lags in this relation, as well as in the later one concerning the payoff from pure research and we show this model in Appendix 1. But it would greatly complicate the calculations, requiring much time discounting, while, as far as we can see, adding nothing new to the results of the model. 11 For a full discussion of this point se LC&B (29-33). 8 The Applied Research Sector The production of new applied knowledge, at, depends on the resources devoted to that sector and on its productivity coefficient which is the useful stock of GPT knowledge, G: (3) at = (Gt ) β1 raβ,t2 β1 ∈ (0,1], β 2 ∈ (0,1) Once again, we make a simplifying assumption about the time-lag structure in the model, allowing the stock of useful pure knowledge to have an immediate impact on the applied R&D sector as soon as it is produced. Appendix 1 provides a possibly more intuitively appealing version where the stock of pure knowledge is lagged one period. The stock of applied knowledge, At, is the current flow of produced applied knowledge plus the depreciated stock of such knowledge accumulated in the past: (4) At = at + (1-ε)At-1 (Although this sector only produces at at time t, we speak of At as being produced by this sector.) The Diffusion of the Effect of an Existing GPT In this section, we investigate the working of the model with exogenously determined GPTs. To do this we need two GPTs, the incumbent and the GPT that it replaced. This change over occurred in the past at time tn. The realised productivity of the incumbent GPT is given at each time period by Gt, which enters the productivity coefficient of the applied R&D sector. In our earlier model (Carlaw and Lipsey 2006), the full potential productivity of the new GPT was felt immediately giving rise to unrealistic spikes in growth when a new GPT was discovered. To model the observed slow diffusion of a new GPT, we use a logistic diffusion process similar to that used in LC&B, Chapters 14 and 15. We let ( Gn−1 )(t −1) be the productivity n of the first GPT to be adopted (and numbered n – 1) evaluated in the final period before it was displaced by the challenger, (t-1)n. We let ϑt Pt be the full potential productivity of the incumbent GPT when it was first adopted at time tn. and, for now, we let ξ t = 1 . (The separate meanings of θ and P are explained in the next section.) n n n 9 eτ +γ (t −tn ) Gt = ξtn ( Gn −1 )( t −1) + ϑtn Ptn − ( Gn −1 )(t −1) τ + γ ( t − tn ) n n 1 + e (5) The equation shows actually useful general purpose knowledge, Gt , becoming available for use in applied research according to a logistic diffusion process in which tn is the adoption date of the incumbent GPT, [ ϑt Pt - ( Gn −1 )(t −1) ] is the difference between the full potential productivity of n n n the incumbent GPT and the productivity of the GPT it replaced evaluated at the time at which that GPT was last used, (t – 1)n. γ and τ are calibration parameters that control the rate of diffusion.12 To see how this diffusion process works, note that when the new GPT was first discovered, the value of the diffusion coefficient in equation (5) was a very small number (since t = tn and τ is set equal to 0.06 for our simulations). Thus, the initial productivity of producing with the new GPT was only slightly more that the current productivity of producing with the previous GPT. Now assume that the diffusion process is complete so that the value of the diffusion coefficient is effectively unity. Then (with ξt set at unity), the productivity of the incumbent GPT, Gt, is approximately equal to the productivity of the previous GPT, (Gn −1 )(t −1) in equation (5) plus the full potential productivity increment conferred by the existing GPT ( i.e., ϑt Pt − (Gn −1 )(t −1) ) in equation (5)) now turned into actually useful knowledge. Between these two dates, the diffusion process feeds the potential productivity increment of the new GPT into its actual productivity logistically. n n n n n Now consider the effect of ξ. It is a random number, which alters the lower asymptote of the logistic curve. The probability function that determines ξ is such that there is a higher likelihood of producing small upward than downward shifts.13 This random variable models two things. The first is the possibility of what LC&B call ‘historical increasing returns’ that exist at the early stages of a new GPT but are exhausted once the new 12 Because we are dealing with discrete time periods rather than continuous time, there are several ways in which the transition from one GPT to another can be handled. Since the subsequent behaviour is unaffected by how we handle the transition, we adopted what was computationally the easiest method. When a new GPT is invented at period tn, the actual value of the G that is used in the applied R&D sector is the value of the replaced GPT’s G at period tn-1, plus one period’s diffusion of the increment in potential productivity brought about by the new GPT. 13 Depending on what simulations we are running, we draw ξ from different probability distributions, but in all casesξ has a mean slightly greater than unity and a small variance. 10 technology is fully embodied.14 The second is surprises that affect how the new GPT operates in practice rather than how it looks on the drawing board. The Pure Knowledge Sector and the Endogenous Production of New GPTs To make the GPTs endogenous, we introduce our model’s third sector. It produces pure knowledge, gt, according to the production function: gt = ( (1 − µ ) At ) θt (rg ,t )σ 2 σ1 (6) σ 1 ∈ (0,1], σ 2 ∈ (0,1) . The restrictions on the σ’s ensure diminishing returns to resources used in producing pure knowledge, rg,t, and either constant or diminishing returns to that portion, 1-µ , of applied knowledge, At, used in the pure knowledge sector.15 To model the uncertainly that no one knows in advance exactly how much useful pure knowledge will be generated by a given amount of resources devoted to pure research, we multiply rg,t by θt, which is a random variable distributed uniformly with support [0.8, 1.2], mean 1, and variance (0.4)2/12. This period’s current stock of potentially useful pure knowledge, Pt, is last period’s stock, suitably depreciated by δ, plus the flow produced this period: Pt = gt + (1 − δ ) Pt −1 (7) A new GPT is invented when the random viable, λt, equals or surpasses a threshold value λ*. The model is calibrated such that this occurs only infrequently. (See below under the heading “Simulation”.) If the new GPT is adopted, its potential productivity is determined by the amount of accumulated pure knowledge at that date, Pt , multiplied by a random variable, ϑt , whose values typically produce small variations around unity. Thus the productivity potential of the newly invented nth GPT is ϑt Pt . The random variable models the observation that the actual potential of any GPT to raise productivity cannot be precisely predicted when it is originally n n n 14 n See LC&B Chapter 12, especially 397-401 Once again the stock of applied knowledge is not lagged for simplicity and the more intuitively appealing lag structure is discussed in Appendix 1. 15 11 being developed.16 It does this by altering the upper asymptote of the logistic diffusion curve shown in equation (5), thus varying the time taken for a GPT to reach the final stages of its development and how much productivity increase it will produce over that time. The slow diffusion of the full potential of a new GPT, as described in equation (5), creates a problem that we did not encounter in Carlaw and Lipsey (2006) where all of a new GPT’s potentially useful knowledge became available to the applied R&D sector at once. In that case, the new GPT was always adopted since the passage of time ensured that it embodied much more knowledge than the previously invented GPT. In the present case, however, the potentially useful new knowledge only becomes actually useful over time and, as shown below in equations (8) and (9) below, the initial impact of the new GPT may be less than that of the incumbent. Furthermore, the lower asymptote of the newly adopted GPT is shifted either up or down by the random variable ξt whose one-time effect is felt during the first period of the new GPTs life. Before making their adoption decision agents, form an expectation, ξt , about the value that this variable will take after adoption. If ξt is less than unity, indicating that agents are sceptical about how well the new technology will work in practice, this increases the chances that the challenging GPT will not initially be expected to be as productive as the incumbent. In this case, it will be rejected. n n n The adoption decision requires some type of comparison of the productivity of the new GPT with that of the incumbent. LC&B consider three adoption criteria in which the new GPT is adopted: (i) always without comparison with the incumbent, (ii) if its expected initial level of the productivity exceeds that of the incumbent and (iii) if its expected initial level of the productivity is at least as high as the incumbent’s and its expected rate of growth is higher in the near future.17 For illustrative purposes in this paper, we use the second criteria and adopt the new GPT if its expected initial level of productivity exceeds that of 16 Both λ and ϑ are derived from beta distributions, where each distribution is defined as νη x (ν −1) x (η −1) with support [0,1], mean (ν/(ν+η)) and variance . beta( x | ν ,η ) = 2 Beta(ν ,η ) (ν + η ) (ν + η + 1) Beta(ν,η) is the Beta function, and ν and η are parameters which take on positive integer values. The value of ϑtn is set with support [0,1.2] by multiplying all draws by 1.2. The parameters of the Beta distribution are set such that the mass is centered on one. 17 See LC&B, 452-3. There they define five cases because they divide each of our cases (ii) and (iii) in the present text into two sub-cases depending on what happens to the rejected GPT. 12 the incumbent. If the GPT fails that test, it is sent back to the pure research sector for further development. It will again be considered for adoption when a new technological breakthrough occurs as signalled by another favourable drawing of λt > λ*. To apply this adoption test, we need to compare the level of productivity under the incumbent GPT with that expected of the challenger if it is adopted. Here we compare what the incumbent’s G value would be at time tn if it were not replaced at that time (shown in the RHS of equation (9)), with the G value that would be produced in the first period of diffusion with the new GPT (as shown in equation (8) and the LHS of (9). This is a simple matter when we have only one GPT in use at a time and only one applied R&D sector. However, the comparisons become more complex in Part III when we have several different types of GPTs and several different vintages of each type, with the possibility that these different vintages are being used at the same time in different applied R&D sectors. Because we will need it in the multi-GPT model, we use an accounting variable, H, to make the relevant comparison here. H t measures what the actual realised productivity would be at time tn if the new GPT were adopted. n H tn = ξtn ( Gn −1 )(t −1) (8) n ( eτ +γ ( t −tn ) + ϑtn Ptn − ( Gn −1 )(t −1) τ + γ ( t − tn ) n 1 + e ) Equation (8) has the same structure as equation (5) with ξt replaced by ξt . But this time it is evaluated at one specific time, the time when the new GPT is invented. Note that since we are calculating the diffusion over only the first period, the gamma terms disappear from the logistic diffusion expression, leaving only the value of τ to determine how much of the potential increment associated with the challenger accrues in the first period. n n We then compare this value with the productivity that the incumbent GPT would have if it were used instead of the challenger at period tn H tn > (Gn −1 )tn (9) If the new GDP is adopted, its logistic diffusion continues as modelled in equation (5) with ξt replaced by ξt , which is determined by another random drawing that alters the value of ξt somewhat from that of ξt . This alteration models the surprise that occurs after the GPT has been adopted, as e.g., when many of the first offices to introduce computers into a structure designed for hard copy procedures found their efficiency actually declining, only much later to recover and then rise rapidly. So if the test in (9) is n n n n 13 passed, the new GPT begins its actual logistic evolution with a surprise drawing of ξt . (With the adoption, tn is reset at the current value of t and the index n is increased by 1.) n If this test is not passed, the GPT is returned to the pure knowledge sector to be worked on and to await another break through, modelled as λ ≥ λ* at some later date. Resource Allocation In this paper, we could use a resource allocation mechanism based either on maximising behaviour or behaviour that more closely follows the assumptions of evolutionary models. We describe both here, however, because one of our goals is to build an evolutionary model of growth driven by technological change that allows us to move from macro to micro behaviour, we use evolutionary behaviour in our simulations. In a later paper, we compare behaviour under this evolutionary modelling with that which follows from a maximisation assumption. What really matters for our model is that agents respond to relative intersectoral differences in perceived rates of returns, which they do in both the maximising and the evolutionary versions. These are just two of the ways in which we can model agents behaving as groping into an uncertain future in a profit oriented way. Maximization: To model maximising behaviour with respect to resource allocation, we need some further assumptions. First, we need to know how agents form expectations of the consumption payoff from resources devoted to pure research. In LC&B and in Carlaw and Lipsey (2006), we assume that agents estimate the payoff on the assumption that pure knowledge is immediately useful in the applied R&D sector, rather than being useful only after a new GPT is discovered. This is equivalent to assuming that they have a zero time discount. We model this assumption in the last two lines of the maximization procedure presented below. Second, we assume, as in LC&B and in Carlaw and Lipsey (2006), that agents seek to maximise their returns measured in consumption units. We assume that agents do not know the future consumption payoffs from any line of expenditure because they do not know the probability distributions that are generating the disturbances on the outcomes. In the LC&B formulation, agents form expectations of the payoffs to expenditures based on their perceptions of the current period’s marginal productivities. Also, 14 since agents cannot anticipate surprises, they make decisions on the assumption that the expected value of the surprises is zero. Thus they allocate resources among the three sectors according to their expected current marginal products measured in consumption units. Under certain assumptions, this is equivalent to perfect competition.18 We could assume that a social planner makes the entire allocation over all three sectors. Alternatively, we could assume that the allocation is made by private price-taking agents in the consumption and applied research sectors19 and by a government that taxes agents in these two sectors to pay for pure research, which has the assumed payoff just described. Recursive substitution of the constraints into the objective function yields the following reduced form: α1 β1 σ σ β (10) ct = µ v ( (1 − µ ) E[ At ]) 1 ( rg ,t ) 2 + (1 − δ )Gt −1 ( ra ,t ) 2 + (1 − ε ) At −1 (r ) α2 c ,t . The expectations operator is applied to the stock of applied knowledge in this equation because there is a problem of simultaneous determination. We adopt the simplest of assumptions about expectations by setting E[At] = At-1. The maximization problem is: max ct = ( µ At )α1 (rc ,t )α 2 {rc ,t ,ra ,t , rg ,t } s.t. Rt = rc ,t + ra ,t + rg ,t (11) At = at + (1 − ε ) At −1 at = ( Gt ) raβ,t2 β1 gt = ( (1 − µ ) At ) rgβ,2t β1 Gt = gt + (1 − ε )Gt −1 where the upper bars indicate expectations of the variables. Resource allocation with mutation and selection: This is the version we use in this paper. It is designed to bring the model closer to evolutionary behaviour. We seed the model with an arbitrary allocation of resources and 18 In all treatments by other authors, agents are modelled as having perfect foresight about the future evolution of new GPTs. Our assumption of no foresight seems closer to what we observe than the assumption that agents are sufficiently foresighted to maximise over the whole of a GPT’s lifetime—a lifetime that can easily extend over more than a century. 19 Digit-Stiglitz style monopolistic competition can be introduced into these two sectors with an increase in complexity and no change in the qualitative behaviour of the model. 15 use a random mutation to change this allocation period by period. The resource allocation behaviour is completed by a selection mechanism. It keeps the mutated allocation if it results in an increase in some objective, that in various experiments is either consumption or knowledge stocks. If the increase does not occur, the previous period’s allocations are retained. The model is flexible enough to accommodate many other assumptions and we have examined some of these elsewhere. III. THE MULTI-GPT MODEL Key Characteristics of the New Model We now move to a model with multiple GPTs. We maintain the same three sectors: pure knowledge, applied knowledge and consumption goods, but now allow for many distinct industries within each sector. For example, each pure knowledge industry produces knowledge relevant to one generic type of GPT. There are several generic types of GPTs, such as materials, information and communication technologies (ICTs), power delivery systems, modes of transportation, and tools.20 Within each generic type, there are several different kinds. We distinguish these by the sequence of their invention, which we refer to as vintages, such as the steam engine, the internal combustion engine, and the electric motor, within the power delivery category. In our model, GPTs are invented by each of the pure knowledge industries at widely separated and randomly determined times. Also many surprises occur in such things as the payoff from pure research, the initial adaptation of new GPTs, the length of life and eventual productivity of new GPTs, how fast they eliminate competitors, and how well they cooperate with other incumbent GPTs. These mirror the observations that with radically new technologies, there is much that cannot be known in advance and much learning by doing and using must occur as agents grope into an uncertain future in a profit-oriented but not profit-maximising manner.21 When a new vintage of some generic type of GPTs is developed, it may drive out previous vintages quickly or only spread slowly from industry to industry while one or more existing GPTs of earlier vintages co-exist for a 20 See LC&B (131-6) for a discussion of various types of GPTs. There is a mass of evidence supporting these statements. For a summary and evaluation see LC&B, especially Chapter 2 21 16 long time. This mirrors the various different performances of GPTs that have been observed throughout history.22 New GPTs feed their knowledge into applied R&D industries, These use the knowledge to produce new applied knowledge that increases the productivity of the consumption goods industries and the pure knowledge industries. This models the mutual causation loops that are observed to exist among applied and pure research.23 Unlike models that contain a single GPT, the macro behaviour of the model is not governed by the behaviour of one dominant GPT and the arrival of new GPTs can rarely if ever be inferred from observing macro behaviour. This suggests that one should be cautious in accepting the arguments of some economic historians that the study of 19th century English macro behaviour can be used to determine whether or not GPTs were responsible for the observed growth. Some Assumptions and Definitions There are J industries, each producing a different consumption good, j∈(1,J), and making use of some or all of the stocks of applied knowledge. There are K applied R&D industries, each producing a distinct type of applied knowledge, k∈(1,K), and making use of all of the existing types of GPTs.24 There are X industries that produce pure knowledge and make use of some applied knowledge. Each of these occasionally invents a new GPT of type x∈( 1, X). One of each type of GPT is initially seeded in each applied industry and new GPTs are invented periodically. They are adopted for use in the various applied industries at various times. There will thus be a sequence of GPTs of each type (vintage) and different vintages of any one generic type may be employed in different applied R&D industries at any one time. These characteristics require us to keep separate track of the vintages and the sequence of adoptions of each type of GPT. We use the index nx to identify the vintages of GPTs of any one type. Every time a GPT of type x is invented and adopted in at least one industry, nx increments by one. If the 22 For evidence on the varied impacts of new GPTs on existing technologies, including other GPTs, see LC&B, Chapters 5 and 6. 23 For an excellent study of such feedback loops see Rosenberg (1982) Chapters 3 and 7. 24 It is a trivial matter to alter the model to make some of the applied R&D industries not use some of the various types of GPTs, although typically in practice all GPTs are widely used. 17 GPT is rejected by all industries, it is returned to its pure research industry for further development and nx does not change. At any one time, we refer to the Gs that have been produced by the latest and the previously invented GPTs of type x as Gnx and G(xn −1) . x x In this model we must also keep track of the adoption of specific GPTs in specific industries since a new GPT does not have to be adopted at once by all industries. We use the index d k , z to count the sequence of adoptions of GPT type z (= x) by applied R&D industry k. Note that x ∈(1,…X) and z∈(1,…X) are two different designations for the same set of pure knowledge producing industries and the GPTs that they produce.25 Later we need to show the sequence of adoptions of a GPT of any type in industry k, which we indicate by dk . We also need to keep track of the date of each GPT’s invention (and adoption in at least one applied R&D industry). We use tn and (t − 1) n identify invention date of the latest vintage of the type-x GPT and one period before that, while t( n −1) is the invention date of the GPT one vintage earlier than the latest vintage. The time stamps are reset every time a new vintage of type-x GPT is invented (and adopted by at least one applied R&D industry). x x x The Resource Constraint Resources must now be allocated among J consumption, K applied research and X pure research industries. So the single-GPT model’s resource constraint in (1) above now becomes: (12) J K X j =1 k =1 x =1 Rt = ∑ rt j + ∑ rt k + ∑ rt x The Consumption Industries 25 We have used x to identify each type of GPT that is discovered by its corresponding pure knowledge industry. However, different GPTs of type-x may be operating simultaneously in the K different applied R&D industries. We distinguish these by their different adoption dates and for this purpose we use z to identify the type of each GPT. The need for these two separate identifications of the same GPT will be clarified later, when we consider the interaction among GPT operating in different industries within the same sector. 18 The J consumption industries are defined in equation (13), which replaces the single-GPT model’s equation (2). 1 (13) K K ctj = ∏ ( µ Atk )α k (rt j )α K +1 , α k ∈ (0,1] ∀k ∈ K , α K +1 ∈ (0,1) k =1 According to (13), the productivity coefficient in each of the j consumption industries at time t is the geometric mean of the amount of each of the k stocks of applied knowledge that is useful in consumption production (each raised to the appropriate power αk) and the amount of resources devoted to producing in that industry (raised to the power αK+1). In the version of the model used in this paper, we make applied research industry-specific. Thus we have the same number of applied research and consumption industries, J = K, and we make the knowledge produced in each of the applied industries, j, useful only in the one corresponding consumption industry, k. This simplifies the production functions for consumption goods industries as follows: (13)’ ctj = ( µ Atk )α k (rt j )α K +1 , α k ∈ (0,1] ∀k ∈ K , α K +1 ∈ (0,1) and j = k Note that in both (13) and (13)’ the proportion of applied knowledge that is useful in consumption and in pure research, µ, is assumed to be the same for all types of applied knowledge.26 The Applied Research Industries Each applied R&D industry utilizes some vintage of each of the X GPTs. Their production functions now have X + 1 arguments, X for all the types of GPT knowledge, and one for the resource input (the time subscript is needed on the Gs because as time passes, the efficiency of each GPT evolves in a manner analogous to equations (7), (8) and (9) for the single-GPT model). Equation (3) for the single-GPT model becomes: 1 (14) X X atk = ∏ (ν dkk,,xz (Gdxk ,z )t ) β x (rt k ) β X +1 x =1 Atk = atk + (1 − ε ) Atk−1 β x ∈ (0,1) ∀ x ∈ X , β X +1 ∈ (0,1) 26 It is a simple matter to make these proportions differ across industries (µj) if there was reason to do so. 19 where the industry specific parameters, ν dk , x , will be used later to allow for surprises about the productivity of specific new GPTs and for complementarities among new and existing GPTs, but for now are all set at unity.27 The first line of (14) tells us is that the output at time t of the kth R&D industry (which produces R&D useful in the jth consumption industry) is determined by the amount of resources devoted to that industry raised to the appropriate power βX+1, and a productivity coefficient, which is the geometric mean of the current knowledge provided by the X GPTs of the particular vintages used in that industry, each raised to its own coefficient βx (and to be modified by its corresponding ν term when these are no longer all set at unity). The second line tells us that the productivity parameter of the kth goods production industry, Atk , is the past accumulated stock of the applied knowledge that is relevant to that industry (suitably depreciated) plus the current production of that knowledge. k ,z The Pure Knowledge Industries There are X industries each producing a different type of pure knowledge. The productivity coefficient in each is the geometric mean of the various amounts of the K different kinds of applied knowledge that are useful in further pure research (one for each applied R&D industry and each raised to its power σk). The output of pure knowledge in industry x is a function of this productivity coefficient and the amount of resources devoted to that industry. Equation (6) for the single-GPT model is now amended as follows: (15) K gtx = ∏ (1 − µ ) Atk k =1 ( 1 ) σk K x x θt rt ( ) σ K +1 , σ k ∈ (0,1], ∀ k ∈ K and σ K +1 ∈ (0,1) . As in the single-GPT model, θtx , which models the uncertainty about the actual results of any given amount of resources devoted to pure research, is a random variable distributed uniformly with support [0.8, 1.2], mean 1, and variance (0.4)2/12.) The stock of potentially useful knowledge in each of the X lines of pure research is accumulated according to: 27 Each ν multiplies a specific vintage of general purpose knowledge identified by superscript x, which is operating in a given applied sector identified by superscript k. The subscript on each ν identifies the adoption sequence of the latest type z GPT to be adopted in applied R&D sector k. In the later section, entitled “relations among GPTs”, the meaning and application of these νs are explained in some detail. 20 Pt x = gtx + (1 − δ ) Pt −x1 (16) which is equation (7) of the single-GPT model amended to allow for the X separate pure knowledge industries. New GPTs are invented in each of the X lines of pure research in the same way as in the single-GPT model, when the drawing of the λ for that industry equals or exceeds the critical value of λ, i.e., λtx ≥ λ * x . For simplicity, we let the critical value of lambda for each of the X industries in the pure knowledge sector be the same: λ * x = λ * ∀ x ∈ X . Now let a new GPT of type x be invented at time tn because λtx ≥ λ * . The index tn , which indicates the time of the latest invention of a type-x GPT (with at least one adoption), is reset to equal the current t. The index n, which indicates the number of inventions of type-x GPTs (with at least one adoption of each) is augmented by one. x x The stock of potentially useful pure knowledge that goes into the new GPT, and that is available to be transformed into actually useful pure knowledge, is ϑt Pt x where, as in Part II, ϑt models the observation that the nx nx nx applied potential of a GPT cannot be precisely predicted when it is originally being developed. (The sets of λ’s and ϑ’s are drawn from beta distributions as in the single-GPT model.) We now need to determine if and when the new GPT will be adopted in each of the applied R&D industries. To do so, we alter the accounting variable for the single-GPT model given in equation (8) to allow for multiple GPTs. This new equation shows the productivity that a newly adopted GPT is expected to have if it is adopted at time t in some particular sector. ( (17) H nxx ) ( t = G(xn −1) x ) ( t −1)nx eτ +γ ( t −tnx ) x x + ϑ P − (G(xn −1) x )(t −1)nx , 1 + eτ +γ (t −tnx ) tnx tnx ( ) The first term on the RHS is the Gx associated with the incumbent, now numbered (n-1)x, evaluated one period before the new GPT is invented, (t − 1)n . The last term is the gap between the Gx just referred to and the full potential of the new GPT. In the initial set of comparisons, the new GPT has just been invented so that tn in the logistic diffusion term is set equal to t. x x Each of the k applied research industries considers whether or not to adopt the new GPT. In each, the productivity of that applied research 21 industry using the original set of GPTs is compared with the expected productivity if the new GPT of type-x were to replace the existing GPT. However, this test is more complex than in the single-GPT model because, as shown in equation (14), the productivity of each of the X GPTs operating in each of the K applied R&D industries is affected by a set of νs. Up to now we have held theνs constant at unity. We now allow the νs in any one applied R&D industry to change whenever a new GPT is adopted in that industry, either increasing or decreasing in a manner specified below. The new GPT of vintage xn comes with a vector of new expected values ν that influence its own productivity in each of the K applied R&D industries. It may also cause theνs associated with the other GPTs in industry j to alter when it is adopted in that industry. If it does so, this indicates a change in the complementarity relations between the type-x GPT and the other types of GPTs with which it cooperates in industry j. We say much more about the determination and interpretation of the νs in the following section. k , xn When considering adopting a new GPT of type x in applied R&D industry k, agents must form expectations on theνs that will be attached to each of the GPTs operating in that industry if the new GPT is adopted. We assume that they expect the ν attached to GPT of type x to change if they adopt the new vintage of that GPT. They do not, however, expect those νs attached to the other (i.e., non-x-type) GPTs used in industry j to change. Thus agents are assuming that the direct productivity of the challenging GPT operating on it own will change from that of the incumbent, but that the challenger will cooperate with the other GPTs operating in that industry precisely as well as did the incumbent. (We will see in the next section that this assumption is a source of surprises for agents when they employ a new GPT.) Since in each R&D industry the only ν that agents expect to change is the one associated with the challenging x-type GPT, we can compare the productivities for any of the k industries by comparing the νGx that would be produced if the incumbent were left in place with the ν G x that is expected to be produced if the challenger were adopted.28 This comparison is made in 28 If the νs attached to the other GPTs operating in industry k were also expected to change, we would have k to calculate the complete value of the at term from (12) as it would be under the existing GPTs and set of νs and then compare it with what that a term would be if the challenger replaced the incumbent in that industry with the new set of ν s. 22 each of the K applied sectors at time t = tn so the test, stated generally for all industries, is: x ( ( ) ) ν (kd,+x1) H nx ≥ ν dk , x Gdx for each k∈(1,K) k ,z x t k ,z t k ,z (18) where ν (kd,+x1) is the expected ν for the GPT of type x (=z) used in industry k. k ,z Because the index on n was incremented when the λ test was passed, but no adoption has yet taken place (so the d index was not incremented), we must identify the expected ν with the (d+1)th adoption in applied R&D industry k. ( H nx )t is as defined in equation (17). ν dk , x is the actual value of ν that was established at the time of the dth adoption of a vintage of type-x GPT. (Gdx )t is the incumbent GPT of type x (adoption number d) in industry k evaluated at time t. If the test is passed, the new GPT is adopted in industry k and the dk index for that industry increments by one. k ,z x k ,z If none of the k applied industries adopts the GPT, it is returned to its pure knowledge industry. The indexes dk and n are returned to their previous values (i.e., incremented downwards by one) and td is returned to its k ,z previous valueit is as if the favourable drawing of λt >λ* had never occurred. Pure research then continues on the GPT and the improved GPT will not again be considered for adoption until there is another major breakthrough (signalled by a λt > λ*). If at least one industry, k, adopts the new GPT, the diffusion process for that GPT starts. Also a new set of νs is established for all of the GPTs operating in that industry. We explain how these are determined in the next section. Equation (5) for the diffusion of a single GPT is now amended to allow for multiple GPTs. In it, we remove the random term, ξ since we are now able to decompose this surprise term on the lower asymptote of the logistic diffusion of each GPT into a more complex set of surprises by use of the ν terms as described in the next section. (19) (G ) = (G x nx t x ( n −1) x ) ( t −1)nx eτ +γ (t −tnx ) x x + ϑ P − (G(xn −1) x )(t −1)nx , 1 + eτ +γ (t −tnx ) tnx tnx ( ) 23 where ( G( n −1) x ) t is evolving according to its own logistic diffusion process with a starting date of tn .29 Notice that the RHSs of (17) and (19) are identical. Equation (19) gives the evolving Gxs in the industries that have actually adopted the latest vintage of that type of GPT. Equation (17) tracks the same evolution in the productivity variable to be available for comparisons in the future when industries have than not already adopted the latest vintage wish to compare their productivities with the evolving productivity of any later vintage of that type of GPT. x To illustrate the adoption process, suppose that applied industry k = 1 adopts GPT nx, but the others do not. The evolution of GPT knowledge useful in the applied R&D industry 1 then follows the path outlined in equation (19). The evolution in those industries that are using the previous vintage (n-1) of the type-x GPT follows that equation with the subscripts altered as follows: (19)’ ( G x ( n −1) x ) = (G t x ( n − 2) x ) ( t − 2)( n−1) eτ +γ ( t −t( n−1)x ) x P x − (G(xn − 2) x )(t − 2)( n−1) , + ϑ 1 + eτ +γ (t −t( n−1) x ) t( n−1) x t( n−1)x x ( x ) Similar alterations to (19) will show the evolution of the Gs in those industries that are using the vintage (n-2) of the type-x GPT, and so on. For those industries that are not using the latest vintage of type-x GPT, the comparative test of equation (18) is applied each period taking into account the improvements in efficiency of both the incumbent GPTs and that of the challenger due to the logistic diffusion process (through the evolution of the H term in (17)). This comparison is made with each newer vintage of the GPT than the one currently being used in the industry in question. If any of these test are passed, the appropriate newer vintage is adopted. The subsequent flow of useful GPT knowledge is then switched to the evolving flow of the newly adopted GPT. The production function of the applied R&D industry in question then becomes equation (14) whose d replaces whatever earlier d adoption vintage had been in use. Pure researchers in industry x continue to amass knowledge relevant to inventing another new GPT, which will come along at some future date according to the random drawings of λx. It is quite possible that a new GPT of type x will be invented before the previously invented GPT has replaced its predecessor in all industries, just as, for example, the jet engine replaced 29 This is a general formulation that will allow for many vintages of GPTs of one type to be in use in the use in the various industries: G( n − 2) x , G( n −3) x etc. 24 the turbo prop long before the turbo prop had replaced the piston engine in all types of aircraft. Thus there may be several vintages of type-x GPTs operating in the various applied R&D industries. The need to keep track of these possibilities is one of the reasons that our notation is so cumbersome. The Relation Among GPTs Substitutability between an incumbent GPT and its challenger: One of each type of GPT is always used in each applied R&D industry. But, a later vintage of type x may be used in some industries while one or more earlier vintages of that same type are still used in other industriesmirroring what we see in reality when a new GPT spreads slowly through the economy, displacing the incumbent GPT industry by industry. Thus, all GPTs of any one type are (more or less good) substitutes for each other. How good a substitute for the incumbent the challenger is expected to be in each industry, k, depends on how the challenger’s expected productivity (modified by its associated expected ν coefficient) compares with the incumbent’s productivity (modified by its associated actual ν coefficient). How good it actually turns out to be initially depends on the realised value of its own ν. The various vintages of any one type of GPT compete with each other in the sense that in each period the industries using the earlier vintages compare their evolving efficiencies with their expected efficiency if they adopt a later vintage. A switch to the later vintage is made whenever it raises expected productivity. One major source of surprise in our model is when the realisedν, deviates from the value it was expected to take, ν , when the adoption decision was made. This is done using a random deviation from the initial expected value. This difference models the surprises when a GPT turns out in practice to be either somewhat less efficient than it was expected to beas, for example, when the first generation of computers introduced into offices organized for hard copy information were initially less productive than expected or more efficient than expected as, for example, when the laser turned out to be far more useful than its original developers thought it would be. Complementarity among different types of GPTs: The second source of surprises models the complementarity among the different types of the GPTs. In any given industry, these GPTs cooperate with each other to a greater or lesser degree. In practice, no one can be quite sure just how 25 complementary with the other GPTs used in that industry a new GPT introduced into industry k will turn out to be. Whether it is better or worse than the incumbent at cooperating with them is typically something that can only be discovered in practicea case of learning by using. According to the test given by inequality (18), the adoption decision on the challenging GPT of type x in the kth applied R&D industry is made in the expectation that the ν coefficients for all of the non-x type GPTs currently being used in that industry will remain unchanged. However, if the GPT is adopted, these νs are altered. These changes model the alterations in the degree of complementarity between the new type-x GPT and all the other non-x type GPTs with which it is now cooperating. An example: Perhaps an example will help to illustrate these points. Let there be three types of GPT (developed by three pure research industries) and three types of applied R&D industries (producing knowledge useful in three types of consumption industries). Thus we need nine individual νs, one for each GPT that is used in each applied R&D industry. Initial values for these are shown in Table 1. (We use superscripts to identify each element in the matrix, rather than the more usual subscripts in order to conform to the notation in the earlier part of the paper.) TABLE 1 DISTRIBUTION OF νs FOR A MODEL WITH THREE-GPTS, AND THREE APPLIED R&D INDUSTRIES 1 K 2 3 1 ν ν ν 3,1 X 2 ν 1,2 ν 2,2 ν 3,2 3 ν 1,3 ν 2,3 ν 3,3 1,1 2,1 The initial seed values of the elements in this matrix can each be set at any desired values to model any desired set of complementarities among the different GPTs operating in each industry. For illustrative purposes, we set them all at unity. Now let a new type-2 GPT be invented. When agents are considering adopting the challenger in applied R&D industry 1, they need to form 26 expectations on the three ν values in the first column vector that contains the νs for that industry. We assume they expect ν1,1 and ν1,3 to be unchanged by the introduction of the new GPT, but that ν1,2 will now take on the value ν 1,2 . Extrapolating this across all of the applied industries, we see that the new GPT brings with it a row vector of expected new values of the νs that will be associated with its operation in each of the three applied R&D industries if it is adopted there, ν 1,2 , ν 2,2 and ν 3,2 , and the expectations of all otherνs in the matrix are set to their previous period’s actual value. If the test shown in inequality (18) is passed, the type-2 challenging GPT is adopted in applied R&D industry 1. As soon as that happens, all three of the νs in the first column change. ν1,2 alters from ν 1,2 . This is the first source of surprise discussed above. ν1,1 and ν1.3 alter from the values shown in Table 2, which were their actual values when cooperating with the incumbent type-2 GPT and their expected values when cooperating with the challenging type-2 GPT. This is the second source of surprises discussed above. It follows that at any one time each column vector of existingνs was determined when the most recent entry of a GPT occurred in that industry. We call that GPT the ‘source GPT’ for the νs in that industry. Similar tests are made for applied R&D industries 2 and 3 and the challenger is accepted or rejected in each. Where it is accepted, the appropriate column of νs is altered; where it is rejected the column of νs stays as it was in Table 1. Next we need to explain how the several distinct sets of random variations of the νs are determined. First, we need to determine the row vector (x = 2) of expected coefficients for the challenging GPT if it were to be adopted in each of the three applied R&D sectors. These expected ν s are calculated as the actual νs for the incumbent GPT of type x modified by the addition of a random variable π. For our general modelling purposes these random variables can be drawn from a number of different probability distributions and for illustrative purposes we chose a uniform distribution with support [-0.05, 0.05]. For the example discussed above the expected ν s are: 27 ν k ,2 ν k ,2 + π if (ν k ,2 + π ) ∈ [0.5,1.5] ν k ,2 + π < 0.5 if = 0.5 1.5 ν k ,2 + π > 1.5 if k∈(1, 3) If the challenger is adopted in applied R&D industry, k = 1, two types of changes occur. In both cases, the realisation of the νs that multiply the GPTs used in that industry differ from the expected values. First, the challenger’s efficiency differs from its own expected value on which the adoption decision was made as follows: ν 1,2 ν 1,2 + π if (ν 1,2 + π ) ∈ [0.5,1.5] = 0.5 ν 1,2 + π < 0.5 if 1.5 ν 1,2 + π > 1.5 if Second, the νs associated with each of the other GPTs used in the given applied industry also change. This is done by adding a random term π drawn, again for illustrative purposes only, from a uniform distribution with support [-0.05. 0.05]. In this example in which a type-2 GPT is adopted in applied R&D industry 1, the new coefficients on GPTs types x = 1 and x = 3 in industry 1 are: ν 1, x ν 1, x + π x if (ν 1, x + π x ) ∈ [0.5,1.5] ν 1, x + π x < 0.5 , = 0.5 if 1.5 ν 1, x + π x > 1.5 if for x ≠ 2 An increase in one of these νs indicates that the new type-2 GPT cooperates with one of the non-type-2 incumbents better than did the replaced type-2 GPT. A decrease indicates the opposite. It remains to note that when programming our simulations we need a full identification of the νs in Table 1. Every time a new GPT is introduced in any one industry, that GPT is a source of changes in all of the νs in that industry’s column. It follows that the matrix of νs that is operative in any one period can have up the lesser of K or X different GPT sources. However the source of the νs in a particular column need not be the latest vintage of that GPT. A newer vintage of that type might have been invented and adopted elsewhere but rejected by the industry in question and no other types of GPTs adopted there in the meantime. Also after several new vintages have been invented and rejected in sector k, a vintage newer than 28 the incumbent but older than the latest vintage could be adapted. These possibilities require that we keep track, among other things, of the vintage of the GPT that is the source of the νs in each column. So the full identification of each ν that is needed for programming our computer simulations requires that we identify (i) the applied R&D industry that is using it, k; (ii) the GPT to which it applies x; (iii) the latest adoption identification number for that type of GPT, d, and the GPT that is its “source”, z. Thus, we have a five dimensional array, any element of which specifies the complete “address” of the given ν as ν dk , x . k ,z It is now apparent why we need two different symbols for each type of GPT. The x indicates the particular G that is being modified by the ν in question. The z indicates the GPT that is the source of that particular ν, i.e., the GPT whose entry caused the industry’s νs to change to their current values. Only in the one cell where the GPT being modified by the ν is also source GPT will x=z. Having defined the νs as a set of random variables for the purposes of laying out the theory, it is important to note they can also serve as a point of contact with empirical research. Instead of allowing the νs to be determined randomly, they could be determined from data on the GPT under examination. Indeed, we have already done this in a paper applying our model to Australian data. This capability to calibrate the model by choosing the νs to approximate from real data suggests a number of policy experiments that we will conduct in later papers. Multi-GPT Resource Allocation The allocation problem for the multi-GPT model now requires the allocation of resources to J lines of final consumption, K lines of applied R&D, and X lines of pure research. Maximization: As in the single GPT model, it is possible allocate resources by maximizing consumption period by period using current marginal products (as did LC&B) and Carlaw and Lipsey (2006)). The maximization problem is altered slightly from the model presented in the single GPT model because we need to relate the multiple industries in the consumption sector to each other. We make the simple assumption that the system can be represented by an additively separable, social (or representative) utility function defined over the J lines of consumption output. This function relates the consumption outputs of each industry to each other through their 29 marginal utilities. We assume that the objective function is an additive utility function defined over the J lines of consumption output. J , rt , rt t } j =1 ϕj ( ) max U (c ) = ∑ ct j k x {r J t j s.t. (20) J K X R = ∑ rt j + ∑ rt k + ∑ rt x j =1 k =1 k αk t x =1 j α K +1 ct = ( µ A ) (rt ) j X a = ∏ (ν dkk,,xz (Gnxx )t ) β x ](rt k ) β X +1 k t x =1 A = atk + (1 − ε ) Atk−1 k t Gtx = gtx + (1 − δ )Gtx−1 K ( g = ∏ (1 − µ ) Atk x t k =1 ) (θ r ) σk x x t t σ K +1 Resource allocation with mutation and selection: As in the single GPT model, we chose an allocation method that is closer to an evolutionary model of technology-driven economic growth. We model resource allocation by seeding J + K + X industries with an arbitrary allocation of resources.30 We then use random mutations to change this allocation period by period. For each type of resource, in each period, we calculate its portion of the total resources allocated to them as follows: (21) qti = pti rti−1 J +K + X ∑ i =1 ∀i ∈ (1, J + K + X ) , i i t t −1 pr where pti is a random variable drawn from a Beta distribution, calibrated to have a very small variance around mean 0.5. We combine this with the selection mechanisms conditioned on either consumption or improvements in the stock of knowledge. The mechanism keeps the set of mutations if they result in an increase in whichever of the objectives is chosen. If the given selection criteria is not met, the previous period’s values for the resource allocation are retained and mutated once again in the next iteration. 30 In this case, each industry is arbitrarily allocated an equi-proportionate share of the total resources. 30 Simulation To simulate the model, we need to put values on the parameters. The specific numbers shown in Table 2 are for the single GPT model and those shown in Table 3 are for the multi-GPT model. The values were chosen for various reasons. The first was to meet the twin criteria of ensuring diminishing returns to resources and achieving an average annual growth rate of approximately 2%. GPTs arrive on average every 35 years, but with a large variance. The second was that several different parameterisations were tested to check the robustness of the qualitative results and these were found to be robust to wide ranges of values satisfying our basic assumptions. The third was that nothing in our subsequent analysis turns on these particular values. The fourth was that we chose some parameter values to ensure that knowledge has constant returns. The fifth was that we utilized symmetry across sectors and across industries within sectors to simplify the illustrations we wished to make with the model. TABLE 2 NUMERICAL SIMULATION OF THE SINGLE-GPT MODEL The following are the parameter values used to simulate the results of the single GPT model as reported in the text and shown in the figures. α1 = 1 α2 = 0.3 β1 = 1 β2 = 0.3 σ1 = 1 σ2 = 0.3 v = 0.1 A0 = 1 G0 = 1 R = rc,t + ra,t + rg,t= 1000 ε = 0.01 γ = 0.06 τ = -6 δ = 0.01 µ = 0.5 For λ we choose ν = 5 and η = 10. For ϑ we choose ν = 10 and η = 5. For ξ we choose ν = 12 and η = 5 and multiply all values of xt by 1.2 so that the support for the distribution that determines ξ is [0,1.2]. 31 TABLE 3 NUMERICAL SIMULATION OF THE MULTI-GPT MODEL The following are the parameter values used to simulate the results of the multi-GPT model as reported in the text and shown in the figures. α j = 1 ∀j ∈ (1, J ) αJ+1 = 0.3 β k = 1 ∀k ∈ (1, K ) βK+1 = 0.3 σ x = 1 ∀x ∈ (1, X ) A0k = 1 ∀k ∈ (1, K ) σX+1 = 0.3 v=1 G0x = 1 ∀x ∈ (1, X ) G0 = 1 ε = 0.01 δ = 0.01 γ = 0.07 τ = -6 µ = 0.5 ϕ j = 1 ∀j ∈ (1, J ) J K X j =1 k =1 x =1 R = ∑ rt j + ∑ rt k + ∑ rt x = 1000 For the λs we choose ν = 5 and η = 10. For the ϑs we choose ν = 10 and η = 5. For ξ we choose ν = 12 and η = 5 and multiply all values of xt by 1.2 so that the support for the distribution that determines ξ is [0,1.2]. For pi we choose n = ν = 100. The calibration of the νs is discussed in the section “The Relation Among GPTs”. Some Illustrative Simulations The model has real potential, both to simulate interesting theoretical possibilities and to track real data by using empirical evidence to select the many parameters that we have chosen arbitrarily for purposes of illustration. Here we present two sets of crude simulation results just to illustrate what can be done with the model. Like most discrete time models, abrupt changes in the slopes of continuous series for levels cause occasional discontinuous jumps in growth rates. These do not mirror anything found in reality. In fully developed models these changes need to be smoothed in the way that we have smoothed the transition from one GPT to another through our logistic diffusion that slowly transfers a GPTs potential to raise productivity into actual increases. This is a modelling exercise that we have not yet completed, so for the moment, we accept these discrete blips in the growth rate as artefacts of the discrete-time model and concentrate on the rest of the time periods. 32 Single GPTs and Macro Behaviour: To demonstrate that the evolution of a single GPT does not determine the evolution of the entire economy, we present the growth rate of aggregate consumption output (i.e., the growth rate of consumption output aggregated across all J consumption industries) and compare this to the growth rates of pure knowledge in each of the K applied R&D industries. The simulation results are taken from a model in which J=3, K=3 and X=3. We look at the GPTs in each applied R&D industry, rather than in some aggregated way, because not all industries are using the same vintage of the three types of GPTs at each moment in time and it is only when these GPTs are adopted in a given applied R&D industry that they become productively useful. Figures 1-3 show the growth rate of aggregate consumption output and the growth rates of the pure knowledge stocks used in each of the three applied R&D industries k = 1,2,3. In all three figures, the spikes in growth rates indicate the adoption of a new vintage of the indicated type of GPT. It is obvious from a comparison of the these figures that new vintages are not typically adopted at the same time in all industries. For example, GPT type 1 is adopted in industry 1 around period 33, in industry 3 around period 42, and in industry 2 not until about period 49. It is obvious from the figures that the growth of aggregate consumption is influenced in all cases by the discreet jump caused by the arrival of the GPT. But two other things are also obvious. First, the overall aggregate consumption growth rate does not track any one of the growth rates of the three types of pure knowledge stock. Second, leaving the artificial spikes aside, there is no consistent behaviour in the aggregate consumption data several periods before and after the entry of a new vintage of any of the GPTs that would allow one to identify that entry. This simulation in the multi GPT, multi-applied industry and multiconsumption industry model shows that the arrival and adoption of any given GPT does not determine the aggregate evolution of growth or output. So analysis of aggregate variables will not necessarily indicate the arrival dates nor the impact of important individual GPTs. This is only the beginning of this type of analysis, but it does suggest caution at least until further study is completed in assessing the attempts by economic historians to judge the existence and importance of GPTs from a consideration of aggregate data for GDP or TFP. 33 Figure 1. Growth rates for aggregate consumption and pure knowledge stocks used in applied R&D k = 1 2.00E-01 1.50E-01 Aggregate Consumption GPT Type1 1.00E-01 GPT Type 2 GPT Type 3 5.00E-02 0.00E+00 1 5 9 13 17 21 25 29 33 37 41 45 49 Time Figure 2. Growth rates fpr aggregate consumption and pure knowledge stocks used in applied R&D industry k = 2 2.00E-01 1.50E-01 Aggregate consumption GPT Type 1 GPT Type 2 GPT Type 3 1.00E-01 5.00E-02 0.00E+00 1 5 9 13 17 21 25 29 33 37 41 45 49 Time 34 Figure 3. Growth rates for aggregate consumption and pure knowledge stocks used in applied industry k = 3 2.00E-01 1.80E-01 1.60E-01 1.40E-01 1.20E-01 1.00E-01 8.00E-02 6.00E-02 4.00E-02 2.00E-02 0.00E+00 Aggregate Consumption GPT Type 1 GPT Type 2 GPT Type 3 1 5 9 13 17 21 25 29 33 37 41 45 49 Different Selection Criteria: Another illustrative simulation shows that different selection criteria matter for the overall growth performance of the system. We define two selection criteria for modelling resource allocation across the various sectors and their industries. The first adopts new allocations for resources generated by mutating the previous period’s allocation of resources if a five-period moving average of the aggregate applied knowledge stock is larger than the same five-period moving average lagged by one period. The second selection criteria adopts the new allocation of resources if a five-period moving average of aggregate consumption is higher than the same five-period moving average lagged by one period. In order to do comparative dynamics, we use the same initial values and set the realizations of all random variables to be identical for the two different simulations runs. The differences in the behaviour of aggregate consumption is thus due only to differences in the selection criteria. The following comparisons use the J = 2, K = 2, and X = 2 version of the multi-GPT, multi-applied and consumption industry model. Figure 4 shows the aggregate level of consumption output for the two different selection criteria. 35 Figure 4 Aggregate Consumption with the indicated selection criteria 4.50E+08 4.00E+08 3.50E+08 3.00E+08 2.50E+08 2.00E+08 1.50E+08 1.00E+08 5.00E+07 0.00E+00 Applied Knowledge Stock Consumption Level 1 20 39 58 77 96 115 134 153 172 191 Time Clearly, the selection criteria that uses a five-period moving average for aggregate consumption performs better than the selection criteria based on a five-period moving average of the aggregate applied knowledge stock. Note that we define “better” as what achieves the higher level of consumption output because we assume that increases in consumption output are the desired growth outcomes in the model. This result is suggestive if we think of the private sector determining the allocation of resources to the consumption industries and the public sector determining the allocation to research through direct spending and tax incentives. If the public sector determines its allocation based on the behaviour of the applied R&D sector it will do worse that if it uses the behaviour of total output (GDP) as its guide. Of course, this is a very simple model and only a crude illustrative application, but does it suggest the kind of issues that can be investigated with a model as flexible as this one. 36 Appendix 1 With the appropriate time lags in place the Bellman equation for the single-GPT model is, V ( At , Gt , t )= max ct + ρ E [V ( At +1 , Gt +1 , t + 1) ] + ρ 2 E [V ( At + 2 , Gt + 2 , t + 2) ] {rc ,t ,ra ,t ,rg ,t } s.t. R = rc ,t + ra ,t + rg ,t ct = ( µ At −1 )α1 rcα,t2 at = ν ( Gt −1 ) 1 raβ,t2 β At = at + (1 − ε ) At −1 g t = ( (1 − µ ) At −1 ) 1 rgβ,2t β Gt = gt + (1 − ε )Gt −1 where the upper bars indicate expected rather than the actual values of gt and Gt, And these expectations are adaptive. This is a complicated problem in two dimensions of state variables. Normally such a system would be solved to satisfy the maximum principle and the transversality condition (or it discrete time analogue) which would yield a stationary dynamic equilibrium. We, however, do not want a stationary equilibrium, nor do we want the behaviour of the model to act under full information. We want the agents to be exposed to uncertainty in the form of incomplete information about the distributions from which our random variables are drawn. As such we violate the Maximum principle and transversality conditions. We also make simplifying assumptions about the time lag structure in our model to make its exposition easier. 37 BIBLIOGRAPHY Carlaw, Kenneth I. and Richard G. Lipsey (2001), ‘Externalities versus Technological Complementarities: A Model of GPT-driven, Sustained Growth’, A Paper Presented at The Conference in Honour of the 20th Anniversary of Nelson and Winter’s Book An Evolutionary Theory Of Economic Change, Albert Denmark, 12-15 June 2001. __________________________________(2006a ), “GPT-Driven Endogenous Growth”, Economic Journal, 116, pp. 155-74. ___________________________________(2006b) Bridging The Macro-Micro Gap In Evolutionary Modelling With A Top-Down Approach Foster, John (1987) Evolutionary Macroeconomics (London: Allen and Urwin). Freeman, Christopher and Francisco Louçã (2001), As Time Goes By: From the Industrial Revolutions to the Information Revolution, (Oxford: Oxford University Press). Freeman, Christopher and Perez, Carlota (1988), “Structural Crises of Adjustment: Business Cycles and Investment Behaviour”, in Dosi et al. (1988), 38–66. Lipsey, Richard G. and Kenneth I. Carlaw (2004), “Total Factor Productivity and the Measurement of Technological Change”, in The Canadian Journal of Economics, November 2004, Vol 31 #4, 1118-1150. Lipsey, Richard G., Kenneth I. Carlaw and Clifford T. Bekar (2005), Economic Transformations: General Purpose Technologies and Long Term Economic Growth, (Oxford: Oxford University Press)