Proceedings of the Seventieth European Study Group with Industry Limerick, Ireland, June 28th − July 3rd 2009 Editors: Stephen O’Brien Martina O’Sullivan Peg Hanrahan William Lee Joanna Mason Jean Charpin Marguerite Robinson Andrei Korobeinikov ii Contents Contents iii Preface v List of delegates vii List of problems xv Partial Wetting Phenomenon in Superhydrophobic Microchannels xvii The Effect of Mechanical Loading on the Frequency of an Oscillator Circuit xix Designing a Green Roof for Ireland xxi Uplift Quadratic Program in Irish Electricity Price Setting Particle Impact Analysis of Bulk Powder During Pneumatic Conveyance Improvement of Energy Efficiency for Wastewater Treatment xxiii xxv xxvii Initiation of Guinness xxix Solar Reflector Design xxxi iii Technical reports 1 Partial Wetting Phenomenon in Superhydrophobic Microchannels 3 The Effect of Mechanical Loading on the Frequency of an Oscillator Circuit 29 Designing a Green Roof for Ireland 45 Uplift Quadratic Program in Irish Electricity Price Setting 65 Particle Impact Analysis of Bulk Powder During Pneumatic Conveyance 85 Improvement of Energy Efficiency for Wastewater Treatment 107 Initiation of Guinness 141 Solar Reflector Design 183 iv Preface The second European Study Group with Industry (ESGI) workshop in Ireland was held by the Mathematics Applications Consortium for Science and Industry (MACSI) in the Department of Mathematics and Statistics at the University of Limerick, Ireland, from 28th June to 3rd July 2009. The 70th ESGI workshop was opened by Professor Brian Fitzgerald, the Vice President of Research at the University of Limerick. The ESGI workshop followed the usual format of presentations from industry on the Monday with intensive working sessions by leading applied mathematicians from across Europe throughout the week and concluded with presentations from the academics on their recommendations to the companies. Delegates were free to choose which problem or problems they contributed to; each problem was assigned a work room as a base for the week. Some participants opted to stay in one room throughout the week, contributing to their problem right through from defining the problem, designing the model, finding and analysing results and writing the presentation and report. Others moved between each room giving fresh insight to all problems, commenting on the approaches adopted and making suggestions for improvements. Postgraduates often find it difficult to contribute to the early modelling stages when the speed of discussions can be very fast and are in areas of mathematics and applications outside of their immediate field of expertise. This usually changes throughout the week when they are able to work on analysis and numerical simulations and contribute to the presentations and reports. For academics, study groups provide an excellent opportunity to work on industrial mathematics problems outside of their usual areas of research with researchers from postgraduates to professors. The highly interactive nature of these events allows participants to exchange ideas and form long lasting collaborative links with other academics and companies to a much greater extent than with a regular conference. For industry, study groups offer the opportunity of calling on the combined and varied expertise of leading specialists in industrial mathematics to make significant progress on problems in a short space and time. They kick start the research process and provide answers quickly at minimal cost. v Over 90 participants registered for ESGI 70 from Ireland, England, Scotland, Denmark, Poland, Spain, Sweden, Germany and USA. All of the MACSI postgraduates and postdoctoral fellows attended the study group and contributed to work on the problems. The problems were varied and were largely representative of the type of industrial mathematics that usually appears in study groups across Europe. Events throughout the week included a guest lecture on Thursday morning by Professor Chris Budd on Geometric integration: how to compute solutions of differential equations which have the right qualitative features. The MACSI study group was a resounding success. Excellent feedback was received from both university researchers and industrial representatives. All of the companies involved have indicated their strong intention to continue work on the problems with MACSI and action plans are already in place to take the projects forward. Thank you to all of the participants in ESGI 70, who made it an enjoyable and successful week of collaborative problem solving. These proceedings illustrate the results of all their expertise, hard work, energy and enthusiasm. We look forward to seeing you at next year’s Irish Study Group! The sponsors of the ESGI70 were: • Science Foundation Ireland (SFI) • National Development Plan (NDP) Ireland MACSI would like to thank the sponsors without whom the European Study Group with Industry workshop could not have taken place. MACSI is funded by the Science Fundation Ireland Mathematics Initiative grant 06/MI/005. vi List of delegates Academics and Post-Docs Adley, Catherine - Department of Chemical and Environmental Science, University of Limerick, Ireland. Bell, Christopher - Department of Bioengineering, Imperial College London, United Kingdom. Borg, Karl - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Budd, Chris - School of Mathematical Sciences, University of Bath, United Kingdom. Burke, Mark - Department of Mathematics and Statistics, University of Limerick, Ireland. Chapwanya, Michael - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Charpin, Jean - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Cooker, Mark - School of Mathematics, University of East Anglia, United Kingdom. Dellar, Paul - OCIAM, Mathematical Institute, University of Oxford, United Kingdom. Doherty, Kevin - National University of Ireland Galway, Ireland. Enright, Ryan - Stokes Institute, University of Limerick, Ireland. Evatt, Geoff - University of Manchester, United Kingdom. vii Fowler, Andrew - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Gaburro, Romina - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Gleeson, James - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Hayes, Martin - Department of Electronic and Computer Engineering, University of Limerick, Ireland. Hegarty, Alan - Department of Mathematics and Statistics, University of Limerick, Ireland. Hjorth, Poul - Department of Mathematics, Technical University of Denmark, Denmark. Houghton, Conor - Department of Mathematics, Trinity College Dublin, Ireland. Jabłońska, Matylda - Department of Mathematics, Lappeenranta University Finland, Finland. Kinsella, John - Department of Mathematics and Statistics, University of Limerick, Ireland. Kirwan, Laura - Agri-Environment Research Department, Teagasc Environment Research Centre, Ireland. Kopteva, Natalia - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Korobeinikov, Andrei - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Korostynska, Olga - Microelectronic and Semiconductor Research Centre, University of Limerick, Ireland. Lacey, Andrew - Department of Mathematics, University Heriot-Watt, United Kingdom. Lee, William - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Lessels, Gordon - Department of Mathematics and Statistics, University of Limerick, Ireland. Linss, Torsten - Department of Mathematics, Czech Academy of Sciences, Dresden, Germany. viii Lukaschuk, Sergei - Department of Engineering, University of Hull, United Kingdom. Mackey, Dana - School of Mathematical Sciences, Dublin Institute of Technology, Ireland. Mason, Joanna - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. McGuinness, Mark - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland / School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand. Meere, Martin - Department of Applied Mathematics, National University of Ireland Galway, Ireland. Mitchell, Sarah - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. O’Brien, Stephen - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. O’Sullivan, Catherine - Department of Civil and Environmental Engineering, Imperial College, United Kingdom. O’Sullivan, Martina - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Piiroinen, Petri - Department of Applied Mathematics, National University of Ireland Galway, Ireland. Power, Oliver - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Rachinskii, Dmitrii - Department of Applied Mathematics, University College Cork, Ireland. Rebrova, Natalia - Tyndall Institute, Ireland. Richardson, Giles - School of Mathematics, University of Southampton, United Kingdom. ix Robinson, Marguerite - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Ryan, Gearóid - School of Mathematical Sciences, University College Cork, Ireland. Schwartz, Len - Department of Mechanical Engineering, University of Delaware, United States of America. Soussi, Sofiane - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Vynnycky, Michael - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Ward, Jonathan - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Waters, Thomas - Department of Applied Mathematics, National University of Ireland Galway, Ireland. Wilson, Eddie - Department of Engineering Mathematics, University of Bristol, United Kingdom. Zhelev, Toshko - Department of Chemical and Environmental Science, University of Limerick, University of Limerick, Ireland. Postgraduate students Carey, Michelle - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Cregan, Vincent - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Fay, Gemma - OCIAM, Mathematical Institute, University of Oxford, United Kingdom. Gonzalez, Maria - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. x Hanley, Kevin - College of Science, Engineering and Food Science, University College Cork, Ireland. Healy, Timothy - University College Cork, Ireland. Hewitt, Ian - OCIAM, Mathematical Institute, University of Oxford, United Kingdom. Hurley, Julie - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Lapin, Vladimir - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Liu, Yupeng - School of Mathematical Sciences, Dublin Institute of Technology, Ireland. Mellgren, Niklas - Department of Mechanics, KTH Royal Institute of Technology, Sweden. Murphy, Thomas - Department of Chemical and Environmental Science, University of Limerick, Ireland. O’Brien, Mick - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. O’Donovan, Celine Kemmy Business School, University of Limerick, Ireland. O’Keeffe, Niall - Stokes Institute, University of Limerick, Ireland. Pimenov, Alexander - College of Science, Engineering and Food Science, University College Cork, Ireland. Rojas, Jaime - Department of Chemical and Environmental Science, University of Limerick, University of Limerick, Ireland. Sedakov, Roman - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Simboeck, Hildegard - University of Limerick, Ireland. Tuoi, Vo Thi Ngoc - MACSI, Department of Applied Mathematics, National University of Ireland Galway, Ireland. Viscor, Martin - MACSI, School of Mathematical Sciences, University College Cork, Ireland. xi Wang, Dengli - Business School, Dublin City University, Ireland. Wang, Qi - School of Mathematical Sciences, Dublin Institute of Technology, Ireland. Winstanley, Henry - Mathematical Institute, University of Oxford, United Kingdom / MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Witheephanich, Kritchai - Wireless Access Research Centre, University of Limerick, Ireland. Zubkov, Vladimir - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Undergraduate students Bandeira, Afonso - Universidade de Coimbra, Portugal. Delaney, Niamh - University of Limerick, Ireland. Ganchev, Galin - University of Limerick, Ireland. Kirby, Grainne - University of Limerick, Ireland. Mayrhofer, Arno - University of Limerick, Ireland. McCarthy, Stephen - University College Cork, Ireland. Menzies, Colin - University of Limerick, Ireland. Zhezherun, Andrey - University College Cork. Industrial representatives and guests Bourke, Sile - Bord Gáis Energy, First Floor (BGE), City Quarter, Lapps Quay, Cork, Ireland Browne, Conor - Diageo Ireland, St. James’s Gate, Dublin 8, Ireland Carnus, Tim - Teagasc, Johnstown Castle, Co. Wexford, Ireland Healy, Sandra - Analog Devices, Raheen Industrial Estate, Raheen, Limerick, Ireland xii Hoolan, Sean - Erin Energy Ltd, Applied Research into Solar Technologies, MIRC Building, Athlone Institute of Technology, Dublin Road, Athlone, Co Westmeath, Ireland Hurley, Gavin - Bord Gáis Energy, First Floor (BGE), City Quarter, Lapps Quay, Cork, Ireland Jordan, Rory - Science Foundation Ireland, Wilton Park House, Wilton Place, Dublin 2, Ireland. Kirwan, Billy - GFM Systems, Waterway House, Crag Crescent, Clondalkin Ind. Estate, Dublin 22, Ireland O’Hara, John - Landtech Soils Ltd, 20 Kenyon Street, Nenagh, Co. Tipperary, Ireland O’Mahony, Seamus - Wyeth Nutritionals Ireland, Askeaton, Co. Limerick, Ireland O’Mara, Brian - Analog Devices, Raheen Industrial Estate, Raheen, Limerick, Ireland Swallow, Richard - VGraph UK Ltd, 28 Hudson Way, Radcliffe On Trent, Nottingham, NG12 2PP, United Kingdom xiii Organising committee O’Brien, Stephen - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. O’Sullivan, Martina - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Hanrahan, Peg - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Lee, William - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Mason, Joanna - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Charpin, Jean - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Robinson, Marguerite - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. Korobeinikov, Andrei - MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland. xiv List of problems xvi Partial Wetting Phenomenon in Superhydrophobic Microchannels Industrial Partner: Stokes Institute, University of Limerick Industry representatives • Ryan Enright, Stokes Institute, University of Limerick, Ireland. • Niall O’Keeffe, Stokes Institute, University of Limerick, Ireland. • Cormac Eason, Stokes Institute, University of Limerick, Ireland. Description of the problem Superhydrophobic (SH) surfaces are a class of materials that display extreme wetting behaviour that can result in almost spherical droplets, i.e. the contact angle, theta>180deg, when a liquid is deposited on such a surface. This behaviour is a direct result of a favourable surface potential combined with surface micro/nano-structure and was famously described by Cassie and Baxter [1]. These surfaces have received much attention in the last decade due to the proliferation of microfabrcation techniques allowing large areas of geometrically well defined surface structures to be produced. One application of SH surfaces is as liquid flow boundaries in microfluidics. It has been shown analytically, numerically, and experimentally that SH surfaces can produce an apparent slip effect due to reduced liquid/solid interaction and the presence of a trapped gas/vapour layer at the SH surface. This segregated two-phase flow exists due to surface tension and the stability of the flow structure is controlled primarily by the geometry of the SH surface structures. Energybased calculations performed assuming equilibrium conditions provide criteria for the stability of the wetting regime and have been validated experimentally in droplet experiments. However, recent experimental investigation performed by our group have demonstrated the existence of xvii a ‘partial’ wetting state on pillar-structured SH surfaces that is not predicted by equilibrium analysis. The partial wetting state has been observed using direct imaging of the SH surfaces located in a Hele-Shaw microchannel using laser-scanning confocal microscopy and shows that the gas/liquid/solid contact line forms some distance from the top of the pillar structures upon filling of the microchannel. Investigation of the pillar structures using SEM has shown that the pillar sides are not smooth, but instead demonstrate scallop features, which are a result of the fabrication process. These scallop features represent contact-line pinning sites that are not captured in existing energy calculations, which can explain why the pillars remain partially wetted. However, it is unknown why this partial wetting phenomenon should occur in the first place and it does not appear to have been described in the literature prior to this work. An understanding of possible mechanisms, developed through modelling, is of particular interest for informing the design of future devices. [1] A.B.D. Cassie and S. Baxter. Wettability of porous surfaces. Trans. Faraday Soc., 40:546551, 1944. xviii The Effect of Mechanical Loading on the Frequency of an Oscillator Circuit Industrial Partner: Analog Devices Industry representatives • Sandra Healy, Analog Devices, Raheen Industrial Estate, Raheen, Limerick, Ireland. Description of the problem A particular type of silicon chip produced by Analog Devices is loaded with an arbitrary force of the order of 100N during testing. A key component in the chip is an oscillator circuit whose frequency must reach a 1% accuracy specification over a temperature range of -40-105řC. At present the oscillators do not meet the specification during the loading test. We suspect that during the test, deformation of the chip leads to a piezo-resistive effect that alters the oscillator frequency. This problem has come to light during a critical stage in production and a solution is required urgently. Redesigning the chip is not an option at this stage in the development, however the testing configuration may be altered to achieve the required specification. On a much longer timescale, a better understanding of the mechanics of the chip during loading will aid improvements in future designs to prevent this problem from occurring again. Objectives There are two main goals for this project: • Develop a mechanical model of the chip under loading, • Relate the deformation of the chip to the oscillator frequency. xix Further Information Chip Geometry A detailed diagram of the chip geometry will be supplied. There are two silicon wafers (or die) in the chip, the smaller of which sits on top of the larger. These are cast in a plastic mould with connector pins on the base. Testing Rig An example of the testing rig will be brought to the study group and details of its design (including material properties) will be supplied. The chip is placed in a well in the plastic rig and rests on metal pins that contact the connectors on the base of the chip. A plunger is then pressed against the top of the chip and force is applied via a screw mechanism. The size of this force is not known and may vary from test to test. Preliminary experimental data consisting of coarse measurements of applied force versus oscillator frequency will be available. Details of the plunger head design will also be supplied. Oscillator Circuit The oscillator circuit consists of resistors, inductors and capacitors. These components are built up in thin layers inside the silicon wafer. xx Designing a Green Roof for Ireland Industrial Partners: Landtech Soils Ltd and GFM Systems Industry representatives • Catherine Adley, Department of Chemical and Environmental Science, University of Limerick, Ireland. • Billy Kirwan, GFM Systems, Waterway House, Crag Crescent, Clondalkin Ind. Estate, Dublin 22, Ireland. • Landtech Soils Ltd, 20 Kenyon Street, Nenagh, Co. Tipperary, Ireland. Description of the problem Green roofs are becoming increasingly popular in many countries. The direct benefit of a green roof is it provides an environmentally friendly form of roof insulation for homes and commercial buildings. They also provide significant indirect benefits, such as contributing to: increasing biodiversity and wildlife; water management; reducing air pollution and the urban heat island effect; and, most importantly, they act to recycle carbon dioxide. Aesthetically a green area is pleasing and relaxing. The development of an optimised green roof for Ireland is part of a collaborative research programme in UL. The task to be addressed by the study group is to develop a mathematical model of a green roof focusing on energy and water balances. Energy balances determine the effectiveness of the roof as an insulator, while getting the drainage properties of the roof correct is essential to keeping the integrity of the roof structure; preventing water leakage to the interior; and managing water availability for plant growth and sustenance. This model will be used to help design the experimental phase of the research project by indicating the most important parameters to be measured and, once the model is fully parameterised, used to determine an optimal design. xxi The model must contain descriptions of the following: • The Irish climate: inputs will include sunlight and rainfall. • Energy exchange with the house below. • The biota (plants, microbes and fungi) living on the roof - one aspect of optimal roof design will be choosing the right combination of biota. • The properties of the layers making up the roof: typically soils, plant mix drainage layers, root barriers and waterproof sheeting. xxii Uplift Quadratic Program in Irish Electricity Price Setting Industrial Partner: Bord Gáis Energy Industry representatives • Gavin Hurley, Bord Gáis Energy, First Floor (BGE), City Quarter, Lapps Quay, Cork, Ireland. • Sile Bourke, Bord Gáis Energy, First Floor (BGE), City Quarter, Lapps Quay, Cork, Ireland. Description of the problem Energy prices in the Irish Electricity Market are composed of a Shadow Price and an Uplift Component. The Shadow Price is the cost of meeting a marginal unit of demand, as determined by solving the Unit Commitment Problem. The Unit Commitment Problem is a Mixed Integer Program which determines which units (generators) to commit (turn on), so as to minimize the total system cost of generation, subject to meeting demand in each period and respecting technical characteristics of the units (e.g. ramp-rates, min on/off times, minimum generation). Participants declare their true marginal costs and therefore the Shadow Price alone does not guarantee that generators will recover their fixed running costs, e.g. start-up and no-load. The Uplift component of prices addresses this. It is derived via a Quadratic Program (QP) which ensures that the generators recover those costs. The QP aims to minimize a parameterized combination of • total additional cost due to Uplift • volatility of Uplift xxiii The focus of this study will be on the Uplift Quadratic Program and approaches may include • Analysis of necessary conditions for optimality (Karush-Kuhn-Tucker conditions) • Analysis of Lagrangian multipliers on the constraints • Analysis of market data via MATLAB implementation • Study of impact of different parameterizations • Data mining on ultimate drivers of uplift, e.g. demand, wind variability, etc. We will be able to bring extensive market data and also a MATLAB implementation of the quadratic program described. xxiv Particle Impact Analysis of Bulk Powder During Pneumatic Conveyance Industrial Partners: UCC and Wyeth Industry representatives • Kevin Cronin, University College Cork (UCC), Ireland. • Seamus O’Mahony, Wyeth Nutritionals Ireland, Askeaton, Co. Limerick, Ireland. Description of the problem Particle Impact Analysis In many of Ireland’s process industries, the product is manufactured and conveyed in powder form. This is especially true of food and pharmaceutical materials. During processing and handling, the particles are usually immersed in a flowing gas (air) stream. One important aspect of the behaviour of these products is their response to impact. This can be impact against the walls of the process equipment or impact against each other. Particle geometry is generally (approximately) spherical. Size as measured by diameter is dispersed but is usually within the range of 10 microns (0.01 mm) up to 1000 microns (1 mm). An individual spherical particle can either be solid or hollow and if hollow the wall of the sphere may be solid or porous. In some cases, the bulk product consists of billions of these individual particles. In other cases, the powder is in an agglomerated form as granules where each granule consists of between 10 and 100 individual particles bonded together. The actual micro-structure of the material can be complex; for powders of biological origin (such as foods or bio-active chemicals), the structure consists of some matrix of protein, sugar and fat. However we usually take the structure to be homogeneous. xxv At UCC, we analyse these particles in a number of different systems such as fluidised bed granulation, spray drying, rotary drum processing, pneumatic conveying, etc. For some experiments we use real materials or powders while for others we used model materials such as glass beads (corresponding to hard elastic solid behaviour) or plasticine balls (soft, visco-elastic behaviour). In certain industrial systems, the rheology of the particles changes during processing from soft to hard as a result of heat and mass transfer. In addition, sometimes the particle is coated with a layer of viscous liquid which affects its impact behaviour. A standard or reference case that we wish to understand is the normal impact of two spherical particles (not granules) of different diameters with dry surfaces. Both particles are moving in a gas stream of known density and viscosity. The collision will be defined by a velocity dependent coefficient of restitution. We wish to know the force versus deflection behaviour, particularly the maximum force that is achieved, that occurs during the impact. Knowledge of the force enables us to determine whether the collision will result in rebound, coalescence or breakage of the particles. If the two particles are considered as solid elastic spheres, then Hertzian contact stress provides an analytical expression for the force that is developed. However once the spheres are hollow (even if they are still elastic), then we cannot find a solution. xxvi Improvement of energy efficiency for wastewater treatment Industrial Partner: Department of Chemical and Environmental Science, University of Limerick Industry representative • Toshko Zhelev, Department of Chemical and Environmental Science, University of Limerick, Ireland. Description of the problem The autothermal thermophilic aerobic digestion process for wastewater treatment uses the heat generated by thermophilic bacteria to sterilise wastewater. These bacterial require high temperatures (60 -65 degrees centigrade) and oxygen to operate most efficiently. During (batch) processing of wastewater the temperature in the treatment tank is regularly lowered below this temperature by the introduction of new material. Restoring the tank to an optimal temperature is a time and energy consuming process requiring increased aereation over several hours. There are two parts to this study group problem. The first part is the formulation of a mathematical model of the treatment process. The second part is to use this model to develop an optimised plant design and treatment strategy. The model must be able to describe changes to the plant and the way it is operated, for instance: • Using heat exchangers to increase the temperature of pretreated material using heat recovered from treated material, • Changing the feeding pattern, • Changing the size or number of reactors. xxvii The optimisation problem must take include the following aspects of the problem: • Efficiency criteria (contains energy usage, processing time and sterrilisation), • Practicality constraints, • The nature of the solution space, which contains both discrete variables (due to the batched nature of the processing) and continuous variables, • Account for problem complexity: formulation of problem of parametrical optimisation expanded to structural-parametrical optimisation (optimum regime parameters and process flow structure optimisation = number and size of reactors, number and sizes of heat exchangers). xxviii Initiation of Guinness Industrial Partner: Diageo Industry representatives • Conor Browne, Diageo Ireland, St. James’s Gate, Dublin 8, Ireland. • Richard Swallow, VGraph UK Ltd, 28 Hudson Way, Radcliffe On Trent, Nottingham, NG12 2PP, United Kingdom. Description of the problem Unlike carbonated beers which foam spontaneously, Guinness contains a mixture of nitrogen and carbon dioxide which will not spontaneously exsolve. Diageo use various strategies to promote foaming. In canned and bottled Guinness a pressurised plastic container with a small hole in it, known as a widget, is used to trigger exsolution. The inclusion of the widget adds a small but significant cost to the manufacture of the product which Diageo wishes to minimise. To assist with this, the study group is asked to develop a mathematical description of the way the widget promotes foaming, possibly taking the bottle widget as an specific example. Ideally this model should be sufficiently general to be able to shed light on other methods of foam initiation such as ultrasound or nucleating surfaces. xxix xxx Solar Reflector Design Industrial Partner: Erin Energy Industry representative • Sean Hoolan, Erin Energy Ltd, Applied Research into Solar Technologies, MIRC Building, Athlone Institute of Technology, Dublin Road, Athlone, Co Westmeath, Ireland. Description of the problem Objective To develop a design methodology for solar tracking reflector curve generation that will deliver optimal energy capture, such that a low absorber/reflector ratio is achieved within a shallow construction. Parameters Absorber/reflector aperture ratio. Currently, absorber elements are more expensive than the reflector elements. Therefore the aim is to have a low ratio of absorber area to that of the reflector and thus achieve an optimal cost/performance balance point. Distance between absorber centres. • The optimal pitch between absorbers. There are several examples with the absorbers in close proximity to each other, thus casting a shadow over each other reducing the unit effectiveness. • Impacts on the relative height of the mirror wall relative to the absorber height. (Better to lower mirror wall height so as to allow direct hit on adjacent absorber.) xxxi Depth of curve/reflector. Surface mount markets require that the depth of the reflector curve plus the absorber must be maintained as shallow as possible. Incident angle. The greater the acceptance angle, the more of the daily yield will be captured. xxxii Technical reports 2 Partial Wetting Phenomenon in Superhydrophobic Microchannels Report Contributors: Karl Borg1 , Vincent Cregan1,2, Andrew Fowler1 , Mark McGuinness1,3, Stephen B.G. O’Brien1 , Leonard W. Schwartz4 and Vladimir Zubkov1 Industry Representatives: Niall O’Keefe5, Ryan Enright5 and Cormac Eason5 1 MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland Report coordinator, vincent.cregan@ul.ie 3 School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand 4 Department of Mechanical Engineering, University of Delaware, United States of America 5 Stokes Institute, University of Limerick, Ireland 2 1 Introduction Superhydrophobic surfaces are a class of materials that display extreme wetting behaviour that can result in almost spherical liquid droplets. For superhydrophobic surfaces the contact angle between the liquid and the surface may be much greater than 90o . This extreme wetting behaviour is a direct result of a favourable surface potential combined with surface micro/nanostructures. A high degree of surface roughness can lead to a significant increase in the contact angle compared to a smooth surface[1]. Figure 1: Overview of various liquid drop orientations on a surface. (a) Lotus leaf (b) Water strider Figure 2: Examples of superhydrophobic surfaces in nature. The leaves of the lotus plant[2] and the lady’s mantle[3] are well-known examples of superhydrophobic surfaces in nature. Microstructues on the surface of the lotus leaf allows the leaf 4 to be self-cleaning and strongly repellent to water droplets. Superhydrophobicity is not restricted just to the plant world. Water striders posses superhydrophobic legs which allow them to walk along the surface of water and the wings of many butterflies have superhydrophobic properties[3]. An understanding of superhydrophobic wetting behaviour is of huge importance in industry and the phenomenon has received much attention in the last decade due to the proliferation of microfabrication techniques allowing large areas of geometrically well defined surface structures to be produced. Superhydrophobic surfaces are used in the design of waterproof clothing raincoats, windscreens and satellite dishes. Superhydrophobic surfaces are currently playing a major role in the fabrication of silicon wafers[3] via fluidic self-assembly. Another application of these surfaces is as liquid flow boundaries in microfluidics. It has been shown analytically, numerically, and experimentally that superhydrophobic surfaces can produce an apparent slip effect due to reduced liquid/solid interaction and the presence of a trapped gas/vapour layer at the superhydrophobic surface. The Stokes Institute are currently using superhydrophobic surfaces in a number of their experiments. In one particular experimental process, a syringe pump is used to force deionised water through a parallel plate microchannel. The microchannel inlet and outlet are connected to two large fluid reservoirs. The upper wall is a flat no-slip surface while the lower wall is a superhydrophobic surface made up of small cylindrical pillars. (a) Microchannel schematic (b) Experimental apparatus Figure 3: Overview of experimental setup. The breadth, length and height of the microchannel is 10mm, 10mm and 80µm, respectively. Pillar dimensions vary depending on the type of experimental setup but for this report we will use 7.5µm and 25µm as values for the pillar radius and height, respectively. The distance between two pillars (centre to centre) is 20µm. The velocity of the water as it passes through the microchannel is 0.012ms−1 . [4] contains a detailed description of the experimental apparatus. As the water passes through the microchannel, air gets trapped between the pillars and the 5 water. This segregated two-phase flow exists due to surface tension. The stability of the flow structure is controlled primarily by the geometry of the superhydrophobic surface structures. Assuming equilibrium conditions, energy-based calculations performed provide criteria for the stability of the wetting regime and have been validated experimentally in droplet experiments. However, recent experimental investigation performed by Stokes have demonstrated the existence of a ‘partial’ wetting state on pillar-structured superhydrophobic surfaces that is not predicted by equilibrium analysis. The partial wetting state has been observed using direct imaging of the superhydrophobic surfaces located in a Hele-Shaw microchannel using laser-scanning confocal microscopy and shows that the gas/liquid/solid contact line forms some distance from the top of the pillar structures upon filling of the microchannel. Typically, the contact line has been found 10 − 15µm below the pillar tops. In addition, the height of the contact line varies from pillar to pillar. Investigation of the pillar structures using SEM (Scanning electron microscope) has shown that the pillar sides are not smooth, but instead demonstrate scallop features, which are a result of the fabrication process. These scallop features represent contact-line pinning sites that are not captured in existing energy calculations, which can explain why the pillars remain partially wetted. However, it is unknown why this partial wetting phenomenon should occur in the first place and it does not appear to have been described in the literature prior to Stokes’ experimental work. (a) Pillars on superhydrophobic surface (b) Pillar scallops (c) Confocal image of liquid interface in contact with pillars. Stokes are interested in mathematical modelling their experimental set-up and understanding 6 the physical mechanisms involved in their experiment. An understanding of these mechanisms, developed through modelling, is of particular interest for informing the design of future devices. The key problems to be studied may be summarised as follows: • Problem 1: Describe the mechanism behind liquid penetration of the pillars. What is the cause of the liquid going 10 − 15µm down along the pillars? Why does the liquid drop height vary from pillar to pillar? • Problem 2: How do the scallops affect the contact line pinning of the liquid? The modelling work performed during the study group is summarised in this report. The report is split into six main sections and suggestions for future work are found in the concluding section. The content of the sections may be summarised as follows: • Section 2: Review of possible surface wetting surface states • Section 3: Time scales involved in problem • Section 4: Height of water-air interface drop in contact with single cylindrical pillar • Section 5: Partial wetting phenomenon in superhydrophobic microchannels in two dimensions • Section 6: Poiseuille flow across pillars 7 2 Surface wetting states Young’s equation relates the thermodynamic equilibrium contact angle, θc of a liquid drop to the specific energies of the surface-liquid, surface-air and liquid-air interfaces[5] γSG = γSL + γLG cos θC γij denotes the specific energy between medium i and medium j. However, Young’s equation is restricted to smooth, flat surfaces. It was recognised early that surface roughness may lead to deviations in the contact angle predicted by Young’s equation. In the literature, Wenzel’s seminal work was to first to describe the impact of surface roughness and superhydrophobic wetting behaviour in the context of waterproofing of knitted fabrics[7]. Wenzel illustrated how a rough surface can increase the apparent contact angle at the boundary between a liquid and a surface. Wenzel related the standard Young contact angle, θC to the contact angle on the rough surface, θW by the formula cos θW = r cos θC where r is a roughness factor. When a liquid drop occupies the spaces between the surface projections, the drop is said to be in the Wenzel state. Figure 4: Overview of various liquid drop orientations on a superhydrophobic surface with topography. Cassie and Baxter extended Wenzel’s work by considering the wettability or water-repellency of porous surfaces such as natural and artificial clothing[8]. In the Cassie-Baxter (or Fakir) state, a liquid drop remains balanced on the surface projections with air trapped underneath. 8 The associated equilibrium condition is cos θCB = Φ (cos θC + 1) − 1 where Φ is the solid fraction of the surface. [6] contains a detailed description of the various wetting states associated with superhydrophobic surfaces. 9 3 Problem timescales In this section we discuss some of the timescales relevant to the problem. The Stokes Institute advise that it takes about one second for the water to traverse the top of the pillars, with an associated velocity of 10mm s−1 . It is of interest to consider how quickly air that is caught between the water and the bottom of the pillar array can escape through the pillar array. (a) (b) Figure 5: (a) As the water moves through the microchannel, air is free to move around the pillars and escape. (b) Air gets trapped between the water and the pillars once the water has passed completely through the microchannel. Treating the array as a porous medium, with a porosity of 0.6 calculated from the geometry, Darcy’s law gives the volume flux (ms−1 ) q= k ∇p µg where the dynamic viscosity of air is µg = 1.8 × 10−5 kgm−1 s−1 at 20◦ C, and the pressure difference used in the experiments to drive the water, taken to be also driving air flow, is typically about 100Pa over 10mm, so that ∇p ≈ 104 Pam−1 . The permeability is the most uncertain parameter. Here we will use two different ways to estimate k, permeability-porosity plots from hydrogeology and the more formal Carmen-Kozeny relationship. 10 Figure 6: A porosity-permeability plot, from the Bureau of Economic Geology, University of Texas at Austin. The lines indicate pore sizes. Using the permeability-porosity cross plot illustrated in figure (6), the relationship between porosity and permeability for rocks depends on the average pore dimension also. Using the line corresponding to a 20 µm pore size, a porosity of 0.6 is outside the data range, but looks to have a permeability that is k ≈ 2 − 5 × 10−12 m2 . An alternative approach is to use the Carman-Kozeny equation k= 1 φ3 D2p 72τ (1 − φ)2 where τ ≈ 2 is tortuosity, the square of the mean path length taken by the air over the total path length, and Dp is the particle diameter, 20µm. Using φ = 0.6 gives a similar value to the above one, k = 4 × 10−12 m2 . Using k = 4 × 10−12 m2 gives q ≈ 0.002 m s−1 The time for air to move across a 10mm array at this speed is about 5 seconds, comparable to 11 but a little slower than the time taken by the water. Another time scale of interest is the time taken for water to penetrate down the pillars, which may be estimated by considering the experimental results of Moulinet and Bartolo. Their experiment consisted of depositing a small droplet of water in the Fakir state on a superhydrohpobic surface and monitoring the droplet profile evolution. As evaporation of the droplet ensues the droplet radius decreases. The contact lines recedes and the transition from the Fakir state to Wenzel occurs in less than 20ms. This is relatively fast compared to the speed of the water and the air. The implication of these timescales is that it is possible for the water to trap air underneath it, as it passes over the array of pillars and then seals off the escape route at the exit. This motivates the more careful examination of the interaction between water and air that appears later in this report. 12 O 4 Liquid in contact with a single cylindrical pillar Consider a single cylindrical pillar and a liquid in contact with the curved surface of the cylinder. In particular we are interested in the shape of the liquid meniscus. The motivation behind considering this problem is that we can get an approximation for how much the water-air will drop in a distance of 10µm which is half the distance between the centre of the pillars in Stokes’ experimental setup. The radius of the cylinder, R is 7.5µm and the capillary length for water is 2.73 × 10−3 m. [10] and [11] derived asymptotic solutions to the Young-Laplace for the liquid profile in contact with a single cylindrical pillar. Of particular interest is the inner solution in [11] which is valid in an O(R) boundary layer near the cylinder. (a) (b) Figure 7: (a) Schematic of liquid meniscus. (b) Dimensionless plot of liquid height y against distance from cylinder centre, r. We multiply dimensionless quantities by R to convert to dimensional quantities. The contact angle was taken to be 170◦ . For the given parameters, the plot in figure 7 suggests that the water-air interface can drop approximately 12µm in a distance of 10µm. 13 5 Partial wetting phenomenon in superhydrophobic microchannels in two dimensions In this section we study a two-dimensional, Poiseuille flow through a microchannel. Via some geometrical arguments we establish a criterion for which the liquid will go down in between the pillars. Consider the profile of the liquid moving through the channel with velocity u at various instances in time, t0 , t1 , t2 and t3 . Figure 8: Liquid profile at various times as it flows through the microchannel. α and β are the liquid-surface contact angles related to the lower and upper channel surfaces, respectively. The equilibrium condition of the free surface is given by ∆p = γ 1 R where ∆p∗ is the pressure difference across the liquid-air interface and R is the radius of curvature. If we consider the circular liquid front in figure (9) and via some geometrical arguments it may be shown that the center of the circle in figure (9) is (x0 , y0 ) = (−R cos β, −R cos α) Hence, the resulting equation of the circle for the liquid front is (x + R cos β )2 + ( y + R cos α )2 = R2 Rewriting (5.1) for point C = (d, D), we obtain 14 (5.1) d = −R cos β + p R2 − ( D + R cos α )2 (5.2) Points A and B have coordinates A = (xA , 0) and B = (0, yB) where xA = −R cos β + R sin α, yB = −R cos α + R sin β (5.3) Figure 9: Schematic of liquid going through the microchannel. The condition necessary for the liquid front to move down the gap between the pillars is xA > d which may be rewritten using (5.2) and (5.3) to give d cos β − sin α < D 2 cos α (5.4) For the case of β = 105o and α = 105o , (5.4) yields d < 2.36D 15 (5.5) Result (5.5) gives us a condition for which the advancing liquid front will go down between the pillars. However, this result only applies to a two-dimensional system and it does not address the influence of the surrounding pillars on the liquid-air interface in three-dimensions. 16 6 6.1 Poiseuille flow across pillars Local equilibrium A mathematical model is now developed for a pressure driven liquid flow through the microchannel across the hydrophobic pillars behind the advancing liquid front. Air can escape between the pillars under the advancing liquid. Assuming that the liquid-air surface is at local equilibrium, we balance the pressure difference across the surface with the surface curvature to get p − Π = 2γκ (6.1) where p is the liquid pressure, Π is the air pressure, γ is the surface tension and κ is the mean curvature of the liquid-air interface. When the interface is in equilibrium a pressure difference is induced across the interface. We also assume that the liquid-air interface evolves in time as shown in the figure below. Figure 10: Possible liquid-air interface evolution at times T1 , T2 , T3 and T4 . h is the average height of the interface above the base of the pillars and D is the distance between the pillars. At times T1 and T2 , the liquid-air interface is pinned to top of the pillars at z = h0 . At time T1 the liquid pressure is less than the air pressure. Hence, the liquid-air interface is curved upwards. At some point the liquid pressure becomes greater than the air pressure and the interface becomes curved downwards (as shown at time T2 in figure (10)). 17 Increasing liquid pressure causes the interface to move down between the pillars until it reaches the bottom surface between the pillars. At time T4 the liquid-air interface is pinned to the bottom surface of the microchannel. 6.2 Local equilibrium in relation to theoretical function f(h) Assuming that the liquid-air interface progresses with time as shown in figure (10), we theorize that a suitable equilibrium condition for the interface is p−Π = γ f(h) D (6.2) where f(h) is an unknown function of h. With some suitable choice of geometry, interfaces with constant curvature and some physical arguments, the function f(h) can be estimated. Based on figure (10), a plot of f(h) against h will take the form Figure 11: General shape of function f(h). where h0 = (2.5 × 10−5 m) is the top of the pillar and h1 is near the bottom of the pillar. h1 is the point near the bottom of the pillar where the water-air interface goes from being concave to convex. Figure (11) is just a qualitative description of the function f(h). The main features of the function are briefly discussed here and a more detailed derivation of the function will be 18 outlined in future work. At time T1 , f(h) is negative because the pressure difference, p − Π is negative. The change of signs of f(h) at h0 and h1 are related to the change of signs in the curvature of the liquid-air interface. In the time interval from T2 to T3 , f(h) and p − Π reach their maximum value, β. This occurs while the liquid-air interface is concave and moving down the surface of the pillar. The function in figure (11) was derived via a combination of smooth approximations to the Heaviside function, H(h) of the form H(h) ≈ 1 1 + exp (−2 s h) (6.3) where s is a steepness factor used to vary the transition from T1 to T2 and T3 to T4 . Values of h0 , depend on the experimental setup and are easily obtainable. However, exact values of h1 are not directly available. Further experimental work by the Stokes institute is needed to attain a range of values for h1 . 6.3 Poiseuille flow We assume that the flow of the fluid through the microchannel takes the form of a Poiseuille flow. d is the height of the channel and l is the length of the channel. A pressure gradient in the x direction is set up in the microchannel due to the pump used in the experiment. Figure 12: Basic microchannel dimensions. Deoionised water passes through the microchannel. Water may be classified as a Newtonian, visocous, incompressible fluid. The motion of a Newtonian fluid can be described via the 19 Navier-Stokes equations. The Navier-Stokes equations are a dynamical statement of the balance of forces acting at any given region of a Newtonian fluid. The complexity of the Navier equations can be reduced significantly via lubrication theory to obtain the thin film equations. Lubrication theory is applicable to the flow of fluids in a geometry in which one length scale is significantly smaller than the other length scale. As the channel height, d is much smaller than the channel length, l, lubrication theory may be used. The resulting governing equation for the velocity in the x direction, u is ∂p ∂2 u = µl 2 ∂x ∂z (6.4) where µl is the dynamic viscosity of the liquid. Integrating (6.4) twice with respect to z and using the no-slip boundary conditions at z = 0 and z = d yields u = px 2 µl (z 2 − dz ) (6.5) The vertical velocity component, w is found by substituting (6.5) into the incompressibility condition ∂u ∂w + = 0 ∂x ∂z and using the no penetration boundary condition at z = d, to give w = − pxx 2 µl ( z3 3 − dz2 d3 + 2 6 ) (6.6) At the surface of the pillars the downward averaged velocity is w = − d3 pxx 12 µl (6.7) If we consider the pillars to be a porous medium we can use Darcy’s Law to relate the vertical fluid velocity to the pressure difference across the liquid-air interface. Darcy’s law describes the flow of a liquid through a porous medium. The law relates the volume flux per unit area (having units of velocity), q to an applied pressure gradient. Hence, q = − k Πx h µg 20 (6.8) where k is the effective permeability between the pillars and µg is the dynamic viscosity of the air. The downward velocity at the surface of the pillars can be related to the rate at which h is evolving and the pillar-free fraction, φ by the equation w = φ ∂h ∂t (6.9) If w is negative the fluid is moving downward and the height of the liquid above the pillar base, h is decreasing with time. Whereas, if w is positive the fluid is moving upward and h is increasing with time. The continuity equation for h requires that φ ht = −qx . Hence, we have (6.10) w = φ ht = − qx Substituting (6.8) into (6.10) yields − ( ) k ∂ d3 pxx = φ ht = hΠx 12 µl µg ∂x (6.11) We have the boundary conditions for the liquid pressure and the air pressure Πx = 0 ; p = p0 + ∆p ; Π = p = p0 ; at x = l at x = 0 (6.12) (6.13) where ∆p is the water pressure difference across the length of the channel. Integrating (6.11) once with respect to x leads to − d3 k px = hΠx + C1 12 µl µg (6.14) where C1 is an integration constant. Via the boundary condition for the derivative of the air pressure at x = 0, (6.14) reduces to − d3 px = C1 12 µl 21 Hence, we get C1 = A d3 ∆p 12 µl l where A is an O(1) constant. (6.14) becomes − k d3 ∆p d3 px = hΠx + A 12 µl µg 12 µl l (6.15) and dividing both sides by d3 /(12 µl) gives −px = M h Πx + A ∆p l (6.16) where M = (12 k µl)/(d3µg ). Rewriting (6.2) in terms of p and differentiating with respect to x leads to p x = Πx + γ fx D (6.17) Substituting this expression into (6.16) and rearranging in terms of Πx yields Πx ( = − A∆p l + γ D fx ) (6.18) 1 + Mh Substituting (6.18) into (6.11) gives φ ht = − k ∂ µg ∂x γ ( 1 +hM h [ A∆p + f (h)h ]) l D ′ x (6.19) which is a nonlinear Richards type equation in h. Richards type equations are typically used to describe water movement in unsaturated soils[12]. Via (6.18) and boundary condition (6.12) we get γ A∆p = − f ′ (h)hx l D 22 at x = 0 (6.20) Now rearranging in (6.17) in terms of Πx yields Πx = p x − γ fx D (6.21) and putting this in (6.16) leads to px = −A γ Mh ∆p 1 + fx l 1 + Mh D 1 + Mh (6.22) Integrating (6.22) over 0 to l gives Zl 0 ∆p px dx = −A l Zl 0 γ dx + 1 + Mh D Zl 0 Mh fx dx 1 + Mh and since M h ∼ 0.1 << 1, the above expression reduces to Zl 0 ∆p px dx = −A l Zl dx 0 Via the boundary conditions for the liquid pressure, we get A = 1. Hence, (6.20) becomes ∆p γ = − f ′ (h)hx l D at x = 0 (6.23) Summary of dimensional problem The full dimensional problem for the average liquid height may be summarised as follows. φ k ∂ ∂h = − ∂t µg ∂x ( h [ ∆pl + Dγ f (h)h ]) ′ γ ∆p = − f ′ (h)hx l D h = h0 at x = l 23 at x = 0 x (6.24) (6.25) (6.26) Non-dimensionalisation We define the dimensionless variables ^; h = Dh x = l^ x; t = φ D µg l2 ^t kγ and substituting them into (6.24) leads to ^ ^ ∂h ∂ ^ ^ ∂h ) = − h α + f ′ (h) ∂^ x ∂^ x ∂^t ( [ ]) (6.27) where α = (∆p D)/γ is a dimensionless parameter. Boundary conditions (6.25) and (6.26) become ^ ^ ∂h = α −f ′ (h) ∂^ x ^ = h ^0 h at x ^=0 at x ^=1 (6.28) (6.29) ^ 0 = h0 /D. where h Steady state solution ^ the time derivative term in Assuming there is a steady state solution for the liquid height, h, (6.27) vanishes to give ^ d ^ ^ dh h α + f ′ (h) d^ x d^ x ( [ ]) = 0 (6.30) Integrating both sides of the above equation with respect to x ^ leads to ^ h ( α + f (h)^ dd^hx^ ) = C ′ 2 (6.31) where C2 is an integration constant. Via boundary condition (6.28) we get that C2 equals zero 24 to give ^ h ( α + f (h)^ dd^hx^ ) = 0 ′ (6.32) which implies that ^ = 0; h or ^ ^ dh = 0 α + f ′ (h) d^ x (6.33) If follows that the solution to (6.33) is f = −α x ^ + C3 (6.34) where C3 is an integration constant. Using (6.29) we get that ^ = f(h^0) = 0 f(h) at x ^=1 which gives the following two steady state solutions ^ = 0 h f(^ x) = α (1 − x ^) (6.35) (6.36) The first of the steady state solutions simply states that the water-air interface becomes pinned to the top of the pillars and that the interface is horizontal. The second of the steady state solutions is a linear, negative relationship between the function f and the dimensionless length along the microchannel, x ^. From (6.2) we have that f(h) ∼ p − Π. Hence, solution (6.36) suggests as x ^ decreases, f increases and hence, p − Π increases. This increase in the function f is associated with the interval from T1 to T2 in figure (10). Thus, solution (6.36) also only explains a steady state scenario where the water-air interface is pinned to the top of the pillar. In addition, while the interface is pinned the curvature of the interface, changes sign. In other words, the interface goes from being concave to convex. Moreover, solution (6.36) does not provide any information about f as a function of h and how the interface evolves as it moves down between the pillars. To summarise, in the case of α < β (were β is the max value of f(h) and the pressure difference across the water-air interface), the steady state solutions only describe cases where the 25 interface is pinned to the top of the pillar. If ∆p, the pressure difference across the length of the microchannel becomes sufficiently large, then α > β and the interface will move down in between the pillars. A full time dependent solution of (6.27) is necessary to characterise the interface evolution between the pillars. 7 Conclusions and future work This report outlines a mathematical model used to describe an experimental setup presented by the Stokes Institute. In particular we are interested in modelling the interaction between a microchannel whose surfaces are superhydrophobic and deionised water flowing through the microchannel subject to a pressure gradient. In section (2) we reviewed the various possible wetting states associated with superhydrophobic surfaces. Section (3) summarises the time scales involved in the process. The key result in this section being that it is possible for air to get trapped amongst the microchannel pillars underneath the advancing water front. In section (4) classical asymptotic results for the shape of liquid meniscus in contact with the curved surface of a cylinder are used to estimate the distance the water-air interface will drop between two pillars. Simple two-dimensional, geometrical arguments in section (5) are used to establish a criteria for which the advancing water front will go down between the pillars. In the final section we considered a pressure driven Poiseuille flow over the top of the pillars. Lubrication theory provided the downward velocity of the water at the top of the pillars. This result along with Darcy’s Law was used to derive a nonlinear Richards type equation for the average water-air interface height. Steady state solutions were derived and the validity of these solutions was discussed. Future work will include a more thorough derivation of the function f(h) which is proportional to the pressure difference across the water-air interface, p − Π. f(h) may be approximated via hyperbolic functions of the form (6.3). More careful analysis and collaboration with the Stokes Institute is required to determine the exact nature of the function f(h). A full time dependent solution to equation (6.27) is also desirable as the steady state solutions do not provide a sufficient description of the water-air interface progression between the pillars. Finally, this report does not account for the effect of the scallops on the surface of the pillars in determining the water-air contact line. Any future model must address the interaction between the scallops and the water-air interface. Acknowledgments We acknowledge the support of the Mathematics Applications Consortium for Science and Industry (www.macsi.ul.ie) funded by the Science Foundation Ireland mathematics initiative grant 06/MI/005. 26 Nomenclature (x, y, z) u = (u, v, w) (p, Π) κ h φ t q (b, l, d) (R, h0) D U Lc γ θ (µl , µg) k ∆p α β ^0 h Cartesian coordinates vector Velocity vector (Water/air) pressure Curvature Average height of liquid above pillars Pillar free fraction Time Height flux Microchannel (breadth/length/height) Pillar (radius/height) Gap between pillars (centre to centre) Water velocity Capillary length of water Surface tension of water Contact angle (Water/air) dynamic viscosity Permeability Water pressure difference across the array (∆p D)/γ Maximum f(h) value h0 /D 27 (10−2 /10−2 /8 × 10−5 ) (7.5 × 10−6 /2.5 × 10−5 ) 2 × 10−5 0.012 2.73 × 10−3 0.073 (8.9 × 10−4 /1.8 × 10−5 ) 4 × 10−12 100 100 0.0274 1.25 m ms−1 Nm−2 m−1 m ND s ms−1 m m m ms−1 m Nm−1 ND kg m−1 s−1 m2 Pa Pa ND ND Bibliography [1] J .Zhang and D.Y. Kwok. Contact line and contact angle dynamics in superhydrophobic channels Langmuir, 22(11):4998-5004, 2006. [2] A. Dupuis and Y.M. Yeomans, Modeling droplets on superhydrophobic surfaces: equilibrium states and transitions. Langmuir, 21(6):2624-2629, 2005. [3] C. Dorrer and Ruḧe. Some thoughts on superhydrophobic wetting. Soft Matter, 5(1):51-61, 2008. [4] R. Enright, T. Dalton, T.N. Krupenkin et al. Effects of interfacial position on drag reduction in a superhydrophobic microchannel. 6th Int. Conf. Nanochannels, Microchannels, and Minichannels, June 23-25:835-845, 2008. [5] T. Young, An essay on the cohesion of fluids, Philos. T. Roy. Soc. Lon., 95:65-87, 1805. [6] S. Wang and L. Jiang, Definition of superhydrophobic states. Adv. Mater., 19:3423-3424, 2007. [7] R.N.Wenzel. Resistance of solid surfaces to wetting by water. Ind. Chem. Eng., 28(8):988995, 1936. [8] A.B.D. Cassie and S. Baxter. Wettability of porous surfaces. Trans. Faraday Soc., 40:546551, 1944. [9] S. Moulinet and D. Bartolo, Life and death of a fakir droplet: Impalement transitions on superhydrophobic surfaces. Eur. Phys. J. E, 24(3):251-260, 2007. [10] D. James. The meniscus on the outside of a small circular cylinder. J. Fluid Mech., 63(4):657-664, 1974. [11] L.L. Lo. The meniscus on a needle - a lesson in matching. J. Fluid Mech., 132:65-78, 1983. [12] L.A. Richards. Capillary conduction of liquids through porous mediums. Physics 1, 318:318-333, 1931. 28 The Effect of Mechanical Loading on the Frequency of an Oscillator Circuit Report Contributors: Jonathan A. Ward1,2, Vladimir Lapin1 and William Lee1 Study Group Contributors: Chris Budd3, Mark Cooker4 , Paul J. Dellar 5, Martin Hayes6 , Poul Hjorth7 , Olga Korostynska8 and Arno Mayrhofer1 Industry Representative: Sandra Healy9 1 MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland Report coordinator, jonathan.ward@ul.ie 3 School of Mathematical Sciences, University of Bath, United Kingdom 4 School of Mathematics, University of East Anglia, United Kingdom 5 OCIAM, Mathematical Institute, University of Oxford, United Kingdom 6 Department of Electronic and Computer Engineering, University of Limerick, Ireland 7 Department of Mathematics, Technical University of Denmark, Denmark 8 Microelectronic and Semiconductor Research Centre, University of Limerick, Ireland 9 Analog Devices, Raheen Industrial Estate, Raheen, Limerick, Ireland 2 Abstract We investigate the effect of mechanical strain on the frequency of an electronic oscillator embedded in an integrated circuit. This analysis is aimed at explaining a 1% inaccuracy in the oscillator frequency under test conditions prescribed by a leading supplier of semi-conductor devices. During the test the package containing the oscillator was clamped to a circuit board by mechanical pressure. By considering the nature of the oscillator we show that tensile strains of the order of 10−4 could explain the observations via the piezoresistance effect. Both a simple one-dimensional analysis based on the beam equation and an elastic finite element simulation show that strains of this magnitude can be generated during the test. 30 1 Introduction Silicon chips, or Integrated Circuits (ICs), perform the underlying functions of most modern electronic devices. These miniature circuit boards consist of millions of electronic devices such as transistors, capacitors, resistors and diodes. During manufacturing, the components in ICs are built up and connected together in layers on the surface of a silicon substrate, known as a die, via etching, deposition and photo-lithography [1]. The die are then moulded into a plastic case, which we refer to as a package. Chips are typically tested before being sold and may be moved mechanically between a variety of test sites. During testing, a downward force of the order of 100 N is exerted on the chip to ensure good electrical contact. A particular chip manufactured by a leading semi-conductor supplier (who we refer to simply as “the supplier”), contains an oscillator circuit whose frequency must reach a ±1% accuracy specification over a temperature range of −40 to +105◦ C and 100% of the chips are tested. However, the supplier observed that the loads experienced by the chip could lead to significant errors in the accuracy of the oscillator circuit, possibly due to some sort of piezo effect. In this paper we investigate how a vertical load placed on such a chip might result in errors in the oscillator frequency. This problem was presented at the 70th European Study Group with Industry (ESGI), hosted by MACSI at the University of Limerick in 2009, funded by the Science Foundation Ireland mathematics initiative grant. In Section 2 we give a detailed description of the problem including the structure of the package, the test-rig set-up, data concerning the frequency response to loading and how the chips are “trimmed” to the required frequency. We describe in Section 3 how the key components in the oscillator circuit, namely capacitors and resistors, might affect the oscillator frequency when subjected to a vertical load. In Section 4 we discuss mechanical modelling of the elastic response of the package to loading using the beam equation and finite element elasticity calculations. Finally we summarise our results in Section 5. 2 2.1 Detailed Problem Description and Data Package Design Due to commercial sensitivity, details of the specific package and chip design cannot be presented in this paper. A sketch of the package is illustrated in Figure 1(a). There are two die in each package, one of which is smaller and sits on top of the larger (see Figure 1(b)). These are placed on a copper base known as the lead frame. The three layers are aligned at their centres and attached using epoxy adhesives. There are 8 contacts along each side of the base of the package that are connected to the die via small wires. The die, lead frame, contacts and connections are moulded into a plastic case. 31 (c) (a) 6mm 6mm 1111111111111111111 0000000000000000000 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 0000000000000000000 1111111111111111111 Bottom Die (b) Oscillator circuit Contact Lead Frame (d) Plastic Case Top Die 11111111111111111111111111111111111111 00000000000000000000000000000000000000 00000000000000000000000000000000000000 11111111111111111111111111111111111111 00000000000000000000000000000000000000 11111111111111111111111111111111111111 00000000000000000000000000000000000000 11111111111111111111111111111111111111 00000000000000000000000000000000000000 11111111111111111111111111111111111111 00000000000000000000000000000000000000 11111111111111111111111111111111111111 00000000000000000000000000000000000000 11111111111111111111111111111111111111 00000000000000000000000000000000000000 11111111111111111111111111111111111111 00000000000000000000000000000000000000 11111111111111111111111111111111111111 00000000000000000000000000000000000000 11111111111111111111111111111111111111 Contact Glue Test-Rig Base 1mm Socket Lead Frame Contact Figure 1: (a) Sketch of the base of the package (not to scale) illustrating the lead frame and contact positions. The relative positions inside the package of the top and bottom die (dashed boxes) and the oscillator circuit (shaded grey box) are indicated. (b) Cross-section of the package (larger scale than (a)). Again the positions of the lead frame, contacts and the top and bottom die are marked. The epoxy glues are also labelled. (c) Photo of the base of the test-rig in which the package is placed during testing. (d) Magnification of the socket in which the package is placed and the electrical contact pins on which it sits. 2.2 Test Rig Set-Up During testing, each package is transported between test sites (see Figure 1(c)) using a mechanical test rig arm. This device places a small cylindrical nozzle on top of each package and then applies a suction force to attach the package to the arm before it is relocated. In each test, an individual package is placed in a plastic well on the test circuit board, where it rests on top of a set of copper pins that connect with the contacts on the base of the package (see Figure 1(d)). There are three pins positioned slightly off centre that contact the lead frame and one for each of the contacts around the edge of the package. The test rig arm exerts a vertical force on the package to ensure good electric contact, but the magnitude of this force is not known accurately and varies between different test rigs. 2.3 Problems with Frequency Response Due to Loading As stated, the oscillator circuit in the chip must oscillate at a frequency of 128 kHz with a ±1% tolerance. However the supplier found that this error could not be reduced below 1.5%, which they suspected was due to the loads placed on the chip. This was confirmed using a hand test rig consisting of the standard test rig base and a locking top that applies a vertical 32 % Frequency Change % Frequency Change vs. Applied Mass Force (kg equivalent) Figure 2: Data recorded by Analog Devices relating percentage change in frequency to applied mass (i.e. the load placed on top of the package). A decreasing linear response is observed. Note that a load of the order of 100 N (i.e. 10 kg) causes approximately a −0.3% error. Thus the testing process alone wastes a significant amount of the specified tolerance. force to the chip via a hand operated screw mechanism. Although variations in frequency were observed during the hand test, the amount of force applied during the test was not quantifiable in any way. Furthermore, the hand test rig is not used during the actual testing of the chips, so is only an indication that loading of the chip affects the oscillator frequency. To quantify the effect caused by loading the package, the supplier collected a small amount of preliminary quantitative data, illustrated in Figure 2, by applying a range of vertical loads to a given package. The results of this experiment suggest a linear relationship between the load applied and the frequency response of the oscillator circuit, although it is stressed that the data is only indicative. Note that these experiments indicate that for a load of the order of 100 N (i.e. 10 kg), a −0.3% error is observed. Thus a significant amount of the specified tolerance is wasted on the testing process alone, which must be compensated for by higher manufacturing precision. 2.4 Circuit Trimming Banks of components (including resistors and capacitors) are used to tune the oscillator to the required frequency over the given temperature range. This process is known as trimming, and in the traditional sense would mean that particular components were removed permanently from the active circuit. In this instance, a flash memory stores the identities of the trimmed components. 33 3 Piezo Effects in the Oscillator Circuit We now describe how the response of the key components in the oscillator circuit might be affected by mechanical loading. An oscillator circuit provides a continuous, synchronised trigger signal to the rest of the devices on the chip from which information is processed [2]. Due to commercial sensitivity (and its shear complexity), the specific design of the oscillator circuit in question was not provided. However, there are two main types of electronic oscillators, RC circuits (composed of Resistors and C apacitors) and LC circuits (composed of inductors and capacitors). Inductors are not common in ICs, hence we focus on the first type of oscillator. The oscillation frequency of an RC circuit is f = 1/(2πRC) [1], where R and C are the effective resistance and capacitance of the banks of resistors and capacitors on the chip respectively. Typical values of resistance and capacitance are R = 10kΩ and C = 100pF. A small fractional increase, δ, in the product of the resistance and capacitance results in a fractional decrease in frequency, f, of the same amount, f= 1 1 ≈ (1 − δ). 2πRC(1 + δ) 2πRC (3.1) Thus a 1% drop in frequency requires a 1% increase in capacitance or resistance (i.e. δ = 0.01). We now consider whether the kind of mechanical load experienced by the chip can give rise to such changes in the capacitance or resistance. 3.1 Capacitor Response Capacitors consist of two parallel conducting plates with a dielectric material sandwiched between. The plane of the conducting plates is perpendicular to the applied load in this case. The value of capacitance is given by εAC , (3.2) C= d where ε is the permittivity of the dielectric, AC is the area of the plates and d is the distance between them. Thus the capacitance increases in response to a decrease in distance between plates, which might be expected when subjected to a vertical load as in this scenario. However, the dielectric material is typically silicon dioxide, which is a very stiff material. Given a 100N force placed on a package approximately 6mm×6mm, the resulting pressure P is 4 MPa. The Young’s modulus E of silicon dioxide is approximately 100 GPa, thus considering Hooke’s law, the resulting strain (and hence the change in distance between plates) is P ≃ 3 × 10−5 , E which is much smaller then the 1.5% change required. 34 (3.3) Thus the change in capacitance due to the plates being forced closer together cannot be responsible for the observed 1.5% change in frequency. The deformation of the dielectric does not change its permittivity either, since silicon dioxide does exhibit any piezoelectric effects [3]. 3.2 Resistor Response Piezoresistance is a well known effect that occurs when resistors are subject to mechanical deformation [4]. The resistance of a conducting element with a fixed cross section of area AR and length l is given by l R0 = ρ0 , (3.4) AR where ρ0 is resistivity. If the shape of the resistor is changed, the subsequent change in resistance is related to the strain ǫ = ∆l/l via ∆R = Gǫ, (3.5) R0 where G is known as the gauge factor, G := (1 + 2ν) + (∆ρ/ρ0) , ǫ (3.6) and ν is the Poisson’s ratio of the resistor material, which accounts for the change in crosssectional area due to the change in length. It is known that the resistivity change (∆ρ/ρ0 )/ǫ for semiconductors is much larger than the dimensional change (1 + 2ν) [5]. In fact gauge factors as large as 100 have been measured for p-type silicon and as low as −100 in n-type silicon [4, 6]. Thus to observe a 1% decrease in frequency due to piezoresistance, we would need strains of the order of 10−4 . 4 Mechanical Modelling To calculate the strains induced by mechanical loading, we first consider a simple method in which the package is modelled using a one-dimensional beam equation. This approach neglects inhomogeneities in the package and corner effects that arise in the plane perpendicular to the loading due to the stacking of the die. Thus we develop a more sophisticated two-dimensional axisymmetric model of the composite package that we solve using finite element software in COMSOL, from which we obtain the strain field in a cross-section of the package. 4.1 One-Dimensional Model A simple approach to calculating approximate values of the strain in the package is to model the entire assembly using the one-dimensional beam equation [7]. Thus we consider the package 35 Figure 3: Geometry in which the package is modelled as an elastic beam. The pins are modelled as Hookean springs, while the die is modelled as two localised forces. to be elastically homogeneous and made entirely of silicon. The geometry used in this case is shown in Figure 3. The width of the package is 2A and the load is represented by two localised forces F imposed at x = ±B. The pins are modelled as Hookean springs, positioned at the edges of the package (x = ±A) and the centre (x = 0) with spring constants k1 and k2 respectively. The spring constant of the outer pins is chosen to be five times larger than at the centre to reflect that there are more pins around the edge of the package. However, the spring constants are not known accurately,thus we consider two cases: (a), stiff springs, k1 = 500 MN/m and k2 = 100 MN/m ; and (b), flexible springs, k1 = 50 MN/m and k2 = 10 MN/m . As noted, we model the vertical displacement u(x) of the package using the one-dimensional beam equation with point forcing, d4 u EI 4 = F [δ(x + B) + δ(x − B)] − k2 δ(x) u(x), dx (4.1) where E is the Young’s modulus of the package and I is the second moment of area. The boundary conditions at the edges of the package are d3 u EI 3 = −k1 u(±A) and (4.2) dx x=±A d2 u = 0. (4.3) dx2 x=±A Note that at each discontinuity, the zeroth, first and second derivatives must be equal. These equations form a linear system whose solution is a piecewise cubic polynomial in x. The polynomial coefficients in each of the four regions along the beam can be easily calculated using computer algebra, however they are somewhat complicated and hence we do not reproduce them here. The parameters used in the calculation are given in Table 1. The displacement along the beam u(x) is illustrated in Figure 4(a) for the stiff springs case and in Figure 4(b) for the case flexible springs case. These illustrate that the stiffness of the pins can affect the overall shape of the package: stiffer pins result in a “W”–shape, whereas flexible pins give rise to a concave shape. 36 Parameter A B h E F Value 3 mm 2 mm 1 mm 150 GPa 50 N Table 1: Values of parameters used in the calculation. In addition, two different types of spring constant are considered: stiff springs, k1 = 500 MN/m and k2 = 100 MN/m ; and flexible springs, k1 = 50 MN/m and k2 = 10 MN/m . Figure 4: Displacement calculated from equations 4.1–4.3 for the cases (a) (stiff springs) and (b) (flexible springs). Given the displacement and the height of the package h, we can calculate the tensile strain, ǫxx h d2 u , = 2 dx2 (4.4) at the base of the package. This is plotted in Figure 5(a) and (b) for the stiff and flexible pin cases respectively. In both instances, the magnitude of the strains are large enough to account for the observed frequency variations due to piezoresistive effects. The stiffer springs used in Figure 4(a) result in a region of compression at the centre of the package. Note that in both cases, the largest strains occur in regions of extension (positive strain) where the downward load is applied. The strain is zero at the edges due to the boundary conditions (4.3). 4.2 Axisymmetric Finite Element Model Although the simple approach using the beam equation shows some promising results, it is based on a very restrictive set of assumptions, in particular that the package is homogeneous and can be represented as thin rod. A linear elasticity model of the composite package can be 37 Figure 5: Tensile strain at the base of the package for cases (a) (stiff springs) and (b) (flexible springs). computed numerically using finite element methods. To facilitate this, we used the Structural Mechanics Module of COMSOL 3.4 software package [8]. This includes a stationary linear solver for the equilibrium equation from linear elasticity theory, σij,j = 0 where σij,j is the derivative with respect to j of the ij component of the stress tensor. This is supplemented with the linear stress-strain and strain-displacement relationships, σij = λǫkk δij + 2µǫij , ǫij = 1/2 (ui,j + uj,i ) , respectively where ǫ is the strain tensor, u is the displacement and λ and µ are Lamé’s parameters, which can be expressed in terms of the Poisson’s ratio and Young’s modulus. We used 2nd-order Lagrange finite elements with ideal constraints. The simulation set-up is illustrated in Figure 6. We consider an axisymmetric model in cylindrical coordinates to reduce the complexity of the simulation. Consequently the model of the package is disc shaped. In this geometry, we model the full composite structure of the package including the copper lead frame and contacts, the silicon die, the epoxy glues and the plastic casing. Material properties of each of the component parts are listed in Table 2. The load is applied over a 1mm wide annular region on top of the package. The upward forces exerted by the pins are positioned under the contact on the outside edge of the bottom of the package and at the centre under the lead frame. We distribute the downward load between the inner and outer pins with a 1:5 ratio respectively to reflect the fact that there are more pins around the outside of the package. This is an approximation since the pins are not completely stiff and hence may distribute the load differently. The point at the base of the edge of the package is held fixed. In Figure 7, we illustrate the deformation under loading of a cross-section of the package. Note that the scale of the displacements have been increased to accentuate the change in shape. We 38 z P (g) (e) (a) (f) (b) (c) (d) F0 FL r Figure 6: Cross-section of the numerical simulation set-up for the linear elasticity calculation in axisymmetric coordinates. The composite package is modelled, including: (a) top die, (b) bottom die, (c) lead frame, (d) contact, (e) and (f) epoxy glues, (g) plastic case. The applied load is labelled P and the reaction forces from the pins on the contacts and lead frame are labelled FL and FL respectively. These forces are distributed over the coloured regions. observe that the maximum displacement (shaded in dark red) occurs where the downward force is applied and, due to the boundary conditions, there is no displacement at the bottom of the outer edge (shaded dark blue). The centre of the package (r = 0) is also displaced vertically since there is less support from the pins. Thus a slice through the full diameter of the package would resemble Figure 4(a), i.e a W–shape. Furthermore, if we reduce the load supported at the centre of the package, we find that the deformed package becomes concave, as in Figure 4(b). The ǫrr strain field in the lead frame, top and bottom die are illustrated in Figure 8. We have omitted the strain field in the rest of the package since the circuitry lies in a thin layer on the surfaces of the two die, i.e. in the (r, θ) plane. Furthermore, the plastic casing and epoxy glues are much softer than the die and lead frame and consequently the magnitudes of their strains are much larger. Note that the strains on the top surfaces of the die where the electrical components are placed are much less on the top die than on the bottom. This accounts for the fact that the problems with the accuracy of the frequency of the oscillator circuit came to light after it was moved from the top die to the bottom die. 39 Young’s Modulus E, GPa Lead frame (Cu) 127 Contacts (Cu) 127 Die (Si) 150 Epoxy Glue - bottom 3.1 Epoxy Glue - top 0.3 Plastic case 0.2 Table 2: Material properties used in linear elasticity simulation. nm z r Figure 7: Deformation of the package under loading. The black out line illustrates the original position and the shaded region shows the subsequent deformation. The colour correspond to the magnitude of the total displacement. These have been greatly enhanced in this picture to illustrate the qualitative behaviour of the package. 40 z r Figure 8: The coloured shading indicates the ǫrr strain field in the lead frame, top and bottom die. Note that the lead frame is stretched, however the bottom die is compressed. (i) ǫrr ǫθθ r (ii) r Figure 9: The strains ǫrr (a) and ǫθθ (b) along the top surface of the bottom die in the rdirection. 41 In Figures 9(a) and (b) we plot the ǫrr and ǫθθ strains respectively against r at the z–position corresponding to the top surface of the bottom die. The oscillator circuit lies in this plane, between the edge of the top die (r = 0.75) and the edge of the bottom die (r = 1.5). The magnitude of the strains there are large enough to account for the observed change in frequency due to the piezoresistive effect. Note that because of the W–shape of the package, there is a change in the sign of the strain. Furthermore, toward the outer edge of the package, the ǫθθ strain is less than the ǫrr strain. Thus a qualitative feature of our simulation results is that an appropriate choice of position and orientation of the components of the oscillator circuit on the bottom die would minimise the strain they experience. We stress however that computing the strains within the package with enough precision to achieve this would require extending the geometry to three dimensions in rectangular coordinates (which requires far greater computational power) as well as a detailed knowledge of the components themselves. 5 Conclusion and Recommendations This report investigates the possibility that inaccuracies in the frequency of an electronic oscillator measured under test conditions may be due to mechanical deformation of the chip containing the oscillator that occur during testing. We show that the most likely explanation is that the oscillator frequency is modified by the piezoresistive effect and estimate that the observed variations could be produced by tensile strains of the order of 10−4 . We carried out two elasticity calculations: a highly simplified calculation based on the beam equation and a second, considerably more sophisticated, treatment using finite element analysis. These simulations both demonstrated that strains the of the order of 10−4 in magnitude occur within the die, which explains the drift in oscillator frequency during test conditions. From our simulations, we observe that the strain is largest directly underneath the nozzle that pushes down on the package and that the strains are significantly lower in the top die. Thus one might consider relocating the oscillator circuit or changing the shape of the nozzle that applies the downward force in order to reduce the strain in the package. Another possible solution could involve tuning the method in which the oscillator circuit is trimmed such that components which undergo lower values of strain are chosen preferentially. Acknowledgements All contributors would like to thank Sandra Healy from Analog Devices for introducing the problem and assisting in answering questions during the entire week. We acknowledge the support of the Mathematics Applications Consortium for Science and Industry (www.macsi.ul.ie) funded by the Science Foundation Ireland mathematics initiative grant 06/MI/005. 42 Bibliography [1] A.R. Hambley. Electronics. Prentice-Hall, second edition, 2000. [2] Colin D. Simpson. Industrial Electronics. Prentice-Hall, first edition, 1996. [3] L.D. Landau, E.M. Liftshitz, and L.P. Pitaevskii. Electrodynamics of Continuous Media. Butterworth-Heinemann, second edition, 1984. [4] S. Middelhoek and S.A. Audet. Silicon Sensors. Academic Press Ltd., first edition, 1989. [5] Yozo Kanda. Piezoresistance effect of silicon. Sensor Actuat. A, 28:83–91, 1991. [6] Charles S. Smith. Piezoresistance effect in germanium and silicon. Phys. Rev., 94(1):42–49, 1954. [7] L.D. Landau and E.M. Liftshitz. Theory of Elasticity. Butterworth-Heinemann, third edition, 1986. [8] COMSOL AB. COMSOL Multiphysics Modeling Guide. COMSOL Ltd, 3.4 edition, 2007. 43 44 Designing a Green Roof for Ireland Report Contributors: Ian Hewitt1 , Andrew Lacey2, Niklas Mellgren 3, Michael Vynnycky4, Marguerite Robinson4,5 and Mark Cooker6 Study Group Contributors: Andrew Fowler4 , Maria Gonzalez4, Hilda Simboek4, Thomas Murphy 7, Michael Devereux4, Roman Sedakov4 , Grainne Kirby 4, Gemma Fay 1 and Qi Wang 8 Industry Representatives: Catherine Adley 7, Billy Kirwan 1 9 and John O’Hara 10 OCIAM, Mathematical Institute, University of Oxford, United Kingdom Department of Mathematics, University Heriot-Watt, United Kingdom 3 Department of Mechanics, KTH Royal Institute of Technology, Sweden 4 MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland 5 Report coordinator, marguerite.robinson@ul.ie 6 School of Mathematics, University of East Anglia, United Kingdom 7 Department of Chemical and Environmental Science, University of Limerick, Ireland 8 School of mathematical sciences, Dublin Institute of Technology, Ireland 9 GFM Systems, Waterway House, Crag Crescent, Clondalkin Ind. Estate, Dublin 22, Ireland 10 Landtech Soils Ltd, 20 Kenyon Street, Nenagh, Co. Tipperary, Ireland 2 Abstract A model is presented for the gravity-driven flow of rainwater descending through the soil layer of a green roof, treated as a porous medium on a flat permeable surface representing an efficient drainage layer. A fully saturated zone is shown to occur. It is typically a thin layer, relative to the total soil thickness, and lies at the bottom of the soil layer. This provides a bottom boundary condition for the partially saturated upper zone. It is shown that after the onset of rainfall, well-defined fronts of water can descend through the soil layer. Also the rainwater flow is relatively quick compared with the moisture uptake by the roots of the plants in the roof. In a separate model the exchanges of water are described between the (smaller-scale) porous granules of soil, the roots and the rainwater in the inter-granule pores. 46 1 Introduction Green roofs are becoming increasingly popular around the world. The many benefits of a green roof include assistance in the management of storm water, pollution control, building insulation, recycling of carbon dioxide, in addition to being aesthetically pleasing. A green roof is subject to various stresses from the weather, in particular wind-loading, which we ignore in this report, and rainfall: it is the flow, drainage and uptake of rainwater that we model. An understanding of where the water goes is essential to design a roof able to achieve sustained healthy plants and loads that lie within the safe capacity of the supporting structure. The main focus of this report is on the transport of water through the green roof structure. Inadequate drainage can lead to the undesirable occurrence of a fully saturated soil which will cut off the air supply to the plants. Conversely, if the saturation levels are too low plants will die from lack of water. Ideally a degree of saturation that is less than eighty per cent should be maintained at all times. Our goal is to model the distribution of the degree of saturation through the depth of the soil layer, and to see how it changes due to spells of rain, and under the influence of moisture-uptake by plant roots. The basic structure of a common green roof is shown in Fig. 1. A waterproof root barrier protects the underlying roof structure. A drainage layer sits atop this barrier. The typical thickness of this layer is 8/15/20 mm depending on the type of roof. The soil and drainage layers are separated by a thin sheet of perforated hard plastic containing holes approximately 2 mm in diameter and spaced 2 cm apart. There are two layers of soil at the top of the structure separated by a layer of felt. A thin layer (< 2 cm) of refined rooting soil contains the plant life, mainly sedum for thinner roofs and, for thicker ones, low growing grasses such as common bent grass and/or other plants, such as cowslip and ladies bedstraw. Beneath the rooting soil are pellets of lightweight expanded clay. This layer is 5-10 cm thick. Grain sizes are typically < 2 mm for rooting soil and 4-8 mm for expanded clay pellets. 2 The Model We model the dynamics of water flow through the soil layer. We consider a single soil layer with thickness L ≈ 10−1 m and we ignore the presence of the felt layer. We assume that the soil-drainage-layer interface is located at z = 0 and the soil surface at z = L. We consider two possible scenarios: (i ) the entire region 0 ≤ z ≤ L is unsaturated, so that the soil saturation S is always less than 1 and (ii ) a saturated region forms at the bottom of the soil layer for 0 ≤ z ≤ h. Note that the model as presented here is one-dimensional, and represents a horizontal roof, but can be easily extended to two (or three) dimensions, and to account for sloping roofs. 47 Figure 1: Green roof structure. 2.1 The Unsaturated Region We first assume the entire region 0 ≤ z ≤ L is unsaturated (S < 1). The basic model for this region follows that outlined in [1] and [2]. The one-dimensional Richards’ equation for water flow in the unsaturated soil is ∂S ∂ ∂S D0 D(S) = + K0 K(S) − R, (2.1) φ ∂t ∂z ∂z where S = S(z, t), φ is the constant porosity of the soil, taken here to be 0.25, D0 D(S) and K0 K(S) are the water diffusivity and hydraulic conductivity respectively, with the functions D(S) and K(S) given by K(S) = S1/2 [1 − (1 − S1/m )m ]2 , (2.2) D(S) = [1 − (1 − S1/m )m ]2 , S1/m−1/2 (1 − S1/m )m (2.3) where 0 < m < 1. The value of m for the expanded-clay soil was not known but, for later use in simulations and analysis of the model, was taken to be m = 12 . Likewise the values of the constants K0 , the conductivity for saturated soil, and D0, a representative value of diffusivity, were not known. The water flux in the z direction is q = −(D0 D(S)Sz +K0 K(S)). Water uptake by the plant roots is incorporated into the model through the last term in (2.1) and is given by R = 2πakr ld (pa − pc f(S) − pr ) , (2.4) where kr is the radial conductivity of water, a is the root radius, ld is the root length density, pa is atmospheric pressure, pr is an effective pressure in the roots (although it can be negative), pc f(S) is the capillary pressure in the soil, with pc another constant characterising the partly saturated pellets, and 1 f(S) = (S− m − 1)1−m . (2.5) 48 We take parameter values from Roose and Fowler [2] and let 2πakr = 7.85 × 10−16 m2 s−1 Pa−1 , ld = 5 × 103 m−2 and pc = 104 N m−2 . The root pressure pr will be determined from conservation of water within the root. Finally we must prescribe boundary conditions at the top and bottom of the soil layer. At the soil surface we take D0 D(S) ∂S + K0 K(S) = Qin (t) at z = L, ∂z (2.6) where Qin is the rainfall rate averaged over the surface area of the ground. We assume, in this unsaturated case, no outflow at the base of the soil layer and set D0 D(S) ∂S + K0 K(S) = 0 at z = 0. ∂z (2.7) We nondimensionalise the equations by scaling z = L^ z, pr = |P|p^r , t= L ^t, K0 p = pa + pc p ^, ^ R = 2πakr ld |P|R, ^ Qin = Qtyp Q, (2.8) where P is the (negative) root pressure at the soil surface and we set |P| = 106 N m−2 . The time scale used here is that for flow though the soil layer under the action of gravity. The dimensionless Richards’ equation (2.1) then has the form ∂S ∂S ∂ φ δD(S) + K(S) − η(θ − εf(S) − p ^ r ), (2.9) = ∂^ z ∂^ z ∂^t where δ= D0 ≈ 10−4 , LK0 η= 2πakr ld |P|L ≈ 4 × 10−6 , K0 θ= pa ≈ 10−1 , |P| ε= pc ≈ 10−2 . (2.10) |P| Roose and Fowler [2] give values of D0 for different soil types and we can reasonably take D0 = 10−6 m2 s−1 . However, the value of K0 is more difficult to determine as it varies significantly with different soil types. The parameter values in (2.10) are given for K0 = 10−1 m s−1 . We note that η ≪ 1 suggesting that water uptake by the roots is negligible over the chosen timescale. The dimensionless forms of the boundary conditions are given by δD(S) ∂S ^ at z^ = 1, + K(S) = νQ ∂^ z (2.11) ∂S + K(S) = 0 at z^ = 0, ∂^ z (2.12) δD(S) where ν= Qtyp ≈ 3 × 10−6 , K0 (2.13) with Qtyp taken to be some typical rainfall. We set Qtyp = 3 × 10−7 m s−1 for a “wet day" in Ireland. 49 2.2 The Saturated Region When the soil becomes saturated we assume that a moving boundary forms at z = h(t) and the soil saturation is identically one for z ∈ [0, h]. Our governing equation can now be written in the (dimensionless) form ∂ 1 ∂^ p 1+ − η (θ + ε^ p−p ^ r ) = 0, (2.14) ∂^ z γ ∂^ z where ρgL ≈ 10−1 . (2.15) pc The flux through the membrane at z = 0 is prescribed to occur at a rate proportional to the pressure difference across it; Qmem = κ(p − pa ) dimensionally, where pa is the atmospheric pressure in the drainage layer beneath, p is the pressure at z = 0, and κ ≈ 10−5 m s−1 Pa−1 (determined experimentally in the next sub-section). This gives the dimensionless condition γ= 1+ p 1 ∂^ = α^ p at z^ = 0, γ ∂^ z c ^ ^t) ≡ where α = κp ≈ 1. At the saturation front z^ = h( K0 continuity of fluid flux requires K(S) + δD(S) ∂S 1 ∂^ p =1+ ∂^ z γ ∂^ z (2.16) h(t) , L p ^ = 0 (atmospheric), and ^ at z^ = h. (2.17) ^ along with Neglecting the η term in (2.14) for this saturated region, and using p ^ = 0 at z^ = h (2.16), gives ^ ^t) − z^) γ(h( p ^ (^ z, ^t) = , (2.18) ^ ^t) 1 + αγh( so that (2.17) becomes K(S) + δD(S) ^ αγh ∂S = ^ ∂^ z 1 + αγh ^ at z^ = h. (2.19) ^ ^t) determine h ^ in terms In principle equation (2.19) and boundary condition S = 1 at z^ = h( of the flux from the unsaturated region. However, we can simplify things if we notice from (2.11) that the dimensionless flux will in general be small, of order ν (due to the rainfall). If ^ required to satisfy (2.19) will be small. Physically, this is this is the case, then the value of h because for the typical size of fluid flux considered, the pressure required to force it through the membrane according to (2.16) is provided by the hydrostatic head of a very thin layer of water (dimensionally, h is calculated to be much less than 1 mm). Thus if a saturated region is created at the bottom of the soil layer, it will quickly grow to a depth which is sufficient to drain exactly the same amount of water through the membrane as 50 is arriving from the unsaturated region above. Provided this depth is substantially less than the depth of the soil, the saturated region can be ‘collapsed’ (mathematically) onto the line z^ = 0, and the boundary condition applied to the problem in the unsaturated zone for some of the numerical solutions of sub-section 2.5 is then ∂ ∂S K(S) + δD(S) = 0 at z^ = 0, 11 (2.20) ∂^ z ∂^ z which states that whatever flux arrives there from above is allowed through the membrane (the build-up of the small saturated layer which would physically enable this is therefore assumed to happen instantaneously), with the uptake of water by the roots being neglected. We apply (2.20) while K(S) + δD(S) ∂S > 0 at z^ = 0 since water is trying to exit, not enter, the soil layer. ∂^ z 2.3 Experimental measurement of κ The value of κ was deduced from a simple experiment, which involved puncturing a 2 mm diameter hole in a plastic bottle, made with material similar to what we believe the drainage membrane is made of. The rate of drainage through the hole driven by the hydraulic head in the bottle was measured, and used to determine the coefficient of proportionality between pressure difference across the membrane ∆p and the water flux through it q. Writing q = k∆p, (2.21) where ∆p = ρgh, the water depth in the bottle, h, satisfies the equation Abottle dh = −kρgh, dt (2.22) where Abottle is the cross-sectional area of the bottle. Thus log h = − kρg t. Abottle (2.23) Measurements of h against t made during the experiment are in Fig. 2, and the best fit value of the time constant tc = Abottle /kρg was 74 seconds. The flux through an individual hole can be converted into an average velocity through a membrane, using the area of the membrane Amembrane that is drained by each hole. Thus ū = κ∆p, κ= Abottle . Amembrane ρgtc (2.24) Taking Amembrane = π cm2 , and using the cross-sectional area of the bottle Abottle = 25 cm2 , ρ = 103 kg m−3 , and g = 10 m s−2 , gives κ ≈ 10−5 m s−1 Pa−1 . 11 More accurately the bottom condition might be specified in linear complementary form (1 − S)q = 0 with 1 − S ≥ 0 (for no super-saturation) and q ≤ 0 (for upward flux). We use (2.20) for its convenience in some of the numerical simulations, but for others we approximate the more correct linear complementary form, as in (2.33). 51 1 h (cm) 10 0 20 40 60 t (s) 80 100 Figure 2: Experimental measurements of h against t. 2.4 The Root Pressure To determine the root pressure pr in equation (2.9), we assume that the root extends through the full thickness of the soil layer of depth L. Conservation of water inside the root yields kz d2 pr + 2πakr (pa − pc f(S) − pr ) = 0, dz2 (2.25) where kz = 10−14 m6 s−1 N−1 is the root axial conductivity and f(S) is defined in equation (2.5). Zero axial flux at the root tip implies dpr + ρg = 0 at z = 0. dz (2.26) In addition we prescribe a driving pressure at the root base yielding pr = pa + P at z = L. (2.27) In dimensionless form the root pressure will satisfy d2 p ^r + τ (θ − εf(S) − p ^ r ) = 0, d^ z2 (2.28) d^ pr = −εγ at z^ = 0, d^ z (2.29) subject to 52 p ^ r = θ − 1 at z^ = 1, where τ= (2.30) 2πakr L2 ≈ 10−3 . kz 2 The parameters τ ≪ 1 and εγ ≪ 1 which implies dd^zp^2r ≈ 0 subject to root pressure is thus given by p ^ r = θ − 1. (2.31) d^ pr d^ z = 0 on z^ = 0. The (2.32) The complete model is now given by (2.9), with the definitions (2.2), (2.3), (2.5) and (2.32), with boundary condition (2.11) at z^ = 1 and (2.12) if S < 1, or (2.20) if S = 1, at z^ = 0. An initial condition is also needed. The diffusion term in (2.9) is small, so the equation is essentially a first order non-linear wave equation; the boundary condition (rainfall) is transmitted downwards as a wave. If rain starts suddenly, there is a shock front that propagates quickly down to the bottom of the soil; if it stops suddenly there is an expansion fan. 2.5 Numerical Solutions The governing equation for the unsaturated region (2.9) was solved subject to boundary conditions (2.11) and (2.12). As a first approach the η term in (2.9) is neglected so that we are just considering drainage of the soil layer under gravity. The initial saturation was taken to be uniform throughout the soil layer. Three different initial values of the saturation Sinit : 0.05, 0.1, 0.15 were considered. The profiles obtained for S in each case when the computation was stopped are shown in Fig. 3; a corresponding semilog plot is shown in Fig. 4, in order to demonstrate the boundary layer of thickness δ1/2 in S at z^ = 0 that is predicted by an asymptotic analysis, and which is captured by the numerical solution, but which is not visible in Fig. 3. For Sinit = 0.1 and 0.15, computations were stopped when the value of S at z^ = 0, Sbottom , reached 1; for Sinit = 0.05, Sbottom is still far from 1, even for the value of dimensionless time (100) shown here. The time evolution of Sbottom is shown in Fig. 5, while that for S at z^ = 1, Stop , is shown in Fig. 6. Thus, the results suggest an appreciable difference in the time at which complete saturation is achieved at the bottom of the soil when Sinit is increased from 0.05 to 0.1. The effect of the rainfall boundary condition (2.11) has (by the end of the simulations) only affected the tiny region at the right of Fig. 3, where there is the beginning of a shock front propagating downwards from z^ = 1; since δ has been taken to be very small, the shock looks very sharp, and the values on either side of it are the initial condition (below, or left, of the shock), and the ^ (above, or right, of the shock - this value is expectedly independent value given by K(S) = νQ of the initial condition, as shown in Fig. 6). 53 1 sinit=0.05 0.9 sinit=0.1 sinit=0.15 0.8 0.7 s 0.6 0.5 0.4 0.3 0.2 0.1 0 0 0.2 0.4 0.6 0.8 1 z^ Figure 3: S vs. z^ for three different initial conditions (Sinit = 0.05, 0.1, 0.15) at either dimensionless time 100 (Sinit = 0.05) or when S reaches 1 at z^ = 0 (Sinit = 0.1, 0.15). Parameter values are m = 1/2, δ = 10−4 , ν = 3 × 10−6 . 1 sinit=0.05 0.9 sinit=0.1 sinit=0.15 0.8 0.7 s 0.6 0.5 0.4 0.3 0.2 0.1 0 −10 10 −8 10 −6 −4 10 10 −2 10 0 10 z^ Figure 4: A semilog plot of S vs. z^ for three different initial conditions (Sinit = 0.05, 0.1, 0.15) at either dimensionless time 100 (Sinit = 0.05) or when S reaches 1 at z^ = 0 (Sinit = 0.1, 0.15). Parameter values are m = 1/2, δ = 10−4 , ν = 3 × 10−6 . 54 1 sinit=0.05 0.9 sinit=0.1 sinit=0.15 0.8 sbottom 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 20 40 60 80 100 Dimensionless time Figure 5: Sbottom vs. dimensionless time for three different initial conditions (Sinit = 0.05, 0.1, 0.15). Parameter values are m = 1/2, δ = 10−4 , ν = 3 × 10−6 . 0.16 sinit=0.05 sinit=0.1 0.14 sinit=0.15 stop 0.12 0.1 0.08 0.06 0.04 0 20 40 60 80 100 Dimensionless time Figure 6: Stop vs. dimensionless time for three different initial conditions (Sinit = 0.05, 0.1, 0.15). Parameter values are m = 1/2, δ = 10−4 , ν = 3 × 10−6 . 55 The complete problem, with a small saturated region allowed for by using boundary condition (2.20), and with η 6= 0, was also solved by discretising in space and solving with the method of lines using ode15s in Matlab. To apply the switch in boundary conditions smoothly, the condition q0 = q1 e−1000(1−S) , (2.33) was applied for the flux at the bottom node q0 in terms of the flux at the node above q1 ; thus when S is close to 1 this becomes ∂^ q/∂^ z = 0, and when S is less than 1 it becomes q0 = 0. The diffusion coefficient is infinite when S = 1, but this does not cause any issues in the numerics, possibly because the above boundary condition ensures S never quite reaches 1. This seems to allow for steady states when rainfall is constant; if there is more rainfall than is taken up by the roots, the saturation at the bottom is 1 and there is a Rboundary layer of width ^ − η L R (Fig. 7). If there is δ1/2 in which it adjusts to the value as determined by K(S) ≈ νQ 0 less rainfall than is taken up by the roots, the saturation at the bottom decreases almost to 0. ^ = 0.1 to Q ^ = 10, which shows the Fig. 7 shows the result of a sudden increase in rainfall from Q initial shock front travelling down into the soil and the eventual steady state. The saturation at the bottom does not increase towards 1 until the shock front arrives there. Fig. 8 shows the ^ = 0.1. Note that the time intervals shown are longer; result of a sudden decrease back to Q there is an initial expansion fan on the timescale of the gravity drainage which reduces the ^ followed by a slower decay (on the timescale saturation to the value given by K(S) = νQ, 1 O( η )) as the roots take up the remaining water. 3 Two-Porosity Model The expanded clay pellets used in green roof construction are quite large but contain lots of pore space. The difference in pore sizes between these, and the inter-pellet space means water can be drawn into the pellets and retained there for longer than it would otherwise remain in the soil. Thus a two-porosity model would seem appropriate. This is an outline of a “box" or “lumped" model for water storage in the macro-pores between soil particles, which have saturation S, and in the micro-pores within the particles, which have saturation SP . Transport of water into or out of the particles is parameterised to occur at a rate proportional to the saturation difference S − SP .12 The roots do not penetrate into individual particles so provide a sink term R only from the macro-pores. This root uptake R(S) is primarily due to the large negative pressure in the root system, but as saturation decreases a large capillary pressure acts to counteract this; thus R(S) is roughly constant for S close to 1 but decreases at small S (as in the model above). 12 A variant of this model might assume that water transfer into the particles occurs at a rate proportional to the pressure difference pcP f(SP ) − pc f(S); since the capillary pressure in the micropores would be larger than in the macropores (pcP > pc ), this would cause more water to be transferred into the micropores, and a larger supply would be maintained there for the roots to take up. 56 1 0.9 0.9 0.8 0.8 0.7 0.7 0.6 0.6 z/L z/L 1 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 0.2 0.4 0.6 0.8 0 1 S 0 0.2 0.4 0.6 R/R 0.8 1 0 Figure 7: Profiles of saturation and root uptake at time intervals of 1 (in the dimensionless units); the arrow shows the direction of increasing time. This is the result of a sudden increase ^ = 10, from the steady state when Q ^ = 0.1, and the dashed line shows the in rainfall to Q steady state that results. Parameter values are m = 1/2, δ = 10−2 , η = 4 × 10−4 , ν = 3 × 10−4 , ε = 10−2 , γ = 10−1 . 57 1 0.9 0.9 0.8 0.8 0.7 0.7 0.6 0.6 z/L z/L 1 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 0.2 0.4 0.6 0.8 0 1 S 0 0.2 0.4 0.6 R/R 0.8 1 0 Figure 8: Profiles of saturation and root uptake at time intervals of 10 (in the dimensionless units); the arrow shows the direction of increasing time. This is the result of a sudden decrease ^ = 10, to Q ^ = 0.1. Parameter values are m = 1/2, δ = 10−2 , η = 4 × 10−4 , in rainfall from Q −4 −2 ν = 3 × 10 , ε = 10 , γ = 10−1 . 58 The following equations are dimensionless, and the time scale has been chosen to be that due to uptake by the roots (the time scale differs from that used previously by a factor η). Drainage from the volume of soil is supposed to occur due to gravity at a rate K(S), and occurs on a time scale η compared to the uptake by the roots (see above). Rainfall provides a source which is scaled to be the same size as the gravity drainage (note this is different to above the scale for the rainfall here is large and is intended to represent the size of heavy showers; the dimensionless r(^t) will be 0 most of the time, when it is not raining, and O(1) when it is raining heavily). dS 1 1 = r(^t) − K(S) − λ(S − SP ) − R(S), η η d^t dSP (1 − φ)φP = λ(S − SP ), d^t ^ φP is the porosity of the pellets and where r = νQ, φ K(S) = S1/2 [1 − (1 − S2 )1/2 ]2 , which comes from equation (2.2) with m = 1 2 (3.1) (3.2) (3.3) and R(S) = 1 − ε (1 − S2 )1/2 . S (3.4) The use of K(S) for the gravity drainage in equation (3.1) is motivated by the fact that the water flow in the earlier part of the report is (since δ is small) essentially determined by this hydraulic conductivity. The time scale for water to diffuse into individual particles is estimated using their dimensions LP ∼ 1 cm and a diffusion coefficient DP ∼ 10−9 m2 s−1 . L2P /DP is comparable to the time scale for uptake by the roots (∼ 105 s), so the parameter λ is order 1. In equation (3.1) η is very small and in equation (3.4) ε is also small, and we consider especially the distinguished case ε ∼ η2/7 . The behaviour of solutions to this model is quite straightforward, and an example solution for a large rain storm followed by dry weather is in Fig. 9. When it is raining, r is order 1, and on a fast time scale, ^t ∼ O(η), the saturation S relaxes towards the equilibrium given by K(S) = r(^t). This causes water to then transfer into the particles on an O(1) time scale according to (3.2). By balancing terms in equation (3.1) we seek a similarity solution for small 9 S for which K(S) ≈ 41 S 2 . When it stops raining r = 0, and the saturation S decreases quickly due to gravity drainage on an O(η) time scale; approximately φ dS 1 ∼ K(S). d^t η (3.5) 2 Thus, neglecting the small O(η 7 ) terms suggests S tends towards 0 as 2/7 2φη S∼ . 7^t 59 (3.6) 1 0.8 0.4 S r / r0 0.6 0.4 0.2 0.2 0 0 0.5 1 1.5 t / t0 2 2.5 0 3 0 0.5 1 1.5 t / t0 2 2.5 3 0 0.5 1 1.5 t / t0 2 2.5 3 1 1 0.8 p 0.6 0.6 S R / R0 0.8 0.4 0.4 0.2 0.2 0 0 0.5 1 1.5 t / t0 2 2.5 0 3 Figure 9: Solutions for macro-scale and particle-scale saturations S and SP , and root uptake R, as a result of rainfall r(t) which represents a large rain shower. Dashed lines show the limiting behaviour. Parameter values are η = 10−4 , λ = 1, ε = 10−2 . On the O(1) time scale, therefore, S ∼ η2/7 , and the equations are ε 2/7 2/7 2/7 , O(η ) = O(η ) − λ(O(η − SP ) − 1 − S (3.7) dSP = λ(O(η2/7 ) − SP ). (3.8) ^ dt Thus, ignoring the O(η2/7 ) terms, Sp decays exponentially and the water coming out into the macropores is immediately taken up by the roots: R = λSP = −(1 − φ)φP dSP /d^t, as shown in figure 9. Root uptake is maintained for a much longer period after it ceases to rain than would be the case with no micropores, when S decreases rapidly towards 0. (1 − φ)φP 3.1 A Model for Fast Saturation Assuming instead fast saturation of the pellets, SP = SO H(S) , (3.9) where H denotes the Heaviside function, SP denotes the saturation of the individual pellets, SO is the porosity of an individual pellet, and S is the saturation of the inter-pellet pores. The required short time scale can arise from high capillary pressures associated with the very small pores within the pellets. 60 Taking now φ = 41 to be the total proportion of space occupied by air and water within the 1 soil, and SO = 15 , the inter-pellet porosity is ϕ = 16 (given by ϕ + 15 (1 − ϕ) = 41 ). Equation (2.9) can then be replaced by 1 ∂ ∂ ∂S ^, (S + 3H(S)) = K(S) + δD(S) − ηR (3.10) 16 ∂^t ∂^ z ∂^ z ^ ∼ 1, from (2.9) and (2.32). (Equation (3.10) might be better written in terms of the total with R water content, ST = 161 (S + 3H(S)), so that S on the right-hand side is replaced by S(ST ) = 0 3 for 0 ≤ ST ≤ 163 , S(ST ) = 16(ST − 163 ) for 16 ≤ ST ≤ 14 .) Where the pellets are saturated, S > 0 and H(S) = 1, the equations are as in Section 2. Here, for simplicity, an initially dry soil is considered, so that at ^t = 0, S ≡ SP ≡ 0. For ^t > 0, a ^ ^t) < z^ < 1 has become wet: region W( ^, S = H(S) = 0 in 0 < z^ < W ^ < z^ < 1 . S > 0 and H(S) = 1 in W (3.11) To obtain an order-one sized wet region, the relevant time scale must be that for the rainfall (days) so that time has to be rescaled: ^t = t̃/ν . (3.12) Note that this time scale is similar to that for the up-take of water by the plants’ roots. It is also appropriate, from the top boundary condition, to rescale the saturation: S = ν2/9 S̃ , (3.13) where, since we have assumed that m = 21 , K(S) ∼ 41 S9/2 and D(S) ∼ 14 S5/2 for small S. Neglecting the time-derivative term (now effectively of order ν2/9 ), the partial differential equation becomes ! ∂ S̃ 1 ∂ ^. = η̃R (3.14) S̃9/2 + δ̃S̃5/2 4 ∂^ z ∂^ z 1 Here η̃ = η/ν ≈ 31 and δ̃ = δ/ν2/9 ≈ 600 , using the values of Section 2. Although the value of δ̃ is small here, because of the uncertainty in the values of the physical parameters describing water transport through the soil, it could conceivably be of order one and it is therefore retained in (3.14), for the present. The differential equation is subject to the top boundary condition ! ∂ S̃ 1 ^ in at z^ = 1 =Q S̃9/2 + δ̃S̃5/2 4 ∂^ z (3.15) and, assuming that the diffusive, δ̃, term is retained, a lower boundary condition ^ t̃) . S̃ = 0 at z^ = W( 61 (3.16) ^ ^t) between dry and wet soil, conservation Finally, to fix the position of the free boundary z^ = W( of mass of water at this point, where SP jumps from 0 to SO , leads to ! ^ ∂ S̃ 4 dW ^ t̃) . at z^ = W( (3.17) S̃9/2 + δ̃S̃5/2 =− 3 ∂^ z dt̃ (Since, for δ̃ > 0, S̃ = point, the second term on the right-hand side should then be 0 at this ∂S̃ interpreted as δ̃ lim S̃5/2 .) ^ ∂^ z z^→W Of course, if the pellets were already partially saturated, (3.17) would be suitably modified, leading to a faster-moving free boundary. ^ in + z^ − 1 Note also that if the diffusion can be neglected, (3.14) and (3.15) lead to 41 S̃9/2 = Q so (3.17) gives ^ dW 16 ^ ^ −1 . − Qin + W (3.18) = 3 dt̃ ^ t̃ ≤ 0. An The free-boundary condition (3.17) only applies for an advancing wet region, dW/d alternative form is needed for when this region contracts, which will happen when the rainfall decreases sufficiently. In any part of the soil between the lowest location of the free boundary and its current position, the roots can continue to remove water from the pellets, thereby reducing SP . As described in this report we could could now have at least four types of region within the soil layer: 1. Dry zone, where S = SP = 0; 2. Damp or moist (unsaturated) zone I, where S = 0, 0 < SP < SO ; 3. Damp or moist (unsaturated) zone II, where 0 < S < 1, SP = SO ; 4. Wet (saturated) zone, where S = 1, SP = SO . 4 Conclusions In this report a one-dimensional time dependent mathematical model has been described for the development of the saturation in the soil layer of a flat green roof. Our model suggests that a fully-saturated (S = 1) region forms at the base of the soil layer and this region can be thin relative to the total soil thickness. 62 From an initial dry state and from the onset of persistent rain, fronts of saturation were computed to descend through the layer. The decrease of saturation from unity following a decrease in rainfall was also described. The end result is that most of the rainwater falls through the soil layer and exits through the network of holes in the bottom supporting sheet. On a smaller scale, the pellets and soil particles are themselves porous and made up of micropores. The water flow in and out of a typical particle is modelled using the flux between (a) the macropores (whose saturation is as modelled above) and (b) the root system. This two-porosity model suggests that during the time between spells of rain the micropores can retain (for long periods of time) water that is available to be taken up by the roots. Further work might include adapting the soil thickness L to rainfall at the site of the building with the aim of making L as small as possible, while avoiding problems with saturation and aridity. A first step towards this goal would be to carry out experiments to more accurately determine the values of the constants. Further simulations using more extensive rainfall data could then be carried out to determine the optimum soil thickness. In addition small modifications could be made to include the influence of a sloped roof. Acknowledgements We acknowledge the support of the Mathematics Applications Consortium for Science and Industry (www.macsi.ul.ie) funded by the Science Foundation Ireland mathematics initiative grant 06/MI/005. 63 Bibliography [1] M. T. Van Genuchten. A closed-form equation for predicting the hydraulic conductivity of unsaturated soil. Soil Sci. Soc. Am. J., 44:892-898, 1980. [2] T. Roose and A.C. Fowler. A model for water uptake by plant roots. J. Theoret. Biol., 228:155-171, 2004. 64 Uplift Quadratic Program in Irish Electricity Price Setting Report Contributors: Michelle Carey1,2, Conor Houghton 3, Matylda Jabłońska4 and John Kinsella5 Study Group Contributors: James Gleeson1, Gavin Hurley1 , Julie Hurley1 , Arno Mayrhofer1 , Colin Menzies1, Celine O’Donovan6 , Gearóid Ryan7 , Dengli Wang8 and R. Eddie Wilson9 Industry Representatives: Gavin Hurley10 and Sile Bourke10 1 MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland Report coordinator, michelle.carey@ul.ie 3 Department of Mathematics, Trinity College Dublin, Ireland 4 Department of Mathematics, Lappeenranta University Finland, Finland 5 Department of Mathematics and Statistics, University of Limerick, Ireland 6 Kemmy Business School, University of Limerick, Ireland 7 School of Mathematical Sciences, University College Cork, Ireland 8 Business School, Dublin City University , Ireland 9 Bristol Centre for Applied Nonlinear Mathematics, University of Bristol, United Kingdom 10 Bord Gáis Energy, First Floor (BGE), City Quarter, Lapps Quay, Cork, Ireland 2 Abstract This report summarises progress made towards the problem submitted by Bord Gáis at the 70th European Study Group with Industry organised by Mathematics Applications and Consortium for Science and Industry (MACSI) and held in the University of Limerick from the 28thJune 3rd July 2009. Bord Gáis required a deeper insight into the dynamics of Uplift prices. Therefore, the aim of the group was to apply a variety of analytical tools to the problem in order to satisfy Bord Gáis requirements. The group conducted a KKT Optimality Analysis of the quadratic program used to determine the Uplift prices, performed statistical analysis to identify the binding constraints and their sensitives to the Uplift prices, simulated a synthetic stochastic process that is consistent with the Uplift pricing series and investigated alternative objective functions for the quadratic program. 66 Introduction Energy prices in the Irish Electricity Market are composed of a Shadow Price and an Uplift Component. The Shadow Price is the cost of meeting a marginal unit of demand, as determined by solving the Unit Commitment Problem. The Unit Commitment Problem is a Mixed Integer Program which determines which units (generators) to commit (turn on), so as to minimize the total system cost of generation, subject to meeting demand in each period and respecting technical characteristics of the units (e.g. ramp-rates, min on/off times, minimum generation). Participants declare their true marginal costs and therefore the Shadow Price alone does not guarantee that generators will recover their fixed running costs, e.g. start-up and no-load. The Uplift component of prices addresses this. It is derived via a Quadratic Program (QP) which ensures that the generators recover those costs. The QP aims to minimize a parameterized combination of total additional cost due to Uplift and the volatility of Uplift. The problem involves investigating the quadratic program used to determine the Uplift component of the prices and to examine the behaviour of the this Uplift component. Our aim for the week was to exploit the expertise of the group to develop analytical tools that provided Bord Gáis with a deeper insight into the uplift process. The report begins by describing the structure of the quadratic program, followed by analysis of the optimal Uplift components using the Karush_Kuhn_Tucker (KKT) conditions. One can also employ statistical analysis to identify the generators contributing to Uplift prices, this method is described and the results produced. The subsequent sections involves simulating a stochastic process that reproduces the Uplift series to a quantitatively comparable level and investigating other objective functions for the QP. Finally, the report concludes with results and suggestions for further research. 67 1 Problem Formulation The “System Marginal Price" (SMP), the incremental cost of supplying the next MWh of energy demanded for each half-hourly period h, of day t is SMPh = SPh + UPh . Where SPh represents the “Shadow prices”, the marginal price of electricity per MWhr in each half-hour based on the information provided by the generators and UPh symbolizes the “Uplift”, the correction factor that is applied retrospectively to the shadow prices to cover the fixed running costs. The SEMO (Single Electricity Market Operator) determines the Uplift function by solving a quadratic program that minimises Uplift revenues (the Cost objective) and minimises the shadow price distortion (the Profile objective) " # X X X (SPh + UPh ) Qgh + β min F(UPh ) ≡ α UP2h , (1.1) UPh ,h=1...48 | subject to, X h h {z Cost objective [(SPh + UPh )Qgh ] ≥ CRg , UPh ≥ 0, g } | h {z } Profile objective for g = (1, . . . , G), (1.2) for h = (1, . . . , 48). (1.3) Given that Qgh denotes the volume (quantity) of electricity provided by each generator g in each half-hour period h and CRg is the total cost of running for generator g, given by X CRg = [Qgh Cu + NLCg I(Qgh > 0)] + STg , (1.4) h Where Cu is the variable (fuel) cost per unit, NLCg is the no-load cost of generator g, which is the element of operating costs which is invariant with the actual level of output and STg is the start-up cost of generator g, the cost of turning on generator g at any time h during day t. The first constraint given in equation (1.2) ensures that each generator g covers its costs CRg and the second constraint given in equation (1.3) certifies that all Uplift values are positive. For convenience equations (1.1,1.2,1.3) were converted into Matlab-like notation as follows; min F(u) ≡ αeT Q(s + u) + βuT u, u∈R48 (1.5) subject to, Q(s + u) ≥ c, u ≥ 0. 68 (1.6) (1.7) Please refer to the conversion table in the appendix for the details. The problem was attacked using four separate but complementary approaches. These approaches all provide a insight into the composition of the Uplift process and our outlined in the subsequent sections. Section one examines the optimality of the quadratic program using the Karush_Kuhn_Tucker(KKT) conditions, section two describes a statistical analysis of both the actual and re-constructed data set, section three develops a stochastic model that mimics the underlying dynamics of the Uplift prices and finally section five investigates the validility of alternative objective functions. 2 KKT Optimality Analysis of the QP The KKT 1st -order necessary conditions for a general inequality-constrained problem are: X λi ∆u ci (u) = 0 (2.1) ∆u L = 0 ≡ ∆u F(u) − i∈I where I is the index set for the inequality constraints. Computing the gradients with respect to u; αQT e + 2βu − QT λ1 − λ2 = 0, (2.2) where λ1,g ≥ 0 for g = (1, . . . , G) when the corresponding constraints are active and zero otherwise, and λ2,h ≥ 0 for h = (1, . . . , 48) when the corresponding constraints are active and zero otherwise. As the QP is convex, the KKT conditions are both necessary and sufficient for optimality (i.e the lagrange multipliers are unique). Moreover, the lagrange multiplier λ2 must be zero for all h. Therefore one can rewrite equation (2.2) as; 2βu = −αQT e + QT λ1 . (2.3) Under the Single Electricity Market (SEM) trading and settlement code the regulatory authorities determine the weighting parameters α, β in equation (2.2) for each year. The first parameter α governs the importance of the Uplift cost objective, such that 0 ≤ α ≤ 1. The second parameter β governs the importance of the Uplift profile objective, such that 0 ≤ β ≤ 1 and such that α + β = 1. The regulatory authorities have found that setting the parameters to α = 0 and β = 1 provides the most appropriate balance of cost and price stability. Therefore applying these conditions to equation (2.2), given that several generation constraints represented by g ∈ Ag0 are binding yields an optimal set of positive uplifts u which are a non-negative linear combination of the columns of QT (the rows of Q) corresponding to the binding constraints. X λ1,g (qg ) . (2.4) 2u = g∈Ag0 69 replacemen The binding constraints now take the form (Qu)g = cg − (Qs)g for g ∈ Ag0 . 2.1 Results Equation (2.4), motivated an analysis of historical data to determine which generation constraints have been binding in each 24-hour period. This is illustrated in figure (1), which clearly shows that the generators have an intrinsic band structure. Figure 1: Recurrence of Binding Generators Two small subsets of generators can be identified from figure (1). A set of the same generators that are clearly binding from day to day and a set of generators that are never binding. This provoked a further examination of the classification and frequency of the binding generators. The classification of generators that are binding and therefore influencing Uplift prices are shown in figure (2). It appears that it is almost always the thermal plants (gas, oil, distillate and coal) that have binding constraints. The frequency of generators that are typically binding is given in figure (3). It is clear that in general there is only one (45% of all occurrences) or two generators (34% of all occurrences) binding with very few occasions with more than four generators binding (.01% of all occurrences). 70 Figure 2: Classification of binding generators Figure 3: Frequency of binding by generator type Figure 4: Frequency of binding by generator type 3 Factor Analysis Using statistical analysis the group tried to identify factors that were contributing to Uplift prices, in order to state the sensitivity of Uplift prices to certain known factors. The Uplift 71 prices are an adjustment price and therefore can have zero values, in this case 40% of the Uplift price values are equal to zero. A factor model of Uplift prices was employed to decompose the random prices into factor related and specific prices. u = α + β1 f1 + . . . + βk fk + ε, where u denotes the time series of Uplift prices, α is a single coefficient introduced in order to ensure E[ε] = 0, βi is the sensitivity of Uplift to the factor fi and ε represents the error or Uplift specific price’s. If the vector of factors f is known one can model this using a macroeconomic factor model, however if they are unknown and therefore must be estimated one uses a statistical factor model. 3.1 Macroeconomic model The results from the KKT conditions shown in equation (2.2) are structurally similar to a macroeconomic factor model. The equation states that the factors contributing to Uplift prices u are qg , the quantity of electricity provided by each generator g in each half-hour period h. Using the macroeconomic factor model, one can verify the results from the KKT conditions with minimal assumptions on the underlying dynamics of the problem. The advantage to using this method is that it can be applied to both the reconstructed Uplift data and the actual Uplift data, producing the binding generation constraints in both cases. From figures (2) and (3), one can establish that 45% of the generators bind only once, 86% of generators have less than three binding generators and that wind, pumped storage, peat and the interconnector generators don’t contribute to Uplift prices. Exploiting this and the fact that all the parameters in equation (2.2) are positive one can identify the binding generators by analyzing the correlation structure between the output of each of the 85 generators that influence Uplift prices and the Uplift prices themselves. The results are shown in figures (5,6). In figure (5,6), the positive correlation values represented by the light green, yellow, orange and red give the set Ag of all generators that could be binding in any given day t, for the 385 days considered. From this one can identify how many generators are binding, which generators are binding and the sensitivity of Uplift prices to the binding generators on each day t. For the reconstructed data, there are 76 days in which one generator is binding this corresponds to generators which have a correlation coefficient value greater than or equal to 0.9999. To identify the binding generators when more than one generator is considered one would construct a macroeconomic factor model with the Uplift regressed against all possible generator’s identified by the correlation coefficients (> 0 and < 0.9999), the max number of possible binding generators in this case is 42. Therefore one can develop a stepwise algorithm which selects the combination of generators that are binding based on the AIC (Akaike’s information criterion) of each combination. AIC makes comparisons between a number of possible models on the basis 72 Figure 5: Correlations between generator’s and Uplift prices for reconstructed data Figure 6: Correlations between generator’s and Uplift prices for reconstructed data of the following statistic: AIC= −2 × log(maximum partial likelihood) + 2 × b, where b is the number of β coefficients in each model under consideration. The AIC is based on maximum likelihood and a penalty for each parameter. The best model has the smallest AIC value. This algorithm produces the binding generators and their sensitivities to Uplift prices on any given day t. This procedure can also be run on the actual Uplift prices. Thus producing the binding constraints and their sensitivities to actual Uplift prices. Figures (7 and 8) illustrate the results. There are only 5 days for which only one generator is binding. From figures (5) and (7) one can clearly identify a band structure indicating that there is a recurring group of generators that are binding. 73 Figure 7: Correlations between generator’s and Uplift prices for actual data Figure 8: Correlations between generator’s and Uplift prices for actual data 4 4.1 Uplift as an individual stochastic process Motivation for stochastic analysis In terms of forecasting, electricity prices are one of the most difficult types of financial time series. This is due to their highly volatile behavior. However, the Uplift prices exhibit some specific statistical features that can be exploited in order to reconstruct the series up to some qualitatively comparable level. The aim of the group was to reproduce a synthetic stochastic process which would behave similarly to the real Uplift prices, that would be suitable for Monte Carlo simulations or some other stochastic purposes. The following subsections present specific Uplift features that were identified and employed in simulations to generate the synthetic process and an examination of the synthetic process produced. 74 4.2 Identified features of the Real Uplift process Initially the group examined the path of the Uplift data over consecutive days. This is illustrated in figure (9). One can clearly identify a jump structure inherent in the data. Figure 9: Path covering a few consecutive days of Uplift half-hourly data Considering that by definition Uplift must be positive for all time t and the length of the plateaus may be similar to waiting times. One can assume a Poisson distribution to model the lengths of plateaus in the Uplift prices and an investigation of the empirical Uplift jump distributions could be used to verify the jump heights. Since electricity is highly dependant on half-hourly demand (contributing to half-hourly, daily and annual seasonality) it is reasonable to expect the Uplift prices to also reflect some seasonality features. These features can be clearly identified by the autocorrelation function (ACF) and partial autocorrelation function (PACF) for the Uplift price series presented in figure (10). As desired, we clearly see peaks in ACF repeating with 48−lag regularity. Additionally, they are slightly locally maximal points for every 336th lag (48 half-hours times 7 week days), portraying weekly seasonality but this is not as significant as the daily seasonality. Thus, in the simulation, in order to reproduce the daily seasonality the current observation is regressed against the observation that occurred 48 half-hour periods earlier. One would suspect that there is also an annual seasonality unfortunately there is an insufficient amount of data to verify this claim. 4.3 Uplift jump waiting times A Poisson distribution was employed to model jump waiting times in the Uplift process. However, due to time limitations, the parameters were set using heuristic methods. It is evident from Figure (9), that waiting times are not distributed uniformly, as there is a clear dependency on the current price level. In particular, the process zero level components are considerably long, whereas at high levels (40-100), the process jumps down almost immediately. Therefore, 75 Figure 10: Autocorrelation and partial autocorrelation functions for the Uplift price series. it seems sensible to set four different price levels, defined by various values of the Poisson λ parameter. These values are as follows; • λ = 50 for uh = 0. • λ = 10 for 0 < uh ≤ 40. • λ = 2 for 40 < uh ≤ 100. • λ = 0.05 for uh > 100. 4.4 Jump up/down probability and Jump heights One can also clearly identify a jump structure in figure (9). The higher the value of current Uplift, the higher the probability of a negative jump. Another trait that becomes apparent from examining the data and figure (9) is that if the current price level crosses some considerably high price level (in this case uh > 50), then the process exhibits continuous negative jumps until it reaches zero. Sometimes, these negative jumps occur in a few time steps but often within one time step only. Two empirical cumulative distribution functions conveying jump direction are derived from the occurrences of positive and negative movements in the Uplift series for the purpose of random number generations. Figure (11) is a histogram of the occurrences of jumps up and down for a given Uplift level. The histograms structure conveys an exponential curve. The distribution of the jump magnitude conditional on the price level is derived by computing the density function of the jump magnitudes. This is illustrated in figure (12). 76 Figure 11: Histograms for occurrences of jumps up/down depending on current Uplift level. Figure 12: Density function of the jump magnitudes. 4.5 Plateaus within night hours Plateaus during the night half-hour intervals are also apparent in the data these portray a high probability of constant (usually zero) Uplift during this period. Once the base process was simulated one could now induce this attribute by setting the night half-hour intervals equal to zero, based on a uniformly distributed probability of zero level continuing for 12 to 20 half-hour periods. 4.6 Algorithm The general algorithm for the simulation approach is as follows; 1. For h = (1, . . . , 48) set ua,h = us,h . Where ua,h is the actual Uplift price for the half-hour interval h and us,h is the simulated Uplift price for the half-hour interval h. 2. Given us,h for a particular half hour h, identify the corresponding Poisson parameter. 77 Using this parameter generate the length l of the plateaus (amount of half hour intervals h for which the process remains constant). 3. Set the current time point hc , to the previous time point hp plus the length of the plateau l. hc = hp + l. 4. For l > 1 then h = (hp + 1, . . . , hp + l) let us,h = us,h−1 and for l = 1, then h = (hp + 1) let us,h = us,h−1 . 5. Then, Given that us,h is the last price value after the last jump. • Set a limiting threshold T . If us,h > T , force the process into negative jumps until us,h = 0. • If us,h = 0, force the process to only allow positive jump’s generated from the empirical distribution. • If 0 < us,h < T , determine from the empirical distribution of jump direction whether the process jumps up or down, and then based on the empirical distribution of jump magnitude determine the magnitude of the jump. Sum the sampled jump magnitude and the regressed price value that occurred 48 periods ago. Set hc = hc + 1. 6. Repeat until the hc exceeds the target process length to be simulated. 4.7 Results The simulation was implemented to construct a sample of n half-hour intervals of the Uplift series. Figure (13) presents the simulated Uplift series which is quantitatively (by distribution parameters) comparable with the original Uplift series. Moreover, it is clear that our simulation is also able to produce values significantly differing from the process mean, which is also reflected in the real data. Figure 13: The real and simulated Uplift process. 78 In order to identify if the structural elements of the simulated Uplift process mimic those evident in the actual process, one would compare the two data sets microstructure. This is illustrated in figure (14). The figure confirms that the jump structure is accurately modelled by the simulated process and that the plateaus (more persistent when price at zero level) occur with different lengths at different price levels in both the actual and simulated data. Figure 14: The real and simulated Uplift process. Finally, figure (15) illustrates the ACF and PACF of the actual and simulated Uplift data. The half-hourly seasonality, apparent in the actual data has been reconstructed in the simulated data up to a significant level. Additionally, the PACF’s of both the original and simulated data have similar features. Figure 15: Autocorrelation and partial autocorrelation functions of the real and simulated Uplift process. 79 5 Other Objectives The group noticed that the objective formula equation (1.5) is not dimensionally consistent, the cost objective function has units of money whereas the profile objective function has units of money per load by time, all squared. This is dissatisfying, it means for example the currency has to specified as well as α and β and it has the practical problem that there is no reason to expect the cost objective and the profile objective to have a similar scale in fact, for the uplifts calculated with the β = 1 objective C̄ = 684651 P̄ = 13478. (5.1) Where C̄ is the mean of the cost objective and P̄ is the mean of the profile objective, taken over all the days considered. Of course, even if α and β were fine tuned to make these similar, the changes as UPh varies would not be comparable. In order to solve this problem of inconsistency one would have to identify an objective function that explicitly referred to total cost and to variance so that the balance between the two could be decided more transparently as a matter of policy by the industry. Hopefully this would also allow the actual economic cost of reducing variance to be computed as the mixture of these two terms was varied. To do this in a dimensionally consistent way seems to require that the same method of averaging is used for both terms, at the moment C the cost objective effectively averages over load while P the profile objective averages over time; for a given quantity xt , lets use hxt it = for the time average and for the load average; at the moment 1 X xt 48 t (5.2) P gt Qgt xt hxt iQ = P gt Qgt (5.3) C ≈ hut iQ P ≈ hu2t it (5.4) There are, correspondingly, two consistent objectives which would separate out cost and variance E2t = µhut i2t + (1 − µ) hu2t it − hut i2t (5.5) and E2Q = µhut i2Q + (1 − µ) hu2t iQ − hut i2Q (5.6) Both are quadratic objectives, both are dimensionally consistent and both have separate variance and cost terms. 80 Another approach to the objective is to decide that the Uplift price should be chosen so that the time course of the Uplift reflects the actual time course of the cost of generation; this may have the advantage of communicating economic signals to the consumers about the actual half-hour by half-hour cost of producing electricity. In short, imagine there wasn’t a single Uplift, but instead a time-constant Uplift ug for each generator period. This would spread the economic cost of g’s electrical production evenly across g’s load: cg t Qgt ug = P (5.7) The most obvious way to use this generator-period Uplift to calculate ut would be to take the maximum ut at every half-hour. This has the aesthetic advantage of matching this method to the method used to calculate the shadow price; it is however very expensive, it has an average cost of C̄ = 2690010 (5.8) An alternative would be to calculate a load weighted average of the ug : P g Qgt ug ũt = P g Qgt (5.9) This Uplift would be very cheap; C̄ = 72627 (5.10) and would give an Uplift time course well matched to the time course of the electricity generation cost, however, it does not satisfy constraints. The idea, therefore, would be to use this to pose an objective X Ẽ = (ũt − ut )2 (5.11) t Other profiling goals could be posed; for example related to the shadow price. Similarly, it might be possible to find a linear objective; C is already linear, all that would be required would be a linear approach to imposing the profiling goal. One method would be to specify a restricted space for ut ; it could be required that it tracks the total load, or the loads of the relevant generator periods or the shadow price or some positive weighted combination of these. 5.1 Conclusion This report uses various modelling approaches to gain a deeper insight into the underlying dynamics of the Uplift prices. Initially, the group examined the optimality of the quadratic program used to derive the Uplift prices using the KKT conditions. This resulted in equation (2.4) which states that the optimal set of positive uplifts u are a non-negative linear combination of the columns of QT (the rows of Q), corresponding to the binding constraints. This 81 result motivated an analysis of historical data to determine which generation constraints have been binding in each 24-hour period. In order to identify these binding generation constraints one can compute brute force searches or employ statistical analysis. The statistical analysis in section (3) exploits the results computed from section (2) and utilizes correlation coefficients and factor analysis to identify the binding generation constraints for each day t and the sensitives of Uplift prices to these binding generation constraints. While the statistical and brute force searches are both very separate approaches, the results from both methods were equivalent for the reconstructed data. Surprising results were that there is only a small amount of binding generators and that hydro plants were contributing to Uplift prices. Both methods also identified a band structure to the binding generation constraints, implying that there is a recurring group of generators that are binding. The advantage of using the statistical analysis is that the algorithm can be applied to the actual Uplift prices, producing the binding generation constraints and the sensitives of actual Uplift prices to these binding generation constraints. The group then developed a synthetic stochastic process which behaves similarly to the Uplift pricing series. By identifying specific statistical features of the Uplift prices the group was able to simulate a stochastic process which is qualitatively comparable with the actual Uplift prices, conveys structural elements that are inherent in the actual Uplift prices and portrays the halfhourly seasonality evident in the actual Uplift prices. Finally, the group investigated possible alternative objective functions for the quadratic program used to compute Uplift prices as the objective function used by the SEMO is dimensionally inconsistent. Alternative objectives were posed and discussed. Propositions for further research; a statistical factor analysis, where the parameters of interest are proxies which are influencing Uplift prices such as Oil prices, weather conditions etc, an investigation into the parameters of the Poisson distribution for modelling the waiting times and the validation of a linear profiling objective. A distributional analysis of Uplift for each day t and an examination of the effect of annual seasonality with the aid of extra data can be found in ([3]). Acknowledgments All contributors would like to thank Gavin Hurley and Sile Bourke for introducing the problem and assisting in answering questions and formulating the models. We also acknowledge support of the Mathematics Applications Consortium for Science and Industry (www.macsi.ul.ie) funded by the Science Foundation Ireland Mathematics Initiative Grant 06/MI/005. 82 Bibliography [1] Single Electricity Market Committee, SMP Uplift Parameters 2010, Consultation Paper, 7 June 2009, SEM-09-066. [2] All Island Project, SMP Uplift Methodology and Parameters, Decision paper, 15 March 2007, AIP-SEM-07-51. [3] Matylda Jablonska, Arno Mayrhofer and James Gleeson, Title, Journal (in preparation). Appendix Notation In industry standard notation the QP is: " # X X X UPh2 (SPh + UPh ) Qgh + β min F(UPh ) ≡ α UPh ,h=1...48 g h (5.12) h subject to X h [(SPh + UPh )Qgh] ≥ CRg , for g = 1 . . . G (5.13) UPh ≥ 0, for h = 1 . . . 48 (5.14) Here the range of h is h = 1 . . . 48 and the range of g is g = 1 . . . G. Writing the above in a more abbreviated notation where: u = (u1 , . . . , u48 ), uh ≡ UPh , h = 1 . . . 48 s = (s1 , . . . , s48 ), sh ≡ SPh , h = 1 . . . 48 c = (c1, . . . , cG ), cg ≡ CRg , g = 1 . . . G we have min uh ,h=1...48 F(uh ) ≡ α X h " (sh + uh ) X g 83 # Qgh + β X h u2h (5.15) subject to X h [(sh + uh )Quh] ≥ cg , for g = 1 . . . G (5.16) uh ≥ 0, for h = 1 . . . 48 (5.17) The above can be written in a Matlab-like notation as min F(u) ≡ αeT Q(s + u) + βuT u (5.18) Q(s + u) ≥ c u≥0 (5.19) (5.20) u∈R48 subject to where e is a column vector of size G of ones. 84 Particle Impact Analysis of Bulk Powder During Pneumatic Conveyance Report Contributors: Giles Richardson1 , Mark Cooker2 , Niamh Delaney3 , Kevin Hanley4 , Poul Hjorth5 , Dana Mackey6, Joanna Mason3,7, Sarah Mitchell3 and Catherine O’Sullivan8 Industry Representative: Seamus O’Mahony9 1 School of Mathematics, University of Southampton, United Kingdom School of Mathematics, University of East Anglia, United Kingdom 3 MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland 4 Department of Process and Chemical Engineering, University College Cork, Ireland 5 Department of Mathematics, Technical University of Denmark, Denmark 6 School of Mathematical Sciences, Dublin Institute of Technology, Ireland 7 Report coordinator, joanna.mason@ul.ie 8 Department of Civil and Environmental Engineering, Imperial College London, United Kingdom 9 Wyeth Nutritionals Ireland, Askeaton, Co. Limerick, Ireland 2 Abstract Fragmentation of powders during transportation is a common problem for manufacturers of food and pharmaceutical products. We illustrate that the primary cause of breakage is due to inter-particle collisions, rather than particle-wall impacts, and provide a statistical mechanics model giving the number of collisions resulting in fragmentation. 86 1 Introduction Pneumatic conveyance of food and pharmaceutical products in dehydrated, powdered form, is very common in Ireland’s process industries. During conveyance, powder particles may break into smaller pieces, due to impacts, resulting in a change in overall bulk density. A leading manufacturer of infant formula approached the 70th European Study Group with Industry to investigate the change in bulk density that they observe during transportation of this powder. The powder is inhomogeneous and consists of approximately six main ingredients, and around twenty additional micro-nutrients, which are smaller particles. An average particle is approximately spherical and around 100 microns in diameter. Particles consist of proteins, sugars and fats and can be either solid or hollow. If hollow, the size of the wall is typically one fifth of the diameter, and the walls can be solid or porous see Figure 1a. Initially the powder is produced from a liquid in the spray dryer. During this phase the particles can coalesce into clusters (see Figure 1b), which is desirable as agglomerates of particles are better at dissolving in water, when reconstituting the infant formula. The particles are blown along pipes by an air flow, and then intermediately stored in silos, before packing into cans. During transportation particles impact both the walls of the pipe and also each other. A schematic diagram of an industrial pneumatic conveyance line is shown in Figure 2. Typical parameters for both the particles, and the pneumatic conveyance line are listed in Table 3. Infant formula is usually measured out by volume using a scoop. Therefore bulk density of the powder is an essential quality control parameter for infant formula manufacturers. If there is too much breakage during transportation, the average size of the particle clusters decreases because the particles can pack together more tightly, and as a result the bulk density increases. This results in the formula becoming too nutrient rich when reconstituted. Conversely, if there is unusually little breakage during transportation, the final bulk density could be too low, and the rehydrated formula will be insufficient in meeting the infant’s nutritional requirements. As a certain amount of breakage during transportation is inevitable, manufacturers currently estimate the change in bulk density from empirical data. The aim of the study group is to devise a mathematical model for the amount of breakage during transportation, and thus quantify the bulk density change. The purpose of this report is twofold: to outline the main mechanisms of impact, and to provide an analytical expression for the rate of particle breakage during transportation. The outline is as follows. We begin in Section 2 by describing Dean flow, which is the standard way of describing flow in a curved pipe. In Sections 3 and 4 we consider particle flow in the pipe. We initially consider a toy two-dimensional problem, and then extend our analysis to the full three-dimensional problem. An overview of Hertzian contact laws, which we can use to estimate the energy at which particle impacts result in fracture, is presented in Section 5. For Hertzian theory to apply, it is assumed that the impacting bodies are solid. In Section 5.2 we outline an argument to illustrate that the walls of the particles under consideration are sufficiently thick that we may justifiably model them as solid. Section 6 focuses on constructing a statistical 87 M Figure 1: SEM images of a typical particle and clusters of particles. A scale bar is shown below each image. Photographs reproduced with permission from Kevin Hanley. 3m Tall−form Intermediate spray storage silo dryer Air out 20m 0.1m Fluidised bed 5m Air in Rotary feeder To Air input Pneumatic conveying line canning line Figure 2: A schematic diagram of an industrial pneumatic conveyance line for the transport of infant formula from the spray drying phase to the storage silos. 88 Quantity Particle radius Radius of pipe Radius of curvature of pipe Estimated density of particles in the pipe Estimated velocity of particles in the pipe Poisson’s ratio Young’s Modulus Coefficient of restitution Yield stress symbol a0 R κ0 ρ v0 ν E σf value(s) 5 × 10−6 –5 × 10−4 0.05 1 250–350 0.5–1 0.3 3 × 109 (from [10]) 0.2 5 × 105 units m m m kg/m3 m/s Pa Pa Table 3: Typical particle and pipe parameters provided by Kevin Hanley, unless otherwise stated. mechanics model of collisions resulting in fragmentation, which implicitly incorporates Hertzian theory. Finally, in Section 7 we provide some concluding remarks and outline areas for future work. 2 The flow in the pipe The pipe has an approximately circular cross-section and has a centreline that is weakly curved (i.e. has typical radius of curvature L ≫ 2σ where 2σ is the diameter of the pipe cross section. Flow in curved pipes is usually described as a Dean flow [4] and reviewed in [3]. Such flows are characterised by the so-called Dean number De and δ the ratio of the typical pipe radius (or halfwidth in the case of non-circular pipes) σ to its typical radius of curvature L. Unfortunately the Dean number is not defined in the same manner in all works. Here we shall adopt the definitions ρGσ3 (2σ/L)1/2 De = , µ2 δ= σ , L (2.1) where G is a typical pressure gradient along the pipe, µ is the viscosity of the fluid and ρ its density. At low Dean numbers (below about 100) this definition is equivalent to another commonly used definition namely 1/2 2σρW̄0 ^ = 4Re 2σ De , (2.2) where the Reynolds number Re = L µ where W̄0 is the mean velocity along the pipe. However these two definitions do not coincide for larger Dean numbers. 89 At low Dean numbers De ≪ 1, and for loosely coiled pipes δ ≪ 1, the flow is, to a good approximation, a Poiseuille flow along the axis of the pipe with a small circulatory component in the transverse cross-section of the pipe. In the study group we initially made the assumption that significant flow (as far as motion of particles in the flow is concerned) occurs mainly in the axial direction. However, as we shall briefly outline here, this assumption is probably invalid so that further investigation should be directed to considering flow in the cross-section of the pipe. In fact the high Dean number limit turns out to be of more relevance to the flows in the conveyance line than the low Dean number limit. The structure of the flow at high Dean number. The high Dean number limit has been considered previously by [5, 6, 14] by using a mixture of numerical and asymptotic techniques. They define the flow velocity u = wt + v(sin αb − cos αn) + u(cos αb + sin αn) (here t is the unit vector along the axis of the pipe, n is the unit vector in the plane of the pipe centreline directed towards the inside of the this curve, b is the other unit normal vector (see Figure 3)). In the limit De ≫ 1 δ ≫ 1 [5, 6, 14] have shown that in the bulk of the flow the scalings for the various velocity components are 1/2 ! ν ν ν L u = O De1/3 , v = O De1/3 , w = O De2/3 . (2.3) σ σ σ 2σ In addition they identify a boundary layer of thickness O(σDe−1/3 ) around the edge of the pipe in which the solution is such that 1/2 ! ν ν ν L u = O De2/3 , v = O De1/3 , w = O De2/3 . (2.4) σ σ σ 2σ At these high Dean numbers the Dean number is related to the mean axial velocity W̄ by (2.3c), that is by 3/2 9/4 W̄ σ . (2.5) De = O ν3/2 L3/4 Here we give no further details of this solution except to say that it has to be calculated numerically, even in the boundary layer. Turbulent pipe flow. The conveyance line flow occurs at high Reynolds number. So for example, we estimated the Reynolds number as 20,000 by taking the viscosity of the flow to be that of air. In practice the net viscosity of the mixture of air and particles will be more than this, so the Reynolds number will be less than for air alone. Nevertheless the flow may still be turbulent. The boundary layer structure of a turbulent flow in a straight pipe is described to a reasonable approximation by the so-called Law of the Wall (see for example [1, 16]). It is less clear however what the equivalent structure is in a curved pipe. 90 M b u v α n (a) t (b) Figure 3: Illustration of the coordinates used to describe the flow in the pipe. Panel (a) shows the definitions of the vectors {t, n, b} while panel (b) shows the definition of the velocity components (here t is directed into the paper). 3 Can collisions of particles with the wall of the pipe account for particle fragmentation? Here we consider particle motion in the pipe flow. Since particles are small such motion is usually dominated by the fluid drag, rather than inertia. This motivates us to ask the question can particles ever collide with the side of the pipe with sufficient velocity in order to cause their fragmentation (i.e. velocities of O(1ms−1 ))? In particular we note that it is the centrifugal force on the particle that is likely to be the cause of motion across streamlines towards the edge of the pipe and, further, that since the flow velocity at the boundary of the pipe is zero we would not expect the centrifugal force on a particle close to the edge of the pipe to be large. However it is still possible to envisage a situation in which there is a narrow boundary layer in the vicinity of the pipe wall, across which the fluid velocity drops rapidly to zero, where the particle has sufficient inertia to cross from the region of high fluid velocity to the pipe wall without losing most of its kinetic energy. In order to investigate this possibility we will consider the toy problem of a particle moving in a two-dimensional flow. 3.1 The 2D problem Here we consider a toy two-dimensional problem noting that where the two-dimensional boundary layer structure is similar to that of the real flow the resulting particle trajectories will be qualitatively similar. Since the particle Reynolds number is typically small we can approximate 91 its equation of motion by balancing particle inertia against linear Stokes drag to give d2 x dx m 2 = 6πµrp v − . dt dt (3.1) Here m is the mass of one particle, rp is the radius of the particle, x and v are the position and velocity of the particle respectively, and µ is the fluid viscosity. Note that the Stokes drag formula is assumed because the flow of air relative to the moving particle has a low (local) Reynolds number, quite different from the high Re for the air flow down the pipe. We would like to consider two particular scenarios. In the first (i) the particle moves in a pipe flow around a circular bend in which the predominant flow is in the direction of the tangent to the centreline t (i.e. takes the form u = wt). The second scenario (ii) aims to model the situation where the secondary Dean flows in the cross-section of the pipe are of more significance to particle-wall collisions than the primary flow along the axis of the pipe. In order to model these situations we introduce polar coordinates for the particle describing its position by x = R(t)er so that, on differentiation with respect to t we find ẋ = Ṙer + Rθ̇eθ , ẍ = (R̈ − Rθ̇2 )er + (2Ṙθ̇ + Rθ̈)eθ , where er and eθ are the unit radial and azimuthal vectors in the coordinate directions of the plane polars r and θ, whose origin is at the pipe centreline. Furthermore in both scenarios (i) and (ii) the flow is predominantly in the azimuthal direction; this motivates us to write v = g(R, θ)eθ . We non-dimensionalise (3.1) by writing x = R0 x∗ , R = R0 R∗ , v = V0 v∗ , t = R0 ∗ t, V0 which gives, on dropping ′ ∗s, substituting x∗ = R∗ er and equating components in the radial and azimuthal directions, R̈ − Rθ̇2 = −Γ̃ Ṙ, 2Ṙθ̇ + Rθ̈ = Γ̃ (g(R, θ) − Rθ̇). (3.2) where Γ̃ = 6πµrpR0 . mV0 In scenario (i) we are typically interested in a pipe with radius of curvature which is much greater than the pipe radius whereas in scenario (ii) we are interested in flow occurring in a narrow boundary layer around the edge of the pipe. This suggests that we should further write R = 1 − εz where R = 1 denotes the outer edge of the pipe and where the small parameter ε = XRbl0 gives the ratio of the boundary layer thickness to the typical radius of curvature of the 92 pipe. On making this substitution in (3.2), writing θ̇ = Ω and on rescaling time by writing t = εΓ̃ τ we obtain the following system d2 z dz = −(1 − εz)Ω2 − Υ 2 dτ dτ dΩ dz Υ (1 − εz) =g ^(z, θ) − (1 − εz)Ω − 2εΩ dτ dτ dθ = ε1/2 Υ−1/2 Ω. dτ (3.3) (3.4) (3.5) where Υ = ε−1 Γ̃ −2 . The key parameter in this problem is Υ (recall that ε ≪ 1). This parameter corresponds to the ratio of to the timescale for drag to accelerate a particle to the velocity of the flow to that for a particle to traverse the boundary layer. Thus if Υ ≪ 1 the particle moves, to a good approximation, with the velocity of the flow. If however Υ = O(1) the particle can cross the boundary layer to the wall of the pipe without being decelerated to almost zero velocity at the wall. In other words if Υ ≪ 1 it will impact the wall (at z = 0) with very small velocity (an asymptotic analysis reveals that the dimensionless velocity normal to the wall on impact will at most be O(Υ2 ) corresponding to a dimensional velocity of at most O(V0 ε1/2 Υ5/2 )). For Υ = O(1), however, the particle impacts the wall with O(1) dimensionless velocity normal to the wall corresponding to a dimensional velocity of O(V0 ε1/2 ). 10 In summary we expect that the chance of particle impacts on the wall leading to particle fracture is negligible if Υ= m2 V02 ≪ 1, (6πµrp)2 R0 Xbl where Xbl is the width of the flow boundary layer and R0 is the typical radius curvature of the problem (in case (i) flow along the axis of the pipe R0 is the typical radius of curvature of the pipe centreline while in case (ii) in which secondary flows dominate the process R0 is the radius of the pipe) . Where Υ = O(1) the particle impacts the wall with a velocity of O(V0 ε1/2 ) and there is a significantly greater chance of particle fracture on impact. Even where Υ ≪ 1 inter-particle collisions give another possible mechanism giving rise to particle fracture. This will be considered further in §4. Relevance to Dean flow. Consider first the flow along the pipe, parallel to its centreline, corresponding to case (i). Here we can identify R0 with the radius of curvature of the pipe centreline L, the velocity can be read off from (2.3c) and (2.4c) so that the key parameters are 1/2 L 2/3 ν R0 = L, V0 = De , Xbl = σDe−1/3 , σ 2σ 10 Note that where Υ ≫ 1 the asymptotic analysis reveals an inertial particle trajectory in which both w and dz/dτ are unchanged to leading order; in order to retain more information it is necessary to rescale distance normal to the boundary to give an O(1) value of Υ. 93 u u R θ R θ (i) (ii) Figure 4: The flows considered for the toy problem and thus give Υ= m2 ν2 De5/3 (6πµrp)2 2σ4 in case (i). (3.6) In case (ii) we can identify R0 with the radius of the pipe σ, while the typical velocity is that in the azimuthal direction (which can be read off from (2.4a)). The key parameters are thus R0 = σ, ν V0 = De2/3 , σ Xbl = σDe−1/3 , from which it follows that m2 ν2 De5/3 Υ= (6πµrp)2 σ4 in case (ii). (3.7) We can relate these expressions for Υ to the mean axial velocity in the pipe W̄ via (2.5). It is immediately apparent that Υ is the same order of magnitude in both cases signifying that the azimuthal and longitudinal velocities occurring in Dean flow have comparable effects on the motion of particles across the boundary layer adjacent to the pipe boundary. Thus any quantitative analysis requires that we consider the full three-dimensional problem (rather than the toy model treated here). 94 4 Particle transport in the pipe The particle Reynolds number is relatively small and so we can approximate drag force by the linear Stokes relation. Balancing inertia with the drag force gives d2 x dx m 2 = 6πµa v − , (4.1) dt dt where a is the radius of the particle, x and v are the position and velocity of the particle respectively, and µ is the fluid viscosity. Since the radius of the pipe is generally considerably smaller than its radius of curvature it is helpful to introduce a local coordinate system based about the centre line of the pipe, r = q(s), defined by (4.2) x = q(s) + ξn(s) + ηb, where n(s) and b are the principal unit normal p and unit binormal vectors respectively, s measures arclength along the centre line, and η2 + ξ2 measures the distance of the point from the centre line. We use the standard Serret-Frenet formulae (see, for example [12]) to relate the various quantities in (4.2), namely dq = t, ds dt = κn, ds dn = τb − κt, ds (4.3) where κ is the pipe curvature, τ is the torsion and t is the tangent vector. To make the equations complete we also have db/dt = τn. We assume a planar pipe and so τ = 0, therefore in the subsequent analysis we can ignore the n component of the equations of motion. In order to track an individual particle at position x(t) we rewrite (4.2) as x(t) = q(s(t)) + ξ(t)n(s(t)) + ηb and use (4.1) to find differential equations for ξ and s. Before substituting x(t) into (4.1) we nondimensionalise by choosing x= x∗ , κ0 q= q∗ , κ0 r = Rr∗ , ξ = Rξ∗ , a = a0 a∗ , t = η = Rη∗ , κ = κ0 κ∗ , t∗ , v = v0 v∗ , κ0 v0 s= s∗ , κ0 (4.4) where R is the radius of the pipe, κ0 is the typical curvature of the pipe centreline, a0 is a typical particle radius and v0 is a typical fluid velocity. On dropping the ∗’s the dimensionless equations (4.1) and (4.2) become d2 x dx Γ , (4.5) = 2 g(r)t − dt2 a dt 95 and x = q(s) + ǫ(ξn(s) + ηb), (4.6) where we have assumed that the flow is primarily p parallel to the centreline of the pipe and has the nondimensional form v = g(r)t, with r = η2 + ξ2 . We assume that the coordinate η is constant while the particle moves. Note that equation (4.3) remains unchanged. The dimensionless parameters in (4.5) and (4.6) are given by Γ= 9µ 2a20 v0 κ0 ρ ǫ = Rκ0 . , (4.7) Typical sizes of these parameters, using Table 3, are Γ ≈ 11 and ǫ ≈ 0.1, which allows us to set Γ = γ/ǫ where γ = O(1). Using the Serret-Frenet equations (4.3) we differentiate x in (4.6) to give dx = ṡ(1 − ǫξκ)t + ǫξ̇n dt d2 x dκ 2 s̈(1 − ǫξκ) − ǫ ṡ 2 ξ̇κ + ξ = ṡ t + κ(1 − ǫξκ) ṡ + ǫ ξ̈ n. dt2 ds (4.8) (4.9) Upon substitution into (4.5) and equating coefficients of t and n we obtain the following equations relating ξ and s γ dκ [g(r) − ṡ(1 − ǫξκ)] (4.10) s̈(1 − ǫξκ) − ǫṡ 2ξ̇κ + ξ ṡ = ds ǫa2 γ κ(1 − ǫξκ)ṡ2 + ǫξ̈ = − 2 ξ̇. (4.11) a Assuming nǫ ≪ 1 we find that the leading order terms satisfy p a2 κ(s) 2 ṡ . (4.12) ṡ = g( η2 + ξ2 ), ξ̇ = − γ Eliminating ṡ gives the expression i2 a2 κ(s) h p 2 g( η + ξ2 ) , γ (4.13) i2 2a2 κ(s)v20 ρ h p g( (η/R)2 + (ξ/R)2) . 9µ (4.14) ξ̇ = − or in dimensional form ξ̇ = − Further work could treat a particle on the median plane η = 0. Also quite generally, since decreases to zero as the particle g(r = 1) = 0 at the wall, we can at least be sure that dξ dt 2 approaches the wall. Although we can’t set g(r) = 1 − r , because the Reynolds number is high, such a dependence in the boundary layer near r = 1 tells us that 1 − r = 1 − ξ(t) (on η = 0) decreases to zero with exponential decay, hence there is no impact velocity. 96 5 5.1 Hertz theory for binary collisions Critical Energy Hertz theory is an analysis of stresses at the contact of elastic solids. It was initially developed for static loading, but subsequently extended to quasi-static impacts (e.g. of spheres). We use Hertz theory to give an estimate for the critical energy, i.e., the energy needed to break a given particle of some radius a, here we give a brief overview of the key equations. As in [15], consider a (collinear) collision between two particles, modelled as smooth, homogeneous, solid, elastic spheres, of radii a1 and a2 respectively. The elastic properties of the spheres are given by their Young moduli Ei and Poisson’s ratios νi (i = 1, 2). We look at a quasi-static process and thus consider an instant where the mutual force of compression is P. The spheres compress elastically, with a contact area of radius ac , see Figure 5. P z Figure 5: From left to right: elastic sphere forced onto an elastic plane with a force P, and two spherical particles of radii a1 and a2 colliding collinearly. z is the amount of compression, and ac is the radius of the contact disk. The distribution of pressure across the contact area, as a function of distance r away from the centreline, is r p = p(r) = p0 1 − , (5.1) ac where ac is the radius of the contact zone between the particles, and the maximum pressure p0 is given by 3P p0 = . (5.2) 2πa2c 97 The effective values of the Young’s modulus E∗ and radius a∗ are defined via 1 − ν1 2 1 − ν2 2 1 + , = E∗ E1 E2 1 1 1 + , = a∗ a1 a2 and respectively. From Hertz theory, we then have the following relations between the applied force P, the effective Young’s modulus E∗ , the effective radius a∗ , and the compression distance z: 4 P = E∗ a∗1/2 z3/2 , 3 where z is given by a2c = a∗ z, and ac = 3Pa∗ 4E∗ 13 . (5.3) (5.4) The critical compression distance, at which fracture occurs, is denoted zf . For a collision that just reaches this critical compression length zf , the work done by the compression force just equals the initial kinetic energy Er , and thus: Z zf 1 2 Er = mVf = P(z)dz. (5.5) 2 0 V is the relative impact velocity, and Vy is defined as the yield velocity. Provided V < Vy the interaction is assumed to be elastic. Inserting the above expression for P(z) we get Er = 8 E∗ z5f , 15 a∗2 (5.6) We can relate the critical compression distance zf to the contact yield stress σf of the material by σf ≡ p0 (zf ), From equation (5.2) and (5.3) we have σf = 2E∗ zf . πa∗ (5.7) Rearranging for zf , and substituting this expression into (5.5) we obtain the energy at which the particle is estimated to fracture: Er = π5 a∗3 σ5f . 60 E∗4 (5.8) We shall return to this expression in Section 6, where we shall consider the statistics of particle collisions leading to fragmentation. 98 5.2 Particle structure Hertz theory, as mentioned in the previous section, assumes that the two impacting spheres are solid. However, the particles that make up infant formula, see Figure 1 are clearly hollow. The study group was asked to investigate how far into the particle there are significant increases in stress, and therefore whether modelling the spheres as hollow should be incorporated into the model. Here, we consider the simpler case of two impacting cylinders, although it should be straightforward to extend the result to two spheres. Recall that z is the distance from the contact point. If we take the origin of the z-axis to be at the point where the two cylinders first contact, then z = 0 is the location of a plane of contact between the two cylinders i.e., the point at which the contact force is distributed over a rectangle of width 2ac and length equal to the length of the shortest cylinder. This is the smallest possible area over which the contact force is distributed, and therefore the point at which the stress has its maximum value. As the compression increases, the rectangular area becomes larger, because the width of the rectangle constantly increases due to the curvature of the circular cross-section. Since stress is force/area, the stresses decrease as z increases. From [7] we have an expression for σz , the stress in the z direction ac σz , = −p p0 a2c + z2 (5.9) ∗ 1 2 where p0 is the maximum pressure, and for two cylinders in contact is given by p0 = PE πR (see [7]). (Note that the convention in mechanical engineering is to take compressive stresses to be negative.) In Figure 6 we plot a graph of the stress divided by the maximum pressure ( pσ0z ) against the distance from the contact point, zac , for a constant ac and typical particle parameters. At z = 0, the compressive stress is at its maximum. We also observe that the stresses diminish extremely rapidly as you move further away from the contact point. We expect an entirely analogous result in the case of two impacting spheres. If the stress changes are non negligible at a depth that is close to the wall thickness, then Hertz theory is not applicable. If the extent of the zone where there is a measurable increase in stress is small, then Hertz theory can be used to represent the contact mechanics of these particles. As the walls of the infant formula particles are approximately a fifth of the diameter, we conclude that approximating the particles as solid spheres is a reasonable assumption. 99 z 300 ac 250 200 150 100 50 0 −1 −0.8 −0.6 −0.4 σz p0 −0.2 0 Figure 6: We consider two cylinders in contact. The graph shows the stress divided by the maximum pressure ( pσ0z ) plotted against the distance from the contact point, ( azc ), for a constant ac . 6 The statistics of collisions leading to fragmentation Consider a particle size distribution such that the proportion of particles with radius lying in the range (a, a + da) is n(a)da. Furthermore take the probability of breaking a particle of size a when it undergoes a collision in which an energy in the range (E, E + dE) is released to it is pb (E, a)dE. We note that at present both these distributions will have to be derived from experiment, though an ultimate goal of this type of work might be to formulate a Smoluchowski model of the evolution of the size distribution n(a). In §4 we calculated the transverse velocity ξ̇, due to centrifugal force, of a particle being advected in a curved pipe as a function of its radius. Notably this was proportional to the square of the radius of the particle (bigger particles travel more quickly than smaller ones). We also saw that this velocity slowed markedly as the particle approached the outer edge of the pipe, a consequence of the slowing of the flow along the pipe in the immediate vicinity of the pipe wall. This lead us to conclude that collisions of particles with the pipe wall give rise to insignificant particle fragmentation and to hypothesise that the main cause of particle breakage is collisions between particles of different radii travelling with different transverse velocities ξ̇. It is the aim of this section to quantify this process. We start by considering the total number of collisions between particles of sizes in the range (a, a + da) with those in the range (α, α + dα). In the direction of the principle normal n they approach each other with velocity |ξ̇(a)− ξ̇(α)|, see Figure 7. Since they will only collide if their 100 a |ξ̇(a) − ξ̇(α)| b α Figure 7: A schematic diagram of a particle with radius a approaching a particle with radius α. b denotes the distance between their centres, and |ξ̇(a) − ξ̇(α)| their approach velocity. centres come within a distance (a + α) of each other the average number of collisions occurring to a particle of radius a with particles with radii in the range (α, α + dα) in a small time dt is Nπ(a + α)2 |ξ̇(a) − ξ̇(α)|n(α)dαdt, where N is the total number density of all particles. It follows that The number of collisions in time dt per unit volume between particles with radii in range (a, a + da) with those with radii in range (α, α + dα) = N2 π(a + α)2 |ξ̇(a) − ξ̇(α)|n(α)n(a)dαdadt Of these collisions we need to work out how many lead to fracture. We have already calculated the energy released in an impact between a particle radius a and one with radius α when their centres are offset by a distance b (see Figure 7). Such collisions release energy 1 b2 m(a)m(α) 2 Er (a, α, b) = (v(a) − v(α)) 1 − . 2 m(a) + m(α) (a + α)2 Of this we hypothesise that the energy divides between the particles proportional to their mass, that is Energy available for fracture of particle a Energy available for fracture of particle α m(a) Er (a, α, b), m(a) + m(α) m(α) Er (a, α, b), = m(a) + m(α) = (6.1) (6.2) where Er is the energy at which the particle is estimated to fracture, and is given in Section 5. The fraction of collisions for which the offset distance between the centres of the particles (at collision) lies in (b, b + db) is (2πbdb)/(π(a + α)2 ) (see Figure 8b) and the energy released in such a collision is Er (a, α, b)db. 101 α b+db b α a a+ α Figure 8: Here the first figure shows a large particle of size a approaching a smaller particle size α. The second figure shows the end view of the cylinder with the smaller particle lying at the centre. Provided the centre of the larger particle (with radius a) lies within the cylinder the particles will collide. Referring to (6.2) we find that Z Number of particles size 2 range (a, a + da) per unit time = N π (a + α)2 |ξ̇(a) per unit volume which fracture Z Emax m(a) m(a) pb −ξ̇(α)|n(a)n(α) Er d Er dα. (6.3) m(a) + m(α) m(a) + m(α) 0 Translating (6.3) into terms of the separation b between the particle centres (perpendicular to the velocity) upon impact this can be written as Number of particles size Fracture rate = range (a, a + da) per unit time = F(a, x) per unit volume Z∞ m(a)(a + α)2 2 |ξ̇(a, x) = Nπ 0 m(a) + m(α) Z a+α m(a) ∂Er −ξ̇(α, x)|n(a)n(α) pb Er (a, α, b) db dα. (6.4) m(a) + m(α) ∂b 0 7 Conclusions We have provided a model for the flow in the curved pipe which we used to show that the velocity of particles decreases rapidly as they approach the pipe wall. We believe that particle102 wall collisions give rise to insignificant particle fragmentation, and that the main mechanism for particle breakage is collisions between particles of different sizes. We have analysed inter-particle collisions, using Hertzian contact laws to provide a criterion for when a binary collision leads to breakup. In addition we have justified modelling these particles as solid, rather than hollow. Finally, we used a statistical mechanics approach to generate an expression for the total rate of particle breakup in the flow. Future work would involve experimental verification of this model. Acknowledgements This work was carried out at the 70th European Study Group with Industry, hosted by the Mathematics Applications Consortium for Science and Industry (MACSI), funded by the Science Foundation Ireland (SFI) Mathematics initiative 06/MI/005. We would like to thank William Lee for his comments on an earlier draft. 103 Bibliography [1] P. Bradshaw and G.P. Huang, The Law of the Wall in turbulent flow, Proc. Roy. Soc. Lond. A 451, 165–188, (1995). [2] N.V. Brilliantov, F. Spahn, J-M. Hertzsch, T. Pöschel, The collision of particles in granular systems. Phys. A, 321, 417–424, (1996). [3] S.A. Burger, L. Talbot and L.-S. Yao, Flow in curved pipes. Ann. Rev. Fluid Mech. 15, 461–512, (1983). [4] W.R. Dean, The stream-line motion of fluid in a curved pipe. Phil. Mag. 5, 673 (1928). [5] S.C.R. Dennis and N. Riley, On the fully developed flow in a curved pipe at large Dean number. Proc. R. Soc. Lond. A 434, 473–478, (1991). [6] H. Ito, Laminar flow in curved pipes. Z. agnew Math. Mech. 49, 653–663, (1969). [7] K.L. Johnson, Contact Mechanics, Cambridge University Press, (1985). [8] F. Kun and H.J. Herrmann, Transition from damage to fragmentation in collision of solids. Phys. Rev. E, 59, 3, (1999). [9] T.J. Pedley, Mathematical modelling of arterial fluid dynamics. J. Eng. Math. 47, 419–444, (2003). [10] M. Perkins, S.J. Ebbens, S. Hayes, C.J. Roberts, C.E. Madden, S.Y. Luk, N. Patel, Elastic modulus measurements from individual lactose particles using atomic force microscopy. Int. J. Pharm., 332, 168–175, (2007). [11] T. Pöschel and N. Brilliantov (editors), Granular Gas Dynamics, Springer Lecture Notes in Physics, (2003). [12] John Roe, Elementary Geometry, Oxford University Press, (1993). [13] J.H. Siggers and S.L. Waters, Steady flow in pipes with finite curvature. Phys. Fluids 17, 077102 (2005). [14] F.T. Smith, Steady motion within a curved pipe. Proc. R. Soc. Lond. A 347, 345–370, (1976). 104 [15] C. Thornton, Coefficient of restitution for collinear collisions of elastic-perfectly plastic spheres. J. Appl. Mech., 64, 383–386, (1997). [16] http://www.maths-in-industry.org/miis/105/ 105 106 Improvement of Energy Efficiency for Wastewater Treatment Report Contributors: Mark Burke1 , Michael Chapwanya2,3, Kevin Doherty4 , Ian Hewitt 5, Andrei Korobeinikov2 , Martin Meere6, Stephen McCarty 7, Mick O’Brien2 , Vo Thi Ngoc Tuoi6 and Henry Winstanley2,5 Industry Representatives: Toshko Zhelev8 and Jaime Rojas8 1 Department of Mathematics and Statistics, University of Limerick, Ireland MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland 3 Report coordinator, m.chapwanya@ul.ie 4 National University of Ireland Galway, Ireland 5 OCIAM, Mathematical Institute, University of Oxford, United Kingdom 6 Department of Applied Mathematics, National University of Ireland Galway, Ireland 7 University College Cork, Ireland 8 Department of Chemical and Environmental Science, University of Limerick, Ireland 2 Abstract Wastewater treatment requires the elimination of pathogens and reduction of organic matter in the treated sludge to acceptable levels. One process used to achieve this is Autothermal Thermophylic Aerobic Digestion (ATAD), which relies on promoting non-pathogenic thermophilic bacteria to digest organic matter and kill pathogens through metabolic heat generation. This process requires continuous aeration that may be energy consuming, and the final aim of the study is to identify how the process design can minimize the energy input per mass of treated sludge. Appropriate modeling of the reactor process is an essential ingredient, so we explore properties of an existing model and propose a simplified alternative model. 108 (a) A single-reactor design. An example of this design is found in Spain. (b) A two-reactor design. Semi continuous or continuous treatment requires multiple reactors to achieve pasteurization. An example of this design is found in Killarney, Ireland. 109 1 Introduction 1.1 Background Wastewater treatment employs a variety of processes to reduce the environmental and health impacts of effluent. Treatment is an environmental necessity but is itself very energy-intensive, and this energy use is one of the principal costs and environmental impacts of the treatment. Wastewater impurities include a wide range of inorganic and organic chemical species and microorganisms. A filtered, concentrated ‘sludge’ is produced at initial stages of treament. A key subsequent stage is to eliminate potential pathogenic organisms and reduce the organic chemical content which might act as a substrate for further microbial growth. One process that is used to achieve these treatment goals is autothermal thermophilic aerobic digestion (ATAD). ATAD makes use of bacterial growth within the sludge both to reduce organic chemical content and to kill pathogenic bacteria. Aeration of the sludge promotes the growth of aerobic bacteria which feed on and reduce the organic substrates available in the sludge. This metabolic activity generates heat and raises the temperature of the sludge. Thermophilic bacteria in the sludge thrive at high temperatures (Fig. 1). Most pathogens, meanwhile, are mesophiles: their total metabolic activity reduces rapidly at temperatures above 40 – 45 o C, and they are eventually killed by sufficiently long exposure (Fig. 2). Aerobic thermophiles proliferate and dominate in the hot sludge, and ATAD uses their metabolic activity and heat generation to achieve the treatment goals of pasteurization and reduction of organics. 8 Thermophilic decay rate, [1/day] Thermophilic growth rate, [1/day] 25 20 15 10 µ −6.086/(T−34) HT = 155.4e −7.734/(T−34) − 132.8e 5 Experimental data Numerical fitting 0 35 40 45 50 55 o Temperature, T [ C] 60 HT = 0.001T2 + 0.11T − 2.8 6 5 4 3 2 65 b 7 Experimental data Numerical fitting 35 40 45 50 55 o Temperature, T [ C] 60 65 Figure 1: Temperature dependence of the thermophilic growth and decay rate (adapted from [6]). The time and cost implications of running full-scale experiments for different plant arrangements and operating conditions render such an approach infeasible for process design and operational management. Laboratory scale experiments have provided understanding (to varying degree) of many of the microbiological and chemical processes involved. By encapsulating much of this 110 ′ Mesophilic decay rate, [1/day] Mesophilic growth rate, [1/day] 80 Experimental data Numerical fitting 18 16 14 12 −42.95/T µ = 72e 10 H −135.6/T − 201.5e 70 −159.9/T 60 b =1075e H 50 40 30 20 10 8 25 30 35 40 45 50 o Temperature, T [ C] 55 60 0 65 Experimental data Numerical fitting 25 30 35 40 45 50 o Temperature, T [ C] 55 60 65 Figure 2: Temperature dependence of the mesophilic growth and decay rate (adapted from [6]). understanding in mathematical models, engineers have provided tools which allow computer simulations to inform the design process. The ‘Activated Sludge Model 1’ (ASM1) [5] and subsequent variants [3, 4] have achieved a broad level of acceptance in the wastewater treatment world. Based on the dominance of the ASM1 model (and family) in the field, its extension to the ATAD process as set out by [6, 2] appears to have created a de facto standard for modeling this process, despite drawbacks summarized below. We identify this existing model with the label ASM1. 1.2 Objective The intended purpose of the Study Group problem was to use a model of the ATAD treatment process to provide insights into the optimization of both plant design and operating conditions in order to reduce the energy consumed per mass of treated sludge while achieving the required treatment goals. Approaching the problem as a multidimensional constrained optimization based on the ASM1 model, the problem presenters had identified some generic indications for performance improvements, but encountered numerical difficulties due to the high dimensionality and sensitivity to parameters inherent in the ASM1 model. Their next intention was to identify rational means of reducing the dimensionality of the optimization. 1.3 Report Overview This report addresses the problem in the following way: • Section 2 provides a more detailed overview of the problem and the considerations raised 111 during the Study Group. • In Section 3 we discuss the ASM1 model, perform a nondimensionalization based on typical parameter values (adapted from [2] and [6]), and identify certain regimes within the timecourse of a single typical batch reaction for which asymptotic approximations may be appropriate. • In Section 4 we present a simplified ‘ESGI70’ model proposed by the Study Group with the intention of being the minimal model required to describe the observed process. The model comprises a system of ODEs with four dependent variables. • In Section 5 we compare our simulations to experiments and present a sensitivity analysis of the ESGI70 model. • In Section 6 we present an optimal strategy for pasteurization. 2 2.1 Problem Overview Treatment Goals Legal requirements for sludge treatment in Ireland and USA set minimum standards for the pasteurization and stabilization of the treated outflow sludge ([8]). Stabilization refers to the reduction of volatile solids concentration between sludge intake Xfeed and outflow Xout , and the vs vs required minimum reduction is set at 38%: − Xout Xfeed vs vs ≥ 0.38. Xfeed vs (2.1) Pasteurization is achieved by the sludge being subjected to high temperature for a prolonged period of time. Rather than requiring direct microbiological testing of the outflow, the legal requirement for pasteurization specifies a minimum value of 1.0 for a derived ‘lethality’ statistic which is an empirically-based function of sludge temperature and time elapsed in the ATAD reaction process: Zt 1 bT (t) L(T, t) = 10 dt ≥ 1, (2.2) t0 a where temperature T is measured in Celsius, elapsed time t in days, and a = 5.007 × 107 and b = 0.14 ◦C−1 are defined parameter values. Lethality L is only calculated in continuous time intervals in which temperature T > 50 ◦C, and it is equal to zero when T ≤ 50 ◦ C. 112 Air In: Raw sludge Out: Pathogen free solids Figure 3: Schematic representation of a reactor. The ATAD process must have a feed control mechanism which ensures that the organic feed is above a certain minimum concentration for adequate heat production. The solids must also be below a maximum concentration to allow adequate mixing. See for example [8]. 2.2 Treatment Plant and Process ATAD is operated as a batch or semi-batch process. A large reactor containing sludge receives an additional volume of untreated sludge at the start of a batch via a feed inlet (see schematic in Fig. 3). During the batch reaction air is pumped continuously into the reactor and provides both the oxygenation required for aerobic bacterial growth and the physical mixing of the sludge in the whole reactor. Digestion of organic substrates proceeds hand in hand with bacterial growth, predominantly of thermophiles due to the elevated temperature. At the end of the batch reaction time, a volume of treated outflow sludge is removed and is immediately replaced by the next batch of intake sludge. Thus the outflow sludge has the same composition as the mixed sludge in the reactor at the end of the batch reaction time tbatch . Batch outflow and inflow cannot be open at the same time since untreated inflow sludge might be drawn off with and taint the outflow sludge. The time between successive batch intakes is typically fixed at 24 hours for staffing reasons. In order to achieve the desired treatment outcomes, treatment plant designs may include a single ATAD reactor stage, or two reactors in series with outflow from the first reactor being inflow into the second. In some cases, the treatment plant consist of an ATAD reactor and an anaerobic mesophilic digester in series (cf. [8]). The method of operation and the number of reactors used depends on the fluctuation of hydraulic loading and on the quantity of sludge requiring treatment. 113 2.3 Optimization As indicated in Section 1.2, the primary objective is to optimize the plant design and operating conditions so that the minimum energy consumed per mass of treated sludge. The plant’s energy consumption is dominated by the submerged aeration pumps’ energy requirements. For a sludge aeration rate A (air volumes per reactor sludge volume per day), reactor volume V, and batch inflow (and outflow) volume Vin , the optimization corresponds to finding the set of controls U which yield Z V min A(t)dt, (2.3) U Vin subject to the treatment requirement constraints Eqs. (2.1),(2.2). The full range of optimization controls U comprises both plant design issues and operating conditions. Design choices include reactor volume and the use of single or series reactors. Other non-standard possibilities can also be assessed such as parallel heat exchangers to pre-warm the sludge inflow from sludge outflow, reactor, or exhaust air heat sources. Controls on operating conditions in the reactor are limited to the aeration rate and batch reaction time tbatch , and controls on the batch inflow volume, temperature and substrate concentration. For staffing reasons, current standard operating conditions generally use a 24-hour batch cycle and maintain a constant aeration rate A during the whole process. The inflow volume can be expressed as a reactor batch exchange fraction Q = Vin /V ∈ (0, 1] and is typically in the range 0.1 – 0.2. The inflow is usually at ambient temperature and existing plants are not set up for inflow heating either directly or by heat exchangers. The substrate concentration of inflow sludge can be increased by addition of a bulking organic component at minimal expense. 2.4 Study Group Approach As with many biological systems, there is considerable uncertainty in inferring both the functional description of processes and the accuracy of parameterizations when applying laboratoryderived models to the real system. The current models in the wastewater literature are motivated by microbiological and engineering interest and have been devised to incorporate as much as possible of the apparent understanding of underlying processes. As a result they tend to include a relatively large number of variables, processes, and parameters. This stands in contrast to the paucity of data accessible from ATAD plants in operation: aeration rate and sludge temperature are easily monitored, but data on sludge composition at inlet and outlet and during the course of the batch reaction is typically limited to the total dry weight of solids and the chemical oxygen demand of the ‘volatile solids’ (by controlled chemical oxidation of the sludge). Volatile solids include the biomass of bacteria in the sludge together with organic growth substrates and metabolically inert organic compounds, so no distinction is made between these components in the measured data. In addition to the known model being over-parameterized, commonly-used values of several 114 parameters are inferred from curve-fitting predicated on the model itself. In such an exercise, the discrepancy between the large numbers of variables and parameters in the model and their poor representation in the available data clearly motivates the use of a simplified model of the process. The key focus of the Study Group was modeling the core process of a single reactor system during a batch reaction. Such a model is trivially set up to run successive batches and can easily be extended later to a multi-reactor system and other design variants. Two approaches were followed: one being to investigate the existing ASM1 model for mathematical simplifications (Section 3), the other being to construct a new model using a plausible minimal set of interactions to see if it can reproduce the observed behavior (Section 4). The optimization of the process based on such a model was not fully addressed in the Study Group, though some sensitivity analysis and initial investigations into its optimization were tackled. The focus on the core process model and the relative lack of suitably realistic estimates for many of the plant-specific parameters suggested greater priority be placed on single reactor operating conditions as optimization controls. Thus the initial focus was on a single reactor system of fixed design, with plant design optimization left as a task for future work. 2.5 Basic Modeling Assumptions Some simplifying assumptions permit us to reduce the complexity of the model while at the same time retaining the essential aspects of the processes: • There is complete mixing inside the reactor(s). • All the biological activity takes place only in the reactors. • The batch outflow and inflow stages are of sufficiently short duration that biological activity during them is negligible. • Aeration is sufficient that anaerobic metabolic activity is negligible. 3 The Existing ASM1 Model The ASM1 model incorporates all the processes identified by microbiologists as being relevant to modeling the ATAD process. A transfer diagram of the ASM1 model is shown in Fig. 4 with the solid curve cycle showing the processes included in the original ASM1 model (prior to the ATAD extension). Here XBH denotes the concentration of active mesophilic biomass, SO is the concentration of oxygen and XP is the concentration of inert organic matter. Organic substrate is separated into readily biodegradable substrate SS and slowly degradable substrate XS . Untreated sludge in the batch inflow contains a certain concentration of both these substrates. 115 The slow substrate XS consists of organic macromolecules which can be hydrolyzed to produce the fast substrate SS . Microbial decay adds to the pool of XS : cell lysis on death releases the cell’s constituents into the extracellular space, a fraction fP of which is inert matter and (1 −fP ) is hydrolyzable macromolecular substrate. Fast substrate SS and oxygen SO are consumed in growth of the bacteria. All the metabolic process rates are assumed to depend nonlinearly on substrate and oxygen concentrations through dual Michaelis–Menten kinetics terms. The ATAD-specific modifications to the ASM1 model are (broken curve cycle in Fig. 4) via the introduction of a new component: thermophilic biomass, in active XBHT and inactive XSP forms. While mesophiles are relevant to batch reactions starting from cold, typical ATAD plants are operated with batches following in rapid succession precisely in order to maintain the temperature of the reactor. Under these conditions, the mesophiles respire but the temperatures are beyond their optimum level and so they decay very fast and thus do not make noticeable contribution to the process, see Fig. 2. XSP Activation XBHT Thermophilic decay Thermophilic growth Thermophilic respiration S XBH O Decay Growth XI Respiration S S Thermophilic hydrolysis XS Hydrolysis Figure 4: Transfer diagram of the processes in the ASM1 model (solid lines) and the proposed extended model (broken lines). In Section 3.1 we present the mathematical representation of the extended model. (adapted from [7]) 3.1 The ASM1 Model at Thermophilic Temperatures The ASM1 model at thermophilic temperatures tracks the evolution in time of eight quantities in the reactor, these being: SS (t) the readily biodegradable substrate, XS (t) the slowly biodegradable substrate, XSP (t) the inactivate thermophilic biomass, XBH (t) the active mesophilic biomass, XBHT (t) the active thermophilic biomass, XP (t) inert organic matter from decay, SO (t) the dissolved oxygen, and T (t) the temperature. 116 For simplicity, we disregard the temperature dependence here, and so do not display an equation for the temperature. We take fixed values for the model parameters and choose these values for reactor operation at 50o C. We also neglect the effect of thermophilic activation, on the basis that it occurs rapidly at the start of the batch reaction, and so do not consider inactive biomass XSP (t) in the model. Under these assumptions, the governing equations from [6] are SS 1 SS 1 dSS SO SO XBH − µHT XBHT = − µH dt YH KS + SS KO + SO YHT KST + SS KOT + SO XS SO XS SO +kH XBH + kHT XBHT , KX XBH + XS KO + SO KXT XBHT + XS KOT + SO XS SO dXS = bH (1 − fP )XBH + bHT(1 − fPT )XBHT − kH XBH dt KX XBH + XS KO + SO XS SO −kHT XBHT , KXT XBHT + XS KOT + SO SS SO dXBH XBH − bH XBH , (3.1) = µH dt KS + SS KO + SO dXBHT SS SO XBHT − bHT XBHT , = µHT dt KST + SS KOT + SO (1 − YH ) SS (1 − YHT ) SS SO SO dSO =− µH XBH − µHT XBHT dt YH KS + SS KO + SO YHT KST + SS KOT + SO +kLa S¯O − SO , dXP XS XS SO SO = fP kH XBH + fPT kHT XBHT . dt KX XBH + XS KO + SO KXT XBHT + XS KOT + SO The parameters appearing in the model are explained in Table 5. We see that the first five of these equations can be solved independently of the sixth, and so we do not display the equation for the inert organic matter XP again. We model the operation of a single tank system for one batch, and so solve (3.1)1 – (3.1)5 subject to initial conditions SS = SoS , XS = XoS , XBH = XoBH , XBHT = XoBHT , SO = SoO at t = 0. (3.2) For the illustrative numerical results used in the current analysis, we take SoS = 5 gl−1 , XoS = 15 gl−1 , XoBH = 1 gl−1 , XoBHT = 1 gl−1 , SoO = 10−3 gl−1 . We scale the system of ordinary differential equations by choosing t ∼ tbatch , SS ∼ SoS , XS ∼ XoS , XBH ∼ XoBH , XBHT ∼ XoBHT , SO ∼ SoO , where tbatch = 1 d (one day). With this choice of scaling, the dimensionless form for (3.1)1 – 117 (3.1)5 , (3.2) is given by, dSS SS SS SO SO = −δ1 XBH − δ4 XBHT dt δ2 + S S δ3 + S O δ5 + S S δ6 + S O XS SO SO XS XBH + δ9 XBHT , + δ7 δ8 XBH + XS δ3 + SO δ10 XBHT + XS δ6 + SO dXS XS XS SO SO XBH − δ14 XBHT , = δ11 XBH + δ12 XBHT − δ13 dt δ8 XBH + XS δ3 + SO δ10 XBHT + XS δ6 + SO SS SO dXBH = δ15 XBH − δ16 XBH , (3.3) dt δ2 + S S δ3 + S O dXBHT SS SO = δ17 XBHT − δ18 XBHT , dt δ5 + S S δ6 + S O SS SS SO SO dSO = −δ19 XBH − δ20 XBHT + δ21 (δ22 − SO ) , dt δ2 + S S δ3 + S O δ5 + S S δ6 + S O subject to initial conditions SS = 1, XS = 1, XBH = 1, XBHT = 1, SO = 1 at t = 0, where the definition of, and values for, the non-dimensional parameters δi (1 ≤ i ≤ 22) are given in Table 5. In Figs. 5 and 6, we show the solution to the system of equations (3.3) for the parameter values listed in Table 5. It is noteworthy from the parameter values chosen, the readily degradable material is depleted over a period of approximately one half of a day, but that the slowly degradable material persists for approximately five days. We should note, however, that that some of the parameter values used are uncertain, and that they can depend on temperature and time, as well as other factors, such as the character of the inflow sludge and the reactor system. Since we have assumed the parameter values to be fixed, the limitations of such an approach are clear. However, our goal in this section is to gain some insight into aspects of the ASM1 model, rather than attempting to model a particular reactor system in detail. 3.2 A Subcase: Asymptotic Analysis of the ASM1 Model As mentioned above, under normal ATAD operating conditions a relatively high temperature is maintained in the reactor. Commonly, the temperature after adding a new inflow batch and mixing remains higher than the mesophilic temperature range, and hence their metabolic activity is negligible. Therefore we neglect mesophiles in the model and set XBH = 0; equations 118 1.4 ←I 1.2 Concentrations 1 III → II 0.8 IV 0.6 0.4 S (t) S X (t) S XBH(t) 0.2 X (t) BHT S (t) O 0 0 0.1 0.2 0.3 0.4 0.5 Time 0.6 0.7 0.8 0.9 1 Figure 5: Plot of the solution to system of equations (3.3) over one day and for the parameter values displayed in Table 5. 4.5 4 Concentrations 3.5 ← I ← III 3 2.5 IV V VI II 2 1.5 S (t) S X (t) 1 S XBH(t) X 0.5 (t) BHT S (t) O 0 0 1 2 3 4 5 6 7 Time Figure 6: The same solution as in Figure 5, plotted over one week. Over this time, the slowly biodegradable substrate is removed, and the dissolved oxygen achieves its saturation value. 119 Table 4: The set of parameters for the ASM1 model, physical parameters. Experiments have shown that these parameters can depend, among other things, on temperature and time (see Figs. 1 and 2). Symbol KS KO KST KOT S¯O µH µHT bH bHT kH kHT kLa YH YHT fP fPT KX KXT Description Value Physical parameters mesophilic half saturation constant for SS mesophilic half saturation constant for SO thermophilic half saturation constant for SS thermophilic half saturation constant for SO oxygen saturation concentration maximum growth rate of XBH maximum growth rate of XBHT decay rate of mesophiles decay rate of thermophiles maximum mesophilic hydrolysis rate maximum thermophilic hydrolysis rate oxygen mass transfer coefficient mesophilic yield thermophilic yield inert fraction of mesophilic biomass inert fraction of thermophilic biomass half saturation constant for mesophilic hydrolysis half saturation constant for thermophilic hydrolysis 120 0.02 2 × 10−4 0.03 2 × 10−4 5 × 10−3 16.8 24.3 43 5.2 10 9.2 1000 0.6 0.6 0.3 0.3 0.03 0.03 Units g/l g/l g/l g/l g/l 1/d 1/d 1/d 1/d 1/d 1/d 1/d – – – – – – Table 5: The set of parameters for the ASM1 model, dimensionless parameters. Experiments have shown that these parameters can depend, among other things, on temperature and time (see Figs. 1 and 2). Here we assume these parameters to be constant (i.e., temperature independent). Symbol δ1 δ2 δ3 δ4 δ5 δ6 δ7 δ8 δ9 δ10 δ11 δ12 δ13 δ14 δ15 δ16 δ17 δ18 δ19 δ20 δ21 δ22 Description Value Dimensionless parameters µH tbatch XoBH /(YHT SoS ) KS /SoS KO /SoO µHT tbatch XoBHT /(YHT SoS ) KST /SoS KOT /SoO kH tbatch X∗BH /S∗S KX XoBH /XoS (kHT tbatch ) (X∗BHT /SoS ) KXT XoBHT /XoS (1 − fP )bH tbatch XoBH /XoS (1 − fPT )bHT tbatch XoBHT /XoS kH tbatch XoBH /XoS kHT tbatch XoBHT /XoS µH tbatch bH tbatch µHT tbatch bHT tbatch (1 − YH )µH tbatch XoBH /(YH SoO ) (1 − YHT )µHT tbatch XoBHT /(YHT SoO ) kLa tbatch S¯O /SoO 121 5.60 4 × 10−3 0.2 8.10 6 × 10−3 0.2 2.0 2 × 10−3 1.84 2 × 10−3 2.01 0.25 0.67 0.61 16.8 43.0 24.3 5.2 11.2 × 103 16.2 × 103 103 5.0 Units – – – – – – – – – – – – – – – – – – – – – (3.3) then reduce to dSS SS XS SO SO = −δ4 XBHT + δ9 XBHT , dt δ5 + S S δ6 + S O δ10 XBHT + XS δ6 + SO XS SO dXS = δ12 XBHT − δ14 XBHT , dt δ10 XBHT + XS δ6 + SO dXBHT SS SO XBHT − δ18 XBHT , = δ17 dt δ5 + S S δ6 + S O SS SO dSO = −δ20 XBHT + δ21 (δ22 − SO ) , dt δ5 + S S δ6 + S O SS = 1, XS = 1, XBHT = 1, SO = 1 at t = 0. (3.4) In Figs. 7 and 8, we display numerical solutions of system of equations (3.4) for the parameter values given in Table 5. Comparing these with the solutions given in Figs. 5 and 6, one notes that there is good qualitative agreement and reasonable quantitative agreement, as would be expected. Motivated by the relevant parameter values listed in Table 4, we denote δ5 = ε, δ10 = θ1 ε, δ20 = θ2 /ε, δ21 = θ3 /ε, where θ1 , θ2 , θ3 = O(1), and consider the limit ε → 0. For reference, it is worth re-displaying equations (3.4) now in terms of the parameter ε dSS SS SO XS SO = −δ4 XBHT + δ9 XBHT , dt ε + S S δ6 + S O θ1 εXBHT + XS δ6 + SO XS SO dXS = δ12 XBHT − δ14 XBHT , dt θ1 εXBHT + XS δ6 + SO SS SO dXBHT = δ17 XBHT − δ18 XBHT , dt ε + S S δ6 + S O dSO SS SO ε XBHT + θ3 (δ22 − SO ) , = −θ2 dt ε + S S δ6 + S O SS = 1, XS = 1, XBHT = 1, SO = 1 at t = 0. (3.5) The limit ε → 0 is singular, and the asymptotic structure is indicated in Figs. 7 and 8. We now briefly discuss some of the asymptotic regions (time-scales) arising. 3.3 Region I, t = O(ε) This very short initial time-scale, which is not even visible in Fig. 7, appears at t = O(ε) and marks a region of rapid change in the dissolved oxygen concentration. We rescale t = ε^t, and at leading order we have SS ∼ 1, XS ∼ 1, XBHT ∼ 1, 122 1.4 ←I 1.2 Concentrations 1 III → II 0.8 IV 0.6 0.4 S (t) S X (t) S 0.2 X (t) BHT S (t) O 0 0 0.1 0.2 0.3 0.4 0.5 Time 0.6 0.7 0.8 0.9 1 Figure 7: Plot of the solution to system of equations (3.4) over one day for the parameter values displayed in Table 5. The asymptotic regions for ε → 0 are indicated. 4.5 4 Concentrations 3.5 ← I ← III 3 2.5 IV V VI II 2 1.5 S (t) S 1 X (t) S X 0.5 (t) BHT S (t) O 0 0 1 2 3 4 5 6 7 Time Figure 8: The same solution as in Figure 7, but plotted over one week. The asymptotic regions for ε → 0 are indicated. 123 and pose SO ∼ SIO (^t), to obtain dSIO SIO + θ3 (δ22 − SIO ) , = −θ2 δ6 + SIO d^t where SIO = 1 at ^t = 0, and dSIO /d^t → 0, SIO → ρ1 as ^t → ∞, with ρ1 given by s 2 θ2 1 θ2 + 4δ22 δ6 . + ρ1 = δ22 − δ6 − δ22 − δ6 − 2 θ3 θ3 For the parameter values given in Table 5, ρ1 = 0.087, which is in good agreement with the numerical solution displayed in Fig. 7. We have to note that ρ1 > 1 is also possible, so that the dissolved oxygen is not necessarily depleted over this time-scale; the competing effects of aeration and oxygen consumption by the biomass enter at leading order here. 3.4 Region II, t = O(1) and t < tS In t = O(1), we pose SS ∼ SIIS (t), XS ∼ XIIS (t), XBHT ∼ XIIBHT (t), SO ∼ SIIO (t), for ε → 0, to obtain SIIO dSIIS XII , = − (δ4 − δ9 ) dt δ6 + SIIO BHT SIIO dXIIS XII , = δ12 XIIBHT − δ14 dt δ6 + SIIO BHT dXIIBHT SIIO XII − δ18 XIIBHT , = δ17 dt δ6 + SIIO BHT SIIO XII = θ3 (δ22 − SIIO ) . θ2 δ6 + SIIO BHT (3.6) We do not pursue to solve the system of equations (3.6) here, and confine ourselves instead to considering the system’s qualitative behavior. It is clear, that the growth of the biomass is not limited by the availability of SS , and that hydrolysis is not limited by the availability of XS . However, both biomass growth and hydrolysis do depend on the available dissolved oxygen here. From (3.6)1 , it is clear that dSIIS /dt ≤ 0 for δ4 > δ9 , or, in dimensional terms, µHT/YHT > kHT . This corresponds to the case of SS being consumed faster by the biomass than it is generated by hydrolysis. We restrict our discussion here to this case. For this case, there is tS = O(1) such that SIIS (tS ) = 0, and this indicates a region that we will discuss next. 124 3.5 Region III, t∗ = O(1) This time-scale corresponds to t∗ = O(1) where t = tS + εt∗ , and gives the location of a region where the availability of SS becomes a limiting factor in the further growth of the biomass. Also, the oxygen concentration undergoes a rapid change over this time-scale. In our numerical solutions (see Fig. 8), t = tS gives the time when the biomass XBHT achieves its maximum. For t∗ = O(1), we have XS ∼ XIIS (tS ), XBHT ∼ XIIBHT (tS ), where both these quantities are determined as part of the solution to the leading order problem in Region II. In t∗ = O(1), we pose SS ∼ εSIII (t∗ ), SO ∼ SIII (t∗ ), S O to obtain SIII SIII SIII dSIII O S O S II X (t ) + δ XIIBHT (tS ), = −δ S 9 4 BHT III III dt∗ 1 + SIII δ + S δ + S 6 6 S O O III III dSIII S S O S O XIIBHT (tS ) + θ3 (δ22 − SIII ). = −θ2 O III dt∗ 1 + SIII δ + S 6 S O (3.7) ∗ III ∗ III III As t∗ → ∞, dSIII S /dt → 0, dSO /dt → 0, SS → ρ2 , SO → ρ3 , where ρ2 = δ9 , δ4 − δ9 (recall that we are considering δ4 > δ9 here), and s 2 θ2 δ9 II 1 θ2 δ9 II XBHT (tS ) + X (tS ) + 4δ22 δ6 . δ22 − δ6 − ρ3 = δ22 − δ6 − 2 θ3 δ4 θ3 δ4 BHT These predictions are in good agreement with the numerical solution displayed in Fig. 7. 3.6 Region IV, t = O(1), tS < t < tX This is the time-scale over which most of the XS is degraded. In this region, we have SS ∼ ε δ9 , δ4 − δ9 and we have XS , XBHT , SO = O(1). For brevity, we do not display or discuss the leading order equations here, and simply note that there is a tX = O(1) such that XS (tX ) = 0, which determines the time when the availability of XS first becomes a limiting factor for hydrolysis. We omit discussion of Regions V and VI, other than to give their locations and the scalings. Region V is located at t† = O(1), where t = tX + εt† , and in which SS , XS = O(ε), XBHT , SO = O(1). Region VI is at t = O(1), t > tX and here we also have SS , XS = O(ε), XBHT , SO = O(1). 125 4 The ESGI70 Model The full ASM1 model has large numbers of variables and interactions, and consequently many parameters. The motivation for including these in the model is predominantly microbiological, and seems to arise from the urge to include all possible components about which there is some knowledge. However, many of the parameter values are poorly known: they are dependent on the chemical and microbiological make-up of the particular sludge, and therefore typically used as fitting parameters. With such a highly parameterized model, sets of parameter values can be found to fit most data but it is unclear if the model remains reliable in such situations. Applying a parameterized model under different (but physically reasonable) operating conditions may give spurious modeling artifacts. And this may also cause numerical difficulties in attempting any optimization. These considerations motivate us to seek a minimal model which should be capable of describing the essential mechanisms of the ATAD process, but ignores issues which — though microbiologically ‘known’ — are not material to the process within the likely range of operating conditions. It is hoped that such a model might describe the process sufficiently accurately while providing a more suitable basis for optimization. As indicated in our analysis of the ASM1 model, we opt to ignore the presence of mesophiles since the mixed sludge is hot enough to render their metabolic activity negligible. A biomass concentration variable XB corresponds to the concentration of thermophiles in the sludge. Any mesophiles in the inflow sludge can effectively be considered as already lyzed into additional contributions to substrate and inerts. We also choose to ignore the distinction between the two pools of substrate SS and XS , and consider a single substrate pool X. This is motivated by the urge to simplify to a minimal model, rather than through an assumption that one or other pool may be neglected because it is never rate-limiting under normal operating conditions. The effect of this simplification is greatest after substrate is depleted and becomes limiting (e.g. from Region III of Fig. 7 in the ASM1 model): in our model substrate availability for metabolism is directly dependent on biomass decay, while in the ASM1 model it is buffered by the slow substrate reservoir. The size of this effect remains to be demonstrated as well as its importance in relation to achieving the treatment goals. For a single batch and a single reactor, we introduce the following governing ordinary differential equations, 126 dXB dt dX dt dS dt dT dt = G − b(T )XB , = −αG + b(T )(1 − f)XB , (4.1) = A(S̄ − S) − βG, = λG − h(T ), where biomass concentration XB (t) has units [g/l], substrate concentration X(t) has units [g/l], S(t) is the oxygen concentration [mg/l], and T (t) is temperature [o C]. Biomass is assumed here to have a growth rate G given by the dual Michaelis–Menten kinetics expression G = µ(T ) S X XB . Ks + S Kx + X (4.2) The biomass is also subject to linear decay with rate constant b which may be temperaturedependent. Growth of biomass is proportional to the substrate and oxygen consumptions with the consumption efficiency 1/α and 1/β respectively. Substrate X is also produced as a result of decay of biomass, of which a fraction (1 − f) is recycled as substrate. Dissolved oxygen is replenished through aeration that is subject to the saturation concentration S̄. The sludge temperature increases at a rate proportional to the biomass growth rate and is decreased by heat loss from the reactor. This heat loss occurs through the reactor walls, via the exhaust gas stream from aeration and in the sludge outflow. The heat loss is dominated by losses through latent heat of evaporation in the exhaust gas stream. And while the other heat loss pathways are proportional to T −Tamb , this heat loss pathway is less temperature-dependent so we model it with a simple constant loss rate. The parameters for this model are summarized in Table 6. Most of the parameters relate to the thermophile metabolism and sludge properties alone, but those dependent on reactor design are aeration A, heating rate coefficient λ, and heat loss rate h. In the lack of specific plant information, it was deemed appropriate at this stage to focus on operating conditions rather than plant design in any optimization approaches. This model describes the batch reaction period from just after a batch intake is mixed into the reactor until batch outflow. The required initial conditions are those of the sludge immediately following mixing of the inflow with the sludge remaining in the reactor from the previous batch, since the reactor volume is only partially replaced at each batch. The pre-existing sludge conditions depend on the history of previous batches in the reactor. With repeated batches of the same type, we expect the batch cycle to settle to state in which successive batches are identical and the pre-mixing reactor sludge conditions are effectively independent of the initial conditions of the first batch reactor sludge. We therefore provide initial conditions for the inflow sludge added at each batch and also reactor sludge initial conditions for the first batch. The model can be run to simulate successive batches until these settle to a periodic batch cycle, as 127 Table 6: Set of parameters for the ATAD model. Some of the units have been converted from their COD (chemical oxygen demand) equivalent. Symbol Description Value Units Reference Kx Ks µ b α β λ h S̄ Physical parameters half-saturation constant for substrate 0.1 half-saturation constant for oxygen 0.17 growth rate 10 decay rate 5 substrate transformation 1.5 oxygen transformation 0.5 heat production 6.5 heat loss rate 7 oxygen saturation constant 10 g/l mg/l 1/d 1/d – – o Cl/g o C/d mg/l [1] " " " " " " – – A Xfeed B Xfeed Sfeed Vin /V T feed Baseline case parameters aeration rate 200 biomass input/feeding rate 1 sludge input/feeding rate 15 oxygen input/feeding rate 1 ratio of volume input 0.1 temperature of the feed 13 1/d g/l g/l mg/l – o C [1] " " " " – 128 indicated in Fig. 9. The linking of successive batches and initial mixing of inflow sludge for each batch can be described by adding additional terms into the system: dXB dt dX dt dS dt dT dt = G − bXB + Qδ (Xfeed − XB ), B = −αG + b(1 − f)XB + Qδ (Xfeed − X), (4.3) = A(S̄ − S) − βG + Qδ (Sfeed − S), = λG − h + Qδ (T feed − T ), where we define the Qδ as Qδ = Vin δ(t − tin ), V and δ(t − tin ) is a Dirac delta function. The treatment goals for stabilization and pasteurization can be calculated at any solution time after solving the system up to that time. The stabilization target (2.1) is defined in terms of the volatile solids Xvs , which can be calculated by integrating dXvs = G(1 − α) + Qδ (Xfeed − Xvs ). vs dt (4.4) The pasteurization target can be calculated by performing the integration (2.2) while T > 50o C and setting L = 0 at other times. Our first approach is to neglect the temperature dependence of the parameters so that the temperature equation decouples from the system. As typical ATAD operating temperatures are higher than 45o C, we note from Fig. 1 that this will tend to give overestimates of biomass growth and temperature. 4.1 Notes on Optimization The main operating optimization controls are the aeration rate A, the exchanged volume fraction Q, and the batch reaction time tbatch . The inflow sludge properties are largely fixed by the nature of the waste rather than being optimization controls. The exception to this is the possibility of enriching the inflow with additional substrate. Operational restrictions mean that the aeration rate is generally fixed as constant throughout a batch and the batch time is fixed to give a daily cycle. If we adhere to these restrictions, the controls are reduced to the constant aeration rate A and the exchanged volume fraction Q, and optimization will correspond to the treatment goals being achieved at the end of the daily cycle. In this situation, the energy consumption per mass of treated sludge is proportional to A/Q. 129 0.7 15 Volatile solids (g/l) Biomass (g/l) 0.6 0.5 0.4 14.5 14 0.3 0.2 25 26 27 28 29 30 31 32 13.5 25 33 1.6 4 1.4 3.5 Growth rate (g/l/day) Substrate (g/l) 1.2 1 0.8 0.6 0.4 26 27 28 29 30 31 32 Lethality, L(T,t) Oxygen (mg/l) 30 31 32 33 26 27 28 29 30 31 32 33 26 27 28 29 30 31 32 33 26 27 28 29 30 time (days) 31 32 33 3 2 1.5 2.5 6 4 2 2 1.5 1 0.5 26 27 28 29 30 31 32 0 25 33 0.46 59 0.45 out feed /X vs vs 60 58 Stabilisation, 1−X Temperature (degC) 29 3 8 57 56 55 54 53 25 28 2.5 0.5 25 33 10 0 25 27 1 0.2 0 25 26 26 27 28 29 30 time (days) 31 32 0.44 0.43 0.42 0.41 0.4 25 33 Figure 9: Multiple batch steady-state numerical simulations of the ESGI70-ATAD model. 130 The effect of aeration rate A is apparently straightforward: additional oxygenation promotes faster biomass growth and therefore quicker stabilization and temperature increase, giving quicker pasteurization. For a fixed batch time and an exchanged volume fraction, we would reduce the aeration rate until the more stringent treatment goal is achieved at the end of the batch. While energy consumption increases linearly with aeration rate, the growth rate’s increase with oxygenation is less than linear, approaching linearity near zero oxygen concentration. This would suggest that it may be desirable to maintain the lowest aeration rate which provides sufficient mixing and limits anaerobic metabolism, though other considerations such as plant throughput may become overriding. The effect of the exchanged volume fraction Q is less clear. The inflow sludge brings new substrate into the reactor, which leads to increased growth rate, digestion, and heating. However, it also cools the sludge mixture which slows the pasteurization process. These two effects have competing impacts on the time to achieve the pasteurization goal. The stabilization treatment goal is expressed in terms of the reduction of volatile solids between inflow and outflow (which is extracted directly from the reactor sludge mixture). Similarly, while inflow brings fresh substrate to fuel the metabolism and promote growth, increasing the exchanged fraction also decreases the apparent stabilization just after inflow mixing which can be expressed as ∗ (1 − Q)(1 − X∗vs /Xfeed vs ), where Xvs relates to the pre-mixing reactor sludge remaining from the previous batch. A typical ATAD batch reaction follows a timecourse similar to that shown in Fig. 5. The influx of new substrate on initial mixing leads to a rapid increase in metabolic activity and concomitant depletion of oxygen. Substrate is initially in abundance and is digested through the batch reaction, while oxygen is constantly supplied through aeration. At the start of a batch reaction oxygen rapidly becomes limiting while the substrate remains abundant. In this phase the biomass growth rate is mostly determined by the aeration rate and is largely independent of the substrate. As substrate concentration starts to become depleted the control of the process transfers to the diminishing substrate level. The effect of inflow substrate enrichment is to delay the depletion of substrate, extending the duration of the initial oxygen-depleted phase. It has minimal effect on the growth rate or temperature (and hence treatment goals) until the time at which substrate would otherwise have started to become limiting. As substrate enrichment directly increases inflow volatile solids, it means that a greater quantity of volatile solids must be digested to achieve stabilization. Once substrate is sufficiently depleted that it is limiting, the oxygen level under constant aeration will generally increase. At this stage, additional substrate comes only from biomass decay (in the ASM1 scheme, via the remaining slow substrate pool), so the growth rate must decrease from this point until decay dominates and then finally the temperature will start to drop. It would therefore appear to continue the reaction (and aeration) significantly into the stage of a decreasing reaction rate except to achieve an unsatisfied treatment goal. As a broad guideline for optimization, it would therefore appear ideal to adjust the aeration rate and exchanged volume fraction so that the substrate should become depleted close to the 131 end of the batch time, and this should coincide with the two treatment goals being achieved. It is uncertain whether this is achievable with just these controls in a fixed plant design. Increasing aeration accelerates both treatment goals, but directly increases energy consumption. Substrate enrichment enhances pasteurization more than stabilization. Increasing exchanged volume fraction probably delays achievement of both treatment goals, though pasteurization may be greater affected if the post-mixing temperature drops significantly; it also decreases energy consumption by increasing the sludge throughput. 5 Numerical Simulations The derived model consists of a system of first order ordinary differential equations which are integratated in time using Matlab’s stiff solver, ODE15s. We validate and test the practical applicability of the model by comparing the numerical simulations with experimental data for a specific choice of parameters given in Table 6. A comparison between the experimental data and numerical simulations is given in Figs. 10 and 11 for dynamical operational conditions, i.e., the input volume is not the same for each batch, cf. [2]. Note that here we ignore any temperature dependence of the kinetic parameters. The experiments were carried out under different hydraulic loading which is consistent with the Killarney site, (see for example [8]). Despite the limitations in the available data, the comparison shows a very good fit. 70 68 Experimental data Numerical simulation Temperature, T [oC] 66 64 62 60 58 56 54 52 50 0 2 4 6 8 10 12 Time, [days] 14 16 18 20 22 Figure 10: A comparison between experimental data and numerical simulations of the ESGI70ATAD model for multiple batches. In the next sections we run the simulations until steady-state conditions are attained. This is done by checking that the solutions between two consecutive batches are within some threshold value. This occurs within 28 days using the baseline case parameters. The numerical simulations for a base case run (and multiple batches) are shown in Fig. 9 where all the parameters are fixed as given in Table 6. Results presented here are for a single reactor and we assume that stabilization is the main goal for bio-treatment. However, for completeness we will also show 132 20 19 Experimental data Numerical simulation Volitile solids, Xvs [g/l] 18 17 16 15 14 13 12 11 10 0 2 4 6 8 10 12 Time, [days] 14 16 18 20 22 Figure 11: A comparison between experimental data and numerical simulations of the ESGI70ATAD model for multiple batches. simulations for the case when pasteurization is the main goal. For the purpose of comparison; all simulations are shown for day 28, whether there is stabilization or not, and baseline case simulations are highlighted with a ∗ in the figures. 5.1 Sensitivity to the Feed Temperature, T feed In this section, we study the sensitivity of the model to variations of feed temperature to investigate the impact of this on the bio-treatment goals, i.e., pasteurization and stabilization. The fresh sludge can be preheated before loading into the reactor using recovered heat. We observe from Fig. 12 that, for the given parameters, stabilization is not sensitive to the feed temperature. However, pasteurization is very sensitive to feed temperature which is consistent with plants with more than one reactor. 5.2 Sensitivity to Sludge Feed, Xfeed In this section we consider variations of the sludge feeding pattern to investigate the impact of this on the bio-treatment goals. Fig. 13 suggests the organic feed must be within certain concentrations to attain the required levels of pasteurization and stabilization. 5.3 Sensitivity to Aeration Rate, A Since there is some uncertainty in the choice of the aeration rate, we run numerical simulations for different values of A. In fact it is an open question whether continuous aeration is necessary 133 ′ 10 τ feed =5 C τ feed = 13* C τ feed = 30 C τ feed 5 out feed /X vs vs 0.46 o o Stabilisation, 1−X Lethality, L(T,t) 15 o o = 50 C 0.45 0.44 0.43 0.42 τ feed =5 C τ feed o = 13*oC τ feed = 30 C τ feed = 50 C o o 0.41 0 0 0.2 0.4 0.6 Time [days] 0.8 0.4 0 1 0.2 0.4 0.6 Time [days] 0.8 1 Figure 12: Lethality and stabilization for different feed temperatures. 5 Xfeed = 5g/l = 10g/l feed = 15*g/l feed = 20g/l X 4 X 3 Stabilisation, 1−X Lethality, L(T,t) X 0.5 out feed /X vs vs feed 2 1 0 0 0.2 0.4 0.6 Time [days] 0.8 0.45 0.4 0.35 0.3 0.25 0 1 feed X = 5g/l feed X = 10g/l feed X = 15*g/l feed X = 20g/l 0.2 0.4 0.6 Time [days] 0.8 1 Figure 13: Lethality and stabilization for different sludge feed concentration. (such as in the Killarney plant). Numerical results (for a single reactor) suggest that the required treatment goals cannot be achieved for aeration rate lower that 100 1/d, see Fig. 14. 5.4 Sensitivity to Oxygen Feed, Sfeed In this section we consider varying the oxygen feed concentration. We run simulations for values in the range Sfeed = [0.01 − 10] mg/l and results (not shown here) suggest stabilization and pasteurization are not sensitive to the oxygen feed concentration. This suggests that the plant can operate efficiently at levels lower than Sfeed = 1.0 mg/l while maintaining the required treatment goals. 134 ′ 3.5 0.45 out feed /X vs vs 2.5 A = 50 1/d A = 100 1/d A = 200* 1/d A = 300 1/d 2 Stabilisation, 1−X Lethality, L(T,t) 3 0.5 1.5 1 0.5 0 0 0.4 0.35 A = 50 1/d A = 100 1/d A = 200* 1/d A = 300 1/d 0.3 0.25 0.2 0.2 0.4 0.6 Time [days] 0.8 1 0 0.2 0.4 0.6 Time [days] 0.8 1 Figure 14: Lethality and stabilization for different aeration rates. 6 Optimal Strategy for Pasteurization In this section we provide an alternative discussion of the optimization problem. We mentioned that the treatment has two different Objectives, namely: 1. Killing the pathogen by pasteurization, and 2. Stabilization; that is reducing the organic matter contents to an acceptable level (66–80% of the initial). However, one can expect that when one of these Objectives is achieved, another is already accomplished in the process as well. For example, when the treated sludge is pasteurized and the pathogen are killed or reduced to safe level, this portion of sludge is also already stabilized (that is the organic content is reduced to on acceptable level). This observation allows us to substitute one difficult problem that has two Objectives by two simple separable problems that have a single Objective each. In this Section we consider an optimal control policy that is aimed to pasteurization, assuming that the stabilization will be reached in the process of pasteurizing. Pasteurization implies exposing the sludge to high temperature for a prolonged time. To achieve this objective in an optimal way, we may consider a simple practical analogue. The optimal policy for pasteurization will be 1. Firstly, heat the contents to as high as possible temperature, and do it as quickly as possible. 2. After heating, maintain this temperature for the necessary time. The temperature of the sludge and the heat influx for this process are shown in Fig. 15. 135 Stage 2 Heat Influx Stage 1 0 t1 Time t2 Figure 15: Sludge temperature (given by the curve) and the proposed heat influx (given by the rectangles). This involved four questions, namely: Question 1 : What is the temperature of the sludge should be? Answer : The temperature should be as high as it can be maintained for prolonged time. It is obvious that the highest temperature that can be maintained is the temperature when the rate of metabolism balances the bacteria rate of decay (Fig. 16) Question 2 : How long Stage 1 (heating) should be? Answer : As long as needed to reach the pasteurization temperature at the highest possible heating rate Question 3 : How much heat is needed to maintain the temperature through Stage 2? Answer : By the model, dT = Heat Influx − Heat Losses. dt At equilibrium dT/dt = 0, and hence Heat Influx = Heat Losses. The heat losses are know for a given temperature, and hence we have the heat influx. For instance, if the heat losses are linearly proportional to the temperature, that is Heat Losses = hT , and the heat influx is proportional to the rate of metabolism, Heat Influx = µ S X XB Ks + S Kx + X 136 e r at m lis bo et a M y ca te Ra e fd o Temperature Figure 16: The equilibrium of the metabolism rate and the rate of decay. then the task is to maintain the oxygen concentration S at such a level that S ≈ 1. Ks + S Question 4 : How long Stage 2 should be? Answer : As long as needed to reach the required level of pasteurization (this time is well known and is given by a know formula). 6.1 Recommendations 1. Thermal-isolation and utilizing the waste heat. Thermal-isolation reduces heat losses and hence the heat influx can be also reduced. 2. Splitting the process by stages. We suggest that the process should be divided into two stages, namely the heating and the maintaining the temperature. It is advisable to conduct these two processes in different reactors. 3. Preheating the water (utilizing waste heat). Preheating the water would shortening the first stage, the heating and the thermal shock will be smaller. 4. Start every round of the process at the highest possible initial concentration of bacteria. This again should shortening the first stage (the heating) and hence reduce the energy losses.. 137 7 Concluding remarks During the 70th European Study Group, the participants mathematically addressed a problem of optimizing the process of waste water bio-remediation. It was noted by the participants that the models that currently exist are greatly oversized, while the reliable data are rather scarce. Furthermore, it was also noted that the existing models are very sensitive to the parameter values. It was suggested, therefore, that under the circumstance a simpler model is needed. Such a model, that includes only the essence of the process, has been developed during the Study Group. This model includes four crucial variables, namely the substrate concentration (the nutrient), the biomass, the oxygen concentration and the temperature of the sludge. The model describes the process of bio-treatment for the case of a single reactor. It has been shown, that for correctly re-calibrated parameters the ESGI model and the original large-scale models give effectively the same outcome. Development and verification of this model was the major outcome of this study. An extension of this model to the case of a few sequential reactors is reasonable straightforward and possesses no difficulty. In the analysis that are presented in this report, we applied this model to consider two different regimes, namely, when either oxygen, or the substrate (the nutrient) are limiting factors. It appears that one possible operating strategy can be applying a low aeration rate to preserve the oxygen concentration a limiting factor for longer, with the aim that the two objective of the bio-treatment were met at the same moment at the end of the cycle. We would also like to remark that the participant realize that the problem of optimizing of the bio-treatment, as it was presented to the Study Group, is essentially a problem of optimal control. However, taking into consideration that the problem is fundamentally non-linear, addressing this optimal control problem analytically is a highly non-trivial task. This problem in such circumstances may be explored numerically, applying the technique that is known as dynamic control. However, a reliable and reasonably simple model is needed for this at the first instance, and we believe that the model that was developed can serve as the basis for such study. Study of the model also gives an important insight into the nature of the process, and enables us to gain a number of practically relevant recommendations. Thus, it appears that pre-heating the input sludge will be beneficial, provided it can be done in an energy efficient way (recycling the wasted heat that is generated in the process). One avenue for further research is an extension of the ESGI70 model to reactors in series. This study would be of assistance for a more complex physical plant design. 138 Acknowledgement We would like to thank Jaime Rojas and Prof. Toshko Zhelev for providing us with the valuable experimental data and background information in formulating a mathematical model for the autothermal thermophylic aerobic digestion process, especially being available throughout the week. We acknowledge the support of the Mathematics Applications Consortium for Science and Industry (www.macsi.ul.ie) funded by the Science Foundation Ireland mathematics initiative grant 06/MI/005. 139 Bibliography [1] J. M. Gómez. Digestión Aerobic Termófila Autosostenida (ATAD) de fangos. Estudio experimental a escala real y modelización matemática del reactor. PhD thesis, Universidad de Navarra, San Sebastińa, Spain, Octubre 2007. [2] J. M. Gómez, M. de Gracia, E. Ayesa, and J. L. Garcia-Heras. Mathematical modelling of Autothermal Thermophilic Aerobic Digesters. Water Res., 41:1859–1872, 2007. [3] W. Gujer, M. Henze, T. Mino, T. Matsuo, M. C. Wentzel, and G. R. Marais. The Activated Sludge Model No. 2: biological phosphorus removal. Water Sci. Technol., 31(2):1–11, 1995. [4] W. Gujer, M. Henze, T. Mino, and M. van Loosdrecht. Activated Sludge Model No. 3. Water Sci. Technol., 39(1):183–193, 1999. [5] M. Henze, C. P. L. Grady Jr., W. Gujer, G. R. Marais, and T. Matsuo. Activated Sludge Model No. 1. IAWPRC, London, 1987. [6] R. Kovács and P. Miháltz. Untersuchungen der kinetik der aerob-thermophilen klaerschlammstabilisierung - einfluss der temperatur, 17. Fruehlingsakademie und Expertentagung. Balatonfuered, Germany. May, 2005. [7] R. Kovács, P. Miháltz, and Z. Csikor. Kinetics of Autothermal Thermophilic Aerobic Digestion – Application and extension of Activated Sludge Model No. 1 at thermophilic temperatures. Water Sci. Technol., 56(9):137–145, 2007. [8] N. M. Layden. An evaluation of Autothermal Thermophilic Aerobic Digestion (ATAD) of municipal sludge in Ireland. J. Environ. Eng. Sci., 6:19–29, 2007. 140 The initiation of Guinness Report Contributors: Oliver A. Power1,2, William Lee1, Andrew C. Fowler1 , Paul J. Dellar3 , Len W. Schwartz4 , Sergei Lukaschuk5 , Gordon Lessells6 , Alan F. Hegarty6 , Martina O’Sullivan1 and Yupeng Liu7 Industry Representatives: Conor Browne8 and Richard Swallow9 1 MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland Report coordinator, oliver.power@ul.ie 3 OCIAM, Mathematical Institute, University of Oxford, United Kingdom 4 Department of Mechanical Engineering, University of Delaware, United States of America 5 Department of Engineering, University of Hull, United Kingdom 6 Department of Mathematics and Statistics, University of Limerick, Ireland 7 School of Mathematical Sciences, Dublin Institute of Technology, Dublin, Ireland 8 Diageo Ireland, St. James’s Gate, Dublin 8, Ireland 9 VGraph UK Ltd, 28 Hudson Way, Radcliffe On Trent, Nottingham, NG12 2PP, United Kingdom 2 Abstract Guinness Draught is a distinctive beer, having a black body and a thick white head of foam. It is unusual in that it is always dispensed with a gas mixture of about 21 mole % carbon dioxide and 79 mole % nitrogen. Consequently, it does not foam spontaneously but must instead be ‘initiated’. The initiation methods employed by Diageo in canned and bottled Guinness Draught add a significant cost to the final product. Hence, Diageo asked the Study Group to develop a mathematical model for the initiation of Guinness, focusing on any of the current initiation methods. We decided to focus on the ‘surger’ unit, which vibrates ultrasonically to initiate Guinness. It does so mainly by the process of rectified diffusion, although concepts from passive and convective diffusion are also important. In this paper, we briefly review the initiation methods currently employed, the theory of bubble nucleation and growth, and passive, rectified and convective diffusion. A qualitative explanation of how Guinness is initiated is given. Finally, a mathematical model for the initiation of Guinness by the surger unit is developed, the solutions of which are quantitatively reasonable. 142 1 The problem with draught Guinness Guinness is a distinctive beer that originated in the brewery of Arthur Guinness (1725–1803) at St. James’s Gate, Dublin, Ireland [1]. It is based on the ‘porter’ that first appeared in London in the early 18th century. Porter incorporated roasted barley for the first time and it is this ingredient which endows Guinness with its characteristic burnt or roasted flavour. The perfect pint of draught Guinness is black, or officially a very dark shade of ruby, with a thick ‘creamy’ head of foam. The head is an integral part of the experience of drinking Guinness. It is created as a result of gas bubbling out of the beer when it is being served (the famous ‘surge and settle’). Guinness Draught is unusual in that it is packaged and dispensed using a gas mixture of carbon dioxide (CO2 ) and nitrogen (N2 ), in contrast to Guinness Extra Stout, and indeed most beers, which use only carbon dioxide. There seems to have been two motivating factors for the use of nitrogen. First, the introduction of metal kegs allowed beer to be stored and served at high pressures (2 − 4 times atmospheric pressure). For most beers this pressure is achieved using only CO2 and, consequently, the CO2 concentration can be relatively high in the beer (∼ 6 g l−1 ). An acidic ‘fresh’ taste is associated with the dissolved CO2 (as it reacts with water to form carbonic acid), which is not to everyone’s liking [2]. In contrast, the gas mixture used to dispense Guinness is mainly N2 . In particular, at 3.8 bar and 8 ◦ C the gas mixture is about 79 mole % N2 and 21 mole % CO2 . This yields the desired CO2 concentration of ∼ 2 g l−1 and the desired N2 concentration of ∼ 0.072 g l−1 (Section 8). For comparison, a beer dispensed at the same pressure using only CO2 would have a CO2 concentration of ∼ 9.5 g l−1 . Clearly, the Guinness would have the less acidic flavour, the nitrogen being tasteless. In fact, to avoid over-carbonation when high gas pressures are required, such as when pushing beer through long lines or up vertical lifts, a gas mixture of nitrogen and carbon dioxide is used to dispense all beers, the proper gas blend being chosen from the supplier’s blend selection chart. The second advantage to using nitrogen is related to its low solubility. N2 is 100 times less soluble than CO2 at 8 ◦ C. Consequently, the N2 concentration in Guinness is very low. The CO2 concentration is also low but this is because the gas mixture is mainly N2 . Once bubbles form in Guinness, they remain small due to the low dissolved gas concentrations and, consequently, low growth rates. The small bubbles give Guinness Draught a finer ‘creamier’ head of foam than that on a carbonated beer like Guinness Extra Stout. Furthermore, the head is longer-lasting since Ostwald ripening, the main process by which foam becomes coarser, is slowed down by a factor of 100 due to the lower solubility of N2 [3]. The relative concentrations of CO2 and N2 in air also helps: air is about 79 mole % N2 and only 0.03 mole % CO2 , meaning that the tendency for gas molecules to escape from the foam bubbles into the air is weaker for N2 than for CO2 . However, a major disadvantage of using nitrogen is that, unlike carbonated beers which foam spontaneously, the head on Guinness does not form spontaneously and must be ‘initiated’. A good example of this is the Guinness from a ‘surger-ready’ can. If such a can is carefully opened and poured into a clean glass, little or no head forms. However, if the glass is now placed on a ‘surger’ unit, which vibrates ultrasonically, a surge of bubbles appears in the body of the 143 Guinness and a thick creamy head forms as these bubbles collect on top of the Guinness. As well as being available in kegs and surger-ready (widget-free) cans, draught Guinness is also sold in widget cans and widget bottles, the ‘widget’ being a plastic capsule that initiates the Guinness when the seal on the container is broken. In all these forms, the container is pressurised to 3.8 bar at 8 ◦C with a 79 mole % nitrogen and a 21 mole % carbon dioxide gas mixture. In order to better understand the existing initiation methods and motivate new ones, Diageo asked us, at the 70th European Study Group with Industry, to develop a mathematical model describing bubble formation and growth in Guinness. First, we will review the existing methods of initiation. 2 Existing methods of initiation 1. Keg Draught Guinness: Draught Guinness first became available to licensed premises around 1959. To dispense the Guinness, a tap with two lines is attached to the keg. One line leads to a premixed gas cylinder containing nitrogen and carbon dioxide. Pressure gauges on this line allow the pressure in the system to be set. The other line leads to the dispensing tap. On its way to the spout of this tap, the Guinness is forced at high speed through an orifice plate with five small holes. From Bernoulli’s theorem, the pressure drops in the beer as it passes through the holes. This allows existing microbubbles to expand and also induces cavitation in the beer, as evidenced by the characteristic surge of bubbles seen in the dispensed beer. This initiation method is very effective, the result being a thick creamy head. 2. Canned and Bottled Draught Guinness: To meet the demand for draught Guinness that could be poured at home, bottled draught Guinness was launched in 1979. To initiate the Guinness, a syringe, supplied with the bottles, was used to suck up some of the Guinness. This was then squirted back into the glass of (headless) Guinness creating a surge of bubbles and, consequently, the head. The syringe initiator was replaced by the can ‘widget’ in 1989. The widget is a plastic capsule with a tiny hole connecting its interior to the surroundings. When the can is opened, a small amount of beer and nitrogen, previously trapped in the widget, shoots out into the beer. Originally, the widget was used only in canned draught Guinness and was in the form of an insert wedged into the bottom of the can. In 1997, the widget became a floating spherical capsule (it can be heard rattling in the can). A floating bottle widget in the shape of a rocket was introduced in 1999. The bottle widget is optimised for drinking straight from the bottle, while the can widget is optimised for pouring into a glass. The operation of the widget deserves further explanation. In the case of the can, a controlled amount of liquid nitrogen is added just before it is sealed. When this liquid nitrogen evaporates it pressurises the can to 3.8 bar. The pressure equalises in the widget, 144 the headspace and the beer, forcing some beer into the widget as it does so. When the can is opened, the pressure in the headspace and beer rapidly drops to about 1 bar (atmospheric pressure). The contents of the widget then decompress by squirting gas (mainly N2 ) and some beer into the Guinness. In the case of the bottle, no liquid nitrogen is added: the bottle is sealed in an environment already at the desired gas pressure. 3. Surger-Ready Draught Guinness: The ‘surger’ was launched in Tokyo in 2004. It is an electrical device that recreates the ‘surge and settle’ effect without the need for the kegand-tap dispensing system. The interface of the surger unit is a metal plate that vibrates ultrasonically. First a little water is poured onto a plate to ensure good mechanical contact. Then a can of surger-ready (widget-free) Guinness is poured into a glass and placed on the plate. The Guinness initially has little or no head. When a button is pressed, the plate vibrates ultrasonically. The associated pressure fluctuations in the Guinness almost immediately create a surge of bubbles and, consequently, the head. 3 Bubble nucleation and growth Many good introductions to the formation and growth of bubbles in supersaturated liquids can be found in the literature (e.g. [4, 5, 6, 7, 8]). Let us consider a container (keg, can or bottle) filled to the desired level with Guinness and pressurised by some method. In the case of the can, the container is sealed after a controlled amount of nitrogen has been dropped onto the liquid surface, while the bottle is sealed in a closed environment already at the desired gas pressure. In either case, after the container is sealed, an equilibrium is established between the dissolved gas in the liquid and the gas in the headspace (and in the widget, if present). This equilibrium obeys Henry’s law, which states that the concentration cl of gas dissolved in a liquid is proportional to the partial pressure p of that gas above the liquid: cl = Hp. (3.1) Clearly, the solubility of a gas in a liquid increases as the pressure of that gas over the liquid increases. The proportionality constant H is known as the Henry’s law constant. It is temperature dependent, decreasing as the temperature increases; that is, for a fixed partial pressure, gas solubility decreases as the temperature increases. This is the reason that bubbles are seen to form on the surface of a glass of water when it is left in the sun and the reason that effervescence (bubbling) is initially greater in a hot carbonated beverage than a cold one. When the container is opened, the pressure p in the headspace falls abruptly and Henry’s law is no longer satisfied. The liquid is said to be supersaturated with gas molecules. To reestablish equilibrium or, equivalently, to satisfy Henry’s law (3.1), the dissolved gas concentration cl must decrease. The gas molecules can escape either by diffusing through the free (top) surface 145 of the liquid or by forming bubbles1 . Although the laws of thermodynamics favour the formation of gas bubbles, there is a kinetic barrier to the production of gas bubbles. To see why, we note that the gas pressure pb inside a spherical bubble of radius R is given by Laplace’s law: pb = pl + 2γ , R (3.2) where pl is the pressure in the liquid and γ is the surface tension of the liquid in contact with the gas. Laplace’s law states that the pressure inside a bubble is greater by 2γ/R than that in the surrounding liquid due to the fact that the surface tension γ of the liquid tends to contract the bubble surface. Clearly, the gas in the bubble is in equilibrium with its concentration cl in the liquid if Henry’s law is satisfied: cl = Hpb ≡ cb , (3.3) If Henry’s law is not satisfied then the bubble either shrinks or grows. If cl < cb then the bubble shrinks as the gas in it redissolves, whereas if cl > cb then the bubble grows as gas diffuses into it from the liquid. Using Laplace’s law (3.2) to eliminate pb yields 2γ . c l = H pl + R Approximating pl by pa 2 and rearranging terms yields R= 2γ ≡ Rc , cl /H − pa where Rc is called the critical radius. If R = Rc then the gas bubble is in equilibrium with the gas concentration in the liquid (i.e. cl = cb ) and it neither shrinks nor grows. If R < Rc then cl < cb and the bubble shrinks. If R > Rc then cl > cb and the bubble grows (Figure 1). The critical radius is often given in terms of the supersaturation ratio S which is defined by S= cl − 1, Hpa so that 2γ . (3.4) pa S If p0 is the gas pressure over the liquid in the sealed container, then the gas concentration in the liquid is initially cl = Hp0 . Immediately after opening the container, the gas concentration Rc = 1 What are widely called ‘bubbles’ in liquids are actually cavities: a true bubble is a region in which gas is trapped by a thin film, whereas a cavity is a gas-filled hole in a liquid. 2 Strictly speaking, pl is the sum of the atmospheric pressure pa and the hydrostatic pressure ρl g (h − z), the bubble being at a depth h − z. Since pa ≈ 105 Pa and ρl gh . 103 Pa, the hydrostatic pressure can be neglected. 146 Figure 1: The (idealised) dissolved gas concentration close to a postcritical (R > Rc ) bubble. ∆c is the difference between the dissolved gas concentration cl in the liquid bulk and the dissolved gas concentration cb = Hpb in the bubble ‘skin’, which is a thin layer next to the bubble surface and in equilibrium with the gas pressure in the bubble. The concentration falls from cl to cb over a diffusion boundary layer of thickness δN . in the liquid is still cl = Hp0 so that the initial supersaturation ratio is S = p0 /pa − 1. For beer (γ ≈ 40 mN m−1 [3]) and an initial supersaturation ratio of 2.8, which is typical for draught Guinness, the critical radius is 0.3 µm. If enough gas molecules cluster together to form an embryonic bubble with a radius greater than this critical radius, then that bubble will survive and grow. The random thermal motion of the gas molecules is the process that provides these clusters. However, in order to cluster together, the gas molecules must first push their way through the polar water molecules that are strongly attracted to each other. This places a considerable energy barrier before bubble formation. This energy barrier can be lowered by reducing the critical radius, which also increases the likelihood that enough gas molecules will randomly cluster together. To this end, the critical radius can be reduced by lowering the surface tension γ3 or by increasing the supersaturation ratio S. However, calculations, verified by experiment, show that the random nucleation of bubbles, called homogeneous nucleation, requires a supersaturation ratio of several hundred to over a thousand to be observable [9, 10]. In practice, bubbles form in even weakly supersaturated liquids, such as carbonated soft drinks (S ∼ 1 [11]) or champagne (S ∼ 5 [12]). When these liquids are poured into a glass, it is easy to observe that bubbles form on the surface of the glass rather than in the bulk of the liquid. When the bubbles reach a critical size, they detach from the glass surface and rise up through the liquid. Over most of the glass surface, the bubble production is very slow, produces large (∼ 1 mm) bubbles, and soon stops altogether. However, at a few spots on the 3 E.g. we found that when washing-up liquid is added to a carbonated soft drink and then placed on the surger unit, a head consisting of small bubbles is formed. Without the washing-up liquid, the head consists of much larger bubbles and also collapses much faster. 147 Figure 2: Bubbles in a glass of carbonated beverage appear to rise in streams from spots on the surface of the glass. As they rise, they grow larger and spread further apart. glass surface, the bubble production is very rapid and a continuous stream of small (< 1 mm) bubbles can be seen rising from these spots. As the bubbles rise, they grow larger and spread further apart (Figure 2). The surface of the glass is assisting bubble formation by providing nucleation sites, which are usually in the form of preexisting gas pockets [13]. This is termed heterogeneous nucleation. The conventional view is that when a glass is filled with liquid, air can become trapped in crevices on the glass surface [13]. If the radius of curvature of any of these gas pockets is greater than the critical radius, then that gas pocket grows into a macroscopic bubble as dissolved gas diffuses into it (Figure 3). Eventually, the increasing buoyancy force on the bubble exceeds the capillary force holding it to the glass surface and it detaches. Some gas may be left behind in the crevice so that the process of bubble production can repeat itself. However, recent work has shown that cellulose fibres, which either fall out of the air or are left after drying with a towel, can be much better nucleation sites than glass crevices [12, 14]. These cellulose fibres are roughly cylindrical and hollow, about 100 µm long and several µm wide. When a liquid is poured into a glass, air is trapped inside these fibres. Since the radius of curvature of these gas pockets is greater than the critical radius, they grow as dissolved gas diffuses into them. When a gas pocket reaches the tip of a fibre, a bubble is released, but some of the gas is left behind in the fibre so that the process of bubble production can repeat itself (Figure 4). This results in a characteristic bubble stream (Figure 2). As the gas concentration in the liquid decreases, the frequency of bubble formation decreases and the critical radius increases. When the critical radius is equal to the radius of curvature of a gas pocket, bubble production from that gas pocket stops. In summary then, the bubbles in a glass containing a carbonated beverage originate from gas pockets trapped either in glass crevices or cellulose fibres, the bubbles from cellulose fibres being much smaller and more frequent, and thereby forming bubble streams. In passing, we note that a dramatic example of heterogeneous nucleation is the addition of a spoonful of sugar to a carbonated soft drink. The sugar grains most likely contain many small gas pockets. 148 Figure 3: Bubble production cycle from a crevice in a glass wall. (a) Gas diffuses into trapped air pocket. (b) Bubble forms. (c) Buoyancy force FB eventually matches the capillary force Fc . (d) Bubble detaches. Adapted from [15]. (a) (b) (d) (c) Figure 4: Bubble production cycle from a cellulose fibre. (a) Gas diffuses into trapped air pocket. (b) Pocket reaches tip of fibre. (c) Bubble forms. (d) Bubble detaches. 149 After a bubble is released from its nucleation site, it grows as it makes its way to the surface. This is due to the continuous diffusion of dissolved gas into the bubble (the drop ρl gh in the liquid pressure makes a minor contribution to the expansion of the bubble). The buoyancy force increases as the bubbles grow, causing them to accelerate and spread further apart. The rising bubbles also experience a friction force (i.e. a drag) in the viscous liquid. If the radius of a bubble were fixed, then the bubble would rapidly reach a constant velocity, termed the terminal velocity, at which the upward buoyancy force would be balanced by the downward drag force. However, for a growing bubble, the buoyancy force (∝ R3 ) always remains slightly larger than the drag force (∝ R) so that the bubble continuously accelerates as it rises. The final size of a bubble is affected by three main factors: (i) the distance the bubble has travelled, (ii) the concentration of dissolved gas, and (iii) the concentration of organic compounds. The first two are obvious factors, whereas the third needs further explanation. Beer contains many organic compounds which have a water-soluble (hydrophilic) part and a water-insoluble (hydrophobic) part. Hence, they like to gather around the surface of a bubble with their hydrophobic ends pointing into the gas bubble and their hydrophilic ends in the liquid. They are thus called surfactants (SURFace-ACTive AgeNTS). As a bubble rises, it collects these surfactants. According to fluid mechanics, a rigid (i.e. solid) sphere rising through a liquid experiences a greater drag than a fluid (e.g. gas) sphere. Essentially, the no-slip condition at the surface of a solid sphere retards its movement, whereas the circulation of gas within a gas bubble ‘lubricates’ its movement through the liquid. However, surfactants reduce this lubricating effect, causing the drag on the bubble to increase. The bubble continues to grow as it rises and if it grows fast enough then the fraction of the bubble surface that is contaminated by surfactants remains small. This is the case for clear liquids like champagne, which has a relatively high CO2 content (∼ 12 g l−1 [12]) and thus bubble growth rate, and a low surfactant concentration (∼ 1 mg l−1 [12]), so that champagne bubbles experience a drag similar to that which would be experienced by clean bubbles [16]. In contrast, beer bubbles experience a drag similar to that which would be experienced by a solid sphere of the same radius, the combination of low CO2 content (∼ 6 g l−1 ) and high surfactant concentration (∼ 100 mg l−1 [12]) eliminating the lubricating effect associated with a clean bubble [5, 16]. Furthermore, in liquids with zero or low surfactant concentrations, such as pure water, carbonated soft drinks and champagne, bubbles burst almost immediately on reaching the surface. In beer, the surfactant coating allows the bubbles to collect at the surface of the liquid to form a foam, called the head. Guinness, being weakly supersaturated, will have no observable homogeneous nucleation within the body of the liquid. Since surger-ready Guinness can be poured with very little or no head forming, the heterogeneous nucleation rate on the surface of the glass must also be very low. It can be made appreciable by adding a spoonful of sugar. The sugar crystals provide many nucleation sites. Furthermore, as the sugar dissolves into the liquid, it readily liberates the bubbles forming at these sites. The result is a surge of small bubbles. Bubbles do form on the surface of the glass but they seem reluctant to detach, probably because the low dissolved gas concentrations in Guinness prevent them from becoming large enough to detach. 150 4 Qualitative explanation of initiation There are basically two ways to promote bubble formation: (I) we can introduce postcritical (R > Rc ) bubbles or gas pockets (‘seeding’), or (II) we can grow the precritical (R < Rc ) bubbles that are popping into and out of existence. The methods for achieving type I bubble formation are • Entrapment of gas bubbles: If a container is shaken, then some of the gas above the liquid level is mixed into the liquid as small bubbles. When the container is opened, the pressure falls and these bubbles expand. Perhaps more importantly, dissolved gas will diffuse into these bubbles if R > Rc . This vigorous bubble growth can cause shaken carbonated beverages to spray out of their containers. Splashing a liquid into a glass, rather than pouring it slowly down the side of the glass, can also entrap air bubbles with R > Rc , as can banging a glass of liquid down or swirling the glass. By opening a can of surger-ready Guinness at an appropriate angle, it is possible to initiate the Guinness. It is likely that as the gas rushes out of the can, it causes waves to form on the liquid surface via the Kelvin-Helmholtz instability. These waves then entrap gas bubbles as they crash against the tab, the walls of the can or back into the liquid. Ocean foam has similar origins, wind creating the crashing waves. • Cavitation: In the keg-and-tap system, the Guinness is forced through an orifice plate. In accordance with Bernoulli’s theorem, the pressure decreases and the velocity increases as the liquid passes through the holes. If the pressure decrease is large enough, then the liquid can be torn apart and cavities (i.e. bubbles) can form in the Guinness. The distinction between type I and type II bubble formation is blurred here since a liquid is more likely to be torn apart where it is weakest, such as where a precritical bubble exists. • Injection of gas: In widget cans and bottles, a jet of gas shoots from the widget into the Guinness when the container is opened. This jet breaks up into bubbles. If the jet is too short, then some of the Guinness at the bottom of the container remains uninitiated. When the Guinness is poured out of the container, the uninitiated beer passes through or by the foam, providing a fresh supply of nitrogen to existing bubbles and thereby refreshing the head; that is, pouring completes initiation of the Guinness. The method for achieving type II bubble formation is • Pressure fluctuations: Precritical bubbles will expand if the local liquid pressure drops. For example, if the expansion is isothermal, then a drop in liquid pressure from 1 bar to 0.5 bar causes a precritical Guinness bubble with R = 0.25 µm to expand until R ≈ 0.31 µm, while the critical radius drops from 0.29 µm to 0.24 µm. Hence, dissolved gas in the liquid diffuses into the bubble. If the liquid pressure then returns to its original value, the bubble may or may not survive. It will survive if enough gas has diffused into the 151 Initiation method Keg-and-tap Syringe Widget Shaking container Banging glass down Opening can at an angle Ultrasound Method of bubble formation Cavitation Cavitation; entrapment of air bubbles Injection of gas bubbles Entrapment of gas bubbles Entrapment of gas bubbles Entrapment of gas bubbles Pressure fluctuations Type I I I I I I II Table 7: Initiation methods classified according to the method and type of bubble formation employed. bubble to keep its radius greater than the critical radius. Even if this is not the case, the bubble can be encouraged to grow by repeatedly subjecting it to pressure fluctuations. The Guinness surger unit employs this method by vibrating at ultrasonic frequencies. In table 7, we have classified the initiation methods for Guinness according to the method and type of bubble formation employed. Since the surger unit is the ‘cleanest’ experiment, we decided at the Study Group to focus on this initiation method. 5 The mathematical models First, we describe a mathematical model for the diffusion of dissolved gas into a bubble which is not rising in the liquid. We call this passive diffusion. By assuming that the gas bubbles are purely nitrogen, we can introduce a small parameter which allows us to obtain some analytical results. These results emphasise the role played by the critical radius. Next, we introduce the effect of a sound field and apply analytical results found in the literature. It is shown that above a critical threshold for the amplitude of the sound field, precritical bubbles grow. This is termed rectified diffusion. Next, we consider how diffusion is altered when the bubble is rising, as it does in practice. We call this convective diffusion, since convection is now assisting the transfer of mass to the bubble. Finally, a mathematical model for describing the initiation of Guinness by the surger unit is described and solved numerically. 5.1 Bubble growth by passive diffusion Consider a spherical bubble of radius R (t) at rest in an infinite incompressible inviscid liquid (Figure 5). Far away from the bubble the pressure in the liquid is p∞ (t) and this is taken to 152 Liquid r Gas R(t) Bubble surface Figure 5: A spherical bubble of radius R (t) in a liquid. be a known input which regulates the growth or collapse of the bubble. We assume that, as its radius changes in time t, the bubble remains spherical so that the bubble surface moves only in the radial direction. It follows that the motion induced in the fluid is also radial: ur = u (r, t) , uθ = uφ = 0, where ur , uθ , uφ are the components of the fluid velocity expressed in spherical coordinates (r, θ, φ). The origin of the coordinate system is taken at the centre of the bubble so that r is distance from the centre of the bubble. Because of the spherical symmetry, all physical quantities are functions of at most r and t. Heat transfer is neglected by assuming that the temperature T is constant in time and uniform in space. Furthermore, the pressure pb in the bubble is taken to be a function only of t. We also assume that only one gas species (i.e. nitrogen) is present in the liquid and the bubble. The governing equations are the incompressibility condition: 1 ∂ 2 r u = 0, r2 ∂r (5.1) the radial component of the Navier-Stokes equation governing the motion of the liquid: ∂u ∂u 1 ∂pl +u =− , ∂t ∂r ρl ∂r (5.2) and the diffusion-convection equation governing the gas concentration c (r, t) in the incompressible liquid: ∂c ∂c D ∂ 2 ∂c r . (5.3) +u = 2 ∂t ∂r r ∂r ∂r The mass flux of gas into the bubble is given by ∂c 2 , q = 4πR D ∂r r=R(t) 153 and can be equated to the rate of increase of the bubble mass: ∂c d 4 3 2 . πR ρb = 4πR D dt 3 ∂r r=R(t) (5.4) At the bubble surface r = R (t), we also have the following boundary conditions: u = Ṙ at r = R (t) , pl = pb − 2γ R (5.5) at r = R (t) , at r = R (t) . c = Hpb (5.6) (5.7) The kinematic boundary condition (5.5) holds as the fluid motion is induced by the growth or collapse of the bubble. Equation (5.6) is simply Laplace’s law (3.2), and (5.7) is simply an application of Henry’s law (3.1). Far away from the bubble at r = r∞ , the pressure and concentration are prescribed by pl = p∞ (t) at r = r∞ , (5.8) c = c∞ (t) at r = r∞ . (5.9) We also require the following initial conditions: R = R0 at t = 0, (5.10) pl = p0 at t = 0, (5.11) c = Hp0 at t = 0. (5.12) Here, p0 corresponds to the gas pressure in the sealed container so that (5.12) follows from an application of Henry’s law. The contribution of gravity to the liquid pressure is neglected so that the initial liquid pressure can be set equal to p0 . From (3.4), the critical radius is 2 4 × 10−2 N m−1 2γ = ≈ 0.3 µm, Rc = ∆p 2.8 × 105 N m−2 where ∆p = p0 − pa . Solving (5.1) for u and applying the boundary condition (5.5) yields u= R2 Ṙ . r2 (5.13) Substituting (5.13) into (5.2) yields ∂ ∂t R2 Ṙ r2 ! R2 Ṙ ∂ + 2 r ∂r 154 R2 Ṙ r2 ! =− 1 ∂pl . ρl ∂r Hence, 2 2 2 d R Ṙ 2 R Ṙ 1 ∂pl 1 − =− . 2 5 r dt r ρl ∂r Integrating from r = R (t) to r = r∞ yields Z 2 Z r∞ 1 d R2 Ṙ Z r∞ 1 1 r∞ ∂pl 2 dr − 2 R Ṙ dr = − dr, 2 5 dt ρl R ∂r R r R r and so, applying the boundary condition (5.8) and assuming that r∞ ≫ R, 2 1 d R Ṙ Ṙ2 p∞ (t) − pl (R) − =− . R dt 2 ρl This can be rewritten as pl (R) = p∞ (t) + ρl 3Ṙ2 RR̈ + 2 ! . Using (5.6) to eliminate pl (R), the liquid pressure at the bubble surface, we obtain ! 3Ṙ2 2γ . + ρl RR̈ + pb (t) = p∞ (t) + R 2 (5.14) This is the well-known Rayleigh-Plesset (RP) equation. In the steady state, it reduces to Laplace’s law. Substituting (5.13) into (5.3) yields ∂c R2 Ṙ ∂c D ∂ + 2 = 2 ∂t r ∂r r ∂r 2 ∂c r . ∂r (5.15) Assuming that the gas in the bubble behaves as a perfect gas, we have ρb = pb , Rs T (5.16) where Rs is the specific gas constant. Substituting (5.16) into (5.4) yields 1 d R3 pb ∂c 2 . =R D 3Rs T dt ∂r r=R(t) Using (5.14) to eliminate the bubble pressure pb finally yields " !# 2 1 d 2γ ∂c 3 Ṙ 3 2 . R p∞ + =R D + ρl RR̈ + 3Rs T dt R 2 ∂r r=R(t) 155 (5.17) Equations (5.15) and (5.17) govern c (r, t) and R (t). The remaining initial and boundary conditions are R = R0 at t = 0, (5.18) at t = 0, !# 3Ṙ2 RR̈ + 2 (5.19) c = Hp0 " c = H p∞ + 2γ + ρl R c ≈ c∞ (0) = Hp0 at r = R (t) , at r = r∞ . (5.20) (5.21) In condition (5.21), we have assumed that for a single bubble in an infinite liquid, the concentration far away from the bubble remains unchanged from its initial value. Initial bubble growth We now nondimensionalise the governing equations by letting R∗ = R , Rc r∗ = r , Rc t t∗ = , τ c∗ = c − Hpa , H∆p (5.22) where asterisks denote dimensionless variables and τ= R2c , Dε ε= H∆p , ρa ρa = pa . Rs T The time scale τ corresponds to the time it takes a mass 4πρaR3c /3 to enter a sphere of radius Rc if the mass flux density is DH∆p/3Rc . For nitrogen τ ≈ 0.8 ms and ε ≈ 0.06. Substituting (5.22) into the governing equations (5.15) and (5.17) yields (asterisks omitted) ! 2 ∂c R Ṙ ∂c 1 ∂ 2 ∂c ε = 2 r , (5.23) + 2 ∂t r ∂r r ∂r ∂r 1 d 3 dt where R 3 " p∞ ∆p 3Ṙ + + δ RR̈ + pa pa R 2 ∆p ≈ 2.8, pa δ= 2 !# ∂c . =R ∂r r=R(t) 2 (5.24) ρl D2 ε2 ≈ 10−9 . pa R2c Omitting the small terms in (5.23) and (5.24) yields 1 ∂ 2 ∂c r = 0, r2 ∂r ∂r p∞ ∆p 1 d 3 2 ∂c . R + =R 3 dt pa pa R ∂r r=R(t) 156 (5.25) (5.26) Dropping the terms of order ε in (5.23) means that we can no longer satisfy the initial condition (5.19). The other initial and boundary conditions in dimensionless form are (asterisks omitted) R= pa c= ∆p R0 Rc at t = 0, (5.27) (5.28) p∞ ∆p + −1 pa pa R at r = R (t) , r∞ , (5.29) Rc where we have omitted terms of order δ in (5.28). Since r∞ /Rc ≫ 1, we can rewrite (5.29) as c = 1 at r = c → 1 as r → ∞. (5.30) Solving (5.25) for c and applying the boundary conditions (5.28) and (5.30) yields Rpa p∞ ∆p 1 + − 1 − 1 + 1. c= r∆p pa pa R Hence, pa p∞ ∆p 1 ∂c =− + −1 −1 . ∂r r=R R∆p pa pa R Substituting (5.31) into (5.26) yields p∞ Rpa p∞ ∆p 1 ∆p 1 d 3 R =− + + −1 −1 . 3 dt pa pa R ∆p pa pa R (5.31) (5.32) When the container is opened, the liquid pressure rapidly equilibrates with the atmospheric pressure so that p∞ = pa . Then (5.32) becomes ∆p 1 d 3 R 1+ = R − 1, 3 dt pa R or dR R−1 = 2 . dt R + 2R∆p/ (3pa ) (5.33) If R < 1 then Ṙ < 0 and the bubble collapses as the gas in it redissolves. If R > 1 then Ṙ > 0 and the bubble grows as gas in the liquid diffuses into it. If R = 1 then Ṙ = 0 and the bubble is in an unstable equilibrium state; that is, a perturbation from this state results in the bubble either growing or collapsing, depending on whether the perturbation is an increase or decrease in R. This confirms that Rc is the critical radius, determining whether the bubble will grow or collapse by diffusion. Solving (5.33) for R and applying the initial condition (5.27) yields 2∆p R20 R0 R−1 1 2 R − 2 + 1+ R− + ln = t. 2 Rc 3pa Rc R0 /Rc − 1 As shown in Figure 6, the bubble collapses if R0 < Rc and grows if R0 > Rc . For R0 > Rc , √ R → 2t as t → ∞. 157 1.8 1.6 1.4 R/R c 1.2 1 0.8 R =1.2R 0.6 0 c R =1.1R 0 0.4 c R =1.0R 0 c R =0.9R 0.2 0 c R =0.8R 0 0 0 1 2 3 4 c 5 t/τ Figure 6: The bubble collapses if R0 < Rc and grows if R0 > Rc . Bubble maturation Now consider a finite amount of liquid with many (identical) bubbles in it. Initially, the dissolved gas concentration is c = Hp0 , but as the bubbles grow, the dissolved gas concentration falls. Eventually, it is too low and the bubbles stop growing. At this stage, the gas in each bubble is in equilibrium with its concentration in the liquid so that Henry’s law is satisfied: 2γ , c = Hpb = H pa + R∞ where R∞ is the ultimate radius of each bubble. Also at this stage, the gas in each bubble has been extracted from a volume 34 πr3∞ of the liquid, where r∞ is the catchment radius. Ignoring the diffusion of gas across the free surface of the liquid and recalling that the liquid is incompressible, conservation of mass implies initial mass of remaining mass mass of gas gas in liquid of of gas in liquid − . (5.34) = in bubble (of same volume) volume 4 πr3 3 ∞ or, in mathematical notation, 4 3 4 3 4 3 2γ . πR ρb = πr∞ Hp0 − πr∞ H pa + 3 ∞ 3 3 R∞ Solving for r∞ and assuming that 2γ/R∞ ≪ pa , we obtain r∞ = R∞ . ε1/3 158 (5.35) To study how the bubble grows to its final radius R∞ , we rescale distance and time by letting r∗ → Λr∗ , R∗ → ΛR∗ , t ∗ → Λ2 t ∗ , (5.36) where Λ = R∞ /Rc is a large number (Λ = 200 for a 60 µm bubble). The length scale is then R∞ (≈ 60 µm, say) and the time scale is R2∞ / (Dε) (≈ 30 s). Substituting (5.36) into (5.23) and (5.24) yields (asterisks omitted) ! ∂c R2 Ṙ ∂c 1 ∂ 2 ∂c ε = 2 r , + 2 ∂t r ∂r r ∂r ∂r 1 d 3 dt R 3 " 3Ṙ2 RR̈ + 2 p∞ ∆p δ + + 2 pa pa ΛR Λ !# Omitting the small terms in these equations, we obtain 1 ∂ 2 ∂c r = 0, r2 ∂r ∂r 1 d R3 p∞ 2 ∂c . =R 3 dt pa ∂r r=R(t) ∂c =R . ∂r r=R(t) 2 The appropriate boundary conditions are (asterisks omitted) pa p∞ c= −1 at r = R (t) , ∆p pa c= c∞ (t) − Hpa H∆p at r = r∞ 1 = 1/3 , R∞ ε (5.37) (5.38) (5.39) (5.40) where we have neglected terms of order Λ−1 and δΛ−2 in (5.39). Note that we can no longer approximate c∞ (t) by c∞ (0) as we did in (5.21) as we are now considering a finite liquid in which there are many bubbles and the concentration is everywhere falling from its initial value Hp0 . Although ε−1/3 ≈ 2.6, we will assume that it is much larger so that (5.40) can be replaced by c → c∗∞ (t) as r → ∞, (5.41) where c∞ (t) − Hpa . H∆p Solving (5.37) for c and applying the boundary conditions (5.39) and (5.41) yields R ∗ Rpa p∞ c∞ . −1 + 1− c= r∆p pa r c∗∞ (t) = Thus 1 pa p∞ ∂c ∗ =− − 1 − c∞ . ∂r r=R R ∆p pa 159 (5.42) (5.43) Substituting (5.43) into (5.38) yields pa p∞ 1 d R3 p∞ ∗ = −R − 1 − c∞ . 3 dt pa ∆p pa (5.44) Applying the conservation of mass (5.34) when the radius of the bubble is R (t), we obtain, in mathematical notation and dimensional variables, Z (r3∞ +R3 )1/3 4 3 4 3 πR ρb = πr∞ Hp0 − 4π c (r, t) r2 dr. 3 3 R Using the perfect gas law to eliminate ρb and the RP equation to eliminate pb , we obtain " 4 3 2γ ρl p∞ πR ρa + + 3 pa pa R pa 3Ṙ2 RR̈ + 2 !# Z (r3∞ +R3 )1/3 4 3 = πr∞ Hp0 − 4π c (r, t) r2 dr. 3 R In terms of dimensionless variables (asterisks omitted): R 3 " ∆p δ p∞ + + 2 pa pa ΛR Λ 3Ṙ2 RR̈ + 2 !# Z (r3∞ /R3∞ +R3 )1/3 r3∞ εp0 pa = 3 r2 dr. c+ − 3ε R∞ ∆p ∆p R Using (5.35) to eliminate r∞ and then neglecting small terms yields p∞ R3 p0 = − 3ε pa ∆p Z ε−1/3 pa c+ r2 dr. ∆p R Substituting (5.42) for c yields p0 p∞ R3 = − 3ε pa ∆p Z ε−1/3 R Rpa r∆p pa 2 p∞ R ∗ c∞ + r dr. −1 + 1− pa r ∆p Upon integrating, we obtain p0 p∞ 1 Rpa R pa p∞ R3 ∗ = −1 + − 3ε − c∞ + pa ∆p 2ε2/3 ∆p pa 3ε 2ε2/3 3ε∆p 3 R pa p∞ R3 ∗ R3 pa + 3ε . − 1 − c∞ + 2∆p pa 6 3∆p 160 1 R/Λ R c 0.8 0.6 0.4 0.2 0 0 0.2 0.4 0.6 0.8 1 t/Λ2τ Figure 7: For large times t ∼ Λ2 τ ∼ 30 s, the bubble radius approaches a limiting value ΛRc = R∞ . Neglecting terms of order ε1/3 and smaller (ε) and solving for c∗∞ yields c∗∞ (t) = 1 − p∞ 3 R. pa Thus, substituting (5.45) into (5.44) yields pa p∞ p∞ 3 1 d R3 p∞ = −R −1 −1+ R . 3 dt pa ∆p pa pa (5.45) (5.46) If p∞ = pa then (5.46) yields dR 1 − R3 = . dt R Solving (5.47) for R and applying the initial condition R → 0 as t → 0, we obtain 2R + 1 1 1 π 1 −1 2 √ − ln (1 − R) + √ = t. ln R + R + 1 − √ tan 6 3 3 3 6 3 (5.47) As t → ∞, R → 1 as can be seen in Figure 7. 5.2 Bubble growth by rectified diffusion Consider a bubble with R0 < Rc . In the absence of pressure fluctuations (i.e. a sound field), it would gradually dissolve and disappear (Figure 6). However, if a sound field is introduced, then it can cause the bubble to pulsate and this may lead to the diffusion of gas into the bubble by a process called ‘rectified diffusion’. This process competes with the normal diffusion of gas out of 161 the precritical bubble. If the amplitude of the sound waves is greater than some threshold value, then rectified diffusion dominates and the bubble grows. Rectified diffusion is composed of three effects [17]: (i) When the bubble contracts, the concentration of gas in the bubble increases and gas tends to diffuse out of the bubble. When the bubble expands, the concentration of gas decreases and gas tends to diffuse into the bubble; (ii) Since the surface area of the bubble is greater during the expansion, more gas will enter during the expansion of the bubble than will leave during the contraction; (iii) During the expansion, the thickness of the diffusion boundary layer adjacent to the bubble decreases. As a result, the concentration gradient across this layer increases. Since the mass flow into the bubble is proportional to the concentration gradient, it is enhanced during expansion. The opposite is true during contraction. To model the effect of a sound field on a bubble at rest in a liquid, we can set p∞ (t) = pa +ps (t), where ps = Ps sin (ωt) is the sound field. As shown by Eller and Flynn (1965), progress can then be made by first assuming that the instantaneous bubble radius R (t) is periodic with a bubble period Tb and then averaging the equations over one bubble period. This was found to be a valid approximation if Re ≫ (D/ω)1/2 , where Re is the ‘equilibrium’ radius of the pulsating bubble such that 3 2γ Re pb = pa + . Re R For D = 2 × 10−9 m2 s−1 and ω = 250π × 103 rad s−1 , this requires Re ≫ 0.05 µm. The final solution for the rate of change of Re with time is [18] 2γ R p0 DHRs T hR/Re i dRe E 1+ , (5.48) = −D 4 dt Re [1 + 4γ/ (3pa Re )] Re pa p R a e (R/R ) e where angle brackets denote a time average over one bubble period, and 2 R Ps 2 = 1 + Kα , Re pa * + 2 4 R Ps 2 = 1 + (3 + 4K) α , Re pa 1 , 3 [1 + 4γ/ (3pa Re ) − β2 ] 4 − β2 /4 + 4γ/ (3pa Re ) , K= 1 + 4γ/ (3pa Re ) α= β2 = ρl ω2 R2e . 3pa (5.49) (5.50) (5.51) (5.52) (5.53) In deriving (5.49) and (5.50), it is assumed that that the ratio Ps /pa is small, an assumption that does not appear to be satisfied in the case of the surger unit. 162 60 50 Ps,T/pa 40 30 20 10 0 0 0.2 0.4 0.6 R /R e 0.8 1 c Figure 8: Ps,T versus Re for p0 = 3.8 bar, pa = 1 bar, γ = 40 mN m−1 , ρl = 103 kg m−3 , ω = 250π × 103 rad s−1 . The threshold amplitude Ps,T for rectified diffusion is obtained by setting Ṙe = 0 and solving for Ps to obtain s 1 + 2γ/ (pa Re ) − p0 /pa pa Ps,T = . (5.54) α (3 + 4K) p0 /pa − [1 + 2γ/ (pa Re )] K The threshold amplitude increases as Re decreases, as shown in Figure 8. Given R0 , the initial bubble radius, and Ps , (5.48) can be solved for Re (t) using the Matlab ODE solver ode15s. Some examples are shown in Figure 9. Those precritical bubbles for which Ps > Ps,T grow, while those for which Ps < Ps,T shrink. 5.3 Bubble growth by convective diffusion Since the diffusion coefficient D is small (D ≈ 2 × 10−9 m2 s−1 for N2 in water), the rate of mass transfer into a bubble at rest is also small. In practice, the bubble rises in the liquid due to its buoyancy and this motion enhances the rate of mass transfer, just as stirring tea enhances the dissolution of any added sugar. This transport of matter by the combined processes of convection and diffusion will be called convective diffusion. Consider then a bubble with radius R (t) ≫ Rc rising steadily through a supersaturated liquid. The mass flux q into the bubble would be expected to be proportional to the surface area 4πR2 of the bubble and to the concentration drop ∆c = cl − cb between the liquid bulk and the bubble ‘skin’ (Figure 1): q = k4πR2 ∆c, where the proportionality constant k is known as the mass transfer coefficient. This can be 163 2.5 R =1.2R 0 c R =1.1R 0 c R =1.0R 2 0 c R =0.9R 0 c R =0.8R 0 1.5 c R =0.7R /Rc 0 c R =0.6R c R e 0 1 0.5 0 0 1 2 3 4 5 t/τ Figure 9: Re versus t for Ps = 2 bar, p0 = 3.8 bar, pa = 1 bar, γ = 40 mN m−1 , ρl = 103 kg m−3 , ω = 250π × 103 rad s−1 . equated to the rate of increase of the bubble mass: d 4 3 πR ρb = k4πR2 ∆c. dt 3 (5.55) Empirically, k is proportional to the diffusion coefficient of the gas in the liquid, so that (5.55) can be rewritten as 4πR2D∆c d 4 3 . (5.56) πR ρb = dt 3 δN Comparing (5.4) and (5.56), the Nernst diffusion layer thickness δN is seen to be given by ∂c ∆c = . ∂r r=R δN As shown in Figure 1, δN is the thickness which the diffusion layer would have if the concentration profile were a straight line with slope (∂r c)r=R . If we assume that the gas obeys the perfect gas law ρb = pb/ (Rs T ) with pb ≈ pa as R ≫ Rc , and if we also assume that T is constant, then ρb is also approximately constant. Then d 4 3 dR πR ρb = 4πR2 ρb . (5.57) dt 3 dt Substituting (5.57) into (5.55) yields dR k∆c = . dt ρb (5.58) Experimental evidence suggests that k is approximately constant [5, 12]. During the rise time of the bubble, the gas concentration cl in the liquid is also approximately constant. Furthermore, 164 since cb = Hpb and pb ≈ pa , cb is constant. Integrating (5.58) then yields R = R0 + κt, where R0 is the initial (t = 0) radius of the bubble and the growth rate of the bubble is k∆c . ρb κ= (5.59) For champagne bubbles originating from cellulose fibres, R0 ≈ 20 µm and κ ≈ 350 − 400 µm s−1 [12]. For a carbonated beer, values of κ ≈ 100 − 150 µm s−1 [12] and κ ≈ 40 µm s−1 [5] have been reported. The low gas concentrations in Guinness imply smaller values of κ, as we will see. In the literature, the mass transfer coefficient k is usually expressed in terms of the Sherwood number (also known as the Nusselt mass transfer number): Sh = 2kR , D which is itself a function of the Reynolds number Re = 2ρlRU , µ and the Peclet number 2RU . D If Pe ≪ 1, then mass transfer is dominated by diffusion. If Pe ≫ 1, then mass transfer is dominated by convection. In order to determine k, we first need to estimate U, which we can do by considering the forces on a rising bubble. By Archimedes principle, the buoyancy force on a body immersed in a liquid is equal to the weight of the liquid displaced by that body. Thus, for a spherical bubble, the buoyancy force is Pe = 4 FB = πR3ρl g. 3 Since ρb ≪ ρl , the weight of the bubble is negligible in comparsion to the buoyancy force and the bubble rises. As the bubble rises it experiences resistance due friction between its surface and the surrounding viscous liquid. This friction force is expressed as Fd = Cd ρl U2 2 πR , 2 where Cd is the friction factor or drag coefficient. The buoyancy force is nearly always balanced by the drag force so that we can equate them to obtain 4 3 ρl U2 2 πR ρl g = Cd πR , 3 2 165 or, solving for Cd , 8Rg . (5.60) 3U2 This equation can be used to determine Cd experimentally. It is found that beer bubbles ultimately experience a drag similar to that experienced by solid spheres, for which [16] Cd = Cd = 24 1 + 0.15 Re0.687 , Re Re < 800. Over the rise time of the bubble though, a good approximation is Cd ≈ 48/ Re [12]. Solving (5.60) for U then yields ρl gR2 U= . (5.61) 9µ As R increases, U increases and the bubble accelerates. Small bubbles rise slower than large bubbles. Hence, they can be entrained in the downward liquid flow at the glass walls and actually sink. These sinking bubbles are easily seen in a pint of Guinness. For bubbles with radii of order 50 µm in Guinness (ρl ≈ 103 kg m−3 , µ ≈ 2 × 10−3 Pa s [19]), U ≈ 0.1 cm s−1 , Re ≈ 0.07 and Pe ≈ 68. In carbonated beers, U is a couple of cm s−1 in good accordance with Equation (5.61) [5]. For the case of stationary (U, Re, Pe = 0) bubbles, Sh = 2 as can be verified by rewriting (5.32) in dimensional form. For rising bubbles (nonzero U, Re, Pe), flow inside and outside the bubble affects the mass transfer. The Sherwood number can be calculated from the semiempirical expression [8] ! 1/3 0.096 Re , Re < 105 . 1+ Sh = 1 + 1 + Pe1/3 1 + 7 Re−2 For large Pe, Sh ≈ Pe1/3 so that, from the definitions of Sh and Pe, 2/3 D 1/3 D k= Pe = U1/3 . 2R 2R Substituting (5.61) for U yields k= D 2 2/3 ρl g 9µl 1/3 . Because of the R2 dependence of U, k becomes independent of R and hence time, as expected. The growth rate (5.59) becomes 2/3 1/3 D ∆c ρl g κ= 2 9µl ρb Over time cl drops, so that κ drops. For pure nitrogen bubbles in Guinness, we obtain κ ≈ 3 µm s−1 , much smaller than the corresponding values for champagne and carbonated beers. Although Guinness bubbles rise slower and therefore have longer to grow, they remain smaller than those in carbonated beverages because of this very low growth rate. 166 5.4 The surger In the case of the surger, precritical bubbles are initially grown by rectified diffusion. Once they become postcritical, they grow by convective diffusion as they rise through the Guinness. The equations governing the growth of any individual bubble are much like those we have already seen, except now we must consider the combined effects of a sound field and convection. We must also consider the whole distribution of bubbles within the Guinness, as well as the dissolved gas concentration, if we want to model the formation of the head. Consider a (pure nitrogen) bubble which is rising steadily through the liquid as it grows. We assume that the rate of increase of mass in the bubble is equal to the sum of the diffusive and the convective fluxes into the bubble: rate of increase rate of addition rate of addition of mass of gas of mass by of mass by = + , in bubble diffusion convection or, in mathematical notation, d dt 4 3 πR ρb 3 = 4πR2 D∆c + πR2 fU∆c. R (5.62) The factor f is unknown initially; through trial-and-error, a value which makes the results quantitatively reasonable is chosen. Of course, if the Sherwood number were known then Equation (5.55) could be used in place of (5.62). Using the perfect gas law to eliminate ρb and Henry’s law to eliminate cb , we obtain 1 fU 1 d R3 pb 2 (cl − Hpb ) . (5.63) =R + 3Rs T dt R 4 The bubble is assumed to be rising with the Stokes’ terminal velocity: U= 2ρl gR2 , 9µ (5.64) which seems appropriate if wall effects are negligible [16], as they would be for bubbles which originate in the liquid bulk. Neglecting the inertial terms in the Rayleigh-Plesset equation, we have 2γ . (5.65) pb = p∞ + R We set p∞ (t) = pa + ps (t), where ps = Ps sin (ωt) is the ultrasound field. Ps can be estimated as follows: Assuming the plate of the surger unit oscillates sinusoidally with amplitude A, its position is given by z = z0 − A sin (ωt). The corresponding acceleration is g = z̈ = ω2 A sin (ωt). This creates the pressure field ps = ρl g (h − z) = ρl ω2 A (h − z) sin (ωt) , 167 Figure 10: The fluxes into the shaded region ∆R∆z in phase space (R, z). inside the liquid (as verified experimentally in a vibrating vessel [20]). Inserting appropriate values (ρl = 103 kg m−3 , ω = 250π × 103 rad s−1 , A = 89 nm, h = 10 cm), we find that Ps varies from 55 bar at the bottom (z = 0) of the liquid to 0 bar at the top (z = h). From Figure 8, we can see that 55 bar is above the threshold amplitude for most of the precritical bubbles. It does, however, seem like an unusually large pressure and there is uncertainty regarding the exact value of A. For simplicity, we ignored the dependence of ps on z. Now suppose that the bubble distribution in the liquid is given by F (R; z, t) such that FdRdz is the number of bubbles with radii from R to R + dR at heights from z to z + dz at time t. As shown in Figure 10, the rate of increase of F (R; z, t) ∆R∆z, the number of bubbles in the shaded region, is given by rate of increase of number of bubbles in shaded region rate of addition of bubbles due to = bubbles growing rate of addition of precritical rate of addition bubbles by of bubbles due , + + homogeneous to bubbles rising nucleation 168 or, in mathematical notation, ∂F ∆R ∆R ∆R∆z = F R − ; z, t − F R + ; z, t Ṙ∆z ∂t 2 2 ∆z ∆z + F R; z − , t − F R; z + , t U∆R + s (R; t) ∆R∆z, 2 2 where s (R, t) ∆R∆z is the number of precritical bubbles being created or destroyed in the shaded region per unit time. Dividing both sides by ∆R∆z and taking the limit as ∆R, ∆z → 0 yields ∂F ∂F ∂F = − Ṙ − U + s (R; t) . ∂t ∂R ∂z The dissolved gas concentration is given by rate of increase rate of addition of mass of gas of mass of gas = per unit volume per unit volume by diffusion of liquid or, in mathematical notation, rate of loss of mass of gas per − unit volume by bubble growth (5.66) , Z ∂2 c l d 4 3 ∂cl =D 2 − F πR ρb dR. ∂t ∂z dt 3 Using the perfect gas law to eliminate ρb , we obtain Z d R3 pb ∂cl ∂2 c l 4π F =D 2 − dR. ∂t ∂z 3Rs T dt (5.67) Equations (5.63) to (5.67) form a closed set for {R, U, pb, F, cl} and govern the growth of an initial distribution of bubbles in the presence of an ultrasound field ps and a source s of precritical bubbles. However, solving these equations is complicated by the problem of choosing the time step. A naïve simulation would use the ultrasound period 2π/ω ≈ 8 µs over the duration of the surge and settle (∼ 1 min). To speed up the simulation, we instead split it into two parts, the first part of which computes the rate at which postcritical bubbles are being produced by the ultrasound field, and the second part of which uses this rate to study the behaviour of the postcritical bubbles. The final results from this part are shown in Figures 11, 12, and 13. Figure 11 shows the bubble radii as a function of time, each bubble corresponding to one cross. Initially, all the bubbles are precritical and thus their radii are clustered together. As time progresses, the radii increase with the maximum radius being about 400 µm. As bubbles reach the top of the liquid, they are removed from the graph. As time progresses further, the mean bubble radius drops to about 50 µm because the bubble growth rate decreases as the dissolved gas concentration in the liquid falls. In Figure 12, the corresponding heights of the bubbles are shown. Finally, the height of the head is shown in Figure 13; it is approximately 12 mm after 5 s. 169 Figure 11: Bubble radii as a function of time. Figure 12: Bubble heights as a function of time. 170 Figure 13: Head height as a function of time. 6 Conclusions Diageo asked us to develop a mathematical model describing bubble formation and growth in Guinness. To this end, we considered the growth of single bubbles by passive, rectified and convective diffusion. In the passive case, the role of the critical radius Rc was emphasised: in the absence of a sound field, precritical (R < Rc ) bubbles shrink. In the presence of a sound field, we showed that precritical bubbles will grow by rectified diffusion provided that the amplitude of the sound field is above the threshold value given by (5.54). We also considered how the growth of bubbles by convective diffusion is modelled mathematically. Applying these ideas to the surger unit, and some new ideas regarding the evolution of the bubble distribution and the dissolved gas concentration, we successfully modelled the initiation of Guinness. The results, however, are more qualitative than quantitative since 1. We have completely ignored the dissolved CO2 . Since the CO2 concentration is much higher than the N2 concentration, it must surely have a significant effect on the growth of the bubbles (cf. Section 8). In particular, a model taking both gases into account should be able to explain quantitatively why the head on Guinness does not form spontaneously. Furthermore, there may also be some water vapour in the bubble, although for low temperatures the ratio of vapour to gas should remain small. The surger equations, Equations (5.63) to (5.67), are easily modified to take the CO2 into 171 account by replacing Equation (5.63) by 1 d R3 xi pb = ki R2 (cl,i − Hi xi pb ) , 3Rs,i T dt where we are now using the mass transfer coefficient ki , and Equation (5.67) by Z d R3 xi pb ∂2 cl,i 4π ∂cl,i F = Di − dR, ∂t ∂z2 3Rs,i T dt (6.1) (6.2) where xi is the mole fraction of gas species i in the bubble so that X xi = 1. Equations (6.1) and (6.2) constitute four equations, two each for the CO2 (i = CO2 ) and the N2 (i = N2 ). 2. We have not considered the diffusion of gas across the top surface of the Guinness and into the atmosphere. This could be significant: for champagne, 80% of the dissolved CO2 molecules escape by this method [12]. 3. For sufficiently large bubbles and high frequencies, the isothermal (T = const) approximation becomes questionable [21]. 4. Assuming that the rise velocity is simply the Stokes velocity is an over-simplification. In practice, the high number of bubbles creates a complex flow pattern, with liquid and bubbles rising in the centre of the glass and liquid and small bubbles descending at the edges. 5. The bubble distribution and dissolved gas concentration will vary horizontally as well as vertically. 6. The addition of surfactants significantly affects the accuracy of theoretical predictions regarding rectified diffusion [18]. 7. We studied noninteracting spherical bubbles, but bubbles can deform, oscillate, break up and coalesce. A noticeable feature of using the surger unit is the slight delay (∼ 1 s) before the surge of nitrogen bubbles is seen. It is possible that this delay is due to the so-called Bjerknes force holding the bubbles in place while the surger unit is vibrating [20]. When the surger unit stops vibrating (it only vibrates for 1 − 2 s after pressing the activation button), there is no Bjerknes force and the bubbles are free to rise. As they rise, some bubbles move towards the glass walls where they become visible. Although we focused on the surger, the equations we have used can be applied to other methods of initiation simply by modifying the input pressure p∞ (t) and the source term s (R; t). 172 At the Study Group, some ideas for new initiation methods were discussed. Perhaps the most interesting is the possibility of using a ‘nanowidget’; that is, an insert at the bottom of the can which has nanometer features, making it superhydrophobic. When liquid is poured over such a surface, gas is trapped in the cavities between the features. Upon pressurising the can, some of the gas will dissolve. Upon opening the can, the surviving gas pockets will act as nucleation sites, initiating the Guinness. Of course, the effectiveness and cost of such a device would have to be investigated. Another interesting idea is to coat the inside of the can with a surfactant to promote bubble nucleation. For example, it is known that the surfactant coating on Mentos (gum arabic) is partly responsible for the explosivity of the Diet Coke and Mentos reaction [22]. 7 List of symbols In the following list of symbols, the corresponding SI unit, if any, is given in brackets. A amplitude of oscillation of surger plate (m) Cd drag coefficient c gas concentration in liquid (kg m−3 ) cb gas concentration in liquid at bubble surface (kg m−3 ) cl gas concentration in liquid bulk (kg m−3 ) c∞ gas concentration in liquid at r = r∞ (kg m−3 ) D diffusivity of gas in liquid (m2 s−1 ) F bubble distribution function (m−2 ) FB buoyancy force (N) Fc capillary force (N) Fd drag force (N) f function defined in Equation (5.62) g acceleration due to gravity (m s−2 ) H Henry’s law constant (kg m−3 Pa−1 ) h height of liquid (m) K function defined in Equation (5.52) k mass transfer coefficient (m s−1 ) L circumference of nucleation site (m) 173 M molar mass (kg mol−1 ) nb = pb Vb / (Rg T ) number of moles of gas in bubble (mol) Ps amplitude of sound field (Pa) Ps,T threshold amplitude of sound field (Pa) Pe = 2RU/D Peclet number p partial pressure of gas above liquid (Pa) pa atmospheric pressure (Pa) pb gas pressure in bubble (Pa) pl liquid pressure (Pa) ps = Ps sin (ωt) sound field (Pa) p0 gas pressure in sealed container (Pa) p∞ liquid pressure at r = r∞ (Pa) q mass flux (kg s−1 ) R bubble radius (m) Ṙ = dR/dt time derivative of R (m s−1 ) Rc critical radius (m) Re ‘equilibrium’ radius of a pulsating bubble (m) R0 initial radius (m) R∞ ultimate radius (m) Re = 2ρlRU/µ Reynolds number Rg universal gas constant (J K−1 mol−1 ) Rs = Rg /M specific gas constant (J K−1 kg−1 ) r distance from centre of bubble (m) r∞ catchment radius (m) S supersaturation ratio Sh = 2kR/D Sherwood number s precritical bubble source (m−2 s−1 ) T temperature (K) 174 t time (s) (ur , uθ , uφ) liquid velocity components (m s−1 ) u = ur radial component of liquid velocity (m s−1 ) U velocity of a rising bubble (m s−1 ) Vb = 43 πR3 volume of bubble (m3 ) x mole fraction of gas z distance measured upward from bottom (z = 0) of liquid (m) α function defined in Equation (5.51) β function defined in Equation (5.53) γ surface tension (N m−1 ) ∆p = p0 − pa (Pa) ∆c = cl − cb (kg m−3 ) δ = ρl D2ε2 / pa R2c small parameter δN Nernst diffusion layer thickness (m) ε = H∆p/ρa small parameter θ colatitude coordinate in spherical coordinates (r, θ, φ) (rad) θc contact angle (rad) κ bubble growth rate (m s−1 ) Λ = R∞ /Rc large parameter µ dynamic viscosity of liquid (Pa s) ρa = pa /Rs T density scale (kg m−3 ) ρb gas density in bubble (kg m−3 ) ρl liquid density (kg m−3 ) τ = R2c / (Dε) time scale (s) φ longitude coordinate in spherical coordinates (r, θ, φ) (rad) ω angular frequency of oscillation of surger plate (rad s−1 ) 175 8 Gas concentrations in Guinness Dissolved gas concentrations are frequently given in g l−1 (grams of dissolved gas per litre of solution) or g kg−1 (grams of dissolved gas per kilogram of solution). They can also be expressed in parts per million (ppm), Volumes (Vol.), or as a volume over volume percentage (% v/v). These three units are defined by ppm of dissolved gas = ( g kg−1 of dissolved gas) × 103. volume of dissolved gas at 0 ◦ C and 1 atm , volume of solution % v/v of dissolved gas = (Vol. of dissolved gas) × 100. Vol. of dissolved gas = Thus, to convert from g kg−1 to ppm, we multiply by 103 . For example, the desired CO2 concentration in Guinness is about 2 g l−1 . If 1 l of Guinness has a mass of 1 kg, then this is equivalent to 2 g kg−1 or 2000ppm. To convert to Vol., we must first determine the conversion factor from g l−1 to Vol.. From the perfect gas law, one mole of gas occupies 22.4 litres at 0 ◦ C and 1 atm. Thus, one gram of gas occupies 22.4/M litres, where M is the molar mass of the gas in g mol−1 . Hence, Vol. of dissolved gas = g l−1 of dissolved gas × Since the molar mass of CO2 is 44.0 g mol−1 , 22.4 . M Vol. CO2 = g l−1 CO2 × 0.509. Thus, to convert from g l−1 CO2 to Vol. CO2 , we multiply by 0.509, so that 2 g l−1 CO2 is equivalent to 1.02Vol. CO2 . This means that the CO2 in one litre of Guinness would occupy 1.02 litres if brought to a temperature of 0 ◦C and a CO2 gas pressure of 1 atm. Since the molar mass of N2 is 28.0 g mol−1 , Vol. N2 = g l−1 N2 × 0.800. To obtain more accurate conversion factors, the deviation from perfect gas behaviour must be taken into account. To convert from Vol. to % v/v, we multiply by 100. Thus, the desired CO2 concentration is 102% v/v. Henry’s law relates the solubility c of a gas to its partial pressure p above the solution: c = Hp, where H is the Henry’s law constant. The Henry’s law constants of CO2 and N2 in pure water are given in table 8 for different temperatures. The desired CO2 concentration is about 2 g l−1 From Henry’s law this requires a partial pressure of cCO2 2 g l−1 pCO2 = = 0.8 bar . = HCO2 2.5 g l−1 bar−1 176 Temperature (◦ C) 0 8 15 20 HCO2 (g l−1 bar−1 ) 3.42 2.50 2.05 1.76 HN2 (g l−1 bar−1 ) 0.0283 0.0241 0.0209 0.0192 Table 8: Henry’s law constants for CO2 and N2 in pure water. Sources: [23] and [24]. In the sealed container the total pressure is 3.8 bar. Thus the partial pressure of N2 in the headspace (and widget) is pN2 = 3 bar. From Henry’s law this yields a solubility of cN2 = HN2 pN2 = 0.0241 g l−1 bar−1 (3 bar) = 0.0723 g l−1 , or 5.78% v/v. If 1 l of Guinness has a mass of 1 kg, then the solubility of N2 is 72.3ppm. The mole fractions in the gas phase are xCO2 = 0.8 bar = 0.21, 3.8 bar xN2 = 3 bar = 0.79, 3.8 bar so that the gas mixture is 21 mole % CO2 and 79 mole % N2 . Suppose now that the container is opened and the gas concentrations are allowed to come to equilibrium with their partial pressures in the atmosphere at 15 ◦ C. The partial pressures of CO2 and N2 in air are about 3 × 10−4 bar and 0.78 bar, respectively. Hence, from Henry’s law, the corresponding solubilities are cCO2 = 2.05 g l−1 bar−1 3 × 10−4 bar = 0.0006 g l−1 , cN2 = 0.0209 g l−1 bar−1 (0.78 bar) = 0.016 g l−1 . It follows that about 0.0282 g of N2 and 1 g of CO2 must escape from every pint (∼ 500 ml) of Guinness to bring the dissolved gas concentrations into equilibrium with their atmospheric partial pressures. This corresponds to about one billion 50 µm radius bubbles, 50 µm being close to the average bubble radius [19]: total number of excess moles of gas in liquid total number = of bubbles total number of moles in one bubble 1 g /MCO2 + 0.0282 g /MN2 = ≈ 109 bubbles. nb This number of bubbles would create a head 14 cm thick! In practice, most of the CO2 cannot escape as bubbles and must instead diffuse through the free liquid surface. To see why, let us consider what the CO2 partial pressure, xCO2 pb , in a postcritical bubble tends towards. From Henry’s law CO2 will diffuse into the bubble if the dissolved CO2 concentration cCO2 in the liquid is greater than HCO2 xCO2 pb . Noting that initially cCO2 = HCO2 (0.8 bar) and that for a postcritical bubble pb ≈ 1 bar, we find that dissolved CO2 diffuses into the bubble if xCO2 < 0.8. 177 1.4 R/(50µm) x 1.2 CO 2 1 0.8 0.6 0.4 0.2 0 0 1 2 3 4 5 t/0.94s Figure 14: R and xCO2 versus t for R (0) = 1 µm and xCO2 (0) = 0.0003. The CO2 mass flux into the bubble will be greater than the N2 mass flux because of the higher concentration of CO2 . Hence, the CO2 mole fraction in the bubble will tend towards 0.8. In fact, solving (6.1) for the case of stationary bubbles (Sh = 2, ki = Di /R) and constant dissolved gas concentrations (cl,i = const), we find that the CO2 mole fraction tends towards 0.7 (figure 14). It is therefore appropriate to consider how many bubbles, each with xN2 = 0.3 and R = 50 µm, can be created before the N2 runs out: total number of excess N2 moles in liquid total number = of bubbles total number of N2 moles in one bubble 0.0282 g /MN2 ≈ 1.5 × 108 bubbles. = xN2 nb This reduces the height of the head to about 2 cm, which is reasonably close to the observed value of about 1 cm considering the approximations made. The amount of excess CO2 left dissolved in the liquid when the N2 runs out is given by CO2 mass in initial excess dissolved mass of − = one bubble CO2 mass excess CO2 left number × of bubbles = 1 g − (MCO2 xCO2 nb ) 1.5 × 108 ≈ 0.9 g . Hence, 90% of the excess CO2 remains and can only escape by diffusing through the free surface. 178 Acknowledgements The authors would like to thank Diageo, and especially their representatives Conor Browne and Richard Swallow, for bringing this interesting problem to the Study Group and for answering all our questions. The authors would also like to acknowledge the support of the Mathematics Applications Consortium for Science and Industry (MACSI) funded by the Science Foundation Ireland mathematics initiative grant 06/MI/005. 179 Bibliography [1] Guinness Storehouse website. www.guinness-storehouse.com [accessed 27th July 2009]. [2] Campaign for Real Ale website. www.camra.org.uk [accessed 25th August 2009]. [3] Bamforth C.W. (2004). The relative significance of physics and chemistry for beer foam excellence. J. Inst. Brew., 110 (4), 259-266. [4] Walker J. (1981). Reflections on the rising bubbles in a bottle of beer. Am. Sci., 245, 124-132. [5] Shafer N.E. and Zare R.N. (1991). Through a beer glass darkly. Physics Today, 44 (10), 48-52. [6] Weiss P. (2000). The physics of fizz. Sci. News, 157 (19), 300-302. [7] Liger-Belair G. (2003). The science of bubbly. Sci. Am., 288 (1), 80-85. [8] Zhang Y. and Xu Z. (2008). “Fizzics” of bubble growth in beer and champagne. Elements, 4 (1), 47-49. [9] Wilt P.M. (1986). Nucleation rates and bubble stability in water-carbon dioxide solutions. J. Colloid Interface Sci., 112 (2), 530-538. [10] Gardner R.J. (1973). The mechanism of gushing - A review. J. Inst. Brew., 79, 275-283. [11] Planinšič G. (2004). Fizziology. Phys. Ed., 39, 65-68. [12] Liger-Belair G. (2005). The physics and chemistry behind the bubbling properties of champagne and sparkling wines: A state-of-the-art review. J. Agric. Food Chem., 53, 2788-2802. [13] Jones S.F., Evans G.M. and Galvin K.P. (1999). Bubble nucleation from gas cavities - a review. Adv. Colloid Interface Sci., 80, 27-50. [14] Liger-Belair G., Polidori G. and Jeandet P. (2008). Recent advances in the science of champagne bubbles. Chem. Soc. Rev., 37, 2490-2511. [15] Liger-Belair G. (2002). Kinetics of gas discharging in a glass of champagne. Langmuir, 18, 1294-1301. 180 [16] Liger-Belair G., Marchal R., Robillard B., Dambrouck T., Maujean A., Vignes-Adler M. and Jeandet P. (2000). On the velocity of expanding spherical gas bubbles rising in line in supersaturated hydroalcoholic solutions. Langmuir, 16, 1889-1895. [17] Eller A. and Flynn H.G. (1965). Rectified diffusion during nonlinear pulsations of cavitation bubbles. J. Acoust. Soc. Am., 37, 493-503. [18] Crum L.A. (1980). Measurements of the growth of air bubbles by rectified diffusion. J. Acoust. Soc. Am., 68, 203-211. [19] Robinson M., Fowler A.C., Alexander A.J. and O’Brien S.B.G. (2008). Waves in Guinness. Phys. Fluids, 20, 067101. [20] Crum L.A. (1975). Bjerknes forces on bubbles in a stationary sound field. J. Acoust. Soc. Am., 57, 1363-1370. [21] Apfel R.E. (1981). Acoustic cavitation prediction. J. Acoust. Soc. Am., 69, 1624-1633. [22] Coffey T. (2008). Soda pop fizz-ics. The Physics Teacher, 46, 473-476. [23] Horvath A.L. (1975). Physical properties of inorganic compounds - SI units. Arnold, London. [24] Murer G. (2005). Measuring dissolved gases in packaged beverages. Brauwelt International, 2, 100-107. 181 182 Solar Reflector Design Report Contributors: Afonso Bandeira1 , Christopher G. Bell 2, Jean P.F. Charpin3,4, Romina Gaburro3 , Sofiane Soussi3 and R. Eddie Wilson5 Study Group Contributors: Giles Richardson6 and Galin Ganchev3 Industry Representative: Sean Hoolan7 1 Universidade de Coimbra, Portugal Department of Bioengineering, Imperial College London, United Kingdom 3 MACSI, department of Mathematics and Statistics, University of Limerick, Limerick, Ireland 4 Report coordinator, jean.charpin@ul.ie 5 Department of Engineering Mathematics, University of Bristol, United Kingdom 6 School of Mathematics, University of Southampton, United Kingdom 7 Erin Energy Ltd, Applied Research into Solar Technologies, MIRC Building, Athlone Institute of Technology, Dublin Road, Athlone, Co Westmeath, Ireland 2 Abstract The design of solar panels is investigated. Different aspects of this problem are presented. A formula averaging the solar energy received on a given location is derived first. The energy received by the collecting solar panel is then calculated using a specially designed algorithm. The geometry of the device collecting the energy may then be optimised using different algorithms. The results show that for a given depth, devices of smaller width are more energy efficient than those of wider dimensions. This leads to a more economically efficient design. 184 Mirror Energy absorber Figure 1: Typical configuration 1 Introduction In recent years, considerable research was carried out on renewable sources of energy. Designing cheap and efficient devices has become a priority for the industry. Erin Energy, based in Athlone, Westmeath, Ireland, is focusing on solar energy. A typical example of design they study may be seen on Figure 1. This device should be placed on the roof. The solar energy is collected by the energy absorber, transported and then stored in a chemically based energy tank. Ideally, the roof should be completely covered with energy collectors but they are quite expensive. To reduce the amount of collectors necessary, the roof is covered with mirrors. The solar energy may then be collected either directly or after one or more reflections on the mirrors. The shapes of the mirrors and collectors are key to the efficiency of the system. Erin Energy currently investigates a cylindrical collector coupled with parabolic or involute shaped mirrors. They would like to evaluate the efficiency of their current design and if possible, find more optimised shapes. This problem was submitted to the the 70th European Study Group in Industry held in Limerick in June-July 2009. This report reflects the work of the group. Different directions were investigated to tackle this problem. First of all, the incoming solar energy was considered. This parameter varies with the position of the device, its inclination, the time of the day and the time of the year. These different aspects were considered and are detailed in Section 2. The reflections of rays and energy over the mirror was then considered. An exact and fast ray tracing algorithm was developed in MATLAB and is presented in Section 3. Several methods were then considered to optimise the shape of the mirrors. An approach based on inverse methods is introduced in Section 4. An alternative was developed to maximise the direct and first reflections. This aspect is detailed in Section 5. Finally a more general optimisation based on a pattern search method was used and the results may be found in Section 6. 185 Aperture Mirror Absorber Figure 2: Sketch of a trough-like solar concentrator. 2 Model for annual energy collection To calculate the annual energy collected by a stationary trough-like solar concentrator such as that pictured in Figure 2 requires two steps. Firstly we must find the energy incident on the aperture as a function of the position of the sun, and then we must calculate how much of this energy is transmitted to the absorber via reflections from the mirror. In all of this work, we only consider direct beam radiation and ignore the effects of diffuse radiation. For a concentrator situated at latitude angle φ, a simple model for the total annual energy collected may be written as Z δmax Z ωlim (δ, φ) F(δ, ω, φ) T (δ, ω, φ) dω dδ. (2.1) Eannual = A −δmax 0 The energy function F(δ, ω, φ) models the energy striking a unit area of the aperture, and multiplying by the area A of the aperture gives the total energy entering the aperture. The transmission function T (δ, ω, φ) (0 ≤ T ≤ 1) models the fraction of the energy entering the aperture which hits the absorber. Integrating over δ and ω sums the total energy collected over the year. The parameter δ is the declination angle [1, 2], which varies between ±δmax = ±0.40928 radians and determines the height of the Sun above the equatorial plane throughout the year. The variation of δ as a function of day number N, with N = 1 being the 1st January, is depicted in Figure 3. The rotation of the Earth is measured by the hour angle, ω [1, 2], and the range of integration [0, ωlim(δ, φ)] is determined by the hours of daylight available to the concentrator. This 186 Solstice, Jun 21st/22nd δ, radians 0.4 0.3 Equinox, Sep 23rd 0.2 0.1 0 100 K 200 300 ν 0.1 N K 0.2 K 0.3 K 0.4 Equinox, Mar 22nd Solstice, Dec 21st/22nd Figure 3: Graph of how the declination angle δ varies as a function of the day number N. depends both on the height of the Sun in the sky (which depends on δ and φ) and on the orientation and angle of tilt of the concentrator. The function F(δ, ω, φ) models the intensity of the light striking the aperture of the concentrator as a function of δ and ω, corresponding to particular times of day and season. A general formula for F(δ, ω, φ) may be written as [3] 43200 × 365.25 × 2π2 0.034 2π 1+ cos × 173 sin δ × sin(δmax ) 365.25 k × a0 + a1 exp − cos δ cos ω cos φ + sin δ sin φ s(δ, ω, φ) · n× F(δ, ω, φ) = 4 × Isc × cos δ q . (2.2) sin2 (δmax ) − sin2 δ In this formula, the first line quantifies the amount of radiation striking the outer edge of the earth’s atmosphere. The parameter Isc = 1367 W m−2 is the solar constant [2] and the multiplicative factor results from changing the integration parameters from time to angle. The second line adjusts for the ellipticity of the earth’s orbit around the Sun [2]. The third line models the reduction in the intensity of the light striking the earth’s surface depending on how far it has travelled through the atmosphere. We use the Hottel model [4], and we take the 23km Visibility Haze parameters at zero altitude: a0 = 0.1281, a1 = 0.7569, k = 0.3872. (2.3) The fourth line measures the ‘cosine effect’ and modifies the energy striking the aperture according to the angle of incidence. Here s(δ, ω, φ) is a vector pointing in the direction of the 187 Sun and n is the normal vector to the aperture of the collector. Finally the last line comes from the Jacobian due to the change in variables of integration from time to angle. For trough-like concentrators, the problem of calculating the fraction of energy, T (δ, ω, φ), transmitted from the aperture to the absorber is essentially two-dimensional. Away from the ends of the trough, the component of the energy flux along the longitudinal axis may be ignored and the energy collected is simply a function of the angle of incidence in the plane orthogonal to the longitudinal axis of the concentrator. We call this angle ξ(δ, ω, φ) (−π/2 ≤ ξ ≤ π/2), and measure it relative to the normal to the aperture of the concentrator as shown in Figure 4. This angle of incidence is a function of the position of the Sun (through δ, ω and φ), and also depends on the orientation and tilt of the concentrator. For a particular shape of mirror and position of absorber, it is easy to calculate T (ξ) for any angle of incidence ξ by using ray tracing, and if we know how ξ depends on δ, ω and φ, we may calculate the energy integral. n ξ s1 Aperture Figure 4: Sketch showing the angle of incidence ξ in the plane orthogonal to the longitudinal axis of the concentrator. The vector s1 is the projection of the vector s pointing in the direction of the Sun into this plane and n is the normal to the aperture. To assess the performance of a concentrator, there are therefore four functions to determine before the annual energy may be calculated • ωlim (δ, φ) - the hours of daylight available to the concentrator; • s(δ, ω, φ) · n - the ‘cosine effect ’ due to the angle of incidence of the energy onto the aperture; • ξ(δ, ω, φ) - the angle of incidence in the plane orthogonal to the longitudinal axis of the concentrator; • T (ξ) - the fraction of incident energy transmitted to the absorber as a function of the angle of incidence ξ. The first three functions depend on the orientation and tilt of the concentrator, while the last depends on the shape of the mirror and the position of the absorber. 188 In reference [3], we have considered the above problem for a horizontal east-west aligned concentrator tilted at an angle θn from the vertical. If we restrict the location of the concentrator to be between the Antarctic and Arctic circles so that π π (2.4) δmax − ≤ φ ≤ − δmax , 2 2 and we restrict the angle of tilt θn so that π π −φ − − δmax ≤ θn ≤ −φ + − δmax , 2 2 then it is possible to show that the hours of daylight are specified by the upper limit ωlim (δ, φ, θn ) = min arccos − tan δ tan(φ + θn ) , arccos − tan δ tan φ . (2.5) (2.6) The restriction on the angle of tilt θn is reasonable, since as a general rule of thumb using the polar mount (or θn = −φ) for an east-west concentrator is close to maximizing the annual energy incident on the aperture [1]. The ‘cosine effect’ is specified by the relationship s(δ, ω, φ) · n = cos δ cos ω cos(φ + θn ) + sin δ sin(φ + θn ), and finally the angle ξ may be calculated through the relationship [5] tan δ −1 ξ(δ, ω, φ) = −θn − φ + tan . cos ω (2.7) (2.8) For further information on determining the above functions for different orientations and tilts, see references [1, 2]. As a final comment, in reference [3] for a horizontal tilted east-west concentrator, we show how to change integration variables from (δ, ω) to (δ, ξ) so that the energy integral may be written in the form ZZ Eannual = A F(δ, ξ, φ) T (ξ) dδ dξ. (2.9) It is then easy to substitute in for the transmission function as a direct function of ξ. In the next section we show how to develop a ray tracing algorithm to calculate T (ξ). 3 3.1 Ray-tracing package Motivation and scope Later in this report we shall propose several different methods for generating reflector shapes. These are based on either heuristic techniques, or on the formal optimisation of energy return in some highly simplified setting — for example, we may neglect all secondary reflections or 189 consider only a reduced model for the trajectory of the sun through the sky and its effective intensity as a function of inclination. Whatever approach is used to generate reflector shapes, there is a need to definitively evaluate the return in a realistic setting which captures the full details of energy collection throughout the year and which may also cope with extra effects such as diffuse radiation (which is thought to be important in an Irish setting). This definitive evaluation can only be carried out numerically, and the computational framework that we implemented is based on the classical approach of ray-tracing. There are in fact many ray-tracing packages already in the public domain which are equal to our task, but none that we found are specialised to the particular geometry etc. of solar reflector design, yet flexible enough for our computational experiments. In summary, it seemed best to implement our own ray-tracing package during the Study Group itself, using the high-level computational language Matlab. Our code took only about one man-day to complete and forms the basis of a very pleasing tool with which reflector design can be investigated. The user has access to all the powerful programming and visualisation features of the Matlab environment, yet at the same time, the computational kernel is sufficiently fast that meaningful calculations can be carried out almost instantaneously on even modest laptop hardware. Later in the report, we describe how we drove the ray-tracing code in the inner loop of a black box optimisation algorithm, thus enabling formal optimisation of the reflector profile even for realistic models of incident radiation. This is an extremely demanding problem from the computational point of view, yet the optimisation proved tractable provided the search was restricted to relatively simple reflector designs. Of course, it will be possible to accelerate run times significantly, and thus attempt more sophisticated optimisation (over wider search spaces and resolving more detail of the reflector profile), by either translating the computational kernel into a low-level language such as C or Fortran, or porting the code to specialised parallel hardware such as GPU machines. These tasks remain for future work, as does the development of a polished graphical front-end. Since we consider only reflectors with a fixed cross-section, we may simplify matters by only tracing rays constrained to the plane of the cross-section. In reality, at most times of day real-world rays will have an out-of-plane component, but this may be compensated for by a reduction in the intensity factor. Here we assume that the reflector is much longer than it is wide so that end-effects may be neglected. 3.2 Interface to the computational routine In summary, we need only perform ray-tracing in two dimensions, where the reflector is modelled by a curve and the absorber element by a circle. As a final simplification, we disretise the reflector by approximating it by a set of straight line segments, since the intersection of 190 straight lines (rays) with straight lines (reflector segments) is a particularly simple computational problem. The accuracy of results may then be refined by increasing the number of line segments used in the discretisation: order several hundred to a thousand segments is tractable with standard hardware. The inputs to our routine are thus: • The radius R > 0 of a single circular absorber element krk = R, with centre placed at the origin, so that all other coordinates are expressed relative to it. • A collection of i = 1, 2, . . . , m straight line reflector segments, described in terms of the position vectors a(i) and b(i) of their end-points. – In practice, we will often have a(2) = b(1) , a(3) = b(2) , . . ., so that the segments join together to give a contiguous curve, but these conditions are not required by the code, so reflectors consisting of several disconnected components may also be tested. (j) • A collection of j = 1, 2, . . . , n rays, prescribed in terms of their starting points r0 and (j) unit directions ^t . In practice, this ensemble may consist of: 1. Sets of parallel rays with different starting points arranged uniformly along a straight line segment above the reflector so that the complete set of rays is incident across the width of the reflector’s aperture. This set-up tests the energy collection for a single incident angle. 2. Ensembles of ensembles of type 1., so many incident angles can be tested simultaneously. 3. Ensembles of non-parallel rays to test diffuse radiation. • The maximum number p of reflections that should be computed for each individual ray. The output of the routine is: • The eventual ‘fate’ of each ray. The possibilities are: 1. Ray hit absorber element after k reflections on the solar reflector, 0 ≤ k ≤ p. 2. Ray ‘escapes to infinity’ after k reflections, 0 ≤ k ≤ p, so that it could not be incident upon the absorber even if the parameter p were increased. 3. Ray is ‘in-play’ after p reflections, in that it has yet to hit the absorber or escape to infinity. The fate of the ray can only be decided by experimenting with larger values for p. In practice we fix p at a sufficiently large value (e.g. 5 or 6) so that only a very small proportion of rays fall in this category. 191 β Figure 5: Simple output from the ray-tracing routine. A small ensemble of parallel rays are shown incident upon a parabolic reflector discretised into 200 straight line segments. Reflections to infinity are not shown. In this configuration no ray is reflected more than twice, but larger numbers of reflections may be tracked. Here the circular absorber intercepts the caustic of the reflector and so is particularly efficient at capturing the incident radiation. In verbose mode, the user also has access to diagnostic information giving the full itinerary of each ray in terms of the indices of the reflector segments that it hit, and the points at which it was reflected. The latter data may be used to drive a plotting routine: example output is shown in Fig. 5. An energy collection calculation must then weight the output of the routine according to the nature of each ray (position of the sun etc., incorporating the factors from Section 2) which hits the absorber and by αk , where k is the number of reflections it has suffered in the mean time and α is the reflectivity. Rays which are of types 2. or 3. are given zero weight. The total energy return is then just the scalar obtained by summing over all rays. 3.3 Computational method We now give a brief description of how the computational kernel works. In essence, it operates serially over the reflection number k = 1, 2, . . . , p but for a given k the code is vectorised over all rays and reflector segments. This vectorisation means all the hard number crunching is performed internally in Matlab commands rather than by interpreted loops — and this is the key factor in the code’s speed. This structure also means that the code should port to parallel architectures rather simply. • For reflection numbers from k = 0 to k = p in sequence: – For all combinations of admissible reflector segments i = 1, 2, . . . , m and rays j = 1, 2, . . . , n, find potential intersections by solving the 2 × 2 system (j) (j) λ(i,j) a(i) + (1 − λ(i,j) )b(i) = r0 + s(i,j)^t for λ(i,j) and s(i,j) simultaneously. 192 (3.1) – Mark intersections as allowable if 0 ≤ λ(i,j) < 1 and s(i,j) > 0, since each reflector (j) component is a finite line segment and each ray is in fact a half-line starting at r0 . Here we consider half-open reflector segments to avoid double counting problems where the segments are knotted together at their end-points. – For each ray j, compute s∗(j) = min s(i,j) , (3.2) allowed i (j) and let i∗ denote the arg min. This means that ray j will next hit reflector segment (j) (j) i∗ after distance s∗ , unless it hits the absorber first. – For each ray j, also consider intersections with the absorber by solving the quadratic (j) (j) kr0 + s̃(j)^t k2 = R2 (3.3) for s̃(j) . Here we are of course interested only in real solutions with s̃(j) > 0. In the case of two such solutions, we take the least (corresponding to the first intersection). In fact, the case of real solutions with opposite signs will generally not occur in practice since this corresponds to rays that start inside the absorber element! – Some control logic is now applied to each ray (but this is also vectorisable). If there is an allowable s̃(j) > 0 then either (j) - s̃(j) < s∗ , in which case ray j next hits the absorber element, and the number of reflections k(j) suffered so far is recorded. No further computation is required for this ray and it should be removed from the computation: however, the vectorisation works most efficiently if it is left in the computation with a dummy flag indicating that the control logic steps should not be applied to it in future. (j) (j) - s̃(j) > s∗ , in which case ray j next hits reflector segment i∗ . In this case, the (j) new starting point r0 of the ray is re-set to (j) (j) r0 + s∗(j)^t , (j) and its new unit direction ^t (3.4) is then set to (j) (j) ^t(j) − 2(^t(j) · n ^ (i∗ ) )^ n(i∗ ) , (3.5) corresponding to reflection, where n ^ (i) is the unit normal of reflector segment i, which is computed one-shot at the start of the routine from a(i) and b(i) . (j) Finally, reflector segment i∗ is marked as an inadmissible hit for ray j at the next iteration and remains so for further iterations until ray j hits some other reflector segment. This fix is to avoid ‘double hit’ problems due to rounding (j) error, where r0 is placed marginally on one (the wrong) side of the reflector segment, so that it almost instantaneously next hits the same segment. The consequent spurious effect is that some rays appear to pass straight through reflector segments without deviation — hence the admissibility flag which fixes this problem is of key importance. 193 - There are also a number of ‘empty’ cases which must be covered but which we shall not enumerate in detail. For example, in the case where there is no allowable hit with the absorber, the first hit with the reflector should still be processed. If there are no allowable / admissible hits with the reflector, the first hit with the absorber should still be processed. If there are no allowable / admissible hits with either the absorber or the reflector, then the ray in question has escaped to infinity and is marked as such. • Loop over reflection number k. Note. Here the sense n ^ (i) of the reflector segment is not considered, so in effect each reflector segment is a two-sided mirror. The generalisation to one-sided mirrors with a matt reverse side is easily achievable, but requires extra control logic which we have yet to implement. Using the models and algorithms developed in this section and Section 2, work will now focus on the shape of the mirror. Three approaches will be presented: the inverse problem will be detailed first, followed by two optimisation methods. 3.4 Computational cost The run-time of our routine scales like O(mnp), i.e., it is linearly proportional to each of the number of reflector segments, rays, and reflections considered. In addition, the memory constraints scale like O(mn) due to the Matlab vectorisation techniques used, but a C or Fortran code could be optimised to scale like O(m) in this respect. In practice, we have found that the routine takes just a couple of seconds on a laptop when mnp ∼ 106 . Clearly the bottleneck in our approach is the consideration at each iteration of all possible intersections between all rays and all reflector segments. It may be possible to speed-up matters by reducing the search space: for example, we may segment the plane into rectangular subregions and index reflector segments by which rectangle they belong to. The itinerary of each ray through the sub-regions may then be used to seek intersections √ with reduced lists of reflector segments. It seems plausible that this approach may give a O( m) or O(log m) speed-up, but with a significant coding burden which we have yet to undertake. Moreover, this refinement would break the Matlab vectorisation so would only be worthwhile for a low-level language implementation. These features remain for future work. 4 Inverse problem In this section we consider the 2-dimensional case in which the absorber is a point source located at the origin of the xy axes in R2 and the position of the Sun is given by an angle θ, for 0 ≤ θ ≤ π (anticlockwise). Although in a more realistic situation the absorber would be 194 described by a circle centred at the origin (0, 0), the case of a point source collector seems the most appropriate simplified situation to be investigated. The modification in the mathematical model required for the more realistic circle case will be considered later on. Let us recall that the goal is to find the best shape of a mirror placed nearby the source in order to maximise the number of reflections of rays coming from the Sun into the absorber. In the inverse problem we consider the following approach. The case in which radiation is emitted from the absorber is considered. The electromagnetic radiation is emitted in all directions and reflected from the nearby mirror. The goal is to model the reflected rays in term of the incident field coming from the absorber and the shape of the mirror. The inversion consists then in two steps: i) image the mirror in terms of the reflected rays, ii) find the best shape for the mirror by prescribing the reflected rays with some conditions that maximise the number of reflections from the mirror to the general receiver’s location (sun’s location). Let us specify our problem. 4.1 The mathematical model The incident field Let us denote by X = (x, y) any point in R2 . We consider the simple scalar wave equation to model the wave propagation. We make the following assumption. Assumption 4.1. We assume that the mirror is well separated from the region where the sensors are located and that in the intervening region c(x) = c0 , where c0 is the (assumed constant) speed of light in air. The above assumption is reasonable since the sensors location coincide with the sun’s location. The incident field due to a delta source located at (0, 0) at time t = 0 satisfies 1 2 ∇ − 2 ∂t Uin (t, X) = −δ(X)δ(t), c0 (4.1) therefore, at the point X, at the time t′ , it is simply the Green’s function G0 associated to the wave operator appearing on the left hand side of (4.1) Uin (t′ , X) = G0 (t′ , X) = The reflected field We start by giving two definitions. 195 δ t′ − |X| c0 4π|X| . DEFINITION 4.1. We denote by M = {(x, f(x)) | x ∈ R} the graph of some function f : R → R which represents the shape of the mirror. DEFINITION 4.2. We denote by R(θ) = (Rx (θ), Ry(θ)) the location of the receiver (the sun) at the angle θ from the ground. If c0 is the speed of light in the air introduced above, we assume that the perturbation in wave speed c takes the form V(X) := 1 1 = Ṽ(x)δ(y − f(x)), − 2 2 c0 c (X) (4.2) where V is the reflectivity function and it represents the change in speed of the ray when it hits the mirror at the point X = (x, f(x)). We write the total field U as the sum U = Uin + Urf , where Urf represents field reflected from the mirror. The total field U satisfies ∇2 − 1 ∂t U(t, X) = −δ(X)δ(t) c2 (X) (4.3) By combining (4.1) and (4.3) we obtain ∇2 − 1 ∂t Urf (t, X) = −V(X)∂2t U. c2 (x) (4.4) Linearised problem We rewrite (4.4) in integral form and make use of the Born Approximation (single scattering approximation) to replace the total field U on the right hand side of (4.4) with the incident field Uin rf U (t, θ) = FV(t, θ) = Z e−iω(t−(|R(θ)−X|+|X|)/c0 ) 2 ω V(X)dω dX. (4π)2 |R(θ) − X||X| (4.5) This approximation linearises the problem because the product of unknowns VU on the right hand side of (4.4) is replaced by the product of the unknown V with the known incident field Uin . If we denote by 196 A(X, θ, t, ω) := ω2 , (4π)2|R(θ) − X||X| we obtain rf U (t, θ) = FV(t, θ) = Z e−iω(t−(|R(θ)−X|+|X|)/c0 ) A(X, θ, t, ω) V(X)dω dX, (4.6) where ω denotes the angular frequency and F is the forward map which maps the reflectivity function into the reflected ray collected at different location θ and different travel time t. Remark 4.2. Notice that the support of V, supp(V) satisfies supp(V(x, y)) ⊂ {(x, y) ∈ R2 | y = f(x)}, i.e. finding V is equivalent of finding the curve y = f(x). 4.2 Inversion Microlocal analysis F is a Fourier Integral Operator (FIO) given by the oscillatory integral (4.6) with phase function h i φ(θ, t, x, ω) = −ω t − (|R θ − X| + |X|)/c0 (4.7) FIOs are well-studied operators and the imaging scheme planned to be used here will be based on FIO theory, which is part of microlocal analysis [6, 7, 8]). Singularity in the scene are mapped into singularity in the data by the so-called canonical relation of F, ΛF ΛF = (θ, t; σ, τ), (X, ξ) | Dω (θ, t, X, ω) = 0 σ = Dθ φ τ = Dt φ ξ = DX φ . For F given by (4.6) we have 197 (4.8) ΛF = (θ, t; σ, τ), (X, ξ) | 1 (|R(θ) − X| + |X|) c0 τ = −ω τ \ − X · Ṙ(θ) σ = R(θ) c0 τ \ b . ξ = (R(θ) − X − X) c0 t= (4.9) The first condition in (4.9) is a travel time condition t2 = |R(θ) − X|2 + |X|2 = (h cos(θ) − x)2 + (h sin(θ) − f(x))2 + x2 + f2 (x) and it represents the set of points of intersection between the ellipse with foci the sun’s location R(θ) and the absorber’s location (0, 0) with the curve y = f(x) we want to reconstruct. The second condition of (4.9) (σ-condition) σ= τ \ R(θ) − X · Ṙ(θ) c0 gives us the correct point we want to image X and its reflection across the velocity field Ṙ(θ): this is not relevant since the mirror is obviously located below the R(θ), therefore this condition will not introduce artefacts in the image. Formation of the image We use a filtered back projection operator to form the image, i.e. we apply a filtered adjoint of the forward operator to the data: this operator has the same phase as the adjoint of F, with the filter to be chosen later. The image of X is given by Z I(X) := eiω(t−(|R(θ)−X|+|X|)/c0 ) B(X, θ, t, ω)Urf(θ, t)dωdθdt. (4.10) If we combine (4.10) together with (4.6) we obtain I(X) = Z K(X, X′)V(X′ )dX′ , where 198 (4.11) ′ K(X, X ) = Z ′ e−iω((|R(θ)−X|+|R(θ)−X |)/c0 ) A(Z, θ, |X′ − R(θ)|, ω)B(X, θ, |X − R(θ)|, ω)dωdθ. The kernel K(X, X′) is the imaging point spread function and it arises because it is the image that would result from a delta function located at the point X′ . If we perform a large-ω stationary phase analysis of K, it turns out that the main contribution to K does indeed arise from the point X = X′ as predicted in subsection 2.1. This is quite a well known machinery that have been used in the community working on this kind of problem [9, 10, 11, 12, 13, 14]). By making use of the above method we can therefore obtain an image of the mirror y = f(x) in terms of the reflected rays. The work that needs to be done in the future is the understanding of how optimising the shape of the mirror would influence the general reflected ray Urf (θ, t). We believe that an approach to investigate should be the one of maximising the number of possible travel times of the total field arriving at the same location θ. 5 One reflection construction In this section the shape of the mirrors is determined to maximise the amount of energy collected by the energy collector for a given width of the aperture. The optimisation is carried out by considering direct illumination and first reflections through the entire day. The position of the collector is determined first, then the amount of reflected energy is calculated. Finally, a simple optimisation algorithm is used to determine the shape of the mirror. The geometry is determined using the following assumptions: • The length of the aperture is fixed, • The collector is a cylinder of given radius, • The device is optimised for a flat position, • The position of the collector is maximised for a Sun angle of incidence θ0 < θ < π − θ0 , where θ0 is a small given angle. 5.1 Position of the collector To maximise direct energy hits, the collector should be positioned as high as possible above the roof surface. However, if placed too high, a collector will create shadow on the next element and limit its efficiency. A typical configuration is shown on Figure 6. 199 2R h θ1 θ2 2L Figure 6: Position of the collector. A simple geometry calculation shows that the height H of the centre of the collector may be expressed as: h1 = L sin θ1 + R , cos θ1 h2 = L sin θ2 − R cos θ2 where L denotes half the length of the aperture and R is the radius of the collector cylinder. If the collector is positioned to maximise angles θ0 < θ < π − θ0 , as is obvious from the figure, the shadow of the cylinder on the next element, described by the angle θ2 , will be the limiting factor and the height of the cylinder should be: h= 5.2 L sin θ0 − R . cos θ0 (5.1) First reflection The shape of the mirror will now be defined. To start with, the energy reflected by one element of mirror will be calculated and then a simple optimisation algorithm will be developed. Energy reflected We consider a cylindrical absorber of radius R placed above a reflector having some arbitrary shape extending between edges A± with the coordinates (±d, h). The problem is supposed to be invariant by translation along the absorber axis which allows us to consider the problem in a 2D geometry. We use the Cartesian coordinate system (O, x, y) as shown on Figure 7. The Sun position is given by the angle θ between the horizontal and the line going through the Earth and the Sun centres. The Sun is assumed to have a uniform circular motion so that there is an equivalence between the variable θ and the time. For a general point M taken on 200 the reflector surface, we denote by (xM , yM ) its coordinates in our coordinate system. , τ the tangent vector oriented from A− to A+ , and ν the normal vector that is τ rotated by π/2. Finally we call α the algebraic angle between the x-axis and the vector τ. S y θ −d R xM d O −A x A h ν yM α M τ Figure 7: Problem statement. A point M of the mirror will be illuminated by the for angles of Sun verifying (5.2) θ ∈ (β− , β+ ) \ (γ− , γ+). where all angles are defined on Figure 8. This corresponds to the red area. The four angles may be expressed as function of the point position and other geometry parameters: ! ±d − xM β± = arccos p , (±d − xM )2 + (h − yM )2 ! ! xM R γ± = π − arccos p 2 ± arctan p 2 . 2 xM + yM xM + y2M The absorber will collect energy from M when M is lit and the Sun rays are reflected on the absorber. A necessary condition for the Sun rays to be reflected on the absorber by M is that θ ∈ (γ−′ , γ+′ ). where γ±′ = 2α + arccos xM p x2M + y2M ! ± arctan p R x2M + y2M ! . This is illustrated by Figure 9. Combining this result with (5.2), the absorber will collect solar energy from M exactly for θ ∈ IM := (β− , β+) \ (γ− , γ+) ∩ (γ−′ , γ+′ ). 201 S y θ −d R xM d O −A ω β+ ν γ0 γ+ β− M x A h γ− yM α τ Figure 8: Enlightenment of M. S y O′ θ γ0′ ω −d R xM d O −A ω β+ ν β− M h γ+ γ− x A γ0 yM α τ Figure 9: Image of the absorber by the reflector. 5.3 Optimisation algorithm The mirror will now be determined using the following algorithm: • The mirror is (numerically) divided into planar elements of width ds, • The algorithm starts at the extremity of the aperture, • Each element is positioned to maximise the energy reflected towards the collector. 202 50 Depth (mm) 0 -50 -100 -150 -200 -200 -150 -100 -50 0 50 Length (mm) 100 150 200 Figure 10: Mirror shape. The energy reflected by an element of size ds inclined at an angle α to the horizontal as defined on Figure 9 is: Z dE = P sin (θ − α)dθ , θ∈IM where P is the energy radiated in the normal cross section of the solar beam. The value of dE should be maximised for each mirror element with respect to the inclination angle α. Note this parameter appears in the integral but also in the limits of the interval IM . Figure 10 shows the position of the collector and the shape of the mirror for a collector radius R = 24mm and an aperture of 440mm. As could be expected, most of the collector is located outside the mirror and this guarantees maximum direct energy hits for most of the day. The mirror shape resembles a parabola but its width differs by up to 20%. The shape is about 200mm deep. This is far from ideal from a building perspective but the model only includes first reflections. The shape of the mirrors should be significantly modified when further reflections are considered, particularly at the vertical of the collector and this would certainly reduce the depth of the mirror. The shape is also dependent on the optimisation method. Another optimisation method could lead to a different mirror shape. Further investigations are required to complete the study. 6 Optimisation In this final section, the mirror shape is optimised for a device with a circular absorber located in Limerick, at latitude 50◦ N40’. The concentrator is assumed to be horizontal, aligned eastwest and polar mounted, i.e. tilted south by the latitude angle. For each shape, the annual energy collected by the device is calculated using the theory in Section 2 and the ray tracing algorithm in Section 3, which determines the transmission function, T (ξ). Optimisation of 203 the mirror shape was performed using a swarm optimisation algorithm [15] coupled with basic Monte Carlo methods, and the depth of the device was constrained to be less than or equal to the typical depth of a frame installation. The algorithm was initialised with a given shape of the mirror, such as a parabola, involute or any other adequate geometry, and specified by a specific number of points. The optimisation process is extremely time consuming as the annual energy collected must be recalculated for every change in the shape, which involves recomputing the transmission function using the ray tracing algorithm at every step. Hence in practice, the mirror shape was only defined by 20 points. The results presented in this section are calculated for the standard element width of 440mm. The figures are scaled with the collector radius; 1 unit on the figure corresponds to 24mm. The designs are evaluated using the four following parameters: • OOD is the Output Of Driver2_line is the ratio the energy collected and the actual energy available in the aperture. • EPA is the Energy Per Absorber. This may be expressed as the OOD multiplied by the size of a frame and divided by the numbers of absorbers. • TE stands for Total Energy. This is the product of the EPA and the number of absorbers. In the present study, the total size of the frame is 10. • EPC describes the total Energy Per unit Cost. This is the parameter to be maximised and may be expressed as the Total Energy TE divided by the cost of the unit frame and absorbers (40 euros for the frame and 20 euros for the absorber) 6.1 Example 1 In this first example, the half aperture corresponds to 10 times the radius of the collector. There is one collector for each standard 440mm frame. The shape of the mirror may be seen in Figure 11. The coordinates of the points are x y x y coordinate coordinate coordinate coordinate -10 2.6557 -4.4 -1.0434 -8.9 3.2461 -3.3 -2.0939 -7.8 1.0585 -2.2 -1.8143 -6.7 -0.7331 -1.1 -2.1132 This geometry leads to the following results OOD 0.2955 EPA 2.955 204 TE 2.955 EPC 4.925 -5.5 -2.0026 0 -2.0893 8 Depth (ND) 6 4 2 0 -2 -10 -5 0 Length (ND) 5 10 Figure 11: Collector 1. 6.2 Example 2 The half aperture corresponds now to 5 times the radius of the collector. There are two collectors for each standard 440mm frame. The shape of the mirror may be seen in Figure 12. 4 3 Depth (ND) 2 1 0 -1 -2 -4 -2 0 Length (ND) Figure 12: Collector 2. 205 2 4 The coordinates of the points are x y x y coordinate coordinate coordinate coordinate -5 3.6882 -2.2 -0.6145 -4.4 3.6881 -1.7 -1.0438 -3.9 1.9397 -1.1 -1.2990 -3.3 0.8423 -0.6 -1.2802 -2.8 -0.0158 0 -1.0180 This geometry leads to the following results OOD 0.5974 6.3 EPA 2.987 TE 5.974 EPC 7.4675 Example 3 In this example, the half aperture corresponds to 3.3 times the radius of the collector. There are three collectors for each standard 440mm frame. The shape of the mirror may be seen in Figure 13. 2.5 Depth (ND) 2 1.5 1 0.5 0 -0.5 -1 -1.5 -3 -2 -1 0 1 Length (ND) Figure 13: Collector 3. 206 2 3 The coordinates of the points are x y x y coordinate coordinate coordinate coordinate -3.3 2.7680 -1.5 -0.8910 -3 1.3661 -1.1 -1.0479 -2.6 0.4321 -0.7 -1.0910 -2.2 -0.1994 -0.4 -1.0156 -1.9 -0.6211 0 -1.3600 This geometry leads to the following results OOD 0.7189 6.4 EPA 2.396 TE 7.189 EPC 7.189 Example 4 In this final example, the half aperture corresponds to 2.5 times the radius of the collector. There are four collectors for each standard 440mm frame. The shape of the mirror may be seen in Figure 14. 3 2.5 Depth (ND) 2 1.5 1 0.5 0 -0.5 -1 -3 -2 -1 0 1 Length (ND) Figure 14: Collector 4. 207 2 3 The coordinates of the points are x y x y coordinate coordinate coordinate coordinate -2.5 2.4792 -1.1 -0.8888 -2.2 1.1390 -0.8 -1.0460 -1.9 0.1641 -0.55 -1.0977 -1.6 -0.3419 -0.3 -1.0092 -1.4 -0.6071 0 -1.1762 This geometry leads to the following results OOD 0.7361 EPA 1.840 TE 7.361 EPC 6.314 The EPC is maximum for two or three collectors per standard frame. The length of the aperture should therefore be reduced to maximise the cost efficiency of the device when a shallow depth is imposed. 7 Conclusion and future work This reports details several aspects of the modelling problem submitted by Erin Energy. The solar radiation was studied and a ray tracing algorithm was developed. Several techniques were then presented to calculate the shape of the device. The optimisation process shows that wide (and shallow) reflectors are not as energy efficient as smaller devices: for the standard 440mm length, two to three devices should be more economically efficient than a single optimised device when the depth of the mirror is forced to remain shallow. Future work could be done in the following directions: • The optimisation should be performed much more systematically. The present study limited the research to 20 points in the last section, many more should be used. • The optimisation process was initialised using a limited number of shapes. More should be considered, such as the shapes calculated using the inverse method or the one reflection approach. • Finally the optimisation should be performed for other orientations of the concentrator, e.g. with north-south alignment, and inclined by the angle of pitch of a typical roof. Acknowledgements The contributions of Giles Richardson and Galin Ganchev are here kindly acknowledged. All contributors would like to thank Mr Sean Hoolan from Erin Energy Ltd for introducing the problem and assisting in answering questions during the entire week. J.P.F. Charpin, R. Gaburro 208 and S. Soussi acknowledge the support of the Mathematics Applications Consortium for Science and Industry (www.macsi.ul.ie) funded by the Science Foundation Ireland mathematics initiative grant 06/MI/005. 209 Bibliography [1] J.A. Duffie and W.A. Beckman. Solar Engineering of Thermal Processes. John Wiley and Sons Inc, third edition, 2006. [2] W.B. Stine and R.W. Harrigan. Solar Energy Systems Design. John Wiley and Sons Inc, 1985. [3] C.G. Bell, M. McCulloch, H. Ockendon and J.R. Ockendon. Annual energy distribution on stationary solar collectors. To be submitted. [4] H.C. Hottel. A simple model for estimating the transmittance of direct solar radiation through clear atmospheres. Sol. Energy, 18(2):129-134, 1976. [5] A. Rabl. Comparison of solar collectors. Sol. Energy, 18(2):93-111, 1976. [6] J.J. Duistermaat. Fourier Integral Operators. Birkhauser, Boston, 1996. [7] A. Grigis and J. Sjöstrand. Microlocal Analysis for Differential Operators: an Introduction, volume 196 of London Mathematical Society Lecture Note Series. Cambridge University Press, 1994. [8] F. Treves. Introduction to Pseudodifferential and Fourier Integral Operators, volume I and II. Plenum Press, New York, 1980. [9] M. Cheney. A mathematical tutorial on synthetic aperture radar. SIAM Rev., 43:301-312, MR:1861084, 2001. [10] R. Gaburro and C. J. Nolan. Enhanced imaging from multiply scattered waves. Inverse Probl. Imag., 2:1-20, 2008. [11] R. Gaburro, C. J. Nolan, T. Dowling and M. Cheney. Imaging from multiply scattered waves. SPIE 6513, 2007. Article 651304. [12] C. J. Nolan and M. Cheney. Synthetic aperture inversion for arbitrary flight paths and non-flat topography. IEEE T. Image Process., 12:1035-1043, MR:2006858, 2003. [13] C. J. Nolan and M. Cheney. Synthetic aperture inversion. Inverse Probl., 18:221-236, 2002. 210 [14] C. J. Nolan, M. Cheney, T. Dowling and R. Gaburro. Enhanced angular resolution from multiply scattered waves. Inverse Probl., 22:1817-1834, MR:2261268, 2006. [15] A.I.F. Vaz and L.N.Vicente. A particle swarm pattern search method for bound constrained global optimization. J. Global Optim., 39:197-219, 2007. 211