Grid Economics and Business Models: A Gridbus Perspective Grid User Meet, Bengaluru, India Rajkumar Buyya Grid and Distributed Systems (GRIDS) Laboratory Dept. of Computer Science and Software Engineering The University of Melbourne Melbourne, Australia www.gridbus.org WW Grid Outline 2 Introduction Utility Networks and Grid Computing Security. Resource management, pricing and service models Global Grids and Challenges Service Oriented Grids and Grid Economy SOGA, Grid Market Directory, Grid Bank, Broker Architecture, Design and Implementation A Case Study in High Energy Physics Grid Service Broker Performance Evaluation: Experiments in Creation and Deployment of Applications on Global Grids Summary 4 Essential Utilities and Delivery Networks (1) Water (2) Electricity (3) Gas (4) Telephone 3 (5) IT services as the fifth utility (water, electricity, gas, telephone, IT) 4 eScience eBusiness eGovernment eHealth Multilingual eEducation … Leading to Grid (computing) Paradigm: Cyberinfrastructure for sharing resources •Inspired by Power Grid! •* A service-oriented/utility computing paradigm that enables seamless sharing of geographically distributed, autonomous resources (for PROFIT). •* 5 This was the original aim of building Internet although it ended up in giving birth to email! A Bird Eye View of World-Wide Grid Environment Grid Information Service Grid Resource Broker R2 R3 R5 Application database R4 RN Grid Resource Broker R6 Grid Information Service 6 R1 Resource Broker Various Types of Grid Services Computational Services – CPU cycles SETI@Home, NASA IPG, TeraGrid, I-Grid,… Data Services ASP Grid The way knowledge is acquired and managed— data mining. Data Grid Utility Computing Services 7 Interaction Grid eLearning, Virtual Tables, Group Communication (Access Grid), Gaming Knowledge Services Access to remote software/libraries and license management—NetSolve Interaction Services Knowledge Grid Application Services Data replication, management, secure access-LHC Grid/Napster Utility Grid Towards a market-based Grid computing: Leasing and delivering Grid services as ICT utilities. Computational Grid Worldwide Grid Spending After the year 2006, business popularity of Grid computing is expected to be accelerate exponentially: Especially, the financial services and ERP services is expected to take major parts in the expense Source: Insight Research Corp. 8 Grid Challenges Security Computational Economy Uniform Access Resource Discovery 9 System Management Data locality Resource Allocation & Scheduling Application Construction Network Management Some Grid Initiatives Worldwide Australia Brazil China 120million – 5 yrs Europe 450million – 5 yrs 486million – 5 yrs India 1.3 billion (Rs) Japan Nimrod-G Gridbus DISCWorld GrangeNet. 27 million APACGrid ARC eResearch UK eScience EU Grids.. and many more... Singapore NGP Global Grid Forum Australian Grid Forum Conferences: 10 1.3 billion – 3 yrs IBM On Demand Computing HP Adaptive Computing Sun N1 Microsoft - .NET Oracle 10g Satyam – Grid Practice Infosys, Wipro, TCS StorageTek –Grid.. Public Forums NAGERI Globus NASA IPG AccessGrid TeraGrid Cyberinfrasture Industry Initiatives ChinaGrid – Education CNGrid - application 1 billion – 5yrsKorea... N*Grid OurGrid, EasyGrid LNCC-Grid + many others Garuda USA CCGrid Grid HPDC E-Science http://www.gridcomputing.com 2? billion Grid (Market) Participant Types and Application Category free trading Public computing (SETI@Home, Alchemi, UD) Private enterprises (Satyam, IBM, Sun) Sharing Model National provider (K*Grid, TeraGrid, Garuda/IndiaGrid, UKGrid, AusGrid) regulation scientific 11 Application Category commercial mix-and-match (service) Object-oriented Internet/partial-P2P Network enabled Solvers Economic-based Utility / Service-Oriented Computing Nimrod-G 12 The Gridbus Project @ Melbourne: Enable Leasing of ICT Services on Demand Distributed Data WWG Gridbus World Wide Grid! On Demand Utility Computing 13 http://www.gridbus.org The TheGridbus GridbusProject Project@ @GRIDS GRIDSLab, Lab,The TheUniversity Universityof ofMelbourne: Melbourne: Toolkit -Research Applications Toolkitfor forCreating Creatingand andDeploying Deployingee-Research Applicationson onUtility UtilityGrids Grids • • Gridbus is a “open source” Grid R&D project with focus on Grid Economy, Utility Grids and Service Oriented Computing. Gridbus Middleware components include: – Alchemi: .NET-based Enterprise Grid – Grid Market Directory and Web Services – Grid Bank: Accounting and Transaction Management – Visual Tools for Creation of Distributed Applications – Grid Service Broker and Scheduling – Workflow Management Engine – GridSim Toolkit – Libra: SLA-based Resource Allocation 14 Distributed Data Gridbus Grid Economy: Methodology for Sustained Resourced Sharing and Managing Supply-and-Demand for Resources 15 New challenges of Grid Economy Resource Owners 16 I decide prices ? (economic models?) I specify them ? I translate price to resource allocation ? I enforce them ? I advertise & attract consumers ? I do accounting and handle payments? How do How do How do How do ….. I decide expenses ? I express QoS requirements ? I trade between timeframe & cost ? I map jobs to resources to meet my QoS needs? Resource Consumers How do How do How do How do How do How do ….. They need mechanisms and technologies for value expression, value translation, and value enforcement. Grid Entities and Architecture Grid consumer GSP site scheduler broker accounting 17 GSP global scheduler GSP site scheduler Resource Provider Resource Provider Market Maker End users Private enterprises National providers A Reference Service-Oriented Architecture for Utility Grids Data Catalogue Grid Bank Programming Environments Applications Sign-on Info ? Grid Explorer Job Control Agent Grid Market Services Health Monitor Grid Node N Secure Schedule Advisor QoS Grid Node1 Pricing Algorithms Trade Server Trade Manager Trading … Deployment Agent JobExec Grid Resource Broker Misc. services Resource Allocation Storage Grid Middleware Services Accounting Resource Reservation R1 Grid Consumer 18 Information Service R2 … Rm Grid Service Providers Gridbus and Complementary Technologies – realizing Utility Grid Science Commerce … MPI ExcellGrid Grid Brokers: Grid Economy Alchemi NorduGrid Windows Gridscape Unicore … XGrid JVM Solaris Collaboratories Workflow Workflow Engine Nimrod-G Globus .NET Engineering Grid Storage Economy Condor Linux … IRIX Libra Core Grid Middleware Grid Market Directory Tomcat Mac OSF1 G R I D S I M Grid Fabric Software Grid Fabric Hardware CDB PDB 19 User-Level Middleware (Grid Tools) Gridbus Data Broker SGE AIX X-Parameter Sweep Lang. Grid Exchange & Federation Grid Bank PBS … Grid Applications Portals Worldwide Grid On Demand Assembly of Services: Putting Them All Together Application Code Explore data 1 Visual Application Composer 10 2 Data Catalogue 5 6 Grid Resource Broker 4 Grid Info Service 3 ASP Catalogue 9 12 Grid Market Directory 7 8 Grid Service (GS) (Globus) Alchemi Bill GS CPU or PE 20 GSP (e.g., IBM) Cluster Scheduler PE GSP (e.g., UofM) PE GTS GSP (e.g., VPAC) 11 Gridbus GridBank GSP (Accounting Service) Alchemi: .NET-based Enterprise Grid Platform & Web Services Alchemi Manager Web Services Internet Alchemi Users Internet 21 •SETI@Home like Model •General Purpose •Dedicated/Non-dedicate workers •Role-based Security •.NET and Web Services •C# Implementation •GridThread and Job Model Programming •Easy to setup and use • Widely in use! Alchemi Worker Agents Some Users of Alchemi Tier Technologies, USA Large scale document processing using Alchemi framework Satyam Computers Applied Research Laboratory, India Micro-array data processing using Alchemi framework CSIRO, Australia Natural Resource Modeling The University of Sao Paulo, Brazil The Alchemi Executor as a Windows Service The Friedrich Miescher Institute (FMI) for Biomedical Research, Switzerland Patterns of transcription factors in mammalian genes 22 stochastix GmbH, Germany Asynchronous Excel Tasks using ManagedXLL and Alchemi .Net Grid Computing framework. Many users in Universities: See next for an example. The Gridbus Grid Service Broker for Data Grid Applications Builds on the Nimrod-G Computational Grid Broker and Computational Economy [Buyya, Abramson, Giddy, Monash University, 1999-2001] And Extends its notion for Data and Service Grids Gridbus Broker Architecture Gribus Client Gridbus Client Gridbus Client App, T, $, Opt (Bag of Tasks Applications) (Data Grid Scheduler) Gridbus Farming Engine Schedule Advisor Trading Manager Record Keeper Grid Dispatcher Grid Explorer Grid Middleware TM $ TS GE GIS, NWS Grid Info Server RM & TS $ $ U Data Node C G Globus enabled node. 24 G L Unicore enabled node. A RM: Local Resource Manager, TS: Trade Server Alchemi enabled node. Data Catalog Gridbus Services for eScience applications Application Development Environment: Resource Allocation and Scheduling Dynamic discovery of optional computational and data nodes that meet user QoS requirements. Hide Low-Level Grid Middleware interfaces 26 XML-based language for composition of task farming (legacy) applications as parameter sweep applications. Task Farming APIs for new applications. Web APIs (e.g., Portlets) for Grid portal development. Threads-based Programming Interface Workflow interface and Gridbus-enabled workflow engine. Globus, Alchemi, Unicore, NorduGrid, XGrid, etc. Click Here for Demo Drug Design Made Easy! 27 Economy-based Data Grid Scheduling CLICK HERE TO SKIP IF RUNNING OUT of TIME High Energy Physics as eScience Application Case Study Case Study: High Energy Physics and Data Grid The Belle Experiment 29 KEK B-Factory, Japan Investigating fundamental violation of symmetry in nature (Charge Parity) which may help explain the universal matter – antimatter imbalance. Collaboration 400 people, 50 institutes 100’s TB data currently Australian Belle Data Grid Testbed Certificate Authority Analysis Request Analysis Results Virtual Organization Replica Catalog NWS NameServer Grid Service Broker Globus Gatekeeper GRIS Globus Gatekeeper GRIS NWS Sensor NWS Sensor GridFTP Dual Intel Xeon 2.8 Ghz, 2 GB RAM GridFTP Dept. of Physics, University of Sydney Dual Intel Xeon 2.8 Ghz, 2 GB RAM AARNET GRIDS Lab, University of Melbourne Globus Gatekeeper GRIS Globus Gatekeeper GRIS NWS Sensor GridFTP Globus Gatekeeper GRIS NWS Sensor GridFTP NWS Sensor GridFTP Dual Intel Xeon 2.8 Ghz, 2 GB RAM ANU, Canberra VPAC Melbourne 30 Intel Pentium 2.0 Ghz, 512 MB RAM Dept. of Physics, University of Melbourne Dual Intel Xeon 2.8 Ghz, 2 GB RAM Dept. of Computer Science, University of Adelaide Case Study: Event Simulation and Analysis B0->D*+D*-Ks • Simulation and Analysis Package - Belle Analysis Software Framework (BASF) • Experiment in 2 parts – Generation of Simulated Data and Analysis of the distributed data Analyzed 100 data files (30MB each) were distributed among the five nodes 31 Belle Data Grid (GSP CPU Service Price: G$/sec) Certificate Authority Analysis Request Analysis Results Virtual Organization Replica Catalog NWS NameServer Grid Service Broker Globus Gatekeeper GRIS Globus Gatekeeper GRIS NWS Sensor NWS Sensor GridFTP G$4 Dual Intel Xeon 2.8 Ghz, 2 GB RAM GridFTP Dept. of Physics, University of Sydney NA Dual Intel Xeon 2.8 Ghz, 2 GB RAM Globus Gatekeeper AARNET GRIDS Lab, University of Melbourne GRIS Globus Gatekeeper GRIS NWS Sensor Globus Gatekeeper GridFTP GRIS NWS Sensor GridFTP NWS Sensor Dual Intel Xeon 2.8 Ghz, 2 GB RAM GridFTP ANU, Canberra VPAC Melbourne 32 G$6 Intel Pentium 2.0 Ghz, 512 MB RAM Dept. of Physics, University of Melbourne G$2 Dual Intel Xeon 2.8 Ghz, 2 GB RAM Data node Dept. of Computer Science, University of Adelaide G$4 Belle Data Grid (Bandwidth Price: G$/MB) Certificate Authority Analysis Request Analysis Results Virtual Organization NWS NameServer Replica Catalog Grid Service Broker Globus Gatekeeper Globus Gatekeeper GRIS 32 33 36 NWS Sensor GRIS 31 GridFTP NA 30 31 GRIDS Lab, University of Melbourne Dept. of Physics, University of Sydney NWS Sensor Globus Gatekeeper GridFTP GRIS GRIS NWS Sensor GridFTP NWS Sensor Dual Intel Xeon 2.8 Ghz, 2 GB RAM GridFTP ANU, Canberra VPAC Melbourne 33 G$6 Intel Pentium 2.0 Ghz, 512 MB RAM Dept. of Physics, University of Melbourne G$4 Dual Intel Xeon 2.8 Ghz, 2 GB RAM Globus Gatekeeper AARNET GRIS GridFTP 34 38 Dual Intel Xeon 2.8 Ghz, 2 GB RAM Globus Gatekeeper NWS Sensor G$2 Dual Intel Xeon 2.8 Ghz, 2 GB RAM Data node Dept. of Computer Science, University of Adelaide G$4 Deploying Application Scenario A data grid scenario with 100 jobs and each accessing remote data of ~30MB Deadline: 3hrs. Budget: G$ 60K Scheduling Optimisation Scenario: 35 Minimise Time Minimise Cost Results: SUMMARY OF EVALUATION RESULTS Scheduling strategy Total Time Compute Data Taken Cost Cost (mins.) (G$) (G$) Total Cost (G$) Cost Minimization Time Minimization 34425 58390 71.07 48.5 26865 50938 7560 7452 Time Minimization in Data Grids fleagle.ph.unimelb.edu.au belle.anu.edu.au belle.physics.usyd.edu.au brecca-2.vpac.org 80 70 Number of jobs completed 60 50 40 30 20 10 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 Time (in mins.) 36 Results : Cost Minimization in Data Grids fleagle.ph.unimelb.edu.au belle.anu.edu.au belle.physics.usyd.edu.au brecca-2.vpac.org 100 90 80 Number of jobs completed 70 60 50 40 30 20 10 0 1 37 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61 63 Time(in mins.) Observation Organization 38 SUMMARY OF EVALUATION RESULTS Scheduling strategy Total Time Compute Data Taken Cost Cost (mins.) (G$) (G$) Total Cost (G$) Cost Minimization Time Minimization 34425 58390 Node details 71.07 48.5 26865 50938 Cost (in G$/CPUsec) 7560 7452 Total Jobs Executed Time Cost CS,UniMelb belle.cs.mu.oz.au 4 CPU, 2GB RAM, 40 GB HD, Linux N.A. (Not used as a compute resource) -- -- Physics, UniMelb fleagle.ph.unimelb.edu.au 1 CPU, 512 MB RAM, 40 GB HD, Linux 2 3 94 CS, University of Adelaide belle.cs.adelaide.edu.au 4 CPU (only 1 available) , 2GB RAM, 40 GB HD, Linux N.A. (Not used as a compute resource) -- -- ANU, Canberra belle.anu.edu.au 4 CPU, 2GB RAM, 40 GB HD, Linux 4 2 2 Dept of Physics, USyd belle.physics.usyd.edu.au 4 CPU (only 1 available), 2GB RAM, 40 GB HD, Linux 4 72 2 VPAC, Melbourne brecca-2.vpac.org 180 node cluster (only head node used), Linux 6 23 2 Grid and Gridbus Technologies for Various Grid (Market) Types free trading Public computin g (Alchemi) Private enterprises (Libra, Gridbus, Globus) Sharing Model National provider (Globus, Gridbus,..) regulation scientific 39 Application Category commercial Summary and Conclusion Grids exploit synergies that result from cooperation of autonomous entities: 41 Resource sharing, dynamic provisioning, and aggregation at global level. Grid Economy provides incentive needed for sustained cooperation. Grid Network has potential to serve as Cyberinfrastructure for Utility Computing Grids offer enormous opportunities for realizing eScience and eBusiness at global level. Any Questions ? Gridbus Project - http://www.gridbus.org 42 Thanks for your attention! The Gridbus Cooperation! http://www.gridbus.com 43 Backup Slides Deadline (D) and Budget (B) Constrained Scheduling Algorithms Algorithm Execution Execution Compute Time (D) Cost (B) Grid Cost Opt Limited by D Minimize Yes Cost-Time Opt Minimize if possible Minimize Yes Time Opt Minimize Limited by B Yes Conservative -Time Opt Minimize Limited by B, jobs have guaranteed minimum budget Yes 47 Data Grid Yes Yes