Workshop on Sustainable National Grid Services Edinburgh, Feb 22 – 23, 2007 and Submitted as a position statement to the eSI Workshop on Realising and Coordinating e-Research Endeavours Edinburgh, March 14 – 16, 2007 D-Grid Progress Towards Sustainability Wolfgang Gentzsch D-Grid March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 1 Today’s Topics A little history of D-Grid D-Grid: a few details 2 ways towards a sustainability strategy: 1. Learn from others (analysis of major grids) 2. Learn from our own requirements’ analysis 1. Analysis of major grid projects 2. D-Grid Sustainability Workshop Conclusions March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 2 eSI Workshop and Sustainability Free resource usage versus unlimited demand ? How to overcome problems with software licenses and data ? Ensure sustainability for adopted technologies Lock-in to technologies which may become obsolete over time ? How to switch to newer technologies if necessary ? ........ March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 3 My Short Answer: March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 4 My Short Answer: There has to be a need to ensures sustainability ! March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 5 My Short Answer: There has to be a need to ensures sustainability ! And now my long answer : March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 6 Sustainability: Ideas in the Beginning Learn from others, collaborate with others Sustainability in architecture (standards), technology (robust), users (applications), market, legal, government,… Start with a plan for sustainability before you start with a plan for the grid Users and applications are main drivers of sustainability Develop clear benefits for users Make everything easy to use Political and policy landscape has to be right March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 7 Learn from Others: March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 8 e-Science Grid Initiatives Investigated Initiative Funding People *) UK e-Science-I: 2001 - 2004 UK e-Science-II: 2004 - 2006 $180M $220M 900 1100 TeraGrid-I: TeraGrid-II: 2001 - 2004 2005 - 2010 $90M $150M 500 850 Res. Res. ChinaGrid-I: ChinaGrid-II: 2003 - 2006 2007 – 2010 20M RMB 50M RMB *) 400 1000 Res. Res. NAREGI-I: NAREGI-II 2003 - 2005 2006 - 2010 $25M $40M EGEE-I: EGEE-II: 2004 - 2006 2006 - 2008 $40M $45M D-Grid-I: D-Grid-II: 2005 - 2008 2007 - 2009 $25M $25M March, 2007 Time *) Wolfgang Gentzsch, D-Grid & RENCI *) estimate Users Res. Res. Ind. 150 250 Res. Res. Ind. 800 1000 Res. Res. Ind. 220 220 (= 440) Res. Res. Ind. 9 Components of e-Infrastructures for Science (Tony Hey, 2003) 1. Resources: Networks with computing and data nodes, etc. 2. Development/support of standard middleware & grid services 3. Internationally agreed AAA infrastructure 4. Discovery services and collaborative tools 5. Data provenance, curation and preservation 6. Open access to data and publications via interoperable repositories 7. Remote access to large-scale facilities: Telescopes, LHC, ITER, .. 8. Industrial collaboration 10 Sustainability in Current Projects UK e-Science: National Grid Service (NGS), Grid Operations Support Center (GOSC), National e-Science Center (NeSC), Regional e-Science Centers, Open Middleware Infrastructure Institute (OMII), Digital Curation Center (DCC),…. EGEE: Plans to establish a European Grid Initiative (EGI), together with NGIs, to provide persistent grid service federating national grid programmes starting in 2008 ChinaGrid: Increasing numbers of grid applications using CGSP grid middleware packages NAREGI: Software will be managed and maintained by Cyber Science Infrastructure Center of National Institute of Informatics TeraGrid: NSF Cyberinfrastructure Office: 5 year Coop. Agreement. Partnerships with peer grid efforts and commercial web services activities in order to integrate broadly D-Grid: DGI WP 4: sustainability, services strategies, and business models; D-Grid 1,2,3; Layers: resources, middleware, SLAs, Knowledge; Centres of Excellence; more communities – more users March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 11 Challenges for Research and Industry • • • • • • • • • • • • Sensitive data, sensitive applications (medical patient records) Different organizations get different benefits Accounting, who pays for what (sharing!) Security policies: consistent and enforced across the grid ! Lack of standards prevent interoperability of components Current IT culture is not predisposed to sharing resources Not all applications are grid-ready or grid-enabled Open source is not equal open source (read the small print) SLAs based on open source (liability?) “Static” licensing model don’t embrace grid Protection of intellectual property Legal issues (e.g. FDA, HIPAA, multi-country grids) March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 12 Lessons Learned and Recommendations – Continuity: Grid infrastructure should be modified and improved in large cycles only: applications depend on infrastructure ! – Sustainability: Funding should be available after end of project, to guarantee services, support and continuous improvement. – Interoperability: Use open-source software and standards especially in the infrastructure and application middleware layer. – Collaboration: between infrastructure developers and the applications, to best utilize grid services and to avoid application silos. – User-Friendliness: for easy adoption for new communities. Infrastructure group should offer installation, operation and support services. – Grid Services: Centers of Excellence should specialize on specific services, e.g. integration of new communities, grid operation, utility services, training, support, etc. – Participation of Industry: has to be industry-driven. Push from outside, even with govmnt funding, is not promising. Success comes only from real needs e.g. through already existing collaborations between research and industry. More Info: www.renci.org Publications Reports March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 13 Learn from our own requirements’ analysis: March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 14 History of D-Grid Initiative • 01/2003: German scientists started D-Grid Initiative (‘UK pressure’); report with recommendations for German Government • 03/2004: BMBF announced 100 ME e-Science Initiative for Germany • 08/2004: BMBF Call for Proposals for e-Learning, Knowledge Networks, and Grid Computing • • • 09/2005: D-Grid-1: 25 ME, early adopters, ‘Services for Science’ 06/2007: D-Grid-2: new communities and services providers 06/2008: D-Grid-3 (?): Service Grids for research, industry, society March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 15 D-Grid Im Wissensnetz ... WIKINGER ONTOVERSE WISENT Textgrid Knowledge Management MediGrid IN-Grid HEP-Grid C3-Grid Astro-Grid D-Grid-1 Generic Grid Middleware and Grid Services Integration Project March, 2007 Wolfgang Gentzsch, D-Grid & RENCI Courtesy Helmut Loewe, BMBF 16 D-Grid-2 Services Level Agreements Im Wissensnetz ... WIKINGER ... ONTOVERSE WISENT Knowledge Management Textgrid MediGrid IN-Grid HEP-Grid C3-Grid Astro-Grid D-Grid-1 + 2 Generic Grid Middleware and Grid Services Integration Project March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 17 D-Grid-3 Services Level Agreements Im Wissensnetz ... WIKINGER ... Knowledge Management ONTOVERSE Knowledge Management WISENT Textgrid MediGrid IN-Grid HEP-Grid C3-Grid Astro-Grid D-Grid-1 + 2 + 3 Generic Grid Middleware and Grid Services Integration Project March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 18 D-Grid-3 Virtual Competence Centers for Services LevelSupport, Agreements Middleware, Resources, Knowledge,... ? Im Wissensnetz ... WIKINGER ... ONTOVERSE Knowledge Management WISENT Textgrid MediGrid IN-Grid HEP-Grid C3-Grid Astro-Grid D-Grid-1 + 2 + 3 Generic Grid Middleware and Grid Services March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 19 D-Grid Middleware Stack User Application Development and User Access GAT API GridSphere Nutzer Scheduling Workflow Management High-level Grid Services Monitoring Data management Basic Grid Services March, 2007 UNICORE LCG/gLite Accounting Billing User/VO-Mngt Globus 4.0.1 Security Resources in D-Grid Plug-In Distributed Data Archive Data/ Software Network Infrastructur Wolfgang Gentzsch, D-Grid & RENCI Distributed Compute Resources 20 D-Grid: Towards a Sustainable Infrastructure for Science and Industry Govt is looking into changing policies for resource acquisition (HBFG ! ) to enable a service model 2nd Call: Focus on Service Provisioning for Sciences & Industry Strong collaboration with: Globus Project, EGEE, Deisa, CrossGrid, CoreGrid, GridCoord, GRIP, UniGrids, NextGrid, … Application and user-driven, not infrastructure-driven Focus on implementation and production, not grid research, in a multi-technology environment (Globus, Unicore, gLite, etc) D-Grid is the Core of the German e-Science Initiative March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 21 D-Grid Workshop Sustainability in D-Grid Oct 9 – 10 2006, 50 Participants Sustainability in Grids S. and the funding Organization (Govt) S. and monitoring, accounting, billing S. of the D-Grid Infrastructure S. and application communities Example DFN German Research Network S. and Industry S. and support The European Grid Initiative (EGI) March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 22 Results of the Workshop Requirements There is a general need for a sustainable infrastructure Funding agency demands cost-neutral operation Securing long-term investments and ROI But: not only monetary considerations: enables long-term research collaboration and competitive research infrastructure Benefits for all constituencies Access to resources AND long-term data preservation International integration Acceptance of infrastructure through ease of use Long-term planning important for attracting new communities Include learning (GridKa), testing, support, and production March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 23 Results of the Workshop Challenges Heterogeneous middleware complicates building sustainable grid, but necessary for international collaboration => sustainability Today: grids are complex and user unfriendly environments Integrating new hardware from new partners & communities; How ? Currently, D-Grid is not a ‘legal’ entity, problem for contractual collab Long-term financing of resources and their usage/operation unclear Grid-enabled software licensing models still missing Broadening community grids beyond their current core members Germany: “Laender” investments restricted to ‘regional’ usage March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 24 Results of the Workshop Steps towards a sustainable infrastructure Significantly increase number of grid users, far beyond early adopters Govt funding for resources dedicated to D-Grid was key Support of several middlewares important Long-term goal: independence of D-Grid from funding … … but mid-term funding: prototype production Encourage Govt to change current funding policies for resources Professional user support of utmost importance (DGI & CGs) Excellent support excellent people long-term contracts Industry participation as users (SMEs) and providers (IT companies) March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 25 Results of the Workshop Conclusions and Recommendations D-Grid seems to be on track towards a sustainable infrastructure A centralized resource infrastructure is important, but the how still has to be discussed (DGI vs CGs) Implementation of sustainable D-Grid only together with users (CGs) Sustainable usage (business) models only with users (CGs) Integration of D-Grid in European infrastructure is important Central D-Grid institution should encourage broad acceptance of D-Grid, incl certification of and support for resources Role of industry unclear, but participation possible today March, 2007 Wolfgang Gentzsch, D-Grid & RENCI 26