TeraGrid Overview Cyberinfrastructure Days Internet2 10/9/07 Mark Sheddon Resource Provider Principal Investigator San Diego Supercomputer Center sheddon@sdsc.edu TeraGrid • Funded by the Office of Cyberinfrastructure (OCI) within the National Science Foundation (NSF) – Grid Infrastructure Group (GIG) • Coordination software development and deployment • Integration and tracking of general partnership activities • Lead by University of Chicago/Argonne National Labs (Dane Skow) – Nine Computational Resource Providers (RP’s) – Four Software Integration Partners TeraGrid Facility Partners UW Grid Infrastructure Group (UChicago) PSC UC/ANL NCAR PU NCSA Caltech UNC/RENCI IU ORNL USC/ISI SDSC Resource Provider (RP) TACC Software Integration Partner August 2007 TeraGrid Objectives • DEEP Science: Enable Petascale Science – Make science more productive through an integrated set of very-high capability computational resources • Address key scientific challenges prioritized by users • WIDE Impact: Empower Scientific Communities – Bring TeraGrid capabilities to the broad science community • Partner with science community leaders – e.g. “Science Gateways” • Create an OPEN Infrastructure, OPEN Partnership – Provide a coordinated, general purpose, reliable set of services and resources • Partner with campuses and facilities More Than Just Fast Computers Start-up and 9 Large Allocations Resource Providers Over 300 Tflops Computing Security and Access Applications Packages Security Input Security Archive 2D . and 3D Shibboleth Retrieval Data Storage & Collections Authorization Search Central Help Desk Human Support Science Gateways Ontologies Training Authentication Over 100 Collections Schema Metadata Display Tools Accounting Training Education And Outreach ASTA Policy Software Visualization Servers Resource Providers Governance Campus Partners Scientific Instruments Control Data Generation ORNL SNS Over 1,000 PIs Researchers Educators Students Over 3,200 Users Faculty Users Come From Many Scientific Disciplines TeraGrid Projects by Institution Blue: 10 or more PI’s Red: 5-9 PI’s Yellow: 2-4 PI’s Green: 1 PI TeraGrid allocations are available to researchers at any US educational institution by peer review. Exploratory allocations can be obtained through a biweekly review process. See www.teragrid.org. 1000 projects, 3200 users How Can You Get Involved? • Apply for an Allocation – Computing Resources – Applications Support • Access through a Science Gateway • Participate in EOT Activities – Institutes, Workshops, Online Tutorials – TG Conference • Become a Resource Provider Requesting Allocations of Computer Time • TeraGrid resources are provided for free to academic researchers and educators through peer review process – Development Allocations Committee (DAC) for start-up accounts up to 30,000 hours of time are requests processed in two weeks – Medium Resource Allocations Committee (MRAC) for requests of up to 500,000 hours of time are reviewed four times a year – Large Resource Allocations Committee (LRAC) for requests of over 500,000 hours of time are reviewed twice a year Helping Applications Take Best Advantage of TG Resources Advanced Support for TeraGrid Applications (ASTA) Large Data; Virtualized Resources: Earthquake Simulation Olsen (SDSU), Okaya (USC), Jordan (USC), Southern California Earthquake Center Movie SDSC TeraGrid Science Gateways Initiative: Community Interface to Grids • Common Web Portal or application interfaces (database access, computation, workflow, etc). • “Back-End” use of TeraGrid computation, information management, visualization, or other services. Gateways are Growing in Numbers • 10 initial projects as part of TG proposal • >20 Gateway projects today • No limit on how many gateways can use TG resources – Prepare services and documentation so developers can work independently • Open Science Grid (OSG) • Special PRiority and Urgent Computing Environment (SPRUCE) • National Virtual Observatory (NVO) • Linked Environments for Atmospheric Discovery (LEAD) • Computational Chemistry Grid (GridChem) • Computational Science and Engineering Online (CSE-Online) • GEON(GEOsciences Network) • Network for Earthquake Engineering Simulation (NEES) • SCEC Earthworks Project • Network for Computational Nanotechnology and nanoHUB • GIScience Gateway (GISolve) • Biology and Biomedicine Science Gateway • Open Life Sciences Gateway • The Telescience Project • Grid Analysis Environment (GAE) • Neutron Science Instrument Gateway • TeraGrid Visualization Gateway, ANL • BIRN • Gridblast Bioinformatics Gateway • Earth Systems Grid • Astrophysical Data Repository (Cornell) Wide Variety of HPC Training/Education Options • Live/Access Grid Sessions include: – – – – – – – Introduction to UT Grid Rodeo Using the National Microbial Pathogen Data Resource BlueGene Applications Workshop Introduction to Parallel Computing Summer Institutes Toward Multicore Petascale Applications Introduction to Scientific Visualization • On-line Self-Paced Tutorials – Over 35 topics and growing • Curricular Focused Workshops – Introduction to Interdisciplinary Computational Science Education for Educators – Computational Biology for Biology Educators – Computational Chemistry for Chemistry Educators and – Computational Physics for Physics Educators – Computing in the Humanities, Arts and Social Sciences – Parallel and Cluster Computing Workshop Sites 2007 TeraGrid RP 2/4 Yr. College Research 1 Univ. Minority Serving Institution Workshop TeraGrid ‘07 Conf. Conference Tutorial (e.g. SC, AAAS, AAPT) Broadening Participation in TeraGrid Keynote: Anita Jones (U. VA) Technology Keynote: Paul Strong (eBay) Science Keynote: Phil Maechling (USC/SCEC) Science, Technology, and Outreach Sessions Student Competitions for high school, undergraduate and graduate students! Over 350 Attendees www.teragrid.org/events/teragrid07 Coming Down the Track TeraGrid Resource Futures • TACC – Sun (Ranger), 500 TF (peak), Dec 2007 • LSU – IBM Blade, 25 TF (peak), 1Q08 • U. Tennessee – Cray XT, 1 PF (peak), 1Q09 • NCSA – IBM Power, ~1 PF (sustained), 2011 ONWARD! www.teragrid.org Helping Applications Take Best Advantage of TG Resources Advanced Support for TeraGrid Applications (ASTA) Large Data; Virtualized Resources: Earthquake Simulation Olsen (SDSU), Okaya (USC), Southern California Earthquake Center Sources: Tom Jordan (USC). Images SDSC.