NSF HPC Task Force Report Summary Presented to the TeraGrid Scientific Advisory Board Thomas Zacharia University of Tennessee Oak Ridge National Laboratory Jim Kinter Center for Ocean-Land-Atmosphere Studies Richard Loft National Center for Atmospheric Research Arlington, Virginia 20 January 2011 HPC Task Force: Membership Thomas Zacharia, Chair, University of Tennessee/Oak Ridge National Laboratory Jim Kinter, Co-Chair, Center for Ocean-Land-Atmosphere Studies Ravi Arimilli, IBM, Inc. Michael Macy, Cornell University Ron Cohen, Carnegie Institution of Washington Dick McCombie, Cold Spring Harbor Laboratory Larry Davis Dave Randall, Colorado State University Tiziana Di Matteo, Carnegie Mellon University Steve Scott, Cray, Inc. Bill Harrod, Defense Advanced Research Projects Agency Horst Simon, Lawrence Berkeley National Laboratory George Karniadakis, Brown University Thomas Sterling, Louisiana State University Rubin Landau, Oregon State University Nancy Wilkins-Diehr, San Diego Supercomputing Center Rich Loft, National Center for Atmospheric Research Theresa Windus, Iowa State University Rob Pennington, NSF Liaison 2 NSF Advisory Committee for Cyberinfrastructure High Performance Computing Task Force HPC Task Force: Charter • Provide specific advice on the broad portfolio of HPC investments that NSF should consider • Time span: 5–10 years • Scope: – Cyberinfrastructure to support research – Research on cyberinfrastructure (hardware and software) – Training 3 NSF Advisory Committee for Cyberinfrastructure High Performance Computing Task Force HPC Task Force: Activities • 3 community workshops (Arlington, Virginia): University, industry, and government participants – Sustainability and User Requirements, 4 December 2009 – Applications at the Exascale, 29 July 2010 – Broader Engagement, 3 December 2010 • Community input solicited • Presentations, reports, and position papers posted: http://www.nics.tennessee.edu/workshop 4 NSF Advisory Committee for Cyberinfrastructure High Performance Computing Task Force Sustainability and User Requirements Workshop: 4 December 2009 Stan Ahalt, Renaissance Computing Institute Fred Johnson, Retired Phil Andrews, University of Tennessee Amy Apon, University of Arkansas Tim Axelrod, University of Arizona Pete Beckman, Argonne National Laboratory Kirk Jordan, International Business Machines Jay Boisseau, University of Texas Rubin Landau, Oregon State University Rich Loft, National Center for Atmospheric Research William Kramer, University of Illinois Bill Camp, Intel Corporation Ronald Cohen, Carnegie Institute of Washington John Connelly, University of Kentucky Rene Copeland, Silicon Graphics, Incorporated Thom Dunning, University of Illinois Geoffrey Fox, University of Indiana Tom Furlani, State University of New York at Buffalo Omar Ghattas, University of Texas Galen Gisler, University of Oslo Matthias Gobbert, University of Maryland Robert Harrison, Oak Ridge National Laboratory Bill Harrod, Defense Advanced Research Projects Administration Eric Jakobsson, University of Illinois 5 NSF Advisory Committee for Cyberinfrastructure High Performance Computing Task Force Sid Karin, University of California, San Diego Jim Kinter, Center for Ocean-Land-Atmosphere Studies, Co-chair Patricia Kovatch, University of Tennessee Rubin Landau, Oregon State University Paul Messina, Argonne National Laboratory Ravi Nair, International Business Machines Esmond Ng, Lawrence Berkley National Laboratory Mike Norman, University of California, San Diego Brian O’Shea, Michigan State University Rob Pennington, National Science Foundation Dave Randall, Colorado State University Ralph Roskies, University of Pittsburgh Steve Scott, Cray, Inc. Thomas Sterling, Louisiana State University Dennis Wall, Harvard University Thomas Zacharia, University of Tennessee and Oak Ridge National Laboratory, Co-chair 38 attendees Sustainability and User Requirements: Recommendations for NSF Direct supercomputing program evolution to sustain and promote resource stability • Allow researchers and HPC centers to select best value in computational and data platforms • Enable centers to offer continuous service to the community 6 NSF Advisory Committee for Cyberinfrastructure High Performance Computing Task Force Commit to stable and sustained funding for HPC centers • Allow centers to recruit and develop the expertise needed to maximize potential of hardware investments • Use rigorous review and oversight processes to provide assurance that centers are meeting expectations Encourage HPC centers to build long-term relationships with multiple vendors • Provide researchers with benefits of planned roadmap for several generations of chip technology and with continuity in architecture and software environments • Apply results-oriented acquisition strategies to ensure that vendor performance meets center and NSF needs Applications at the Exascale Workshop: 29 July 2010 Amy Apon, University of Arkansas Patricia Kovatch, University of Tennessee Bill Barth, Texas Advanced Computing Center William Kramer, University of Illinois Thomas Cheatham, University of Utah John Levesque, Cray, Inc. John Connolly, University of Kentucky Rick Linger Carnegie-Mellon University Rhonda Dias, Silicon Graphics, Incorporated Bruce Loftis, University of Tennessee Mark Fahey, University of Tennessee Dick McCombie, Cold Spring Harbor Laboratory Rob Fowler, Renaissance Computing Institute Phil Maechling, Southern California Earthquake Center Tom Furlani, State University of New York at Buffalo Bronson Messer, Oak Ridge National Laboratory Omar Ghattas, University of Texas George Michaels, Intel Corporation Galen Gisler, University of Oslo Jose Munoz, National Science Foundation Matthias Gobbert, University of Maryland, Baltimore County Bonnie Nestor, Oak Ridge National Laboratory Dennis Goo, Intel Corporation Esmond Ng, Lawrence Berkeley National Laboratory David Halstead, National Radio Astronomy Observatory Rob Pennington, National Science Foundation Robert Harrison, Oak Ridge National Laboratory Irene Qualters, National Science Foundation Thuc Hoang, National Nuclear Security Administration Dave Randall, Colorado State University Eric Jakobsson, University of Illinois Klaus Schulten, University of Illinois at Urbana-Champaign Brad Jones, Silicon Graphics, Incorporated Mark Shephard, Rensselaer Polytechnic Institute Homa Karimabadi, University of California, San Diego Jeff Vetter, Oak Ridge National Laboratory and Georgia Institute of Technology George Karniadakis, Brown University Nancy Wilkins-Diehr, San Diego Supercomputing Center Dan Katz, University of Chicago Thomas Zacharia, University of Tennessee and Oak Ridge National Laboratory Jim Kinter, Center for Ocean-Land-Atmosphere Studies 7 NSF Advisory Committee for Cyberinfrastructure High Performance Computing Task Force 44 attendees Applications at the exascale: Recommendations for NSF • Expand partnerships with industry, academia, and other agencies to accelerate development of exascale systems and applications – Co-design will be critical in development of applications that can deliver predictive capability • Expand efforts to engage new user communities in HPC – Focused outreach program to industry – Engagement of data-intensive fields of research • Provide a funding framework for co-location of archive and community data resources with compute and visualization resources • Establish a continuing process for soliciting community input on plans for HPC investments 8 NSF Advisory Committee for Cyberinfrastructure High Performance Computing Task Force Broader Engagement Workshop: 3 December 2010 9 Stan Ahalt, Renaissance Computing Institute Ashok Krishnamurthy, Ohio Supercomputing Center Gabrielle Allen, Louisiana State University Sander Lee, National Nuclear Security Administration Amy Apon, University of Arkansas David Lifka, Cornell University Atinuke Arowojolu, National Nuclear Security Administration Bruce Loftis, University of Tennessee David Bader, Georgia Institute of Technology Yashema Mack, University of Tennessee Bill Barth, Texas Advanced Computing Center Jose Munoz, National Science Foundation Jay Boisseau, University of Texas Jim Myers, Rensselaer Polytechnic Institute Rhonda Dias, Silicon Graphics, Incorporated Henry Neeman, Oklahoma University Jim Ferguson, University of Tennessee Bonnie Nestor, Oak Ridge National Laboratory Sue Fratkin, Coalition for Academic Scientific Computation Esmond Ng, Lawrence Berkeley National Laboratory Matthias Gobbert, University of Maryland, Baltimore County Mike Norman, San Diego Supercomputing Center Dennis Goo, Intel Corporation Mike Papka, University of Chicago/Argonne National Laboratory Milt Halem, University of Maryland, Baltimore County Rob Pennington, National Science Foundation David Halstead, National Radio Astronomy Observatory Irene Qualters, National Science Foundation Robert Harrison, Oak Ridge National Laboratory Dan Reed, Microsoft Corporation Elliot Hirshman, University of Maryland ,Baltimore County Ralph Roskies, University of Pittsburgh Larry Hoelzeman, Cray, Inc. Mark Shephard, Rensselaer Polytechnic Institute Raquell Holmes, University of Connecticut Health Center David Skinner, Lawrence Berkeley National Laboratory Homa Karimabadi, University of California, San Diego Alex Szalay, John Hopkins Jim Kinter, Center for Ocean-Land-Atmosphere Studies John Towns, National Center for Supercomputing Applications Patricia Kovatch, University of Tennessee Jeff Vetter, Oak Ridge National Laboratory and Georgia Institute of Technology William Kramer, University of Illinois Thomas Zacharia, University of Tennessee and Oak Ridge National Laboratory NSF Advisory Committee for Cyberinfrastructure High Performance Computing Task Force 41 attendees Broader Engagement: Preliminary recommendations to NSF • Continue and grow education, outreach, and training programs to expand awareness and encourage use of high-end modeling and simulation capabilities – Industry – New user communities – Expanded workforce development programs, starting with K–12 • Measure program impacts – Expansion in use of HPC – Delivery of science • Consider establishment of a computational science program or division 10 NSF Advisory Committee for Cyberinfrastructure High Performance Computing Task Force HPC Taskforce: Summary recommendations to NSF • Develop a sustainable model to provide the academic research community with access, by 2015–2016, to a rich mix of HPC systems that: – Deliver sustained performance of 20–100 petaflops on a broad range of science and engineering applications – Are integrated into a balanced, comprehensive, national cyberinfrastructure environment – Are supported at national, regional, and/or campus levels • Invest now to prepare for exascale systems that will be available by 2018–2020 – Co-design partnerships – Data cyberinfrastructure (networking, disk, and storage) for data-driven science • Broaden outreach to include industry and new user communities • Establish a continuing process for soliciting community input on plans for HPC investments 11 NSF Advisory Committee for Cyberinfrastructure High Performance Computing Task Force Questions & Discussion 12 NSF Advisory Committee for Cyberinfrastructure High Performance Computing Task Force