HPCx: Multi-Teraflops in the UK A World-Class Service for World-Class Research Director

advertisement
HPCx: Multi-Teraflops in the UK
A World-Class Service for World-Class Research
Dr Arthur Trew
Director
HPCx rationale
UK academic research is increasingly
dependent upon high-end compute facilities
recent technological and Grid advances
highlighted the need to upgrade UK resources
HPCx objectives are thus:
– aim “to deliver optimum service resulting in worldleading science”
– address “the problems involved in scaling to the
capability levels required”
What is HPCx?
 Consortium of leading UK organisations committed to
creating and managing the new resource for the next
6 years
– led by University of Edinburgh
 multi-stage project to deliver a world-class academic
computing resource, the largest in Europe
 £54M/$100M budget
 Grid-enabled, a key component in the UK e-Science
programme
HPCx Consortium Members
 UoE HPCx Ltd – wholly-owned subsidiary of the
University of Edinburgh and lead contractor
 EPCC, University of Edinburgh
 Daresbury Laboratory, CCLRC
 IBM
 the University of Edinburgh is one of the top 5 research
universities in the UK
 EPCC is the leading computer centre in Europe, bridging
the gap between academia and industry
 … and provides both HPC and novel computing solutions
to a wide range of problems and users
 long experience of providing national HPC services
including:
– Meiko Computing Surfaces
– Thinking Machines CM200
– Cray T3D/T3E
EPCC overview
Technology Transfer
Academic:
o National HPC facilities
o Research
o Support
Industry:
o Projects
o Consultancy
o Standards
Training:
o Academia
o Industry
European leadership:
o Visitor programmes
o Technology Transfer
o Strategic Planning
Daresbury Laboratory
 A multi disciplinary research lab with
over 500 people
 Provides large-scale research
facilities both for UK academic and
industrial research communities
 Runs the UK’s Collaborative
Computational Projects
 Daresbury hosts the HPCx
hardware
 IBM provides the technology for HPCx
 Long standing involvement in HPC including the
development of a number of ASCI machines and 4 of the
top dozen machines in the 21st TOP500 list:
– ASCI White: Rmax = 7.3 TFlop/s
– SP Power3 (6656 Processors): Rmax = 7.3 TFlop/s
– xSeries (1920 Processors): Rmax = 6.6 TFlop/s
– HPCx (1280 processors): Rmax = 3.2 TFlop/s
 IBM has the long term technology road map essential to
a 6 year project such as HPCx
HPCx in Place
HPCx: Phase 1
 System will be commissioned in three main stages
Phase 1 covering 2002-2004 consists of:
– 40 Regatta-H SMP nodes, 1280 processors
– Peak performance 6.6 TFlop/s, with 3.4 Tflop/s sustained
Linpack
• currently 16th in the Top500
– 1.28 TB total memory capacity
– Over 50 TB of storage capacity
– Double plane Colony switch with total
peak bandwidth of 250 MB/s per
processor
The e-Science Grid
CeSC
(Cambridge)
HPCx Phases 2 & 3
 Phase 2 (2004-2006)
– aiming for 6 TFlop/s sustained on Linpack and 2.5 TFlop/s on
sPPM
– O(48) Regatta-H+ SMP nodes
– interconnect upgraded to Federation switch
– doubling of I/O and storage
• already built a cluster with 8 Regatta-H+ frames and a prerelease Federation switch
• undertaking a phased upgrade during 1H2004
 Phase 3 (2006-2008)
– target of 12 TFlop/s sustained on Linpack
– may be additional nodes or alternative technology
HPCx Science Support
18 staff in 5 dual-centre functional support teams
Outreach
Life sciences
New applications
Applications Support
Users
Helpdesk
Training
Liaising with users
Terascaling
Capability applications
Scalable algorithms
Performance optimisation
Software Engineering
Underpinning technology
Grid/e-Science
Systems & Networking
Flexible and responsive capability computing service
Smooth transitions between phases
Technology
CPU Usage by Job Size
80%
128
Jan-03
Feb-03
Mar-03
Apr-03
May-03
Jun-03
Jul-03
Aug-03
Sep-03
Oct-03
Nov-03
Dec-03
256
1000000
>1024
2000000
40%
512
1500000
0%
AUs
HPCx Status: Usage
Utilisation of Capability Region
100%
60%
1024
20%
64
500000
32
0
8
Jan-03
Feb-03
Mar-03
Apr-03
May-03
Jun-03
Jul-03
Aug-03
Sep-03
Oct-03
Nov-03
Dec-03
16
HPCx Status: Application
Areas
 currently 28 active consortia, and over 450 users
Usage by Application Area
Atomic &
Molecular Physics
Science Support
Materials
Chemistry
Engineering
Environmental
Science
 Life Sciences outreach activity supported by IBM
Atomic and Molecular
Physics
 The UK Multiphoton, Electron Collision and Bose Einstein
Condensates (MECBEC) HPC Consortium
 Two flagship projects model two-electron atoms (helium)
and molecules (hydrogen) exposed to intense, ultra-short
laser pulses
 Modelling involves the grid solution of multidimensional, time-dependent partial differential
equations)
 Requires large amounts of computing power
 Visualisation techniques crucial in
extracting information
Simulation of Double Ionization of laser-driven helium performed at
Queen’s University Belfast
Environmental Science:
POLCOMS
 POLCOMS is a multi-disciplinary model developed at the
Proudman Oceanographic Laboratory
 … a 3-D hydrodynamic model
integrating coasts and oceans using a
wide range of associated models
 POLCOMS is a step towards real
time modelling of coastal zones,
enabling better analysis of impacts
to, and sustainability of, the marine
environment
A simulation of Chlorophyll density in UK
waters using the POLCOMS model
Material Science: Crystal
 CRYSTAL computes electronic structure and related properties of
periodic systems
 Developed jointly by Daresbury and the University of Turin
 A Fortran90 and MPI program that performs Hartree-Fock, density
functional and other approximation calculations
 On HPCx, CRYSTAL used to calculate the structure of the Crambin
molecule, the largest Hartree-Fock calculation ever converged (1284
atoms)
– next, the Rusticyanin molecule (6284 atoms)
Engineering: UKTC
UK Turbulence Consortium is developing world
leading turbulence simulation codes using HPCx
essential that they can easily facilitate the
scientific usage of the resulting data
the calculated data are transferred from HPCx to a
remote site for analysis
so, the Grid is becoming increasingly important
Chemistry: RealityGrid
 RealityGrid is a UK collaboration
– which aims to grid-enable the realistic modelling and
simulation of complex condensed matter structures at
the meso and nanoscale levels
 HPCx terascaling team has worked on parallelisation &
optmisation of RealityGrid codes, such as LB3D
 RealityGrid also uses the Grid for data transfer,
computational steering, and remote visualisation
– using Globus on HPCx
– TeraGyroid won award at SC2003
Summary
HPCx is the new UK HPC resource
– capability computing for world-leading science
– largest academic computer in Europe
Series of IBM pSeries clusters
– 6.6TF  12TF  22TF
– remaining very competitive until 2008+
Very successful first year
HPCx: Supporting the
Future of UK Research
Summary: Website
 For further information:
http://www.hpcx.ac.uk
The front page of
the HPCx website
Download