HPCx Power for the Grid Dr Alan D Simpson HPCx Project Director

advertisement
HPCx
Power for the Grid
Dr Alan D Simpson
HPCx Project Director
EPCC Technical Director
HPCx Overview
• UK’s major HPC facility, primarily funded by EPSRC
• £53M/6 year contact awarded to UoE HPCX Ltd
– wholly-owned subsidiary of University of Edinburgh
– work subcontracted to CCLRC (DL), EPCC and IBM
• Largest academic supercomputer in Europe
– doubling in performance every 2 years
HPCx
September 2003
2
HPCx Objectives
• Deliver capability computing for world-leading
science
• Capability Computing
– jobs which use a significant fraction of the
resource, eg, at least 512 CPUs
• Collaboration between HPCx and users through the
Terascaling process
• Maximise benefits to the UK’s computational
science and engineering community
• Forging links between HPC and e-Science
• High quality support is the key to success
HPCx
September 2003
3
Partnership
• EPCC and CCLRC
– are partners in C3ES (Consortium for Capability
Computing and e-Science)
– underpinned by MoU between UoE and CCLRC
– combines Europe’s foremost academic HPC,
e-Science and technology transfer centres
– virtual organisation facilitated by Access Grid
– significant experience of:
• operating national HPC services
• developing capability applications
– the strongest UK partnership ever to support
scientific computing
HPCx
September 2003
4
Virtual Organisation
• Dual-centre functional support teams
Outreach
Life sciences
New applications
Applications Support
Users
Helpdesk
Training
Liaising with users
Terascaling
Capability applications
Scalable algorithms
Performance optimisation
Software Engineering
Underpinning technology
Grid/e-Science
Systems & Networking
Flexible and responsive capability computing service
Smooth transitions between phases
HPCx
September 2003
Technology
5
HPCx Utilisation
800000
>1024 CPUs
700000
1024 CPUs
Usage
600000
512 CPUs
500000
256 CPUs
400000
128 CPUs
300000
64 CPUs
32 CPUs
200000
16 CPUs
100000
8 CPUs
0
Jan-03
Feb-03
Mar-03
Apr-03
May-03
Jun-03
Jul-03
Aug-03
• successful first 9 months
• >75% utilisation for last 6 months
• capability usage has increased to 35%
HPCx
September 2003
6
HPCx and the Grid
• Key responsibility for Software Engineering team
– led by Dr Stephen Booth
• who is also responsible for EPCC’s Grid operations
• HPCx is committed to support access via Grid
– currently provided through Globus 2
– Globus 3 support when appropriate
• HPCx is key part of UK collaboration with
Extensible Teragrid Facility project in the US
– promoting UK science on the world stage
HPCx
September 2003
7
ETF Collaboration
• Focus is exploiting unique features of Grid + HPC
systems for capability computing
– `HPCy-class’ applications
• Initial experiment planned for SC2003
– RealityGrid computational steering
– HPCx is major compute resource
• Current challenges are:
– network bandwidth
– lack of direct network connections to compute nodes
• developing port-forwarding software to allow Globus IO
connections to batch jobs
HPCx
September 2003
8
Summary
• HPCx builds on significant complementary experience
at EPCC and DL
• Very successful start
– …with capability usage already up to 35%
• Committed to e-Science and the Grid
– strong links with NeSC and CCLRC e-Science Centre
– ETF experiment at SC2003
• HPCx is focussed on capability computing
– world-class service for world-class research
HPCx
September 2003
9
Download