Korea Institute of Science and Technology Information Visit to NeSC

advertisement
Korea Institute of
Science and Technology
Information
Visit to NeSC
Malcolm Atkinson
Director
www.nesc.ac.uk
24th March 2004
Outline
The National e-Science Centre
Role and mission
The e-Science Institute
The UK e-Science Programme
Funding and organisation
The UK Grid
The European dimension
The Essence of e-Science
NeSC Roles
Help coordinate and lead UK e-Science
Community building, training & outreach
Help establish the UK’s international role
The focus for presenting UK e-Science
Run the e-Science Institute
A meeting place
Workshops and conferences
Research visitors and events
Undertake R&D projects
Reliable middleware (OGSA-DAI, SunDCG, …)
Engage industry (IBM, Sun, Microsoft, HP, Oracle, …)
Stimulate the uptake of e-Science technology
Training Team
The Primary Requirement …
Enabling People to Work Together on Challenging Projects: Science, Engineering & Medicine
Events Held
(from 1 Aug 2002 to 29 Feb 2004 – 31 months)
We have run 197 events (just over 6 per month):
3 conferences (including GGF5 with 900 participants)
20 project meetings
16,444 delegate days
23 research meetings
197 events
61 workshops
4 schools
6,825 delegates
32 training sessions
339 event days
27 outreach events
9 international meetings
18 e-Science management meetings
(though the definitions are fuzzy!)
The Website – a Resource
National e-Science Centre
http://www.nesc.ac.uk/
Mission, Background, Foundation, Locations, Staff, Resources, Projects
Register interest, Mailing lists, NeSCForge
Regional associations and Collaborations
News, Notices
Presentations and Lectures
http://www.nesc.ac.uk/presentations/
National e-Science Institute
http://www.nesc.ac.uk/esi/
Mission, Events (Future and Past)
Register for Events, Visitor Programme, GridNet
UK e-Science
Map and Index of Centres
Technical Papers
Index of >100 Projects
Task Forces
http://www.nesc.ac.uk/centres/
http://www.nesc.ac.uk/technical_papers/
http://www.nesc.ac.uk/projects/
http://www.nesc.ac.uk/teams/
General Information
Glossary, Bibliography, Who’s who
Comprehensive
Widely used byand
UK, Growing
USA andSource
the restofofInformation
the world
Web site Statistics
Since going ‘live’ in 2001 to 31 Jan 2004 (figures in
last week to 27 Feb 2004)
> 3.0 million successful requests (‘hits’)
transferring 166 gigabytes of data (2.20 GB)
Average hits per day 3237 (5430)
Distinct files served 18,980 (2821)
… to 96,610 (3705) distinct hosts
Average data transferred per day 182 MB
(typical file size about 100 kB) (322 MB)
Web site Statistics 2
18,000
200000
AFRICA
AMERICA - NORTH
180000
AMERICA - OTHER
16,000
ASIA
EUROPE - UK
160000
EUROPE - NON UK
14,000
MIDDLE EAST
OCEANIA - PACIFIC
UNKNOWN
140000
HITS
12,000
100000
8,000
80000
6,000
60000
4,000
40000
2,000
20000
2004/02
2004/01
2003/12
2003/11
2003/10
2003/09
2003/08
2003/07
2003/06
2003/05
2003/04
2003/03
2003/02
2003/01
2002/12
2002/11
2002/10
2002/09
2002/08
2002/07
2002/06
2002/05
2002/04
2002/03
0
2002/02
0
Successful Hits
10,000
2002/01
Volume (MB)
120000
Technical Reports
The Virtual Observatory as a Data Grid, Bob Mann, Sep 03
UK Experience with OGSA, Dave Berry, Sep 03
e-Science Gap Analysis, Geoffrey Fox, David Walker, Jun 03
Scientific Data Mining, Integration and Visualisation, Bob Mann, Roy Williams,
Malcolm Atkinson, Ken Brodlie, Amos Storkey, Chris Williams, Nov 02
A Rough Guide to Grid Security, Mike Surridge, Sep 02
Multi-Site Videoconferencing for the UK e-Science Programme, Stephen Booth, John
Brooke, Kate Caldwell, Liz Carver, Michael Daw, David De Roure, Alan Flavell,
Philippe Galvez, Brian Gilmore, Henry Hughes, Ben Juby, Ivan Judson, Jim Miller,
Harvey Newman, Chris Osland, Sue Rogers, Oct 02
Database Access and Integration Services on the Grid, Norman W Paton, Malcolm P
Atkinson, Vijay Dialani, Dave Pearson, Tony Storey, Paul Watson, Feb 02
Research Agenda for the Semantic Grid: A Future e-Science Infrastructure, David De
Roure, Nicholas Jennings, Nigel Shadbolt, Dec 01
Databases and the Grid, Paul Watson, Dec 01
Technical Reports
A Grid Application Framework based on Web Services Specifications and Practices,
Savas Parastatidis, Jim Webber, Paul Watson, Thomas Rischbeck, Aug 03
Grid Information Systems 2003 (Draft), Rob Allan, Dharmesh Chohan, Xiao Dong
Wang, Andy Richards, Mark McKeown, John Colgrave, Matthew Dovey, Mark Baker,
Steve Fisher, Dec 03
Towards tractable toolkits for the Grid: a plea for lightweight, usable middleware,
Jonathan Chin, Peter Coveney, Feb 04
Portals and Portlets 2003, Rob Allan, Chris Awre, Mark Baker, Adrian Fish, Mar 04
IMAGE 03: Images, Medical Analysis and Grid Environments, Dave Berry, Derek Hill,
Steve Pieper, Joel Saltz, Cécile Germain-Renaud, Mar 04
Open Issues in Grid Scheduling, Alain Andrieux, Dave Berry, Jon Garibaldi, Stephen
Jarvis, Djamila Ouelhadj, Mar 04
Data Provenance and Annotation, Peter Buneman, Michael Wilde.
e-Science Workflow Services, Matthew Addis, Dave Berry, Earl Ecklund, Carole
Goble
Outline
The National e-Science Centre
Role and mission
The e-Science Institute
The UK e-Science Programme
Funding and organisation
The UK Grid
The European dimension
The Essence of e-Science
UK e-Science Budget
(2001-2006)
Total: £213M
EPSRC Breakdown
MRC (£21.1M)
10%
EPSRC (£77.7M)
37%
HPC (£11.5M)
BBSRC (£18M)
15%
8%
NERC (£15M)
7%
Applied (£35M)
Staff
45%
costs Grid Resources
funded separately
Core (£31.2M)
40%
PPARC (£57.6M)
27%
CLRC (£10M)
5%
ESRC (£13.6M)
6%
+ Industrial Contributions
Source: Science Budget 2003/4 – 2005/6, DTI(OST)
The e-Science
Centres
Globus Alliance
Open
Middleware
Infrastructure
Institute
Digital
Curation
Centre
e-Science
Institute
Grid
Operations
Centre
?
CeSC (Cambridge)
EGEE
Digital Curation Centre
communities
of practice:
users
curation
organisations
community
support &
outreach
Collaborative
Associates
Network of
Data
Organisations
services
management
& coordination
research
research
collaborators
development
testbeds
& tools
Industry
standards bodies
Task Forces
Directors’ Forum
Helped build a community
Engineering Task Force
Built the UK Grid
Architecture Task Force
UK Adoption of OGSA
OGSA Grid Market
Future approaches
Security Task Force
Database Task Force (now disbanded)
OGSA-DAI (www.ogsadai.org.uk)
GGF DAIS-WG
Usability Task Force
The e-Science
Grid
Engineering Task
Force
(Contributions
from e-Science
Centres)
1280 x CPU
AIX
512 x CPU
Irix
HPC(x)
20 x CPU
18TB Disk
Linux
Grid Support
Centre / Grid
Operations
Centre
OGSA Test Grid
projects
CeSC (Cambridge)
64 x CPU
4TB Disk
Linux
Access Grid
Cameras
Crucial for
management
meetings
Requires IP
multicast
throughout
the network
Microphones
Some UK e-Health Projects
eDIaMoND (with IBM and Mirada)
Breast Cancer Project
IXI (with GSK and Philips Medical)
Information from medical images
MIAS Devices
Mobile sensors for healthcare
CLEF
Integrating medical information
eDiaMoND – Compute
Mammograms have different
appearances, depending on image
settings and acquisition systems
Standard
Mammo
Format
Temporal
mammography
Computer
Aided
Detection
3D View
eDiaMoND – Non-Functional
Anonymisation
Grid
Screening
Screening
Screening
Screening
Diagnosis
Diagnosis
Diagnosis
Teaching
Training
Teaching
Teaching
Epidemiology
Epidemiology
Epidemiology
Epidemiology
Ethics
Legal
Security
Performance
Scalability
Manageability
Auditability
……
Lossless Compression
Encryption
256MB & 5 secs
response
~100 Centres
Systems Administration
Non-Repudiation
KCL, Imperial and Oxford
http://www.ixi.org.uk
Automatic registration technology
Rigid registration of
MR and CT images
of the head
Inter-subject image warping
CLEF - Integrating information
Need high quality, integrated clinical information for:
clinical research
evidence-based health care
the clinical application of genetic and genomic research
The capture, integration, and presentation of descriptive
information is a major barrier to achieving an integrated
framework
Data includes:
clinical histories
radiology and pathology reports
annotations on genomic and image databases
technical literature and Web based resources
MIAS Devices Project
Sensor bus
Easy Plug and Play of Sensors
Wireless connection using
802.11
Positioning information from
GPS
Mobile medical technologies on
a distributed Grid
GPS ariel
The European dimension
EGEE: Enabling Grids for E-Science in Europe
… and beyond
32M Euro, 10 regions, 70 partners
Additional funding from NSF (USA)
50% production, 30% development, and 20%
dissemination and training
“The Grid Infrastructure in Europe”
Deploy a production Grid across Europe
Initially based on LHC Computing Grid
Examples of
HealthGRID
applications
Grids for medical development
Preparation and follow-up of
medical missions in developing
Clermont-Ferrand/Paris
countries
Support to local medical
centres in terms of second
diagnosis, patient follow-up
and e-learning
Ibagué
Hand surgery
Medical centre
2 missions (Ibagué &
Chuxiong) with the
french NPO « Chaîne de
l’Espoir » used as test
cases
Chuxiong
The grid impact :
•Improved telemedecine
services
• Federation of patient
databases
•Interactive e-learning (high
bandwidth network required)
eHealth
eScience
DataGrid :
status of biomedical applications
Bio-informatics
deployed
tested on EDG
under preparation
Phylogenetics : BBE Lyon (T. Sylvestre)
Search for primers : Centrale Paris (K. Kurata)
Bio-informatics web portal : IBCP (C.
Blanchet)
Parasitology : LBP Clermont, Univ B. Pascal
(N. Jacq)
Data-mining on DNA chips : Karolinska (R.
Médina, R. Martinez)
Geometrical protein comparison : Univ.
Padova (C. Ferrari)
GATE MonteCarlo simulation
platform for
nuclear
medecine
Medical imaging
Local_Monopro1500MHz
X10
X20
X50
X100
160
Temps en minutes
MR image simulation : CREATIS (H. BenoitCattin)
Medical data and metadata management :
CREATIS (J. Montagnat)
Mammographies analysis ERIC/Lyon 2 (S.
Miguet, T. Tweed)
Simulation platform for PET/SPECT based on
Geant4 : GATE collaboration (L. Maigne)
180
140
120
100
80
60
40
20
0
Parallelisation
Outline
The National e-Science Centre
Role and mission
The e-Science Institute
The UK e-Science Programme
Funding and organisation
The UK Grid
The European dimension
The Essence of e-Science
What is e-Science?
Invention and exploitation of advanced
computational methods
to generate, curate and analyse research data


From experiments, observations and simulations
Quality management, preservation and reliable evidence
to develop and explore models and simulations


Computation and data at extreme scales
Trustworthy, economic, timely and relevant results
to enable dynamic distributed virtual organisations


Facilitating collaboration with information and resource sharing
Security, reliability, accountability, manageability and agility
e-Science >> Grid & Web Services
It is what you do with them that counts
Fundamental & Growing Assets
Understanding of Processes & Requirements
International and Multi-disciplinary Skill base
Experience composing & adapting existing
technologies
and of building new components
Experience Supporting Developers and Users
Experience Establishing Virtual Organisations
across Enterprise boundaries
Embedded in People & Teams, Growing – they need nurture
Primary Multi-Enterprise Issues
Combining subsystems built independently in
different enterprises
Deploying, Starting and Managing Applications
and Production Operations
Assume Benefits of
Shared Infrastructure:
Using a set of combined facilities
How much?
Independently built
One size fits all?
Autonomously managed
Developing software independently
Expecting to integrate later
All for VO communities that retain independence
Relative Importance
What envelopes you put your messages in
How they are delivered
Infrastructure to organise a common technical
platform – the foundations of communication
Relative Importance
What envelopes you put your messages in
How they are delivered
Infrastructure to organise a common technical platform – the foundations of
communication
What information you send in your messages
Their patterns of Use - sequences that mean
something
Their Contents
The Grammar and Vocabulary of Communication
Agreed Interpretations
Relative Importance
What envelopes you put your messages in
Technical
Experts
How they are delivered
Infrastructure to organise a common technical platform – the foundations of communication
What information you send in your messages
Their patterns of Use - sequences that mean something
Their Contents
The Grammar and Vocabulary of Communication
Agreed Interpretations
What you do when you get a message
The Application Code you Execute
The Middleware Services

Security, Privacy, Authorisation, Accounting, Registries, Brokers, …
Integration Services

Multi-site Hierarchical Scheduling, Data Access & Integration, …
Portals, Workflow Systems, Virtual Data, Semantic Grids
Tools to support Application Developers, Users & Operations
Relative Importance
What envelopes you put your messages in
How they are delivered
Infrastructure to organise a common technical platform – the foundations of communication
What information you send in your messages
Their patterns of Use - sequences that mean something
Their Contents
The Grammar and Vocabulary of Communication
Agreed Interpretations
What you do when you get a message
The Application Code you Execute
The Middleware Services

Security, Privacy, Authorisation, Accounting, Registries, Brokers, …
Integration Services

Domain
Specialists
Multi-site Hierarchical Scheduling, Data Access & Integration, …
Portals, Workflow Systems, Virtual Data, Semantic Grids
Tools to support Application Developers, Users & Operations
Creative Actions and Judgements of
Researchers, Designers & Clinicians
Data, Models & Analyses
In Silico Experiments, Design, Diagnosis & Planning
Creating the Scientific Record
Where Next for e-Infrastructure
Put people and teams first
The creative force
The repository of Experience, Skills and Knowledge
Focus on Major Priorities
Developing well-defined Flexible Agreements

Embraced as standards
High-level Software Investment

Applications & Requirements led
Explore & Evolve Common & Shared Infrastructure
Recognise and respond to differences
Celebrate and support commonalities
Download