Integration Services - National e

advertisement
Dr. Huai Jinpeng
(Vice President,
Beihang University)
Visit to NeSC
Malcolm Atkinson
Director
www.nesc.ac.uk
10th May 2004
Outline
The National e-Science Centre
Role and mission
The e-Science Institute
The UK e-Science Programme
Funding and organisation
The UK Grid
The European dimension
Technical Options
The Essence of e-Science
NeSC Roles
Help coordinate and lead UK e-Science
Community building, training & outreach
Help establish the UK’s international role
The focus for presenting UK e-Science
Run the e-Science Institute
A meeting place
Workshops and conferences
Research visitors and events
Undertake R&D projects
Reliable middleware (OGSA-DAI, SunDCG, …)
Engage industry (IBM, Sun, Microsoft, HP, Oracle, …)
Stimulate the uptake of e-Science technology
Training Team
The Primary Requirement …
Enabling People to Work Together on Challenging Projects: Science, Engineering & Medicine
Events Held
(from 1 Aug 2002 to 29 Feb 2004 – 31 months)
We have run 197 events (just over 6 per month):
3 conferences (including GGF5 with 900 participants)
20 project meetings
16,444 delegate days
23 research meetings
197 events
61 workshops
4 schools
6,825 delegates
32 training sessions
339 event days
27 outreach events
9 international meetings
18 e-Science management meetings
(though the definitions are fuzzy!)
The Website – a Resource
National e-Science Centre
http://www.nesc.ac.uk/
Mission, Background, Foundation, Locations, Staff, Resources, Projects
Register interest, Mailing lists, NeSCForge
Regional associations and Collaborations
News, Notices
Presentations and Lectures
http://www.nesc.ac.uk/presentations/
National e-Science Institute
http://www.nesc.ac.uk/esi/
Mission, Events (Future and Past)
Register for Events, Visitor Programme, GridNet
UK e-Science
Map and Index of Centres
Technical Papers
Index of >100 Projects
Task Forces
http://www.nesc.ac.uk/centres/
http://www.nesc.ac.uk/technical_papers/
http://www.nesc.ac.uk/projects/
http://www.nesc.ac.uk/teams/
General Information
Glossary, Bibliography, Who’s who
Comprehensive
Widely used byand
UK, Growing
USA andSource
the restofofInformation
the world
Outline
The National e-Science Centre
Role and mission
The e-Science Institute
The UK e-Science Programme
Funding and organisation
The UK Grid
The European dimension
Technical Options
The Essence of e-Science
UK e-Science Budget
(2001-2006)
Total: £213M
EPSRC Breakdown
MRC (£21.1M)
10%
EPSRC (£77.7M)
37%
HPC (£11.5M)
BBSRC (£18M)
15%
8%
NERC (£15M)
7%
Applied (£35M)
Staff
45%
costs Grid Resources
funded separately
Core (£31.2M)
40%
PPARC (£57.6M)
27%
CLRC (£10M)
5%
ESRC (£13.6M)
6%
+ Industrial Contributions
Source: Science Budget 2003/4 – 2005/6, DTI(OST)
The e-Science
Centres
Globus Alliance
Open
Middleware
Infrastructure
Institute
Digital
Curation
Centre
e-Science
Institute
Grid
Operations
Centre
?
CeSC (Cambridge)
EGEE
Digital Curation Centre
communities
of practice:
users
curation
organisations
community
support &
outreach
Collaborative
Associates
Network of
Data
Organisations
services
management
& coordination
research
research
collaborators
development
testbeds
& tools
Industry
standards bodies
The e-Science
Grid
Engineering Task
Force
(Contributions
from e-Science
Centres)
1600 x CPU
AIX
512 x CPU
Irix
HPC(x)
20 x CPU
18TB Disk
Linux
Grid Support
Centre / Grid
Operations
Centre
OGSA Test Grid
projects
CeSC (Cambridge)
64 x CPU
4TB Disk
Linux
Access Grid
Cameras
Crucial for
management
meetings
Requires IP
multicast
throughout
the network
Microphones
Some UK e-Health Projects
eDIaMoND (with IBM and Mirada)
Breast Cancer Project
IXI (with GSK and Philips Medical)
Information from medical images
MIAS Devices
Mobile sensors for healthcare
CLEF
Integrating medical information
eDiaMoND – Compute
Mammograms have different
appearances, depending on image
settings and acquisition systems
Standard
Mammo
Format
Temporal
mammography
Computer
Aided
Detection
3D View
eDiaMoND – Non-Functional
Anonymisation
Grid
Screening
Screening
Screening
Screening
Diagnosis
Diagnosis
Diagnosis
Teaching
Training
Teaching
Teaching
Epidemiology
Epidemiology
Epidemiology
Epidemiology
Ethics
Legal
Security
Performance
Scalability
Manageability
Auditability
……
Lossless Compression
Encryption
256MB & 5 secs
response
~100 Centres
Systems Administration
Non-Repudiation
KCL, Imperial and Oxford
http://www.ixi.org.uk
Automatic registration technology
Rigid registration of
MR and CT images
of the head
Inter-subject image warping
MIAS Devices Project
Sensor bus
Easy Plug and Play of Sensors
Wireless connection using
802.11
Positioning information from
GPS
Mobile medical technologies on
a distributed Grid
GPS ariel
The European dimension
EGEE: Enabling Grids for E-Science in Europe
… and beyond
32M Euro, 10 regions, 70 partners
Additional funding from NSF (USA)
50% production, 30% development, and 20%
dissemination and training
“The Grid Infrastructure in Europe”
Deploy a production Grid across Europe
Initially based on LHC Computing Grid
Examples of
HealthGRID
applications
Grids for medical development
Preparation and follow-up of
medical missions in developing
Clermont-Ferrand/Paris
countries
Support to local medical
centres in terms of second
diagnosis, patient follow-up
and e-learning
Ibagué
Hand surgery
Medical centre
2 missions (Ibagué &
Chuxiong) with the
french NPO « Chaîne de
l’Espoir » used as test
cases
Chuxiong
The grid impact :
•Improved telemedecine
services
• Federation of patient
databases
•Interactive e-learning (high
bandwidth network required)
eHealth
eScience
DataGrid :
status of biomedical applications
Bio-informatics
deployed
tested on EDG
under preparation
Phylogenetics : BBE Lyon (T. Sylvestre)
Search for primers : Centrale Paris (K. Kurata)
Bio-informatics web portal : IBCP (C.
Blanchet)
Parasitology : LBP Clermont, Univ B. Pascal
(N. Jacq)
Data-mining on DNA chips : Karolinska (R.
Médina, R. Martinez)
Geometrical protein comparison : Univ.
Padova (C. Ferrari)
GATE MonteCarlo simulation
platform for
nuclear
medecine
Medical imaging
Local_Monopro1500MHz
X10
X20
X50
X100
160
Temps en minutes
MR image simulation : CREATIS (H. BenoitCattin)
Medical data and metadata management :
CREATIS (J. Montagnat)
Mammographies analysis ERIC/Lyon 2 (S.
Miguet, T. Tweed)
Simulation platform for PET/SPECT based on
Geant4 : GATE collaboration (L. Maigne)
180
140
120
100
80
60
40
20
0
Parallelisation
EGEE - NA3 Challenges
Building an Integrated training team
Developing training material and delivery platforms
Attracting enough potential users
Converting enough of them to successful EGEE users
Sustaining both high quality and high volume training
Achieving and delivering advanced skills & knowledge
EGEE - NA3 Partners
NeSC
Edinburgh
UK & Ireland
IHEP
IMPB RAS
ITEP
JINR
Moscow
Russia
Moscow
Russia
Moscow
Russia
Dubna
Russia
KU-NATFAK
PNPI
RRCKI
Copenhagen
Denmark
Petersburgh
Russia
Moscow
Russia
GUP
Linz
Austria
FZK
Karlsruhe
Germany
Innsbruck
Austria
GRNET
Athens
Greece
INFN
CESNET
Rome
Italy
Prague
Czech Rep.
BUTE
Budapest
Hungary
II-SAS
Bratislava
Slovakia
ICM
PSNC
ICI
Warsaw
Poland
Poznan
Poland
Bucharest
Romania
ELUB
Budapest
Hungary
MTA SZTAKI
TAU
Budapest
Hungary
Tel Aviv
Isreal
e-Infrastructure Issues
Discussion of Technical options
OGSI & GT3 investment
Three forms of investment
Architectural effort
This
OGSA: Use cases, Design Patterns, …
investment
Factoring & describing a complex engineering domain
carries
Standardisation effort
OGSI, DAIS, WS-Agreement, etc.
forward
WSDL 2.0, WSDM, WS-Security, etc.
into
Implementation effort
Combined OGSI & Grid component work
WSRF
Combining
Grid and Web Services – First try
Technology intercept
is not easy
Grid
Started
far apart
in apps
& tech
Web
Have been
converging
?
People accepted
OGSI
• Several (some partial) implementations
• Issues: technical, political & commercial
• Successes: a number of operational grids
Combining
Grid and Web Services: Second try
Grid
Started
far apart
in apps
& tech
Web
Technology intercept
is still not easy
Have been
converging
WSRF
Not the only
possible
technical
solution
Support from major WS vendors
especially service management suppliers
e.g., CA, HP, IBM, Fujitsu, BEA, SAP, …
Core Ideas in WSRF
Preserves OGSI functionality
Lifetime, properties, notification, error types, …
Separates service from resource
Service is static and stateless
Resource is dynamic and stateful
Builds on WS-Addressing
Is WS-I compliant
But note that WS-I alone doesn’t make the problems go
away, still need to worry about how to manage lifetime,
naming, state, …
GT & WSRF Timeline
GW OASISGGF10 interop
TC
techPre
1
Improved robustness,
scalability, performance,
usability
GT3.2
2004
3.2
March
4.0 b
Q2
2005
GT4.0
Not waiting for
finalisation
of WSRF specs.
Use as submitted
4.0
Q3
WSRF; some new
functionality; further
usability, performance
4.2
enhancements
Q2 ‘05
GT4.2
Numerous new WSRF-based services
Components in GT 3.2
GSI
WU GridFTP
Pre-WS
GRAM
WS-Security
RFT
(OGSI)
WS GRAM
(OGSI)
CAS
(OGSI)
RLS
SimpleCA
OGSA-DAI
MDS2
JAVA
WS Core
(OGSI)
WS-Index
(OGSI)
OGSI
C Bindings
OGSI
Python Bindings
(contributed)
pyGlobus
(contributed)
XIO
Security
Data
Management
Resource
Management
Information
Services
WS
Core
Planned Components in GT 4.0
GSI
New GridFTP
Pre-WS
GRAM
WS-Security
RFT
(WSRF)
WS-GRAM
(WSRF)
CAS
(WSRF)
RLS
CSF
(contribution)
SimpleCA
OGSA-DAI
Authz
Framework
XIO
Security
Data
Management
MDS2
JAVA
WS Core
(WSRF)
WS-Index
(WSRF)
C WS Core
(WSRF)
pyGlobus
(contributed)
Resource
Management
Information
Services
WS
Core
Importance of collaboration: VDT
A highly successful
collaborative effort
VDT Working Group
VDS (Chimera/Pegasus) team

Provides the “V” in VDT
Condor Team
Globus Alliance
NMI Build and Test team
EDG/LCG/EGEE

Middleware, testing, patches,
feedback …
PPDG

Hardening and testing
Pacman


Provides easy installation capability
Currently Pacman 2, moving to
Pacman 3 soon
Thanks to Miron Livny
Used by many projects
Systematic testing
Rich integration of
components
The UK should be part of
this – exploit test bed
contribute components
Conclusions
WSRF
Good enough for recurrent platform requirements
Has significant commercial and technical momentum
Improves engagement with industry
Only sensible flag to rally behind
Must collaborate internationally
Scale of challenge & international virtual organisations
Discourage localised alternatives
Avoid effort fragmentation and unnecessary arguments
Coping well with transitions …
Is a primary Darwinian selector!
Outline
The National e-Science Centre
Role and mission
The e-Science Institute
The UK e-Science Programme
Funding and organisation
The UK Grid
The European dimension
The Essence of e-Science
What is e-Science?
Invention and exploitation of advanced
computational methods
to generate, curate and analyse research data


From experiments, observations and simulations
Quality management, preservation and reliable evidence
to develop and explore models and simulations


Computation and data at extreme scales
Trustworthy, economic, timely and relevant results
to enable dynamic distributed virtual organisations


Facilitating collaboration with information and resource sharing
Security, reliability, accountability, manageability and agility
e-Science >> Grid & Web Services
It is what you do with them that counts
Fundamental & Growing Assets
Understanding of Processes & Requirements
International and Multi-disciplinary Skill base
Experience composing & adapting existing
technologies
and of building new components
Experience Supporting Developers and Users
Experience Establishing Virtual Organisations
across Enterprise boundaries
Embedded in People & Teams, Growing – they need nurture
Primary Multi-Enterprise Issues
Combining subsystems built independently in
different enterprises
Deploying, Starting and Managing Applications
and Production Operations
Assume Benefits of
Shared Infrastructure:
Using a set of combined facilities
How much?
Independently built
One size fits all?
Autonomously managed
Developing software independently
Expecting to integrate later
All for VO communities that retain independence
Relative Importance
What envelopes you put your messages in
How they are delivered
Infrastructure to organise a common technical
platform – the foundations of communication
Relative Importance
What envelopes you put your messages in
How they are delivered
Infrastructure to organise a common technical platform – the foundations of
communication
What information you send in your messages
Their patterns of Use - sequences that mean
something
Their Contents
The Grammar and Vocabulary of Communication
Agreed Interpretations
Relative Importance
What envelopes you put your messages in
Technical
Experts
How they are delivered
Infrastructure to organise a common technical platform – the foundations of communication
What information you send in your messages
Their patterns of Use - sequences that mean something
Their Contents
The Grammar and Vocabulary of Communication
Agreed Interpretations
What you do when you get a message
The Application Code you Execute
The Middleware Services

Security, Privacy, Authorisation, Accounting, Registries, Brokers, …
Integration Services

Multi-site Hierarchical Scheduling, Data Access & Integration, …
Portals, Workflow Systems, Virtual Data, Semantic Grids
Tools to support Application Developers, Users & Operations

Incremental deployment tools, diagnostic aids, performance monitoring, …
Relative Importance
What envelopes you put your messages in
How they are delivered
Infrastructure to organise a common technical platform – the foundations of communication
What information you send in your messages
Their patterns of Use - sequences that mean something
Their Contents
The Grammar and Vocabulary of Communication
Agreed Interpretations
What you do when you get a message
The Application Code you Execute
The Middleware Services

Security, Privacy, Authorisation, Accounting, Registries, Brokers, …
Integration Services

Domain
Experts
Multi-site Hierarchical Scheduling, Data Access & Integration, …
Portals, Workflow Systems, Virtual Data, Semantic Grids
Tools to support Application Developers, Users & Operations
Creative Actions and Judgements of
Researchers, Designers & Clinicians
Data, Models & Analyses
In Silico Experiments, Design, Diagnosis & Planning
Creating the Scientific Record
Where Next for e-Infrastructure
Put people and teams first
The creative force
The repository of Experience, Skills and Knowledge
Focus on Major Priorities
Developing well-defined Flexible Agreements

Embraced as standards
High-level Software Investment

Applications & Requirements led
Explore & Evolve Common & Shared Infrastructure
Recognise and respond to differences
Celebrate and support commonalities
Download