ppt - The Australian Virtual Observatory

advertisement
Funding Sources for Academic Research
Nearly all academic research in the UK is funded by the government
through Research Councils.
There are 6 Research Councils in total:
EPSRC – Engineering and Physical Sciences Research Council
NERC – Natural Environment Research Council
PPARC – Particle Physics and Astronomy Research Council
BBSRC – Biotechnology and Biological Science Research Council
ESRC – Economic and Social Research Council
MRC – Medical Research Council
And:
CCLRC – Council for the Central Laboratory of the Research Councils
The UK e-Science Programme
Kerstin Kleese van Dam
(For Tony Hey
Director of UK e-Science Core Programme
Tony.Hey@epsrc.ac.uk)
e-Science and the Grid
‘e-Science is about global collaboration in key
areas of science, and the next generation of
infrastructure that will enable it.’
John Taylor
Director General of Research Councils
Office of Science and Technology
GRID Vision
Computing resources
Instruments
Complex problem
Data
Knowledge
GRID
Solution
People
The Grid as an Enabler
for Virtual Organisations
Ian Foster, Carl Kesselman and Steve Tueke
• ‘The Grid is a software infrastructure that
enables flexible, secure, coordinated resource
sharing among dynamic collections of
individuals, institutions and resources’
•
includes computational systems and
data storage resources and specialized
facilities
• Enabling infrastructure for transient ‘Virtual
Organisations’
UK e-Science Initiative: First Phase
• £120M Programme over 3 years from April 2001
• £75M is for Grid Applications in all areas of
science and engineering
• £10M as first installment for UK HPC(X)
• £35M ‘Core Program’ to encourage development
of generic ‘industrial strength’ Grid middleware
 Require £20M additional ‘matching’
funds from industry
UK e-Science Programme
Director’s
Awareness and Co-ordination Role
Director’s
Management Role
Generic Challenges
Pilot Application
Programme
PPARC (£26M)
BBsrc (£8M)
MRC (£8M)
NERC (£7M)
Esrc (£3M)
EPsrc (£17M)
CLRC (£5M)
Research Councils (£74M)
EPsrc (£15M), DTI (£20M)
Collaborative projects
Industrial Collaboration (£20M)
Technical
Advisory
Group
UK e-Science Management
DGRC/CERCs
e-Science
Steering Committee
Research Councils
e-Science Directors
Relevant National/
International bodies:
e.g. JISC, CERN
CEO/EPsrc
Director
e-Science Core
Programme
e-Science Support
Based at EPsrc
and at DTI
Technical Advisory
Group
Core Programme
Project Teams
Excerpt from e-Science CP
Director’s job objectives
‘Develop effective collaborative Core
Programme projects between the science
base, industry and national funding
agencies, and ensure the application and
outcomes from the projects.’
UK e-Science Projects
£75M for e-Science Grid Application ‘pilots’
- spanning all sciences and engineering
 Particle Physics and Astronomy (PPARC)
- £17M GridPP and £5M AstroGrid
 Engineering and Physical Sciences (EPSRC)
- funding 6 projects at around £3M each
 Biology, Medical and Environmental Science
- funding projects with total value of £23M
UK Grid Projects: First Phase (1)
Particle Physics and Astronomy (PPARC)
• GRIDPP
• ASTROGRID
Engineering and Physical Sciences (EPsrc)
• Comb-e-Chem
• DiscoveryNet
• GEODISE
• myGrid
• RealityGrid
Comb-e-Chem Project
Video
Simulation
Diffractometer
Properties
Analysis
Structures
Database
X-Ray
e-Lab
Properties
e-Lab
Grid Middleware
GEODISE Project
Engineer
GEODISE
PORTAL
Knowledge
repository
Ontology for
Engineering,
Computation, &
Optimisation and
Design Search
Reliability
Security
QoS
Visualization
Session
database
Traceability
OPTIMISATION
Globus, Condor, SRB
OPTIONS
System
Optimisation
archive
APPLICATION
SERVICE
PROVIDER
Intelligent
Application
Manager
CAD System
CADDS
IDEAS
ProE
CATIA, ICAD
COMPUTATION
Licenses
and code
Analysis
CFD
FEM
CEM
Design
archive
Parallel machines
Clusters
Internet Resource Providers
Pay-per-use
Intelligent
Resource
Provider
Computational science
• Molecular dynamics
• Mesoscale modelling
• High throughput experiments
• High performance visualization
• Computational steering
• Terascale parallel computing
myGrid Project
• Imminent ‘deluge’ of
data
• Highly heterogeneous
• Highly complex and
inter-related
• Convergence of data
and literature archives
Discovery Net Project
In Real Time
Scientific
Information
Scientific
Discovery
Real Time Integration
Workflow Construction
Literature
Databases
Operational
Data
Dynamic Application
Integration
Interactive Visual
Analysis
Using Distributed Resources
Images
Instrument
Data
How It Works
Interactive
Editor &
Visualisation
Nucleotide Annotation Workflows
Download
sequence
from
Reference
Server
Inter
Pro
SMART
KEGG
EMBL
NCBI
SWISS
PROT
TIGR
SNP
GO
Save to
Distributed
Annotation
Server
1800 clicks
 500 Web access
200 copy/paste
 3 weeks work
in 1 workflow
and few second
execution
Execute
distributed
annotation
workflow
UK Grid Projects: First Phase (2)
Natural Environment Applications (NERC)
• Climateprediction.com
• Oceanographic Grid
• Molecular Environmental Grid
• NERC DataGrid (with CP)
Biotechnology and Biological Sciences (BBsrc)
• Biomolecular Grid
• Proteome Annotation Pipeline
• High-Throughput Structural Biology
• Global Biodiversity
BioSim GRID
1st Level Metadata –
Describing the Simulation
Data…
York
Nottingham
Level Metadata – Describing the
Results of Generic Analyses…
2nd
Birmingham
Oxford
RAL
distributed ‘raw’ data
London
…
Southampton
Structure of the proposed biosimulation database
A biosimulation GRID for the UK
Integrating Different Levels of Simulation
molecular
cellular
organism
Sansom et al. (2000) Trends
Biochem. Sci. 25:368

An e-science challenge – non-trivial

NASA IPG as a possible paradigm

Need to integrate rigorously if to deliver accurate
& hence biomedically useful results
Noble (2002) Nature Rev.
Mol. Cell.Biol. 3:460
UK Grid Projects: First Phase (3)
Medical Applications (MRC)
• Biology of Ageing (with BBsrc)
• Sequence and Structure Data
• Molecular Genetics
• Cancer Management (with PPARC)
• Clinical e-Science Framework
• Neuroinformatics Modeling Tools
CLEF - Clinical e-Science Framework
Partners:
• AstraZeneca, GSK, BMJ Publishing Group
• CSW Informatics, iSoft plc, Sun Microsystems Limited
• UK National Health Service
–
–
–
–
NHS Information Authority Stakeholder Relations
Camden & Islington Health Authority
Central Manchester and Manchester Childrens' Health Authority
Royal Brompton and Harefield NHS Trust
• Universities of Cambridge, Manchester, Freiburg and
University College London
CLEF - Integrating information
• High quality, integrated clinical information is key to:
– clinical research
– evidence-based health care
– the clinical application of genetic and genomic research
• Capture, integration, and presentation of descriptive
information is a major barrier to achieving an integrated
framework
• Data includes:
–
–
–
–
clinical histories
radiology and pathology reports
annotations on genomic and image databases
technical literature and Web based resources
e-Science and Grid Middleware
‘e-Science is about global collaboration in key areas
of science, and the next generation of infrastructure
that will enable it.’
John Taylor
 Requirements of e-Science Grid Application Projects
determine services required by Grid middleware
 UK Projects focus more on Grid Data Services than
Teraflop/s HPC systems
e-Science Core Program: First Phase

£15M OST + £20M DTI + £20M Industry
1. Network of e-Science Centres
 UK e-Science Grid
2. Support for e-Science Applications
3. Grid Network Issues
4. Generic/Industrial Grid Middleware
5. e-Health Grid ‘Grand Challenges’
6. Outreach/International Activities
UK e-Science Grid
Edinburgh
Glasgow
Newcastle
Belfast
Manchester
DL
Cambridge
Oxford
Cardiff
RAL
London
Southampton
Hinxton
UK e-Science Grid
• All e-Science Centres donating resources plus four
dedicated compute/data clusters
– Supercomputers, clusters, storage, facilities
• All Centres run same Grid Software
– Starting point is Globus 2 and Condor: Storage
Resource Broker (SRB)
• Standard Grid middleware supported
– e-Science Grid now at ‘Level 2’: moving towards
production Grid with real users
Access Grid – Group Conferencing
Multi-site group-to-group
conferencing system
Continuous audio and video
contact with all
participants
Globally deployed
All UK e-Science
Centres have AG
rooms
Widely used for
technical and
management meetings
Support for e-Science Projects
• Grid Support Centre in operation
– supported Grid middleware & users
– see www.grid-support.ac.uk
• National e-Science Institute
– Research Seminars
– Training Programme
– See www.nesc.ac.uk
• National Certificate Authority
– Issue digital certificates for projects
– Goal is ‘single sign-on'
Anatomy of a Digital Certificate
Public Key
ABCDEFGHIJKLMNOPQRSTUV
A text string
Validity Data
Signature from CA’s private key
Extensions
How a certificate is issued
• The Registration Authority (RA) approves a request
for a certificate. The RA is local to the users.
• The CA then issues the corresponding certificate.
How does it work?
1. Scientist wishes to access a
resource, so he sends a copy
of the certificate to the resource
2. Resource says: prove it’s your
certificate
Challenge
Private Key
3. Scientist proves that he has
the corresponding private key
4. Resource is convinced that
scientist is who he claims to be
and decides to give him access
Response
UK CA Statistics, February 2003
•
•
•
•
•
•
250 valid certificates issued
24 RAs (more waiting for approval/training etc)
Issuing 60 certificates /month
Adding 3 RAs / month
Adding 6 RA operators /month
UK certificates recognized by EU and US
projects
Grid Network Team
• Expert group to identify end-to-end network
bottlenecks and other network issues
- e.g. problems with multicast for Access Grid
• Identify e-Science project requirements
• Funding (with PPARC and EPSRC) a number of
network QoS, scheduling and monitoring projects
• ‘UKLight’ lambda connection to Chicago and
Amsterdam now approved
UK Backbone Infrastructure
• Based on SuperJANET4 academic network
run by UKERNA for JISC
• WorldCom(!) providing national backbone
for SJ4 – now at 20Gbps
• Connections to universities via MANs at up
to 2.5Gbps
• ‘Last mile’ problem?
• Research network use versus teaching, websearching, email – differential services?
SuperJANET4
Access Grid
Multicast
One source sending same data to 3 receivers
only has to have one copy of data (more
copies are made only when necessary)
Networking Research Projects
GRID
Infrastructure
GRS, GRID resource management
‘
FutureGRID, P2P architecture
Service
Infrastructure
Network
Infrastructure
GridMcast, Multicastenabled data distribution
MB-NG, QoS Features
GRIDprobe, backbone
passive monitoring at
10Gbps
CP Collaborative Industrial
Projects: First Phase
•
•
•
•
•
9 Centres with ring-fenced allocations
£11M CP + £11M Industry funding
£5M Open Call Projects
All First Phase funds now committed
Over 60 Companies involved
CP Centre Projects
6 projects CeSC,
4 OeSC
5 NEReSC
4 NeSC),
5 SeSC
2 LeSC
5 WeSC
7 eSNW
5 BeSC
Total of 43 projects
68 different companies
Range of disciplines (IT, Engineering,
Pharma, Environmental etc)
New sectors engaged (broadcasting,
defence, banking etc)
Industrial Funds more than match
DTI funds
All Centres have spent money
allocated or have projects under
consideration
CP Open Call Projects
Visualization Middleware for e-Science
e-Science Technologies in the Simulation of Complex Materials
Performance-based Middleware for Grid Computing
A scalable monitoring platform for the GRID (GridProbe)
eDiamond distributed mammographic archive
End-to-End traffic management services
Information eXtraction from Images (IXI)
Deductive Synthesis Techniques to the Rapid Assembly of Grid Applications
Trustworthy GRID Resource Management
A Grid-based approach to the validation and testing of lubrication models
Self-Organising GRID Resource Management
Jigsaw: Distributed and dynamic visualisation generation
FutureGRID: a program for long-term research into GRID systems architecture
Total of 13 projects
OGSA – DAI Project
• Design Specification completed
– Papers for GGF WG on Database Access and
Integration Services
• Three Prototypes delivered:
– Distributed Query Service
– XML Database Interface
– Relational Database Interface
• Alpha versions delivered January 2003
– Integrate with Globus GT3
Open Grid Services Architecture
• Development of Web Services
• OGSA will provide
Naming /Authorization / Security / Privacy/…
 Projects looking at higher level services: Workflow,
Transactions, DataMining, Knowledge Discovery…
 Exploit Synergy: Commercial Internet
with Grid Services
IRC ‘Grand Challenge’ Projects
• Equator: Technological
innovation in physical and
digital life
• AKT: Advanced Knowledge
Technologies
• DIRC: Dependability of
Computer-Based Systems
• MIAS: From Medical Images
and Signals to Clinical
Information
e-Health Grid ‘Grand Challenges’
• Grid-Enabled Knowledge Services for Medical
Informatics
- Triple Assessment in Breast Cancer:
Clinical, Radiological and Cytological
data fusion
• Grid-based Medical Devices for Everyday Health
- Patient sensors, mobile wireless
communication
• eDiamond Digital Mammography
- Normalized archive of mammograms
- Oxford, IBM (£2M), Mirada and Hospitals
eDiamond
Mammograms have
different appearances,
depending on image
settings and acquisition
systems
SMF is a normalised
representation
independent of
scanner settings
eDiamond
Training and
Differential Diagnosis
Applications of SMF
Teleradiology and QC
VirtualMammo
“Find one like it”
?
Advanced CAD
SMF-CAD workstation
Epidemiology
SMFcomputed
breast density
International Involvement
• Funding UK participation in the Global Grid Forum
Research/Working Groups
• Funding for International CS ‘Grid Fellowships’
– CERN DataGrid and USA iVDGL
• International members on TAG
• Participation in EU FP5 Grid Activities
– e.g. EU DataGrid and DataTAG projects
• Development of FP6 Grid Projects
– First call closes April/May
– EGEE, EU Open Middleware Infrastructure Institute?
e-Science Demonstrators
•
•
•
•
•
•
•
•
•
Dynamic Brain Atlas
Biodiversity
Chemical Structures
Mouse Genes
Robotic Astronomy
Collaborative Visualisation
Climateprediction.com
Medical Imaging/VR
Seamless Access to Multiple Databases
UK e-Science Funding
First Phase: 2001 –2004
• Application Projects
– £74M
– All areas of science
and engineering
• Core Programme
– £35M
– Collaborative
industrial projects
Second Phase: 2003 –2006
• Application Projects
– £96M
– All areas of science and
engineering
• Core Programme
– £16M + £25M (?)
– Core Grid Middleware
Core Programme 2
Overall Rationale: Four major functions of CP
– Assist development of essential, wellengineered, generic, Grid middleware usable
by both e-scientists and industry
– Provide necessary infrastructure support for
UK e-Science Research Council projects
– Collaborate with the international e-Science
and Grid communities
– Work with UK industry to develop
industrial-strength Grid middleware
Core Programme 2
1.
2.
3.
4.
5.
6.
6 Key Activities for Second Phase
UK e-Science Grid/Centres and e-Science
Institute
Grid Support Centre and Network Monitoring
Core Middleware engineering
National Data Curation Centre
e-Science Exemplars/New Opportunities
Outreach and International involvement
Core Grid Middleware
•
Need to develop open source, open standard
compliant, Grid Middleware stack that will
integrate and federate with industrial solutions
• Software Engineering focus as well as R&D
Aim is to produce robust, well-documented,
re-usable software that is maintainable and
can evolve to embrace emerging Grid
Service standards
 Major focus of Core Programme 2
National Data Curation Centre
•
In next 5 years e-Science projects will produce
more scientific data than has been collected in the
whole of human history
• In 20 years can guarantee that the operating and
spreadsheet program and the hardware used to
store data will not exist
 Need to research and develop technologies and
best practice for curating digital data
 Need to liaise closely with individual research
communities and data archive centres
Director General
OST
HPC
Centres
Research Council
Pilots
CCLRC
Projects
e-Science
Operations
Committee
e-Science
EPsrc/DTI
Steering
Finance
Committee
Grid
Support
Team
4 IRC
+Projects
DIRECTOR
CORE PROGRAMME
Deputy Director
Technical Advisory
Group
International
Grid
Network
Team
Reports
9 Grid
Demos
National
Centre
Programme
Open
Call Projects
NHSNet
NERC
DTI
“HEFCE”
Keyworth
Dir Gen
MRC
Pilots
Outreach
OST SR2002
Pilots
Hinxton
BBsrc
Web sites
Bid
HPC
Pilots
Publicity
e-Science
Esrc
Centres
4 IRCs
EPsrc/DTI
Information
Steering
Pilots
5
Projects
Finance
Committee
CCLRC
9 Grid
e-Science
e-Science
Projects
Demos
Institute
Operations
EPsrc
8 Regional
Director
Committee
Pilots
Centres
Core Programme
National
£20M of
Centre
PPARC
Deputy Director
Grid
50 Projects
Pilots
Technical Advisory
CCLRC Support
CCLRC
Open
Open
Group
RAL
&
DL
RAL
&
DL
Team
CERN
CallProjects
Projects
Call
ICT
ICT
Grid
Grid
Suppliers
Suppliers
GEANT
GEANT
Reports
Grid
Reports
International
Network
Grid
Network
USERS
EU
Gridnet
Team
Team
Security USERS
Gridnet
Security
Framework
Grid
Grid
Taskforce
UKERNA
Taskforce
UKERNA
Projects
Fellowships
Fellowships
Architecture
Architecture
JISC
JISC
Other
Other
Taskforce
Taskforce
Data Base
Base
International
International
Data
Taskforce
Projects
Network
Projects
Network
Taskforce
Monitoring
Global
GlobalGrid
Grid
Monitoring
Forum
Forum
USUS
Players
Players
NHSNet
NERC
DTI
“HEFCE”
Keyworth
Dir Gen
MRC
Pilots
Outreach
IBM
Qinetiq
OST
Pilots
Hinxton
SR2002
BBsrc
Web sites
Microsoft
Data Systs
Bid
HPC
Pilots
Sun
Roche
Publicity
e-Science
Esrc
Centres
4 IRCs
EPsrc/DTI
Logica
BMT
Information
Steering
Pilots
5
Projects
Finance
SGI
CCDC
Committee
CCLRC
9 Grid
e-Science
BAE Systems Fujitsu
e-Science
Projects
Demos
Institute
Operations Rolls Royce Met Office
EPsrc
8 Regional
CFS Cons
Committee Welcome Director
Pilots
Centres
Compaq
BP
Core Programme
National
Oracle
Pallas
£20M of
Centre
PPARC
Deputy Director
AVS
Grid Platform
50 Projects
Pilots
Technical Advisory
CCLRC SupportAvaki
RTZ
Open
Group
RAL
&
DL
Entropia
Epistemics
Team
CERN
Call Projects
ICT
HP
Fluent
Industry
Grid
Suppliers
ABB
BNFL
GEANT
Reports
Grid
& Commerce
International
Network
Bayer
Delta Dot
USERS
EU
Gridnet
Team
Security
Intel
RVCO ltd
Framework
Grid Pfizer
Taskforce
Infosense
UKERNA
Projects
Fellowships
NAG
Merck
Architecture
Avantium JISC
AstraZeneca
Other
Taskforce
GSK
Unilever Data Base
International
Taskforce
Network
Projects
Technical
Monitoring
Global Grid
Advisory Group
Forum
US
Players
NHSNet
NERC
DTI
“HEFCE”
Keyworth
Dir Gen
MRC
Pilots
Outreach
IBM
Qinetiq
OST
IBM
Pilots
Hinxton
SR2002
BBsrc
Web sites
Microsoft
Data Systs
Bid
HPC
Pilots
Sun
Roche
Publicity
e-Science
Esrc
Centres
4 IRCs
EPsrc/DTI
Logica
BMT
Information
Steering
Pilots
5
Projects
Finance
SGI
CCDC
Committee
CCLRC
9 Grid
e-Science
BAE Systems Fujitsu
e-Science
Projects
Demos
Institute
Operations Rolls Royce Met Office
EPsrc
8 Regional
CFS Cons
Committee Welcome Director
Pilots
Centres
Compaq
BP
Core Programme
National
Oracle
Pallas
£20M of
Centre
PPARC
Deputy Director
AVS
Grid Platform
50 Projects
Pilots
Technical Advisory
CCLRC SupportAvaki
RTZ
Open
Group
RAL
&
DL
Entropia
Epistemics
Team
CERN
Call Projects
ICT
HP
Fluent
Industry
Grid
Suppliers
ABB
BNFL
GEANT
Reports
Grid
& Commerce
International
Network
Bayer
Delta Dot
USERS
EU
Gridnet
Team
Security
Intel
RVCO ltd
Framework
Grid Pfizer
Taskforce
Infosense
UKERNA
Projects
Fellowships
NAG
Merck
Architecture
Avantium JISC
AstraZeneca
Other
Taskforce
GSK
Unilever Data Base
International
Taskforce
Network
Projects
Technical
Monitoring
Global Grid
Advisory Group
Forum
US
Players
NHSNet
NERC
DTI
“HEFCE”
Keyworth
Dir Gen
MRC
Pilots
Outreach
OST SR2002
Pilots
Hinxton
USA
BBsrc
Web sites
Bid
France
HPC
Pilots
Publicity
e-Science
Esrc
Germany
Centres
4 IRCs
EPsrc/DTI
Information
Steering
Pilots
Brazil
5
Projects
Finance
Committee
CCLRC
9 Grid
e-Science
Holland
e-Science
Projects
Demos
Institute
Japan
Operations
China
EPsrc
8 Regional
Director
Committee
Pilots
Centres
Core Italy
Programme
National
Scandinavia
£20M of
Centre
PPARC
Deputy
Director
Australia
Grid
50 Projects
Pilots
Technical
Advisory
Switzerland
CCLRC Support
Open
Group
Austria
RAL
&
DL
Team
CERN
Call Projects
Singapore
ICT
Grid
Belgium
Suppliers
GEANT
Reports
Grid
International
Network
Canada
USERS
EU
Gridnet
Team
Ireland
Security
Framework
Grid
Poland
Taskforce
UKERNA
Projects
Fellowships
Spain
Architecture
SouthJISC
Other
Taskforce
America
Data Base
International
Taskforce
Network
Projects
Monitoring
Global Grid
Forum
US
Players
NHSNet
NERC
DTI
“HEFCE”
Keyworth
Dir Gen
MRC
Pilots
Outreach
OST SR2002
Pilots
Hinxton
BBsrc
Web sites
Bid
HPC
Pilots
Publicity
e-Science
Esrc
Centres
4 IRCs
EPsrc/DTI
Information
Steering
Pilots
5
Projects
Finance
Committee
CCLRC
9 Grid
e-Science
e-Science
Projects
Demos
Institute
Operations
EPsrc
8 Regional
Director
Committee
Pilots
Centres
Core Programme
National
£20M of
Centre
PPARC
Deputy Director
Grid
50 Projects
Pilots
Technical Advisory
CCLRC Support
Open
Group
RAL
&
DL
Team
CERN
Call Projects
ICT
Grid
Suppliers
GEANT
Reports
Grid
International
Network
USERS
EU
Gridnet
Team
Security
Framework
Grid
Taskforce
UKERNA
Projects
Fellowships
Architecture
JISC
Other
Taskforce
Data Base
International
Taskforce
Network
Projects
Monitoring
Global Grid
Forum
US
Players
A viable Core Programme must
have this scope and an
infrastructure to support it!
e-Science and the Grid
‘e-Science will change the dynamic of the way
science is undertaken.’
John Taylor, 2001
 Need to convince university IT Directors!
e-Government and the Grid
‘[The Grid] intends to make access to
computing power, scientific data repositories
and experimental facilities as easy as the
Web makes access to information.’
Tony Blair, 2002
Download