e-Science and the David De Roure University of Southampton

advertisement
e-Science
and the
David De Roure
University of Southampton
Outline
1.
2.
e-Science and e-Research
Enabling Technologies


3.
4.
July 2004
Grid
Semantic Web
Semantic Grid
Building Bridges
IAAI Panel
2
Vision: e-Science
e-Science is about global collaboration in
key areas of science and the next
generation of [computing] infrastructure
that will enable it. e-Science will change
the dynamic of the way science is
undertaken
John Taylor, Director General of UK Research Councils
July 2004
IAAI Panel
3
Vision: e-Science
‘[The Grid] intends to make access to
computing power, scientific data
repositories and experimental facilities as
easy as the Web makes access to
information.’
Tony Blair, 2002
July 2004
IAAI Panel
4
UK funding context
Particle Physics and
Astronomy
Engineering and
Physical Sciences
Natural Environment
Economic and Social
Medical
Biotechnology and
Biological Sciences
CCLRC
(Arts and Humanities)
July 2004
Dept of Trade
and Industry
Companies
University
R&D
European
Commission
Research Councils
Joint Information
Systems Committee
IAAI Panel
5
UK e-Science Funding
First Phase:
2001 –2004
 Application Projects
 £74M
 All areas of science
and engineering
 Core Programme
 £15M Research
infrastructure
 £20M Collaborative
industrial projects
Second Phase:
2003 –2006
 Application Projects
 £96M
 All areas of science
and engineering
 Core Programme
 £16M Research
Infrastructure
 £10M DTI Technology
Fund
Across all areas
Application-led
Core program
July 2004
IAAI Panel
6
e-Science Core Program
Four major functions:

Assist development of essential, wellengineered, generic, Grid middleware

Provide necessary infrastructure
support for UK e-Science Research
Council projects

Collaborate with the international
e-Science and Grid communities

Work with UK industry to develop
industrial-strength Grid middleware
July 2004
IAAI Panel
7
myGrid pilot project





Bioinformatics
Imminent ‘deluge’ of
data
Highly heterogeneous
Highly complex and
inter-related
Convergence of data
and literature archives
July 2004
IAAI Panel
8
Combe Chem pilot project
Video
Simulation
Diffractometer
Properties
Analysis
Structures
Database
X-Ray
e-Lab
Properties
e-Lab
Grid Middleware
July 2004
IAAI Panel
9
UK e-Science Grid
Edinburgh
Glasgow
DL
Belfast
Newcastle
Manchester
Cambridge
Oxford
Cardiff
RAL
London
Hinxton
Southampton
July 2004
IAAI Panel
10
UK e-Science: Phase 2
Three major new activities:
1. National Grid Service and Grid Operation
Centre
2. Open Middleware Infrastructure Institute
for testing, software engineering and UK
repository
3. Digital Curation Centre to look at longterm data preservation issues
July 2004
IAAI Panel
11
Grid Operation Support Centre
 Deploy production ‘National Grid Service’ based
on four dedicated compute and data nodes plus
two UK Supercomputers
 Develop operational policies, security, …
 Gain experience with genuine users
 Develop Web Services based e-Science Grid
 Work with EU EGEE project, the NSF
Cyberinfrastructure Program and A-P Grid
activities
July 2004
IAAI Panel
12
Open Middleware Infrastructure Institute







July 2004
Repository for UK-developed Open Source
‘e-Science/Cyber-infrastructure’ Middleware
Documentation, specification, QA and standards
Fund work to bring ‘research project’ software
up to ‘production strength’
Fund Middleware projects for identified ‘gaps’
Work with US NSF, EU Projects and others
Supported by major IT companies
Southampton selected as the OMII site
IAAI Panel
13
Digital Curation Centre
In next 5 years e-Science projects will produce
more scientific data than has been collected in
the whole of human history
In 20 years can guarantee that the OS and
spreadsheet program and the hardware used to
store data will not exist





July 2004
Research curation technologies and best practice
Need to liaise closely with individual research
communities, data archives and libraries
Edinburgh with Glasgow, CLRC and UKOLN
selected as site of DCC
IAAI Panel
14
Typical Science Grid
Service such as Research
Database or simulation
Campus or
Enterprise
Administrative
Grid
Learning Management
Grid
Science Grids
Bioinformatics
Earth Science …….
Transformed by Grid Filter
to form suitable for education
Publisher
Grid
Education Grid
Digital
Library
Grid
Teacher Educator
Grids
Student/Parent …
Community Grid
Informal
Education
(Museum)
Grid
Education as a Grid of Grids (thanks to Geoffrey Fox)
July 2004
IAAI Panel
15
Vision: e-Research

Not just new Science








e-Social Science
e-Humanities
e-Arts
e-Research
e-Business
e-Anything
…



And new disciplines!
July 2004
IAAI Panel
Researchers working in all
disciplines are faced daily with a
wide variety of tasks necessary
to sustain and progress their
research activity
These involve the analytical
aspects of their work, access to
resources, collaboration with
fellow researchers, and project
management and admin
These tasks rapidly increase in
scale and complexity as
collaborations grow larger,
become more geographically
distributed and involve a wider
range of disciplines
JISC
16
Vision: HASTAC
Humanities, Arts, Science and Technology
Advanced Collaboratory
HASTAC is an international, interdisciplinary
consortium which seeks to create, develop, advance
and utilize a broad range of leading computing and
information systems while contributing to an
understanding of the interconnections between the
human sciences, natural sciences, arts, and
technology in a complex global society
July 2004
IAAI Panel
17
Vision: Joining up

These visions are all about joining resources
and people together in new ways in order
to create new things





Researchers can focus on the real research
The research process is accelerated
New research results are possible
New research areas are possible
NB s/research/business/
July 2004
IAAI Panel
19
Vision: The Grid
July 2004
IAAI Panel
Courtesy of Ian Foster
20
Vision: The Grid
Grid computing has emerged as an important new
field, distinguished from conventional distributed
computing by its focus on large-scale resource
sharing, innovative applications, and, in some cases,
high-performance orientation...we [define] the "Grid
problem”…as flexible, secure, coordinated resource
sharing among dynamic collections of individuals,
institutions, and resources - what we refer to as
virtual organizations
From "The Anatomy of the Grid: Enabling Scalable Virtual
Organizations" by Foster, Kesselman and Tuecke
July 2004
IAAI Panel
21
Challenges: Unanticipated Re-use

myGrid
Wish to reuse
 Data
 Services
 Software
 Knowledge
Combechem
July 2004
IAAI Panel
22
Outline
1.
2.
The e-Vision and its challenges
Enabling Technologies


3.
4.
July 2004
Grid
Semantic Web
Semantic Grid
Building Bridges
IAAI Panel
26
Two infrastructure enablers
Grid
Computing



Semantic
Web
On demand transparently
constructed multiorganisational federations
of distributed services
Distributed computing
middleware
Computational Integration
July 2004



IAAI Panel
An automatically
processable, machine
understandable web
Distributed knowledge and
information management
Information integration
27
July 2004
IAAI Panel
28
Five Myths busted!
1.
2.
Isn’t it just for Physics?

No – Grids for Life Science and Medicine will
dominate Grid applications

Think of the range and scale of data and the
community!
Isn’t it just High Performance computing?

No – it’s a generic mechanism for forming,
managing and disbanding dynamic federations of
services

Data integration, data access, data transport will
dominate

Application integration is the key
July 2004
IAAI Panel
29
Five Myths busted!
3.
4.
5.
Isn’t it just a bag of protocols glued together?

No – the Open Grid Service Architecture gives a
well specified middleware stack built on industry
standard web services
Isn’t it just Globus toolkit?

No – that is one reference implementation.
Isn’t it just a bunch of academic physicists?

No –all the commercial vendors are making serious
investment. IBM DB2 and Oracle 10g will be gridcompliant
July 2004
IAAI Panel
30
Grid Services
Specific services: drug discovery
pipeline, sky surveys
Grid Applications
Open Grid Service Architecture
Web Service Resource Framework
Web Service-Notification
Web Services
July 2004
Standard services: agreement,
data access and integration,
workflow, security, policy,
brokering…
Standard interfaces and
behaviours for distributed systems:
naming, service state, lifetime
management, notification
Standard mechanisms for
describing and invoking services:
WSDL, SOAP, WS-Security etc
IAAI Panel
31
July 2004
IAAI Panel
32
Origins of the Semantic Web
The Semantic Web is an extension of
the current Web in which information
is given a well-defined meaning, better
enabling computers and people to
work in cooperation.
It is the idea of having data on the
Web defined and linked in a way that it
can be used for more effective
discovery, automation, integration and
reuse across various applications.
The Web can reach its full potential if
it becomes a place where data can be
processed by automated tools as well
as people.
W3C Activity Statement
July 2004
IAAI Panel
33
Layers of Languages
Attribution
Explanation
We are here!
Rules & Inference
Ontologies
Metadata annotations
Standard Syntax
Identity
July 2004
IAAI Panel
34
Resource Description Framework





Common model for
metadata
A graph of triples
Query over and link
together
RDQL, repositories,
integration tools,
presentation tools
The Network Effect
July 2004
IAAI Panel
Graphic courtesy of Tim Berners-Lee
35
OWL Web Ontology Language
DARPA
Agent Markup DAML
Language
EU/NSF Joint Ad hoc
Committee
The most popular ontology
language in the world ever!
A W3C
Recommendation
July 2004
OIL
Ontology
Inference
Layer
RDF
DAML+OIL
OWL
IAAI Panel
All
influenced
by RDF
OWL Lite (thesaurus)
OWL DL (reason-able)
OWL Full (anything goes)
36
5 More Myths Busted!
Isn’t it just AI and distributed agents (again)?
1.
No – It is primarily metadata integration and querying

Don’t you need all that reasoning stuff?
2.
No – A little bit of semantics goes a long way! (Hendler)

It only applies to the Web?
3.
No – the technologies are being used for Enterprise
integration, exposing data in a common model, common
ontology languages, representing terminologies.

One big ontology of everything never works!
4.
No – multiple ontologies; multiple everything!

One big Semantic Web!
5.

July 2004
No – lots of Semantic Web-lets, and expect it to break!
IAAI Panel
37
Outline
1.
2.
The e-Vision and its challenges
Enabling Technologies


3.
4.
July 2004
Grid
Semantic Web
Semantic Grid
Building Bridges
IAAI Panel
38
The Semantic Grid Report 2001


At this time, there are a number of grid
applications being developed and there is a whole
raft of computer technologies that provide
fragments of the necessary functionality.
However there is currently a major gap between
these endeavours and the vision of e-Science in
which there is a high degree of easy-to-use and
seamless automation and in which there are
flexible collaborations and computations on a
global scale.
www.semanticgrid.org
July 2004
IAAI Panel
39
Scale of
Interoperability
Semantic Grid
Semantic
Web
Semantic
Grid
Classical
Web
Classical
Grid
Scale of data and computation
July 2004
IAAI Panel
Based on an idea by Norman Paton
40
Semantics in and on the Grid
The Semantic Grid is
an extension of the
current Grid in which
information and
services are given
well-defined meaning,
better enabling
computers and people
to work in cooperation
July 2004
IAAI Panel
41
Underpinnings of e-Science
Grid
Computing
The
Semantic
Grid
The
Semantic
Web
Web Services
Contrast with…
July 2004
IAAI Panel
42
Knowledge Grid
July 2004
IAAI Panel
43
Grid Computing trajectory
There are SG
technologies
available today
for immediate
deployment
cost
Virtual organisations
with dynamic access to
unlimited resources
For all
Sharing of apps and know-how
With controlled set of
unknown clients
Sharing standard scientific
process and data, sharing
of common infrastructure
Between trusted partners
CPU intensive workload
Grid as a utility, data Grids,
robust infrastructure
Intra-company, intra community
e.g. Life Science Grid
CPU scavenging
time
July 2004
IAAI Panel
45
Semantics in e-Science





Ontology-aided
workflow
construction
RDF-based service
and data registries
RDF-based metadata
for experimental
components
RDF-based
provenance graphs
OWL based controlled
vocabularies for
database content
OWL based
integration
July 2004
IAAI Panel
RDF-based semantic
mark up of results,
logs, notes, data
entries
46
Engineering Design
Engineer
GEODISE
PORTAL
Knowledge
repository
Ontology for
Engineering,
Computation, &
Optimisation and
Design Search
Reliability
Security
QoS
Visualization
Session
database
Traceability
OPTIMISATION
Globus, Condor, SRB
OPTIONS
System
Optimisation
archive
APPLICATION
SERVICE
PROVIDER
Intelligent
Application
Manager
July 2004
CAD System
CADDS
IDEAS
ProE
CATIA, ICAD
COMPUTATION
Licenses
and code
Analysis
CFD
FEM
CEM
Design
archive
Parallel machines
Clusters
Internet Resource Providers
Pay-per-use
IAAI Panel
Intelligent
Resource
Provider
47
Ontologies for e-Science



User-oriented, scalable
environment for domain
experts to acquire,
develop and use
ontologies
Based on OilEd and
Protégé 2000
Transatlantic cooperation
on the development of
ontologies for e-Science
Universities Manchester and Southampton, UK
Stanford University, USA
July 2004
IAAI Panel
48
Collaboration tools
BuddySpace
Access Grid
Node
NetMeeting
awareness of
colleagues’ ‘presence’
virtual meetings
recovering information
from meetings
Replay
mapping real time
discussions/group
sensemaking
Compendium
enacting decisions/
coordinating activities
synthesising artifacts
I-X Tools
July 2004
IAAI Panel
49
NASA Scenario
1. Astronauts debrief on EVA
Compendium maps from trained
compendium astronaut
Mars
Video and
Science Data
Remote Science Team (RST) on
earth e.g. geologists
Plan for next
Day’s EVA
2. Virtual meeting of RST
using CoAKTinG tools
July 2004
IAAI Panel
50
Finding collaborators
Using scaleable triple store and AKT ontology
July 2004
IAAI Panel
51
GGF9 Semantic Grid Workshop








The Role of Concepts in myGrid Carole Goble
Planning and Metadata on the Computational Grid Jim
Blythe
Semantic support for Grid-Enabled Design Search in
Engineering Simon Cox
Knowledge Discovery and Ontology-based services on the
Grid Mario Cannataro
Attaching semantic annotations to service descriptions Luc
Moreau
Semantic Matching of Grid Resource Description
Frameworks John Brooke
Interoperability challenges in Grid for Industrial
Applications Mike Surridge
Semantic Grid and Pervasive Computing David De Roure
July 2004
IAAI Panel
52
GGF11 Semantic Grid Workshop



Engineering semantics: Costs and
Benefits Simon Cox
Designing Ontologies and
Distributed Resource Discovery
Services for an Earthquake
Simulation Grid Marlon Pierce
Exploring Williams-Beuren
Syndrome Using myGrid Carole

Schroeter


Goble



Distributed Data Management and
Integration Framework: The
Mobius Project Shannon Hastings
eBank UK - Linking Research Data,
Scholarly Communication and
Learning David De Roure
Using the Semantic Grid to Build
Bridges between Museums and
Indigenous Communities Ronald

Collaborative Tools in the Semantic
Grid David De Roure
The Integration of Peer-to-peer
and the Grid to Support Scientific
Collaboration
OWL-Based Resource Discovery for
Inter-Cluster Resource Borrowing
Hideki YOSHIDA

Semantic Annotation of
Computational Components Peter
Vanderbilt

Schroeter
July 2004
Using the Semantic Grid to Build
Bridges between Museums and
Indigenous Communities Ronald
IAAI Panel
Interoperability and
Transformability through Semantic
Annotation of a Job Description
Language Jeffrey Hau
53
E-Science Special Issue

IEEE Intelligent Issue Special Issue on
E-Science, Jan-Feb 2004


De Roure, Gil, Hendler
Challenges:




July 2004
Realizing the network effect
Moving beyond centralized stores
Automated assembly
Collaboration tools
IAAI Panel
54
Self-Organizing Semantic Grid
…Our self-organizing Semantic Grid is now
a constantly evolving organism, with
ongoing, autonomous processing rather
than on-demand processing. This evolving,
organic Grid can generate new processes
and new knowledge.
David De Roure, Trends and Controversies
IEEE Intelligent Systems, August 2003
July 2004
IAAI Panel
55
Outline
1.
2.
The e-Vision and its challenges
Enabling Technologies


3.
4.
July 2004
Grid
Semantic Web
Semantic Grid
Building Bridges
IAAI Panel
56
Building bridges
July 2004
IAAI Panel
57
Semantic
Pervasive
July 2004
IAAI Panel
Grid
58
Closing Remarks




The Semantic Grid is needed to realise the
Grid ambition and the e-Anything vision
Both Grid and Semantic Web are about
joining things up – building bridges
To create this infrastructure we also need to
build bridges – it needs the engagement of
multiple research communities
What can the Semantic Grid do for you, and
what can you do for the Semantic Grid?
July 2004
IAAI Panel
61
Contact



David De Roure
University of Southampton, UK
dder@ecs.soton.ac.uk
Carole Goble
University of Manchester, UK
carole@cs.man.ac.uk
See www.semanticgrid.org
July 2004
IAAI Panel
62
Acknowledgements
myGrid
July 2004
Combechem
IAAI Panel
63
Download