The Grid: Abstract

advertisement
2
Abstract
The Grid:
Reality, Technology, and
Applications
The Grid seems to be everywhere, with announcements of
sales from major computer vendors, deployment in a wide
range of application spaces, and many national and
international scale infrastructure deployment. However, in
spite of the popularity of the term, there is often confusion
as to what the Grid is and what problems it solves. Is there
any "there there" or is it all just marketing hype?
In this talk, I will address these questions, describing what
the Grid is, what problems it solves, and what technology
has been developed to build Grid infrastructure and create
Grid applications. I will review the current status of Grid
infrastructure and deployment and give examples of where
Grid technology is being used not only to perform current
tasks better, but to provide fundamentally new types of
capabilities that are not possible otherwise.
Ian Foster
Argonne National Laboratory
University of Chicago
Globus Alliance
Univa Corporation
3
4
Overview
Overview
?
Grid motivation and definition
?
Grid motivation and definition
?
The need for standards
?
The need for standards
?
The power of open source
?
The power of open source
?
Grid in practice
?
Grid in practice
?
Summary
?
Summary
5
Why the Grid?
Origins: Revolution in Science
The Grid
Enable “coordinated resource sharing
& problem solving in dynamic, multiinstitutional virtual organizations.”
?
Pre -Internet
?
(Source: “The Anatomy of the Grid ”)
?
Access to shared resources
? Virtualization, allocation, management
?
?
With predicable behaviors
? Provisioning, quality of service
In dynamic, heterogeneous environments
? Standards - based interfaces and protocols
?
6
Theorize &/or experiment, alone
or in small teams; publish paper
Post-Internet
?
Construct and mine large databases of
observational or simulation data
?
Develop simulations & analyses
?
Access specialized devices remotely
?
Exchange information within
distributed multidisciplinary teams
1
Open Science Grid :
Astronomy Mosaic Construction
Origins in Science
Engage via telepresence in an
experiment at a remote facility
8
Construct custom mosaics on demand from multiple data sources
User specifies projection, coordinates, size, rotation, spatial
sampling
Discover & access a genome analysis
service (running on high-end computer)
Integrate data from multiple sources
in support of global change research
Harness computers across
sites to process data from
a physics experiment
NVO/NASA Montage: A small (1200 node) workflow
Work by Ewa Deelman et al., USC/ISI and Caltech
9
Why the Grid?
New Driver: Revolution in Business
?
?
?
?
Pre -Internet
?
?
?
?
Enterprise computing is highly distributed,
heterogeneous, inter- enterprise (B2B)
?
Business processes increasingly
computing - & data- rich
?
?
Growing complexity & need for
more efficient management
CRM: Workforce
Management (WFM)
A
1
1
2
7
1
2
?
9
Use Globus to discover
& provision resources
9 Servers Each
Strategy B:
Improve
Performance
1
2
1
14
2
IPC
SAP AG R/3 Internet Pricing
& Configurator (IPC)
12
Resource allocation & management
?
Application- driven provisioning
Service -sharing federation and management
Access to programs, hardware, data, services
across departmental boundaries
? Deliver: enhanced performance &/or lower
total cost of ownership
B
1
9
IPC
Delegation of
Dispatcher Request
Response: PricelistServer
Depending on:
- Time
- Discount
- Number of Items
-… 3
?
?
A
1
IPC
Server
2
Common virtualization/provisioning interfaces
1
B
Request:
Price Query
Bridging the Gap Between
Application & Infrastructure
?
B
1
(typically several thousand requests)
Adjust to varying
demand & resources
11 Servers
Strategy A:
Reduce TCO
Web Browsers / Batch Processes
SCM: Advanced Planner
& Optimizer (APO)
11
Cheaper or Faster ...
A
CRM: Internet Pricing Configurator (IPC)
Applications modified to:
?
Outsourcing becomes feasible ?
service providers of various sorts
18 Servers
3 Globus-enabled applns:
?
Central data processing facility
Post-Internet
?
10
eBusiness Use of Grid:
SAP Demonstration @ TechEd
2
?
E.g., large scale data management
?
O n-demand resource allocation
16 Servers Each
2
13
Avoiding Silos & Vendor Lock-In:
Open Standards & Software
Overview
?
Grid motivation and definition
?
The need for standards
?
The power of open source
?
?
?
Grid in practice
Summary
?
Standardized & interoperable mechanisms
for secure & reliable:
?
Authentication, authorization, policy, …
?
Representation & management of state
?
Initiation & management of computation
?
Data access & movement
?
Communication & notification
Good quality open source implementations
to accelerate adoption & development
?
15
?
?
Web services provide
?
Interface definition & service invocation
?
Nice semantics & extensibility
?
First steps towards security, workflow, etc.
?
Considerable commercial adoption
A silver bullet
?
A complete solution
?
Fixed in stone
Web Services
and Stateful Resources
?
?
“State” appears in almost all applications
?
Data in a purchase order
?
Current usage agreement for resources
?
Metrics associated with work load on a server
Web services can model, access and manage
state in many different ways
?
Ad- hoc, per- application approaches
?
WSRF proposes a standard approach
Modeling Stateful Resources with Web Services,
Foster, Frey, Graham, et al. www.globus.org/wsrf , 2004.
16
Applications of the framework
(Compute, network, storage provisioning,
job reservation & submission, data management,
application service QoS, …)
WS-Agreement
(Agreement negotiation)
But are not
?
E.g., Globus Toolkit
Grid Requires that We
Define WS-Based Frameworks
Web Services as a Foundation
14
WS Distributed Management
(Lifecycle, monitoring, …)
WS-Resource Framework & WS-Notification
(Resource identity, lifetime, inspection, subscription, …)
Web services
(WSDL, SOAP, WS-Security, WS-ReliableMessaging, …)
17
18
WSRF & WS-Notification
?
Naming and bindings (basis for virtualization)
?
?
Lifecycle (basis for fault resilient state mgmt)
?
?
?
Every resource can be uniquely referenced, and has one or
more associated services for interacting with it
Resources created by services following factory pattern
Resources destroyed immediately or scheduled
Information model (basis for monitoring, discovery)
?
?
?
Resource properties associated with resources
Operations for querying and setting this info
Asynchronous notification of changes to properties
?
Service groups (basis for registries, collective svcs )
?
Base Fault type
?
Group membership rules & membership management
3
19
Bringing it All Together
WS Distributed Management
?
Scenario: Resource management & scheduling
OASIS Web Services Distributed Management
(WSDM) technical committee
?
?
?
20
Management using Web Services
HP submitted its Web Services Management
Framework (WSMF) to WSDM in July 2003
Notification
Framework that builds on WSRF to provide
(among other things):
?
Service lifecycle (state) models
?
Service monitoring abstractions
?
Service relationships
Grid
Grid “Jobs” and “tasks
” Scheduler
is a
Other
are
kinds
alsoofmodeled
resources
using
are also
“modeled”
WS-Resources
as WS-Resources
andWeb Service
Grid
Service
Scheduler
Resource Properties
Level
Local processor manager
J
J
is “front-ended” with
J
A Web service interface
Blades
R
R
R
WS-Resource used to
A
Service
“model”Level
physical
Agreement
processor resources
is modeled as a
Network
Storage
WS-Resource
WS-Notification
can be
used to “inform ” the Lifetime of SLA
R
scheduler
R
R Properties
when
R R
WS-Resource
Resource
tiedRto
processor
utilization
“project”
processor
status the duration of
changes
(like utilization)
the agreement
B
M
I
I B M
21
22
Overview
?
Grid motivation and definition
?
The need for standards
?
The power of open source
?
Grid in practice
?
Summary
Globus Toolkit Main Components
?
Core Web services
?
Security
?
Execution management
?
Data management
?
Monitoring
?
?
?
?
?
Infrastructure for building new services
Apply uniform policy across distinct systems
Provision, deploy, & manage services
Discover, transfer, & access large data
Discover & monitor dynamic services
Globus Toolkit: Open Source Grid S/W
CAS
OGSA-DAI
[Tech Preview]
WS
Authentication
Authorization
Reliable
File
Transfer
(RFT)
Grid
Resource
Allocation Mgmt
(WS GRAM)
Monitoring
& Discovery
System
(MDS4)
Java WS Core
Pre-WS
Authentication
Authorization
GridFTP
Grid
Resource
Allocation Mgmt
(Pre-WS GRAM)
Monitoring
& Discovery
System
(MDS2)
C Common
Libraries
CLIENT
Non- WS
Your
Your
CC
Client
Client
Your
Your
Java
Java
Client
Client
Your
Your
Python
Python
Client
Client
Interoperable
WS-I-compliant
SOAP messaging
Your
Your
Python
Python
Client
Client
X.509 credentials =
common authentication
Execution
Management
Information
Services
Common
Runtime
SERVER
Java Services in Apache Axis
Plus GT Libraries and Handlers
pyGlobus
WS Core
C WS
Core
Python hosting,
GT Libraries
Pre-W S M D S
Your
C
Service
RLS
Your
Python
Service
Pre-WS GRAM
Your
Your
Java
Java
Service
Service
MyProxy
XIO
Credential
Management
Data
Management
Your
Your
CC
Client
Client
Components
Replica
Location
Service
(RLS)
Security
Your
Your
Java
Java
Client
Client
GridFTP
G
T
4
Web
Services
Components
SimpleCA
G
T
3
GT4 Components
C WS Core
CAS
OGSA-DAI
GTCP
G
T
2
Python WS Core
[contribution]
Index
Trigger
Archiver
G
T
3
Community
Scheduler
Framework
[contribution]
Delegation
Service
GRAM
RFT
Delegation
G
T
4
24
C Services using GT
Libraries and Handlers
4
25
26
?
Control access to shared services across
administrative domains
?
?
Local policy authorities rule
Compute
Center
UK eScience Grid
NEESgrid
TeraGrid
BIRN Biomedical Grid
Earth System Grid
Access Grid
Fusion Grid
Rights ’’
CAS or VOMS
issuing SAML
or X.509 ACs
VO
Users
Rights
Users able to set up dynamic trust domains
?
MPICH-G2
on user’s behalf)
Access
Multi-user collaborations
Federate through mutually trusted services
28
SSL/WS-Security
with Proxy
Services (running
Certificates
Authz Callout
Driven by autonomous management, e.g.,
different policy in different work- groups
?
IBM Grid Toolbox
GT4 Grid Security:
Standards-Based Delegation
Grid Security
?
…
Globus Toolkit
WSDL, SOAP, WS-Security
27
?
EU DataGrid
WS-Addressing, WSRF,
WS-Notification
Butterfly Grid
Custom
GT4
WSRF Web WSRF Web
Services
Services
Virtual Data Toolkit
Custom
Web
Services
Registry
Administration
GT4 Container
User Applications
NSF Middleware Init.
GT-Based Grid Tools & Solutions
Platform Globus
Application Development with GT4
Personal collection of resources working
together based on trust of user
Local Policy
on VO identity
or attribute
authority
MyProxy
VO
Rights ’
29
KCA
30
GT4’s Use of Security Standards
Overview
?
Grid motivation and definition
?
The need for standards
?
The power of open source
?
Grid in practice
?
Summary
5
31
32
A Typical eScience Use of Globus
How it Really Happens
Web
Browser
Compute
Server
Simulation
Tool
Web
Portal
Compute
Server
Registration
Service
Data
Viewer
Tool
Chat
Tool
Credential
Repository
Camera
Telepresence
Monitor
Camera
Database
service
Data
Catalog
Database
service
Database
service
Certificate
authority
Links instruments, data,
computers, people
Users work
with client
applications
Application services
organize VOs & enable
access to other services
Collective services
aggregate &/or
virtualize resources
Resources implement
standard access &
management interfaces
33
34
How it Really Happens
(without Globus)
Web
Browser
Application
Developer
Simulation
Tool
Web
Portal
13
Globus
Toolkit
0
Grid
Community
0
Data
Viewer
Tool
Compute
Server
B
Compute
Server
Chat
Tool
Credential
Repository
Application services
organize VOs & enable
access to other services
Web
Browser
GRAM
Globus
GRAM
Globus Index
Service
CHEF
C
Database
service
D
Database
service
E
Database
service
Application
Developer
2
Off the
Shelf
9
Globus
Toolkit
4
Grid
Community
4
Data
Viewer
Tool
CHEF Chat
Teamlet
MyProxy
Telepresence
Monitor
Compute
Server
Users work
with client
applications
35
Application services
organize VOs & enable
access to other services
Camera
Globus
DAI
Globus
MCS/RLS
Globus
DAI
Globus
Certificate
Authority
Resources implement
standard access &
management interfaces
University of Texas
Compute
Server
Camera
Camera
Data
Catalog
Collective services
aggregate &/or
virtualize resources
Globus
Simulation
Tool
Camera
Telepresence
Monitor
Certificate
authority
Users work
with client
applications
A
Registration
Service
9
Off the
Shelf
How it Really Happens
(with Globus)
DAI
Collective services
aggregate &/or
virtualize resources
Database
service
Database
service
Database
service
Resources implement
standard access &
management interfaces
Global
Community
6
37
38
Early Commercial Applications
Novartis
Grid Services Market Opportunity 2005
Leading adopters (Oct 2003) *
• Financial services: 31%
• Life sciences: 26%
• Manufacturing: 18%
PC Grid links 3,700
desktop systems
Manufacturing
Financial
Services
Energy
Derivatives
Analysis
Seismic
Analysis
Statistical
Analysis
Reservoir
Analysis
Portfolio
Risk
Analysis
?
Mechanical/
Electronic
Design
LS /
Bioinformatics
Process
Simulation
Cancer
Research
Entertainment
Finite
Element
Analysis
Drug
Discovery
Digital
Rendering
Protein
Folding
Failure
Analysis
Other
Massive
Multi-Player
Games
Protein
Sequencing
Streaming
Media
?
Web
Applications
Weather
Analysis
Code
Breaking/
Simulation
?
Academic
?
“Gridified ” Infrastructure
Research & development
applications
Potentially mainstream
business computing
> 5 teraflop/s computing
power
“We have projects we
calculate would take 6
years on a single
supercomputer.
Today, the run time is
12 hours.”
Estimate savings of
$200M over 3 years
Peter Sany,
Novartis CIO
Sources: IDC, 2000 and Bear Stearns- Internet 3.0 - 5/01 Analysis by SAI
*Grids 2004: From Rocket Science To Business Service, The 451 Group
Globus Consortium
(www.globusconsortium.com)
?
39
Acknowledgments
?
“The Globus Consortium is comprised of
global computing leaders who support the
Globus Toolkit for use in enterprise data
centers. Through the Globus Consortium,
vendors of hardware, software and their
customers can work together to accelerate
use of the Globus Toolkit in the enterprise
by hardening features and speaking with
one voice on emerging Grid standards.”
?
?
?
?
?
HP, IBM, Intel, Sun, Nortel, Univa are
founding members
?
Carl Kesselman and Steve Tuecke, my long time Globus co-conspirators
Gregor von Laszewski, Kate Keahey, Jennifer
Schopf, Mike Wilde, other Argonne colleagues
Globus Alliance members at Argonne,
U.Chicago , USC/ISI, Edinburgh, PDC
Miron Livny, U.Wisconsin Condor project, Rick
Stevens, Argonne & U.Chicago
Other partners in Grid technology, application,
& infrastructure projects
DOE, NSF, NASA, IBM for generous support
41
42
Overview
?
Grid motivation and definition
?
The need for standards
?
The power of open source
?
Grid in practice
?
Summary
Summary
?
A significant opportunity
?
Federate resources to increase capability
&/or reduce cost
?
Requires new technology
?
Open standards & software are important
?
We ’re perhaps at an inflection point
?
?
Bridge application- infrastructure gap
To address needs without vendor lock in
?
Major adoption for eScience
?
Growing adoption within industry
7
43
For More Information
?
Globus Alliance
?
?
?
?
www.ggf.org
Background information
?
?
www.globusconsortium.com
Global Grid Forum
?
?
www.globus.org
Globus Consortium
www.mcs.anl.gov/~foster
GlobusWORLD 2005
?
Feb 7-11, Boston
2nd Edition
www.mkp.com/grid2
8
Download