The Gridbus Toolkit

advertisement
Grid Economics and Business Models:
A Gridbus Perspective
Grid User Meet, Bengaluru, India
Rajkumar Buyya
Grid and Distributed Systems (GRIDS) Laboratory
Dept. of Computer Science and Software Engineering
The University of Melbourne
Melbourne, Australia
www.gridbus.org
WW Grid
Outline






2
Introduction

Utility Networks and Grid Computing

Security. Resource management, pricing and service
models
Global Grids and Challenges
Service Oriented Grids and Grid Economy

SOGA, Grid Market Directory, Grid Bank, Broker

Architecture, Design and Implementation

A Case Study in High Energy Physics
Grid Service Broker
Performance Evaluation: Experiments in Creation
and Deployment of Applications on Global Grids
Summary
4 Essential Utilities and
Delivery Networks
(1) Water
(2) Electricity
(3) Gas
(4) Telephone
3
(5) IT services as the fifth utility
(water, electricity, gas, telephone, IT)
4
eScience
eBusiness
eGovernment
eHealth
Multilingual
eEducation
…
Leading to Grid (computing) Paradigm:
Cyberinfrastructure for sharing resources
•Inspired
by Power Grid!
•*
A service-oriented/utility computing paradigm that enables seamless sharing
of geographically distributed, autonomous resources (for PROFIT).
•*
5
This was the original aim of building Internet although it ended up in giving
birth to email!
A Bird Eye View of
World-Wide Grid Environment
Grid Information Service
Grid Resource Broker
R2
R3
R5
Application
database
R4
RN
Grid Resource Broker
R6
Grid Information Service
6
R1
Resource Broker
Various Types of Grid Services


Computational Services – CPU cycles

SETI@Home, NASA IPG, TeraGrid, I-Grid,…
Data Services


ASP Grid
The way knowledge is acquired and managed—
data mining.
Data Grid
Utility Computing Services

7
Interaction Grid
eLearning, Virtual Tables, Group Communication
(Access Grid), Gaming
Knowledge Services


Access to remote software/libraries and
license management—NetSolve
Interaction Services


Knowledge Grid
Application Services


Data replication, management, secure access-LHC Grid/Napster
Utility Grid
Towards a market-based Grid computing:
Leasing and delivering Grid services as ICT
utilities.
Computational Grid
Worldwide Grid Spending

After the year 2006, business popularity of Grid computing
is expected to be accelerate exponentially:

Especially, the financial services and ERP services is expected to take
major parts in the expense
Source: Insight Research Corp.
8
Grid Challenges
Security
Computational Economy
Uniform Access
Resource Discovery
9
System Management
Data locality
Resource Allocation
& Scheduling
Application Construction
Network Management
Some Grid Initiatives Worldwide


Australia






Brazil



China
120million – 5 yrs




Europe

450million – 5 yrs
486million – 5 yrs

India
1.3 billion (Rs)


Japan

Nimrod-G
Gridbus
DISCWorld
GrangeNet.
27 million
APACGrid
ARC eResearch









UK eScience
EU Grids..
and many more...




Singapore
NGP


Global Grid Forum
Australian Grid Forum
Conferences:




10
1.3 billion – 3 yrs
IBM On Demand Computing
HP Adaptive Computing
Sun N1
Microsoft - .NET
Oracle 10g
Satyam – Grid Practice
Infosys, Wipro, TCS
StorageTek –Grid..
Public Forums

NAGERI
Globus
NASA IPG
AccessGrid
TeraGrid
Cyberinfrasture
Industry Initiatives

ChinaGrid – Education
CNGrid - application
1 billion – 5yrsKorea...
N*Grid


OurGrid, EasyGrid
LNCC-Grid + many others
Garuda
USA
CCGrid
Grid
HPDC
E-Science
http://www.gridcomputing.com
2? billion
Grid (Market) Participant Types and
Application Category
free trading
Public
computing
(SETI@Home,
Alchemi, UD)
Private enterprises
(Satyam, IBM, Sun)
Sharing
Model
National provider
(K*Grid, TeraGrid,
Garuda/IndiaGrid,
UKGrid, AusGrid)
regulation
scientific
11
Application Category
commercial
mix-and-match (service)
Object-oriented
Internet/partial-P2P
Network enabled Solvers
Economic-based Utility /
Service-Oriented Computing
Nimrod-G
12
The Gridbus Project @ Melbourne:
Enable Leasing of ICT Services on Demand
Distributed Data
WWG
Gridbus
World Wide Grid!
On Demand Utility Computing
13
http://www.gridbus.org
The
TheGridbus
GridbusProject
Project@
@GRIDS
GRIDSLab,
Lab,The
TheUniversity
Universityof
ofMelbourne:
Melbourne:
Toolkit
-Research Applications
Toolkitfor
forCreating
Creatingand
andDeploying
Deployingee-Research
Applicationson
onUtility
UtilityGrids
Grids
•
•
Gridbus is a “open source” Grid R&D
project with focus on Grid Economy, Utility
Grids and Service Oriented Computing.
Gridbus Middleware components include:
– Alchemi: .NET-based Enterprise Grid
– Grid Market Directory and Web Services
– Grid Bank: Accounting and Transaction
Management
– Visual Tools for Creation of Distributed
Applications
– Grid Service Broker and Scheduling
– Workflow Management Engine
– GridSim Toolkit
– Libra: SLA-based Resource Allocation
14
Distributed Data
Gridbus
Grid Economy: Methodology for Sustained
Resourced Sharing and Managing
Supply-and-Demand for Resources
15
New challenges of Grid Economy

Resource Owners












16
I decide prices ? (economic models?)
I specify them ?
I translate price to resource allocation ?
I enforce them ?
I advertise & attract consumers ?
I do accounting and handle payments?
How do
How do
How do
How do
…..
I decide expenses ?
I express QoS requirements ?
I trade between timeframe & cost ?
I map jobs to resources to meet my QoS needs?
Resource Consumers


How do
How do
How do
How do
How do
How do
…..
They need mechanisms and technologies for value expression,
value translation, and value enforcement.
Grid Entities and Architecture
Grid consumer
GSP site
scheduler
broker
accounting
17
GSP
global
scheduler
GSP site
scheduler
Resource
Provider
Resource
Provider
Market
Maker
End users
Private enterprises
National providers
A Reference Service-Oriented
Architecture for Utility Grids
Data Catalogue
Grid Bank
Programming
Environments
Applications
Sign-on
Info ?
Grid Explorer
Job
Control
Agent
Grid Market
Services
Health
Monitor
Grid Node N
Secure
Schedule Advisor
QoS
Grid Node1
Pricing
Algorithms
Trade Server
Trade Manager
Trading
…
Deployment Agent
JobExec
Grid Resource Broker
Misc. services
Resource Allocation
Storage
Grid Middleware
Services
Accounting
Resource
Reservation
R1
Grid Consumer
18
Information
Service
R2
…
Rm
Grid Service Providers
Gridbus and Complementary
Technologies – realizing Utility Grid
Science
Commerce
…
MPI
ExcellGrid
Grid Brokers:
Grid Economy
Alchemi
NorduGrid
Windows
Gridscape
Unicore
…
XGrid
JVM
Solaris
Collaboratories
Workflow
Workflow Engine
Nimrod-G
Globus
.NET
Engineering
Grid
Storage
Economy
Condor
Linux
…
IRIX
Libra
Core Grid
Middleware
Grid
Market
Directory
Tomcat
Mac
OSF1
G
R
I
D
S
I
M
Grid
Fabric
Software
Grid
Fabric
Hardware
CDB
PDB
19
User-Level
Middleware
(Grid Tools)
Gridbus Data Broker
SGE
AIX
X-Parameter Sweep Lang.
Grid
Exchange &
Federation
Grid
Bank
PBS
…
Grid
Applications
Portals
Worldwide Grid
On Demand Assembly of Services: Putting
Them All Together
Application Code
Explore
data
1
Visual Application Composer
10
2
Data Catalogue
5
6
Grid
Resource Broker
4
Grid Info Service
3
ASP Catalogue
9
12
Grid Market Directory
7
8
Grid Service (GS)
(Globus)
Alchemi
Bill
GS
CPU
or
PE
20
GSP
(e.g., IBM)
Cluster Scheduler
PE
GSP
(e.g., UofM)
PE
GTS
GSP
(e.g., VPAC)
11
Gridbus
GridBank
GSP
(Accounting Service)
Alchemi: .NET-based Enterprise Grid
Platform & Web Services
Alchemi Manager
Web
Services
Internet
Alchemi Users
Internet
21
•SETI@Home like Model
•General Purpose
•Dedicated/Non-dedicate
workers
•Role-based Security
•.NET and Web Services
•C# Implementation
•GridThread and Job Model
Programming
•Easy to setup and use
• Widely in use!
Alchemi Worker Agents
Some Users of Alchemi
Tier Technologies, USA
Large scale document processing using Alchemi framework
Satyam Computers Applied Research Laboratory, India
Micro-array data processing using Alchemi framework
CSIRO, Australia
Natural Resource Modeling
The University of Sao Paulo, Brazil
The Alchemi Executor as a Windows Service
The Friedrich Miescher Institute (FMI) for Biomedical Research, Switzerland
Patterns of transcription factors in mammalian genes
22
stochastix GmbH, Germany
Asynchronous Excel Tasks using ManagedXLL and Alchemi .Net Grid
Computing framework.
Many users in Universities: See next for an example.
The Gridbus Grid Service
Broker for Data Grid
Applications
Builds on the Nimrod-G Computational
Grid Broker and Computational Economy
[Buyya, Abramson, Giddy, Monash
University, 1999-2001]
And
Extends its notion for Data and Service
Grids
Gridbus Broker Architecture
Gribus Client
Gridbus Client
Gridbus Client
App, T, $, Opt
(Bag of Tasks Applications)
(Data Grid Scheduler)
Gridbus Farming Engine
Schedule Advisor
Trading Manager
Record
Keeper
Grid Dispatcher
Grid Explorer
Grid Middleware
TM
$
TS
GE
GIS, NWS
Grid Info Server
RM & TS
$
$
U
Data
Node
C
G
Globus enabled node.
24
G
L
Unicore enabled node.
A
RM: Local Resource Manager, TS: Trade Server
Alchemi enabled node.
Data
Catalog
Gridbus Services for eScience
applications

Application Development Environment:






Resource Allocation and Scheduling


Dynamic discovery of optional computational and data
nodes that meet user QoS requirements.
Hide Low-Level Grid Middleware interfaces

26
XML-based language for composition of task farming
(legacy) applications as parameter sweep applications.
Task Farming APIs for new applications.
Web APIs (e.g., Portlets) for Grid portal development.
Threads-based Programming Interface
Workflow interface and Gridbus-enabled workflow
engine.
Globus, Alchemi, Unicore, NorduGrid, XGrid, etc.
Click Here for Demo
Drug Design
Made Easy!
27
Economy-based Data Grid
Scheduling
CLICK HERE TO SKIP IF
RUNNING OUT of TIME
High Energy Physics as eScience
Application Case Study
Case Study:
High Energy Physics and Data Grid

The Belle Experiment




29
KEK B-Factory, Japan
Investigating
fundamental violation
of symmetry in nature
(Charge Parity) which
may help explain the
universal matter –
antimatter imbalance.
Collaboration 400
people, 50 institutes
100’s TB data
currently
Australian Belle Data Grid Testbed
Certificate
Authority
Analysis Request
Analysis Results
Virtual
Organization
Replica
Catalog
NWS
NameServer
Grid
Service
Broker
Globus
Gatekeeper
GRIS
Globus
Gatekeeper
GRIS
NWS
Sensor
NWS
Sensor
GridFTP
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
GridFTP
Dept. of Physics,
University of Sydney
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
AARNET
GRIDS Lab,
University of Melbourne
Globus
Gatekeeper
GRIS
Globus
Gatekeeper
GRIS
NWS
Sensor
GridFTP
Globus
Gatekeeper
GRIS
NWS
Sensor
GridFTP
NWS
Sensor
GridFTP
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
ANU, Canberra
VPAC
Melbourne
30
Intel Pentium 2.0 Ghz,
512 MB RAM
Dept. of Physics,
University of Melbourne
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
Dept. of Computer Science,
University of Adelaide
Case Study: Event Simulation and
Analysis
B0->D*+D*-Ks
• Simulation and Analysis Package - Belle Analysis Software Framework (BASF)
• Experiment in 2 parts – Generation of Simulated Data and Analysis of the
distributed data
Analyzed 100 data files (30MB each) were distributed among the five nodes
31
Belle Data Grid (GSP CPU Service Price: G$/sec)
Certificate
Authority
Analysis Request
Analysis Results
Virtual
Organization
Replica
Catalog
NWS
NameServer
Grid
Service
Broker
Globus
Gatekeeper
GRIS
Globus
Gatekeeper
GRIS
NWS
Sensor
NWS
Sensor
GridFTP
G$4
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
GridFTP
Dept. of Physics,
University of Sydney
NA
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
Globus
Gatekeeper
AARNET
GRIDS Lab,
University of Melbourne
GRIS
Globus
Gatekeeper
GRIS
NWS
Sensor
Globus
Gatekeeper
GridFTP
GRIS
NWS
Sensor
GridFTP
NWS
Sensor
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
GridFTP
ANU, Canberra
VPAC
Melbourne
32
G$6
Intel Pentium 2.0 Ghz,
512 MB RAM
Dept. of Physics,
University of Melbourne
G$2
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
Data
node
Dept. of Computer Science,
University of Adelaide
G$4
Belle Data Grid (Bandwidth Price: G$/MB)
Certificate
Authority
Analysis Request
Analysis Results
Virtual
Organization
NWS
NameServer
Replica
Catalog
Grid
Service
Broker
Globus
Gatekeeper
Globus
Gatekeeper
GRIS
32
33
36
NWS
Sensor
GRIS
31
GridFTP
NA
30
31
GRIDS Lab,
University of Melbourne
Dept. of Physics,
University of Sydney
NWS
Sensor
Globus
Gatekeeper
GridFTP
GRIS
GRIS
NWS
Sensor
GridFTP
NWS
Sensor
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
GridFTP
ANU, Canberra
VPAC
Melbourne
33
G$6
Intel Pentium 2.0 Ghz,
512 MB RAM
Dept. of Physics,
University of Melbourne
G$4
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
Globus
Gatekeeper
AARNET
GRIS
GridFTP
34
38
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
Globus
Gatekeeper
NWS
Sensor
G$2
Dual Intel Xeon 2.8 Ghz,
2 GB RAM
Data
node
Dept. of Computer Science,
University of Adelaide
G$4
Deploying Application Scenario




A data grid scenario with 100 jobs and
each accessing remote data of ~30MB
Deadline: 3hrs.
Budget: G$ 60K
Scheduling Optimisation Scenario:



35
Minimise Time
Minimise Cost
Results:
SUMMARY OF EVALUATION RESULTS
Scheduling strategy
Total Time
Compute
Data
Taken
Cost
Cost
(mins.)
(G$)
(G$)
Total Cost
(G$)
Cost Minimization
Time Minimization
34425
58390
71.07
48.5
26865
50938
7560
7452
Time Minimization in Data Grids
fleagle.ph.unimelb.edu.au
belle.anu.edu.au
belle.physics.usyd.edu.au
brecca-2.vpac.org
80
70
Number of jobs completed
60
50
40
30
20
10
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42
Time (in mins.)
36
Results : Cost Minimization in Data
Grids
fleagle.ph.unimelb.edu.au belle.anu.edu.au belle.physics.usyd.edu.au brecca-2.vpac.org
100
90
80
Number of jobs completed
70
60
50
40
30
20
10
0
1
37
3
5
7
9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61 63
Time(in mins.)
Observation
Organization
38
SUMMARY OF EVALUATION RESULTS
Scheduling strategy
Total Time
Compute
Data
Taken
Cost
Cost
(mins.)
(G$)
(G$)
Total Cost
(G$)
Cost Minimization
Time Minimization
34425
58390
Node details
71.07
48.5
26865
50938
Cost (in G$/CPUsec)
7560
7452
Total Jobs
Executed
Time
Cost
CS,UniMelb
belle.cs.mu.oz.au
4 CPU, 2GB RAM, 40 GB HD,
Linux
N.A. (Not used as a
compute resource)
--
--
Physics, UniMelb
fleagle.ph.unimelb.edu.au
1 CPU, 512 MB RAM, 40 GB HD,
Linux
2
3
94
CS, University of
Adelaide
belle.cs.adelaide.edu.au
4 CPU (only 1 available) , 2GB
RAM, 40 GB HD, Linux
N.A. (Not used as a
compute resource)
--
--
ANU, Canberra
belle.anu.edu.au
4 CPU, 2GB RAM, 40 GB HD,
Linux
4
2
2
Dept of Physics,
USyd
belle.physics.usyd.edu.au
4 CPU (only 1 available), 2GB
RAM, 40 GB HD, Linux
4
72
2
VPAC,
Melbourne
brecca-2.vpac.org
180 node cluster (only head node
used), Linux
6
23
2
Grid and Gridbus Technologies for
Various Grid (Market) Types
free trading
Public
computin
g
(Alchemi)
Private
enterprises
(Libra, Gridbus,
Globus)
Sharing
Model
National provider
(Globus, Gridbus,..)
regulation
scientific
39
Application Category
commercial
Summary and Conclusion

Grids exploit synergies that result from
cooperation of autonomous entities:




41
Resource sharing, dynamic provisioning, and
aggregation at global level.
Grid Economy provides incentive needed for
sustained cooperation.
Grid Network has potential to serve as
Cyberinfrastructure for Utility Computing
Grids offer enormous opportunities for realizing
eScience and eBusiness at global level.
Any Questions ?
Gridbus Project - http://www.gridbus.org
42
Thanks for your attention!
The Gridbus Cooperation!
http://www.gridbus.com
43
Backup Slides
Deadline (D) and Budget (B) Constrained
Scheduling Algorithms
Algorithm Execution Execution Compute
Time (D) Cost (B)
Grid
Cost Opt
Limited by D Minimize
Yes
Cost-Time
Opt
Minimize if
possible
Minimize
Yes
Time Opt
Minimize
Limited by B
Yes
Conservative
-Time
Opt
Minimize
Limited by B,
jobs have
guaranteed
minimum
budget
Yes
47
Data Grid
Yes
Yes
Download