Overview of the CrossGrid Project

advertisement
X#
Overview of the CrossGrid
Project
Marian Bubak
Institute of Computer Science & ACC CYFRONET
AGH, Kraków, Poland
and
Michał Turała
Institute of Nuclear Physics, Cracow, Poland
Cracow Grid Workshop, November 5-6, 2001
Towards the CrossGrid
X#
– 1st meeting January 24, 2001, to join DataGrid
– CPA9 Call
– Extended collaboration meeting at GGF1 (March 7)
• 23 partners
• New type of applications
–
–
–
–
Proposal submitted – April 22, 2001; 22 partners
Comments of reviewers and PO
Negotiations October 24, 2001; 21 partners
...
Cracow Grid Workshop, November 5-6, 2001
X#
CrossGrid Collaboration
Ireland:
Poland:
TCD Dublin
Netherlands:
UvA Amsterdam
Germany:
FZK Karlsruhe
TUM Munich
USTU Stuttgart
Austria:
Portugal:
LIP Lisbon
U.Linz
Spain:
CSIC Santander
Valencia &
RedIris
UAB Barcelona
USC Santiago
& CESGA
Italy:
DATAMAT
Cyfronet
& INP Cracow
PSNC Poznan
ICM & IPJ Warsaw
Slovakia:
II SAS Bratislava
Greece:
Algosystems
Demo Athens
AuTh Thessaloniki
Cracow Grid Workshop, November 5-6, 2001
Cyprus:
UCY Nikosia
Main Objectives
– New category of Grid enabled applications
•
•
•
•
–
–
–
–
computing and data intensive
distributed
near real time response (a person in a loop)
layered
New programming tools
Grid more user friendly, secure and efficient
Interoperability with other Grids
Implementation of standards
Cracow Grid Workshop, November 5-6, 2001
X#
CrossGrid Architecture
Interactive, Compute and Data Intensive Applications
(WP1)
Interactive simulation and visualisation of a
biomedical system
Flooding crisis team support
Distributed data analysis in HEP
Weather forecast and air pollution modelling
HLA
Grid
Visualisation
Kernel
MPI code debugging and
verification
Metrics and benchmarks
Interactive and semiautomatic
performance evaluation tools
New Grid Services and Tools (WP3)
Datagrid
Services
GriPhyN
Data Mining
Grid Application
Programming Environment
(WP2)
Portals and roaming access
Grid resource management
Grid monitoring
Optimisation of data access
...
Globus Middleware
Fabric Infrastructure
Cracow Grid Workshop, November 5-6, 2001
X#
X#
Key functionalities of applications
– Data gathering
• Data generators and data bases geographically distributed
• Selected on demand
– Processing
• Needs large processing capacity on demand
• Interactive
– Presentation
• Complex data require versatile 3D visualisation
• Support interaction and feedback to other components
Cracow Grid Workshop, November 5-6, 2001
Why Interactive Computing?
X#
– Goal: From Data, via Information to Knowledge =>Planning
and Management
– Complexity: Huge data-sets, complex processes
– Approach: Parametric exploration and sensitivity analyses:
• Combine raw (sensory) data with simulation
• Person in the loop:
• Sensory interaction
• Intelligent short-cuts
Cracow Grid Workshop, November 5-6, 2001
X#
Common issues of applications
– Inherently distributed applications profit from grid
approach
– All tasks require high performance & MPI
• 1.1 and 1.2 - interactive, near-real time
• 1.3 and 1.4 - high throughput
– Data mining
• 1.3 and 1.4
– Data discovery
• 1.2 and 1.4
Cracow Grid Workshop, November 5-6, 2001
X#
Example – medical application
Cracow Grid Workshop, November 5-6, 2001
Architecture
Cracow Grid Workshop, November 5-6, 2001
X#
X#
Distributed Data Analysis in HEP
Complementarity with DataGrid HEP application package:
• Crossgrid will develop interactive final user application for physics
analysis, will make use of the products of non-interactive simulation & dataprocessing preceeding stages of Datagrid
• Apart from the file-level service that will be offered by Datagrid, Crossgrid
will offer an object-level service to optimise the use of distributed
databases:
-Two possible implementations (will be tested in running experiments):
–Three-tier model accesing OODBMS or O/R DBMS
–More specific HEP solution like ROOT.
• User friendly due to specific portal tools
Cracow Grid Workshop, November 5-6, 2001
X#
Distributed Data Analysis in HEP
•Several challenging points:
–Access to large distributed databases in the Grid.
–Development of distributed data-mining techniques.
–Definition of a layered application structure.
–Integration of user-friendly interactive access.
•Focus on LHC experiments (ALICE, ATLAS, CMS and LHCb)
Cracow Grid Workshop, November 5-6, 2001
X#
WP2 - Grid Application Programming Environments
Objectives
•
•
•
•
specify
develop
integrate
test
tools that facilitate the development and tuning of parallel
distributed
high-performance and high-throughput computing applications on
Grid infrastructures
Cracow Grid Workshop, November 5-6, 2001
X#
WP2 - Grid Application Programming Environments
Six Tasks in WP2
2.0 Co-ordination and Management
2.1 Tools requirement definition
2.2 MPI code debugging and verification
2.3 Metrics and benchmarks
2.4 Interactive and semiautomatic performance evaluation tools
2.5 Integration, testing and refinement
Cracow Grid Workshop, November 5-6, 2001
X#
WP2 - Components and relations to other WPs
Benchmarks
(2.3)
Performance analysis (2.4)
Automatic
analysis
Application
WP1
running on
testbed WP4
Grid
monitoring
(3.3)
Performance
measurement
Visualization
Analytical
model
MPI
verification
(2.2)
Application
source code
Cracow Grid Workshop, November 5-6, 2001
WP3 Objectives
X#
• Tools for development of interactive compute- and dataintensive applications
• To address user-friendly Grid environments
• To simplify the applications and Grid access by
supporting the end user
• To achieve a reasonable trade-off between resource
usage efficiency and application speedup
• To support management issues while accessing
resources
Cracow Grid Workshop, November 5-6, 2001
WP3
Portals
(3.1)
Roaming Access
(3.1)
Applications
WP1
End Users
Grid Resource
Management
(3.2)
Grid Monitoring
(3.3)
Performance
evaluation tools
(2.4)
Optimisation of
Data Access
(3.4)
Tests and
Integration
(3.5)
Testbed
WP4
WP1, WP2,
WP5
Cracow Grid Workshop, November 5-6, 2001
X#
X#
Testbed Organisation (WP4)
– Testbed setup and incremental evolution
• from several local testbeds to fully integrated one
– Integration with DataGrid
• common design, environment for HEP applications
– Infrastructure support
• flexible fabric management tools and network support
– Verification and quality control
• reliability of the middleware and network infrastructure
Cracow Grid Workshop, November 5-6, 2001
X#
CrossGrid WP4 - International Testbed Organisation
Partners in WP4
TCD Dublin
PSNC Poznan
U v Amsterdam
ICM & IPJ Warsaw
FZK Karlsruhe
USC Santiago
CYFRONET Cracow
II SAS Bratislava
CSIC Santander
LIP Lisbon
WP4 lead by
CSIC (Spain)
CSIC Madrid
Auth Thessaloniki
U A Barcelona
CSIC Valencia
DEMO Athens
Cracow Grid Workshop, November 5-6, 2001
UCY Nikosia
X#
WP4 - International Testbed Organisation
Testbed site responsibles:
– CYFRONET (Krakow) A.Ozieblo
– ICM(Warsaw) W.Wislicki
– IPJ (Warsaw) K.Nawrocki
4.0 Coordination and management
– UvA (Amsterdam) D.van Albada
(task leader: J.Marco, CSIC, Santander)
– FZK (Karlsruhe) M.Kunze
–Coordination with WP1,2,3
– IISAS (Bratislava) J.Astalos
–Collaborative tools (web+videoconf+repository)
– PSNC(Poznan) P.Wolniewicz
–Integration Team
– UCY (Cyprus) M.Dikaiakos
– TCD (Dublin) B.Coghlan
4.1 Testbed setup & incremental evolution – CSIC (Santander/Valencia) S.Gonzalez
– UAB (Barcelona) G.Merino
(task leader:R.Marco, CSIC, Santander)
– USC (Santiago) A.Gomez
–Define installation
– UAM (Madrid) J.del Peso
–Deploy testbed releases
– Demo (Athenas) C.Markou
–Trace security issues
– AuTh (Thessaloniki) D.Sampsonidis
– LIP (Lisbon) J.Martins
Tasks in WP4
Cracow Grid Workshop, November 5-6, 2001
X#
WP4 - International Testbed Organisation
Tasks in WP4
4.2 Integration with DATAGRID (task leader: M.Kunze, FZK)
–Coordination of testbed setup
–Exchange knowledge
–Participate in WP meetings
4.3 Infrastructure Support (task leader: J.Salt, CSIC, Valencia)
–Fabric management
–HelpDesk
–Provide Installation Kit
–Network support
4.4 Verification & quality control (task leader: J.Gomes, LIP)
–Feedback
–Improve stability of the testbed
Cracow Grid Workshop, November 5-6, 2001
Technical Coordination
X#
– Merging of requirements
– Specification and refinement of the GrossGrid architecture
(protocols, APIs; HLA, CCA ...)
– Establishing standard operational procedures
•
•
•
•
repository acces procedures
problem reporting mechanism
handling changed requests mechanism
release preparation procedure
– Specification of the structure of deliverables
– Approach: rapid prototyping and iterative engineering
Cracow Grid Workshop, November 5-6, 2001
Project Phases
X#
M 4 - 12: first development phase: design,
1st prototypes, refinement of requirements
M 25 -32: third development phase:
complete integration, final code versions
M 1 - 3: requirements
definition and merging
M 33 -36: final phase:
demonstration and documentation
M 13 -24: second development phase:
integration of components, 2nd prototypes
Cracow Grid Workshop, November 5-6, 2001
Clustering with # Projects
– Objective – exchange of
• information
• software components
– Our partners
•
•
•
•
DATAGRID
DATATAG
GRIDLAB
EUROGRID and GRIP
– GRIDSTART
– Participation in GGF
Cracow Grid Workshop, November 5-6, 2001
X#
Expected Results of the CrossGrid
–
–
–
–
–
–
Grid enabled interactive applications
Elaborated methodology
Generic application architecture
New programming tools
New Grid services
Extension of the Grid in Europe and to new virtual
organisations
Cracow Grid Workshop, November 5-6, 2001
X#
Dissemination & Exploitation
– Methods & software developed will be available to
scientific community
– Each collaboration partner
• topic conferences, GGF, national Grid initiatives
• MSc, PhD and lectures on Grid technology
– Centralised
•
•
•
•
CrossGrid vortal
workshops, seminars, user/focus groups
newsletter, brochures
industrial deployment
Cracow Grid Workshop, November 5-6, 2001
X#
X#
Overall Links between WPs and Tasks
1.0
Coordination
2.0
Coordination
3.0
Coordination
1.1-1.4
Applications
2.1
Requirements
3.1-3.4
Services
5.1
Coordination
&
Management
5.2
Architecture
Team
2.2-2.4
Tools
4.0
Coordination
GGF
2.5
Tests
3.5
Tests
4.2
Integration with
DataGrid
4.1, 4.3, 4.4
Testbeds
5.3
Dissemination
& Exploitation
DataGrid Other Grid Projects
Cracow Grid Workshop, November 5-6, 2001
X#
Ready to start
January 1, 2002
Cracow Grid Workshop, November 5-6, 2001
Download