WP2+5: Data and Storage Management Gavin McCance – Jens Jensen -

advertisement
WP2+5: Data and Storage
Management
Gavin McCance – University of Glasgow
Jens Jensen - RAL
University of Glasgow
GridPP9 NeSC, Edinburgh
DataGrid is a project funded by the European Union
GridPP is funded by PPARC
GridPP9 – 5 February 2004 – Data Management
EDG Summary Report
WP2: Data Management
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 2
The team
 Based
at Glasgow University

William Bell

David Cameron

Tony Doyle

Gavin McCance

Paul Millar

Caitriana Nicholson
 We
formed close collaboration with WP2 of European DataGrid

Became an integral part of the WP2 team

All our products were delivered jointly with EDG
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 3
What we set out to do?
 Initial
 Grid



work-plan was in two key areas:
Optimisation
Simulating the Grid from the data-centric viewpoint
Applying the results from the simulations to develop algorithms
for optimisation of replicas
Development of optimisation software for replica management
 Metadata

management
Development of secure metadata services for the experiments
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 4
What we actually did…
 Grid




Optimisation
Developed a data-centric Grid simulation [OptorSim] (fully
documented final release available now)
Compared of a variety of replica optimisation techniques and
algorithms (many publications)
Worked on developing and refining these algorithms
Exploitation of some of the early work, we developed the Replica
Optimisation Service (ROS) component of the Replica
Management System

provides file access cost estimation to the user
and the WP1 resource broker.
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 5
What we actually did…
 Metadata





Management
Early production of the Spitfire browser service allowing remote
access to RDBMS via a web browser
Early adoption of GSI security into Spitfire to allow secure use of
the service using Grid certificates
Initial adoption of web services into EDG with the creation of the
web services API version of Spitfire, allowing programmatic access
to remote RDMBS
The web services technology built for Spitfire was subsequently
factored out and used in all other WP2 services
The security built into Spitfire was subsequently factored out and
led to the creation of the WP2 Java security modules
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 6
What we actually did…
 Replica





Management Services
As focus of EDG changed to “stability, stability, stability” and
production, we changed with it
Helped factor-out the web services and security frameworks from
Spitfire
Moved into the core replica management services

Replica Manager

Replica Location Service

Replica Metadata Catalogue

Replica Optimisation Service
Provided continued support for these products to EDG and LCG
Led the thorough performance testing of our products
for WP2’s final EU deliverable
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 7
What we actually did…
 WP2

Integration and Testing
Significant integration expertise developed
Paul Millar is the WP2 integration team member


Glasgow ran the WP2 CVS repository

We created the testing framework for WP2 software
 WP2

Testbed support
Provided support for the WP2 mini-testbed
Machines managed at ScotGrid and at CERN

 WP2

Management
Gavin McCance is WP2 deputy
 Major
contributions to WP2 deliverables
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 8
What we actually did…
 Standards

Contribution of GGF Data Area documents
 Publications

Work was fully reported: 15 or so papers published…
Optimisation
Metadata
Replication
GGF
Deployment
6
3
2
3
1
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 9
EDG Effort Delivered
 Substantial
UK effort delivered to EDG : ~150% of contracted
Project Months
Integrated Effort
90
80
70
60
50
40
30
20
10
0
Integrated effort PM
Contracted effort PM
1
2
3
4
5
6
7
8
9
10 11 12
Quarter
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 10
EDG Summary Report
WP5: Storage Management
(Jens Jensen)
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 11
The team
 Based
at RAL

Tim Eaves

Tim Folkes

John Gordon

Jens Jensen

Glen Johnson

Tara Shah

Bonny Strong

Owen Synge

Regina Tam
 UK
WP5 managed the WP5 team at CERN, SARA and IN2P3.
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 12
What we did…
 Developed

 SE

Storage Element
uniform control interface to MSS: disk, ADS, CASTOR, HPSS
provides additional functionality for storage systems
e.g. directory hierarchies for mass storage with flat filenames
 Modular
design allows easy deployment in front of different
backing mass storage systems
 SE
provides disk cache in front of mass storage
 SE
can provide secure access (authentication and
authorisation) to the mass storage resource.
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 13
What we did…
 Demonstrated
3rd party transfers between different mass
storage systems
 Work
on interfacing with Replica Management service and with
the information services
 LCG

now requires SRM v1 to access storage
EDG SE is architected for easy migration to this
 Significant

work in standards development
Input into the SRM standards process
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 14
What we did…

Overall WP5 Management


WP5 Integration and Testing


Significant integration expertise developed

Owen Synge and Regina Tam were EDG integration team members

Regina Tam was on the EDG Software Quality team
EDG Architecture Team


John Gordon and Jens Jensen
Jens Jensen was WP5 member of Architecture team.
UK responsible for all WP5 deliverables
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 15
EDG effort delivered
 Substantial
UK effort delivered to EDG : ~150% of contracted
Integrated Effort
160
Project Months
140
120
100
Integrated effort (PM)
80
Contracted effort (PM)
60
40
20
0
1
2
3
4
5
6
7
8
9 10 11 12
Quarter
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 16
WP2 + WP5: What we learned
 Software

sometimes learned the hard way…!
 Much

development process…
feedback from our users on the software
Very useful for the design process in the next projects
 The
amount of effort required to make production quality
services out of our software
 Take
all this forward into GridPP2!
Gavin McCance – University of Glasgow
GridPP9 – 5 February 2004 – Data Management – n° 17
Download