Contents
• e-Science
• e-Infrastructure (~ cyberinfrastructure)
• Grid concepts
• NGS: National Grid Service (UK)
• EGEE: Enabling Grids for e-Science (EU funded)
• Building future infrastructure: European Grid Initiative

'e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it.'
John Taylor Director General of Research Councils Office of Science and Technology 2000

e-Infrastructure

'e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it.'

e-Infrastructure = Networks + Grids .. + Operations, Support, Training… + Data centres, archives, instruments…
• Networks connect resources
• Grids enable flexible use of networked resources: "virtual computing"

Grid concepts

Grids: a foundation for e-Science

• Enabling a whole-system approach
• Effect > Σparts

computers software Grid sensor nets instruments colleagues Shared data archives

Virtual organisations and grids

• What is a Virtual Organisation?
– People in different organisations seeking to cooperate and share resources across their organisational boundaries
– E.g. A research collaboration
• Each grid is an infrastructure enabling one or more "virtual organisations" to share and access resources
• Each resource is exposed to the grid through an abstraction that masks heterogeneity, e.g.
– Multiple diverse computational platforms
– Multiple data resources
• Resources are usually owned by VO members. Negotiations lead to VOs sharing resources

Typical current grid

• Virtual organisations bring and/or negotiate access to resources
• Grid middleware runs on each shared resource
• Provides
– Data services
– Computation services
– Single sign-on
• Distributed services (both people and middleware) enable the grid

The Role of the Virtual Organisation (VO)

Compute Center VO Service Compute Center

The many scales of grids

National datacentres, HPC, instruments
Institutes' data; Wider collaboration greater resources
International instruments,..
International grid (EGEE)
UK: National Grid Service
Regional grids
Campus grids
Condor pools, clusters
Desktop

Little interoperability across these scales of grids – yet. (Some of the) Basic grid services

• In both EGEE and NGS:
– Authorisation and authentication underpins it all
Grid Security Infrastructure: X.509 – issued by Certificate Authority
Additional VO credentials – "VOMS"
– Compute services
Broker – user submits job "to the grid"
Jobs run in batch mode under e.g. LSF, PBS,…
– Data services - Next slide!
– VO-specific and "Higher level services" build on these
Portals,… Application hosting services

2 main types of data services on Grids

• Simple data files on grid-specific storage
• Middleware supporting
– Replica files to be close to where you want computation
For resilience
– Logical filenames
– Catalogue: maps logical name to physical storage device/file
– Virtual filesystems, POSIX-like I/O
– Services provided: storage, transfer, catalogue that maps logical filenames to replicas.
• Solutions include
– gLite data service (EGEE)
– Globus: Data Replication Service
– Storage Resource Broker

Other data e.g. ….
– Structured data: RDBMS, XML databases,…
– Files on project's filesystems
– Data that may already have other user communities not using a Grid
• Require extendable middleware tools to support
– Computation near to data
– Controlled exposure of data without replication
• Basis for integration and federation
• OGSA –DAI
– In Globus 4
– Not (yet...) in gLite

National Grid Service

EGEE – international e-infrastructure

A four year programme (from April 2004):
• Build, deploy and operate a consistent, robust a large scale production grid service that
– Links with and build on national, regional and international initiatives
• Improve and maintain the middleware in order to deliver a reliable service to users
• Attract new users from research and industry and ensure training and support for them

Pan-European Grid Operations, Support and training
• Collaboration
• Network infrastructure & Resource centres

Production service Sites

Size of the infrastructure today:
• 192 sites in 40 countries
• ~25 000 CPU
• ~ 3 PB disk, + tape MSS

The Vision of the NGS

• National infrastructure services which allow researchers to:
– systematically create, process, preserve and publish digital information;
– easily navigate through the available resources;
– be confident in the quality of the services available;
– tie into international efforts
• To achieve this, the NGS will
– Lead the deployment of a common grid infrastructure
– Promote common open standards
– Through the NGS Partnership programme, integrate services to access a growing number, scale and variety of resources

• A production Service

NGS & Partners, 2006

To e- or not to e-, that is the question

• And in the geo world it's a no-brainer.
– Integrating resources (data, cpus, expertise) across semantic and admin domains
– Orchestrating services: data, computation and models
– Collaborating in key research & public service support
• BUT
– are the foundations of grids strong enough?
– Do NGS, EGEE have adequate Authentication & Authorisation? Often, yes! But richer authorisation services are needed
• Perhaps the biggest problems of all
– Are we willing to invest in and sustain production quality services for others to use? An ecology of geo-services….
– Will "The People Grid" grow? Will competition squash cooperation?
This workshop & OGF-GIS WG,… are reasons for optimism

EGEE - Further information

• EGEE www.eu-egee.org
• EGEE digital library: http://egee.lib.ed.ac.uk/
• gLite http://www.glite.org/
• Real-time monitors: http://gridportal.hep.ph.ic.ac.uk/rtm
• EGEE training: http://egee.nesc.ac.uk

NGS Information

• http://www.ngs.ac.uk
• Wiki: http://wiki.ngs.ac.uk
• To see what's happening: http://ganglia.ngs.rl.ac.uk/
• Training events: http://www.nesc.ac.uk/training