Executive Summary

advertisement
PUBLISHABLE EXECUTIVE SUMMARY
Enabling Grids for E-sciencE
Accomplishments during the first
reporting period
April 2004 to December 2004
Project Goals
The EGEE project involves partners from over 27 countries with the common
aim of developing an international computing grid infrastructure in Europe
which has access to major computing resources worldwide. This grid
infrastructure will be available to scientists around-the-clock and independent
of their geographic location, and is based on the EU Research Network
GÉANT. The goals of the EGEE project are:
* to build a consistent, robust and secure grid that attracts new computing
resources;
* to improve and maintain the middleware in order to deliver a reliable service
to users;
* to attract new users from science and industry and ensure training and
support for them.
Funded by the European Commission, the EGEE project consists of 12
partner federations, with 70 contractors and over 30 non-contracting
participants. Two pilot application domains have been selected for the project:
the Large Hadron Collider Computing Grid (LCG¹) supporting data-intensive
High Energy Physics (HEP) experiments and the Biomedical Grids,
addressing bioinformatics and healthcare data. The project comprises eleven
activities which are summarised below.
¹ The international grid project for particle physics. See http://lcg.web.cern.ch/LCG/
EGEE is a project funded by the European Commission - contract number INFSO-RI-508833
PUBLISHABLE EXECUTIVE SUMMARY
Project Management (NA1)
Prior to the project launch, the project management ensured submission of the
EU contract and Consortium Agreement and coordinated preparation of the
programme of work.
The main boards and committees established and coordinated by the project
management are: the Project Management Board (PMB) which deals with
managerial issues and assesses project progress (quarterly); the
Administrative Federation Meeting (AFM) which monitors, discusses and
solves administrative issues; the Project Executive Board (PEB) in which
activity leaders meet with the Project Director and Technical Director and
monitor technical progress of the project (weekly); the Collaboration Board
(CB) composed of representatives from all EGEE official partners (meet biannually at project conferences); the External Advisory Committee (EAC)
whose members attend the project conferences and provide feedback and
advice on the project’s progress to the Project Director.
During the first nine months of the project, the management has completed the
following tasks: instigated the writing of the execution plan documents from
each activity during the project start-up phase (specifies work breakdown,
individuals involved and deliverables); established the reporting procedures for
the quarterly reporting exercise in collaboration with the quality assurance
activity manager (includes timesheet reporting process, which was
successfully automated in the second quarter of the project); prepared the first
contract amendment, resolving inconsistencies and completing outstanding
sections; developed an open source licence for all software produced in the
project; participated in major technical conferences and public events worldwide on behalf of the project; established and trained the project office; and
managed and coordinated all project activities.
Dissemination and Outreach (NA2)
NA2 has designed and developed a consistent brand and style for all project
dissemination material and has produced a project folder, written a variety of
information sheets about key areas of the project and designed templates for
generic posters, technical posters, word documents (information sheets and
news releases) and presentations.
On the internet, the EGEE main portal gives users the opportunity to enter the
external site, technical site or training site. The external website targets a
broad audience and the NA2 partners have developed national language
versions using similar layouts. Journalists and the public can register on the
site to join the EGEE mailing list. A monthly EGEE newsletter is produced by
the CERN project office since September 2004.
To date there have been three press releases issued centrally by TERENA and
CERN and 24 issued in different languages (mainly translations of the centrally
issued release). EGEE has been featured in over 90 publications (online and
print) and promoted at over 100 different events worldwide.
Two EGEE conferences have been organised so far, one in Ireland (in April
2004) with 300 participants and one in The Netherlands (November 2004) with
401 participants. NA2, in collaboration with NA1 and NA4, also organised an
EGEE stand and live demonstrations at the IST 2004 event at The Hague,
November 2004.
EGEE is a project funded by the European Commission - contract number INFSO-RI-508833
PUBLISHABLE EXECUTIVE SUMMARY
User Training and Induction (NA3)
Over the reporting period, the EGEE training activity (NA3) has rapidly built up
capacity to ensure consistent delivery of quality training. Some 39 induction
and training events have been held, attended by over 1000 participants in
total. All of the federations have delivered training activities and the material
from all events is available via the NA3 web pages.
The GILDA testbed and GENIUS portal, developed by NA4, provide an
important resource for EGEE training which gives new users the opportunity to
access grid applications without waiting for certification. This allows a deeper
interaction with applications after becoming certified but before joining a Virtual
Organisation (VO). It also allows application developers to gain experience in
a grid environment.
Applications Identification and Support (NA4)
EGEE has already achieved its two-year target of having five disciplines
making use of the infrastructure. All four LHC HEP experiments (ALICE,
ATLAS, CMS and LHCB) have successfully used the EGEE infrastructure for
data challenges during 2004. There is a growing decentralized usage by
biomedical application developers, with about 20,000 jobs submitted
corresponding to approximately 30,000 hours CPU hours. Earth Sciences
applications, selected by the EGEE Generic Applications Advisory Panel
(EGAAP) in June 2004, are already active on the production infrastructure.
Astrophysics and computational chemistry are using the GILDA facility for
further testing.
EGEE support of LCG has helped ensure migration of HEP pilot applications,
through the creation of the HEP ARDA project. This project evaluates the new
EGEE middleware and has provided the experiment analysis prototype
systems to all 4 LHC experiments. Other HEP experiments, including DØ,
CDF and BaBAr, are also starting to make use of the EGEE infrastructure.
EGEE is a project funded by the European Commission - contract number INFSO-RI-508833
PUBLISHABLE EXECUTIVE SUMMARY
Policy and International Cooperation (NA5)
NA5 cooperates with national and international grid initiatives (US NSF), for a
(GGF) and other projects, and supports the e-Infrastructures and eInfrastructures Reflection group (eIRG) events. The purpose of the eIRG is to
facilitate dialogue between technical experts and policy makers in all
European countries.
In close collaboration with Project Management, NA5 has concentrated on:
cooperation between EGEE and the US future NSF Cyberinfrastructure
initiative; establishment of Russian and US grid federations in the EGEE
project management structure; preparation of a Memorandum of
Understanding (MoU) signed with South Korea to cover collaboration on grids
for physics and biology; planning the extension of the EGEE infrastructure to
the Baltic States and EGEE training events there in autumn 2004; and further
extensions to Latin American countries and other non-European areas.
This activity helped organise the e-Infrastructures workshops and meetings
and edited the eIRG White Paper, as well as revisions of this produced for the
eIRG meetings in April 2004 and November 2004. During the World
Conference on Information Technology, the activity supported the organisation
of a special forum dedicated to grid technology and services. NA5 has also
engaged with other EU grid projects, including the EU RI DEISA project, to
help develop a roadmap for the European Research Infrastructure.
Collaborative agreements have been made with SEE-GRID and GRIDCC, and
there has been consultation with related projects and initiatives such as
DILIGENT and the UK Open Middleware Infrastructure Institute (OMII).
In conjunction with the Project Office, NA5 organised and contributed to the
first Concertation meeting on e-Infrastructures in November 2004, which was
arranged in combination with the second EGEE conference. More than 240
people attended from over 15 European Research Infrastructure and Grid
Research projects.
EGEE is a project funded by the European Commission - contract number INFSO-RI-508833
PUBLISHABLE EXECUTIVE SUMMARY
Grid Support, Operations and Management (SA1)
Leveraging resources provided by LCG, SA1 has assembled and operates a
production grid with more than 90 resource centres providing in excess of
9000 processors, thereby greatly surpassing the two-year target of 50 sites. All
EGEE federations are actively using this grid and have resources available on
it at a level close to or exceeding project commitments (Annex 1 of contract).
There are more than 700 registered users of this grid. There are some 2000
CPU provided through non-EGEE sites in Taiwan, China, US, Japan, Canada,
Pakistan and India.
The establishment of the Regional Operations Centres (ROCs) in all the
regions is complete and they provide the first level support for deployment and
operations within their regions. The establishment of the Core Infrastructure
Centres (CICs) is well advanced, and the development plan is being
implemented. The four CICs currently take weekly turns to provide a (5x8)
service to monitor infrastructure problems and their resolution. The activity has
established tools to support this work, and written operational procedures that
are being implemented and further developed. This service started in
November 2004.
The processes for the creation and management of new Virtual Organisations
(VOs) have been established. The BioMed VO, Earth Resources VO, MAGICS
VO, EGEODE VO and Computational Chemistry VO are active alongside the
HEP VOs. In total there are more than 50 VOs deployed on the infrastructure,
including around 40 VOs of a regional nature.
Significant effort has been invested in the support of the LCG data challenges.
This is the first major use of a grid service of this size, providing valuable
feedback on middleware functionality, operations procedures, and debugging
and management tools. Peaks of over 8000 concurrent jobs have been
reached.
The pre-production service is active, using the LCG-2 middleware, with a few
gLite (the new EGEE middleware) components deployed on top of the basic
service. Technical discussions are taking place with other grid projects
(Grid3/Open Science Grid (OSG) and NorduGrid) to achieve interoperability
between the different infrastructures.
EGEE is a project funded by the European Commission - contract number INFSO-RI-508833
PUBLISHABLE EXECUTIVE SUMMARY
Network Resource Provision (SA2)
A document containing a comprehensive view of network services in GÉANT
and European NRENs was compiled during the summer, with the help of
TERENA which includes a detailed report about the implementation of network
services. The operational interface was drafted and then discussed at the
second EGEE conference. The updated version defines the major tasks and
outlines the main issues and solutions to them. Operational interface
components have been proposed, including Trouble Ticket Manager (TTM),
EGEE Network Operations centre (ENOC), Topology Database and SLA
database.
SA2 is due to produce a document "Open issues and model for network
services provisioning from GÉANT and the NRENs towards EGEE" early in
2005. A document with EGEE requirements was compiled. It focuses mainly
on two areas: Bandwidth Allocation and QoS, and Network Performance
Monitoring.
A document on terms and definitions used for SLR formulation was prepared,
discussed with EGEE members and published. It proposes the structure and
syntax of Service Requests. A study of service classes currently available in
GÉANT was also produced. It proposes new service classes, suitable for
serving the SLRs of EGEE applications.
Middleware Re-engineering and Integration (JRA1)
The architecture of the next generation grid middleware (gLite) being
developed by JRA1 has been documented. The gLite grid services follow a
Service Oriented Architecture, facilitating interoperability among grid services
and allowing easier compliance with upcoming standards such as WSRF. The
architecture of this set of services is not bound to specific implementations and
can be deployed and used independently.
The definition of the gLite grid service interfaces enables other grid
middleware implementations to offer the same interfaces allowing different
suppliers to interoperate and be interchanged. The interface design of the
gLite grid services has been completed and documented.
Existing services have been selected where possible from previous projects
such as AliEn, EU DataGrid and VDT. gLite was first installed on a distributed
testbed spanning CERN and Wisconsin University. Expert users from NA4
assessed the interfaces and semantics of the grid services of this prototype.
This testbed was not foreseen in the description of work but has proved
invaluable for gLite development.
gLite components passing integration and testing phases have been delivered
to SA1 for deployment on a pre-production service. Services developed within
gLite are: Workload Management Services; Data Management Services;
Information and Monitoring Services; Virtual Organisation Membership
Services (VOMS); Grid Access Services (GAS); and Package Manager (PM)
suites. Links to other projects have been established, including OGSA-DAI,
Unicore, and GEMSS. Contacts have been made with Platform Computing
and PPDG in the US, the University of Manchester (WSRF:Lite) and the
MammoGrid project. Existing contacts with DILIGENT and GridLab have been
intensified. Various activity members contribute to ongoing standardisation
efforts through GGF research and working groups.
EGEE is a project funded by the European Commission - contract number INFSO-RI-508833
PUBLISHABLE EXECUTIVE SUMMARY
Quality Assurance (JRA2)
The Quality Assurance Group (QAG), composed of quality representatives
from each activity, meets monthly. The Quality Plan, including the EGEE
Measurement Plan, has been delivered, and defines quality measures
applying throughout the project. It ensures standards and procedures are
established and followed throughout the project life cycle, defines quality
criteria by which the services can be evaluated as well as procedures to define
and monitor services (e.g. documentation, user support) within operations.
The main project procedures (document management and associated
templates, EU deliverables review, etc.) have been implemented on the JRA2
pages of the technical website. The main project metrics destined to evaluate
the project progress have been identified and the Measurement Plan
initialised.
Security (JRA3)
The middleware security group (MWSG), chaired by JRA3, is where activities
JRA1, JRA3, SA1, NA4, the Joint Security Group, Global Grid Forum (GGF),
Open Science Grid (OSG) and others meet to solve security issues
concerning the middleware design. Security requirements from all parts of the
project are prioritised by MWSG and handed over to JRA3 for implementation.
Together with the Project Office and NA5, this activity supports cross-project
collaboration aiming at a shared security infrastructure via the Joint Security
Policy Group (JSPG). Other achievements include: middleware security work
for the guiding documents Global Security Architecture and OGSA SEC
service initial recommendations for reengineering; supporting preparation of
security components for the JRA1 middleware (gLite) as part of software
maintenance and development; preparation of the document Security
operational procedures and incident handling, definition of a common grid
incident format in coordination with JSPG and OSG.
The activity’s security architect has been appointed area director of Security at
GGF12, facilitating future standardisation efforts. The activity also contributes
to integrating the Globus Dynamic Account System (DAS), and to coordinating
and collaborating with JRA1 security work (VOMS) and LCG security work
(VOMS admin). JRA3 is supporting LCG for legacy components LCAS and
LCMAPS and the grid-ftp server, and has completed the set-up of the PMA for
European Certification Authorities (CAs). Since then a recurring task has been
to chair the EUGridPMA and to expand the number of members.
EGEE is a project funded by the European Commission - contract number INFSO-RI-508833
PUBLISHABLE EXECUTIVE SUMMARY
Network Services Deployment (JRA4)
After an initial requirements gathering exercise, JRA4 initiated its prototype
implementation stage. In the network performance monitoring sub-activity
(NPM) a common interface for publishing and accessing network performance
information was agreed upon. This interface is based upon the NM-WG GGF
recommendation and involved GÉANT, participating NRENs and EGEE. An
early prototype not intended for inclusion in gLite was produced.
In the bandwidth allocation and reservation (BAR) on-demand sub-activity the
first high-level specification of the components and interfaces required to
request and access bandwidth services from the network providers has been
produced. A draft architecture was generated and the first BAR prototype is
being developed. A joint meeting at the second EGEE conference, with 55
participants, resulted in collaboration with JRA1 and JRA3. The Technical
Network Liaison Committee (TNLC) meeting at the same event strengthened
links with SA2. For the IPv6 task, contacts were established with JRA1 and
NA3, as well as with 6NET, to plan raising awareness. IPv6 training events
have been planned for EGEE partners to promote uptake.
EGEE is a project funded by the European Commission - contract number INFSO-RI-508833
Download