Enabling Grids for E-sciencE Earth Science Activity in EGEEII and DEGREE M. Petitdidier (IPSL/CETP), monique.petitdidier@cetp.ipsl.fr In collaboration with EGEE and DEGREE EUproject partners www.eu-egee.org INFSO-RI-031688 Geoinformatics Workshop – M. Petitdidier –8 March 2007 Planet Earth : a complex system GOME total ozone assimilation Stratospheric Ozone Industrial Emissions Atmospheric profiles Topography & Motion Land cover & vegetation Marine SST, SSH& colour Currents, bathymty & ice 10 y displacement of Etna 1992-01 SSA- IST 2005-034619 Geoinformatics Workshop – March 2007 Acute Questions for Earth Science Forecasting of meteorological events Extreme events: storm, tornado, hurricane, earthquake, tsunami… Water management: precipitations, flood, aquifer.. Pollution,,… => To provide real time information: real time data access, data assimilation and modelling Long term prediction : Climate Change Climatology i.e. trend of parameter variations like temperature, precipitations…. Polar ozone hole Impact on agriculture, weather… => Long series of multiple data sets, Intense processing, Discoveries and Dissemination of the knowledge How it works, Why…. These are Questions for Operational and Science organisations SSA- IST 2005-034619 Geoinformatics Workshop – March 2007 GRID: a solution GRID infrastructure – Considered an “open platform” for handling computing resources, data, tools… not limited to high-performing computing • Partner can use a lot more resources than the ones he (she) brings in – Impressive number of shared resources • EGEEII around 32,000 CPUS distributed in 200 sites • 13 PB storage – A collaborative possible platform among teams and/or countries • interactive collaboration to avoid effort duplication – Secure and restricted access to resources, data, tools… • Same data and software policy as outside Grid – Grid will open new fields of investigation – on Earth Science SSA- IST 2005-034619 Geoinformatics Workshop – March 2007 GRID: Nearly world-wide deployment Enabling Grids for E-sciencE – European Union § § § § § § § BalticGrid EELA EUChinaGrid EUIndiaGrid EUMedGrid SEE-GRID NorduGrid – USA § OSG – Japan § Naregi – Africa- Unesco Programme § Algeria, Ghana, Nigeria, § Senegal, Zimbabwe INFSO-RI-031688 Geoinformatics Workshop – M. Petitdidier –8 March 2007 EGEEII – ES VO Enabling Grids for E-sciencE • 2 Virtual Organisations : – Due to different data policy for Academic and private research • VO ESR (Earth Science research) – As an average around 30 – not always the same persons – Mainly scientists, few tests carried out by Company – Bulgaria, Finland, France, Germany, Greece, Italy, Russia, Slovakia,Switzerland, The Netherlands – Around 1500CPUs/day for ESR • VO EGEODE (Expanding GEOsciences on DEmand) – centered around Geocluster (software from Compagnie Générale de Géophysique) around 10-20 persons – Recently French Academic institutes already using Geocluster join the VO INFSO-RI-031688 Geoinformatics Workshop – M. Petitdidier –8 March 2007 Earth Science Applications in EGEEII Enabling Grids for E-sciencE Flood of a Danube riverCascade of models (meteorology,hydraulic ,hydrodynamic….) UISAV(SK)ESA, UTV(IT), KNMI(NL), IPSL(FR)Production and validation of 7 years of Ozone profiles from GOME Rapid Earthquake analysis (mechanism and epicenter) 50- 100CPUs Geocluster for Academy and industry CGG(FR)- IPGP(FR) DKRZ(DE)- Data access studies, climate impacts on agriculture Mars atmosphere CETP( FR): INFSO-RI-031688 Specfem3D: Seismic application. Benchmark for MPI (2 to 2000 CPUs) (IPGP,FR) Data mining Meteorology & Space Weather (GCRAS, RU) Air Pollution model- BAS(BG) Modelling seawater intrusion in costal aquifer (SWIMED) CRS4(IT),INAT(TU), Univ.Neuchâtel(CH)- Geoinformatics Workshop – M. Petitdidier – 8 Mars 2007 Earthquake Characteristics Enabling Grids for E-sciencE Fast Determination of mechanisms of important earthquakes (IPGP: E. Clévédé, G. Patau; IPSL: D. Weissenbach) Application to run on alert Collect data of 30 seismic stations from GEOSCOPE worldwide network Select stations and data Define a spatial 3D grid +time based on the assumed earthquake location Run for each grid point or group of grid points a job => ~ 50-100jobs Results obtained ~6hr after the earthquake Important for emergency action and other related researches All major earthquakes so treated: 21/24 in 2006 => catalogue INFSO-RI-031688 Geoinformatics Workshop – M. Petitdidier –8 March 2007 Enabling Grids for E-sciencE Cellular automaton for geomorphological research IPGP: C. Narteau , O. Rozier • Objective : understanding landscapes formation and • Computationnal evolution resources needed : • Examples : erosion of the • CPU time processing (one mountains, dynamic of simulation ~ 4 hours) dunes ... • RAM requirements • Algorithm : very simple (200x200x200 cells) transition rules between cells of different states (transport, deposition ...). INFSO-RI-031688 Geoinformatics Workshop – M. Petitdidier –8 March 2007 Landscape Evolution Enabling Grids for E-sciencE INFSO-RI-031688 Geoinformatics Workshop – M. Petitdidier –8 March 2007 SpecFEM3D [1] Enabling Grids for E-sciencE (moguilny@ipgp.jussieu.fr) • Resolution of regional scale seismic wave propagation problems to model wave propagation at high frequencies and for complex geological structures, with use of the spectral-element method. • Application first written by D. Komatitsch (Université de Pau), used in more than 75 laboratories in the world,especially for the study of earthquakes. • Very scalable application using F90 + MPI needing NFS mounted homes and requiring the successive launch of two mpirun on the SAME nodes, allocated in the SAME order. • Project: to be distributed on EGEEII to authorized users INFSO-RI-031688 Geoinformatics Workshop – M. Petitdidier –8 March 2007 Hydrology Enabling Grids for E-sciencE Management of water resources in Mediterranean area (SWIMED): G. Lecca (CRS4 Italy), P. Renard (Unine, CH), J. Kerrou (INAT, Tunisia), R. Ababou (IMFT, Fr) CODESA-3D: Density-dependent 3D coupled groundwater flow and transport simulations Data requirement: Geology, Topography, Meteorology, Water extraction by the farmer, Aquifer properties (Soil maps, Land use) GRID Impacts: Enhancing the data sharing among the field geologists, modelers, and water managers Allowing the water managers in Tunisia to investigate the potential impacts of management decision through a remote use of the grid : web interface - EUMEDGrid INFSO-RI-031688 Geoinformatics Workshop – M. Petitdidier –8 March 2007 Final remarks Enabling Grids for E-sciencE • In EGEEII improvement and/or implementation of functionalities more adapted to Earth Science – – – – License server Data management Workflow Portal • New tools needed to use the whole Grid potential – Due to Change in scale of computing power – Need of Exploration of huge data sets – Creation of Platform integrating web services, computing power, information systems…. • New conceptual approach of Earth Science – Role of Scientist § Interactive collaboration -> less duplication of development and/or adaptation § More time for new ideas, new research § Confidentiality of the research – Application development ( access to several large data sets, more CPUs…) DEGREE EU project is preparing the future INFSO-RI-031688 Geoinformatics Workshop – M. Petitdidier –8 March 2007 Dissemination and Exploitation of GRids in Earth sciencE • Strategic objectives – Bridge the ES and GRID communities throughout Europe – Ensure that ES requirements are satisfied in next Grid generation – Ensure the integration of emerging technologies for managing ES knowledge The DEGREE team: IISAS, Slovakia (Coordinator) CNRS, France KNMI, The Netherlands UNINE, Switzerland CRS4, Italy SCAI, Germany GCRAS, Russia ESA-ESRIN, Italy CGG, France Dutch Space, The Netherlands http://www.eu-degree.eu Project Vision Build a bridge linking the ES and Grid communities SSA- IST 2005-034619 Geoinformatics Workshop – M. Petitdidier –March 2007 ESA Earth Observation Grid GRID infrastructure dedicated to Earth Science applications at ESA – 150+ CPUs, around 100 TBytes storage – Earth Science dedicated portal – 40+ applications demonstrated in ESRIN – Environment integrates specialized ESA developed toolboxes (BEAM, BEST, BEAT…) – Recently opened to selected external PIs via Call for Proposals Infrastructure is used to support GRID processing/data access applications as well as to support new technologies (Digital Libraries, e-collaboration …) SSA- IST 2005-034619 Geoinformatics Workshop – March 2007 Example of ESA GRID Applications (1) (A)ATSR volcano monitoring: Extract time series in large data archives (1 volcano, 3 years of data: 1h using 20 CPU) •Operational global MERIS, (A)ATSR, and SAR large scale mapping: Land, ocean, and atmosphere Automatic coregistration of (A)SAR time series: Extract time series in large data archives SSA- IST 2005-034619 Large scale (A)SAR / Landsat ortomapping: Multi resolution data storage; many hundred images processing / integration Geoinformatics Workshop – March 2007 First work Collect Earth Science applications already or not ported on a grid infrastructure –not super computer – Classification as a function of their complexity – Analysis of the requirements – Catalogue of requirements Other applications are welcome ….. SSA- IST 2005-034619 Geoinformatics Workshop – March 2007 Role of the other Workpackages Focus on specific tools or functionalities – data management – job control,workflow,.. – portals, etc, – => to provide a cross overview of what exists in the ES community and in Grid projects. – => to provide a list of potential tools adapted to ES requirements DEGREE is a SSA => tests to be done by other partners or in other projects like EGEE… SSA- IST 2005-034619 Geoinformatics Workshop – March 2007 DEGREE Objectives Roadmap and New Challenges – Develop the concept and specify the technology base for an Earth Sciences Grid Services Platform, taking inputs from the DEGREE technical WPs: • • • • ES Application requirements on the Grid middleware and services ES Application requirements for Grid data management ES Application management workflows and job control ES Portals requirements Identify new technical challenges and make recommendations for future actions – Particular emphasis on supporting and meeting the needs of: • ES Data Assimilation Application • Knowledge Infrastructures – Develop the roadmap for building and promoting a Grid Services Platform for new ES applications and value added services • Portals, workflow, data management • Identify new challenges ES Community Outreach – Generate new contacts through existing Earth Science research and industry bodies – Promote and advance complementary technologies for providing application services based on Grid, e-Collaboration, SOA, and Semantic Web SSA- IST 2005-034619 Geoinformatics Workshop – March 2007 Earth Science Expectations Pushing frontiers of scientific discovery by exploiting advanced computational methods. www.eu-degree.eu News Letter, example of ES applications … EGEEII User Forum, Manchester, UK – 9-11 May 2006 (www.eu-egee.org/uf2 SSA- IST 2005-034619 Geoinformatics Workshop – March 2007