Korea Institute of Science and Technology Information Visit to NeSC Malcolm Atkinson Director www.nesc.ac.uk 24th March 2004 Outline The National e-Science Centre Role and mission The e-Science Institute The UK e-Science Programme Funding and organisation The UK Grid The European dimension The Essence of e-Science NeSC Roles Help coordinate and lead UK e-Science Community building, training & outreach Help establish the UK’s international role The focus for presenting UK e-Science Run the e-Science Institute A meeting place Workshops and conferences Research visitors and events Undertake R&D projects Reliable middleware (OGSA-DAI, SunDCG, …) Engage industry (IBM, Sun, Microsoft, HP, Oracle, …) Stimulate the uptake of e-Science technology Training Team The Primary Requirement … Enabling People to Work Together on Challenging Projects: Science, Engineering & Medicine Events Held (from 1 Aug 2002 to 29 Feb 2004 – 31 months) We have run 197 events (just over 6 per month): 3 conferences (including GGF5 with 900 participants) 20 project meetings 16,444 delegate days 23 research meetings 197 events 61 workshops 4 schools 6,825 delegates 32 training sessions 339 event days 27 outreach events 9 international meetings 18 e-Science management meetings (though the definitions are fuzzy!) The Website – a Resource National e-Science Centre http://www.nesc.ac.uk/ Mission, Background, Foundation, Locations, Staff, Resources, Projects Register interest, Mailing lists, NeSCForge Regional associations and Collaborations News, Notices Presentations and Lectures http://www.nesc.ac.uk/presentations/ National e-Science Institute http://www.nesc.ac.uk/esi/ Mission, Events (Future and Past) Register for Events, Visitor Programme, GridNet UK e-Science Map and Index of Centres Technical Papers Index of >100 Projects Task Forces http://www.nesc.ac.uk/centres/ http://www.nesc.ac.uk/technical_papers/ http://www.nesc.ac.uk/projects/ http://www.nesc.ac.uk/teams/ General Information Glossary, Bibliography, Who’s who Comprehensive Widely used byand UK, Growing USA andSource the restofofInformation the world Web site Statistics Since going ‘live’ in 2001 to 31 Jan 2004 (figures in last week to 27 Feb 2004) > 3.0 million successful requests (‘hits’) transferring 166 gigabytes of data (2.20 GB) Average hits per day 3237 (5430) Distinct files served 18,980 (2821) … to 96,610 (3705) distinct hosts Average data transferred per day 182 MB (typical file size about 100 kB) (322 MB) Web site Statistics 2 18,000 200000 AFRICA AMERICA - NORTH 180000 AMERICA - OTHER 16,000 ASIA EUROPE - UK 160000 EUROPE - NON UK 14,000 MIDDLE EAST OCEANIA - PACIFIC UNKNOWN 140000 HITS 12,000 100000 8,000 80000 6,000 60000 4,000 40000 2,000 20000 2004/02 2004/01 2003/12 2003/11 2003/10 2003/09 2003/08 2003/07 2003/06 2003/05 2003/04 2003/03 2003/02 2003/01 2002/12 2002/11 2002/10 2002/09 2002/08 2002/07 2002/06 2002/05 2002/04 2002/03 0 2002/02 0 Successful Hits 10,000 2002/01 Volume (MB) 120000 Technical Reports The Virtual Observatory as a Data Grid, Bob Mann, Sep 03 UK Experience with OGSA, Dave Berry, Sep 03 e-Science Gap Analysis, Geoffrey Fox, David Walker, Jun 03 Scientific Data Mining, Integration and Visualisation, Bob Mann, Roy Williams, Malcolm Atkinson, Ken Brodlie, Amos Storkey, Chris Williams, Nov 02 A Rough Guide to Grid Security, Mike Surridge, Sep 02 Multi-Site Videoconferencing for the UK e-Science Programme, Stephen Booth, John Brooke, Kate Caldwell, Liz Carver, Michael Daw, David De Roure, Alan Flavell, Philippe Galvez, Brian Gilmore, Henry Hughes, Ben Juby, Ivan Judson, Jim Miller, Harvey Newman, Chris Osland, Sue Rogers, Oct 02 Database Access and Integration Services on the Grid, Norman W Paton, Malcolm P Atkinson, Vijay Dialani, Dave Pearson, Tony Storey, Paul Watson, Feb 02 Research Agenda for the Semantic Grid: A Future e-Science Infrastructure, David De Roure, Nicholas Jennings, Nigel Shadbolt, Dec 01 Databases and the Grid, Paul Watson, Dec 01 Technical Reports A Grid Application Framework based on Web Services Specifications and Practices, Savas Parastatidis, Jim Webber, Paul Watson, Thomas Rischbeck, Aug 03 Grid Information Systems 2003 (Draft), Rob Allan, Dharmesh Chohan, Xiao Dong Wang, Andy Richards, Mark McKeown, John Colgrave, Matthew Dovey, Mark Baker, Steve Fisher, Dec 03 Towards tractable toolkits for the Grid: a plea for lightweight, usable middleware, Jonathan Chin, Peter Coveney, Feb 04 Portals and Portlets 2003, Rob Allan, Chris Awre, Mark Baker, Adrian Fish, Mar 04 IMAGE 03: Images, Medical Analysis and Grid Environments, Dave Berry, Derek Hill, Steve Pieper, Joel Saltz, Cécile Germain-Renaud, Mar 04 Open Issues in Grid Scheduling, Alain Andrieux, Dave Berry, Jon Garibaldi, Stephen Jarvis, Djamila Ouelhadj, Mar 04 Data Provenance and Annotation, Peter Buneman, Michael Wilde. e-Science Workflow Services, Matthew Addis, Dave Berry, Earl Ecklund, Carole Goble Outline The National e-Science Centre Role and mission The e-Science Institute The UK e-Science Programme Funding and organisation The UK Grid The European dimension The Essence of e-Science UK e-Science Budget (2001-2006) Total: £213M EPSRC Breakdown MRC (£21.1M) 10% EPSRC (£77.7M) 37% HPC (£11.5M) BBSRC (£18M) 15% 8% NERC (£15M) 7% Applied (£35M) Staff 45% costs Grid Resources funded separately Core (£31.2M) 40% PPARC (£57.6M) 27% CLRC (£10M) 5% ESRC (£13.6M) 6% + Industrial Contributions Source: Science Budget 2003/4 – 2005/6, DTI(OST) The e-Science Centres Globus Alliance Open Middleware Infrastructure Institute Digital Curation Centre e-Science Institute Grid Operations Centre ? CeSC (Cambridge) EGEE Digital Curation Centre communities of practice: users curation organisations community support & outreach Collaborative Associates Network of Data Organisations services management & coordination research research collaborators development testbeds & tools Industry standards bodies Task Forces Directors’ Forum Helped build a community Engineering Task Force Built the UK Grid Architecture Task Force UK Adoption of OGSA OGSA Grid Market Future approaches Security Task Force Database Task Force (now disbanded) OGSA-DAI (www.ogsadai.org.uk) GGF DAIS-WG Usability Task Force The e-Science Grid Engineering Task Force (Contributions from e-Science Centres) 1280 x CPU AIX 512 x CPU Irix HPC(x) 20 x CPU 18TB Disk Linux Grid Support Centre / Grid Operations Centre OGSA Test Grid projects CeSC (Cambridge) 64 x CPU 4TB Disk Linux Access Grid Cameras Crucial for management meetings Requires IP multicast throughout the network Microphones Some UK e-Health Projects eDIaMoND (with IBM and Mirada) Breast Cancer Project IXI (with GSK and Philips Medical) Information from medical images MIAS Devices Mobile sensors for healthcare CLEF Integrating medical information eDiaMoND – Compute Mammograms have different appearances, depending on image settings and acquisition systems Standard Mammo Format Temporal mammography Computer Aided Detection 3D View eDiaMoND – Non-Functional Anonymisation Grid Screening Screening Screening Screening Diagnosis Diagnosis Diagnosis Teaching Training Teaching Teaching Epidemiology Epidemiology Epidemiology Epidemiology Ethics Legal Security Performance Scalability Manageability Auditability …… Lossless Compression Encryption 256MB & 5 secs response ~100 Centres Systems Administration Non-Repudiation KCL, Imperial and Oxford http://www.ixi.org.uk Automatic registration technology Rigid registration of MR and CT images of the head Inter-subject image warping CLEF - Integrating information Need high quality, integrated clinical information for: clinical research evidence-based health care the clinical application of genetic and genomic research The capture, integration, and presentation of descriptive information is a major barrier to achieving an integrated framework Data includes: clinical histories radiology and pathology reports annotations on genomic and image databases technical literature and Web based resources MIAS Devices Project Sensor bus Easy Plug and Play of Sensors Wireless connection using 802.11 Positioning information from GPS Mobile medical technologies on a distributed Grid GPS ariel The European dimension EGEE: Enabling Grids for E-Science in Europe … and beyond 32M Euro, 10 regions, 70 partners Additional funding from NSF (USA) 50% production, 30% development, and 20% dissemination and training “The Grid Infrastructure in Europe” Deploy a production Grid across Europe Initially based on LHC Computing Grid Examples of HealthGRID applications Grids for medical development Preparation and follow-up of medical missions in developing Clermont-Ferrand/Paris countries Support to local medical centres in terms of second diagnosis, patient follow-up and e-learning Ibagué Hand surgery Medical centre 2 missions (Ibagué & Chuxiong) with the french NPO « Chaîne de l’Espoir » used as test cases Chuxiong The grid impact : •Improved telemedecine services • Federation of patient databases •Interactive e-learning (high bandwidth network required) eHealth eScience DataGrid : status of biomedical applications Bio-informatics deployed tested on EDG under preparation Phylogenetics : BBE Lyon (T. Sylvestre) Search for primers : Centrale Paris (K. Kurata) Bio-informatics web portal : IBCP (C. Blanchet) Parasitology : LBP Clermont, Univ B. Pascal (N. Jacq) Data-mining on DNA chips : Karolinska (R. Médina, R. Martinez) Geometrical protein comparison : Univ. Padova (C. Ferrari) GATE MonteCarlo simulation platform for nuclear medecine Medical imaging Local_Monopro1500MHz X10 X20 X50 X100 160 Temps en minutes MR image simulation : CREATIS (H. BenoitCattin) Medical data and metadata management : CREATIS (J. Montagnat) Mammographies analysis ERIC/Lyon 2 (S. Miguet, T. Tweed) Simulation platform for PET/SPECT based on Geant4 : GATE collaboration (L. Maigne) 180 140 120 100 80 60 40 20 0 Parallelisation Outline The National e-Science Centre Role and mission The e-Science Institute The UK e-Science Programme Funding and organisation The UK Grid The European dimension The Essence of e-Science What is e-Science? Invention and exploitation of advanced computational methods to generate, curate and analyse research data From experiments, observations and simulations Quality management, preservation and reliable evidence to develop and explore models and simulations Computation and data at extreme scales Trustworthy, economic, timely and relevant results to enable dynamic distributed virtual organisations Facilitating collaboration with information and resource sharing Security, reliability, accountability, manageability and agility e-Science >> Grid & Web Services It is what you do with them that counts Fundamental & Growing Assets Understanding of Processes & Requirements International and Multi-disciplinary Skill base Experience composing & adapting existing technologies and of building new components Experience Supporting Developers and Users Experience Establishing Virtual Organisations across Enterprise boundaries Embedded in People & Teams, Growing – they need nurture Primary Multi-Enterprise Issues Combining subsystems built independently in different enterprises Deploying, Starting and Managing Applications and Production Operations Assume Benefits of Shared Infrastructure: Using a set of combined facilities How much? Independently built One size fits all? Autonomously managed Developing software independently Expecting to integrate later All for VO communities that retain independence Relative Importance What envelopes you put your messages in How they are delivered Infrastructure to organise a common technical platform – the foundations of communication Relative Importance What envelopes you put your messages in How they are delivered Infrastructure to organise a common technical platform – the foundations of communication What information you send in your messages Their patterns of Use - sequences that mean something Their Contents The Grammar and Vocabulary of Communication Agreed Interpretations Relative Importance What envelopes you put your messages in Technical Experts How they are delivered Infrastructure to organise a common technical platform – the foundations of communication What information you send in your messages Their patterns of Use - sequences that mean something Their Contents The Grammar and Vocabulary of Communication Agreed Interpretations What you do when you get a message The Application Code you Execute The Middleware Services Security, Privacy, Authorisation, Accounting, Registries, Brokers, … Integration Services Multi-site Hierarchical Scheduling, Data Access & Integration, … Portals, Workflow Systems, Virtual Data, Semantic Grids Tools to support Application Developers, Users & Operations Relative Importance What envelopes you put your messages in How they are delivered Infrastructure to organise a common technical platform – the foundations of communication What information you send in your messages Their patterns of Use - sequences that mean something Their Contents The Grammar and Vocabulary of Communication Agreed Interpretations What you do when you get a message The Application Code you Execute The Middleware Services Security, Privacy, Authorisation, Accounting, Registries, Brokers, … Integration Services Domain Specialists Multi-site Hierarchical Scheduling, Data Access & Integration, … Portals, Workflow Systems, Virtual Data, Semantic Grids Tools to support Application Developers, Users & Operations Creative Actions and Judgements of Researchers, Designers & Clinicians Data, Models & Analyses In Silico Experiments, Design, Diagnosis & Planning Creating the Scientific Record Where Next for e-Infrastructure Put people and teams first The creative force The repository of Experience, Skills and Knowledge Focus on Major Priorities Developing well-defined Flexible Agreements Embraced as standards High-level Software Investment Applications & Requirements led Explore & Evolve Common & Shared Infrastructure Recognise and respond to differences Celebrate and support commonalities