Funding Sources for Academic Research Nearly all academic research in the UK is funded by the government through Research Councils. There are 6 Research Councils in total: EPSRC – Engineering and Physical Sciences Research Council NERC – Natural Environment Research Council PPARC – Particle Physics and Astronomy Research Council BBSRC – Biotechnology and Biological Science Research Council ESRC – Economic and Social Research Council MRC – Medical Research Council And: CCLRC – Council for the Central Laboratory of the Research Councils The UK e-Science Programme Kerstin Kleese van Dam (For Tony Hey Director of UK e-Science Core Programme Tony.Hey@epsrc.ac.uk) e-Science and the Grid ‘e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it.’ John Taylor Director General of Research Councils Office of Science and Technology GRID Vision Computing resources Instruments Complex problem Data Knowledge GRID Solution People The Grid as an Enabler for Virtual Organisations Ian Foster, Carl Kesselman and Steve Tueke • ‘The Grid is a software infrastructure that enables flexible, secure, coordinated resource sharing among dynamic collections of individuals, institutions and resources’ • includes computational systems and data storage resources and specialized facilities • Enabling infrastructure for transient ‘Virtual Organisations’ UK e-Science Initiative: First Phase • £120M Programme over 3 years from April 2001 • £75M is for Grid Applications in all areas of science and engineering • £10M as first installment for UK HPC(X) • £35M ‘Core Program’ to encourage development of generic ‘industrial strength’ Grid middleware Require £20M additional ‘matching’ funds from industry UK e-Science Programme Director’s Awareness and Co-ordination Role Director’s Management Role Generic Challenges Pilot Application Programme PPARC (£26M) BBsrc (£8M) MRC (£8M) NERC (£7M) Esrc (£3M) EPsrc (£17M) CLRC (£5M) Research Councils (£74M) EPsrc (£15M), DTI (£20M) Collaborative projects Industrial Collaboration (£20M) Technical Advisory Group UK e-Science Management DGRC/CERCs e-Science Steering Committee Research Councils e-Science Directors Relevant National/ International bodies: e.g. JISC, CERN CEO/EPsrc Director e-Science Core Programme e-Science Support Based at EPsrc and at DTI Technical Advisory Group Core Programme Project Teams Excerpt from e-Science CP Director’s job objectives ‘Develop effective collaborative Core Programme projects between the science base, industry and national funding agencies, and ensure the application and outcomes from the projects.’ UK e-Science Projects £75M for e-Science Grid Application ‘pilots’ - spanning all sciences and engineering Particle Physics and Astronomy (PPARC) - £17M GridPP and £5M AstroGrid Engineering and Physical Sciences (EPSRC) - funding 6 projects at around £3M each Biology, Medical and Environmental Science - funding projects with total value of £23M UK Grid Projects: First Phase (1) Particle Physics and Astronomy (PPARC) • GRIDPP • ASTROGRID Engineering and Physical Sciences (EPsrc) • Comb-e-Chem • DiscoveryNet • GEODISE • myGrid • RealityGrid Comb-e-Chem Project Video Simulation Diffractometer Properties Analysis Structures Database X-Ray e-Lab Properties e-Lab Grid Middleware GEODISE Project Engineer GEODISE PORTAL Knowledge repository Ontology for Engineering, Computation, & Optimisation and Design Search Reliability Security QoS Visualization Session database Traceability OPTIMISATION Globus, Condor, SRB OPTIONS System Optimisation archive APPLICATION SERVICE PROVIDER Intelligent Application Manager CAD System CADDS IDEAS ProE CATIA, ICAD COMPUTATION Licenses and code Analysis CFD FEM CEM Design archive Parallel machines Clusters Internet Resource Providers Pay-per-use Intelligent Resource Provider Computational science • Molecular dynamics • Mesoscale modelling • High throughput experiments • High performance visualization • Computational steering • Terascale parallel computing myGrid Project • Imminent ‘deluge’ of data • Highly heterogeneous • Highly complex and inter-related • Convergence of data and literature archives Discovery Net Project In Real Time Scientific Information Scientific Discovery Real Time Integration Workflow Construction Literature Databases Operational Data Dynamic Application Integration Interactive Visual Analysis Using Distributed Resources Images Instrument Data How It Works Interactive Editor & Visualisation Nucleotide Annotation Workflows Download sequence from Reference Server Inter Pro SMART KEGG EMBL NCBI SWISS PROT TIGR SNP GO Save to Distributed Annotation Server 1800 clicks 500 Web access 200 copy/paste 3 weeks work in 1 workflow and few second execution Execute distributed annotation workflow UK Grid Projects: First Phase (2) Natural Environment Applications (NERC) • Climateprediction.com • Oceanographic Grid • Molecular Environmental Grid • NERC DataGrid (with CP) Biotechnology and Biological Sciences (BBsrc) • Biomolecular Grid • Proteome Annotation Pipeline • High-Throughput Structural Biology • Global Biodiversity BioSim GRID 1st Level Metadata – Describing the Simulation Data… York Nottingham Level Metadata – Describing the Results of Generic Analyses… 2nd Birmingham Oxford RAL distributed ‘raw’ data London … Southampton Structure of the proposed biosimulation database A biosimulation GRID for the UK Integrating Different Levels of Simulation molecular cellular organism Sansom et al. (2000) Trends Biochem. Sci. 25:368 An e-science challenge – non-trivial NASA IPG as a possible paradigm Need to integrate rigorously if to deliver accurate & hence biomedically useful results Noble (2002) Nature Rev. Mol. Cell.Biol. 3:460 UK Grid Projects: First Phase (3) Medical Applications (MRC) • Biology of Ageing (with BBsrc) • Sequence and Structure Data • Molecular Genetics • Cancer Management (with PPARC) • Clinical e-Science Framework • Neuroinformatics Modeling Tools CLEF - Clinical e-Science Framework Partners: • AstraZeneca, GSK, BMJ Publishing Group • CSW Informatics, iSoft plc, Sun Microsystems Limited • UK National Health Service – – – – NHS Information Authority Stakeholder Relations Camden & Islington Health Authority Central Manchester and Manchester Childrens' Health Authority Royal Brompton and Harefield NHS Trust • Universities of Cambridge, Manchester, Freiburg and University College London CLEF - Integrating information • High quality, integrated clinical information is key to: – clinical research – evidence-based health care – the clinical application of genetic and genomic research • Capture, integration, and presentation of descriptive information is a major barrier to achieving an integrated framework • Data includes: – – – – clinical histories radiology and pathology reports annotations on genomic and image databases technical literature and Web based resources e-Science and Grid Middleware ‘e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it.’ John Taylor Requirements of e-Science Grid Application Projects determine services required by Grid middleware UK Projects focus more on Grid Data Services than Teraflop/s HPC systems e-Science Core Program: First Phase £15M OST + £20M DTI + £20M Industry 1. Network of e-Science Centres UK e-Science Grid 2. Support for e-Science Applications 3. Grid Network Issues 4. Generic/Industrial Grid Middleware 5. e-Health Grid ‘Grand Challenges’ 6. Outreach/International Activities UK e-Science Grid Edinburgh Glasgow Newcastle Belfast Manchester DL Cambridge Oxford Cardiff RAL London Southampton Hinxton UK e-Science Grid • All e-Science Centres donating resources plus four dedicated compute/data clusters – Supercomputers, clusters, storage, facilities • All Centres run same Grid Software – Starting point is Globus 2 and Condor: Storage Resource Broker (SRB) • Standard Grid middleware supported – e-Science Grid now at ‘Level 2’: moving towards production Grid with real users Access Grid – Group Conferencing Multi-site group-to-group conferencing system Continuous audio and video contact with all participants Globally deployed All UK e-Science Centres have AG rooms Widely used for technical and management meetings Support for e-Science Projects • Grid Support Centre in operation – supported Grid middleware & users – see www.grid-support.ac.uk • National e-Science Institute – Research Seminars – Training Programme – See www.nesc.ac.uk • National Certificate Authority – Issue digital certificates for projects – Goal is ‘single sign-on' Anatomy of a Digital Certificate Public Key ABCDEFGHIJKLMNOPQRSTUV A text string Validity Data Signature from CA’s private key Extensions How a certificate is issued • The Registration Authority (RA) approves a request for a certificate. The RA is local to the users. • The CA then issues the corresponding certificate. How does it work? 1. Scientist wishes to access a resource, so he sends a copy of the certificate to the resource 2. Resource says: prove it’s your certificate Challenge Private Key 3. Scientist proves that he has the corresponding private key 4. Resource is convinced that scientist is who he claims to be and decides to give him access Response UK CA Statistics, February 2003 • • • • • • 250 valid certificates issued 24 RAs (more waiting for approval/training etc) Issuing 60 certificates /month Adding 3 RAs / month Adding 6 RA operators /month UK certificates recognized by EU and US projects Grid Network Team • Expert group to identify end-to-end network bottlenecks and other network issues - e.g. problems with multicast for Access Grid • Identify e-Science project requirements • Funding (with PPARC and EPSRC) a number of network QoS, scheduling and monitoring projects • ‘UKLight’ lambda connection to Chicago and Amsterdam now approved UK Backbone Infrastructure • Based on SuperJANET4 academic network run by UKERNA for JISC • WorldCom(!) providing national backbone for SJ4 – now at 20Gbps • Connections to universities via MANs at up to 2.5Gbps • ‘Last mile’ problem? • Research network use versus teaching, websearching, email – differential services? SuperJANET4 Access Grid Multicast One source sending same data to 3 receivers only has to have one copy of data (more copies are made only when necessary) Networking Research Projects GRID Infrastructure GRS, GRID resource management ‘ FutureGRID, P2P architecture Service Infrastructure Network Infrastructure GridMcast, Multicastenabled data distribution MB-NG, QoS Features GRIDprobe, backbone passive monitoring at 10Gbps CP Collaborative Industrial Projects: First Phase • • • • • 9 Centres with ring-fenced allocations £11M CP + £11M Industry funding £5M Open Call Projects All First Phase funds now committed Over 60 Companies involved CP Centre Projects 6 projects CeSC, 4 OeSC 5 NEReSC 4 NeSC), 5 SeSC 2 LeSC 5 WeSC 7 eSNW 5 BeSC Total of 43 projects 68 different companies Range of disciplines (IT, Engineering, Pharma, Environmental etc) New sectors engaged (broadcasting, defence, banking etc) Industrial Funds more than match DTI funds All Centres have spent money allocated or have projects under consideration CP Open Call Projects Visualization Middleware for e-Science e-Science Technologies in the Simulation of Complex Materials Performance-based Middleware for Grid Computing A scalable monitoring platform for the GRID (GridProbe) eDiamond distributed mammographic archive End-to-End traffic management services Information eXtraction from Images (IXI) Deductive Synthesis Techniques to the Rapid Assembly of Grid Applications Trustworthy GRID Resource Management A Grid-based approach to the validation and testing of lubrication models Self-Organising GRID Resource Management Jigsaw: Distributed and dynamic visualisation generation FutureGRID: a program for long-term research into GRID systems architecture Total of 13 projects OGSA – DAI Project • Design Specification completed – Papers for GGF WG on Database Access and Integration Services • Three Prototypes delivered: – Distributed Query Service – XML Database Interface – Relational Database Interface • Alpha versions delivered January 2003 – Integrate with Globus GT3 Open Grid Services Architecture • Development of Web Services • OGSA will provide Naming /Authorization / Security / Privacy/… Projects looking at higher level services: Workflow, Transactions, DataMining, Knowledge Discovery… Exploit Synergy: Commercial Internet with Grid Services IRC ‘Grand Challenge’ Projects • Equator: Technological innovation in physical and digital life • AKT: Advanced Knowledge Technologies • DIRC: Dependability of Computer-Based Systems • MIAS: From Medical Images and Signals to Clinical Information e-Health Grid ‘Grand Challenges’ • Grid-Enabled Knowledge Services for Medical Informatics - Triple Assessment in Breast Cancer: Clinical, Radiological and Cytological data fusion • Grid-based Medical Devices for Everyday Health - Patient sensors, mobile wireless communication • eDiamond Digital Mammography - Normalized archive of mammograms - Oxford, IBM (£2M), Mirada and Hospitals eDiamond Mammograms have different appearances, depending on image settings and acquisition systems SMF is a normalised representation independent of scanner settings eDiamond Training and Differential Diagnosis Applications of SMF Teleradiology and QC VirtualMammo “Find one like it” ? Advanced CAD SMF-CAD workstation Epidemiology SMFcomputed breast density International Involvement • Funding UK participation in the Global Grid Forum Research/Working Groups • Funding for International CS ‘Grid Fellowships’ – CERN DataGrid and USA iVDGL • International members on TAG • Participation in EU FP5 Grid Activities – e.g. EU DataGrid and DataTAG projects • Development of FP6 Grid Projects – First call closes April/May – EGEE, EU Open Middleware Infrastructure Institute? e-Science Demonstrators • • • • • • • • • Dynamic Brain Atlas Biodiversity Chemical Structures Mouse Genes Robotic Astronomy Collaborative Visualisation Climateprediction.com Medical Imaging/VR Seamless Access to Multiple Databases UK e-Science Funding First Phase: 2001 –2004 • Application Projects – £74M – All areas of science and engineering • Core Programme – £35M – Collaborative industrial projects Second Phase: 2003 –2006 • Application Projects – £96M – All areas of science and engineering • Core Programme – £16M + £25M (?) – Core Grid Middleware Core Programme 2 Overall Rationale: Four major functions of CP – Assist development of essential, wellengineered, generic, Grid middleware usable by both e-scientists and industry – Provide necessary infrastructure support for UK e-Science Research Council projects – Collaborate with the international e-Science and Grid communities – Work with UK industry to develop industrial-strength Grid middleware Core Programme 2 1. 2. 3. 4. 5. 6. 6 Key Activities for Second Phase UK e-Science Grid/Centres and e-Science Institute Grid Support Centre and Network Monitoring Core Middleware engineering National Data Curation Centre e-Science Exemplars/New Opportunities Outreach and International involvement Core Grid Middleware • Need to develop open source, open standard compliant, Grid Middleware stack that will integrate and federate with industrial solutions • Software Engineering focus as well as R&D Aim is to produce robust, well-documented, re-usable software that is maintainable and can evolve to embrace emerging Grid Service standards Major focus of Core Programme 2 National Data Curation Centre • In next 5 years e-Science projects will produce more scientific data than has been collected in the whole of human history • In 20 years can guarantee that the operating and spreadsheet program and the hardware used to store data will not exist Need to research and develop technologies and best practice for curating digital data Need to liaise closely with individual research communities and data archive centres Director General OST HPC Centres Research Council Pilots CCLRC Projects e-Science Operations Committee e-Science EPsrc/DTI Steering Finance Committee Grid Support Team 4 IRC +Projects DIRECTOR CORE PROGRAMME Deputy Director Technical Advisory Group International Grid Network Team Reports 9 Grid Demos National Centre Programme Open Call Projects NHSNet NERC DTI “HEFCE” Keyworth Dir Gen MRC Pilots Outreach OST SR2002 Pilots Hinxton BBsrc Web sites Bid HPC Pilots Publicity e-Science Esrc Centres 4 IRCs EPsrc/DTI Information Steering Pilots 5 Projects Finance Committee CCLRC 9 Grid e-Science e-Science Projects Demos Institute Operations EPsrc 8 Regional Director Committee Pilots Centres Core Programme National £20M of Centre PPARC Deputy Director Grid 50 Projects Pilots Technical Advisory CCLRC Support CCLRC Open Open Group RAL & DL RAL & DL Team CERN CallProjects Projects Call ICT ICT Grid Grid Suppliers Suppliers GEANT GEANT Reports Grid Reports International Network Grid Network USERS EU Gridnet Team Team Security USERS Gridnet Security Framework Grid Grid Taskforce UKERNA Taskforce UKERNA Projects Fellowships Fellowships Architecture Architecture JISC JISC Other Other Taskforce Taskforce Data Base Base International International Data Taskforce Projects Network Projects Network Taskforce Monitoring Global GlobalGrid Grid Monitoring Forum Forum USUS Players Players NHSNet NERC DTI “HEFCE” Keyworth Dir Gen MRC Pilots Outreach IBM Qinetiq OST Pilots Hinxton SR2002 BBsrc Web sites Microsoft Data Systs Bid HPC Pilots Sun Roche Publicity e-Science Esrc Centres 4 IRCs EPsrc/DTI Logica BMT Information Steering Pilots 5 Projects Finance SGI CCDC Committee CCLRC 9 Grid e-Science BAE Systems Fujitsu e-Science Projects Demos Institute Operations Rolls Royce Met Office EPsrc 8 Regional CFS Cons Committee Welcome Director Pilots Centres Compaq BP Core Programme National Oracle Pallas £20M of Centre PPARC Deputy Director AVS Grid Platform 50 Projects Pilots Technical Advisory CCLRC SupportAvaki RTZ Open Group RAL & DL Entropia Epistemics Team CERN Call Projects ICT HP Fluent Industry Grid Suppliers ABB BNFL GEANT Reports Grid & Commerce International Network Bayer Delta Dot USERS EU Gridnet Team Security Intel RVCO ltd Framework Grid Pfizer Taskforce Infosense UKERNA Projects Fellowships NAG Merck Architecture Avantium JISC AstraZeneca Other Taskforce GSK Unilever Data Base International Taskforce Network Projects Technical Monitoring Global Grid Advisory Group Forum US Players NHSNet NERC DTI “HEFCE” Keyworth Dir Gen MRC Pilots Outreach IBM Qinetiq OST IBM Pilots Hinxton SR2002 BBsrc Web sites Microsoft Data Systs Bid HPC Pilots Sun Roche Publicity e-Science Esrc Centres 4 IRCs EPsrc/DTI Logica BMT Information Steering Pilots 5 Projects Finance SGI CCDC Committee CCLRC 9 Grid e-Science BAE Systems Fujitsu e-Science Projects Demos Institute Operations Rolls Royce Met Office EPsrc 8 Regional CFS Cons Committee Welcome Director Pilots Centres Compaq BP Core Programme National Oracle Pallas £20M of Centre PPARC Deputy Director AVS Grid Platform 50 Projects Pilots Technical Advisory CCLRC SupportAvaki RTZ Open Group RAL & DL Entropia Epistemics Team CERN Call Projects ICT HP Fluent Industry Grid Suppliers ABB BNFL GEANT Reports Grid & Commerce International Network Bayer Delta Dot USERS EU Gridnet Team Security Intel RVCO ltd Framework Grid Pfizer Taskforce Infosense UKERNA Projects Fellowships NAG Merck Architecture Avantium JISC AstraZeneca Other Taskforce GSK Unilever Data Base International Taskforce Network Projects Technical Monitoring Global Grid Advisory Group Forum US Players NHSNet NERC DTI “HEFCE” Keyworth Dir Gen MRC Pilots Outreach OST SR2002 Pilots Hinxton USA BBsrc Web sites Bid France HPC Pilots Publicity e-Science Esrc Germany Centres 4 IRCs EPsrc/DTI Information Steering Pilots Brazil 5 Projects Finance Committee CCLRC 9 Grid e-Science Holland e-Science Projects Demos Institute Japan Operations China EPsrc 8 Regional Director Committee Pilots Centres Core Italy Programme National Scandinavia £20M of Centre PPARC Deputy Director Australia Grid 50 Projects Pilots Technical Advisory Switzerland CCLRC Support Open Group Austria RAL & DL Team CERN Call Projects Singapore ICT Grid Belgium Suppliers GEANT Reports Grid International Network Canada USERS EU Gridnet Team Ireland Security Framework Grid Poland Taskforce UKERNA Projects Fellowships Spain Architecture SouthJISC Other Taskforce America Data Base International Taskforce Network Projects Monitoring Global Grid Forum US Players NHSNet NERC DTI “HEFCE” Keyworth Dir Gen MRC Pilots Outreach OST SR2002 Pilots Hinxton BBsrc Web sites Bid HPC Pilots Publicity e-Science Esrc Centres 4 IRCs EPsrc/DTI Information Steering Pilots 5 Projects Finance Committee CCLRC 9 Grid e-Science e-Science Projects Demos Institute Operations EPsrc 8 Regional Director Committee Pilots Centres Core Programme National £20M of Centre PPARC Deputy Director Grid 50 Projects Pilots Technical Advisory CCLRC Support Open Group RAL & DL Team CERN Call Projects ICT Grid Suppliers GEANT Reports Grid International Network USERS EU Gridnet Team Security Framework Grid Taskforce UKERNA Projects Fellowships Architecture JISC Other Taskforce Data Base International Taskforce Network Projects Monitoring Global Grid Forum US Players A viable Core Programme must have this scope and an infrastructure to support it! e-Science and the Grid ‘e-Science will change the dynamic of the way science is undertaken.’ John Taylor, 2001 Need to convince university IT Directors! e-Government and the Grid ‘[The Grid] intends to make access to computing power, scientific data repositories and experimental facilities as easy as the Web makes access to information.’ Tony Blair, 2002