Database Taskforce and the OGSA-DAI Project Norman Paton University of Manchester Outline Database Taskforce (DBTF): Databases and the Grid: who, what and why. context and capabilities. OGSA Database Access and Integration Project: what, where and when. Relevant Web Pages http://www.cs.man.ac.uk/grid-db Membership Malcolm Atkinson (NESC) Vijay Dialani (Southampton University) Norman Paton (Manchester University) Dave Pearson (Oracle UK) Tony Storey (IBM Hursley) Paul Watson (Newcastle University) Aims/Actions: Requirements To identify the requirements of database and information management applications within a Grid setting. Requirements analysis underway, with visits to Grid projects. Preliminary report by Dave Pearson tabled at GGF4 and on Web. Series of meetings at NESC. Aims/Actions: Roadmap To develop a roadmap for research and development of database functionalities in a Grid setting. Proposal involves a collection of Grid Database services. Initial collection of Grid Database Services tabled at GGF4 and on Web. Aims/Actions: Standards To propose standards for Grid/Database links through the GGF. Birds of a Feather Session run as part of GGF4. Hope to establish a Working Group before GGF5, involving wider community. Aims/Actions: Implementation To foster the development of reference implementations of proposed Grid standards. OGSA-DAI (Database Access and Integration Project just funded through Core Programme). GGF WG to encourage wider involvement with reference implementations. Aims/Actions: Architecture To advise the Architecture Task Force on architectural requirements for database support. Meet periodically with ATF. In line with ATF in terms of services focus. Aims/Actions: Education To encourage greater awareness of needs and best practice in respect to databases and the Grid. Requirements analysis building links with users. Series of meetings at NESC on Database/Grid issues. Input on use of developed software, plus good practice. Outline Database Taskforce (DBTF): Databases and the Grid: who, what and why. context and capabilities. OGSA Database Access and Integration Project: what, where and when. Databases in the Grid Data Complexity Computational Complexity Grid Middleware Combining Grid and Web Services composition frameworks (e.g. XCAT) Job Submission / Control Grid ssh File Transfer CORBA GRAM Data Management Monitoring Events …… Credential Management Workflow Management other services: •visualization •interface builders •collaboration tools •numerical grid generators •etc. Python, Java, etc., JSPs CoG Kits implementing Web Services in servelets, servers, etc. Apache SOAP, .NET, etc. Apache Tomcat&WebSphere &Cold Fusion=JVM + servlet instantiation + routing Resources Condor-G SRB/ Metadata Catalogue Data Replica and Metadata Catalog GridFTP Grid Monitoring Architecture Grid X.509 Certification Authority Grid Information Service Grid Web Service Description (WSDL) & Discovery (UDDI) MPI Secure, Reliable Group Comm. Grid Protocols and Grid Security Infrastructure Environment Management (LaunchPad, HotPage) Grid Services: Collective and Resource Access Grid Protocols and Grid Security Infrastructure http, https. etc. Problem Solving Environments (AVS, SciRun, Cactus) PDA Web Browser X Windows Discipline / Application Specific Portals (e.g. SDSC TeleScience) Web Services XML / SOAP over Grid Security Infrastructure Clients Application Portals Compute (many) Storage (many) Communication Instruments (various) Scope of Database Services Discovery. Statements. Delivery. Metadata. Transactions. Distributed queries. Statement Service For queries involves: The The The The query notation. query to be evaluated. parameters for the query. result delivery mechanism. Invoking a query operation: Prepares and validates the query, reporting back success or failure. Provides access to the results collection. Query Operation Specified here even more abstractly than in WSDL. Several operations: synchronous, asynchronous, etc. Service descriptions start off data model and language independent, then instantiated. query( IN queryNotation, IN query, OPTIN values, OPTIN expires, INOUT resultHandle, OUT fail) Outline Database Taskforce (DBTF): Databases and the Grid: who, what and why. context and capabilities. OGSA Database Access and Integration Project: what, where and when. OGSA-DAI Project OGSA - Data Access and Integration Services project. IBM (Tony Storey, Brian Collins). Oracle (Dave Pearson). NESC/EPCC (Malcolm Atkinson, Rob Baxter). Manchester University (Norman Paton). Newcastle University (Paul Watson). £1.5 Million (plus matching industrial contributions) over 18 months. Project Plan Well defined deliverables for first 6 months: Core Relational Services. Core XML Services. Less well defined deliverables over following 12 months: Intercept emerging GT3 releases. Comprehensive delivery services. Metadata. Specialist data types. Distributed queries. Short Term Deliverables GGF5: DAI Framework Document. Instantiation for RDBs and XML repositories. Provisional dates for core services (given resolution of licensing/collaboration agreements): XML RDB Earlyadopters 1st May 1st July Public 19th July 1st Sept Short Term Tensions Multiple database paradigms/platforms. Multiple language interfaces. Users contexts: Web services. Globus 2. Globus 3. Relationship to GGF WG To be established, but: Contribute insights into design of DAI services. Contribute to documents that feed into the GGF standardisation activity. Provision of reference implementations of standard proposals. Contribute to the development of OGSA proposal as early adopters. Relationship to other UK eScience Projects To be established, but: Seeks to address requirements found in (UK) e-Science projects. Development activity designed to provide core functionality early. Formal relationship with some as early adopters (myGrid, AstroGrid). Could be possible to collaborate on certain development activities. Summary: DBTF Interfaces Working Group Recommendations Requirements Communitybuilding Database Service Development