OGSA-DAI Users’ Meeting Introduction Malcolm Atkinson Director www.nesc.ac.uk 7th April 2004 Overview Today’s Meeting Users meet each other We hear Users’ Complaints, Requirements & Priorities Users understand better the limits of one team’s work We jointly identify the way ahead OGSA-DAI Vision What we aspire to achieve Short, medium and long term Relationship with GGF DAIS WG Standardisation effort OGSA-DAI Today Current and Imminent Releases The Road Map Which you — the Users — should influence OGSA-DAI History Lesson DB & BiGUM meetings August 2001 — Paul Watson paper — prime mover Database Task Force 01Q3 – 02Q3 Norman Paton, Dave Pearson & Paul Watson http://www.cs.man.ac.uk/grid-db/dbtf.html Architecture Task Force 01Q3 – 02Q2 Urged UK engagement in OGSA OGSI Sep 01 to Sep 03 http://www.nesc.ac.uk/teams/atf/01-03/ OGSA-DAI project Start Feb 02 – end Sept 03 Produced releases R1 to R3.1 Produced Alpha Release of DQP http://www.ogsadai.org.uk/ DAIT project Start Nov 03 – end ~ March 06 Continues OGSA-DAI development • GGF4 Toronto March 02 • GGF5 July 02 •DAISEdinburgh BoF •DAIS launched •OGSI WG announced • GGF10 Berlin March 04 •WS-Resource Framework OGSA-DAI Partners IBM USA EPCC & NeSC Glasgow Newcastle Belfast Daresbury Lab Manchester Oxford Cambridge Oracle RAL Cardiff London IBM Hursley Southampton Hinxton $5 million, 20 months, started February 2002 Additional 24 months, starts October 2003 OGSA-DAI Vision 1 There are many sources of Data Structure is the essence of using data Structure supported by DBMS: Relational & XML Structure also recognised in files: BinX & GGF DFDL Sources of Data are Autonomously Managed Vary in model, structure, schema Vary in content Vary in Access & Use policies Vary in Platform Discovery & Design enabled by Combining Data From multiple sources Recorded in multiple data resources OGSA-DAI Vision 2 Developing applications using heterogeneous data sources is difficult: Dealing with all those variations Negotiating with data owners and data platforms Communication protocols and data movement Data translation Distributed operations Scale, reliability, coping with change, failure recovery, security, privacy … Currently solved incrementally on a per application basis OGSA-DAI to the rescue OGSA-DAI Vision 3 OGSA-DAI will provide a generic framework Standard components for dealing with common requirements Query, update, bulk load, schema edit, metadata access, transactions, … Agreed protocols and standard agreements with DBMS Established mechanisms for handling owner policy requirements Provides data translation Supports distributed operations Delivers Scale, reliability, coping with change, failure recovery, security, privacy … Costs of developing a robust OGSA-DAI amortised over many applications Development of Applications significantly accelerated Accepted de facto standard by DBMS vendors and data owners Efficiency from direct support and system or resource knowledge Collaborating International Community Extending the Framework E.G. the work of OGSA-WebDB OGSA-DAI Vision 4 Progressively develop the functionality Progressively raise level of operations Progressively improve quality Scale, Performance, Reliability, Flexibility, Usability Adapt to changing context Support user community Training, Documentation, Consultation, Collaboration Place High Value on Users’ & Application Developers’ Ingenuity They know about the data resources They invent new ways of combining and analysing data Therefore Extensibility and Flexibility MUST always be supported OGSA-DAI today 1919 Downloads so far from http://www.ogsadai.org.uk/ Release 4: April 04 from http://www.ogsadai.org.uk/ Release 5: October 04 from http://www.ogsadai.org.uk/ Releases 6&7: 2005: Many options — You must help us choose Download analysis Downloads by Domain gov, 13 at, 44 it, 12 br, 11 ru, 9 net, 73 de, 87 unresolved, 674 jp, 126 edu, 133 com, 170 uk, 487 And Now … Users’ Presentations What are you doing with our “baby” What do you like What do you want changed, extended or added What consensus is there Team Presentations More details of current and planned release The issues that face us Choosing Interoperation Platforms Choosing which requirements get priority Working Groups Collation of the Groups’ conclusions Ideas for Future User Group Activities URGENT Questions ? Reserve Slides Follow OGSA-DAI Behaviours Request to Registry for sources of data about “x” Registry responds with Factory handle Analyst SOAP/HTTP Registry GDSR service creation API interactions Request to Factory for access to database Factory returns handle of GDS to client Factory GDSF Factory creates GridDataService Client queries GDS with SQL, XPath, XQuery etc Query results returned XML OR delivered to consumer as XML Consumer Grid Data Service GDS Database (Xindice, MySQL Oracle, DB2) GDS interacts with database Delivery Patterns Retrieve 1. Update/Insert 4. Q A Q+U 7. A G Pipeline Q1 G1 = P G S S+R S1 A 2. G A S 6. Q A C Q1 + D P U S1 A Q U/R Q2 I A D R G1 = P 8. G S G2 = C G R C S2 Q+D S I I P A 3. U 5. Q+D U/R Q2 + D G S S2 G2 = C Statement Operations Receives statement Performs statement against a data resource Returns results Statement Statement Component and Database Connections Data Resource Results Statement Document <?xml version="1.0" encoding="UTF-8" ?> <GridDataServiceScript> <Header> … Name, versioning, configuration specification </Header> <Body> <Documentation>Update with data delivered with the script</Documentation> <DefineParameter name="table">myimages</DefineParameter> <DefineParameter name="id">id = 2</DefineParameter> <Statement name="xyz" dataResource="MyDataResource"> SELECT * FROM <UseParameter reference="table" /> WHERE <UseParameter reference="id" /> </Statement> <Delivery name="delivery1"> <Mechanism type="bulk" /> <Mode type="full" /> <From>xyz</From> <To>response</To> </Delivery> <Execute name="execute1">xyz</Execute> </Body> </GridDataServiceScript> Statement Architecture Engine/GDS-S Statement Statement Activity Statement Handlers JDBC Statement Handlers XMLDB Statement Handlers Relational DB JDBC Connection XMLDB Connection JDBC API XML:DB API XML DB OGSA-DAI Security Grid Security Infrastructure Authentication The role mapper takes credentials and a database name and returns a dbRole dbRole contains username and password Credentials Database Name Role Mapper username dbRole password Data Access & Integration Services 1a. Request to Registry for sources of data about “x” SOAP/HTTP Registry 1b. Registry responds with Factory handle service creation API interactions 2a. Request to Factory for access to database Factory Client 2c. Factory returns handle of GDS to client 3a. Client queries GDS with XPath, SQL, etc 3c. Results of query returned to client as XML 2b. Factory creates GridDataService to manage access Grid Data Service XML / Relationa l database 3b. GDS interacts with database Future DAI Services? 1a. Request to Registry for sources of data about “x” & “y” 1b. Registry responds with Factory handle Data Registry SOAP/HTTP service creation API interactions 2a. Request to Factory for access and integration from resources Sx and Sy Data Access & Integration master 2c. Factory returns handle of GDS to client 3b. Client Problem tells“scientific” Solving analyst Client Application Environment coding scientific insights Analyst 2b. Factory creates Semantic GridDataServices network Meta data 3a. Client submits sequence of scripts each has a set of queries to GDS with XPath, SQL, etc GDTS1 GDS GDTS XML database GDS2 Sx 3c. Sequences of result sets returned to analyst as formatted binary described in a standard XML notation Application Code GDS GDS1 Sy GDS3 GDS GDTS2 GDTS Relational database