Welcome! http://www-conf.slac.stanford.edu/xldb07 One Day, Three Goals 1. Identify trends and major roadblocks related to building extremely large databases 2. Bridge the gap between users trying to build extremely large databases and database vendors 3. Understand if and how open source projects like the LSST Database can contribute to the previous two goals in the next few years Things We Talked About Valuable data discarded due to scalability limits and cost Substantial commonalities between science & industry (pattern discovery, multi-d aggregation, unpredictable query load, procedural language needs, …) Industry leading scale, science leading complexity of analytics Parallel, shared-nothing architectures on commodity clusters are becoming very popular Roadblocks: funding problems, vendor-users disconnect, science-academia disconnect Rebuilding, not reusing software Gap between needs and what vendors offer is widening Structured and unstructured data coming together MapReduce popular, but lacks efficient joins Things We Decided Conduct another workshop in ~1 year, 2-3 days, @SLAC – Don’t expand size much – By-invitation only – Focus on experience sharing, commonalities that can be developed into community-wide requirements Try to setup smaller workshop and/or working group(s) – In particular science – db academics http://xldb.slac.stanford.edu/display/XLDB/SciDB Set up shared infrastructure – Initially wiki, possibly test-bed environments Try to define a standard benchmark focused on data-intensive queries http://www-conf.slac.stanford.edu/xldb08 Two Days, Three Goals 1. Continue to understand major roadblocks related to extremely large databases with an emphasis on complex analytics 2. Continue bridging the gaps within the XLDB community including science, industry, database researchers and vendors 3. Build the open source SciDB community It Is All About Ad-hoc Discussions * You are expected to speak up too – But no sale speeches, please * Discussions are not electronically recorded * Detailed report will be released – Once OK’ed by workshop participants Attendance – Rough Breakdown xldb1 xldb2 23 25 Data-intensive scientific users 11 12 Data-intensive industrial users 16 12 Vendors, incl. startups 3 13 Academia, db research & programmers 53 62 Attendance – Rough Breakdown If this group won’t make a difference, who will? 1. Big science 2. Big industries 3. All major DBMS vendors 4. Very promising startups 5. World-class DB researchers 6. Superstar DB programmers Dinner * Location – Sheraton Palo Alto – Driving directions available * Reception – 7:00 pm – 7:30 pm * Dinner – 7:30 pm – 10:00 pm – Buffet * Cost – Free – Maybe except the valet parking Make sure you wear your XLDB2 badge BIG Thanks to Our Sponsors Agenda http://www-conf.slac.stanford.edu/xldb08/agenda.htm