Server 7 Launch Japan 1/21/99 gray 1 Microsoft SQL Server™ 7.0 Jim Gray Senior Researcher Microsoft Research Microsoft Corporation Server 7 Launch Japan 1/21/99 gray 2 Organizations Are Going Online Building a digital nervous system. Inexpensive hardware means huge databases are possible. But, we are drowning in data. Databases help organize information. Microsoft’s goal: Information at your fingertips. Make it easy to capture, manage, and analyze information. Server 7 Launch Japan 1/21/99 gray 3 Microsoft SQL Server 7 Goals Reduce cost of ownership: Scaleable Easy to build applications Easy to administer Easy to scaleup applications Scale down to portables Scale to many nodes (manage, replicate) Scale to huge nodes Data warehousing Data capture Data analysis Server 7 Launch Japan 1/21/99 gray 4 Microsoft SQL Server Enterprise Standard DeskTop Gigabytes 100s users Megabytes Terabytes 1,000s users Large Memory Clusters Partitioned cubes Full featured < 5 users simple replication + Microsoft Data Engine in Office 2000 Need W2K logo Easy Easy Scalability Data Warehousing Server 7 Launch Japan 1/21/99 gray 5 Dynamic self management Multi-site management Operation Scripting Job scheduling and execution Alert/response management DBA profiling/tuning tools Installation and upgrade Unicode Integrated with NT Security Integrated with NT files Made It Server 7 Launch Japan 1/21/99 gray 6 (Even) Easier! Desktop Workgroups Auto Configure Self-Configuring Engine / Dynamic Disk/memory Reduce Learning Curve, Increase Productivity Self-Managing SQLAgent, Wizards, “Task Pads” Large Organizations Deploy/manage “100’s” of SQL Servers Lower TOC for Large Environments Multi-Server Operations/ Remote Operation Server 7 Launch Japan 1/21/99 gray 7 Multi-Site Management Common Console Multi-site management through groups, agents Route events from all servers to one master Script any set of operations for all servers Job Scheduling and Workflow Shared sub-set of UI for web administration Graphical tools for data Import/export Server 7 Launch Japan 1/21/99 gray 8 Distributed Management Framework Administration shell Database Profiler SQL Namespace Object (SQL-NS) DTS SQL-Distributed Management Objects) In-Process COM Server 40+ Objects, 1600+props/methods ActiveX™ Scripting Agents MMC - SQL Server Snap-in Navigation + UI For namespace shells COM administration interfaces Microsoft Management Console Namespace objects MMC snap-in Supports all operations Self-managing component Activity monitoring/filters Events Queries/traffic State changes in engine SQL Server Agent Replication Task Execution Event Mgt Alerting/Paging SNMP Events Actions SQL Server Engine Trace Events/ Perf Mon Server 7 Launch Japan 1/21/99 gray 9 Dynamic Self Management Dynamic memory management 48M Memory, 100 locks 1 GB Memory, 10K locks Dynamic File Space Management with Auto-Statistics 1/97, Sales history table 128 MB on disk 12/97, Sales history table 4 GB on disk, auto-grow 1/98, Sales history table 128 MB on disk, auto-shrink Auto-Statistics 2 a.m., 5 users Noon, 700 users Server 7 Launch Japan 1/21/99 gray 10 Wizards and GUIs Many Wizards - over 50 at last count MS Access as a query interface Query tool built into SQL, Visual Studio,... Graphical show plan Server 7 Launch Japan 1/21/99 gray 11 DBA and Developer Tools SQL Server Profiler Select server events and trace criteria “Capture” output to screen or replay Index Tuning Wizard Analyzes actual server usage history Makes recommendations to improve performance Server 7 Launch Japan 1/21/99 gray 12 Demo: 5 minutes Tour of the wizards wizard (touch on create index, DTS, maintenance plan) Show database layout (auto schema) Show index tuning wizard (using pre-canned trace file) Let it define the indices Server 7 Launch Japan 1/21/99 gray 13 Index Wizard is Good but we will make it better On a complex query set wizard is 90% of best expert. Extending it to other aspects of DB design Scalability Easy Scalability Data Warehousing Server 7 Launch Japan 1/21/99 gray 14 Win9x/NT4/Win2000 version Dynamic row-level locking VLDB improvements Improved query optimizer Intra-query parallelism VLM support Replication improvements Distributed query High Availability Clusters Server 7 Launch Japan 1/21/99 gray 15 Query Processor Enhancements Focus on Complex Queries Parallelism Improved scan, fetch, & sort Smart hash & merge join Large joins & grouping Better query optimization Multi-index operations Automatic statistics maintenance Distributed Query Heterogeneous Query Server 7 Launch Japan 1/21/99 gray 16 Parallel Query SMP & Disk Parallelism Global Agg. Result 50 rows + 4 x 50 rows Local Agg. Disks + + + + 50,000 rows •# of emp. per group •total inc. per group Plus Distributed Plus Hash Join (fanciest on the planet) Plus Optimized Partitioned views Server 7 Launch Japan 1/21/99 gray 17 Distributed Heterogeneous Queries Data Fusion / Integration Join spread sheets, databases, directories, Text DBs etc. Any source that exposes OLE DB interfaces SQL Server as gateway, even on the desktop Directory Service Database (DB2, VSAM, Oracle, …) Spreadsheet SQL 7.0 Query Processor Photos Mail Maps Documents and the Web Server 7 Launch Japan 1/21/99 gray 18 Replication Transactional and Merge Remote update ODBC and OLE DB subscribers Wizards Performance OS 390 DB2 Publisher 2PC, RPC Distributor DB2 VSAM Subscriber CICS Subscriber Subscriber Subscriber Updating Subscriber (immediate updates) Server 7 Launch Japan 1/21/99 gray 19 Utilities The Key to LARGE Databases Backup Recovery Fuzzy Parallel Incremental Restartable Fast File granularity Reorganize shrinks file reclusters file Auto-Repair Index creation ~2x faster than 6.5 DBCC not required, a good practice 5x - 100x faster Server 7 Launch Japan 1/21/99 gray 20 SQL Server 7.0 Backup Transaction Throughput during online backup Parallel Backup Throughput (MBps) 3000 Transactions 2500 2000 1500 NO backup avg = 2475 tpm Backup avg = 2349 tpm 1000 500 0 0 Scaling at device speed 500 1000 1500 Time (secs) 2000 2500 95% of throughput maintained during full on-line backup (8 drives) Max tape backup speed: ~ 600GB/hr Server 7 Launch Japan 1/21/99 gray 21 Recovery Speed Comparison 4x faster! 60 Recovery time (secs) 50 40 SQL Server 6.5 30 SQL Server 7.0 20 10 0 1 2 3 # of indices 4 Server 7 Launch Japan 1/21/99 gray 22 SQL Server 7.0 Scalability 5700 SQL Server 6.5 3537 SQL Server 7.0 95% of customer needs 2400 1011 <1500 <300 Application: Benchmark: SAP R/3 SD Users 3-Tier BAAN IV Baan Reference Users PeopleSoft HRMS Users Server 7 Launch Japan 1/21/99 gray 23 Best Single Node NT TPC-C 25,000 20,000 15,000 23,143 Published TPC-C results as of 1/20/99 www.tpc.org 16,273 10,000 6,393 6,842 7,650 5,000 0 DBMS Oracle Informix Sybase System:-- 4X Pent-Pro, 200 MHz -$/tpmC $108 $100 $82 6.5 7.0 4X PII, Xeon $56 $26 Server 7 Launch Japan 1/21/99 gray 24 The TPC Report 621,784 $ for 5 years 1.7 Tera Byte Disk 4 CPU 4 GB RAM 18,850 users Affordable & Manageable Server 7 Launch Japan 1/21/99 gray 25 Price Performance Leadership 250%/year on Price, 100%/year performance Best performance: Best price/performance: $1,000 $/tpmC vs time 100,000 23,143 tpmC, $26/tpmC 17715, $22tpmC tpmC vs time tpmC $/tpmC 10,000 $100 1,000 1.5 2.755676 100 %/year improvement lately! 250 %/year improvement! $10 Jan-94 Jan-95 Jan-96 Jan-97 Jan-98 Dec-98 100 Jan-94 Jan-95 Jan-96 Jan-97 Jan-98 Dec-98 Server 7 Launch Japan 1/21/99 gray 26 VLDB Improvements Terabyte Support File Groups for easier management Exabyte Database Size (theoretical max) Tested to 2.5 Terabytes 64-bit support (up to 32 GB with NT5) Server 7 Launch Japan 1/21/99 gray 27 Demo http://www.TerraServer.com Microsoft BackOffice SPIN-2 Server 7 Launch Japan 1/21/99 gray 28 Site Configuration StorageTek 9710 TimberWolf Enterprise Storage Array Alpha 8400 4 NTFS Stripe Sets (600 gb) 28 11-Disk Raid 5 Stripe sets 324 9 GB Seagate Disks 10 x DLT 7000 (8x440) 10GB Ram 7 HSZ70 Ultra-SCSI Dual redundant Controllers Compaq Compaq Compaq Compaq Compaq Compaq Proliant 5500 Proliant 5500 Proliant 5500 Proliant 5500 Proliant 5500 Proliant 5500 4x200mhz 512mb RAM 20GB Raid5 4x200mhz 512mb RAM 20GB Raid5 4x200mhz 512mb RAM 20GB Raid5 4x200mhz 512mb RAM 20GB Raid5 4x200mhz 512mb RAM 20GB Raid5 4x200mhz 512mb RAM 20GB Raid5 To the Web Server 7 Launch Japan 1/21/99 gray 29 File System Configuration 28 hardwareRAID5 sets Each raid set has 11 disks (16 spare drives) 4 595 GB NTFS volumes Each striped over 7 Raid sets on 7 controllers 26 20 GB files on F:, G:, H:, I: DB is File Group of 104 files (2 TB) F: G: H: I: Server 7 Launch Japan 1/21/99 gray 30 TerraServer Statistics TerraServer June 22 1998 to Jan 18 1999 35000000 30000000 Count 25000000 20000000 Billion SQL queries served 99.98% of scheduled availability New data loaded in background Sessions Hit Page View DB Query Image 15000000 10000000 5000000 0 Date Server 7 Launch Japan 1/21/99 gray 31 High Availability Microsoft Clusters Automatic failover in less than a minute Browser Server 1 Server 2 Web site Web site Web site files Database files Server 7 Launch Japan 1/21/99 gray 32 Data Warehousing Warehousing Framework Visual data modeler Microsoft repository Data transformation services Scalability (DTS) Plato & Dcube - Multi Dimensional Data Cubes Data Warehousing Integration with Office2000 Easy Server 7 Launch Japan 1/21/99 gray 33 Microsoft Data Warehousing Framework Building Operational Data (ODBC, OLE-DB) DB Schema* Data Transformation Services (DTS) Transformation Data Mart Design (Cubes/Star schema) Data Marts (RDBMS, OLAP) End-User Tools (Excel, Access, English Query, 3rd parties) OLE DB Managing Data Warehouse Design (logical/physical schema/ data flow) Using Scheduling data** OLAP Metadata** Microsoft Repository (Persistent Shared Meta-Data) Data Warehouse Management (Console*, Scheduling, Events,Topology*) Data Flow * Only partially available in SQL 7 ** Implemented after SQL 7 Meta-Data Flow Server 7 Launch Japan 1/21/99 gray 34 Data Warehouse / Data Analysis Data Transformation Services to get data into the warehouse CUBE (OLE/DB OLAP) to analyze data Operational Data Extact & Load Data Warehouse Storage OLAP Server 7 Launch Japan 1/21/99 gray 35 Data Transformation Services Job steps + Workflow & Scheduler + Repository Metadata IDTSDataPump IUnknown Data Pump Source OLE DB ODBC Fixed Field ASCII Delimited Examples: Oracle, DB/2, VSAM, flat files Steps Xforms DTS Data Pump In Out Destination OLE DB ODBC Fixed Field ASCII Delimited HTML Page Repl. Publication Server 7 Launch Japan 1/21/99 gray 36 Warehouses of Data Cubes Warehouse is a collection of cubes Cubes have Star and Snow Flake Schemas Data Attributes Dimensions Geography Regions Districts Dealers By Year By Make By Make & Year Sales Days Quarters Years Time RED WHITE Colors Models Lines Products BLUE By Color & Year Sum By Color Server 7 Launch Japan 1/21/99 gray 37 RelationalOLAP+MultidmensionalOLAP HybridOLAP Source table Europe Partition 1 “Plato” User 1 ROLAP Designer USA Partition 2 Dcube SQL MD SQL Client app Asia Partition 3 ROLAP Dcube “Plato” server Client app User 2 Server 7 Launch Japan 1/21/99 gray 38 OLAP + Office 2000 demo Server 7 Launch Japan 1/21/99 gray 39 Readiness 300 applications now days; 3,000 within 18 months System Integrators and ISVs 52,000 trained DBAs Server 7 Launch Japan 1/21/99 gray 40 Internet Platform Visual Studio and FrontPage Site Server IIS MTS Web client MSMQ Windows NT Server Office client Server 7 Launch Japan 1/21/99 gray 41 Microsoft SQL Server 7.0 all these features in one product workstation, server, enterprise Easy Scalable Data Warehousing Server 7 Launch Japan 1/21/99 gray 42