Microsoft SQL Server™ 7.0 Jim Gray Senior Researcher Microsoft Research

advertisement
Server 7 Launch Japan 1/21/99 gray 1
Microsoft SQL Server™ 7.0
Jim Gray
Senior Researcher
Microsoft Research
Microsoft Corporation
Server 7 Launch Japan 1/21/99 gray 2
Organizations Are Going Online





Building a digital nervous system.
Inexpensive hardware means huge
databases are possible.
But, we are drowning in data.
Databases help organize information.
Microsoft’s goal:


Information at your fingertips.
Make it easy to capture,
manage, and
analyze information.
Server 7 Launch Japan 1/21/99 gray 3
Microsoft SQL Server 7 Goals

Reduce cost of ownership:




Scaleable




Easy to build applications
Easy to administer
Easy to scaleup applications
Scale down to portables
Scale to many nodes (manage, replicate)
Scale to huge nodes
Data warehousing


Data capture
Data analysis
Server 7 Launch Japan 1/21/99 gray 4
Microsoft SQL Server
Enterprise
Standard
DeskTop

Gigabytes
100s users
Megabytes


Terabytes
1,000s users



Large Memory
Clusters
Partitioned cubes
Full featured
< 5 users
simple replication
+
Microsoft Data
Engine in Office 2000
Need
W2K
logo
Easy


Easy



Scalability



Data Warehousing 
Server 7 Launch Japan 1/21/99 gray 5
Dynamic self management
Multi-site management
Operation Scripting
Job scheduling and execution
Alert/response management
DBA profiling/tuning tools
Installation and upgrade
Unicode
Integrated with NT Security
Integrated with NT files
Made It
Server 7 Launch Japan 1/21/99 gray 6
(Even)
Easier!

Desktop



Workgroups



Auto Configure
Self-Configuring Engine / Dynamic Disk/memory
Reduce Learning Curve, Increase Productivity
Self-Managing SQLAgent, Wizards, “Task Pads”
Large Organizations



Deploy/manage “100’s” of SQL Servers
Lower TOC for Large Environments
Multi-Server Operations/ Remote Operation
Server 7 Launch Japan 1/21/99 gray 7
Multi-Site Management
Common Console
 Multi-site management
through groups, agents
 Route events from all
servers to one master
 Script any set of
operations for all servers
 Job Scheduling and
Workflow
 Shared sub-set of UI for
web administration
 Graphical tools for data
 Import/export
Server 7 Launch Japan 1/21/99 gray 8
Distributed Management Framework

Administration shell




Database
Profiler
SQL Namespace Object (SQL-NS)
DTS
SQL-Distributed Management Objects)
In-Process COM Server
40+ Objects, 1600+props/methods
ActiveX™ Scripting
Agents



MMC - SQL Server Snap-in
Navigation + UI
For namespace shells
COM administration
interfaces


Microsoft Management Console
Namespace objects


MMC snap-in
Supports all
operations
Self-managing
component
Activity
monitoring/filters
Events


Queries/traffic
State changes
in engine
SQL Server Agent
Replication
Task Execution
Event Mgt
Alerting/Paging
SNMP
Events
Actions
SQL Server
Engine
Trace
Events/
Perf
Mon
Server 7 Launch Japan 1/21/99 gray 9
Dynamic Self Management
Dynamic memory management
48M Memory, 100 locks
1 GB Memory, 10K locks
Dynamic File Space Management with Auto-Statistics
1/97, Sales history table
128 MB on disk
12/97, Sales history table
4 GB on disk, auto-grow
1/98, Sales history table
128 MB on disk, auto-shrink
Auto-Statistics
2 a.m., 5 users
Noon, 700 users
Server 7 Launch Japan 1/21/99 gray 10
Wizards and GUIs




Many Wizards - over 50 at last count
MS Access as a query interface
Query tool built into SQL, Visual Studio,...
Graphical show plan
Server 7 Launch Japan 1/21/99 gray 11
DBA and Developer Tools

SQL Server Profiler



Select server events and trace criteria
“Capture” output to screen or replay
Index Tuning Wizard


Analyzes actual server usage
history
Makes recommendations to
improve performance
Server 7 Launch Japan 1/21/99 gray 12
Demo: 5 minutes

Tour of the wizards wizard




(touch on create index, DTS, maintenance plan)
Show database layout (auto schema)
Show index tuning wizard
(using pre-canned trace file)
Let it define the indices
Server 7 Launch Japan 1/21/99 gray 13
Index Wizard is Good
but we will make it better


On a complex query set
wizard is 90% of best expert.
Extending it to other aspects of DB
design
Scalability

Easy



Scalability



Data Warehousing 

Server 7 Launch Japan 1/21/99 gray 14
Win9x/NT4/Win2000 version
Dynamic row-level locking
VLDB improvements
Improved query optimizer
Intra-query parallelism
VLM support
Replication improvements
Distributed query
High Availability Clusters
Server 7 Launch Japan 1/21/99 gray 15
Query Processor Enhancements









Focus on Complex Queries
Parallelism
Improved scan, fetch, & sort
Smart hash & merge join
Large joins & grouping
Better query optimization
Multi-index operations
Automatic statistics maintenance
Distributed Query
Heterogeneous Query
Server 7 Launch Japan 1/21/99 gray 16
Parallel Query
SMP & Disk Parallelism
Global Agg.
Result 50 rows
+
4 x 50 rows
Local Agg.
Disks



+
+
+
+
50,000 rows
•# of emp. per group
•total inc. per group
Plus Distributed
Plus Hash Join (fanciest on the planet)
Plus Optimized Partitioned views
Server 7 Launch Japan 1/21/99 gray 17
Distributed Heterogeneous Queries
Data Fusion / Integration
Join spread sheets,
databases,
directories,
Text DBs
etc.
Any source that
exposes OLE DB
interfaces
SQL Server as
gateway,
even on the
desktop
Directory
Service
Database
(DB2, VSAM,
Oracle, …)
Spreadsheet
SQL 7.0
Query
Processor
Photos
Mail
Maps
Documents
and the Web
Server 7 Launch Japan 1/21/99 gray 18
Replication





Transactional and Merge
Remote update
ODBC and OLE DB subscribers
Wizards
Performance
OS 390
DB2
Publisher
2PC,
RPC
Distributor
DB2
VSAM
Subscriber CICS
Subscriber Subscriber Subscriber Updating Subscriber
(immediate updates)
Server 7 Launch Japan 1/21/99 gray 19
Utilities
The Key to LARGE Databases

Backup





Recovery



Fuzzy
Parallel
Incremental
Restartable
Fast
File granularity
Reorganize


shrinks file
reclusters file


Auto-Repair
Index creation
~2x faster than 6.5

DBCC



not required,
a good practice
5x - 100x faster
Server 7 Launch Japan 1/21/99 gray 20
SQL Server 7.0 Backup
Transaction Throughput
during online backup
Parallel Backup Throughput (MBps)
3000
Transactions
2500
2000
1500
NO backup
avg = 2475 tpm
Backup
avg = 2349 tpm
1000
500
0
0
Scaling at
device speed
500
1000
1500
Time (secs)
2000
2500
95% of throughput
maintained during
full on-line backup
(8 drives)
Max tape backup speed: ~ 600GB/hr
Server 7 Launch Japan 1/21/99 gray 21
Recovery Speed Comparison
4x faster!
60
Recovery time (secs)
50
40
SQL Server 6.5
30
SQL Server 7.0
20
10
0
1
2
3
# of indices
4
Server 7 Launch Japan 1/21/99 gray 22
SQL Server 7.0 Scalability
5700
SQL Server 6.5
3537
SQL Server 7.0
95% of
customer
needs
2400
1011
<1500
<300
Application:
Benchmark:
SAP R/3
SD Users
3-Tier
BAAN IV
Baan
Reference
Users
PeopleSoft
HRMS
Users
Server 7 Launch Japan 1/21/99 gray 23
Best Single Node NT TPC-C
25,000
20,000
15,000
23,143
Published
TPC-C results
as of 1/20/99
www.tpc.org
16,273
10,000
6,393
6,842
7,650
5,000
0
DBMS Oracle Informix Sybase
System:-- 4X Pent-Pro, 200 MHz -$/tpmC $108
$100
$82
6.5
7.0
4X PII, Xeon
$56
$26
Server 7 Launch Japan 1/21/99 gray 24
The TPC Report






621,784 $
for 5 years
1.7 Tera Byte Disk
4 CPU
4 GB RAM
18,850 users
Affordable &
Manageable
Server 7 Launch Japan 1/21/99 gray 25
Price Performance Leadership
250%/year on Price,
100%/year performance
Best performance:
Best price/performance:
$1,000
$/tpmC vs time
100,000
23,143 tpmC, $26/tpmC
17715,
$22tpmC
tpmC vs time
tpmC
$/tpmC
10,000
$100
1,000
1.5
2.755676
100 %/year
improvement lately!
250 %/year improvement!
$10
Jan-94 Jan-95 Jan-96 Jan-97 Jan-98 Dec-98
100
Jan-94
Jan-95
Jan-96
Jan-97
Jan-98
Dec-98
Server 7 Launch Japan 1/21/99 gray 26
VLDB Improvements
Terabyte Support




File Groups for easier management
Exabyte Database Size (theoretical max)
Tested to 2.5 Terabytes
64-bit support (up to 32 GB with NT5)
Server 7 Launch Japan 1/21/99 gray 27
Demo
http://www.TerraServer.com
Microsoft
BackOffice
SPIN-2
Server 7 Launch Japan 1/21/99 gray 28
Site Configuration
StorageTek
9710 TimberWolf
Enterprise Storage Array
Alpha
8400
4 NTFS Stripe Sets (600 gb)
28 11-Disk Raid 5 Stripe sets
324 9 GB Seagate Disks
10 x
DLT
7000
(8x440)
10GB
Ram
7 HSZ70 Ultra-SCSI
Dual redundant Controllers
Compaq
Compaq
Compaq
Compaq
Compaq
Compaq
Proliant 5500
Proliant 5500
Proliant 5500
Proliant 5500
Proliant 5500
Proliant 5500
4x200mhz
512mb RAM
20GB Raid5
4x200mhz
512mb RAM
20GB Raid5
4x200mhz
512mb RAM
20GB Raid5
4x200mhz
512mb RAM
20GB Raid5
4x200mhz
512mb RAM
20GB Raid5
4x200mhz
512mb RAM
20GB Raid5
To the Web
Server 7 Launch Japan 1/21/99 gray 29
File System Configuration
28 hardwareRAID5 sets
Each raid set has 11 disks (16 spare drives)
4 595 GB NTFS volumes
Each striped over 7 Raid sets on 7 controllers
26 20 GB files on F:, G:, H:, I:
DB is File Group of 104 files (2 TB)
F:
G:
H:
I:
Server 7 Launch Japan 1/21/99 gray 30
TerraServer Statistics
TerraServer June 22 1998 to Jan 18 1999
35000000
30000000
Count
25000000
20000000
Billion SQL queries served
99.98% of scheduled availability
New data loaded in background
Sessions
Hit
Page View
DB Query
Image
15000000
10000000
5000000
0
Date
Server 7 Launch Japan 1/21/99 gray 31
High Availability
Microsoft Clusters

Automatic failover in less than a minute
Browser
Server 1
Server 2
Web
site
Web
site
Web site files
Database files
Server 7 Launch Japan 1/21/99 gray 32
Data Warehousing




Warehousing Framework
Visual data modeler
Microsoft repository
Data
transformation
services
Scalability
(DTS)
 Plato & Dcube - Multi
Dimensional Data Cubes
Data Warehousing  Integration with Office2000
Easy
Server 7 Launch Japan 1/21/99 gray 33
Microsoft Data Warehousing Framework
Building
Operational
Data
(ODBC,
OLE-DB)
DB Schema*
Data
Transformation
Services
(DTS)
Transformation
Data Mart Design
(Cubes/Star schema)
Data
Marts
(RDBMS,
OLAP)
End-User Tools
(Excel,
Access,
English Query,
3rd parties)
OLE DB
Managing
Data Warehouse Design
(logical/physical schema/ data flow)
Using
Scheduling
data**
OLAP
Metadata**
Microsoft Repository (Persistent Shared Meta-Data)
Data Warehouse Management
(Console*, Scheduling, Events,Topology*)
Data Flow
* Only partially available in SQL 7
** Implemented after SQL 7
Meta-Data Flow
Server 7 Launch Japan 1/21/99 gray 34
Data Warehouse / Data Analysis


Data Transformation Services
to get data into the warehouse
CUBE (OLE/DB OLAP)
to analyze data
Operational
Data
Extact
& Load
Data Warehouse
Storage
OLAP
Server 7 Launch Japan 1/21/99 gray 35
Data Transformation Services
Job steps
+ Workflow &
Scheduler
+ Repository
Metadata
IDTSDataPump
IUnknown
Data Pump
Source
OLE DB
ODBC
Fixed Field
ASCII Delimited
Examples:
Oracle, DB/2,
VSAM, flat files
Steps
Xforms
DTS Data Pump
In
Out
Destination
OLE DB
ODBC
Fixed Field
ASCII Delimited
HTML Page
Repl. Publication
Server 7 Launch Japan 1/21/99 gray 36
Warehouses of Data Cubes


Warehouse is a collection of cubes
Cubes have
Star and Snow Flake Schemas
 Data Attributes
 Dimensions
Geography

Regions
Districts
Dealers
By Year
By Make
By Make & Year
Sales
Days
Quarters
Years
Time
RED
WHITE
Colors
Models
Lines
Products
BLUE
By Color & Year
Sum
By Color
Server 7 Launch Japan 1/21/99 gray 37
RelationalOLAP+MultidmensionalOLAP
HybridOLAP
Source table
Europe
Partition 1
“Plato”
User 1
ROLAP
Designer
USA
Partition 2
Dcube
SQL
MD SQL
Client
app
Asia
Partition 3
ROLAP
Dcube
“Plato”
server
Client
app
User 2
Server 7 Launch Japan 1/21/99 gray 38
OLAP + Office 2000 demo
Server 7 Launch Japan 1/21/99 gray 39
Readiness



300 applications now days;
3,000 within 18 months
System Integrators and ISVs
52,000 trained DBAs
Server 7 Launch Japan 1/21/99 gray 40
Internet Platform
Visual Studio
and FrontPage
Site Server
IIS
MTS
Web
client
MSMQ
Windows NT Server
Office
client
Server 7 Launch Japan 1/21/99 gray 41
Microsoft SQL Server 7.0
all these features in one product
workstation, server, enterprise
Easy
Scalable
Data
Warehousing
Server 7 Launch Japan 1/21/99 gray 42
Download