Oracle Database Machine

advertisement
<Insert Picture Here>
Oracle for Data Warehousing
Today’s Reality
Packaged
Applications
Analytics
Custom
Reporting
Data
Migration
Data Silos
Data
Replication
Data Marts
Business
Intelligence
Performance
Management
Data
Warehousing
Data
Federation
Data Hubs
Data Access
Batch
Scripts
Data
OLTP & ODS Warehouse
Systems
Data Mart
Custom
SQL
Java
SAP, Oracle
Files
PeopleSoft, Siebel,
Excel
Custom Apps
XML
OLAP
What’s the Impact on Your Business?
Fragmented Untrustworthy Inaccessible
data Silos
Data
Information
Limited
Scalability
• Higher Costs
Fragmented
SilosCosts
• data
Higher
• Increased Risk
• Increased Risk
• Poor Decisions
• Poor Decision Making
Difficult
to Manage
Moving toward a Clean Architecture
ORCHESTRATION
APPLICATIONS
MASTER DATA
ANALYTICS /
APPS
SERVERS
MDM
APPLICATIONS
CRM APPS
ETL
OPERATIONS
Master
Master
Data
Data
EPM
WEBSITES
BPEL
PM
BUSINESS INTELLIGENCE
SCM APPS
BI
ETL
FINANCIALS
...
DISTRIBUTION
ERP APPS
Data
Warehouse
CUSTOM APPS
ESB
CLUSTERS
Data Warehouse Reference Architecture
5
Oracle’s Integrated Stack
Customer
Service
Ad Hoc
Analysis
Interactive Performance Reporting &
Dashboards Management Publishing
Proactive
Detection
PM and BI Applications
PM Applications BI Applications
Business Intelligence Foundation
OLAP
Predictive
Data Integration & Management
ETL/ELT
Data Services
Data Federation Data Quality
Data Warehousing
Partitioning
Modeling
Data Mining
Compression
Database and Middle Tier Servers
Storage
Oracle: #1 in Data Warehousing
Oracle Confidential – Do Not Distribute
Source: IDC, August 2010 – “Worldwide Data Warehouse Platform Software 2009 Vendor Shares”
Oracle Data Warehouse Customers
Retail
Communications
Financial Services
Manufacturing
Manufacturing
CPG/Healthcare
Transportn/Energy
Oracle Exadata Database Machine
Extreme Performance
• Fastest growing new product in Oracle’s history
• Server & Storage Integrated Hardware &
Software Platform
• Data Warehousing
• OLTP
• Consolidation
“After carefully testing several data
warehouse platforms, we chose the Oracle
Database Machine. Oracle Exadata was able
to speed up one of our critical processes
from days to minutes.”
Brian Camp
SVP, Infrastructure Services Knowledge Base Marketing
Oracle for Data Warehousing
• Optimized for strategic warehousing
• 25 GB/sec IO bandwidth, with up to 50 GB/sec
with Flash
• Optimized for real-world data loading
• Read consistency with the ability to load at 5TB/hr
• Optimized for operational warehousing
• Advanced indexing capabilities running at 1M
IOPS
• Optimized for advanced analytics
• Integrated OLAP, data mining, spatial and
statistics
• Optimized for large data sets
• 10x user data compression
10
Exadata Hardware Architecture
Scaleable Grid of industry standard servers for Compute and Storage
• Eliminates long-standing tradeoff between Scalability, Availability, Cost
Database Grid
• 8 Dual-processor x64
database servers
Intelligent Storage Grid
• 14 High-performance low-cost
storage servers
OR
• 2 Eight-processor x64
database servers
InfiniBand Network
• Redundant 40Gb/s switches
• Unified server & storage
network
• 100 TB High Speed disk, or
336 TB High Capacity disk
• 5.3 TB PCI Flash
• Data mirrored across
storage servers
Complete Family Of Database Machines
For OLTP, Data Warehousing & Consolidated Workloads
Oracle Exadata X2-2
• Quarter, Half, Full and Multi-Racks
Oracle Exadata X2-8
• Full and Multi-Racks
Exadata is Smart Storage
Compute Intensive
Processing
Database Server
• Compute and memory intensive data
processing executes in database servers
• Fully-parallelized joins and aggregations
Bandwidth Intensive
Searches
Exadata Storage Server
• IO-bandwidth intensive database operations
executes in storage servers
• Exadata Smart Scans and Exadata Storage Indexes
filter out data that is not relevant to a query
• Database servers and Exadata storage work in conjunction to execute SQL
• Exadata cell is smart storage, not a complete database node
13
Exadata Database Processing in Storage
• Exadata storage servers implement data intensive
processing in storage
•
•
•
•
•
•
•
Row filtering based on “where” predicate
Column filtering
Join filtering
Incremental backup filtering
Scans on Hybrid Columnar Compressed data
Scans on encrypted data
Data Mining model scoring
• 10x reduction in data sent to DB servers
is common
• No application changes needed
•
•
Processing is automatic and transparent
Even if cell or disk fails during a query
Exadata Flash
Extreme Performance
• Oracle Database Machine has
5 TB of flash storage
• 4 high-performance flash cards in every Exadata
Storage Server
• Smart Flash Cache caches hot
data
Oracle is the First Flash
Optimized Database
• Not just simple LRU
• Knows when to avoid caching to avoid
flushing cache
• Allows optimization by application table
Oracle Database Machine:
Optimized for large scans
10 TB of user data
Requires 10 TB of IO
1 TB
with compression
100 GB
with partition pruning
Subsecond
On Database
Machine
20 GB
with Storage Indexes
5 GB
with Smart Scans
2000X less data needs to be processed
What does Extreme Performance mean
for your business?
• Massive data volumes
• More granular data
• Daily data instead of weekly
• Store data instead of account
• More history
• 5 years instead of 1 year
• New data sources
• Consumer-level data
• Entirely new analytics
• Queries that were never possible now run in minutes
• Near-real-time data loading
Oracle Database Machine
• Optimized for strategic warehousing
• 25 GB/sec IO bandwidth, with up to 50 GB/sec
with Flash
• Optimized for real-world data loading
• Read consistency with the ability to load at 5TB/hr
• Optimized for operational warehousing
• Advanced indexing capabilities running at 1M
IOPS
• Optimized for advanced analytics
• Integrated OLAP, data mining, spatial and
statistics
• Optimized for large data sets
• 10x user data compression
18
Fast data loading
• Performant data loading and transformation with in-database ETL
• Direct flat file access with external tables
• Transformation inside the database with SQL and PL/SQL
• Bulk and trickle load
• Up to 5 TB/hr of raw data loading on a Database Machine
Access
Transform
Load
Transform Data Where Data Resides
In-database ETL technology
Extract
Load
Transform
Insert
Data Pump
Transportable Tablespaces
Partition Exchange Loading
Change Data Capture
Distributed Queries
SQL*Loader
External Tables
Table Functions
Multi-Table Insert
MERGE
DML error logging
DBFS - Scalable Shared File System
• Database Machine comes with DBFS shared Linux file system
•
Shared storage for ETL staging, scripts, reports and other application files
• Files stored as SecureFile LOBs in database tables stored in Exadata
•
Protected like any DB data – mirroring, DataGuard, Flashback, etc.
• 5 to 7 GB/sec file system I/O throughput
Load into database
using External Tables
ETL Files in DBFS
ETL
More File Throughput than High-End NAS Filer
Real Time Data Integration with
Oracle GoldenGate
• Real time extracts from
transactional systems
• Non-invasive on sources
Source 1
EMP
DEPT
• Continuous streaming load
into ODS Schema of target
• Latency in seconds
On-Disk Logs
Oracle
GoldenGate
Source 2
EMP
DEPT
Oracle
GoldenGate
On-Disk Logs
22
Oracle is uniquely capable of
concurrent query and updates
report
• Concurrent small data loads and
queries
Budget table
•
Looks like... OLTP
• Oracle's read consistency
update
Rollback
Segment
update
accurate
report
•
•
•
•
•
•
Readers never block writers
Writers never block readers
Queries are always consistent and auditable
No deadlocks
Introduced in Oracle V4 (1982)
Teradata: lock rows for reads and writes to obtain
consistency
Oracle Database Machine
• Optimized for strategic warehousing
• 25 GB/sec IO bandwidth, with up to 50 GB/sec
with Flash
• Optimized for real-world data loading
• Read consistency with the ability to load at 5TB/hr
• Optimized for operational warehousing
• Advanced indexing capabilities running at 1M
IOPS
• Optimized for advanced analytics
• Integrated OLAP, data mining, spatial and
statistics
• Optimized for large data sets
• 10x user data compression
24
Data Warehouse Reference Architecture
Base data warehouse schema
Atomic-level data, 3nf design
Supports general end-user queries
Data feeds to all dependent systems
Application-specific performance structures
Summary data / materialized views
Dimensional view of data
Supports specific end-users, tools, and applications
25
Data Warehouse Reference Architecture
IO-bandwidth intensive
workloads
Random-IO intensive workloads
26
Oracle delivers complete capabilities
for tactical warehouse queries
• Fast lookups
• B-Tree indexes
• Dimensional queries
• Star query optimizations
• Bitmap indexes
• Bitmap join indexes
• Aggregate management
• Materialized Views
• Cube-organized materialized views
Workload Management for DW
Setting Up a Workload Management System
Workload
Management
Define Workload
Plans
Define Workloads
Filter Exceptions
Manage Resources
Execute
Workloads
Monitor Workloads
Adjust Plans
RAC
IORM
Adjust Workload
Plans
OEM
DBRM
Monitor
Workloads
© 2010 Oracle Corporation
Workload Management
Request
Each request:
• Executes on a RAC Service
• Which limits the physical resources
• Allows scalability across racks
Assign
Each consumer group has:
Each request
assigned to a
consumer group:
•OS or DB Username
•Application or Module
•Action within Module
•Administrative function
•Resource Allocation (example: 10% of CPU/IO resources)
•Directives (example: 20 active sessions)
•Thresholds (example: no jobs longer than 2 min)
Ad-hoc
Workload
Downgrade
Execute
Queue
Reject
© 2010 Oracle Corporation
Workload Management
Real-Time
ETL
Batch
ETL
Request
Analytic
Reports
Assign
Execute
OLTP
Requests
Ad-hoc
Workload
Downgrade
Queue
Reject
© 2010 Oracle Corporation
Workload Management
Real-Time
ETL
Queue
R-T 10%
Batch
ETL
Queue
Analytic
Reports
Queue
Analytic
Reports
50%
OLTP
Requests
Queue
OLTP 5%
Batch 10%
Request
Assign
Ad-hoc
25%
Ad-hoc
Workload
Downgrade
Queue
Reject
© 2010 Oracle Corporation
Oracle Database Machine
• Optimized for strategic warehousing
• 25 GB/sec IO bandwidth, with up to 50 GB/sec
with Flash
• Optimized for real-world data loading
• Read consistency with the ability to load at 5TB/hr
• Optimized for operational warehousing
• Advanced indexing capabilities running at 1M
IOPS
• Optimized for advanced analytics
• Integrated OLAP, data mining, spatial and
statistics
• Optimized for large data sets
• 10x user data compression
32
In-database Analytics
Bring Algorithms to the Data, Not Data to the Algorithms
• Analytic computations done in
the database
• Dimensional analysis
• Statistical analysis
• Data Mining
OLAP
Statistics
Data Mining
•
•
•
•
Scalability
Security
Backup & Recovery
Simplicity
Deeper insights
from Oracle Data Warehouses
Oracle Spatial: GIS data available for
analysis and displayed via MapViewer
Deeper insights
from Oracle Data Warehouses
Oracle OLAP: Hierarchically aware
rankings, shares, alerts and time series
calculations are easily defined in the cube
and queried by OBIEE using simple and
efficient SQL
Deeper insights
from Oracle Data Warehouses
Hierarchically aware rankings, shares,
alerts and Oracle
time series
are
Datacalculations
Mining: Predictions
&
easily defined
in the are
cube
and queried
by database
probabilities
calculated
within
OBIEE and
tools for
using
simpleusing
and OBIEE
andother
available
reporting
efficient SQL
Deeper insights
from Oracle Data Warehouses
In-Database
Analytics
OLAP
Data
Mining
Spatial
Deeper insights
for business users



Simple and
seamless
embedded
analytics
Extreme
performance and
scalability






Pervasive
benefits
across stack
Oracle Database Machine
• Optimized for strategic warehousing
• 25 GB/sec IO bandwidth, with up to 50 GB/sec
with Flash
• Optimized for real-world data loading
• Read consistency with the ability to load at 5TB/hr
• Optimized for operational warehousing
• Advanced indexing capabilities running at 1M
IOPS
• Optimized for advanced analytics
• Integrated OLAP, data mining, spatial and
statistics
• Optimized for large data sets
• 10x user data compression
38
Exadata Hybrid Columnar Compression
Highest Capacity, Lowest Cost
• Data is organized and compressed by column
Query
• Dramatically better compression
• Speed Optimized Query Mode for Data
Warehousing
• 10X compression typical
• Runs faster because of Exadata offload!
• Space Optimized Archival Mode for
infrequently accessed data
• 15X to 50X compression typical
Faster and Simpler
Backup, DR,
Caching, Reorg,
Clone
Benefits
Multiply
© 2010 Oracle Corporation
39
More Data Capacity
Exadata V2
10x Compression
Systems with
Equal User Data
Netezza TwinFin
2x to 4x Compression
EMC VMAX
3x Oracle Compression
Teradata 2580
1.3x Compression
All with Largest Disks,
Best Compression
SUMMARY
41
Enterprise Availability
RAC
Active
Data Guard
ASM
Secure
Backup
Flashback
Online
Redefinition
• Redundant Hardware
GoldenGate
Replication
• Real-Time Active Replica
• Servers, Storage, Network
• Database Level HA
• Tolerate failures and changes
© 2010 Oracle Corporation
42
First Secure Database Machine
• Moves decryption from software
to hardware
• Over 5x faster
• Near zero overhead for fully
encrypted database
• Queries decrypt data at hundreds
of Gigabytes/second
© 2010 Oracle Corporation
43
Standardized and Simple to Deploy
• All Database Machines are the same
• Delivered Tested and Ready-to-Run
• Highly Optimized
• Highly Supportable
• No unique configuration issues
• Identical to config used by Oracle Engineering
• Runs existing OLTP and DW applications
• Full 30 years of Oracle DB capabilities
• No Exadata certification required
Deploy in Days,
Not Months
• Leverages Oracle ecosystem
• Skills, knowledge base, people, partners
© 2010 Oracle Corporation
44
Faster than DW Appliances
50 GB/sec!
Flash
• Faster Throughput from Disk
Query Throughput
GB/sec Uncompressed Data
Single Rack
• Much Faster with Flash
Disk
• 50 TB of data fits in Flash
• Using 10x Query Compression
• Effective Query Throughput on
compressed data is even higher
• Hundreds of GB/sec
Teradata
Netezza
Exadata
2580
TwinFin 12
V2
Oracle Exadata Momentum
Rapid adoption in all geographies and industries
Oracle Exadata Database Machine:
Changes the Game
• Database intelligence and massively
parallel scaling in the storage tier
• Using state of the art industry standard
hardware
• Complete, Integrated Data Warehouse,
OLTP and consolidation solution
• High availability
• Enterprise Security
• Advanced analytics
• Innovative new technologies:
• Hybrid Columnar Compression
• In-memory parallel execution
• FlashFire hardware + flash-optimized
software
47
Download