Oracle Database 11g Performance Innovations Juan Loaiza Senior Vice President, Oracle Corporation The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle. Agenda • High-Performance Today • Offloading and Caching for High Performance • High Performance with Large Data Volumes • End-to-end Performance Architecture <Insert Picture Here> High Performance Today Database Scaling SMP Scale-Up • Very mature – 20 years of experience • Many customers with largest SMPs on the market – 64 to 256 CPUs – Sun M9000, HP Superdome, IBM Regatta • Single System Image – Easy to manage – Easy to design applications • Works great, but high cost – Eventually hits a wall • Need at least two for availability RAC Scale-Out HR • • • • • Runs all Oracle database applications Highly available and scalable No Idle Resources Single System Image Thousands of production customers ERP Oracle Leads Performance Benchmarks Benchmark World Record Leadership TPC-C Performance Oracle TPC-C Price/Performance Oracle TPC-H @ 1,000 GB Oracle TPC-H @ 3,000 GB Non-Clustered Oracle TPC-H @ 10,000GB Non-Clustered Oracle TPC-H @ 30,000 GB Oracle SAP Sales and Distribution Parallel Oracle SAP Sales and Distribution 2-tier Oracle SAP (ATO) Assemble-To-Order 2 and 3 Tier Oracle As of October 2, 2009: Source: www.tpc.org & www.sap.com/benchmark.SAP TRBK Standard Application Benchmark: Sun Fire E6900 DB Server (8 1.5 GHz US-IV+ processors, 16 cores, 16 threads, 56 GB memory): 10,012,000 Day posts/hr, 6,664,000 Night bal accs/hr, Solaris 10, Oracle 10g, SAP Account Management 3.0 (64-bit) Cert #2006018.The two-tier SAP Business Information Warehouse 3.5 Standard Application Benchmark suite performed on 2/28/06, by Fujitsu Siemens Computers in Paderborn, Germany, was certified on 3/14/06 with the following data. The scenario for 32GB main memory which corresponds to 467,200,000 records in fact table was used. Load Phase - Average throughput total step 1+2 (rows/hour): 53,255,652. Query Navigation Steps: 377,280. The software configuration for all steps of the SAP BW Benchmark: Operating system central server: SUN Solaris 10. RDBMS: Oracle 10g. Platform Release: SAP NetWeaver '04. Configuration: Central server: Fujitsu Primepower 850, 16 processors/16 Cores/16 threads, SPARC64 V, 2.16 GHz, 128 KB (D) + 128 KB(I) L1 cache, 4 MB L2 cache, and 32 GB main memory, Cert # 2006014. Best OLTP Performance - TPC-C Record • • • • • 12 node Sun T5440 cluster – Sparc T2+ First World Record using Flash Technology 8X less hardware than previous record 16 times better response time 4X Less Power As of 10/10/2009: 12-Node Sun SPARC Enteprise T5440 server cluster 7,717,510 tmpC, $2.34/tpmC, available 12/14/09 Best Data Warehouse Performance - World Record 30 TB TPC-H QphH 150,960 Processors 128 Core Superdome Memory 1 Terabyte Disk Arrays 256 MSA1000’s Disk Storage 448 TB Read Throughput 40 GB/Sec Oracle Source: TPC, As of Nov 9, 2007: Oracle Database 10g on HP Superdome Server, 150,960 QphH $46.69/QphH, avail 6/18/07. Best Business Application Performance - World Record SAP SD 2-tier Benchmark First Benchmark on a 256 Core SMP – Sun M9000 39,100 40,000 35,400 35,000 SD Users 30,000 25,000 20,000 15,000 10,400 10,000 5,000 0 MSFT DB2 Oracle These results, as of October 2, 2009, have been certified by SAP AG, www.sap.com/benchmark. Please see notes page for benchmark certification details for the above results. The results above we achieved with SAP ERP 6.0 (Non-Univode)SAP ERP 200 Best Business Application Scaling World Record SAP SD Benchmark Results 5 Node Oracle RAC IBM P570 40,000 37,040 35,000 4 Node RAC 30,016 SD Users 30,000 3 Node RAC 25,000 22,416 20,000 2 Node RAC 15,520 15,000 10,000 Single Node SMP 4,010 5,000 2,035 0 4 8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 68 72 76 80 # of CPU Cores Near Perfect Scaling across SMP and Cluster These results, as of October 2, 2009: have been certified by SAP AG, www.sap.com/benchmark. Please see notes page for benchmark certification details for the above results. Best Business Intelligence Performance World Record SAP BI Data Mart Benchmark 2 Node RAC Fujitsu RX300 609,349 Query Navigation Steps/Hour 700,000 600,000 500,000 Single Node SMP 320,363 400,000 300,000 182,112 200,000 100,000 IBM DB2 Oracle Oracle 0 8 8 16 # of CPU Cores These results, as of October 2, 2009: , have been certified by SAP AG, www.sap.com/benchmark. Please see notes page for benchmark certification details for the above results. Best OLTP Price-Performance Value Leadership Over Microsoft 1.00 0.91 0.84 0.90 0.80 0.68 $/tpmC 0.70 0.60 0.73 0.74 0.60 0.54 0.50 0.40 0.30 Linux Windows Linux HP Intel 0.20 0.10 0.00 Oracle Oracle Oracle Oracle Oracle SQL Server SQL Server As of 10/2/09::HP ProLiant ML350 G6 server, 232,002 tpmC, $0.54USD/tpmC, available May 21, 2009. Dell PowerEdge 2900 Server, 104,492 tpmC $.60USD/tpmC, available 2/20/2009. Dell PowerEdge 2900 97,083 tpmC, .68/tpmC, available 6/16/08. HP ProLiant ML350G5, 102,454 tpmC, .73/tpmC, available 12/31/07. HP ProLiant ML350G5, 100,926 tpmC, .74/tpmC, available 6/8/07. Microsoft SQL Server on HP ProLiant ML350G5, 82,774, .84/tpmC, available 03/27/07. Dell PowerEdge 2900, 69,564 tpmC, .91/tpmC, available 3/9/07. Source: Transaction Processing Performance Council (TPC), www.tpc.org What Oracle Runs Storage Example: Oracle Central e-Business DB Texas Colorado Sun E25K 36 CPU 2 Cores/CPU Total = 288 Cores 4 Node RAC Data Guard 76 TB Primary 76 TB Standby • Worldwide Central E-business database for Fortune 200 company • ERP, HR, and CRM – Payroll, orders, contracts, procurement, expense reports, hiring… • Consolidated 70 separate Applications databases – Estimated cost savings of over $1B Oracle Beehive OLTP using Exadata • Runs Oracle Email, Calendar, Contacts, Chat, Documents, Web Conferencing Each of 3 Configurations: 16 Node RAC Cluster 2 quad-core Intel CPUs per Node • 16-node production system – Remote standby, testing system • 1 PB of disk per system – 50 SAS cells, 48 SATA cells – 3 PB of total storage Infiniband 17 Switches • Complete Oracle Software Stack – RAC, Streams, Active Data Guard, Secure Backup, RMAN, Flashback Database, ASM, Partitioning – 2X space saved with compressed SecureFiles 98 HP Exadata Storage Cells 1 PB Raw Storage Offloading and Caching for High Performance Database, Client, Remote <Insert Picture Here> Server SQL Results Cache • Database caches results of queries, sub-queries, or pl/sql function calls – • • • • Cache is shared across statements and sessions on server – Full consistency and proper semantics 2x speedup on hit for worst case of trivial query 100x speedup on hit for complex queries Statement hints specify caching - /*+ result_cache +*/ Only for very read intensive tables In-Memory Parallel Execution QphH: 1 TB TPC-H 1,166,976 • Database release 11.2 introduces parallel query processing on memory cached data 1,018,321 – Queries run from tables in database buffer cache – Harnesses memory capacity of entire database cluster for queries – Foundation for world record 1TB TPC-H 315,842 ParAccel Exasol Oracle & HP Exadata Faster than specialized inmemory warehouse databases Memory has 100x more bandwidth than Disk Source: Transaction Processing Council, as of 9/14/2009: Oracle on HP Bladesystem c-Class 128P RAC, 1,166,976 QphH@1000GB, $5.42/QphH@1000GB, available 12/1/09. Exasol on PRIMERGY RX300 S4, 1,018,321 QphH@1000GB, $1.18/QphH@1000GB, available 08/01/08. ParAccel on SunFire X4100 315,842 QphH@1000GB, $4.57 /QphH@1000GB, available 10/29/07. Database Smart Flash Cache • Database Smart Flash Cache transparently extends buffer cache Buffer Cache Many I/O’s Enterprise Storage Multiple Cabinets Buffer Cache Few I/O’s Database Smart • Flash Cache Mid-Range Storage Few Shelves – 10x Larger – Uses flash disks or cards in database host – Cache eliminates most I/Os – Available on Solaris and OEL Benefits – – – – – – Fewer disks needed Less powerful array needed Better response time Big jobs run faster Lower Power High ROI Oracle is the First Flash Optimized Database OCI Consistent Client Cache Application Server Database Consistent Caching Simplest Queries can speedup: • 50x in elapsed time • 20x in CPU time • Caches query results on client • Primarily for caching small (10s or 100s of KB) read-intensive tables – – Queries where network overhead dominates e.g. lookup tables • Cache is fully consistent – Coherence messages bundled into responses to DB calls ensure cache remain consistent – Like Cache Fusion extended out to clients In-Memory Database Cache Grid – TimesTen Scaling with Business Growth On-demand loading of cached data and redistribution based on usage High availability Transactional consistency Peer-to-peer communication between grid nodes Application In-Memory Database Cache Application Application In-Memory Database Cache In-Memory Database Cache Incremental scalability Application Application In-Memory Database Cache In-Memory Database Cache Microseconds Lightning Fast Response Time Runs In-Application 16 14 14 millionths of a second 12 10 8 6 4 4 millionths of a second 2 0 Read a Record Update Transaction Oracle TimesTen In-Memory Database 11g - Intel Xeon 3.0 Ghz 64-bit Oracle Enterprise Linux Active Data Guard Query Offload • Users want to performance protect their production DBs – Active Data Guard offloads high risk reporting & backup from OLTP • Current approaches – Physical Copy Reporting DB (e.g. split mirror) • Solution is simple but data is stale (day old) – Logical Replica Reporting DB (e.g. replication) • Replication provides real-time updates but is complex Simple • Active Data Guard enables a unique real-time solution – Reporting using physical standby technology – Real-time, simple, and fast – also provides DR Continuous Redo Shipment and Apply Production Database RealTime Concurrent Real-time Query Queries Reporting Database Web Scale Highly Available Reader Farm Reporting, web content browsing • Reader farm implemented using Active Data Guard – – – – Reader Databases Updates Scale-out read queries Isolate faults to each DB High performance Supports all types & DDL • Automatic, zero loss failover Redo Shipping Redo Shipping Primary Database Designated Fast-Start Failover DB – Readers follow automatically • RAC can scale-out updater, or centralize storage of readers High-Performance with Large Data Volumes <Insert Picture Here> Data Growth Challenges • IT must support exponentially growing amounts of data – With improved performance – With lower cost • Powerful and efficient compression is key Advanced OLTP Compression • Compress large application tables – Transaction processing, data warehousing – Transparent to application • Compress all data types – Structured and unstructured data types • Improve query performance – Cascade storage savings throughout data center Up To 4X Compression Real World Compression Results - ERP Database 10 Largest Tables Storage Utilization Table Scan Performance 2500 0.4 2.5x Faster 2000 3x Smaller 0.3 1500 0.2 1000 0.1 500 DML Performance 0 40 30 20 10 0 Less than 3% Overhead 0 Exadata Hybrid Columnar Compression Two Modes Warehouse Compression • 10x average storage savings • 10x reduction in Scan IO Archive Compression • 15x average storage savings – Up to 50x on some data • Some access overhead • For cold or historical data Optimized for Speed Optimized for Space Smaller Warehouse Faster Performance Reclaim 93% of Disks Keep Data Online Can mix OLTP and Hybrid Columnar Compression by partition for ILM Real-World Compression Ratios Oracle Production E-Business Suite Tables Size Reduction Factor by Table 52 50 45 40 35 30 25 20 15 10 5 0 OLTP Compression (avg=3.3) 43 Query Compression (avg=14.6) Archive Compression (avg=22.6) 10 10 10 11 29 16 19 19 19 20 21 • Columnar compression ratios • Query = 14.6X • Archive = 22.6X • Vary by application and table © 2009 Oracle Corporationl 31 Files in the Database Reinvented • Best of Both Worlds • File Capabilities – – – – – – File System Interface High Performance Compression Encryption Deduplication HSM • Database Capabilities – Transactions – Query Consistency – Advanced Backup and Recovery – Powerful Security – Flashback – Scale up SMPs – Scale out Clusters • Files are an integral part of modern database applications – Product images, contracts, XML, ETL files, manuals, etc. • Applications developers want to store business data files in the database to benefit from transactional consistency, and unify HA and Security – Poor performance, limited functionality, and lack of access by existing file based tools have held them back • Oracle Database 11g reinvents files in the database • SecureFiles provides super fast and powerful file storage – Removes performance barrier to storing files in the database • DBFS provides a file system interface to files in the DB – Enables existing file based tools to easily access DB files SecureFiles Performance • Performance compared to Linux FS – Tests run using both SecureFiles and ext3 in metadata journaling only, no network File Reads File Writes (MB/second) (MB/second) SecureFiles 100 80 60 40 20 0 Linux Files LOBs 0.01 0.10 1 File Size (MB) 10 100 60 50 40 30 20 10 0 SecureFiles Linux Files LOBs 0.01 0.10 1 10 File Size (MB) 100 Database File System - DBFS • Shared Linux file system – Shared storage for ETL staging, scripts, reports and other application files • Files stored as SecureFiles in database tables – Protected like any DB data – mirroring, DataGuard, Flashback, etc. • 5 to 7 GB/sec file system I/O throughput on Database Machine • Example use case: Load into database using External Tables ETL Files in DBFS ETL More File Throughput than High-End NAS Filer End-to-end Performance Architecture Sun Oracle Database Machine • Grid is the architecture of the future • Highest performance, lowest cost, fault tolerant, scalable on demand • Database Machine is an engineered, optimized, standardized, and tested grid for Oracle database – with intelligent storage Oracle Database Server Grid • 8 compute servers • 64 Intel Cores • 576 GB DRAM InfiniBand Network • 40 Gb/sec unified server and storage network • Fault Tolerant © 2009 Oracle Corporation Exadata Storage Server Grid • 14 storage servers • 100 TB or 336 TB Disk Storage • 5TB flash storage! • Offload queries into storage 36 Scale Performance and Capacity • Scalable – Scales to 8 rack database machine by just adding wires • More with external InfiniBand switches – Scales to hundreds of storage servers • Multi-petabyte databases • Redundant and Fault Tolerant – Failure of any component is tolerated – Data is mirrored across storage servers Exadata Database Offload to Storage • Exadata storage servers implement data intensive processing in storage New – – – – – – – Row filtering based on “where” predicate Column filtering Join filtering Incremental backup filtering Storage Indexing Scans on encrypted data Data Mining model scoring • 10x reduction in data sent to DB servers is common • No application changes needed – Processing is automatic and transparent – Even if cell or disk fails during a query © 2009 Oracle Corporation 38 Exadata Smart Flash Cache • Smart Flash Cache holds hot data – Not just simple LRU • Knows when to avoid caching to avoid flushing cache – Allows optimization by application table • Performance of Flash combined with Cost of Disk Oracle is the First Flash Optimized Database Flash Cache for Scans * Note: This compares uncompressed data. Oracle’s • Flash storage more than doubles scan throughput – 50 GB/sec • Combined with Hybrid Columnar Compression – Up to 50 TB of data fits in flash – Queries on compressed data run up to 500 GB/sec GB/Second (uncompressed) advantage is much greater than this with compressed data. 50 45 40 35 30 25 20 15 10 5 0 50 Flash 2x – 5x better throughput* 21 Disk 7.5 Disk 10 Disk TERADATA NETEZZA 2550 TwinFin 12 SUN ORACLE Database Machine Flash Cache for Random I/O • Database Machine achieves: • 20x more random I/Os – Over 1 million per second • 10x better I/O response time – Sub-millisecond • Greatly Reduced Cost 5X More I/Os than 1000 Disk Enterprise Storage Array – 10x fewer disks needed for I/O – Lower Power Sun Oracle Database Machine Extreme Performance for all Data Management • Best for Data Warehousing – Parallel query on memory or Flash at more than 50 GB/sec – 10x compressed tables with storage offload • Best for OLTP – – – – Only database that scales real-world applications on grid Smart flash cache provides 1 million I/Os per second Up to 50x compression for archival data Secure, fault tolerant • Best for Database Consolidation – Only database machine that runs and scales all workloads – Predictable response times in multi-database, multiapplication, multi-user environments Oracle is Ready for the Future • High-Performance Today • Offloading and Caching for High Performance • High Performance with Large Data Volumes • End-to-end Performance Architecture Exadata Sessions Date Time Room Session Title Mon 10/12 5:30 PM Moscone South 307 S311436 - Implement Best Practices for Extreme Performance with Oracle Data Warehouses. Tue 10/13 11:30 AM Moscone South 307 S311385 - Extreme Backup and Recovery on the Oracle Database Machine. Tue 10/13 1:00 PM Moscone South 307 S311437 - Achieve Extreme Performance with Oracle Exadata and Oracle Database Machine. Tue 10/13 1:00 PM Moscone South Room 102 S311358 - Oracle's Hybrid Columnar Compression: The Next-Generation Compression Technology Tue 10/13 2:30 PM Moscone South 102 S311386 - Customer Panel 1: Exadata Storage and Oracle Database Machine Deployments. Tue 10/13 4:00 PM Moscone South 102 S311387 - Top 10 Lessons Learned Implementing Oracle and Oracle Database Machine. Tue 10/13 5:30 PM Moscone South 102 S307963 - Oracle Database Machine and Exadata Best Practices and Customer Considerations. Tue 10/13 5:30 PM Moscone South Room 104 S311239 - The Terabyte Hour with the Real-World Performance Group Tue 10/13 5:30 PM Moscone South 252 S310048 - Oracle Beehive and Oracle Exadata: The Perfect Match. Wed 10/14 4:00 PM Moscone South 102 S311387 - Top 10 Lessons Learned Implementing Oracle and Oracle Database Machine. Wed 10/14 5:00 PM Moscone South 104 S311383 - Next-Generation Oracle Exadata and Oracle Database Machine: The Thu 10/15 12:00 PM Moscone South 307 S311511 - Technical Deep Dive: Next-Generation Oracle Exadata Storage Server and Oracle Database Machine Future Is Now.