EMC Data Protection for Oracle Saving Time and Money EMC Data Domain Mel Shum Technical Marketing Engineer EMC Backup Recovery Systems © Copyright 2010 EMC Corporation. All rights reserved. 1 Agenda • Problems many database administrators face • Overview of Oracle Recovery Manager (RMAN) functional components • Key features of EMC Data Domain deduplication storage systems • Impact of Data Domain deduplication in an Oracle environment • Customer examples • Summary © Copyright 2010 EMC Corporation. All rights reserved. 2 Problems Many Database Administrators Face Too many for this talk so we’ll focus on just backup and recovery • Non-stop growth and proliferation • Backup windows remain approximately the same • Recovery is just as important and often overlooked • Limited online retention – Increased frequency of recovery for audits, troubleshooting, updates, etc. • Archive (aka, ILM) becoming increasingly important – Some data must be retained for 5 years, 7 years, forever…. © Copyright 2010 EMC Corporation. All rights reserved. 3 RMAN – What Is It? For those in the audience that don’t already know: • Oracle’s included backup / recovery utility – Supports full, incremental, and image backups – Retains backup information in the control file and (optionally) a separate database catalog – Many configuration options to tailor to users’ requirements – Automatically eliminates unused data blocks from most backups © Copyright 2010 EMC Corporation. All rights reserved. 4 Overview of RMAN Functional Components Target Database RMAN Repository (Control File) © Copyright 2010 EMC Corporation. All rights reserved. 5 Overview of RMAN Functional Components RMAN Client Target Database RMAN Repository (Control File) © Copyright 2010 EMC Corporation. All rights reserved. 6 Overview of RMAN Functional Components RMAN Client Target Database RMAN Repository (Control File) Backup Disk © Copyright 2010 EMC Corporation. All rights reserved. 7 Overview of RMAN Functional Components RMAN Client Target Database RMAN Repository (Control File) Backup Backup MML API Disk Third Party Media Managers © Copyright 2010 EMC Corporation. All rights reserved. 8 Overview of RMAN Functional Components RMAN Client Target Database RMAN Repository (Control File) Backup Backup Disk MML API Third Party Media Managers Tape © Copyright 2010 EMC Corporation. All rights reserved. 9 Overview of RMAN Functional Components RMAN Client Recovery Catalog Target Database RMAN Repository (Control File) Backup Backup Disk MML API Third Party Media Managers Tape © Copyright 2010 EMC Corporation. All rights reserved. 10 Overview of RMAN Functional Components RMAN Client Recovery Catalog FRA Target Database RMAN Repository (Control File) Backup Backup Disk MML API Third Party Media Managers Tape © Copyright 2010 EMC Corporation. All rights reserved. 11 Overview of RMAN Functional Components RMAN Client Recovery Catalog FRA Target Database RMAN Repository (Control File) Backup Backup Disk MML API Third Party Media Managers Tape © Copyright 2010 EMC Corporation. All rights reserved. 12 Weekly Full Backup Weekly: Full image backups (nightly would be preferred!) RMAN> ALLOCATE CHANNEL CH1 DEVICE TYPE DISK FORMAT ‘/backup/ora.weekly/%U’; RMAN> ALLOCATE CHANNEL CH2 DEVICE TYPE DISK FORMAT ‘/backup/ora.weekly/%U’; RMAN> BACKUP AS COPY TAG ‘MAY9’ DATABASE INCLUDE CURRENT CONTROLFILE; RMAN> BACKUP ARCHIVELOGS TAG ‘MAY9’ ALL NOT BACKED UP DELETE ALL INPUT; Target DB 1 TB © Copyright 2010 EMC Corporation. All rights reserved. 13 Weekly Full Backup Weekly: Full image backups (nightly would be preferred!) RMAN> ALLOCATE CHANNEL CH1 DEVICE TYPE DISK FORMAT ‘/backup/ora.weekly/%U’; RMAN> ALLOCATE CHANNEL CH2 DEVICE TYPE DISK FORMAT ‘/backup/ora.weekly/%U’; RMAN> BACKUP AS COPY TAG ‘MAY9’ DATABASE INCLUDE CURRENT CONTROLFILE; RMAN> BACKUP ARCHIVELOGS TAG ‘MAY9’ ALL NOT BACKED UP DELETE ALL INPUT; Full images 1 TB Disks 1 TB 1 TB Disk Disk Much more than 1 TB to keep a few days Limits Retention Target DB 1 TB © Copyright 2010 EMC Corporation. All rights reserved. 14 Daily Incremental Backup Daily: Incremental backup after weekly full RMAN> ALLOCATE CHANNEL CH1 DEVICE TYPE DISK FORMAT ‘/backup/ora.weekly/%U’; RMAN> ALLOCATE CHANNEL CH2 DEVICE TYPE DISK FORMAT ‘/backup/ora.weekly/%U’; RMAN> BACKUP INCREMENTAL DATABASE …. Weekly Fulls 1 TB 1 TB 1 TB Disks Disks Disks Target DB 1 TB © Copyright 2010 EMC Corporation. All rights reserved. 15 Daily Incremental Backup Daily: Incremental backup with update RMAN> ALLOCATE CHANNEL CH1 DEVICE TYPE DISK FORMAT ‘/backup/ora.weekly/%U’; RMAN> ALLOCATE CHANNEL CH2 DEVICE TYPE DISK FORMAT ‘/backup/ora.weekly/%U’; RMAN> BACKUP INCREMENTAL DATABASE …. Weekly Fulls + 1 TB Daily Incrementals 1 TB 1 TB Disks Disks1 2 3 . . Disks12 . . 3 1 2 3 . . Well beyond 1TB of backup storage needed Target DB 1 TB © Copyright 2010 EMC Corporation. All rights reserved. 16 Before Data Domain… 18 Cabinets of IBM Tape © Copyright 2010 EMC Corporation. All rights reserved. 17 After Data Domain… 1 DD690 and 2 Expansion Shelves © Copyright 2010 EMC Corporation. All rights reserved. 18 Data Deduplication: Under The Hood Store more backups in a smaller footprint Friday Full Backup BACKUP DATA LOGICAL ESTIMATED REDUCTION PHYSICAL A B C D A E F G © Copyright 2010 EMC Corporation. All rights reserved. 19 Data Deduplication: Under The Hood Store more backups in a smaller footprint Friday Full Backup A B C D A E F G BACKUP DATA WEEKEND FULL LOGICAL 1 TB ESTIMATED REDUCTION 2- 4x PHYSICAL 250 GB A BCDE FG © Copyright 2010 EMC Corporation. All rights reserved. 20 Data Deduplication: Under The Hood Store more backups in a smaller footprint Friday Full Backup A B C D A E F G Mon Incr A B H BACKUP DATA WEEKEND FULL Monday Incr LOGICAL ESTIMATED REDUCTION PHYSICAL 1 TB 2- 4x 250 GB 100 GB 7-10x 10 GB A BCDE FGH © Copyright 2010 EMC Corporation. All rights reserved. 21 Data Deduplication: Under The Hood Store more backups in a smaller footprint Friday Full Backup BACKUP DATA A B C D A E F G WEEKEND FULL LOGICAL ESTIMATED REDUCTION PHYSICAL 1 TB 2- 4x 250 GB Mon Incr A B H Monday Incr 100 GB 7-10x 10 GB Tues Incr C B Tuesday Incr 100 GB 7-10x 10 GB I A BCDE FGH I © Copyright 2010 EMC Corporation. All rights reserved. 22 Data Deduplication: Under The Hood Store more backups in a smaller footprint Friday Full Backup BACKUP DATA A B C D A E F G WEEKEND FULL LOGICAL ESTIMATED REDUCTION PHYSICAL 1 TB 2- 4x 250 GB Mon Incr A B H Monday Incr 100 GB 7-10x 10 GB Tues Incr C B I Tuesday Incr 100 GB 7-10x 10 GB Weds Incr E G J Wednesday Incr 100 GB 7-10x 10 GB A BCDE FGH I J © Copyright 2010 EMC Corporation. All rights reserved. 23 Data Deduplication: Under The Hood Store more backups in a smaller footprint Friday Full Backup BACKUP DATA A B C D A E F G WEEKEND FULL LOGICAL ESTIMATED REDUCTION PHYSICAL 1 TB 2- 4x 250 GB Mon Incr A B H Monday Incr 100 GB 7-10x 10 GB Tues Incr C B I Tuesday Incr 100 GB 7-10x 10 GB Weds Incr E G J Wednesday Incr 100 GB 7-10x 10 GB Thurs Incr A C K Thursday Incr 100 GB 7-10x 10 GB A BCDE FGH I J K © Copyright 2010 EMC Corporation. All rights reserved. 24 Data Deduplication: Under The Hood Store more backups in a smaller footprint Friday Full Backup BACKUP DATA A B C D A E F G WEEKEND FULL LOGICAL ESTIMATED REDUCTION PHYSICAL 1 TB 2- 4x 250 GB Mon Incr A B H Monday Incr 100 GB 7-10x 10 GB Tues Incr C B I Tuesday Incr 100 GB 7-10x 10 GB Weds Incr E G J Wednesday Incr 100 GB 7-10x 10 GB Thurs Incr A C K Thursday Incr 100 GB 7-10x 10 GB 1 TB 50-60x 18 GB 2.4 TB 7.8x 308 GB Second Friday Full Backup B C D E F 2nd WEEKEND FULL L G H TOTAL A BCDE FGH I J K L © Copyright 2010 EMC Corporation. All rights reserved. 25 Deduplication at Backup / Recovery Speeds • 102,400 lookups/sec.: is it already in the multi-TB store? – 800 MB/sec, for 8KB segment: >100K lookups per second – If random lookups: very slow – Each disk: 1 MB/sec • E.g. for a 7.2KB RPM Seagate 1TB SATA drive: <120 seeks/second • 120/second @ 8KB segment: 0.96 MB/sec/disk – Would need 830 disks to go 800 MB/sec (like most dedupers today) • This is 4 full height racks of nothing but SATA disks (16 drives / 3U rack) Same performance, in less than half a full height rack © Copyright 2010 EMC Corporation. All rights reserved. 26 SISL Scaling Architecture • Data Domain answer: SISL™ – Stream-informed Segment Layout includes: • Summary Vector in RAM says if segment is new • Segment Localities minimize seeks if answer is on disk • Check uniqueness with Summary Vector • Check in-memory fingerprint cache • Key results See: http://www.usenix.org/events/fast08/tech/full_papers/zhu/zhu.pdf © Copyright 2010 EMC Corporation. All rights reserved. 27 Data Integrity: Data Invulnerability Architecture Trust but verify – hope is not a strategy © Copyright 2010 EMC Corporation. All rights reserved. 28 Data Integrity: Data Invulnerability Architecture Trust but verify – hope is not a strategy • Data verification – – – CheckSum Dedupe, write to disk Verify © Copyright 2010 EMC Corporation. All rights reserved. 29 Data Integrity: Data Invulnerability Architecture Trust but verify – hope is not a strategy • Data verification – – – CheckSum Dedupe, write to disk Verify • Self-healing file system – – – – Cleaning Expired data Defrag Verify © Copyright 2010 EMC Corporation. All rights reserved. 30 Data Integrity: Data Invulnerability Architecture Trust but verify – hope is not a strategy • Data verification – – – CheckSum Dedupe, write to disk Verify • Self-healing file system – – – – Cleaning Expired data Defrag Verify • Other – – – RAID-6 NVRAM Snapshots © Copyright 2010 EMC Corporation. All rights reserved. 31 Weekly Full Backup – With Deduplication Weekly: Full image backups RMAN> ALLOCATE CHANNEL CH1 DEVICE TYPE DISK FORMAT ‘/dd/backup/ora.weekly/%U’; RMAN> ALLOCATE CHANNEL CH2 DEVICE TYPE DISK FORMAT ‘/dd/backup/ora.weekly/%U’; RMAN> BACKUP AS COPY TAG ‘MAY9’ DATABASE INCLUDE CURRENT CONTROLFILE; RMAN> BACKUP ARCHIVELOGS TAG ‘MAY9’ ALL NOT BACKED UP DELETE ALL INPUT; Before: Without deduplication 1 TB 1 TB 1 TB Disk Disk Disk Target DB 1 TB © Copyright 2010 EMC Corporation. All rights reserved. 32 Weekly Full Backup – With Deduplication Weekly: Full image backups RMAN> ALLOCATE CHANNEL CH1 DEVICE TYPE DISK FORMAT ‘/dd/backup/ora.weekly/%U’; RMAN> ALLOCATE CHANNEL CH2 DEVICE TYPE DISK FORMAT ‘/dd/backup/ora.weekly/%U’; RMAN> BACKUP AS COPY TAG ‘MAY9’ DATABASE INCLUDE CURRENT CONTROLFILE; RMAN> BACKUP ARCHIVELOGS TAG ‘MAY9’ ALL NOT BACKED UP DELETE ALL INPUT; After: With deduplication Full 500 GB Target DB 1 TB © Copyright 2010 EMC Corporation. All rights reserved. 33 Weekly Full Backup – With Deduplication Weekly: Full image backups RMAN> ALLOCATE CHANNEL CH1 DEVICE TYPE DISK FORMAT ‘/dd/backup/ora.weekly/%U’; RMAN> ALLOCATE CHANNEL CH2 DEVICE TYPE DISK FORMAT ‘/dd/backup/ora.weekly/%U’; RMAN> BACKUP AS COPY TAG ‘MAY9’ DATABASE INCLUDE CURRENT CONTROLFILE; RMAN> BACKUP ARCHIVELOGS TAG ‘MAY9’ ALL NOT BACKED UP DELETE ALL INPUT; After: With deduplication Full 500 GB Deduplication applied to fulls requiring much less disk Target DB 1 TB © Copyright 2010 EMC Corporation. All rights reserved. 34 Daily Incremental Backup – With Deduplication Daily: Incremental backup with update RMAN> ALLOCATE CHANNEL CH1 DEVICE TYPE DISK FORMAT ‘/dd/backup/ora.weekly/%U’; RMAN> ALLOCATE CHANNEL CH2 DEVICE TYPE DISK FORMAT ‘/dd/backup/ora.weekly/%U’; RMAN> BACKUP INCREMENTAL DATABASE …. Incrementals after deduplication 500 GB Weekly Fulls Target DB 1 TB © Copyright 2010 EMC Corporation. All rights reserved. 35 Daily Incremental Backup – With Deduplication Daily: Incremental backup with update RMAN> ALLOCATE CHANNEL CH1 DEVICE TYPE DISK FORMAT ‘/dd/backup/ora.weekly/%U’; RMAN> ALLOCATE CHANNEL CH2 DEVICE TYPE DISK FORMAT ‘/dd/backup/ora.weekly/%U’; RMAN> BACKUP INCREMENTAL DATABASE …. Incrementals after deduplication 500 GB Weekly Fulls Daily Incrementals Target DB 1 TB © Copyright 2010 EMC Corporation. All rights reserved. 36 Benefits of Using Inline Deduplication • Daily fulls instead of incrementals • Longer online retention of full backup images • No requirement for other backup software plug-ins • Non disruptive introduction into RMAN process • Simplified replication using EMC Data Domain Replicator • Ideal location for Flashback Recovery Area • Incrementally updated backups © Copyright 2010 EMC Corporation. All rights reserved. 37 Incremental Merge EMC Data Domain combines rapid cloning and deduplication • Nightly take cumulative incremental; apply to snapshot of weekly full First image backup 2X storage savings DB Full • Results in up-to-date nightly full images available for rapid recovery Snap Snap Snap Snap Snap Incremental Snap Sun © Copyright 2010 EMC Corporation. All rights reserved. Mon Tue Wed Thu Fri Sat 38 Alternatives • RMAN to 3rd Party Backup MML/SBT – Makes random access disk look like streaming tape. Not cost effective – Limits the options available to the DBA for backing up, cloning, merging, etc. – RMAN communicates through MML to agent on 3rd party backup application media server – Can front a Data Domain system as either disk-based backup or VTL – All RMAN I/O is via MML which presents only a DEVICE TYPE TAPE Data Domain can work with an enterprise backup application but does not require it. © Copyright 2010 EMC Corporation. All rights reserved. 39 Oracle Customers Leveraging Data Domain CN Rail – A Large Canadian Freight Railroad • RMAN and db2backup to NFS mounts on DD690 and DD880 systems replicating to DD690 and DD580 systems • 4 week retention schedule, 10% average daily change rate, archive logs backed up hourly • All database backups replicated again to SunGard • Sustained deduplication ratios of >6:1 (includes archive logs) © Copyright 2010 EMC Corporation. All rights reserved. 40 Oracle Customers Leveraging Data Domain Large California-based clothing retailer • Oracle tablespace archive for data warehouse • Large table spaces are written to Data Domain systems and flagged as read only and used for data mining • Sustained deduplication ratios of >7:1 © Copyright 2010 EMC Corporation. All rights reserved. 41 Oracle RMAN to Disk National supermarket chain testimonial “EMC Data Domain is just disk to me. Changing RMAN scripts to go straight to Data Domain disk was simple.” DBA Manager “We used to have to go through our backup team for recovery requests and 90% of our actual restore time was spent waiting on tape and administration. With Data Domain, I don't have to wait for someone else to satisfy a restore request or a tape recall.” DBA Manager © Copyright 2010 EMC Corporation. All rights reserved. 42 Summary • How deduplication benefits Oracle/RMAN backups • Simplification • Savings © Copyright 2010 EMC Corporation. All rights reserved. 43 More Information… • For more information on EMC Data Domain and Oracle, visit http://www.datadomain.com/solutions/oracle.html © Copyright 2010 EMC Corporation. All rights reserved. 44 © Copyright 2010 EMC Corporation. All rights reserved. 45