EMC BACKUP AND RECOVERY SOLUTIONS Backup to the future © Copyright 2011 EMC Corporation. All rights reserved. 1 Agenda • EMC backup and recovery solutions – Backup Recovery Systems (BRS) division profile – The transition from tape to disk – Backup and recovery in the enterprise • Enterprise backup and recovery – EMC NetWorker – EMC Disk Library and Disk Library for mainframe • Deduplication: Enabling next-generation backup – EMC Avamar – EMC Data Domain © Copyright 2011 EMC Corporation. All rights reserved. 2 EMC Backup Recovery Systems Division • Division HQ: Santa Clara, CA • 10 R&D locations – 2,000 employees • Data protection storage systems – More than 60,000 systems installed – More than 45,000 customers – More than 15,000 PB under protection worldwide • Global sales, support, and services – Approximately 6,000 channel partners © Copyright 2011 EMC Corporation. All rights reserved. 3 EMC Backup and Recovery Market Position • Avamar – #1 deduplication backup software worldwide – 8,000 installations – 4,400 customers • Data Domain – #1 deduplication storage worldwide – 12,000 installations – 5,100 customers • Disk Library – #1 virtual tape library (VTL) worldwide – >$1B in sales • NetWorker – Top three enterprise backup software – 30,000 customers © Copyright 2011 EMC Corporation. All rights reserved. 4 Backup and Recovery Architectures: In Transition from Tape to Disk Backup/Recovery Architecture Conventional (Tape-centric) Application Backup Clients Backup/Media Manager Onsite Backup Storage Disaster Recovery Storage Backup NetWorker software Tape Tape Backup software NetWorker Disk VTL Library VTL/Tape DB Home Backup software NetWorker Deduplication storage Data Domain Transformational (Disk-centric) Deduplication backup software and system Avamar Data Protection Data Protection Management Advisor Software on premise © Copyright 2011 EMC Corporation. All rights reserved. off premise 5 Backup and Recovery: Hot Spot in the Enterprise • Secular shift from tapecentric to disk/networkcentric approaches F1000: What are your top storage pain points? • Enabler: Massive data deduplication and compression techniques • Unabated growth of enterprise data overwhelms legacy infrastructure • Server virtualization is another catalyst Deduplication helps optimize many of these initiatives Source: TheInfoPro, Wave 14 Storage Study, Q2 2010, published August 19, 2010; n=166 (8/11/10 F1000 sample). Note that due to multiple responses per interview, total exceeds 100%. © Copyright 2011 EMC Corporation. All rights reserved. 6 EMC NETWORKER © Copyright 2011 EMC Corporation. All rights reserved. 7 NetWorker Backup/Recovery Architecture Conventional (Tape-centric) Application Backup Clients Backup/Media Manager Onsite Backup Storage Disaster Recovery Storage Backup NetWorker software Tape Tape Backup NetWorker software Disk VTL Library VTL/Tape DB Home Deduplication storage Data Domain Transformational (Disk-centric) NetWorker Deduplication backup Avamar software and system Data Protection Advisor Data Protection Management Software on premise © Copyright 2011 EMC Corporation. All rights reserved. off premise 8 NetWorker Backup and Recovery Software Unified backup software • Common platform – – – – Backup to disk Backup to tape Snapshot management Replication management • Integrated deduplication support – Integrated Avamar client services – Data Domain Boost integration • Simplified, centralized management • Broad, heterogeneous platform support • Enterprise-wide deployment experience – Mid-market to enterprise – Small to very, very large © Copyright 2011 EMC Corporation. All rights reserved. 9 Centralized Management DEDUPLICATION APPLICATION SUPPORT Oracle SAP Microsoft VIRTUALIZATION Data Domain Avamar NetWorker Cloud Tape FILE SYSTEMS EMC STORAGE PLATFORMS AND SERVER VNX Family RECOVERY Disk Library Family Centera © Copyright 2011 EMC Corporation. All rights reserved. REMOTE AND BRANCH OFFICES Symmetrix 10 NetWorker Differentiation • Centralized control for all backup requirements • Seamless integration with industry’s two leading deduplication solutions • Advanced application and virtual environment support • Reliable recoverability © Copyright 2011 EMC Corporation. All rights reserved. 11 EMC DISK LIBRARY © Copyright 2011 EMC Corporation. All rights reserved. 12 EMC Disk Library Backup/Recovery Architecture Conventional (Tape-centric) Application Backup Clients DB Backup/Media Manager Onsite Backup Storage Disaster Recovery Storage NetWorker Tape Tape Symantec Disk VTL Library VTL/Tape TSM Home Transformational (Disk-centric) Other 3rd Party Data Domain Deduplication backup Avamar software and system Data Protection Advisor Data Protection Management Software on premise © Copyright 2011 EMC Corporation. All rights reserved. off premise 13 Disk Library DL5200 EMC DISK LIBRARY • Based on proven CLARiiON CX4 array Industry’s most popular virtual tape library • Nearly 3 PB of logical capacity – 1 TB or 2 TB SATA drives – Up to 945 drives in a single array • Enhanced system throughput, up to 10.2 TB per hour – Faster hardware compression – Front-end 8 Gb/s Fibre Channel ports • Improved energy efficiency – High-density drives and Spin Down reduce per-terabyte disk drive energy requirements – New high-density racks—twice the capacity in same footprint • Easy integration into existing infrastructure – Central management with other Disk Library systems – Replication between Disk Library systems © Copyright 2011 EMC Corporation. All rights reserved. 14 DL5200 DL5200 Storage CLARiiON CX4960 Maximum capacity (usable) 1.4 PB Maximum performance (compressed) 10.2 TB per hour Fibre Channel connectivity 8 Gb/s Engines 2 Back-end arrays 1 Drive size 1 TB/2 TB Active engine failover Replication Consolidated media management Hardware compression Spin Down © Copyright 2011 EMC Corporation. All rights reserved. 15 Disk Library Differentiation • Industry-leading open systems virtual tape library – More than 500 PB deployed – More than 2,500 customers worldwide • Industry’s most energy-efficient virtual tape library – Drive Spin Down reduces power and cooling costs • Consolidated media management – EMC NetWorker and Symantec NetBackup integrated • Most qualified backup environments – More than three million supported configurations © Copyright 2011 EMC Corporation. All rights reserved. 16 EMC DISK LIBRARY FOR MAINFRAME © Copyright 2011 EMC Corporation. All rights reserved. 17 EMC Disk Library for Mainframe • True IBM tape emulation • Transparent to mainframe operations • Leverages low-cost SATA II technology IBM mainframe EMC Disk Library for mainframe • High-performance read and write • Unmatched remote replication capability • EMC-branded product – – – – © Copyright 2011 EMC Corporation. All rights reserved. QA/tested by EMC Manufactured by EMC Maintained by EMC Professional Services by EMC 18 EMC Disk Library for Mainframe Family DLm120 DLm960 Number of VTEs 1 or 2 1–6 Connectivity FICON FICON Number of channels to host 2 or 4 2–12 Number of virtual tape drives Up to 512 Up to 1,536 Maximum capacity (usable) 9.5 TB–96.5 TB 19.3 TB–1.2 PB Performance Up to 400 MB/s Up to 1.2 GB/s Number of cabinets 1 2–13 with 1 TB 2–9 with 2 TB Replication Hardware compression © Copyright 2011 EMC Corporation. All rights reserved. 19 DLm960 with Deduplication Storage Expansion Option EMC DISK LIBRARY FOR MAINFRAME and industry’s most popular deduplication system • Based on proven Data Domain DD880 • Nearly 3.5 PBs of logical capacity • System throughput up to 4.3 TB per hour – Hardware compression – Deduplication • Reliability designed for the data center DLm960 Deduplication Storage Expansion Option – Multipath for access to all tapes – Data Domain Data Invulnerability Architecture – Call home for support • Easy integration into existing infrastructure – Behaves like a tape library to the application – Low bandwidth replication between disk library systems – No changes to current management process © Copyright 2011 EMC Corporation. All rights reserved. 20 Disk Library for Mainframe Differentiation • Eliminates all issues related to traditional tape handling – Eliminates manual intervention, physical movement of tape cartridges, robotic issues, and single points of failure • Works seamlessly with existing applications – Uses existing tape management processes to automate tape vaulting • Significantly improves performance – Reallocates all of the data to disk and uses smart I/O buffering, allowing potentially significant reductions in batch windows • Extends disaster recovery capabilities to the tape workload – Utilizes array-based replication process over IP to seamlessly move tapes offsite • Provides deduplication for backup and archive workloads – Gain longer onsite retention, optimize replication, and lower overall disk storage costs • Easily scales as the workload increases – No need for additional subsystems, libraries, network connections, etc. © Copyright 2011 EMC Corporation. All rights reserved. 21 EMC AVAMAR AND EMC DATA DOMAIN Enabling next-generation data protection with deduplication © Copyright 2011 EMC Corporation. All rights reserved. 22 EMC Avamar and EMC Data Domain Retain, replicate, recover Deduplicate everything without changing anything Simplify backup, archiving, and disaster recovery with easy integration across workloads, infrastructures, and backup software Data Domain Deduplication Storage Systems © Copyright 2011 EMC Corporation. All rights reserved. Never back up the same data twice Revolutionize your backup by moving less data to solve your toughest VMware, NAS, remote office, and desktop/laptop backup challenges Avamar Deduplication Backup Software 23 Data Reduction/Deduplication: F1000 The “in-use” rating for EMC is now over threetimes that of its nearest competitor Source: TheInfoPro, Wave 14 Storage Study – Q2 2010, published August 19, 2010; n=146 (7/6/10 F1000 sample) © Copyright 2011 EMC Corporation. All rights reserved. 24 Deduplication Impact on Data Size Deduplication 10–30 times less data stored versus fulls plus incrementals with typical retention policies Data Stored 30 20 10 0 1 5 10 15 20 Weeks in Use Deduplication storage Traditional storage © Copyright 2011 EMC Corporation. All rights reserved. 25 Data Deduplication: Technology Overview Store more backups in a smaller footprint Friday Full Backup A B C D A E F G Mon Incremental Tues Incremental Weds Incremental Thurs Incremental A C E A B H B G C Backup Data Logical Estimated Reduction Physical FRIDAY FULL 1 TB 2–4x 250 GB Monday Incremental 100 GB 7–10x 10 GB Tuesday Incremental 100 GB 7–10x 10 GB Wednesday Incremental 100 GB 7–10x 10 GB Thursday Incremental 100 GB 7–10x 10 GB Second FRIDAY FULL 1 TB 50–60x 18 GB 2.4 TB 7.8x 308 GB I J K Second Friday Full Backup B C D E F L G H TOTAL A BCDE FGH I J K L © Copyright 2011 EMC Corporation. All rights reserved. 26 It’s Not All Deduplication Out There Regular storage array 1:1 Whitespace reduction File level Fixed blocks, snapshots Backup target, variable segment LZ compression ~ 2:1 Single instance storage ~ 3:1 Fixed block ~ 3:1 Variable segment ~20:1 Deduplication significantly reduces: • Replication WAN bandwidth • Power • Heat • Cooling • Management © Copyright 2011 EMC Corporation. All rights reserved. 27 Deduplication Enables Next-Generation Storage Architectures Storage 1.0 When did you implement this? What made you evolve? Tape Primary Disk Storage 2.0 Why did you add SATA? What did you learn? Primary Disk Tape SATA Storage 3.0 Backup/recover plus archive from disk (shrink primary) Tape: monthly Primary Deduplicate SATA Before After Tape Storage 4.0 Flash for primary Everything else to deduplicate © Copyright 2011 EMC Corporation. All rights reserved. Flash Deduplicate SATA Before After 28 EMC AVAMAR © Copyright 2011 EMC Corporation. All rights reserved. 29 Backup and Recovery Architectures: In Transition from Tape to Disk Backup/Recovery Architecture Conventional (Tape-centric) Application Backup Clients Backup/Media Manager Onsite Backup Storage Disaster Recovery Storage Backup NetWorker software Tape Tape Backup NetWorker software Disk VTL Library VTL/Tape DB Home Deduplication storage Data Domain Transformational (Disk-centric) NetWorker Deduplication backup Avamar software and system Data Protection Advisor Data Protection Management Software on premise © Copyright 2011 EMC Corporation. All rights reserved. off premise 30 Avamar Deduplication backup software and system Avamar VM • End-to-end, software/hardware solution – Integrated system for simple, predictable results – Client-side, global deduplication; within and across clients • Improves backup window, less network load – Backup process minimizes data sent and stored – Reduces network and virtual infrastructure stress • Integrated high availability and reliability Full backups, every time: one-step recovery Higher backup success rate and reliability Increased ROI, lower TCO, less risk © Copyright 2011 EMC Corporation. All rights reserved. – RAIN (redundant array of independent nodes) architecture for high availability and fault tolerance – Recoverability verified daily – Disaster recovery through replication • Flexible deployment options – Avamar Data Store – Avamar Virtual Edition – Agent-only for remote office/branch office (ROBO) 31 Avamar Family UNIFIED MANAGEMEN T EXAMPLE USE CASES VMware Remote/Branch Offices EMC Data Protection Advisor NAS/NDMP Desktop/Laptop CLIENTS Lotus Notes IBM DB2 CORE PLATFORMS EMC NetWorker Avamar VM EMC Avamar © Copyright 2011 EMC Corporation. All rights reserved. EMC Avamar Data Store EMC Avamar Virtual Edition for VMware 32 Avamar Differentiation • Shorter backup windows – Less data moved reduces daily full backup times • Reduces required daily network bandwidth and client stress – Scalable VMware backup for greater server consolidation • Simple management – System deployment is easy, pre-configured, with predictable performance – Streamlined, centralized administration and management of remote backups • Single-step restore – Single-step restore for full backups; no need for full and incrementals • Recoverability guaranteed – Daily integrity checks, RAIN, and replication ensure recoverability, high availability © Copyright 2011 EMC Corporation. All rights reserved. 33 EMC DATA DOMAIN © Copyright 2011 EMC Corporation. All rights reserved. 34 EMC Data Domain Backup/Recovery Architecture Conventional (Tape-centric) Application Backup Clients DB Backup/Media Manager Onsite Backup Storage Disaster Recovery Storage NetWorker Tape Tape Symantec Disk VTL Library VTL/Tape TSM Home Transformational (Disk-centric) Other 3rd Party Data Domain Deduplication backup Avamar software and system Data Protection Advisor Data Protection Management Software on premise © Copyright 2011 EMC Corporation. All rights reserved. off premise 35 Data Domain Basics Easy integration with existing environment Control Tier Target Tier Backup and archive applications CIFS, NFS, NDMP, DD Boost EMC Ethernet Symantec Virtual Tape Library (VTL) over Fibre Channel CommVault Disaster Recovery Tier Replication Tivoli Software BakBone Software Vizioncore © Copyright 2011 EMC Corporation. All rights reserved. DD890 appliance DD890 appliance 2U 2 to 10 ports 10 and 1 Gigabit Ethernet; 8 Gb/s Fibre Channel RAID 6 Up to 285 TB usable capacity with shelves 2 TB or 1 TB 7.2K rpm SATA hard disk drives in shelf File system NVRAM N+1 fans and redundant, hot-plug power supplies 36 Industry’s Most Scalable Inline Deduplication Systems Global Deduplication Array DD800 Appliance Series DD Archiver DD600 Appliance Series Software options: DD Boost, DD Virtual Tape Library, DD Replicator, DD Retention Lock, and DD Encryption DD140 Remote Office Appliance DD140 DD610 DD630 DD670 DD860 DD890 Global Deduplication Array DD Archiver Speed (DD Boost) 490 GB/hr 1.3 TB/hr 2.1 TB/hr 5.4 TB/hr 9.8 TB/hr 14.7 TB/hr 26.3 TB/hr 9.8 TB/hr Speed (other) 450 GB/hr 675 GB/hr 1.1 TB/hr 3.6 TB/hr 5.1 TB/hr 8.1 TB/hr 10.7 TB/hr 4.3 TB/hr Logical capacity 9–43 TB 40–195 TB 84–420 TB 0.6–2.7 PB 1.4–7.1 PB 2.9–14.2 PB 5.7–28.5 PB 5.7–28.5 PB Raw capacity 1.5 TB Up to 6 TB Up to 12 TB Up to 76 TB Up to 192 TB Up to 384 TB Up to 768 TB Up to 768 TB Usable capacity 0.86 TB Up to 3.98 TB Up to 8.4 TB Up to 55.9 TB Up to 142 TB Up to 285 TB Up to 570 TB Up to 570 TB © Copyright 2011 EMC Corporation. All rights reserved. 37 Methodology: Inline versus Post-Process Deduplication INLINE POST- PROCESS Deduplication Before Storing Deduplication After Storing Deduplication Store Deduplication 3x disk accesses to shared store Other activities unimpeded − Predictable − Simpler The more processes, the more resource contention − − − − Copy to tape: Too slow to stream tape Recovery: Service level agreement predictability Replication: Poor time-to-disaster-recovery Deduplication: If interleaved with backup or restore More administration to fight these issues © Copyright 2011 EMC Corporation. All rights reserved. 38 Performance: CPU-Centric versus Spindle-Bound Data Domain Throughput MB/s 1,500 Improvement since 2004: Throughput: 175x Capacity: 450x Fibre Channel SATA Most deduplication vendors 50 50 100 150 200 Number of Disk Spindles © Copyright 2011 EMC Corporation. All rights reserved. 39 Data Domain Differentiation • Maturity – Simple – Consistent – Robust (e.g., policy-driven deduplication replication) • Product concept: purpose-built storage – Inline and simple appliance – System infrastructure – Application independent: backup, archive, and more • Architecture: fast, small, storage of last resort – CPU-centric for price/performance – Data protection from the ground up © Copyright 2011 EMC Corporation. All rights reserved. 40 THANK YOU © Copyright 2011 EMC Corporation. All rights reserved. 41