Backup and Recovery Module 4.2 © 2006 EMC Corporation. All rights reserved. Backup and Recovery Upon completion of this module, you will be able to: Describe best practices for planning Backup and Recovery. Describe the common media and types of data that are part of a Backup and Recovery strategy. Describe the common Backup and Recovery topologies. Describe the Backup and Recovery Process. Describe Management considerations for Backup and Recovery. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 2 In this module … This module contains the following lessons: Planning for Backup and Recovery Backup and Recovery Methods Backup Architecture Topologies Managing the Backup Process © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 3 Lesson: Planning for Backup and Recovery Upon completion of this lesson, you be able to: Define Backup and Recovery. Describe common reasons for a Backup and Recovery plan. Describe the business considerations for Backup and Recovery. Define RPO and RTO. Describe the data considerations for Backup and Recovery Describe the planning for Backup and Recovery. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 4 What is a Backup? Backup is an additional copy of data that can be used for restore and recovery purposes. The Backup copy is used when the primary copy is lost or corrupted. This Backup copy can be created as a: – Simple copy (there can be one or more copies) – Mirrored copy (the copy is always updated with whatever is written to the primary copy.) © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 5 Backup and Recovery Strategies Several choices are available to get the data to the backup media such as: Copy the data. Mirror (or snapshot) then copy. Remote backup. Copy then duplicate or remote copy. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 6 It’s All About Recovery! Businesses back up their data to enable its recovery in case of potential loss. Businesses also back up their data to comply with regulatory requirements. Types of backup derivatives: – Disaster Recovery – Archival – Operational © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 7 Reasons for a Backup Plan Hardware Failures Human Factors Application Failures Security Breaches Disasters Regulatory and Business Requirements © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 8 How does Backup Work? Client/Server Relationship Server – Directs Operation – Maintains the Backup Catalog Client – Gathers Data for Backup (a backup client sends backup data to a backup server or storage node). Storage Node © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 9 How does Backup Work, continued Clients Servers Backup Clients Backup Server & Storage Node Metadata Catalog Disk Storage © 2006 EMC Corporation. All rights reserved. Data Set Tape Backup Backup and Recovery - 10 Business Considerations Customer business needs determine: – What are the restore requirements – RPO & RTO? – Where and when will the restores occur? – What are the most frequent restore requests? – Which data needs to be backed up? – How frequently should data be backed up? hourly, daily, weekly, monthly – How long will it take to backup? – How many copies to create? – How long to retain backup copies? © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 11 Data Considerations: File Characteristics Location Size Number © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 12 Data Considerations: Data Compression Compressibility depends on the data type, for example: Application binaries – do not compress well. Text – compresses well. JPEG/ZIP files – are already compressed and expand if compressed again. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 13 Data Considerations: Retention Periods Operational – Data sets on primary media (disk) up to the point where most restore requests are satisfied, then moved to secondary storage (tape). Disaster Recovery – Driven by the organization’s disaster recovery policy Portable media (tapes) sent to an offsite location / vault. Replicated over to an offsite location (disk). Backed up directly to the offsite location (disk, tape or emulated tape). Archiving – Driven by the organization’s policy. – Dictated by regulatory requirements. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 14 Lesson: Summary Topics in this lesson included: Backup and Recovery definitions and examples. Common reasons for Backup and Recovery. The business considerations for Backup and Recovery. Recovery Point Objectives and Recovery Time Objectives. The data considerations for Backup and Recovery The planning for Backup and Recovery. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 15 Lesson: Backup and Recovery Methods Upon completion of this lesson, you be able to: Describe Hot and Cold Backups. Describe the levels of Backup Granularity. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 16 Database Backup Methods Hot Backup: production is not interrupted. Cold Backup: production is interrupted. Backup Agents manage the backup of different data types such as: – Structured (such as databases) – Semi-structured (such as email) – Unstructured (file systems) © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 17 Backup Granularity and Levels Full Backup Cumulative (Differential) Incremental Full © 2006 EMC Corporation. All rights reserved. Cumulative Incremental Backup and Recovery - 18 Restoring an Incremental Backup Monday Tuesday Wednesday Thursday File 4 File 3 File 5 Incremental Incremental Incremental Files 1, 2, 3 Full Backup Files 1, 2, 3, 4, 5 Production Key Features – Files that have changed since the last full or incremental backup are backed up. – Fewest amount of files to be backed up, therefore faster backup and less storage space. – Longer restore because last full and all subsequent incremental backups must be applied. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 19 Restoring a Cumulative Backup Monday Tuesday Wednesday Thursday File 4 Files 4, 5 Files 4, 5, 6 Cumulative Cumulative Cumulative Files 1, 2, 3 Full Backup Files 1, 2, 3, 4, 5, 6 Production Key Features – More files to be backed up, therefore it takes more time to backup and uses more storage space. – Much faster restore because only the last full and the last cumulative backup must be applied. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 20 Lesson: Summary Topics in this lesson included: Hot and Cold Backups. The levels of Backup Granularity. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 21 Lesson: Backup Architecture Topologies Upon completion of this lesson, you be able to: Describe DAS, LAN, SAN, Mixed topologies. Describe backup media considerations. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 22 Backup Architecture Topologies There are 3 basic backup topologies: – Direct Attached Based Backup – LAN Based Backup – SAN Based Backup These topologies can be integrated, forming a “mixed” topology © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 23 Direct Attached Based Backups LAN Metadata Catalog Backup Server © 2006 EMC Corporation. All rights reserved. Data Storage Node Backup Client Media Backup Backup and Recovery - 24 LAN Based Backups Database Server Backup Client Mail Server Backup Client Metadata Data LAN Metadata Data Backup Server Storage Node Storage Node © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 25 SAN Based Backups (LAN Free) Mail Server Storage Node Backup Client LAN Data SAN Backup Device Metadata Data Backup Server © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 26 SAN/LAN Mixed Based Backups Storage Node Database Server Backup Client Mail Server Backup Client Data LAN Data SAN Metadata Data Backup Device Backup Server © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 27 Backup Media Tape – Traditional destination for backups – Sequential access – No protection Disk – Random access – Protected by the storage array (RAID, hot spare, etc) © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 28 Multiple Streams on Tape Media Data from Stream 1 Data from Stream 2 Data from Stream 3 Tape Multiple streams interleaved to achieve higher throughput on tape – Keeps the tape streaming, for maximum write performance – Helps prevent tape mechanical failure – Greatly increases time to restore © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 29 Backup to Disk Backup to disk minimizes tape in backup environments by using disk as the primary destination device – Cost benefits – No processes changes needed – Better service levels Backup to disk aligns backup strategy to RTO and RPO © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 30 Tape versus Disk – Restore Comparison 24 Minutes Disk Backup / Restore 108 Minutes Tape Backup / Restore 0 10 20 30 40 50 60 70 80 90 100 110 120 Recovery Time in Minutes* *Total time from point of failure to return of service to e-mail users Typical Scenario: 800 users, 75 MB mailbox 60 GB database Source: EMC Engineering and EMC IT © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 31 31 Three Backup / Restore Solutions based on RTO 2 Min. Local Replica / Clone Restore time 17 Min. 19 Minutes Log playback Backup on ATA 24 Min. 17 Min. 41 Minutes Backup on tape 108 Min. 17 Min. 125 Minutes 0 10 20 30 40 50 60 70 80 90 100 110 120 130 Recovery Time in Minutes* *Total time from point of failure to return of service to e-mail users Typical Scenario: 800 users, 75 MB mailbox 60 GB DB – restore time 500 MB logs – log playback © 2006 EMC Corporation. All rights reserved. Time of last image dictates the log playback time Larger data sets extend the recovery time (ATA and tape) Backup and Recovery - 32 Traditional Backup, Recovery and Archive Approach Production environment grows – Requires constant tuning and data placement to maintain performance Production – Need to add more tier-1 storage Backup environment grows – Backup windows get longer and jobs do not complete Backup Process – Restores take longer – Requires more tape drives and silos to keep up with service levels Archive environment grows Archive Process – Impact flexibility to retrieve content when requested – Requires more media, adding management cost – No investment protection for long term retention requirements © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 33 Differences Between Backup / Recovery & Archive Backup / Recovery Archive A secondary copy of information Primary copy of information Used for recovery operations Available for information retrieval Improves availability by enabling application to be restored to a specific point in time Adds operational efficiencies by moving fixed / unstructured content out of operational environment Typically short-term (weeks or months) Typically long-term (months, years, or decades) Data typically overwritten on periodic basis (e.g., monthly) Data typically maintained for analysis, value generation, or compliance Not for regulatory compliance— though some are forced to use Useful for compliance and should take into account informationretention policy © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 34 New Architecture for Backup, Recovery & Archive 1 Backup Process 3 4 2 Production 4 Archive Process Understand the environment Actively archive valuable information to tiered storage Back up active production information to disk Retrieve from archive or recover from backup © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 35 Lesson: Summary Topics in this lesson included: The DAS, LAN, SAN, and Mixed topologies. Backup media considerations. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 36 Lesson: Managing the Backup Process Upon completion of this lesson, you be able to: Describe features and functions of common Backup/Recovery applications. Describe the Backup/Recovery process management considerations. Describe the importance of the information found in Backup Reports and in the Backup Catalog. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 37 How a Typical Backup Application Works Backup clients are grouped and associated with a Backup schedule that determines when and which backup type will occur. Groups are associated with Pools, which determine which backup media will be used. Each backup media has a unique label. Information about the backup is written to the Backup Catalog during and after it completes. The Catalog shows: – when the Backup was performed, and – which media was used (label). Errors and other information is also written to a log. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 38 Backup Application User Interfaces There are typically two types of user interfaces: Command Line Interface – CLI Graphical User Interfaces – GUI © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 39 Managing the Backup and Restore Process Running the B/R Application: Backup – The backup administrator configures it to be started, most (if not all) of the times, automatically – Most backup products offer the ability for the backup client to initiate their own backup (usually disabled) Running the B/R Application: Restore – There is usually a separate GUI to manage the restore process – Information is pulled from the backup catalog when the user is selecting the files to be restored – Once the selection is finished, the backup server starts reading from the required backup media, and the files are sent to the backup client © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 40 Backup Reports Backup products also offer reporting features. These features rely on the backup catalog and log files. Reports are meant to be easy to read and provide important information such as: – Amount of data backed up – Number of completed backups – Number of incomplete backups (failed) – Types of errors that may have occurred Additional reports may be available, depending on the backup software product used. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 41 Importance of the Backup Catalog As you can see, backup operations strongly rely on the backup catalog If the catalog is lost, the backup software alone has no means to determine where to find a specific file backed up two months ago, for example It can be reconstructed, but this usually means that all of the backup media (i.e. tapes) have to be read It’s a good practice to protect the catalog – By replicating the file system where it resides to a remote location – By backing it up Some backup products have built-in mechanisms to protect their catalog (such as automatic backup) © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 42 Lesson: Summary Topics in this lesson included: The features and functions of common Backup/Recovery applications. The Backup/Recovery process management considerations. The importance of the information found in Backup Reports and in the Backup Catalog. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 43 Module Summary Key points covered in this module: The best practices for planning Backup and Recovery. The common media and types of data that are part of a Backup and Recovery strategy. The common Backup and Recovery topologies. The Backup and Recovery Process. Management considerations for Backup and Recovery. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 44 Check Your Knowledge What are three reasons for doing a Backup plan? What are the three topologies that support creating a Backup set? What are the advantages and disadvantages of using tape as the Backup media? What are the three levels of granularity found in Backups? © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 45 Apply Your Knowledge… Upon completion of this topic, you will be able to: Describe EMC’s product implementation of a Backup and Recovery solution. © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 46 EMC NetWorker Tiered Protection and Recovery Management Remove risk Faster and more consistent data backup Improve reliability Keep recovery copies fresh and reduce process errors Lower total cost of ownership Basic Tape backup and recovery Low Backup to disk Advanced backup Disk-backup option Snapshot management SERVICE-LEVEL REQUIREMENTS Centralization and ease of use © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 47 High NetWorker Backup and Recovery Solution Features Enterprise protection Basic Architecture Heterogeneous clients – Critical applications – Heterogeneous platforms and storage – Scalable architecture – 256-bit AES encryption and secure authentication Key applications LAN Backup server NAS (NDMP) SAN Storage Node Centralized management – Graphical user interface – Customizable reporting – Wizard-driven configuration Performance Tape library © 2006 EMC Corporation. All rights reserved. – Data multiplexing – Advanced indexing – Efficient media management Backup and Recovery - 48 Critical Application and Database Protection Backup without Application Modules Backup with NetWorker Application Modules Offline (Cold) Restart application SAVE 24x7 OPERATIONS Application NetWorker MODULE Back up application Application DOWNTIME Shut down application Application Integration with application APIs for backup and recovery © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 49 Media-Management Advantages Open Tape Format – Datastream multiplexing – Self-contained indexing – Cross-platform format UNIX Windows Linux – Minimize impact of tape corruption Dynamic drive sharing – Cross-platform tape-drive sharing – On-demand device usage – Reduce hardware total cost of ownership © 2006 EMC Corporation. All rights reserved. NetWorker UNIX/Linux NetWorker Windows Backup and Recovery - 50 NetWorker DiskBackup Option Backup-to-Disk Architecture Heterogeneous clients Key applications Backup server SAN – Simultaneous-access operations – No penalty on restore versus tape Policy-based migration of data from disk to tape LAN NAS High performance Storage Node – Automated staging and cloning – Up to 50% faster – Clone backups jobs as they complete – Reduce wear and tear on tape drives and cartridges Superior capability Tape library © 2006 EMC Corporation. All rights reserved. Diskbackup target – Operational backup and recovery for all clients, including NAS with NDMP – Direct file access for fast recovery Backup and Recovery - 51 Advanced Backup - Snapshots and CDP Integration of backup with snapshots, full-volume mirrors, Production and Continuous Data Protection (CDP) server Instant restore Off-host backups Production information Achieve stringent recovery-time objectives (RTOs), recovery-point objectives (RPOs) Recover It is expected that snapshot technology for data protection will surpass backup to tape as the trend in data protection as organizations continue to focus on recovery times © 2006 EMC Corporation. All rights reserved. Backup Snapshot 5:00 p.m. Snapshot 11:00 a.m. Backup snap 10:00 p.m. Backup server Backup and Recovery - 52 NetWorker PowerSnap Module Policy-based management Advanced Backup Heterogeneous clients – Administer snapshots in NetWorker – Schedule, create, retain, and delete snapshots by policy Key applications Third-party integration LAN Backup server NAS SAN Tape library Storage Node – Leverage third-party replication technology Array-based (Symmetrix DMX, CLARiiON CX, etc.) Software-based (RecoverPoint) Application recovery CLARiiON with SnapView – Integration with Application Modules to ensure consistent state Exchange / SQL / Oracle / SAP © 2006 EMC Corporation. All rights reserved. Backup and Recovery - 53 NetWorker SnapImage Module Advanced Backup Block-level backups – Host-based snapshot – Targeted at high-density file systems – Single-file restore – Sparse backups High performance – Significant backup-and-restore performance impact—up to 10 times faster 10,000,000+ files 1,000,000+ directories © 2006 EMC Corporation. All rights reserved. – Drive tape at rated speeds – Optional network-accelerated serverless backup with Cisco intelligent switch Backup and Recovery - 54 Solution Example: Major Telecom Company Enterprise-Information Protection Solution: Business Challenge: NetWorker PowerSnap with Symmetrix and TimeFinder/Snap Complex application environment – Server-free backup No backup window Recovery-time objective: Restore 24 TB in two hours NetWorker DiskBackup Option with CLARiiON with ATA disks – Rapid primary-site protection NetWorker and SRDF/S Disaster-Recovery Site – Disaster recovery – Offsite protection Production Site NetWorker Disasterrecovery host Tape library Storage Node Application host Storage Node PowerSnap SAN SAN Tape library SRDF/S Symmetrix DMX © 2006 EMC Corporation. All rights reserved. Symmetrix DMX CLARiiON CX Value Proposition Zero backup window for applications Eliminated data-loss risk Reduced management overhead Backup and Recovery - 55 55