Backup and Recovery
Module 4.2
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery
Upon completion of this module, you will be able to:
 Describe best practices for planning Backup and
Recovery.
 Describe the common media and types of data that are
part of a Backup and Recovery strategy.
 Describe the common Backup and Recovery topologies.
 Describe the Backup and Recovery Process.
 Describe Management considerations for Backup and
Recovery.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 2
In this module …
This module contains the following lessons:
 Planning for Backup and Recovery
 Backup and Recovery Methods
 Backup Architecture Topologies
 Managing the Backup Process
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 3
Lesson: Planning for Backup and Recovery
Upon completion of this lesson, you be able to:
 Define Backup and Recovery.
 Describe common reasons for a Backup and Recovery
plan.
 Describe the business considerations for Backup and
Recovery.
 Define RPO and RTO.
 Describe the data considerations for Backup and
Recovery
 Describe the planning for Backup and Recovery.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 4
What is a Backup?
 Backup is an additional copy of data that can be used for
restore and recovery purposes.
 The Backup copy is used when the primary copy is lost
or corrupted.
 This Backup copy can be created as a:
– Simple copy (there can be one or more copies)
– Mirrored copy (the copy is always updated with whatever is written
to the primary copy.)
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 5
Backup and Recovery Strategies
Several choices are available to get the data to the backup
media such as:
 Copy the data.
 Mirror (or snapshot) then copy.
 Remote backup.
 Copy then duplicate or remote copy.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 6
It’s All About Recovery!
 Businesses back up their data to enable its recovery in
case of potential loss.
 Businesses also back up their data to comply with
regulatory requirements.
 Types of backup derivatives:
– Disaster Recovery
– Archival
– Operational
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 7
Reasons for a Backup Plan
 Hardware Failures
 Human Factors
 Application Failures
 Security Breaches
 Disasters
 Regulatory and Business Requirements
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 8
How does Backup Work?
 Client/Server Relationship
 Server
– Directs Operation
– Maintains the Backup Catalog
 Client
– Gathers Data for Backup (a backup client sends backup data to a
backup server or storage node).
 Storage Node
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 9
How does Backup Work, continued
Clients
Servers
Backup Clients
Backup Server
& Storage Node
Metadata
Catalog
Disk
Storage
© 2006 EMC Corporation. All rights reserved.
Data Set
Tape
Backup
Backup and Recovery - 10
Business Considerations
 Customer business needs determine:
– What are the restore requirements – RPO & RTO?
– Where and when will the restores occur?
– What are the most frequent restore requests?
– Which data needs to be backed up?
– How frequently should data be backed up?
 hourly, daily, weekly, monthly
– How long will it take to backup?
– How many copies to create?
– How long to retain backup copies?
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 11
Data Considerations: File Characteristics
 Location
 Size
 Number
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 12
Data Considerations: Data Compression
Compressibility depends on the data type, for example:
 Application binaries – do not compress well.
 Text – compresses well.
 JPEG/ZIP files – are already compressed and expand if
compressed again.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 13
Data Considerations: Retention Periods
 Operational
– Data sets on primary media (disk) up to the point where most restore
requests are satisfied, then moved to secondary storage (tape).
 Disaster Recovery
– Driven by the organization’s disaster recovery policy
 Portable media (tapes) sent to an offsite location / vault.
 Replicated over to an offsite location (disk).
 Backed up directly to the offsite location (disk, tape or emulated tape).
 Archiving
– Driven by the organization’s policy.
– Dictated by regulatory requirements.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 14
Lesson: Summary
Topics in this lesson included:
 Backup and Recovery definitions and examples.
 Common reasons for Backup and Recovery.
 The business considerations for Backup and Recovery.
 Recovery Point Objectives and Recovery Time
Objectives.
 The data considerations for Backup and Recovery
 The planning for Backup and Recovery.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 15
Lesson: Backup and Recovery Methods
Upon completion of this lesson, you be able to:
 Describe Hot and Cold Backups.
 Describe the levels of Backup Granularity.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 16
Database Backup Methods
 Hot Backup: production is not interrupted.
 Cold Backup: production is interrupted.
 Backup Agents manage the backup of different data
types such as:
– Structured (such as databases)
– Semi-structured (such as email)
– Unstructured (file systems)
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 17
Backup Granularity and Levels
Full Backup
Cumulative (Differential)
Incremental
Full
© 2006 EMC Corporation. All rights reserved.
Cumulative
Incremental
Backup and Recovery - 18
Restoring an Incremental Backup
Monday
Tuesday
Wednesday
Thursday
File 4
File 3
File 5
Incremental
Incremental
Incremental
Files 1, 2, 3
Full Backup
Files 1, 2, 3, 4, 5
Production
 Key Features
– Files that have changed since the last full or incremental backup are
backed up.
– Fewest amount of files to be backed up, therefore faster backup and less
storage space.
– Longer restore because last full and all subsequent incremental backups
must be applied.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 19
Restoring a Cumulative Backup
Monday
Tuesday
Wednesday
Thursday
File 4
Files 4, 5
Files 4, 5, 6
Cumulative
Cumulative
Cumulative
Files 1, 2, 3
Full Backup
Files 1, 2, 3, 4, 5, 6
Production
 Key Features
– More files to be backed up, therefore it takes more time to backup
and uses more storage space.
– Much faster restore because only the last full and the last cumulative
backup must be applied.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 20
Lesson: Summary
Topics in this lesson included:
 Hot and Cold Backups.
 The levels of Backup Granularity.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 21
Lesson: Backup Architecture Topologies
Upon completion of this lesson, you be able to:
 Describe DAS, LAN, SAN, Mixed topologies.
 Describe backup media considerations.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 22
Backup Architecture Topologies
 There are 3 basic backup topologies:
– Direct Attached Based Backup
– LAN Based Backup
– SAN Based Backup
 These topologies can be integrated, forming a “mixed”
topology
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 23
Direct Attached Based Backups
LAN
Metadata
Catalog
Backup Server
© 2006 EMC Corporation. All rights reserved.
Data
Storage Node
Backup Client
Media
Backup
Backup and Recovery - 24
LAN Based Backups
Database Server
Backup Client
Mail Server
Backup Client
Metadata
Data
LAN
Metadata
Data
Backup Server
Storage Node
Storage Node
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 25
SAN Based Backups (LAN Free)
Mail Server
Storage Node
Backup Client
LAN
Data
SAN
Backup
Device
Metadata
Data
Backup Server
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 26
SAN/LAN Mixed Based Backups
Storage Node
Database Server
Backup Client
Mail Server
Backup Client
Data
LAN
Data
SAN
Metadata
Data
Backup
Device
Backup Server
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 27
Backup Media
 Tape
– Traditional destination for backups
– Sequential access
– No protection
 Disk
– Random access
– Protected by the storage array (RAID, hot spare, etc)
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 28
Multiple Streams on Tape Media
Data from
Stream 1
Data from
Stream 2
Data from
Stream 3
Tape
 Multiple streams interleaved to achieve higher
throughput on tape
– Keeps the tape streaming, for maximum write performance
– Helps prevent tape mechanical failure
– Greatly increases time to restore
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 29
Backup to Disk
 Backup to disk minimizes tape in backup environments
by using disk as the primary destination device
– Cost benefits
– No processes changes needed
– Better service levels
 Backup to disk aligns backup strategy to RTO and
RPO
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 30
Tape versus Disk – Restore Comparison
24
Minutes
Disk
Backup / Restore
108
Minutes
Tape
Backup / Restore
0
10
20
30
40
50
60
70
80
90
100 110 120
Recovery Time in Minutes*
*Total time from point of failure to return of service to e-mail users
Typical Scenario:
 800 users, 75 MB mailbox
 60 GB database
Source: EMC Engineering and EMC IT
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 31
31
Three Backup / Restore Solutions based on RTO
2 Min.
Local Replica /
Clone
Restore time
17 Min. 19 Minutes
Log playback
Backup on ATA
24 Min. 17 Min. 41 Minutes
Backup on tape
108 Min. 17 Min. 125 Minutes
0
10
20
30
40
50
60
70
80
90 100 110 120 130
Recovery Time in Minutes*
*Total time from point of failure to return of service to e-mail users
Typical Scenario:
 800 users, 75 MB mailbox
 60 GB DB – restore time
 500 MB logs – log playback
© 2006 EMC Corporation. All rights reserved.
 Time of last image dictates
the log playback time
 Larger data sets extend the
recovery time (ATA and tape)
Backup and Recovery - 32
Traditional Backup, Recovery and Archive Approach
 Production environment grows
– Requires constant tuning and data placement to
maintain performance
Production
– Need to add more tier-1 storage
 Backup environment grows
– Backup windows get longer and jobs do not complete
Backup
Process
– Restores take longer
– Requires more tape drives and silos to keep up with
service levels
 Archive environment grows
Archive
Process
– Impact flexibility to retrieve content when requested
– Requires more media, adding management cost
– No investment protection for long term retention
requirements
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 33
Differences Between Backup / Recovery & Archive
Backup / Recovery
Archive
A secondary copy of information
Primary copy of information
Used for recovery operations
Available for information retrieval
Improves availability by enabling
application to be restored to a
specific point in time
Adds operational efficiencies by
moving fixed / unstructured content
out of operational environment
Typically short-term (weeks or
months)
Typically long-term (months, years,
or decades)
Data typically overwritten on
periodic basis (e.g., monthly)
Data typically maintained for
analysis, value generation, or
compliance
Not for regulatory compliance—
though some are forced to use
Useful for compliance and should
take into account informationretention policy
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 34
New Architecture for Backup, Recovery & Archive
1
Backup
Process
3
4
2
Production
4
Archive
Process
 Understand the environment
 Actively archive valuable information to tiered
storage
 Back up active production information to disk
 Retrieve from archive or recover from backup
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 35
Lesson: Summary
Topics in this lesson included:
 The DAS, LAN, SAN, and Mixed topologies.
 Backup media considerations.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 36
Lesson: Managing the Backup Process
Upon completion of this lesson, you be able to:
 Describe features and functions of common
Backup/Recovery applications.
 Describe the Backup/Recovery process management
considerations.
 Describe the importance of the information found in
Backup Reports and in the Backup Catalog.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 37
How a Typical Backup Application Works
 Backup clients are grouped and associated with a Backup
schedule that determines when and which backup type will
occur.
 Groups are associated with Pools, which determine which
backup media will be used.
 Each backup media has a unique label.
 Information about the backup is written to the Backup Catalog
during and after it completes. The Catalog shows:
– when the Backup was performed, and
– which media was used (label).
 Errors and other information is also written to a log.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 38
Backup Application User Interfaces
There are typically two types of user interfaces:
 Command Line Interface – CLI
 Graphical User Interfaces – GUI
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 39
Managing the Backup and Restore Process
 Running the B/R Application: Backup
– The backup administrator configures it to be started, most (if not all)
of the times, automatically
– Most backup products offer the ability for the backup client to initiate
their own backup (usually disabled)
 Running the B/R Application: Restore
– There is usually a separate GUI to manage the restore process
– Information is pulled from the backup catalog when the user is
selecting the files to be restored
– Once the selection is finished, the backup server starts reading from
the required backup media, and the files are sent to the backup
client
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 40
Backup Reports
 Backup products also offer reporting features.
 These features rely on the backup catalog and log files.
 Reports are meant to be easy to read and provide
important information such as:
– Amount of data backed up
– Number of completed backups
– Number of incomplete backups (failed)
– Types of errors that may have occurred
 Additional reports may be available, depending on the
backup software product used.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 41
Importance of the Backup Catalog
 As you can see, backup operations strongly rely on the
backup catalog
 If the catalog is lost, the backup software alone has no
means to determine where to find a specific file backed
up two months ago, for example
 It can be reconstructed, but this usually means that all of
the backup media (i.e. tapes) have to be read
 It’s a good practice to protect the catalog
– By replicating the file system where it resides to a remote location
– By backing it up
 Some backup products have built-in mechanisms to
protect their catalog (such as automatic backup)
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 42
Lesson: Summary
Topics in this lesson included:
 The features and functions of common Backup/Recovery
applications.
 The Backup/Recovery process management
considerations.
 The importance of the information found in Backup
Reports and in the Backup Catalog.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 43
Module Summary
Key points covered in this module:
 The best practices for planning Backup and Recovery.
 The common media and types of data that are part of a
Backup and Recovery strategy.
 The common Backup and Recovery topologies.
 The Backup and Recovery Process.
 Management considerations for Backup and Recovery.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 44
 Check Your Knowledge
 What are three reasons for doing a Backup plan?
 What are the three topologies that support creating a
Backup set?
 What are the advantages and disadvantages of using
tape as the Backup media?
 What are the three levels of granularity found in
Backups?
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 45
Apply Your Knowledge…
Upon completion of this topic, you will be able to:
 Describe EMC’s product implementation of a Backup
and Recovery solution.
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 46
EMC NetWorker
Tiered Protection and Recovery Management
Remove risk
Faster and more consistent data
backup
Improve reliability
Keep recovery copies fresh and
reduce process errors
Lower total cost of
ownership
Basic
Tape backup
and recovery
Low
Backup
to disk
Advanced
backup
Disk-backup
option
Snapshot
management
SERVICE-LEVEL REQUIREMENTS
Centralization and ease of use
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 47
High
NetWorker Backup and Recovery
Solution Features
 Enterprise protection
Basic Architecture
Heterogeneous
clients
– Critical applications
– Heterogeneous platforms and
storage
– Scalable architecture
– 256-bit AES encryption and secure
authentication
Key
applications
LAN
Backup
server
NAS
(NDMP)
SAN
Storage
Node
 Centralized management
– Graphical user interface
– Customizable reporting
– Wizard-driven configuration
 Performance
Tape
library
© 2006 EMC Corporation. All rights reserved.
– Data multiplexing
– Advanced indexing
– Efficient media management
Backup and Recovery - 48
Critical Application and Database Protection
Backup without Application
Modules
Backup with NetWorker
Application Modules
Offline (Cold)
Restart
application
SAVE
24x7 OPERATIONS
Application
NetWorker MODULE
Back up
application
Application
DOWNTIME
Shut down
application
Application
Integration with application APIs
for backup and recovery
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 49
Media-Management Advantages
 Open Tape Format
– Datastream multiplexing
– Self-contained indexing
– Cross-platform format
 UNIX  Windows  Linux
– Minimize impact of tape corruption
 Dynamic drive sharing
– Cross-platform tape-drive sharing
– On-demand device usage
– Reduce hardware total cost of
ownership
© 2006 EMC Corporation. All rights reserved.
NetWorker
UNIX/Linux
NetWorker
Windows
Backup and Recovery - 50
NetWorker DiskBackup Option
Backup-to-Disk Architecture
Heterogeneous
clients
Key
applications
Backup
server
SAN
– Simultaneous-access operations
– No penalty on restore versus tape
 Policy-based migration of data
from disk to tape
LAN
NAS
 High performance
Storage
Node
– Automated staging and cloning
– Up to 50% faster
– Clone backups jobs as they
complete
– Reduce wear and tear on tape
drives and cartridges
 Superior capability
Tape
library
© 2006 EMC Corporation. All rights reserved.
Diskbackup
target
– Operational backup and recovery
for all clients, including NAS with
NDMP
– Direct file access for fast recovery
Backup and Recovery - 51
Advanced Backup - Snapshots and CDP
 Integration of backup with snapshots, full-volume mirrors,
Production
and Continuous Data Protection (CDP)
server
 Instant restore
 Off-host backups
Production
information
 Achieve stringent recovery-time objectives (RTOs),
recovery-point objectives (RPOs)
Recover
It is expected that snapshot
technology for data protection will
surpass backup to tape as the trend
in data protection as organizations
continue to focus on recovery times
© 2006 EMC Corporation. All rights reserved.
Backup
Snapshot
5:00 p.m.
Snapshot
11:00 a.m.
Backup snap
10:00 p.m.
Backup
server
Backup and Recovery - 52
NetWorker PowerSnap Module
 Policy-based management
Advanced Backup
Heterogeneous
clients
– Administer snapshots in
NetWorker
– Schedule, create, retain, and
delete snapshots by policy
Key
applications
 Third-party integration
LAN
Backup
server
NAS
SAN
Tape
library
Storage
Node
– Leverage third-party replication
technology
 Array-based (Symmetrix DMX,
CLARiiON CX, etc.)
 Software-based (RecoverPoint)
 Application recovery
CLARiiON
with
SnapView
– Integration with Application
Modules to ensure consistent
state
 Exchange / SQL / Oracle / SAP
© 2006 EMC Corporation. All rights reserved.
Backup and Recovery - 53
NetWorker SnapImage Module
Advanced Backup
 Block-level backups
– Host-based snapshot
– Targeted at high-density file
systems
– Single-file restore
– Sparse backups
 High performance
– Significant backup-and-restore
performance impact—up to 10
times faster
10,000,000+ files
1,000,000+ directories
© 2006 EMC Corporation. All rights reserved.
– Drive tape at rated speeds
– Optional network-accelerated
serverless backup with Cisco
intelligent switch
Backup and Recovery - 54
Solution Example: Major Telecom Company
Enterprise-Information Protection
Solution:
Business Challenge:
 NetWorker PowerSnap with
Symmetrix and TimeFinder/Snap
 Complex application
environment
– Server-free backup
 No backup window
 Recovery-time objective:
Restore 24 TB in two
hours
 NetWorker DiskBackup Option with
CLARiiON with ATA disks
– Rapid primary-site protection
 NetWorker and SRDF/S
Disaster-Recovery Site
– Disaster recovery
– Offsite protection
Production Site
NetWorker
Disasterrecovery host
Tape
library
Storage Node
Application
host
Storage Node
PowerSnap
SAN
SAN
Tape
library
SRDF/S
Symmetrix
DMX
© 2006 EMC Corporation. All rights reserved.
Symmetrix
DMX
CLARiiON
CX
Value Proposition
Zero backup window for
applications
Eliminated data-loss risk
Reduced management
overhead
Backup and Recovery - 55
55