EMC BACKUP AND
RECOVERY
SOLUTIONS
Backup to the future
© Copyright 2011 EMC Corporation. All rights reserved.
1
Agenda
• EMC backup and recovery solutions
– Backup Recovery Systems (BRS) division profile
– The transition from tape to disk
– Backup and recovery in the enterprise
• Enterprise backup and recovery
– EMC NetWorker
– EMC Disk Library and Disk Library for mainframe
• Deduplication: Enabling next-generation backup
– EMC Avamar
– EMC Data Domain
© Copyright 2011 EMC Corporation. All rights reserved.
2
EMC Backup Recovery Systems Division
• Division HQ: Santa Clara, CA
• 10 R&D locations
– 2,000 employees
• Data protection storage systems
– More than 60,000 systems installed
– More than 45,000 customers
– More than 15,000 PB under protection
worldwide
• Global sales, support, and services
– Approximately 6,000 channel partners
© Copyright 2011 EMC Corporation. All rights reserved.
3
EMC Backup and Recovery Market
Position
• Avamar
– #1 deduplication backup software worldwide
– 8,000 installations
– 4,400 customers
• Data Domain
– #1 deduplication storage worldwide
– 12,000 installations
– 5,100 customers
• Disk Library
– #1 virtual tape library (VTL) worldwide
– >$1B in sales
• NetWorker
– Top three enterprise backup software
– 30,000 customers
© Copyright 2011 EMC Corporation. All rights reserved.
4
Backup and Recovery Architectures:
In Transition from Tape to Disk
Backup/Recovery
Architecture
Conventional
(Tape-centric)
Application Backup
Clients
Backup/Media
Manager
Onsite Backup
Storage
Disaster Recovery
Storage
Backup
NetWorker
software
Tape
Tape
Backup
software
NetWorker
Disk
VTL
Library
VTL/Tape
DB
Home
Backup
software
NetWorker
Deduplication
storage
Data Domain
Transformational
(Disk-centric)
Deduplication backup
software and system
Avamar
Data Protection
Data Protection
Management
Advisor
Software
on premise
© Copyright 2011 EMC Corporation. All rights reserved.
off premise
5
Backup and Recovery:
Hot Spot in the Enterprise
• Secular shift from tapecentric to disk/networkcentric approaches
F1000: What are your top storage pain points?
• Enabler: Massive data
deduplication and
compression techniques
• Unabated growth of
enterprise data overwhelms
legacy infrastructure
• Server virtualization is
another catalyst
Deduplication helps optimize many of these initiatives
Source: TheInfoPro, Wave 14 Storage Study, Q2 2010, published August 19, 2010; n=166 (8/11/10 F1000 sample).
Note that due to multiple responses per interview, total exceeds 100%.
© Copyright 2011 EMC Corporation. All rights reserved.
6
EMC NETWORKER
© Copyright 2011 EMC Corporation. All rights reserved.
7
NetWorker
Backup/Recovery
Architecture
Conventional
(Tape-centric)
Application Backup
Clients
Backup/Media
Manager
Onsite Backup
Storage
Disaster Recovery
Storage
Backup
NetWorker
software
Tape
Tape
Backup
NetWorker
software
Disk
VTL
Library
VTL/Tape
DB
Home
Deduplication
storage
Data Domain
Transformational
(Disk-centric)
NetWorker
Deduplication backup
Avamar
software and system
Data Protection
Advisor
Data Protection
Management
Software
on premise
© Copyright 2011 EMC Corporation. All rights reserved.
off premise
8
NetWorker Backup and Recovery
Software
Unified backup software
• Common platform
–
–
–
–
Backup to disk
Backup to tape
Snapshot management
Replication management
• Integrated deduplication support
– Integrated Avamar client services
– Data Domain Boost integration
• Simplified, centralized
management
• Broad, heterogeneous platform
support
• Enterprise-wide deployment
experience
– Mid-market to enterprise
– Small to very, very large
© Copyright 2011 EMC Corporation. All rights reserved.
9
Centralized Management
DEDUPLICATION
APPLICATION
SUPPORT Oracle
SAP
Microsoft
VIRTUALIZATION
Data Domain Avamar
NetWorker
Cloud
Tape
FILE SYSTEMS
EMC STORAGE PLATFORMS
AND SERVER
VNX Family
RECOVERY Disk Library Family Centera
© Copyright 2011 EMC Corporation. All rights reserved.
REMOTE AND
BRANCH OFFICES
Symmetrix
10
NetWorker Differentiation
• Centralized control for all backup
requirements
• Seamless integration with industry’s
two leading deduplication solutions
• Advanced application and virtual
environment support
• Reliable recoverability
© Copyright 2011 EMC Corporation. All rights reserved.
11
EMC DISK LIBRARY
© Copyright 2011 EMC Corporation. All rights reserved.
12
EMC Disk Library
Backup/Recovery
Architecture
Conventional
(Tape-centric)
Application Backup
Clients
DB
Backup/Media
Manager
Onsite Backup
Storage
Disaster Recovery
Storage
NetWorker
Tape
Tape
Symantec
Disk
VTL
Library
VTL/Tape
TSM
Home
Transformational
(Disk-centric)
Other 3rd Party
Data Domain
Deduplication backup
Avamar
software and system
Data Protection
Advisor
Data Protection
Management
Software
on premise
© Copyright 2011 EMC Corporation. All rights reserved.
off premise
13
Disk Library DL5200
EMC DISK LIBRARY • Based on proven CLARiiON CX4 array
Industry’s most popular
virtual tape library
• Nearly 3 PB of logical capacity
– 1 TB or 2 TB SATA drives
– Up to 945 drives in a single array
• Enhanced system throughput, up to 10.2 TB per
hour
– Faster hardware compression
– Front-end 8 Gb/s Fibre Channel ports
• Improved energy efficiency
– High-density drives and Spin Down reduce per-terabyte
disk drive energy requirements
– New high-density racks—twice the capacity in same
footprint
• Easy integration into existing infrastructure
– Central management with other Disk Library systems
– Replication between Disk Library systems
© Copyright 2011 EMC Corporation. All rights reserved.
14
DL5200
DL5200
Storage
CLARiiON CX4960
Maximum capacity (usable)
1.4 PB
Maximum performance
(compressed)
10.2 TB per hour
Fibre Channel connectivity
8 Gb/s
Engines
2
Back-end arrays
1
Drive size
1 TB/2 TB
Active engine failover
Replication
Consolidated media management
Hardware compression
Spin Down
© Copyright 2011 EMC Corporation. All rights reserved.
15
Disk Library Differentiation
• Industry-leading open systems virtual tape
library
– More than 500 PB deployed
– More than 2,500 customers worldwide
• Industry’s most energy-efficient virtual tape
library
– Drive Spin Down reduces power and cooling costs
• Consolidated media management
– EMC NetWorker and Symantec NetBackup
integrated
• Most qualified backup environments
– More than three million supported configurations
© Copyright 2011 EMC Corporation. All rights reserved.
16
EMC DISK LIBRARY
FOR MAINFRAME
© Copyright 2011 EMC Corporation. All rights reserved.
17
EMC Disk Library for Mainframe
• True IBM tape emulation
• Transparent to mainframe
operations
• Leverages low-cost SATA II
technology
IBM mainframe
EMC Disk Library
for mainframe
• High-performance read and write
• Unmatched remote replication
capability
• EMC-branded product
–
–
–
–
© Copyright 2011 EMC Corporation. All rights reserved.
QA/tested by EMC
Manufactured by EMC
Maintained by EMC
Professional Services by EMC
18
EMC Disk Library for Mainframe Family
DLm120
DLm960
Number of VTEs
1 or 2
1–6
Connectivity
FICON
FICON
Number of channels to host
2 or 4
2–12
Number of virtual tape drives
Up to 512
Up to 1,536
Maximum capacity (usable)
9.5 TB–96.5 TB
19.3 TB–1.2 PB
Performance
Up to 400 MB/s
Up to 1.2 GB/s
Number of cabinets
1
2–13 with 1 TB
2–9 with 2 TB
Replication
Hardware compression
© Copyright 2011 EMC Corporation. All rights reserved.
19
DLm960 with Deduplication Storage
Expansion Option
EMC DISK LIBRARY FOR
MAINFRAME
and industry’s most popular deduplication
system
• Based on proven Data Domain DD880
• Nearly 3.5 PBs of logical capacity
• System throughput up to 4.3 TB per
hour
– Hardware compression
– Deduplication
• Reliability designed for the data center
DLm960
Deduplication Storage
Expansion Option
– Multipath for access to all tapes
– Data Domain Data Invulnerability
Architecture
– Call home for support
• Easy integration into existing
infrastructure
– Behaves like a tape library to the application
– Low bandwidth replication between disk
library systems
– No changes to current management process
© Copyright 2011 EMC Corporation. All rights reserved.
20
Disk Library for Mainframe Differentiation
• Eliminates all issues related to traditional tape handling
– Eliminates manual intervention, physical movement of tape
cartridges, robotic issues, and single points of failure
• Works seamlessly with existing applications
– Uses existing tape management processes to automate tape
vaulting
• Significantly improves performance
– Reallocates all of the data to disk and uses smart I/O buffering,
allowing potentially significant reductions in batch windows
• Extends disaster recovery capabilities to the tape
workload
– Utilizes array-based replication process over IP to seamlessly move
tapes offsite
• Provides deduplication for backup and archive workloads
– Gain longer onsite retention, optimize replication, and lower overall
disk storage costs
• Easily scales as the workload increases
– No need for additional subsystems, libraries, network connections,
etc.
© Copyright 2011 EMC Corporation. All rights reserved.
21
EMC AVAMAR AND
EMC DATA DOMAIN
Enabling next-generation data
protection with deduplication
© Copyright 2011 EMC Corporation. All rights reserved.
22
EMC Avamar and EMC Data Domain
Retain, replicate, recover
Deduplicate everything
without changing
anything
Simplify backup, archiving, and disaster
recovery with easy integration across
workloads, infrastructures, and backup software
Data Domain Deduplication
Storage Systems
© Copyright 2011 EMC Corporation. All rights reserved.
Never back up the
same data twice
Revolutionize your backup by moving less
data to solve your toughest VMware, NAS,
remote office, and desktop/laptop backup
challenges
Avamar Deduplication
Backup Software
23
Data Reduction/Deduplication: F1000
The “in-use” rating for
EMC is now over threetimes that of its nearest
competitor
Source: TheInfoPro, Wave 14 Storage Study – Q2 2010, published August 19, 2010; n=146 (7/6/10 F1000 sample)
© Copyright 2011 EMC Corporation. All rights reserved.
24
Deduplication Impact on Data Size
Deduplication
10–30 times less data stored versus fulls plus incrementals with typical retention policies
Data Stored
30
20
10
0
1
5
10
15
20
Weeks in Use
Deduplication storage
Traditional storage
© Copyright 2011 EMC Corporation. All rights reserved.
25
Data Deduplication: Technology
Overview
Store more backups in a smaller footprint
Friday Full Backup
A B C D A E F G
Mon Incremental
Tues Incremental
Weds Incremental
Thurs Incremental
A
C
E
A
B
H
B
G
C
Backup
Data
Logical
Estimated
Reduction
Physical
FRIDAY FULL
1 TB
2–4x
250 GB
Monday Incremental
100 GB
7–10x
10 GB
Tuesday Incremental
100 GB
7–10x
10 GB
Wednesday Incremental
100 GB
7–10x
10 GB
Thursday Incremental
100 GB
7–10x
10 GB
Second FRIDAY FULL
1 TB
50–60x
18 GB
2.4 TB
7.8x
308 GB
I
J
K
Second Friday Full Backup
B C D E F
L G H
TOTAL
A BCDE FGH I J K L
© Copyright 2011 EMC Corporation. All rights reserved.
26
It’s Not All Deduplication Out There
Regular storage array
1:1
Whitespace reduction
File level
Fixed blocks, snapshots
Backup target, variable segment
LZ compression
~ 2:1
Single instance storage
~ 3:1
Fixed block
~ 3:1
Variable
segment
~20:1
Deduplication significantly reduces:
• Replication WAN bandwidth
• Power
• Heat
• Cooling
• Management
© Copyright 2011 EMC Corporation. All rights reserved.
27
Deduplication Enables Next-Generation
Storage Architectures
Storage 1.0
 When did you implement this?
 What made you evolve?
Tape
Primary Disk
Storage 2.0
 Why did you add SATA?
 What did you learn?
Primary Disk
Tape
SATA
Storage 3.0
 Backup/recover plus archive
from disk (shrink primary)
 Tape: monthly
Primary
Deduplicate SATA
Before
After
Tape
Storage 4.0
 Flash for primary
 Everything else to deduplicate
© Copyright 2011 EMC Corporation. All rights reserved.
Flash
Deduplicate SATA
Before
After
28
EMC AVAMAR
© Copyright 2011 EMC Corporation. All rights reserved.
29
Backup and Recovery Architectures:
In Transition from Tape to Disk
Backup/Recovery
Architecture
Conventional
(Tape-centric)
Application Backup
Clients
Backup/Media
Manager
Onsite Backup
Storage
Disaster Recovery
Storage
Backup
NetWorker
software
Tape
Tape
Backup
NetWorker
software
Disk
VTL
Library
VTL/Tape
DB
Home
Deduplication
storage
Data Domain
Transformational
(Disk-centric)
NetWorker
Deduplication backup
Avamar
software and system
Data Protection
Advisor
Data Protection
Management
Software
on premise
© Copyright 2011 EMC Corporation. All rights reserved.
off premise
30
Avamar
Deduplication backup software and system
Avamar
VM
• End-to-end, software/hardware solution
– Integrated system for simple, predictable results
– Client-side, global deduplication; within and across
clients
• Improves backup window, less network load
– Backup process minimizes data sent and stored
– Reduces network and virtual infrastructure stress
• Integrated high availability and reliability
Full backups, every time:
one-step recovery
Higher backup success rate
and reliability
Increased ROI, lower TCO, less
risk
© Copyright 2011 EMC Corporation. All rights reserved.
– RAIN (redundant array of independent nodes)
architecture for high availability and fault tolerance
– Recoverability verified daily
– Disaster recovery through replication
• Flexible deployment options
– Avamar Data Store
– Avamar Virtual Edition
– Agent-only for remote office/branch office (ROBO)
31
Avamar Family
UNIFIED
MANAGEMEN
T
EXAMPLE USE CASES
VMware
Remote/Branch Offices
EMC Data
Protection
Advisor
NAS/NDMP
Desktop/Laptop
CLIENTS
Lotus Notes
IBM DB2
CORE PLATFORMS
EMC
NetWorker
Avamar
VM
EMC Avamar
© Copyright 2011 EMC Corporation. All rights reserved.
EMC Avamar Data
Store
EMC Avamar Virtual
Edition for VMware
32
Avamar Differentiation
• Shorter backup windows
– Less data moved reduces daily full backup times
• Reduces required daily network bandwidth and client stress
– Scalable VMware backup for greater server consolidation
• Simple management
– System deployment is easy, pre-configured, with predictable performance
– Streamlined, centralized administration and management of remote
backups
• Single-step restore
– Single-step restore for full backups; no need for full and incrementals
• Recoverability guaranteed
– Daily integrity checks, RAIN, and replication ensure recoverability, high
availability
© Copyright 2011 EMC Corporation. All rights reserved.
33
EMC DATA DOMAIN
© Copyright 2011 EMC Corporation. All rights reserved.
34
EMC Data Domain
Backup/Recovery
Architecture
Conventional
(Tape-centric)
Application Backup
Clients
DB
Backup/Media
Manager
Onsite Backup
Storage
Disaster Recovery
Storage
NetWorker
Tape
Tape
Symantec
Disk
VTL
Library
VTL/Tape
TSM
Home
Transformational
(Disk-centric)
Other 3rd Party
Data Domain
Deduplication backup
Avamar
software and system
Data Protection
Advisor
Data Protection
Management
Software
on premise
© Copyright 2011 EMC Corporation. All rights reserved.
off premise
35
Data Domain Basics
Easy integration with existing environment
Control Tier
Target Tier
Backup and archive
applications
CIFS, NFS,
NDMP, DD Boost
EMC
Ethernet
Symantec
Virtual Tape
Library (VTL) over
Fibre Channel
CommVault
Disaster Recovery Tier
Replication
Tivoli Software
BakBone Software
Vizioncore
© Copyright 2011 EMC Corporation. All rights reserved.
DD890 appliance









DD890 appliance
2U
2 to 10 ports
10 and 1 Gigabit Ethernet; 8 Gb/s Fibre Channel
RAID 6
Up to 285 TB usable capacity with shelves
2 TB or 1 TB 7.2K rpm SATA hard disk drives in shelf
File system
NVRAM
N+1 fans and redundant, hot-plug power supplies
36
Industry’s Most Scalable Inline
Deduplication Systems
Global Deduplication
Array
DD800
Appliance Series
DD Archiver
DD600
Appliance Series
Software options:
DD Boost, DD Virtual Tape Library, DD Replicator,
DD Retention Lock, and DD Encryption
DD140 Remote
Office Appliance
DD140
DD610
DD630
DD670
DD860
DD890
Global
Deduplication Array
DD Archiver
Speed (DD
Boost)
490 GB/hr
1.3 TB/hr
2.1 TB/hr
5.4 TB/hr
9.8 TB/hr
14.7 TB/hr
26.3 TB/hr
9.8 TB/hr
Speed (other)
450 GB/hr
675 GB/hr
1.1 TB/hr
3.6 TB/hr
5.1 TB/hr
8.1 TB/hr
10.7 TB/hr
4.3 TB/hr
Logical capacity
9–43 TB
40–195 TB
84–420 TB
0.6–2.7 PB
1.4–7.1 PB
2.9–14.2 PB
5.7–28.5 PB
5.7–28.5 PB
Raw capacity
1.5 TB
Up to 6 TB
Up to 12
TB
Up to 76 TB
Up to 192
TB
Up to 384
TB
Up to 768 TB
Up to 768
TB
Usable capacity
0.86 TB
Up to 3.98 TB
Up to 8.4
TB
Up to 55.9 TB
Up to 142
TB
Up to 285
TB
Up to 570 TB
Up to 570
TB
© Copyright 2011 EMC Corporation. All rights reserved.
37
Methodology:
Inline versus Post-Process Deduplication
INLINE
POST- PROCESS
Deduplication Before Storing
Deduplication After Storing
Deduplication
Store
Deduplication
3x disk accesses
to shared store
 Other activities unimpeded
− Predictable
− Simpler
 The more processes, the more resource
contention
−
−
−
−
Copy to tape: Too slow to stream tape
Recovery: Service level agreement predictability
Replication: Poor time-to-disaster-recovery
Deduplication: If interleaved with backup or
restore
 More administration to fight these issues
© Copyright 2011 EMC Corporation. All rights reserved.
38
Performance:
CPU-Centric versus Spindle-Bound
Data Domain
Throughput MB/s
1,500
Improvement since 2004:
Throughput: 175x
Capacity:
450x
Fibre Channel
SATA
Most
deduplication
vendors
50
50
100
150
200
Number of Disk Spindles
© Copyright 2011 EMC Corporation. All rights reserved.
39
Data Domain Differentiation
• Maturity
– Simple
– Consistent
– Robust (e.g., policy-driven deduplication replication)
• Product concept: purpose-built storage
– Inline and simple appliance
– System infrastructure
– Application independent: backup, archive, and more
• Architecture: fast, small, storage of last resort
– CPU-centric for price/performance
– Data protection from the ground up
© Copyright 2011 EMC Corporation. All rights reserved.
40
THANK YOU
© Copyright 2011 EMC Corporation. All rights reserved.
41