Symantec De-Duplication Solutions
Complete Protection for your Information Driven Enterprise
Richard Hobkirk
Sr. Pre-Sales Consultant
1
Symantec’s Prescription For Better Data Protection
Deduplicate
Everywhere To
Rationalize
Infrastructure
Archive for Long
Term Storage &
Improve Backup
Performance
Centralize
Management To
Reduce
Complexity
2
Deduplication, Store Data More Efficient
Deduplication Eliminates Redundant Data
• Places to deduplicate:
– Source
– Target
– At the media server
– With appliances
• Benefits include:
– Backup Less data = faster
– Dramatically reduced storage costs
and backup load
3
Target-Side vs Source-Side Deduplication
• Target Deduplication
– Can be a non-disruptive way to add dedupe
to your environment
Application Servers
– Expensive dedicated appliances ingest data
at the end of the backup process
– Doesn’t do anything to help your backup
infrastructure scale
Backup Infrastructure
• Source Deduplication
– Reduces the amount of data sent over the
network, freeing up bandwidth
– Reduces data through backup
infrastructure
– Suitable when application/client server can
handle the dedupe processing load
Target Deduplication
Appliance
Backup Disk Pool
4
Built-In with NetBackup 7 & Backup
Exec 2010
Deduplicate Everywhere, Closer to the Source
Built-In with NetBackup 7 & BE 2010
1
Client
Servers
Built-In Dedupe
at the client
2
Media
Server
Built-In Dedupe
at the media
server
3
Backup
Appliance
Integrate with
deduplication
appliances
1 Start at the Source - Faster backups for 50-70% of customer data
2 Not all Data is Equal – Dedupe for CPU sensitive applications
3 Turbo-Charge Appliances – More control & performance
6
Deduplication at the Source
• Data is deduplicated at the
source/client before being sent
across the network
• Benefits include:
– Built-In to NetBackup, simple and easy
to deploy
– Reduced WAN/LAN bandwidth impact
– Reduced backend storage requirements
– Transparent support for applications
Client/Source
Media server/Target
• Ideal for:
– Remote offices
– Protecting virtual machines
– File/folder & Database backups with
low change rate
= Deduplication engine
Dedupe appliance
Client Deduplication Leads to Higher Backup Throughput
Client Deduplication vs. Media Server Deduplication
Backup Speed (MB/s) vs. Number of Clients (1-12)
300
282
282MB/s
3 x faster backups
with only 12 clients!
250
229
200
MB/s
80 % Deduplication
147
150
1 Gb/s Network
100
88
86
94
94MB/s
72
58
50
47
47
0
1 client 2 clients 4 clients 8 clients 12 clients 1 client 2 clients 4 clients 8 clients 12 clients
Client Deduplication
Media Server Deduplication
1 Stream / Client
Client Deduplication vs. Standard Backup to Disk
Example with 50 clients
FULL Backup
without deduplication
10
15
5
20
3
1 Client
25
30
4
2 Clients
FULL Backups with
95% deduplication
40
710
424
MB/s
1190
580
820
MB/s
192
235
915
1070
110
100
50 MB/s
50
1 Gb/s network
Standard Clients with
maximum read speed of
50MB/s
1Gb/s is saturated
with 2-3 clients
Higher # of clients, deliver
higher aggregated backup
throughput
Media Server
Deduplication Pool
9
Up till 10x faster backups *
9
Media Server Deduplication
• Data is deduplicated inline at the
media server before being stored
on disk
Client/Source
• Benefits include:
– Built-In to NetBackup, simple and
easy to deploy
– No client impact
– Leverage commodity hardware
– Reduced backend storage
requirements (1Gb/s vs. 10Gb/s)
– Highly scalable
• Ideal for:
– Data center environments
= Deduplication engine
Media server/Target
Dedupe appliance
Deduplication Architecture
Meta Data
Communication
NetBackup Client
NetBackup Master
Deduplication
Plug-in
NetBackup Media server
• Deduplication plug-in embedded in
standard NBU client
Local Disk
(SAN/DAS)
PureDisk Pool
• Data moves directly between client and
Deduplication Store (Media Server
Deduplication Pool)
Media Server
Dedupe Pool
32TB
100TB
Deduplication platform
• Deduplication storage embedded into
the NetBackup Media Server
• Master/ Media servers required for
Control Communication, Catalogue meta
data repository and Policy management
• Transparent client deduplication support
for applications (VMware, SQL ,...)
• Support for existing PureDisk pool
11
Increase Performance with Media Server Load
Balancing
Media Server Deduplication Pool - MSDP
NetBackup Clients
Load balancing option
• Deduplication load sharing
Additional Media servers can be
enabled to contribute to the
deduplication process, all writing
to the same Media server
deduplication pool
Media Server
Deduplication
Pool
• Global deduplication is
maintained within each
deduplication pool
NetBackup Media
Server
Deduped stream
Standard stream
Deduplication process
• A simple check in the box to
enable deduplication
PureDisk
Deduplication
Pool
• Support for PureDisk
Deduplication Pool
12
Media Server Deduplication Performance Example
• Performance depends on a lot of
factors
– network, deduplication rate, disk
write speed, CPU power …
1000
– or 3TB/hour
• Add more media servers
to achieve higher throughput
while maintaining global dedupe!
Backup Throughput in MB/s
• Example for 10Gb/s network and
multiple backup jobs
– 16 jobs deliver a total of 900MB/s
Aggregated Backup Speed (MB/s) for 1 Media Server
Deduplication Pool with 90% deduplication*
~ 900 MB/s
900
800
700
600
500
Aggregated
Backup Speed
(MB/s)
400
300
200
100
0
1
2
4
8
16
32
Total # of concurrent backup jobs/streams
* Media Server has 10GB/s NIC card and deduplication rates are
90 % (conservative for full backups)
13
Integration with Deduplication Appliances
• Data is deduplicated at the
appliance, yet centrally managed
by NetBackup via OpenStorage API
Client/Source
• Benefits include:
– It allows NetBackup to see the disk, as
disk, giving NetBackup visibility and
control of advanced features
Media server/Target
– Centralized policy management and
replication control
– Improved performance
• Supported appliances include:
= Deduplication engine
Dedupe appliance
Deduplicate Everywhere, Closer to the Source
Primary
Storage
Archive data off
primary storage &
dedupe data
Client
Servers
Media
Server
Deduplicate
at the client
Deduplicate
at the media
server
Backup
Appliance
Integrate with
appliances via
OpenStorage
New in NetBackup 7 & BE 2010
RESULTS
Faster backup & recovery
Reduced bandwidth
Less storage requirements
Optimized search/discovery
15
Better DR with Global Data Protection
16
Better Disaster Recovery for Global Data Protection
• One Console To View
Distributed Backup
Information
• Replicate & Store 80%
Less Data Duplication
• Recover terabytes of data
in seconds from anywhere
17
Move Up to 80% Less Data With Optimized Duplication
One Policy
for Backup &
DR Copies
Site 1
Site 2
SAN/
DAS
Optimized Duplication
Copy 1: 14 days
Copy 2: 21 days
Copy 3: 60 days
Optimized Duplication
OST
Copy 1: 14 days
OST=OpenStorage Appliance (e.g., Data Domain)
OST
Copy 2: 21 days
18
Thank you!
Copyright © 2010 Symantec Corporation. All rights reserved. Symantec and the Symantec Logo are trademarks or registered trademarks of Symantec Corporation or its affiliates in
the U.S. and other countries. Other names may be trademarks of their respective owners.
This document is provided for informational purposes only and is not intended as advertising. All warranties relating to the information in this document, either express or implied,
are disclaimed to the maximum extent allowed by law. The information in this document is subject to change without notice.
19