Title: Emerging Technologies
1Understanding Deduplication
- Kevin Carpenter
- Account Manager Upstate NY
- Phil Benincasa
- System Engineer Upstate NY
Emerging Technologies
2Agenda
Emerging Technologies
3What is De-duplication?
Deduplication is the process of removing
redundant data from any storage medium based upon
identifying repeating components either prior to
or shortly after writing to the media.
Deduplication is used primarily on Backup and
Archived data as this data contains the most
redundancy, is the least performance sensitive
data, and consumes the most media capacity in a
data center.
Current deduplication technology focuses on block
level redundancy of data because it provides high
redundancy with a manageable deduplication
process.
Emerging Technologies
4Why Deduplicate Data?
- Backup and Archive copies by nature have lots of
redundant contents common block patterns inside
DBs, images, file versions, etc. - Eliminate redundant block patterns across file,
databases, images and your archive retain more
backup copies on disk - 90 reduction in disk usage
- Data cant spin for everextend and preserve that
reduction benefit to offline storage to support
retention, vaulting or offsite DR needs - 90 reduction in tape usage
Normal Disk Store
Emerging Technologies
5Why Deduplicate Data?
- Data is growing at an alarming rate. Faster
than hardware capacity is growing - Less Disk and Tape to consume and manage
- Less power and cooling in the data center
- Preserve current infrastructure (space and
hardware) - Less people required to manage more data
-
Emerging Technologies
6Types of Dedupe Solutions
All Deduplication works using the same
process!!!! The only difference is where the
steps occur.
Emerging Technologies
7Types of Dedupe Solutions
Appliance Based
Advantages
- All processing occurs within the system.
- The dedupe database and storage are contained
within the unit - Data is deduplicated either in-line or lands in a
common storage area and is then deduped
post-write.
- Easy to acquire
- Dedicated Hardware and to the processing
- Flexible deployment options
Disadvantages
- Do not scale easily
- Limited scope of de-duplication
- Requires more data to flow through backup and
archive process - Ties you into a specific solution
- Can become a performance bottleneck
Emerging Technologies
8Types of Dedupe Solutions
Software Based
Advantages
- Lower cost of acquisition
- More efficient in limiting data flow
- Easier to manage
- Balances performance and processing over entire
process - Free to choose whatever hardware makes sense
Disadvantages
- More complex to license
- Tied to a specific software package
- Requires hardware for performance that is not
all-in-one
Emerging Technologies
9Types of Dedupe Solutions
Client Side Dedupe
Advantages
- Most efficient in limiting data flow
- Best overall performance in backup environments
- Free to choose whatever hardware makes sense
- Easier to manage than Appliance
Disadvantages
- More complex to license
- Tied to a specific software package
- Requires hardware for performance that is not
all-in-one - Heavier processing requirements on clients
Emerging Technologies
10Questions. ??????
Emerging Technologies
11Thank You For Your Time
Emerging Technologies