Why tapes are still the most efficient backup method?

For businesses it is obvious that the vast amount of data (at the time of writing this, tens and hundreds of gigabytes, if not terabytes) make high capacity tapes the medium of choice for backup.

Large data centers cannot live without tape libraries AND offsite tape storage and rotation, to cover against the case of a disaster.

Believe it or not, tape drives are also the best backup for a home system as well.

Of course, some people will jump in right now and object to tape drives with various arguments. These arguments are mostly flawed as per the discussion below. Most of them come from people who only have experience around PCs or PC operating systems (e.g. Windows), and no exposure to how mainframes/mini-computers used to be, nor how data centers are run.

Let us address them one by one:

Objection Comments
Who needs backup when you have RAID RAID is local to one machine. If that machine catches fire, or gets stolen, the data is no where to be found. Moreover, RAID does not protect you from user error. If you delete file or a directory by mistake, RAID will not help you one bit. Only a backup on a separate media / directory will help with these "Oops!" mishaps.
Tape is old and obsolete technology! Tapes are definitely old as a concept. But the technology keeps evolving, with more capacity and speed all the time. There are also robotic tape libraries that handle the insertion / removal of tapes from drives, although these are beyond home use.
Optical media such as CD-Rs and DVDs are better! Optical media now are either CD-Rs (approx 700 MB each?) or DVDs (4 GB each?). These capacities are not enough to backup an entire disk today. Moreover, the media is mostly write once, so it is throwaway, unless you use CD-RW, which are too small, or DVD RW, which is still nascent technology with high cost drives and media, as well as low capacity compared to tape.
IDE disks are so cheap. Just buy another disk, and put in the computer and use it only for backup. While that helps a bit, it is still flawed. If the original hard disk fails, this will protect you. However, if the power supply blows up and fries all components and drives, you are out of luck. You are also out of luck if you get your basement flooded or have a fire.
Firewire external drives or IDE hard drives in a caddy are better and have more capacity These are certainly a step above the RAID argument or the optical media one. They will help you with the scenario where the server power supply catches fire. They will not help with other scenarios though. If you keep the disk at home, then a fire at home destroys your data AND the backup of that data. Moreover, you only have one copy of the data, so if you delete a file by mistake, then do a backup, that backup does not have the file on it.
rsync mirroring does the job for me It depends on how you set it up. If you set it up to a remote machine (offsite, at work, university or at a friends place), then it is much better than the above options. However, do you keep multiple versions of the backup for a month or more? If so, then congratulations, and you are lucky to have all that space and resources! If not, then you have the same drawbacks as above. A local rsync over the LAN is relatively fast, but again, it is in the same home, subject to theft, flooding or fire.

You can read more on how I use tapes on Linux to backup a home network, where are there are lots of details on the hardware, configuration, and commands used, as well as lots of links to various information on backup.

Contents: 

Comments

Media longevity is one thing, hardware compatibility another

When considering a tape format be aware that some technologies do not maintain an indefinite compatibility to older formats. We use the LTO format for our 48 slot tape library. LTO is a great tape format but each generation of drives have a built-in limitation of only being able to read the last 2 tape generations, meaning LTO1 tapes are unusable in LTO4 or newer drives. If your LTO3 system is being returned on lease or dies completely you will be unable to touch that archival data. Your only option is to transfer older backups to the newer format when your backup system is being upgraded or retired. At least you will be using fewer tapes with the new format, your data pool gets refreshed and the time it takes to move that old data is reduced with each generation.

Tape shelf life is not decades!

In my experience about 40% of our tapes were unreadable after just 5 years of being kept in normal room conditions and not regularly being run through a tape drive. Also build up on the tapes meant that in order to go through our collection and save the data that was left, cleaning tapes were required far more often than normal and the drives needed to be opened up and manually cleaned several times.

You have to bear in mind that the manufacturers claims are based on simulated aging so may not be accurate in the first place, and if your storage conditions are even slightly worse than 'optimal' it could make a huge difference.

So if you are planning to archive on any media, especially tape it is important to monitor the state of your data otherwise you are going to be in for a shock when you attempt to go back and use it.

Pages