remote support
 
Disaster Recovery
 











"Our server hard disks failed and our server was unavailable for days"

Disk drives are mechanical devices with platters that typically spin at between 7,000 - 10,000rpm - over time it is inevitable that the disks in your server (or workstations) will eventually fail. With proper planning however, the consequences of these hardware failures can be minimised.

Disk drive failures are more complex to resolve than other hardware issues (e.g power supply, memory). For example, if the power supply in your server were to fail, while there would be some "down-time", once the power supply had been replaced the server would be ready to resume operation as before. In contrast, once a failed disk drive has been replaced, all the information that was previously stored on the original disk drive must be restored.

For this reason a single-disk volume on a server, is referred to as not being "fault-tolerant" or "redundant". A failure of the disk drive in a 'single-disk' configuration will result in a catastrophic "crash". Time to recover from such a failure will depend on the amount of data to be restored and the type of backups held. The best-case scenario is 1/2 day and if an "offline backup" is not held 2-3 days is not uncommon.

Given the amount of disruption that can be caused by the failure of a disk drive in a single-disk server, a fault-tolerant disk configuration is highly recommended. There are a number of options available depending upon capacity, budget and performance requirements. RAID1 (mirroring) and RAID5 (array) are popular options. In both cases, the failure of a single-disk within the server does not disrupt the operation of the server.

A fault-tolerant disk configuration on it's own though is not enough. Say there was a virus strike that could not be repaired or say someone was logged onto the server with administrator rights and inadvertently damaged the system volume preventing the server from being able to restart. There is no substitute for having good backups of the operating system and user data.

One way that backups can be classified is whether they're "on-line" or "off-line". An "on-line" backup is recommended for regular backup of user data and other important data on the server (like e-mail). While an "off-line" backup is recommended for backup of the server operating system and associated system files (registry, directory etc).



 
privacy legal ©2004