enhanced availability with raid cc5493/7493. raid redundant array of independent disks raid is...
TRANSCRIPT
Enhanced Availability With RAID
CC5493/7493
RAID
• Redundant Array of Independent Disks
• RAID is implemented to improve:– IO throughput (speed) and – Availability of a file system.
RAID Implementation
• Software – often criticized as not being a true RAID implementation.
• Hardware – A special RAID controller is required.
RAID: Stripe
• The stripe takes on two meanings within the context of a RAID system:– Stripe width (number of independent drives)– Stripe size (storage block size)
Both stripe width and stripe size are adjusted to enhance IO throughput.
RAID Stripe Width
• Stripe width refers to the number of disks used in parallel for IO transfers to and from the array.
Raid Stripe Size
• Stripe size refers to the size of the storage units organized on the disk surface.
• The stripe size is adjusted to optimize the speed of the IO transfers.
Common RAID Types
• RAID-0
• RAID-1
• RAID-1+0, RAID-0+1
• RAID-5
• RAID-6
RAID-0
• AKA disk striping
• Does not provide redundancy
• Degrades data availability, reduces MTF
• Improves IO throughput (average IO transfer rate improves)
RAID-0
• Ideal for temporary storage requiring fast data access.-Engineering/Scientific calculations on large
data volumes. However, the data is a redundant temporary copy.
RAID-1
• AKA mirroring
• Requires two independent disk devices– The first disk stores the data– The second disk is an image of the first– Can double the overall read throughput
RAID-1
• width = 1
RAID-1 Advantages
• Improves data availability.
• Dual-channel controller allows for two simultaneous read operations.
• Allows for error detection on read.
• Administrative advantages for service on one drive while the other remains available.
• Fault tolerance is one drive.
RAID-1 Disadvantages
• Writes have a slight performance penalty compared to no RAID.
• Doubles the cost of storage.
• Storage efficiency = 50%
RAID-1
• Ideal for data that is read more often than written:– Some database information that is not
updated often.– Web Server information (lots of reads, few
writes)
RAID-1+0
• Enhances IO throughput and data availability.
• Requires 2(n+1) separate disk devices, where n = 1, 2, 3, 4, …– Minimum of 4 disks required (n=1)
RAID-1+0
Width=2
RAID-1+0
• Width = 4
RAID-1+0
• RAID-1+0 has a higher fault tolerance compared to RAID-0,1, & 5.
• Storage efficiency is 50%
RAID-0+1
• Requires the same hardware as RAID-1+0, but less fault tolerant.
• However, there is better read throuthput from RAID-0+1 compared to RAID-1+0.
RAID-0+1
• Duplicate RAID-0 arrays. Allows simultaneous reads
RAID-5
• RAID-5 enhances – IO data throughput– Data availability
• Parity information enhances availability
• Requires a minimum of 3 independent disk devices.
Parity Information
• Based on the logical exclusive-or operation.
RAID-5 Configuration
• Stripe Width = 4
RAID-5
• The most common implementation of RAID.
• Ideal for a disk-server providing general storage.
• A good balance between reliability and speed.
• Often implemented using high quality disk drives (SCSI, 15k-rpm, high MTF)
RAID-5 Limitations
• Overhead occurs during writes due to the parity calculation and parity write.
• Storage efficiency is not 100% due to the parity storage requirements.
storage efficiency = (n-1)/n, where n = number of drives.
RAID-5 (S)ATA Limitations
• Large capacity (S)ATA drives are more likely to contain bad blocks.
• After a disk failure, the bad blocks make it impossible to rebuild the array from the remaining drives.
RAID-6
• Contains two sets of parity.
• Tolerates two simultaneous disk failures.
• A better solution for (S)ATA arrays where each disk has a large capacity (multiple TB).
• Stripe Width = 6
RAID-6
• Higher availability at the cost of greater IO overhead due to complex parity calculations and storage.
• Storage efficiency = (n-2)/n
• Becoming more popular for large storage capacity (S)ATA arrays
RAID-6 Disadvantages
• More expensive to implement due to extra parity information
• Slower write operations compared to other RAID-5
RAID Disk Swapping
• Hot Swap
• Warm Swap
• Cold Swap
Hot Swap
• The ability to swap out a failed disk from a RAID array without an interruption of service from the array.
• Performance will be slower due to the operations required to rebuild the new replacement disk.
Warm Swap
• The array is not accessible while a drive is being serviced, but the system does not need to be shut down.
Cold Swap
• System must be shutdown to service the array.
Spare Disk: Hot Spare
• Some RAID controllers can be configured to immediately recover from a disk failure if a hot-spare disk is connected to the controller at all times.
RAID Disk Failure and Performance
• When a failed disk is replaced in an array, there is a performance hit as the new disk must be re-populated with the required data for the complete array.
RAID Summary
• RAID-0 : for temporary storage only
• RAID-1 : ideal for disk services that provide mostly read operations like data base services and web services.
• RAID-5 : general purpose disk-server
• RAID-6 : for very large data requirement environments (multiple T-Bytes).
RAID Summary
• RAID 1+0 : general purpose disk server where RAID-5 & 6 are not adequate.– Better fault tolerance– More IO throughput
Other?
• RAID 1+1, mirror a mirrored RAID-1– Triples the cost of storage– Excellent fault tolerance.– Excellent read throughput.– Writes will suffer