enhanced availability with raid cc5493/7493. raid redundant array of independent disks raid is...

39
Enhanced Availability With RAID CC5493/7493

Upload: melvin-short

Post on 16-Dec-2015

232 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

Enhanced Availability With RAID

CC5493/7493

Page 2: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID

• Redundant Array of Independent Disks

• RAID is implemented to improve:– IO throughput (speed) and – Availability of a file system.

Page 3: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID Implementation

• Software – often criticized as not being a true RAID implementation.

• Hardware – A special RAID controller is required.

Page 4: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID: Stripe

• The stripe takes on two meanings within the context of a RAID system:– Stripe width (number of independent drives)– Stripe size (storage block size)

Both stripe width and stripe size are adjusted to enhance IO throughput.

Page 5: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID Stripe Width

• Stripe width refers to the number of disks used in parallel for IO transfers to and from the array.

Page 6: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

Raid Stripe Size

• Stripe size refers to the size of the storage units organized on the disk surface.

• The stripe size is adjusted to optimize the speed of the IO transfers.

Page 7: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

Common RAID Types

• RAID-0

• RAID-1

• RAID-1+0, RAID-0+1

• RAID-5

• RAID-6

Page 8: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-0

• AKA disk striping

• Does not provide redundancy

• Degrades data availability, reduces MTF

• Improves IO throughput (average IO transfer rate improves)

Page 9: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-0

• Ideal for temporary storage requiring fast data access.-Engineering/Scientific calculations on large

data volumes. However, the data is a redundant temporary copy.

Page 10: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-1

• AKA mirroring

• Requires two independent disk devices– The first disk stores the data– The second disk is an image of the first– Can double the overall read throughput

Page 11: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-1

• width = 1

Page 12: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-1 Advantages

• Improves data availability.

• Dual-channel controller allows for two simultaneous read operations.

• Allows for error detection on read.

• Administrative advantages for service on one drive while the other remains available.

• Fault tolerance is one drive.

Page 13: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-1 Disadvantages

• Writes have a slight performance penalty compared to no RAID.

• Doubles the cost of storage.

• Storage efficiency = 50%

Page 14: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-1

• Ideal for data that is read more often than written:– Some database information that is not

updated often.– Web Server information (lots of reads, few

writes)

Page 15: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-1+0

• Enhances IO throughput and data availability.

• Requires 2(n+1) separate disk devices, where n = 1, 2, 3, 4, …– Minimum of 4 disks required (n=1)

Page 16: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-1+0

Width=2

Page 17: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-1+0

• Width = 4

Page 18: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-1+0

• RAID-1+0 has a higher fault tolerance compared to RAID-0,1, & 5.

• Storage efficiency is 50%

Page 19: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-0+1

• Requires the same hardware as RAID-1+0, but less fault tolerant.

• However, there is better read throuthput from RAID-0+1 compared to RAID-1+0.

Page 20: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-0+1

• Duplicate RAID-0 arrays. Allows simultaneous reads

Page 21: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-5

• RAID-5 enhances – IO data throughput– Data availability

• Parity information enhances availability

• Requires a minimum of 3 independent disk devices.

Page 22: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

Parity Information

• Based on the logical exclusive-or operation.

Page 23: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-5 Configuration

• Stripe Width = 4

Page 24: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-5

• The most common implementation of RAID.

• Ideal for a disk-server providing general storage.

• A good balance between reliability and speed.

• Often implemented using high quality disk drives (SCSI, 15k-rpm, high MTF)

Page 25: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-5 Limitations

• Overhead occurs during writes due to the parity calculation and parity write.

• Storage efficiency is not 100% due to the parity storage requirements.

storage efficiency = (n-1)/n, where n = number of drives.

Page 26: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-5 (S)ATA Limitations

• Large capacity (S)ATA drives are more likely to contain bad blocks.

• After a disk failure, the bad blocks make it impossible to rebuild the array from the remaining drives.

Page 27: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-6

• Contains two sets of parity.

• Tolerates two simultaneous disk failures.

• A better solution for (S)ATA arrays where each disk has a large capacity (multiple TB).

Page 28: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

• Stripe Width = 6

Page 29: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-6

• Higher availability at the cost of greater IO overhead due to complex parity calculations and storage.

• Storage efficiency = (n-2)/n

• Becoming more popular for large storage capacity (S)ATA arrays

Page 30: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID-6 Disadvantages

• More expensive to implement due to extra parity information

• Slower write operations compared to other RAID-5

Page 31: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID Disk Swapping

• Hot Swap

• Warm Swap

• Cold Swap

Page 32: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

Hot Swap

• The ability to swap out a failed disk from a RAID array without an interruption of service from the array.

• Performance will be slower due to the operations required to rebuild the new replacement disk.

Page 33: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

Warm Swap

• The array is not accessible while a drive is being serviced, but the system does not need to be shut down.

Page 34: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

Cold Swap

• System must be shutdown to service the array.

Page 35: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

Spare Disk: Hot Spare

• Some RAID controllers can be configured to immediately recover from a disk failure if a hot-spare disk is connected to the controller at all times.

Page 36: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID Disk Failure and Performance

• When a failed disk is replaced in an array, there is a performance hit as the new disk must be re-populated with the required data for the complete array.

Page 37: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID Summary

• RAID-0 : for temporary storage only

• RAID-1 : ideal for disk services that provide mostly read operations like data base services and web services.

• RAID-5 : general purpose disk-server

• RAID-6 : for very large data requirement environments (multiple T-Bytes).

Page 38: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

RAID Summary

• RAID 1+0 : general purpose disk server where RAID-5 & 6 are not adequate.– Better fault tolerance– More IO throughput

Page 39: Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability

Other?

• RAID 1+1, mirror a mirrored RAID-1– Triples the cost of storage– Excellent fault tolerance.– Excellent read throughput.– Writes will suffer