Redundant Array of Independent Disks RAID Mass Storage

  • Slides: 18
Download presentation
Redundant Array of Independent Disks RAID

Redundant Array of Independent Disks RAID

Mass Storage �Many systems today need to store many terabytes of data. �Don’t want

Mass Storage �Many systems today need to store many terabytes of data. �Don’t want to use single, large disk too expensive failures could be catastrophic �Would prefer to use many smaller disks.

What is RAID? �is a storage technology. �was first defined by David Patterson, Garth

What is RAID? �is a storage technology. �was first defined by David Patterson, Garth A. Gibson, and Randy Katz at the University of California, Berkeley in 1987. �is the organization of multiple disks into a large, high performance logical disk.

The Need for RAID �An array of multiple disks accessed in parallel will give

The Need for RAID �An array of multiple disks accessed in parallel will give greater throughput than a single disk. �Redundant data on multiple disks provides fault tolerance.

Two Important Concepts �Striping �Redundancy

Two Important Concepts �Striping �Redundancy

Striping �Take file data and map it to different disks �Allows for reading data

Striping �Take file data and map it to different disks �Allows for reading data in parallel file data block 0 Disk 0 block 1 Disk 1 block 2 Disk 2 block 3 Disk 3

Redundancy �In engineering, redundancy is the duplication of critical components or functions of a

Redundancy �In engineering, redundancy is the duplication of critical components or functions of a system with the intention of increasing reliability of the system, usually in the case of a backup or fail-safe. �Data redundancy occurs in database systems which have a data that is repeated in two or more disks.

Standard Levels �A number of standard schemes have evolved which are referred to as

Standard Levels �A number of standard schemes have evolved which are referred to as levels. �There were five RAID levels originally conceived �Other kinds have been proposed in literature �Level 2 and 4 are not commercially available

RAID 0 �Break a file into blocks of data �Stripe the blocks across disks

RAID 0 �Break a file into blocks of data �Stripe the blocks across disks in the system �provides no redundancy or error detection important to consider because lots of disks means low Mean Time To Failure (MTTF)

Data Mapping for RAID 0

Data Mapping for RAID 0

RAID 1 � A complete file is stored on a single disk � A

RAID 1 � A complete file is stored on a single disk � A second disk contains an exact copy of the file � Provides complete redundancy of data � Most expensive RAID implementation requires twice as much storage space

RAID 2 �RAID 2 implements bit striping with ECC �Error correction code (Hamming code)

RAID 2 �RAID 2 implements bit striping with ECC �Error correction code (Hamming code) allows for correction of a single bit error �is not as efficient as other RAID levels and is not generally used.

RAID 3 �Data is striped so each sequential byte is on a different drive

RAID 3 �Data is striped so each sequential byte is on a different drive �Parity is calculated across corresponding bytes and stored on a dedicated parity drive. �It requires only one disk for parity data. �RAID 3 suffers from a write bottleneck.

RAID 4 �Similar to RAID 3. �It employs striped data in much larger blocks

RAID 4 �Similar to RAID 3. �It employs striped data in much larger blocks or segments. �Not used commercially.

RAID 5 �Distribution of the parity strip to avoid the bottle neck. � Best

RAID 5 �Distribution of the parity strip to avoid the bottle neck. � Best of all worlds read and write performance close to that of RAID Level-1 requires as much disk space as Levels-3, 4

Combinations of different levels �Combine two levels and get the advantages from both. �Examples:

Combinations of different levels �Combine two levels and get the advantages from both. �Examples: 0+1, 1+0, 0+3, 3+0, 0+5, 5+0, 1+5, and 5+1.

Uses �Today, RAID is found everywhere--- �In operating system software. �A stand-alone controller providing

Uses �Today, RAID is found everywhere--- �In operating system software. �A stand-alone controller providing advanced data integrity in high-end storage area networks. �Laptops, as well as desktops, workstations, servers, and external enclosures with a larger number of hard disk drives. �RAID is even included in TV set top boxes or personal storage devices.

THANK YOU!

THANK YOU!