Lecture 8 SecondaryStorage Structure 1 Disk Architecture Track

Disk Architecture Track Sector Disk head Cylinder 3500 -7000 rpm 2

Disk Structure n n Disk drives are addressed as large 1 -D arrays of

Disk Structure Logical view: 1 -D array logical block Cylinder 2 Cylinder 1 Cylinder

Disk Scheduling n n The OS is responsible for using hardware efficiently -- for

Disk Architecture Seek time “seek to the cylinder” = 10 -2 second Latency time:

Disk Scheduling n n n Minimize seek time; seek time seek distance Disk bandwidth=

1. First-Come-First-Served (FCFS) Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head starts

1. First-Come-First-Served (FCFS) n Easy to program and intrinsically fair. n Ignore position relationships

2. Shortest-Seek-Time-First (SSTF) n Select the request with minimum seek time from the current

2. Shortest-Seek-Time-First (SSTF) Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head starts

SSTF n It is not optimal l Consider the servicing sequence (from the left):

SSTF not optimal Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head starts

3. SCAN (Elevator algorithm) n The disk arm starts at one end of the

3. SCAN (Elevator algorithm) Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head

3. SCAN (Elevator algorithm) n Eliminate much of discrimination in SSTF l l l

4. C-SCAN n n n A variant of SCAN which provides a more uniform

4. Circular-SCAN (C-SCAN) Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head starts

5. C-LOOK n C-LOOK are the practical variants of CSCAN. l l n The

5. C-LOOK (Modified C-SCAN) Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head

Selecting a Disk. Scheduling Algorithm n It is possible to develop an optimal algorithm

Selecting a Disk. Scheduling Algorithm n n If the queue seldom has more than

Disk Management n Low-level formatting, or physical formatting -Dividing a disk into sectors that

Swap-Space Management n n Swap-space -- Virtual memory uses disk space as an extension

Swap-Space File system Swap space User program Main Memory Disk 25

Swap-Space Use n Systems that implement swapping may use swap space to hold the

Swap-Space Location n 1. normal file system: l l n simply a large file

Swap-Space Management n BSD 4. 3 l Preallocation: allocates swap space when process starts;

Swap-Space Management n Solaris 2 l allocates swap space only when a page is

Disk Reliability Disk failure causes a loss of data and significant downtime while disk

Disk striping (interleaving) n n n A group of disks is treated as one

RAID n n RAID: Redundant Arrays of Inexpensive Disks It improves performance (especially price

RAID Level 1 n n known as Mirroring or shadowing, makes a duplicate of

RAID 1 : Mirroring (Shadowing): WRITE c: Ft. Disk 34

RAID 3 : Block Interleaved Parity n n An extra block of parity data

Block Interleaved Parity: Input 1 Input 2 Output 1 1 0 0 1 1

Block Interleaved Parity n n n If one disk crashes we can re-compute the

RAID 3 n Utilization issue: l stripping data across a minimum of two drives

RAID Level 5 n n Similar to RAID 3; all devices are used for

RAID Info n Read the article about RAID in the course webpage: Sun. EXpert,

Slides: 41

Download presentation

Lecture 8: Secondary-Storage Structure 1

Disk Architecture Track Sector Disk head Cylinder 3500 -7000 rpm 2

Disk Structure n n Disk drives are addressed as large 1 -D arrays of logical blocks, where the logical block is the smallest unit of transfer The 1 -D array of logical blocks is mapped onto the sectors of the disk sequentially l l Sector 0 is the 1 st sector of the 1 st track on the outermost cylinder Mapping proceeds in order through that track, then the rest of the tracks in that cylinder, and then through the rest of the cylinders from outermost to innermost 3

Disk Structure Logical view: 1 -D array logical block Cylinder 2 Cylinder 1 Cylinder 0 Se 0 r o t c cto Se r 1 Disk Physical view 4

Disk Scheduling n n The OS is responsible for using hardware efficiently -- for the disk drives, this means having a fast access time and high disk I/O bandwidth Access time has two major components l l 1. Seek time: move the head to the destination cylinder 2. Rotational Latency: time for disk to rotate the desired sector to the disk head 5

Disk Architecture Seek time “seek to the cylinder” = 10 -2 second Latency time: “Rotate to sector” = 0. 5 x (60/6000) ~ 5 x 10 -3 second Track Sector Disk head Cylinder Transmission Time: Transfer bytes into memory ~ 10 -4 sec (1 KB block size/(10 MB/s) DMA rate) 3500 -7000 rpm 6

Disk Scheduling n n n Minimize seek time; seek time seek distance Disk bandwidth= total # of bytes/(time: 1 st request to last request) Some algorithms l l l 1. First-Come-First-Served (FCFS) 2. Shortest-Seek-Time-First (SSTF) 3. SCAN (Elevator algorithm) 4. Circular-SCAN (C-SCAN) 5. C-LOOK (LOOK) 7

1. First-Come-First-Served (FCFS) Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head starts at 53 0 14 37 53 65 67 98 122 124 183 199 45 85 146 85 108 110 59 2 640 tracks 8

1. First-Come-First-Served (FCFS) n Easy to program and intrinsically fair. n Ignore position relationships among pending requests. n Acceptable when the loading on a disk is light and requests are uniformly distributed. Not good for medium and heavy loading. 9

2. Shortest-Seek-Time-First (SSTF) n Select the request with minimum seek time from the current head position. n A form of SJF scheduling n Better than FCFS in general. n It may cause starvation of some requests. 10

2. Shortest-Seek-Time-First (SSTF) Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head starts at 53 0 14 37 53 65 67 98 122 124 183 199 12 2 30 23 84 24 2 59 236 tracks 11

SSTF n It is not optimal l Consider the servicing sequence (from the left): 53, 37, 14, 65, 57, 98, 122, 124, 184. The total head movement is 208 tracks. This is 28 tracks less than that of SSTF. 12

SSTF not optimal Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head starts at 53 0 12 2 30 23 84 24 2 59 236 tracks (SSTF) 14 37 53 65 67 98 122 124 183 199 16 23 51 2 31 24 2 59 208 tracks 13

3. SCAN (Elevator algorithm) n The disk arm starts at one end of the disk, and moves toward the other end, servicing requests until it gets to the other end of the disk, where the head movement is reversed and servicing continues. n Most disk scheduling strategies actually implemented based on SCAN n Improve throughput and mean response time. 14

3. SCAN (Elevator algorithm) Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head starts at 53 16 23 14 65 2 31 24 2 59 0 14 37 53 65 67 98 122 124 183 199 236 tracks 15

3. SCAN (Elevator algorithm) n Eliminate much of discrimination in SSTF l l l much lower variance in response time. The requests at the other end of the disk wait the longest time. But the upper bound on head movement for servicing a disk request is just twice the number of disk tracks. 16

4. C-SCAN n n n A variant of SCAN which provides a more uniform wait time than SCAN. The head moves from one end of the disk to the other end, servicing requests as it goes. When it reaches the other end it immediately returns to the beginning of the disk, without servicing any requests on the return trip. It has very small variance in response time; that is it maintains a more uniform wait time distribution among the requests. 17

4. Circular-SCAN (C-SCAN) Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head starts at 53 12 2 31 24 2 59 16 199 14 23 0 14 37 53 65 67 98 122 124 183 199 382 tracks 18

5. C-LOOK n C-LOOK are the practical variants of CSCAN. l l n The head is only moved as far as the last request in each direction. When there are no requests in the current direction the head movement is reversed. C-SCAN always move the head from one end of the disk to the other. 19

5. C-LOOK (Modified C-SCAN) Queue= 98, 183, 37, 122, 14, 124, 65, 67 Head starts at 53 169 12 2 31 24 2 59 16 199 14 23 0 14 37 53 65 67 98 122 124 183 199 322 tracks 20

Selecting a Disk. Scheduling Algorithm n It is possible to develop an optimal algorithm but the computation needed may not justify the savings over SSTF or SCAN scheduling algorithms. n The SCAN and C-SCAN (or LOOK and C-LOOK) algorithms are more appropriate for systems that place a heavy load on the disk. 21

Selecting a Disk. Scheduling Algorithm n n If the queue seldom has more than one outstanding request then all disk scheduling algorithms are degraded to FCFS and thus are effectively equivalent. Some disk controller manufacturers have moved disk scheduling algorithms into the hardware itself. l The OS sends requests to the controller in FCFS order and the controller queues them and execute them in some more optimal order. 22

Disk Management n Low-level formatting, or physical formatting -Dividing a disk into sectors that the disk controller can read and write. n To use a disk to hold files, the OS still needs to record its own data structures on the disk. l Partition the disk into one or more groups of cylinders l Logical formatting for “making a file system”. 23

Swap-Space Management n n Swap-space -- Virtual memory uses disk space as an extension of main memory. Goal: to provide best throughput for virtualmemory system. 24

Swap-Space File system Swap space User program Main Memory Disk 25

Swap-Space Use n Systems that implement swapping may use swap space to hold the entire process image, including the code and data segments. (Paging systems may simply store pages that have been pushed out of main memory. ) n Size of swap space: few MB to hundreds of MB n UNIX: multiple swap spaces, usually put on separate disks. Unix copy entire processes between contiguous disk regions and memory. 26

Swap-Space Location n 1. normal file system: l l n simply a large file within the file system easy to implement, but inefficient due to the cost of traversing the file-system data structure 2. separated disk partition (more common): l l No file system or directory structure is placed on this space. A separate swap-space storage manager is used to allocate and de-allocate the block (optimized for speed not storage utilization) 27

Swap-Space Management n BSD 4. 3 l Preallocation: allocates swap space when process starts; holds text segment (the program) and data segment l kernel uses swap maps to track swap-space use l file system is consulted only once for each text segment; l pages from data segment are read in from the file system, or created, and are written to swap space and paged back in as needed. 28

Swap-Space Management n Solaris 2 l allocates swap space only when a page is forced out of physical memory, not when the virtual memory page is first created (modern computer has larger main memory) 29

Disk Reliability Disk failure causes a loss of data and significant downtime while disk is replaced and data restored. 30

Disk striping (interleaving) n n n A group of disks is treated as one storage unit. Each data block is divided into several subblocks. Each sub-block is stored on a separate disk. This reduces disk block access time and can fully utilize disk I/O bandwidth. Performance improvement: All disks transfer their subblocks in parallel 31

RAID n n RAID: Redundant Arrays of Inexpensive Disks It improves performance (especially price performance ratio) and reliability (with duplication of data). 32

RAID Level 1 n n known as Mirroring or shadowing, makes a duplicate of all data files onto a second disk. 50% disk space utilization the simplest RAID organization. 33

RAID 1 : Mirroring (Shadowing): WRITE c: Ft. Disk 34

RAID 3 : Block Interleaved Parity n n An extra block of parity data is written to a separate disk. Example l if there are 9 disks in the array then sector 0 of disks 1 to 8 have their parity computed and stored on disk 9. The operation takes place on bit level for each byte. 35

Block Interleaved Parity: Input 1 Input 2 Output 1 1 0 0 1 1 0 Truth Table of XOR 1 0 Disk 1 1 0 1 Disk 2 1 1 1 0 Parity Disk 36

Block Interleaved Parity Input 1 Input 2 Output 1 1 0 0 1 1 0 Truth Table of XOR 1 0 Disk 1 1 0 ? 1 1 Disk 2 1 1 Disk 3 0 37

Block Interleaved Parity n n n If one disk crashes we can re-compute the original data from the other data bits plus the parity. It has been shown that with a RAID of 100 inexpensive disks and 10 parity disks the mean time to data loss (MTDL) is 90 years. MTDL of a standard large expensive disk is 2 or 3 years. 38

RAID 3 n Utilization issue: l stripping data across a minimum of two drives while using a third drive to store each byte’s parity bit. 2/3 disk utilization (3 disks) n Performance issue: l l During writing, because updating any single data subblock forces the corresponding parity subblock to be recomputed ad rewritten. Can manage only one data transfer at a time per array (for example, a single read or write) 39

RAID Level 5 n n Similar to RAID 3; all devices are used for data storage, with parity bit recording distributed across all drives. RAID 5 provides the best combination of overall data availability and fault-tolerant protection. 40

RAID Info n Read the article about RAID in the course webpage: Sun. EXpert, March 1996, Vol. 7, No. 3, “RAID: Wasted Days, Wasted nights” 41