Carnegie Mellon Bryant and OHallaron Computer Systems A
Carnegie Mellon Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 1
Carnegie Mellon Synchronization: Basics 15 -213: Introduction to Computer Systems 24 th Lecture, November 16, 2017 Instructor: Randy Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 2
Carnegie Mellon Today ¢ ¢ Threads review Sharing Mutual exclusion Semaphores Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 3
Carnegie Mellon Traditional View of a Process ¢ Process = process context + code, data, and stack Process context Program context: Data registers Condition codes Stack pointer (SP) Program counter (PC) Kernel context: VM structures Descriptor table brk pointer Code, data, and stack SP Shared libraries brk Run-time heap Read/write data Read-only code/data PC Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 0 4
Carnegie Mellon Alternate View of a Process ¢ Process = thread + code, data, and kernel context Thread (main thread) Code, data, and kernel context Shared libraries SP Stack Thread context: Data registers Condition codes Stack pointer (SP) Program counter (PC) Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition brk Run-time heap Read/write data Read-only code/data PC 0 Kernel context: VM structures Descriptor table brk pointer 5
Carnegie Mellon A Process With Multiple Threads ¢ Multiple threads can be associated with a process § Each thread has its own logical control flow § Each thread shares the same code, data, and kernel context § Each thread has its own stack for local variables but not protected from other threads § Each thread has its own thread id (TID) § Thread 1 (main thread) Thread 2 (peer thread) stack 1 Thread 1 context: Data registers Condition codes SP 1 PC 1 Shared code and data shared libraries stack 2 run-time heap read/write data read-only code/data Thread 2 context: Data registers Condition codes SP 2 PC 2 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 0 Kernel context: VM structures Descriptor table brk pointer 6
Carnegie Mellon Shared Variables in Threaded C Programs ¢ Question: Which variables in a threaded C program are shared? § The answer is not as simple as “global variables are shared” and “stack variables are private” ¢ ¢ Def: A variable x is shared if and only if multiple threads reference some instance of x. Requires answers to the following questions: § What is the memory model for threads? § How are instances of variables mapped to memory? § How many threads might reference each of these instances? Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 7
Carnegie Mellon Threads Memory Model: Conceptual ¢ ¢ Multiple threads run within the context of a single process Each thread has its own separate thread context § Thread ID, stack pointer, PC, condition codes, and GP registers ¢ All threads share the remaining process context § Code, data, heap, and shared library segments of the process virtual address space § Open files and installed handlers Thread 1 (private) Thread 2 (private) stack 1 stack 2 Shared code and data shared libraries Thread 1 context: Data registers Condition codes SP 1 PC 1 Thread 2 context: Data registers Condition codes SP 2 PC 2 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition run-time heap read/write data read-only code/data 8
Carnegie Mellon Threads Memory Model: Actual ¢ Separation of data is not strictly enforced: § Register values are truly separate and protected, but… § Any thread can read and write the stack of any other thread stack 1 Thread 1 (private) Thread 1 context: Data registers Condition codes SP 1 PC 1 stack 2 Thread 2 (private) Thread 2 context: Data registers Condition codes SP 2 PC 2 Virtual Address Space Shared code and data shared libraries run-time heap read/write data read-only code/data The mismatch between the conceptual and operation model is a source of confusion and errors Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 9
Carnegie Mellon Example Program to Illustrate Sharing char **ptr; /* global var */ int main(int argc, char *argv[]) { long i; pthread_t tid; char *msgs[2] = { "Hello from foo", "Hello from bar" }; ptr = msgs; for (i = 0; i < 2; i++) Pthread_create(&tid, NULL, thread, (void *)i); Pthread_exit(NULL); } sharing. c Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition void *thread(void *vargp) { long myid = (long)vargp; static int cnt = 0; printf("[%ld]: %s (cnt=%d)n", myid, ptr[myid], ++cnt); return NULL; } Peer threads reference main thread’s stack indirectly through global ptr variable A common, but inelegant way to pass a single argument to a thread routine 10
Carnegie Mellon Mapping Variable Instances to Memory ¢ Global variables § Def: Variable declared outside of a function § Virtual memory contains exactly one instance of any global variable ¢ Local variables § Def: Variable declared inside function without static attribute § Each thread stack contains one instance of each local variable ¢ Local static variables § Def: Variable declared inside function with the static attribute § Virtual memory contains exactly one instance of any local static variable. Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 11
Carnegie Mellon Mapping Variable Instances to Memory Global var: 1 instance (ptr [data]) char **ptr; Local vars: 1 instance (i. m, msgs. m) /* global var */ int main(int main, char *argv[]) { long i; pthread_t tid; char *msgs[2] = { "Hello from foo", "Hello from bar" }; ptr = msgs; for (i = 0; i < 2; i++) Pthread_create(&tid, NULL, thread, (void *)i); Pthread_exit(NULL); } sharing. c Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Local var: 2 instances ( myid. p 0 [peer thread 0’s stack], myid. p 1 [peer thread 1’s stack] ) void *thread(void *vargp) { long myid = (long)vargp; static int cnt = 0; printf("[%ld]: %s (cnt=%d)n", myid, ptr[myid], ++cnt); return NULL; } Local static var: 1 instance (cnt [data]) 12
Carnegie Mellon Shared Variable Analysis ¢ Which variables are shared? Variable Referenced by instance main thread? ptr cnt i. m msgs. m myid. p 0 myid. p 1 yes no no Referenced by peer thread 0? Referenced by peer thread 1? yes yes no yes char **ptr; /* global var */ void *thread(void *vargp) int main(int main, char *argv[]) { long i; pthread_t tid; x is shared iff {multiple threads ¢ Answer: A variable long myid = (long)vargp; char *msgs[2] = {"Hello from foo", reference at least one instance Thus: int cnt = 0; static "Hello from bar" }; of x. ptr msgs; n = ptr, cnt, and msgs are shared printf("[%ld]: %s (cnt=%d)n", for (i = 0; i < 2; i++) myid, ptr[myid], ++cnt); n Pthread_create(&tid, i and myid are not shared return NULL; NULL, thread, (void *)i); } Pthread_exit(NULL); } Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 13
Carnegie Mellon Shared Variable Analysis ¢ Which variables are shared? Variable Referenced by instance main thread? ptr cnt i. m msgs. m myid. p 0 myid. p 1 ¢ yes no no Referenced by peer thread 0? Referenced by peer thread 1? yes yes no yes Answer: A variable x is shared iff multiple threads reference at least one instance of x. Thus: n n ptr, cnt, and msgs are shared i and myid are not shared Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 14
Carnegie Mellon Synchronizing Threads ¢ ¢ Shared variables are handy. . . …but introduce the possibility of nasty synchronization errors. Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 15
Carnegie Mellon badcnt. c: Improper Synchronization /* Global shared variable */ volatile long cnt = 0; /* Counter */ int main(int argc, char **argv) { long niters; pthread_t tid 1, tid 2; for (i = 0; i < niters; i++) cnt++; niters = atoi(argv[1]); Pthread_create(&tid 1, NULL, thread, &niters); Pthread_create(&tid 2, NULL, thread, &niters); Pthread_join(tid 1, NULL); Pthread_join(tid 2, NULL); return NULL; } /* Check result */ if (cnt != (2 * niters)) printf("BOOM! cnt=%ldn", cnt); else printf("OK cnt=%ldn", cnt); exit(0); } /* Thread routine */ void *thread(void *vargp) { long i, niters = *((long *)vargp); badcnt. c Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition linux>. /badcnt 10000 OK cnt=20000 linux>. /badcnt 10000 BOOM! cnt=13051 linux> cnt should equal 20, 000. What went wrong? 16
Carnegie Mellon Assembly Code for Counter Loop C code for counter loop in thread i for (i = 0; i < niters; i++) cnt++; Asm code for thread i movq testq jle movl. L 3: movq addq cmpq jne. L 2: (%rdi), %rcx, %rcx. L 2 $0, %eax cnt(%rip), %rdx $1, %rdx, cnt(%rip) $1, %rax %rcx, %rax. L 3 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Hi : Head Li : Load cnt Ui : Update cnt Si : Store cnt Ti : Tail 17
Carnegie Mellon Concurrent Execution ¢ Key idea: In general, any sequentially consistent interleaving is possible, but some give an unexpected result! § Ii denotes that thread i executes instruction I § %rdxi is the content of %rdx in thread i’s context i (thread) instri %rdx 1 %rdx 2 cnt 1 1 2 2 2 1 H 1 L 1 U 1 S 1 H 2 L 2 U 2 S 2 T 1 0 1 1 2 2 2 - 0 0 0 1 1 2 2 2 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition OK 18
Carnegie Mellon Concurrent Execution ¢ Key idea: In general, any sequentially consistent interleaving is possible, but some give an unexpected result! § Ii denotes that thread i executes instruction I § %rdxi is the content of %rdx in thread i’s context i (thread) instri %rdx 1 %rdx 2 cnt 1 1 2 2 2 1 H 1 L 1 U 1 S 1 H 2 L 2 U 2 S 2 T 1 0 1 1 2 2 2 - 0 0 0 1 1 2 2 2 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Thread 1 critical section Thread 2 critical section OK 19
Carnegie Mellon Concurrent Execution (cont) ¢ Incorrect ordering: two threads increment the counter, but the result is 1 instead of 2 i (thread) 1 1 1 2 2 2 instri H 1 L 1 U 1 H 2 L 2 S 1 T 1 U 2 S 2 T 2 %rdx 1 0 1 1 1 - %rdx 2 0 1 1 1 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition cnt 0 0 0 1 1 1 Oops! 20
Carnegie Mellon Concurrent Execution (cont) ¢ How about this ordering? i (thread) 1 1 2 2 1 1 1 2 ¢ instri H 1 L 1 H 2 L 2 U 2 S 2 U 1 S 1 T 2 %rdx 1 %rdx 2 0 1 1 cnt 0 1 1 Oops! We can analyze the behavior using a progress graph Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 21
Carnegie Mellon Progress Graphs A progress graph depicts the discrete execution state space of concurrent threads. Thread 2 T 2 (L 1, S 2) Each axis corresponds to the sequential order of instructions in a thread. S 2 U 2 Each point corresponds to a possible execution state (Inst 1, Inst 2). L 2 H 1 L 1 U 1 S 1 T 1 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Thread 1 E. g. , (L 1, S 2) denotes state where thread 1 has completed L 1 and thread 2 has completed S 2. 22
Carnegie Mellon Trajectories in Progress Graphs Thread 2 A trajectory is a sequence of legal state transitions that describes one possible concurrent execution of the threads. T 2 S 2 Example: H 1, L 1, U 1, H 2, L 2, S 1, T 1, U 2, S 2, T 2 U 2 L 2 H 1 L 1 U 1 S 1 T 1 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Thread 1 23
Carnegie Mellon Trajectories in Progress Graphs Thread 2 A trajectory is a sequence of legal state transitions that describes one possible concurrent execution of the threads. T 2 S 2 Example: H 1, L 1, U 1, H 2, L 2, S 1, T 1, U 2, S 2, T 2 U 2 L 2 H 1 L 1 U 1 S 1 T 1 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Thread 1 24
Carnegie Mellon Critical Sections and Unsafe Regions Thread 2 L, U, and S form a critical section with respect to the shared variable cnt T 2 critical section wrt cnt Instructions in critical sections (wrt some shared variable) should not be interleaved S 2 Unsafe region Sets of states where such interleaving occurs form unsafe regions L 2 H 1 L 1 U 1 S 1 Thread 1 critical section wrt cnt Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 25
Carnegie Mellon Critical Sections and Unsafe Regions Thread 2 safe critical section wrt cnt T 2 Def: A trajectory is safe iff it does not enter any unsafe region S 2 Claim: A trajectory is correct (wrt cnt) iff it is safe U 2 Unsafe region unsafe L 2 H 1 L 1 U 1 S 1 Thread 1 critical section wrt cnt Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 26
Carnegie Mellon badcnt. c: Improper Synchronization /* Global shared variable */ volatile long cnt = 0; /* Counter */ int main(int argc, char **argv) { long niters; pthread_t tid 1, tid 2; for (i = 0; i < niters; i++) cnt++; niters = atoi(argv[1]); Pthread_create(&tid 1, NULL, thread, &niters); Pthread_create(&tid 2, NULL, thread, &niters); Pthread_join(tid 1, NULL); Pthread_join(tid 2, NULL); return NULL; } /* Check result */ if (cnt != (2 * niters)) printf("BOOM! cnt=%ldn", cnt); else printf("OK cnt=%ldn", cnt); exit(0); } /* Thread routine */ void *thread(void *vargp) { long i, niters = *((long *)vargp); badcnt. c Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Variable main thread 1 thread 2 cnt yes* yes niters. m yes no no tid 1. m yes no no i. 1 no yes no i. 2 no no yes niters. 1 no yes no niters. 2 no no yes 27
Carnegie Mellon Quiz Time! Check out: https: //canvas. cmu. edu/courses/1221 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 28
Carnegie Mellon Enforcing Mutual Exclusion ¢ ¢ Question: How can we guarantee a safe trajectory? Answer: We must synchronize the execution of the threads so that they can never have an unsafe trajectory. § i. e. , need to guarantee mutually exclusive access for each critical section. ¢ Classic solution: § Semaphores (Edsger Dijkstra) Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 29
Carnegie Mellon Semaphores ¢ ¢ Semaphore: non-negative global integer synchronization variable. Manipulated by P and V operations. P(s) § If s is nonzero, then decrement s by 1 and return immediately. Test and decrement operations occur atomically (indivisibly) § If s is zero, then suspend thread until s becomes nonzero and the thread is restarted by a V operation. § After restarting, the P operation decrements s and returns control to the caller. § ¢ V(s): § Increment s by 1. Increment operation occurs atomically § If there any threads blocked in a P operation waiting for s to become nonzero, then restart exactly one of those threads, which then completes its P operation by decrementing s. § ¢ Semaphore invariant: (s >= 0) Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 30
Carnegie Mellon Semaphores ¢ ¢ Semaphore: non-negative global integer synchronization variable Manipulated by P and V operations: § P(s): [ while (s == 0) wait(); s--; ] Dutch for “Proberen” (test) § V(s): [ s++; ] § Dutch for “Verhogen” (increment) § ¢ OS kernel guarantees that operations between brackets [ ] are executed indivisibly Only one P or V operation at a time can modify s. § When while loop in P terminates, only that P can decrement s § ¢ Semaphore invariant: (s >= 0) Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 31
Carnegie Mellon C Semaphore Operations Pthreads functions: #include <semaphore. h> int sem_init(sem_t *s, 0, unsigned int val); } /* s = val */ int sem_wait(sem_t *s); int sem_post(sem_t *s); /* P(s) */ /* V(s) */ CS: APP wrapper functions: #include "csapp. h” void P(sem_t *s); /* Wrapper function for sem_wait */ void V(sem_t *s); /* Wrapper function for sem_post */ Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 32
Carnegie Mellon badcnt. c: Improper Synchronization /* Global shared variable */ volatile long cnt = 0; /* Counter */ int main(int argc, char **argv) { long niters; pthread_t tid 1, tid 2; for (i = 0; i < niters; i++) cnt++; niters = atoi(argv[1]); Pthread_create(&tid 1, NULL, thread, &niters); Pthread_create(&tid 2, NULL, thread, &niters); Pthread_join(tid 1, NULL); Pthread_join(tid 2, NULL); return NULL; } /* Check result */ if (cnt != (2 * niters)) printf("BOOM! cnt=%ldn", cnt); else printf("OK cnt=%ldn", cnt); exit(0); } /* Thread routine */ void *thread(void *vargp) { long i, niters = *((long *)vargp); How can we fix this using semaphores? badcnt. c Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 33
Carnegie Mellon Using Semaphores for Mutual Exclusion ¢ Basic idea: § Associate a unique semaphore mutex, initially 1, with each shared variable (or related set of shared variables). § Surround corresponding critical sections with P(mutex) and V(mutex) operations. ¢ Terminology: § Binary semaphore: semaphore whose value is always 0 or 1 § Mutex: binary semaphore used for mutual exclusion P operation: “locking” the mutex § V operation: “unlocking” or “releasing” the mutex § “Holding” a mutex: locked and not yet unlocked. § Counting semaphore: used as a counter for set of available resources. § Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 34
Carnegie Mellon goodcnt. c: Proper Synchronization ¢ Define and initialize a mutex for the shared variable cnt: volatile long cnt = 0; sem_t mutex; /* Counter */ /* Semaphore that protects cnt */ sem_init(&mutex, 0, 1); /* mutex = 1 */ ¢ Surround critical section with P and V: for (i = 0; i < niters; i++) { P(&mutex); cnt++; V(&mutex); } goodcnt. c Function linux>. /goodcnt 10000 OK cnt=20000 linux> badcnt goodcnt Time (ms) 12 450 Warning: It’s orders of magnitude slower niters than = 106 badcnt. c. Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Slowdown 1. 0 37. 5 35
Carnegie Mellon Why Mutexes Work Thread 2 Provide mutually exclusive access to shared variable by surrounding critical section with P and V operations on semaphore s (initially set to 1) T 2 V(s) S 2 Unsafe region U 2 L 2 -1 P(s) 0 H 2 1 H 1 0 P(s) 0 L 1 0 U 1 S 1 V(s) Initially s=1 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition T 1 Thread 1 36
Carnegie Mellon Why Mutexes Work Thread 2 Provide mutually exclusive access to shared variable by surrounding critical section with P and V operations on semaphore s (initially set to 1) T 2 V(s) Semaphore invariant creates a forbidden region that encloses unsafe region and that cannot be entered by any trajectory. S 2 Unsafe region U 2 L 2 -1 P(s) 0 H 2 1 H 1 0 P(s) 0 L 1 0 U 1 S 1 V(s) Initially s=1 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition T 1 Thread 1 37
Carnegie Mellon Why Mutexes Work Thread 2 Provide mutually exclusive access to shared variable by surrounding critical section with P and V operations on semaphore s (initially set to 1) T 2 V(s) Semaphore invariant creates a forbidden region that encloses unsafe region and that cannot be entered by any trajectory. S 2 Unsafe region U 2 L 2 0 P(s) 0 0 H 2 1 H 1 0 P(s) 0 L 1 1 0 U 1 S 1 V(s) Initially s=1 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition T 1 Thread 1 38
Carnegie Mellon Why Mutexes Work Thread 2 T 2 V(s) S 2 U 2 L 2 P(s) H 2 Initially s=1 1 1 0 0 0 0 0 Forbidden region -1 -1 Unsafe region 0 0 0 -1 -1 1 1 0 0 0 0 1 1 H 1 P(s) L 1 U 1 S 1 V(s) Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition T 1 Provide mutually exclusive access to shared variable by surrounding critical section with P and V operations on semaphore s (initially set to 1) Semaphore invariant creates a forbidden region that encloses unsafe region and that cannot be entered by any trajectory. Thread 1 39
Carnegie Mellon Binary Semaphores ¢ Mutex is special case of semaphore § Value either 0 or 1 ¢ Pthreads provides pthread_mutex_t § Operations: lock, unlock ¢ Recommended over general semaphores when appropriate Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 40
Carnegie Mellon goodmcnt. c: Mutex Synchronization ¢ Define and initialize a mutex for the shared variable cnt: volatile long cnt = 0; /* Counter */ pthread_mutex_t mutex; pthread_mutex_init(&mutex, NULL); // No special attributes ¢ Surround critical section with lock and unlock: for (i = 0; i < niters; i++) { pthread_mutex_lock(&mutex); cnt++; pthread_mutex_unlock(&mutex); } goodcnt. c Function linux>. /goodmcnt 10000 OK cnt=20000 linux> badcnt goodmcnt Time (ms) niters = 106 12 450 214 Slowdown 1. 0 37. 5 17. 8 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 41
Carnegie Mellon Summary ¢ ¢ ¢ Programmers need a clear model of how variables are shared by threads. Variables shared by multiple threads must be protected to ensure mutually exclusive access. Semaphores are a fundamental mechanism for enforcing mutual exclusion. Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition 42
- Slides: 42