Automated Tools for Software Reliability Suhabe Bugrara suhabestanford

Automated Tools for Software Reliability Suhabe Bugrara suhabe@stanford. edu Stanford University

Problem • • 80% of development cost on identifying and correcting defects Software errors cost US economy $60 billion annually (0. 6% of GDP)

Manual Testing • • Traditional approach to quality assurance Expensive Time consuming Not systematic Difficult to quantify effectiveness of test suite Cannot make any guarantees about reliability Insufficient for safety critical systems

Automated Tools • • Programs to find defects in programs Automated Systematic Easy to quantify effectiveness Provide guarantees about reliability Sometimes expensive (for now…) Sometimes time consuming (for now…)

Program Analyzers Complete Undecidable Sound • Reports all errors • Reports no false alarms Decidable Unsound • May not report all errors • Reports no false alarms Incomplete Decidable • Reports all errors • May report false alarms Decidable • May not report all errors • May report false alarms

Static Driver Verifier • • • Program analyzer for API usage rules Developed by Microsoft Research Applied to device drivers in Windows Sound: reports all possible errors Incomplete: may report false alarms

SDV: Overview 1. 2. 3. 4. 5. 6. 7. Write API usage rule specification Instrument program with usage checks Abstract program Check abstraction for errors If error found, see if error is false alarm If false alarm, refine abstraction If not false alarm, report error as bug

API Usage Rules • Ex. locks are alternatingly acquired and released

API Usage Rules • Ex. locks are alternatingly acquired and released • Expressed as finite state machine – States = {locked, unlocked, error} – Transitions = {acquire(), release()}

API Usage Rules • Ex. locks are alternatingly acquired and released • Expressed as finite state machine – States = {locked, unlocked, error} – Transitions = {acquire(), release()} unlocked release(); acquire(); release(); error locked acquire();

state { enum { Unlocked=0; Locked=1} state = Unlocked; } Ke. Acquire. Spin. Lock. return { if (state == Locked) error(); else state = Locked; } Ke. Release. Spin. Lock. return { if (!(state == Locked)) error(); else state = Unlocked; }

$enum {Unlocked=0, Locked=1} state = Unlocked void Ke. Acquire. Spin. Lock_return() { if (state$

enum {Unlocked=0, Locked=1} state = Unlocked void Ke. Acquire. Spin. Lock_return() { if (state == Locked) error(); else state = Locked; } void Ke. Release. Spin. Lock_return() { if (!(state == Locked)) error(); else state = Unlocked; }

1: void example() { 2: do { 3: Ke. Acquire. Spin. Lock(); 4: 5: n. Packets. Old = n. Packets; 6: req = dev. Ext->WLHV 7: if (req && req->status) { 8: dev. Ext->WLHV = req->Next 9: Ke. Release. Spin. Lock(); 10: 11: irp = req->irp; 12: if (req->status > 0) { 13: irp->Io. S. Status = SUCCCESS; 14: irp->Io. S. Info = req->Status; 15: } else { 16: irp->Io. S. Status = FAIL; 17: irp->Io. S. Info = req->Status; 18: } 19: Smart. Dev. Free. Block(req); 20: Io. Complete. Request(irp); 21: n. Packets++; 22: } 23: } while (n. Packets!=n. Packets. Old); 24: Ke. Release. Spin. Lock(); 25: 26: }

$enum {Unlocked=0, Locked=1} state = Unlocked void Ke. Acquire. Spin. Lock_return() { if (state$

enum {Unlocked=0, Locked=1} state = Unlocked void Ke. Acquire. Spin. Lock_return() { if (state == Locked) error(); else state = Locked; } void Ke. Release. Spin. Lock_return() { if (!(state == Locked)) error(); else state = Unlocked; }

1: void example() { Program 2: do { 3: Ke. Acquire. Spin. Lock(); 4: Ke. Acquire. Spin. Lock_return(); 5: n. Packets. Old = n. Packets; 6: req = dev. Ext->WLHV 7: if (req && req->status) { 8: dev. Ext->WLHV = req->Next 9: Ke. Release. Spin. Lock(); 10: Ke. Release. Spin. Lock_return(); 11: irp = req->irp; 12: if (req->status > 0) { 13: irp->Io. S. Status = SUCCCESS; 14: irp->Io. S. Info = req->Status; 15: } else { 16: irp->Io. S. Status = FAIL; 17: irp->Io. S. Info = req->Status; 18: } 19: Smart. Dev. Free. Block(req); 20: Io. Complete. Request(irp); 21: n. Packets++; 22: } 23: } while (n. Packets!=n. Packets. Old); 24: Ke. Release. Spin. Lock(); 25: Ke. Release. Spin. Lock_return(); 26: } A

SDV: Abstraction • Construct abstraction B of original program A – Over-approximates reachability • If error() is reachable in A, then it is also reachable in B – This characteristic makes SDV sound • If error() is reachable in B, then it may not be reachable in A – This characteristic makes SDV incomplete • Check abstraction B for any errors

Reachable States real bug! Abstraction B error Original A Sound: If A has error, then B has error

false alarm! Reachable States Abstraction B error Original A Incomplete: If B has error, then A may not have error

bool b 1; Abstract state == Locked with b 1 = false; void Ke. Acquire. Spin. Lock_return() { if (b 1) error(); else b 1 = true; } void Ke. Release. Spin. Lock_return() { if (!(b 1)) error(); else b 1 = false; }

$1: void example() { 2: do { 3: ; 4: Ke. Acquire. Spin. Lock_return();$

1: void example() { 2: do { 3: ; 4: Ke. Acquire. Spin. Lock_return(); 5: ; 6: ; 7: if (Sdv. Make. Choice()) { 8: ; 9: ; 10: Ke. Release. Spin. Lock_return(); 11: ; 12: if (Sdv. Make. Choice()) { 13: ; 14: ; 15: } else { 16: ; 17: ; 18: } 19: ; 20: ; 21: ; 22: } 23: } while (Sdv. Make. Choice()); 24: ; 25: Ke. Release. Spin. Lock_return(); 26: } Program B

$1: void example() { 2: do { 3: ; 4: Ke. Acquire. Spin. Lock_return();$

1: void example() { 2: do { 3: ; 4: Ke. Acquire. Spin. Lock_return(); 5: ; 6: ; 7: if (Sdv. Make. Choice()) { 8: ; 9: ; 10: Ke. Release. Spin. Lock_return(); 11: ; 12: if (Sdv. Make. Choice()) { 13: ; 14: ; 15: } else { 16: ; 17: ; 18: } 19: ; 20: ; 21: ; 22: } Error trace 23: } while (Sdv. Make. Choice()); found! 24: ; 25: Ke. Release. Spin. Lock_return(); 26: }

1: void example() { 2: do { 3: Ke. Acquire. Spin. Lock(); 4: Ke. Acquire. Spin. Lock_return(); 5: n. Packets. Old = n. Packets; 6: req = dev. Ext->WLHV 7: if (req && req->status) { 8: dev. Ext->WLHV = req->Next 9: Ke. Release. Spin. Lock(); 10: Ke. Release. Spin. Lock_return(); 11: irp = req->irp; 12: if (req->status > 0) { 13: irp->Io. S. Status = SUCCCESS; 14: irp->Io. S. Info = req->Status; 15: } else { 16: irp->Io. S. Status = FAIL; 17: irp->Io. S. Info = req->Status; 18: } 19: Smart. Dev. Free. Block(req); 20: Io. Complete. Request(irp); 21: n. Packets++; 22: } But, no bug 23: } while (n. Packets!=n. Packets. Old); original 24: Ke. Release. Spin. Lock(); 25: Ke. Release. Spin. Lock_return(); program! 26: } in

1: void example() { Program 2: do { 3: ; 4: Ke. Acquire. Spin. Lock_return(); 5: b 2 = false; 6: ; 7: if (Sdv. Make. Choice()) { 8: ; 9: ; 10: Ke. Release. Spin. Lock_return(); 11: ; 12: if (Sdv. Make. Choice()) { 13: ; 14: ; 15: } else { 16: ; 17: ; 18: } 19: ; 20: ; 21: b 2 = !b 2 ? true : Sdv. Make. Choice(); 22: } 23: } while (b 2); 24: ; 25: Ke. Release. Spin. Lock_return(); 26: } C

Reachable States error Abstraction B Refined C Original false alarm no longer reported! A

SDV: Summary 1. 2. 3. 4. 5. 6. 7. Write API usage rule specification Instrument program with usage checks Abstract program Check abstraction for errors If error found, see if error is false alarm If false alarm, refine abstraction If not false alarm, report error as bug

Soundness • Assume memory safety – No buffer/integer overflows – Safe memory management – No null pointer dereferences • Oversimplified harness – Use stubs to model calls into OS procedures – Stubs may not represent all behavior

Research Challenges in Verification • • Eliminate assumption of memory safety Eliminate false alarms Scale to the entire operating system Verify more complicated properties – prove consistency of file system data structures

Program Analyzers Complete Undecidable Sound • Reports all errors • Reports no false alarms Decidable Unsound • May not report all errors • Reports no false alarms Incomplete Decidable • Reports all errors • May report false alarms Decidable • May not report all errors • May report false alarms

EXE • Automatically generate test cases that explore important program paths • Developed by Dawson Engler’s group • Bug finding tool • Unsound: may not report all errors • Complete: never reports false alarms