Lectures on Recursive Algorithms COMP 523 Advanced Algorithmic

  • Slides: 48
Download presentation
Lectures on Recursive Algorithms COMP 523: Advanced Algorithmic Techniques Lecturer: Dariusz Kowalski Lectures on

Lectures on Recursive Algorithms COMP 523: Advanced Algorithmic Techniques Lecturer: Dariusz Kowalski Lectures on Recursive Algorithms 1

Announcements • Web page: www. csc. liv. ac. uk/~darek/comp 523. html • Slides constantly

Announcements • Web page: www. csc. liv. ac. uk/~darek/comp 523. html • Slides constantly updated based on your feedback – check for a new version a few hours after the lecture(s) Lectures on Recursive Algorithms 2

Overview Previous lectures: • Algorithms: correctness, termination, performance • Asymptotic notation • Popular asymptotic

Overview Previous lectures: • Algorithms: correctness, termination, performance • Asymptotic notation • Popular asymptotic complexities: log n, n, n log n, n 2, . . . • Graphs - basic definitions and examples These lectures: • Recursive algorithms - basic recursion • Searching algorithm in time O(log n) • Finding majority in time O(n) • Sorting in time O(n log n) • Other examples Lectures on Recursive Algorithms 3

Algorithms based on recurrence Algorithmic scheme: • Reduce the input to smaller sub-input(s) •

Algorithms based on recurrence Algorithmic scheme: • Reduce the input to smaller sub-input(s) • Solve the problem for (some) sub-inputs • Merge the obtained solution(s) into one global solution Lectures on Recursive Algorithms 4

Complexity analysis of recursive process Let T(m) denote time (or other complexity measure) of

Complexity analysis of recursive process Let T(m) denote time (or other complexity measure) of solving a given problem by a given algorithm working on input length m Simple recursive formula: T(n) q. T(n/2) + f(n) Solve the problem for half of the input q times, and spend at most f(n) time for partitioning and/or merging Lectures on Recursive Algorithms 5

Example 1: Searching in time O(log n) Lectures on Recursive Algorithms 6

Example 1: Searching in time O(log n) Lectures on Recursive Algorithms 6

Searching: problem and algorithm • • Input: sorted array A of n numbers and

Searching: problem and algorithm • • Input: sorted array A of n numbers and number x Problem: find if x is in array A Solution: Algorithm SEARCH(x, 1, n) Procedure SEARCH(x, i, j): – If i = j then • if x = A[i] then answer YES • if x A[i] then answer NO – Else compare x with element A[(i+j)/2]: • If x = A[(i+j)/2] then answer YES • If x < A[(i+j)/2] then SEARCH(x, i, (i+j)/2 -1) • If x > A[(i+j)/2] then SEARCH(x, (i+j)/2+1, j) Lectures on Recursive Algorithms 7

Tree structure of searching root 4 find if value 5 is there 2 1

Tree structure of searching root 4 find if value 5 is there 2 1 6 3 5 height = 3 7 1 2 3 4 5 6 7 8 8 leaves Lectures on Recursive Algorithms 8

Time and memory consumption Each recursive call – needs additional constant memory, and –

Time and memory consumption Each recursive call – needs additional constant memory, and – the size of input decreases by factor 2 Formula for time (# comparisons): T(n) T(n/2) + 4 Additional memory size: log n = O(log n) In other words: recursive algorithm must store a path from the root to the searched leaf, and it has logarithmic length Lectures on Recursive Algorithms 9

Case q = 1, f(n) = c T(n) T(n/2) + c , T(2) c

Case q = 1, f(n) = c T(n) T(n/2) + c , T(2) c T(n) T(n/2) + c (T((n/2)/2) + c = T(n/4) + 2 c (T((n/4)/2) + c) + 2 c = T(n/8) + 3 c … (T(n/2 i) + c) + (i-1). c = T(n/2 i) + i. c … (T(n/2 log n – 1) + c) + (log n - 2). c = T(n/(n/2)) + (log n - 2). c = T(2) + (log n – 1). c c + (log n – 1). c = c log n Lectures on Recursive Algorithms 10

Case q = 1, f(n) = c formal analysis T(n) T(n/2) + c T(2)

Case q = 1, f(n) = c formal analysis T(n) T(n/2) + c T(2) c Solution: T(n) c log n Proof: by induction. – For n = 2 straightforward: T(2) c c log 2 – Suppose T(n/2) c log (n/2); then T(n) T(n/2) + c c log n - c + c c log n Lectures on Recursive Algorithms 11

Example 2: Finding a majority in time O(n) Lectures on Recursive Algorithms 12

Example 2: Finding a majority in time O(n) Lectures on Recursive Algorithms 12

Finding a majority • Input: array A of n elements • Problem: find if

Finding a majority • Input: array A of n elements • Problem: find if there is an element x that occurs in array A more than n/2 times (a majority value) • Solution: Algorithm Majority(A) returning a majority element in A, if it exists, or null otherwise Lectures on Recursive Algorithms 13

Finding a majority: algorithm Algorithm Majority(A[1. . n]): If |A| = 0 then output

Finding a majority: algorithm Algorithm Majority(A[1. . n]): If |A| = 0 then output null, else if |A| = 1 then output A[1] ; else: • If n = |A| is odd then – check whether A[n] is a majority in A by counting the number of occurrences of value A[n]; if yes then output A[n], otherwise decrease n by one • • • Initialize additional array B of size |A|/2 Set j to 0 For i = 1, 2, …, n/2 do: – if A[2 i-1] = A[2 i] then • • increase j by one set B[j] to A[2 i] Find if there is a majority in B[1. . j] by executing Majority(B[1. . j]) If a majority value x in B[1. . j] is returned then check whether x is a majority in A, by going through array A and counting the number of occurrences of value x in A; if successful output x; otherwise null Lectures on Recursive Algorithms 14

Tree structure of finding majority root 2 2 2 1 odd, no majority height

Tree structure of finding majority root 2 2 2 1 odd, no majority height = 2 2 2 1 1 8 leaves Lectures on Recursive Algorithms 15

Correctness of Majority Algorithm Lemma: (to be proved later) If x is a majority

Correctness of Majority Algorithm Lemma: (to be proved later) If x is a majority in A then x is a majority in B. Correctness: (proof by induction, based on the Lemma) Inductive assumption: – Majority(B) works correctly for all B’s of size smaller than |A| Case 1: There is a majority in A. – Then it is a majority in B, by Lemma, therefore it will be found in the recursive stage of the algorithm Majority(B) (by inductive argument), and will be checked (successfully) in the last point of the algorithm Majority(A). Case 2: There is no majority in A. – Then even if there is a candidate value found in the recursive stage Majority(B) of the algorithm, it will be rejected anyway in the last point of the algorithm Majority(A). Lectures on Recursive Algorithms 16

Proof of the Lemma Suppose, to the contrary, that x is a majority in

Proof of the Lemma Suppose, to the contrary, that x is a majority in A but is not a majority in B. Let • m be the number of occurrences of x in A, and • k be the number of occurrences of x in B. It follows that other values appear at least k times in B. Consequently, the other values appear in A: • at least 2 k times: those pairs that are represented in B by a value different than x; plus • m-2 k times: each occurrence of x in A that is not paired by another x (there are k pairs xx in A, thus m-2 k of other occurrences of x in A) is paired by some other value (different than x) which gives at least 2 k+(m-2 k) = m occurrences in total, and contradicts the fact that x is a majority in A. Contradiction! Therefore x must be also a majority in B. Lectures on Recursive Algorithms 17

Memory consumption Each call to the recursive procedure • needs additional memory of half

Memory consumption Each call to the recursive procedure • needs additional memory of half of the current input size • reduces the size of the input by factor at least two Formula for time (# comparisons): T(n) n -1 + 3 n/2 + T(n/2) + n = T(n/2) + 7 n/2 Additional memory used (for smaller arrays B): M(n) n/2 + M(n/2) n/2 + n/4 + M(n/4) n/2 + n/4 + … + 1 = n - 1 = O(n) Intuition about memory size: the algorithm needs a binary tree of linear size, in the worst case, to store “winners” Lectures on Recursive Algorithms 18

Case q = 1, f(n) = cn T(n) T(n/2) + cn , T(1) c

Case q = 1, f(n) = cn T(n) T(n/2) + cn , T(1) c T(n) T(n/2) + cn (T(n/4) + cn/2) + cn = T(n/4) + (3/2)cn (T(n/8) + cn/4) + (3/2)cn = T(n/8) + (7/4)cn … (T(n/2 i) + cn/2 i-1) + ((2 i-1 -1)/2 i-2). cn = T(n/2 i) + ((2 i-1)/2 i-1). cn … T(n/2 log n) + ((2 log n-1)/2 log n-1). cn = T(1) + (n – 1)/(n/2). cn c + 2 cn – 2 c 2 cn Lectures on Recursive Algorithms 19

Case q = 1, f(n) = c n formal analysis T(n) T(n/2) + c

Case q = 1, f(n) = c n formal analysis T(n) T(n/2) + c n T(1) c Solution: T(n) 2 c n Proof: by induction. – For n = 1 straightforward: T(1) c 2 c 1 – Suppose T(n/2) 2 c (n/2); then T(n) T(n/2) + c n 2 c (n/2) + c n = 2 c n Lectures on Recursive Algorithms 20

Textbook and exercises READING: Chapter 5, up to Section 5. 2 EXERCISES: • Solve

Textbook and exercises READING: Chapter 5, up to Section 5. 2 EXERCISES: • Solve the following recursive formulas the best you can: – – T(n) T(n/2) + 4 T(n) T(n/2) + 5 n T(n) T(n/2) + 3 n 2 T(n) (3/2) T(n/2) + 1 Compare and sort the obtained (asymptotic) formulas according to the asymptotic order. For each of them try to find an algorithmic example with performance giving by this formula. • Sort the following formulas according to big/small Oh order: log (n 1/2), log (9 n), log (n 3), 2 log n, 23 log n, 2 log (9 n), n 2, n log n • Propose and analyse an algorithm for checking if there is a majority in a given sorted array A. Lectures on Recursive Algorithms 21

Example 3: Sorting in time O(n log n) Lectures on Recursive Algorithms 22

Example 3: Sorting in time O(n log n) Lectures on Recursive Algorithms 22

Divide and conquer Algorithm: • Partition input into two halves in (at most) linear

Divide and conquer Algorithm: • Partition input into two halves in (at most) linear time • Solve the problem for each half of the input separately • Merge solutions into one in (at most) linear time Recursive formula on time complexity: T(n) 2 T(n/2) + c n Lectures on Recursive Algorithms 23

Case q = 2, f(n) = cn T(n) 2 T(n/2) + cn , T(2)

Case q = 2, f(n) = cn T(n) 2 T(n/2) + cn , T(2) c T(n) 2 T(n/2) + cn 2(2 T(n/4) + cn/2) + cn = 4 T(n/4) + 2 cn 4(2 T(n/8) + cn/4) + 2 cn = 8 T(n/8) + 3 cn … 2 i-1 (2 T(n/2 i) + cn/2 i-1) + (i-1). cn = 2 i T(n/2 i) + i. cn … 2 log n - 1 T(2) + (log n - 1). cn log n Lectures on Recursive Algorithms 24

Solution for time formal analysis T(n) 2 T(n/2) + c n T(2) c Solution:

Solution for time formal analysis T(n) 2 T(n/2) + c n T(2) c Solution: T(n) c n log n Proof: by induction. – For n = 2 straightforward: T(2) c c 2 log 2 – Suppose T(n/2) c (n/2) log (n/2); then T(n) 2 T(n/2) + c n 2 c (n/2) log (n/2) + c n = 2 c (n/2) log n – 2 c n/2 + c n = c n log n Lectures on Recursive Algorithms 25

Example - sorting by merging • Input: list L of n numbers • Problem:

Example - sorting by merging • Input: list L of n numbers • Problem: Sort list L • Algorithm Merge. Sort(L): If L has at most two elements then sort them by comparison and exit; Else: – Split L into two lists prefix and suffix, each of size n/2 – Sort, by merging, the prefix and the suffix separately: Merge. Sort(prefix) and Merge. Sort(suffix) – Merge sorted prefix with sorted suffix as follows: • Initialize final list as empty • Repeat until either prefix or suffix is empty: – Compare the first elements on the lists prefix and suffix and then move the smaller one to the end of the final list • Concatenate final list with the remaining non-empty list (prefix or suffix) Lectures on Recursive Algorithms 26

Merge. Sort procedure 1 2 4 7 8 3 5 6 9 10 1

Merge. Sort procedure 1 2 4 7 8 3 5 6 9 10 1 2 3 4 5 6 7 Lectures on Recursive Algorithms 8 9 10 27

Tree structure of sorting root merging operations height = 3 2 4 1 7

Tree structure of sorting root merging operations height = 3 2 4 1 7 8 3 6 5 8 leaves Lectures on Recursive Algorithms 28

Time and memory consumption Each recursive call – needs additional constant memory, and –

Time and memory consumption Each recursive call – needs additional constant memory, and – the number of inputs increases by factor 2, and – the size of each input decreases by factor 2 Formula for time (# comparisons and list operations): T(n) n/2 + 2 T(n/2) + 2 n = 2 T(n/2) + 5 n/2 Additional memory size: M(n) 2 M(n/2) + 2 4 M(n/4) + 4 + 2 8 M(n/8) + 8 + 4 + 2 … n/2 +n/4 + … + 4 + 2 = n - 2 = O(n) In other words: divide and conquer algorithm must store the binary tree of pointers during the computation Lectures on Recursive Algorithms 29

Quick sort - algorithmic scheme Generic Quick Sort: • Select one element x from

Quick sort - algorithmic scheme Generic Quick Sort: • Select one element x from the input • Partition the input into part containing elements not greater than x and part containing elements bigger than x • Sort each part separately • Concatenate these two sorted parts Problem: how to choose element x to balance the sizes of these two parts? (to get the similar recursive equations as for Merge. Sort) Lectures on Recursive Algorithms 30

Why parts should be balanced? Suppose they are not balanced, but, say, the smallest

Why parts should be balanced? Suppose they are not balanced, but, say, the smallest element is chosen: T(n) T(1) + T(n-1) + c n T(1) c Solution: T(n) (c/2). n 2 Proof: by induction. – For n = 1 straightforward: T(1) c (c/2). 12 – Suppose T(n-1) (c/2). (n-1)2; then T(n) T(n-1) + cn (c/2). (n-1)2 + c (n+1) (c/2). (n-1)2 + (c/2). 2 n (c/2). n 2 Lectures on Recursive Algorithms 31

Two approaches to be fast Deterministic: • Additional tree structure of size O(n) needed

Two approaches to be fast Deterministic: • Additional tree structure of size O(n) needed for identifying the median value • Time: O(n log n), additional memory O(n) Randomized: • Select separation element x uniformly at random • Time (expected): O(n log n), additional memory O(n) Lectures on Recursive Algorithms 32

Randomized approach - analysis Let T(n) denote the expected time: sum of all possible

Randomized approach - analysis Let T(n) denote the expected time: sum of all possible values of time weighted by the probabilities of these values T(n) 1/n ([T(n-1)+T(1)] + [T(n-2)+T(2)] + … +[T(0)+T(n)]) + cn T(0) = T(1) = 1, T(2) c Solution: T(n) d n log n, for some constant d 8 c Proof: by induction. – For n = 2 straightforward: T(2) c d. 2. log 2 – Suppose T(m) d m log m, for every m < n; then (1 -1/n) T(n) (2/n) (T(0) + … + T(n-1)) + c n (2 d/n) (1 + 2 log 2 + … + (n-1)log(n-1)) + c n (2 d/n) ((n 2/2) log n – n 2/8) + c n d n log n - d (n/4) + c n d n log n - d n/2 T(n) (n/(n-1)) (d n log n - d n/2) d n log n Lectures on Recursive Algorithms 33

Tree structure of random execution root 1 6 5 7 height = 5 3

Tree structure of random execution root 1 6 5 7 height = 5 3 2 4 1 2 3 4 5 6 7 8 8 leaves Lectures on Recursive Algorithms 34

Textbook and Exercises READING: Section 5. 3 EXERCISES: • Solved exercises 1, 2 from

Textbook and Exercises READING: Section 5. 3 EXERCISES: • Solved exercises 1, 2 from the textbook, chapter 5 “Divide and Conquer” • Prove that 1 + 2 log 2 + … + (n-1)log(n-1) n(n/2) log(4 n) , and (n/(n-1)) (d n log n - d n/2) d n log n , for n > 16 Lectures on Recursive Algorithms 35

Example 4: Finding closest pair in time O(n log n) Lectures on Recursive Algorithms

Example 4: Finding closest pair in time O(n log n) Lectures on Recursive Algorithms 36

Problem Closest pair of points: • Input: n points on the plane • Output:

Problem Closest pair of points: • Input: n points on the plane • Output: two points having the closest distance Remarks: • Exhaustive search algorithm gives time O(n 2) • Similar approach gives ALL pairs of closest points Lectures on Recursive Algorithms 37

Solution Preprocessing: • Sort points according to the first coordinate (obtain horizontal list H)

Solution Preprocessing: • Sort points according to the first coordinate (obtain horizontal list H) and according to the second coordinate (obtain vertical list V) Main Algorithm: • Partition input list H into halves (H 1, H 2) in linear time (draw vertical line L containing medium point); split list V accordingly into V 1, V 2, where Vi contains the same elements as Hi and inherits the initial order from V (for i = 1, 2) • Solve the problem for each half of the input separately, obtaining two pairs of closest points; let be the smallest distance from the obtained ones • Find if there is a pair which has distance smaller than - if yes then find the smallest distance pair Lectures on Recursive Algorithms 38

Main difficulty Find if there is a pair which has distance smaller than :

Main difficulty Find if there is a pair which has distance smaller than : if yes then find the smallest distance pair How to do it in linear time? L Lectures on Recursive Algorithms 39

Closest pair in the strip 1. Find a sub-list P of V containing all

Closest pair in the strip 1. Find a sub-list P of V containing all points with a distance at most from the line L: – go through V and remove elements not –close to L 2. For each element in P check its distance to the next 8 elements and remember the closest pair ever found P Lectures on Recursive Algorithms L 40

Why to check only 8 points ahead? 1. Partition the strip into squares of

Why to check only 8 points ahead? 1. Partition the strip into squares of length /2 each, as shown in the picture 2. Each square contains at most 1 point - by definition of 3. If there is at least 2 full squares between points then they can not be the closest points 4. There at most 8 squares to check /2 /2 Lectures on Recursive Algorithms /2 /2 /2 L 41

Time complexity Preprocessing: sorting in time O(n log n) Main Algorithm: recursion in time

Time complexity Preprocessing: sorting in time O(n log n) Main Algorithm: recursion in time O(n log n) T(n) 3 n/2 + 2 T(n/2) + 8 n = 2 T(n/2) + 9. 5 n T(2) = 1 Lectures on Recursive Algorithms 42

Example 5: Integer multiplication in time Lectures on Recursive Algorithms 1. 59 O(n )

Example 5: Integer multiplication in time Lectures on Recursive Algorithms 1. 59 O(n ) 43

Beyond (n log n) : integer’s multiplication • Input: two integers x and y,

Beyond (n log n) : integer’s multiplication • Input: two integers x and y, each consisting of at most n bits • Output: multiply these integers • Naïve Algorithm: – Multiply each bit i of x by y, add (i-1) zeroes at the end – Add the obtained at most n numbers • Time: (n 2) of bit operations Lectures on Recursive Algorithms 44

Divide and Conquer approach Let x = x 1 + 2 n/2 x 2

Divide and Conquer approach Let x = x 1 + 2 n/2 x 2 and y = y 1 + 2 n/2 y 2 Let p = x 1 + x 2 and q = y 1 + y 2. Then x y = (x 1 + 2 n/2 x 2) (y 1 + 2 n/2 y 2) = x 1 y 1 + 2 n/2 (x 2 y 1+ x 1 y 2) + 2 n x 2 y 2 = x 1 y 1 + 2 n/2 (pq - x 1 y 1 - x 2 y 2) + 2 n x 2 y 2 Algorithm: – Compute p and q (in linear time) – Compute recursively: x 1 y 1 , x 2 y 2 , pq – Perform x 1 y 1 + 2 n/2 (pq - x 1 y 1 - x 2 y 2) + 2 n x 2 y 2 in linear time Lectures on Recursive Algorithms 45

Time complexity T(n) 3 T(n/2) + c n T(1) c Solution: T(n) d nlog

Time complexity T(n) 3 T(n/2) + c n T(1) c Solution: T(n) d nlog 3, for some constant d 3 c Proof: For n = 1 straightforward: T(1) c d nlog 3 General case: T(n) 3 T(n/2) + c n 9 T(n/4) + 3 c(n/2) + c n … c n (1 + 3/2 + (3/2)2 + … + (3/2)log n) c n 2(3/2)log n + 1 3 c n nlog 3 - 1 = 3 c nlog 3 d nlog 3 = O(n 1. 59) Lectures on Recursive Algorithms 46

Textbook and Exercises READING: Chapter 5 from Section 5. 4 OBLIGATORY EXERCISES: • How

Textbook and Exercises READING: Chapter 5 from Section 5. 4 OBLIGATORY EXERCISES: • How to add two n-bit integers in linear number of bit operations? ADDITIONAL EXERCISES: • Solved Exercise 1 from the textbook, chapter 5 “Divide and Conquer” • Exercises 1, 6, 7 from the textbook, chapter 5 “Divide and Conquer” Lectures on Recursive Algorithms 47

Conclusions • Searching in time O(log n) • Finding majority value in time O(n)

Conclusions • Searching in time O(log n) • Finding majority value in time O(n) • Divide and conquer in time O(n log n) – sorting algorithms (Merge. Sort, Rand. Quick. Sort) – closest points algorithm • Beyond (n log n) : O(nlog 3) time algorithm for integer multiplication Lectures on Recursive Algorithms 48