randomized algorithms csc 4520/6520 design & analysis of algorithms fall 2013 slides adopted...

22
Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej Ciesielski, University of Massachusetts

Upload: margaretmargaret-sutton

Post on 05-Jan-2016

219 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Randomized Algorithms

CSc 4520/6520

Design & Analysis of Algorithms

Fall 2013

Slides adopted from Dmitri Kaznachey, George Mason University

and Maciej Ciesielski, University of Massachusetts

Page 2: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

The Hiring Problem

• The goal is to hire a new assistant through an employment agency.– The agency sends one candidate each day.

– The commitment is to have the best person to do the job.

– When the interviewed person is better than the current assistant, he/she is hired in place of the current one.

– There is a small cost to pay for the interview.

– There is usually a larger cost associated with the fire/hire process.

Page 3: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

The Hiring Problem: Algorithm

Hire-Assistant (n)1 best = 0 // candidate 0 is least qualified2 for i = 1 to n3 <interview candidate i>4 if i is better than best5 best = i6 <hire candidate i>

Assume interviewing has cost ci, whereas more expensive hiring has cost ch. Let m be the number of people hired. Then the cost of the above algorithm is:

O(nci + mch)

The quantity m varies with each run and determines the overall cost of the algorithm. It is estimated using probabilistic analysis.

Page 4: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Indicator Random Variables

Assume sample space S and an event A. The indicator random variable I{A} is defined as

1, if A occursI{A} = 0, otherwise

Given a sample space S and an event A, denote XA a random variable associated with an event being A, i.e. XA = I{A}. The the expected value of XA is:

E[XA] = Pr{A}

Proof.

E[XA] = E[I{A}] = 1Pr{A} + 0Pr{A} = Pr{A}

Page 5: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

The Hiring Problem: Analysis

Let Xi be the indicator random variable associated with the event that the candidate i is hired:

Xi = I{candidate i is hired}

Let X be the random variable whose value equals the number of time we hire a new candidate:

X = X1 + ... + Xn

Note that E[Xi] = Pr{Xi} = Pr{candidate i is hired}.

We now need to compute Pr{candidate i is hired}.

Page 6: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

The Hiring Problem: Analysis (cont.)

Candidate i is hired (line 5) when it is better than any of the previous (i-1) candidates. Since all candidates arrive in random order, each of them have the same probability of being the best so far. Therefore:

E[Xi] = Pr{Xi} = 1/i

We can now compute E[X]:

E[X] = E[X1 + ... + Xn] = 1 + ½ + ... + 1/n = ln(n) + O(1)

Hence, when candidates are presented in random order, the algorithm Hire-Assistant has a total hiring cost:

O(ch ln(n))

Page 7: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Randomized Algorithms

In a randomized algorithm the distribution of inputs is imposed. In particular, in the randomized version of the Hire-Assistant algorithm we randomly permute the candidates:

Randomized-Hire-Assistant (n)1 <randomly permute the list of candidates>2 best = 0 // candidate 0 is least qualified3 for i = 1 to n4 <interview candidate i>5 if i is better than best6 best = i7 <hire candidate i>

According to the earlier computations, the expected cost of the above algorithm is O(nci+chln(n)).

Page 8: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Randomized Version

Want to make running time independent of input ordering.

Randomized-Partition(A, p, r)i := Random(p, r);A[r] A[i];Partition(A, p, r)

Randomized-Partition(A, p, r)i := Random(p, r);A[r] A[i];Partition(A, p, r)

Randomized-Quicksort(A, p, r)if p < r then

q := Randomized-Partition(A, p, r);Randomized-Quicksort(A, p, q – 1);Randomized-Quicksort(A, q + 1, r)

fi

Randomized-Quicksort(A, p, r)if p < r then

q := Randomized-Partition(A, p, r);Randomized-Quicksort(A, p, q – 1);Randomized-Quicksort(A, q + 1, r)

fi

Page 9: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Quick-Sort

7 4 9 6 2 2 4 6 7 9

4 2 2 4 7 9 7 9

2 2 9 9

Page 10: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Quick-SortQuick-sort is a randomized sorting algorithm based on the divide-and-conquer paradigm:

Divide: pick a random element x (called pivot) and partition S into

L elements less than x E elements equal x G elements greater than

x Recur: sort L and G Conquer: join L, E and G

x

x

L GE

x

Page 11: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

PartitionWe partition an input sequence as follows:

We remove, in turn, each element y from S and

We insert y into L, E or G, depending on the result of the comparison with the pivot x

Each insertion and removal is at the beginning or at the end of a sequence, and hence takes O(1) timeThus, the partition step of quick-sort takes O(n) time

Algorithm partition(S, p)Input sequence S, position p of pivot Output subsequences L, E, G of the

elements of S less than, equal to,or greater than the pivot, resp.

L, E, G empty sequencesx S.remove(p) while S.isEmpty()

y S.remove(S.first())if y < x

L.insertLast(y)else if y = x

E.insertLast(y)else { y > x }

G.insertLast(y)return L, E, G

Page 12: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Quick-Sort TreeAn execution of quick-sort is depicted by a binary tree

Each node represents a recursive call of quick-sort and stores Unsorted sequence before the execution and its pivot Sorted sequence at the end of the execution

The root is the initial call The leaves are calls on subsequences of size 0 or 1

7 4 9 6 2 2 4 6 7 9

4 2 2 4 7 9 7 9

2 2 9 9

Page 13: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Execution Example

Pivot selection

7 2 9 4 2 4 7 9

2 2

7 2 9 4 3 7 6 1 1 2 3 4 6 7 8 9

3 8 6 1 1 3 8 6

3 3 8 89 4 4 9

9 9 4 4

Page 14: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Execution Example (cont.)Partition, recursive call, pivot selection

2 4 3 1 2 4 7 9

9 4 4 9

9 9 4 4

7 2 9 4 3 7 6 1 1 2 3 4 6 7 8 9

3 8 6 1 1 3 8 6

3 3 8 82 2

Page 15: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Execution Example (cont.)

Partition, recursive call, base case

2 4 3 1 2 4 7

1 1 9 4 4 9

9 9 4 4

7 2 9 4 3 7 6 1 1 2 3 4 6 7 8 9

3 8 6 1 1 3 8 6

3 3 8 8

Page 16: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Execution Example (cont.)

Recursive call, …, base case, join

3 8 6 1 1 3 8 6

3 3 8 8

7 2 9 4 3 7 6 1 1 2 3 4 6 7 8 9

2 4 3 1 1 2 3 4

1 1 4 3 3 4

9 9 4 4

Page 17: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Execution Example (cont.)

Recursive call, pivot selection

7 9 7 1 1 3 8 6

8 8

7 2 9 4 3 7 6 1 1 2 3 4 6 7 8 9

2 4 3 1 1 2 3 4

1 1 4 3 3 4

9 9 4 4

9 9

Page 18: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Execution Example (cont.)Partition, …, recursive call, base case

7 9 7 1 1 3 8 6

8 8

7 2 9 4 3 7 6 1 1 2 3 4 6 7 8 9

2 4 3 1 1 2 3 4

1 1 4 3 3 4

9 9 4 4

9 9

Page 19: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Execution Example (cont.)

Join, join

7 9 7 17 7 9

8 8

7 2 9 4 3 7 6 1 1 2 3 4 6 7 7 9

2 4 3 1 1 2 3 4

1 1 4 3 3 4

9 9 4 4

9 9

Page 20: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Worst-case Running TimeThe worst case for quick-sort occurs when the pivot is the unique minimum or maximum elementOne of L and G has size n 1 and the other has size 0The running time is proportional to the sum

n (n 1) … 2 Thus, the worst-case running time of quick-sort is O(n2)

depth time

0 n

1 n 1

… …

n 1 1

Page 21: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Expected Running Time

7 9 7 1 1

7 2 9 4 3 7 6 1 9

2 4 3 1 7 2 9 4 3 7 61

7 2 9 4 3 7 6 1

Good call Bad call

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

Good pivotsBad pivots Bad pivots

Page 22: Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej

Expected Running Time, Part 2

Probabilistic Fact: The expected number of coin tosses required in order to get k heads is 2kFor a node of depth i, we expect

i2 ancestors are good calls The size of the input sequence for the current call is at most

(34)i2ns(r)

s(a) s(b)

s(c) s(d) s(f)s(e)

time per levelexpected height

O(log n)

O(n)

O(n)

O(n)

total expected time: O(n log n)

Therefore, we have For a node of depth 2log43n,

the expected input size is one

The expected height of the quick-sort tree is O(log n)

The amount or work done at the nodes of the same depth is O(n)Thus, the expected running time of quick-sort is O(n log n)