Tirgul 11 - PowerPoint PPT Presentation

About This Presentation
Title:

Tirgul 11

Description:

... SUCCESSOR once for each node, we go down each right edge once, at most ... For each time we go down an edge, we have to go back up that edge, and vice versa ... – PowerPoint PPT presentation

Number of Views:23
Avg rating:3.0/5.0
Slides: 25
Provided by: doronf
Category:
Tags: tirgul

less

Transcript and Presenter's Notes

Title: Tirgul 11


1
Tirgul 11
  • Solutions for questions from T2, T3
  • DFS BFS - reminder
  • Some Hashing

Reminder dont forget to run dast/bin/testEx3.cs
h on your jar file before submitting ex3!!!
2
T2 Question 3
  • Q Prove that there is no comparison-based sort
    whose running time is linear for at least half of
    the n! inputs of length n.
  • A A comparison based sort can be described by a
    decision tree
  • Each node is a comparison
  • Each edge is an outcome of a comparison
  • The running time is the depth of the correct leaf.

3
T2 Question 3
  • A tree of height h has less than 2h1 nodes
  • If there are n!/2 leaves at depth cn, then
  • 2cn1 n!/2 ? cn 1 log(n!) - 1
  • ? n ?(n logn) ? contradiction!
  • What about 1/n of the input? And 1/2n of the
    input?
  • 2cn1 n!/n ? cn 1 log((n-1)!) ?(n logn)
  • 2cn1 n!/2n ? cn 1 log(n!) - n ?(n
    logn)
  • ? Both impossible as well

4
T3 Question 1
  • Q Suppose that another data structure contains a
    pointer to a node y in a binary search tree, and
    suppose that ys predecessor z is deleted from
    the tree by the procedure TREE-DELETE. What
    problem can arise? How can TREE-DELETE be
    rewritten to solve the problem?

delete
z
y
external reference
5
T3 Question 1
  • If z has two children, then the deletion of z
    consists of
  • Replacing the contents of z with the contents of
    the successor of z
  • Deleting the successor of z
  • In this case, we will delete the node where y
    used to be.
  • So the reference to y will become invalid, even
    though we did not delete y itself

y
?
external reference
6
T3 Question 1
  • Solution instead of copying the contents of y
    into z, place the node y in the tree where z used
    to be, after deleting it
  • yz.successor
  • make y.right the child of y.parent instead of y
  • connect y to zs parent and children instead of z

y
external reference
7
T3 Question 5
  • An in-order tree walk can be implemented by
    finding the minimum element and then making n-1
    calls to TREE-SUCCESSOR
  • Q How many times at most do we pass through each
    edge?

8
T3 Question 5
  • TREE-SUCCESSOR(x)
  • if x.rightnull
  • yx.parent
  • while y!null xy.right
  • xy
  • yy.parent
  • else
  • yx.right
  • while y.left!null
  • yy.left
  • return y

going up (1)
going up (2)
going down (3)
going down (4)
9
T3 Question 5
  • Right edges
  • A right edge n?n.right is passed downwards only
    at (3), which happens when we call
    TREE-SUCCESSOR(n)
  • Since we call TREE-SUCCESSOR once for each node,
    we go down each right edge once, at most
  • Left edges
  • After we pass a left edge n?n.left (at (1) or
    (2)), TREE-SUCCESSOR returns n
  • Since TREE-SUCCESSOR returns each node once, we
    go up each left edge once, at most
  • For each time we go down an edge, we have to go
    back up that edge, and vice versa
  • Therefore, we pass each edge at most twice
  • In-order walk takes O(n) steps

10
T4 Question 2
  • Q You are in a square maze of n?n cells and
    youve got loads of coins in your pocket. How do
    you get out?
  • A The maze is a graph where
  • Each cell is a node
  • Each passage between cells is an edge
  • Solve the maze by running DFS until the exit is
    found

11
DFS - Reminder
DFS-VISIT(u) u.colorgray u.dtime for
each v?adju if v.colorwhite
v.prevu DFS-VISIT(v) u.colorblack
u.ftime
DFS(G) for each u?VG u.colorwhite
u.prevnil time0 for each u?VG if
u.colorwhite DFS-VISIT(u)
12
BFS - Reminder
BFS(G,s) for each u?VG u.dist8
s.dist0 Q.enqueue(s) while Q not empty
uQ.dequeue() for each v?adju if
v.dist8 v.distu.dist1
v.prevu Q.enqueue(v)
13
T4 Question 2
  • A white node is a cell without any coins
  • A gray node is a cell with a coin lying with its
    head side up
  • A black node is a cell with a coin lying with its
    tail side up
  • An edge connecting a node to its parent is marked
    by a coin
  • When visiting a cell, we color it gray
  • If it has a white cell adjacent to it visit it
  • If there are no such cells,
  • Color the cell black by flipping the coin
  • backtrack by going to the cell marked as parent

14
T4 Question 2
  • Each node has one parent
  • When backtracking, the parent will be the only
    adjacent gray cell that has a coin leading to
    it
  • Can we solve it using BFS?
  • Answer No! In DFS we go between adjacent cells
    in BFS, the nodes are in a queue, so the next
    cell could be anywhere

15
Dynamic Hash Tables
  • Since performance depends on the load factor, we
    would like to expand the table when the load
    factor is too high
  • Expansion involves allocating a new table and
    rehashing the keys into it from the old table
  • Most insertions would be O(1), but some
    insertions will cause the table to expand, which
    would take O(n)
  • So, table operations take O(n) in the worst case,
    but what is the average time for a table
    operation in the worst case? What is the worst
    case for n operations? We use amortized analysis
    to answer these questions

16
Example - Stack with multi-pop
  • As a first example we analyze the performance of
    a stack with one additional operation -
    multipop(k)- that pops the top k elements of the
    stack.
  • Since multipop can take O(n) time in the
    worst-case, we might conclude that n stack
    operations can take O(n2) in the worst case.
  • We will analyze this simple D.S. using two
    methods
  • The aggregate method
  • The accounting method
  • and see that the average time per operation is
    still O(1).

17
The aggregate method
  • In this method we find T(n), the total time to
    perform n operations in the worst-case. The
    amortized cost of each operation is then defined
    as T(n)/n.
  • In our stack example, T(n) O(n) If we
    performed a total of k push operations, then the
    total time for the pop and multipop operations is
    also at most k (the number of elements we can pop
    is at most the number of elements we pushed).
    Since k is at most n, then T(n)O(n).
  • Thus the amortized cost for all operations is
    O(1).

18
The accounting method
  • In this method, we receive money for each
    operation.
  • We pay for the actual cost of the operation
  • With whats left we may pay for future
    operations.
  • The total cost for n operations is the total
    money we got, since with it we covered the
    entire actual cost.
  • The amortized cost of each operation is the money
    we got for this operation.
  • In our stack example, we define the following
  • The amortized cost (the money we get) for a
    push operation is 2, and for the pop and multipop
    operation is 0.
  • When pushing an item, we pay 2 1 goes to the
    push operation, and 1 is left to its credit this
    will pay for its pop or multipop.

19
The accounting method (continued)
Operation
Actual cost
Average (amortized) cost
push
1
2
pop
1
0
multipop
k
0
  • So we see that the average cost of each operation
    is constant (in contrast to the actual cost). In
    other words, since the total payment is at most
    2n, the total time is at most O(n), and the
    average time is per operation is O(1).

20
Dynamic Tables
  • A Dynamic table is an array that expands
    dynamically when it becomes overloaded, to fit
    itself to a variable demand. What is it good for?
  • Javas Vector
  • Heaps
  • Hash tables
  • In a dynamic table, besides the regular insert
    and delete, there is also expansion when the
    table is overloaded and we want to insert a new
    element.

21
Expansion
  • Consider the following expansion scheme
  • Table starts with a size of 1.
  • Upon an insert, if the table is full, create a
    table twice the old size and copy the old table
    to the beginning of the new one.
  • Actual cost 1 for regular insertion, size(T) for
    expansion.

22
Expansion - Aggregate
  • Expansion doubles the table size
  • Therefore, for every i, after inserting the 2i1
    item, a new table of size 2i1 will be allocated
    and 2i values will be copied
  • So, the number of operations required for n
    insertions is
  • Therefore, the amortized cost of each insertion
    is O(1).

23
Expansion - Accounting
  • Amortized cost, the accounting method
  • Suppose we pay 3 for every regular insertion 1
    for the actual cost, and 2 remains as credit for
    this element.
  • How do we pay for an expansion? Suppose the table
    is doubled from size x to 2x. Then we have x/2
    elements that didnt pay for an expansion yet
    (have credit of 2). Thus each one of them pays
    for itself and for one other element out of the
    x/2 with credit 0.

24
Expansion Hash Tables
  • When the load factor is too high, we would like
    to expand the table
  • Since the hash codes depend on the table size, we
    need to rehash compute the new hash code for
    every item in the old table and use it to insert
    it to the new table
  • If the maximal load factor is constant, the
    amortized cost of insertion remains O(1)
Write a Comment
User Comments (0)
About PowerShow.com