Advances in Pattern Databases - PowerPoint PPT Presentation

About This Presentation
Title:

Advances in Pattern Databases

Description:

1. Advances in Pattern Databases. Ariel Felner, Ben-Gurion University. Israel ... sorts all generated nodes in an OPEN-LIST and chooses the node with the best ... – PowerPoint PPT presentation

Number of Views:190
Avg rating:3.0/5.0
Slides: 58
Provided by: iseB8
Category:

less

Transcript and Presenter's Notes

Title: Advances in Pattern Databases


1
Advances in Pattern Databases
  • Ariel Felner,
  • Ben-Gurion University
  • Israel
  • email felner_at_bgu.ac.il

2
Overview
  • Heuristic search and pattern databases
  • Disjoint pattern databases
  • Compressed pattern databases
  • Dual lookups in pattern databases
  • Current and future work

3
optimal path search algorithms
  • For small graphs provided explicitly, algorithm
    such as Dijkstras shortest path, Bellman-Ford or
    Floyd-Warshal. Complexity O(n2).
  • For very large graphs , which are implicitly
    defined, the A algorithm which is a best-first
    search algorithm.

4
Best-first search schema
  • sorts all generated nodes in an OPEN-LIST and
    chooses the node with the best cost value for
    expansion.
  • generate(x) insert x into OPEN_LIST.
  • expand(x) delete x from OPEN_LIST and generate
    its children.
  • BFS depends on its cost (heuristic) function.
    Different functions cause BFS to expand different
    nodes..

20
25
30
35
30
35
35
40
Open-List
5
Best-first search Cost functions
  • g(x) Real distance from the initial state to x
  • h(x) The estimated remained distance from x to
    the goal state.
  • ExamplesAir distance
  • Manhattan Dinstance
  • Different cost combinations of g and h
  • f(x)level(x) Breadth-First Search.
  • f(x)g(x) Dijkstras algorithms.
  • f(x)h(x) Pure Heuristic Search (PHS).
  • f(x)g(x)h(x) The A algorithm (1968).

6
A (and IDA)
  • A is a best-first search algorithm that uses
    f(n)g(n)h(n) as its cost function.
  • f(x) in A is an estimation of the shortest path
    to the goal via x.
  • A is admissible, complete and optimally
    effective. Pearl 84
  • Result any other optimal search algorithm will
    expand at least all the nodes expanded by A

Breadth First Search
A
7
Domains
  • 15 puzzle
  • 1013 states
  • First solved by Korf 85 with IDA and Manhattan
    distance
  • Takes 53 seconds
  • 24 puzzle
  • 1024 states
  • First solved by Korf 96
  • Takes 2 days

8
Domains
  • Rubiks cube
  • 1019 states
  • First solved by Korf 97
  • Takes 2 days to solve

9
(n,k) Top Spin Puzzle
  • n tokens arranged in a ring
  • States any possible permutation of the tokens
  • Operators Any k consecutive tokens can be
    reversed
  • The (17,4) version has 1013 states
  • The (20,4) version has 1018 states

10
4-peg Towers of Hanoi (TOH4)
  • There is a conjecture about the length of optimal
    path but it was not proven.
  • Size 4k

11
How to improve search?
  • Enhanced algorithms
  • Perimeter-search Delinberg and Nilson 95
  • RBFS Korf 93
  • Frontier-search Korf and Zang 2003
  • Breadth-first heuristic search Zhou and Hansen
    04
  • ? They all try to better explore the search tree.
  • Better heuristics more parts of the search tree
    will be pruned.

12
Better heuristics
  • In the 3rd Millennium we have very large
    memories.
  • We can build large tables.
  • For enhanced algorithms large open-lists or
    transposition tables. They store nodes
    explicitly.
  • A more intelligent way is to store general
    knowledge. We can do this with heuristics

13
Subproblems-Abstractions
  • Many problems can be abstracted into subproblems
    that must be also solved.
  • A solution to the subproblem is a lower bound on
    the entire problem.
  • Example Rubiks cube Korf 97
  • Problem ? 3x3x3 Rubiks cube
  • Subproblem ? 2x2x2 Corner cubies.

14
Pattern Databases heuristics
  • A pattern database Culbreson Schaeffer 96 is
    a lookup table that stores solutions to all
    configurations of the subproblem / abstraction /
    pattern.
  • This table is used as a heuristic during the
    search.
  • Example Rubiks cube.
  • Has 1019 States.
  • The corner cubies subproblem has 88 Million
    states
  • A table with 88 Million entries fits in memory
    Korf 97.

Search space
Pattern space
Mapping/Projection
15
Non-additive pattern databases
  • Fringe pattern database Culberson Schaeffer
    1996.
  • Has only 259 Million states.
  • Improvement of a factor of 100 over Manhattan
    Distance


16
Example - 15 puzzle
  • How many moves do we need to move tiles 2,3,6,7
    from locations 8,12,13,14 to their goal locations
  • The solution to this is located in
  • PDB812131418

17
Disjoint Additive PDBs (DADB)
  • If you have many PDBS, take their maximum
  • Values of disjoint databases can be added and are
    still admissible Korf Felner AIJ-02,
  • Felner, Korf
    Hanan JAIR-04
  • Additivity can be applied if the cost of a
    subproblem is composed from costs of objects from
    corresponding pattern only

18
DADBTile puzzles
5-5-5
6-6-3
7-8
6-6-6-6
Korf, AAAI 2005
Memory Time Nodes Value Heuristic Puzzle
3-tera-bytes 28 days 1013 Breadth-FS 15
0 53.424 401,189,630 36.942 Manhattan 15
3,145 0.541 3,090,405 41.562 5-5-5 15
33,554 0.163 617,555 42.924 6-6-3 15
576,575 0.034 36,710 45.632 7-8 15
242,000 2 days 360,892,479,671 6-6-6-6 24
19
Heuristics for the TOH
  • Infinite peg heuristic (INP) Each disk moves to
    its own temporary peg.
  • Additive pattern databases
  • Felner, Korf Hanan, JAIR-04

20
Additive PDBS for TOH4
  • Partition the disks into disjoint sets
  • Store the cost of the complete pattern space of
    each set in a pattern database.
  • Add values from these PDBs for the heuristic
    value.
  • The n-disk problem contains 4n states
  • The largest database that we stored was of 14
    disks which needed 414256MB.

6
10
21
TOH4 results
16 disks
seconds Nodes Avg h h(s) solution Heuristic
memory full Infinite peg
48 134,653,232 75.78 102 161 Static 13-3
14 36,479,151 89.10 114 161 Static 14-2
21 12,872,732 95.52 114 161 Dynamic 14-2
17 disks
2,501 238,561,590 97.05 116 183 Dynamic 14-3
  • The difference between static and dynamic is
    covered in Felner, Korf Hanan JAIR-04

22
Best Usage of Memory
  • Given 1 giga byte of memory, how do we best use
    it with pattern databases?
  • Holte, Newton, Felner, Meshulam and Furcy,
    ICAPS-2004 showed that it is better to use many
    small databases and take their maximum instead of
    one large database.
  • We will present a different (orthogonal) method
    Felner, Mushlam Holte AAAI-04.

23
Compressing pattern database Felner et al
AAAI-04
  • Traditionally, each configuration of the pattern
    had a unique entry in the PDB.
  • Our main claim ?
  • Nearby entries in PDBs are highly correlated
    !!
  • We propose to compress nearby entries by storing
    their minimum in one entry.
  • We show that ?
  • most of the knowledge is preserved
  • Consequences Memory is saved, larger patterns
    can be used ? speedup in search is obtained.

24
Cliques in the pattern space
  • The values in a PDB for a clique are d or d1
  • In permutation puzzles cliques exist when only
    one object moves to another location.

d
G
d1
d
  • Usually they have nearby entries in the PDB
  • A44444

A clique in TOH4
25
Compressing cliques
  • Assume a clique of size K with values d or d1
  • Store only one entry (instead of K) for the
    clique with the minimum d. Lose at most 1.
  • A44444 A44441
  • Instead of 4p we need only 4(p-1) entries.
  • This can be generalized to a set of nodes with
    diameter D. (for cliques D1)
  • A44444 A44411
  • In general compressing by k disks reduces memory
    requirements from 4p to 4(p-k)

26
TOH4 results 16 disks (142)
Mem MB Time Nodes D Avg H H(s) PDB
256 14.34 36,479,151 0 87.03 116 14/0 2
64 14.69 37,964,227 1 86.48 115 14/1 2
16 15.41 40,055,436 3 85.67 113 14/2 2
4 16.94 44,996,743 5 84.44 111 14/3 2
1 17.36 45,808,328 9 82.73 107 14/4 2
0.256 23.78 61,132,726 13 80.84 103 14/5 2
  • Memory was reduced by a factor of 1000!!! at a
    cost of only a factor of 2 in the search effort.


27
TOH4 larger versions
Memory was reduced by a factor of 1000!!! At a
cost of only a factor of 2 in the search
effort. Lossless compressing is noe efficient in
this domain.
Mem Time Nodes Avg H Type PDB size
256 gt421 gt393,887,923 81.5 static 14/0 3 17
256 2,501 238,561,590 87.0 dynamic 14/0 3 17
256 83 155,737,832 103.7 static 15/1 2 17
256 7 17,293,603 123.8 static 16/2 1 17
256 463 380,117,836 123.8 static 16/2 2 18
  • For the 17 disks problem a speed up of 3 orders
    of magnitude is obtained!!!
  • The 18 disks problem can be solved in 5 minutes!!

28
Tile Puzzles
Goal State
Clique
  • Storing PDBs for the tile puzzle
  • (Simple mapping) A multi dimensional array ?
  • A1616161616 size1.04Mb
  • (Packed mapping) One dimensional array ?
    A1615141312 size 0.52Mb.
  • Time versus memory tradeoff !!

29
15 puzzle results
  • A clique in the tile puzzle is of size 2.
  • We compressed the last index by two ?
  • A161616168

Avg H Mem Time Nodes compress Type PDB
44.75 576,575 0.081 136,288 No packed 1 7-8
45.63 576,575 0.034 36,710 No packed 1 7-8
43.64 57,657 0.232 464,977 No packed 1 7-7-1
43.64 536,870 0.058 464,977 No simple 1 7-7-1
43.02 268,435 0.069 565,881 Yes simple 1 7-7-1
43.98 536,870 0.021 147,336 Yes simple 2 7-7-1
44.92 536,870 0.016 66,692 Yes simple 2 7-7-1

30
  • Dual lookups in pattern databases Felner et al,
    IJCAI-04

31
Symmetries in PDBs
  • Symmetric lookups were already performed by the
    first PDB paper of Culberson Schaeffer 96
  • examples
  • Tile puzzles reflect the tiles
  • about the main diagonal.
  • Rubiks cube rotate the cube
  • We can take the maximum among the different
    lookups
  • These are all geometrical symmetries
  • We suggest a new type of symmetry!!

7
8
8
7
32
Regular and dual representation
  • Regular representation of a problem
  • Variables objects (tiles, cubies etc,)
  • Values locations
  • Dual representation
  • Variables locations
  • Values objects

33
Regular vs. Dual lookups in PDBs
  • Regular question
  • Where are tiles 2,3,6,7 and how many moves
    are needed to gather them to their goal
    locations?
  • Dual question
  • Who are the tiles in locations 2,3,6,7 and
    how many moves
  • are needed to distribute them to their goal
    locations?

34
Regular and dual lookups
  • Regular lookup PDB8,12,13,14
  • Dual lookup PDB9,5,12,15

35
Regular and dual in TopSpin
  • Regular lookup for C PDB1,2,3,7,6
  • Dual lookup for C PDB1,2,3,8,9

36
Dual lookups
  • Dual lookups are possible when there is a
    symmetry between locations and objects
  • Each object is in only one location and each
    location occupies only one object.
  • Good examples TopSpin, Rubiks cube
  • Bad example Towers of Hanoi
  • Problematic example Tile Puzzles

37
Inconsistency of Dual lookups
Consistency of heuristics h(a)-h(b) lt
c(a,b)
  • Example Top-Spin
  • c(b,c)1
  • Both lookups for B
  • PDB1,2,3,4,50
  • Regular lookup for C PDB1,2,3,7,61
  • Dual lookup for C PDB1,2,3,8,92

Regular Dual
b 0 0
c 1 2
38
Traditional Pathmax
  • children inherit f-value from their parents if
    it makes them larger

g1 h4 f5
Inconsistency
g2 h2 f4
g2 h3 f5
Pathmax
39
Bidirectional pathmax (BPMX)
h-values
h-values
2
4
BPMX
5
1
5
3
  • Bidirectional pathmax h-values are propagated in
    both directions decreasing by 1 in each edge.
  • If the IDA threshold is 2 then with BPMX the
    right child will not even be generated!!

40
Results (17,4) TopSpin puzzle
regular dual BPMX nodes time
1 0 ---- 40,019,429 67.76
0 1 no 7,618,805 15.72
0 1 yes 1,397,614 2.93
4 4 yes 82,606 0.94
17 17 yes 27,575 1.34
  • Nodes improvement (17r17d) 1451
  • Time improvement (4r4d) 72
  • We also solved the (20,4) TopSpin version.

41
Results Rubiks cube
  • Data on 1000 states with 14 random moves
  • PDB of 7-edges cubies

regular dual BPMX nodes time
1 0 ---- 90,930,662 28.18
0 1 no 19,653,386 7.38
0 1 yes 8,315,116 3.24
4 4 yes 615,563 0.51
24 24 yes 362,927 0.90
  • Nodes improvement (24r24d) 250
  • Time improvement (4r4d) 55

42
Results Rubiks cube
  • With duals we improved Korfs results on random
    instances by a factor of 1.5 using exactly the
    same PDBs.

43
Results tile puzzles
Heuristic BPMX Value nodes time
Manhattan ---- 36.94 401,189,630 53.424
R ---- 44.75 136,289 0.081
RR ---- 45.63 36,710 0.034
RRDD yes 46.12 18,601 0.022
  • With duals, the time for the 24 puzzle drops
    from 2 days to 1 day.

44
Discussion
  • Results for the TopSpin and Rubiks cube are
    better than those of the tile puzzles
  • Dual PDB lookups and BPMX cutoffs are more
    effective if each operators changes larger part
    of the states.
  • This is because the identity of the objects being
    queried in consecutive states are dramatically
    changed

45
Summary
  • Dual PDB lookups
  • BPMX cutoffs for inconsistent heuristics
  • State of the art solvers.

46
Future work
  • More compression
  • Duality in search spaces
  • Which and how many symmetries to use
  • Other sources of inconsistencies
  • Better ways for propagating inconsistencies

47
Ongoing and future work compressing PDBs
  • An item for the PDB of tiles (a,b,c,d) is in the
    form ltLa, Lb, Lc, Ldgtd
  • Store the PDBs in a Trie
  • A PDB of 5 tiles will have a level in the trie
    for each tile. The values will be in the leaves
    of the trie.
  • This data-structure will enable flexibility and
    will save memory as subtrees of the trie can be
    pruned

48
Trie pruninig
Simple (lossless) pruning Fold leaves with
exactly the same values.
No data will be lost.
2
2
2
2
2
49
Trie pruninig
  • Intelligent (lossy)pruning
  • Fold leaves/subtrees with are correlated to each
    other (many option for this!!)
  • Some data will be lost.
  • Admissibility is still kept.

2
2
2
2
4
50
Trie Initial Results
A 5-5-5 partitioning stored in a trie with simple
folding
Mem Nodes/sec Time Nodes H(s) MD PDB
3,145,728 5,150,676 0.6 3,090,405 41.56 36.94 Simple
1,572,480 988,613 3.126 3,090,405 41.56 36.94 Packed
765,778 1,191,826 2.593 3,090,405 41.56 36.94 Trie
51
Neural Networks (NN)
  • We can feed a PDB into a neural network engine.
    Especially, Addition above MD
  • For each tile we focus on its dx and dy from its
    goal position. (i.e. MD)
  • Linear conflict
  • dx1 dx2 0
  • dy1 gt dy21
  • A NN can learn
  • these rules

2
1
dy1 2 dy20
52
Neural network
  • We train the NN by feeding the entire (or part of
    the) pattern space.
  • For example for a pattern of 5 tiles we have 10
    features, 2 for each tile.
  • During the search, given the locations of the
    tiles we look them up in the NN.

53
Neural network example
dx4
Layout for the pattern of the tiles 4, 5 and 6
dy4
dx5
4
dy5
dx6
dy6
54
Neural Network problems
  • We face the problem of overestimating and will
    have to bias the results towards underestimating.
  • We keep the overestimating values in a separate
    hash table
  • Results are encouraging!!

Mem Time Nodes H(s) PDB
1,572,480 0.49 243,290 31.00 Regular
33,611d472w 69.75 454,262 29.67 Neural Network
55
Ongoing and Future WorkDuality
  • Definition 1 of a dual state
  • For a state S we flip the roles of variables and
    objects
  • A vector lt3,1,4,2gt
  • Regular state S 3, 1 , 4 , 2
  • Dual state Sd 2, 4 , 1 , 4

56
Future of Duality
  • S ? O ? G
  • G ? O ? Sd
  • Sd ? O-1 ? G

57
Workshop
  • You are all welcome to the workshop on
  • Heuristic Search, Memory-based Heuristics and
    Their application
  • To be held in AAAI-06
  • See www.ise.bgu.ac.il/faculty/felner
Write a Comment
User Comments (0)
About PowerShow.com