Handling%20non-determinism%20and%20incompleteness - PowerPoint PPT Presentation

About This Presentation

Title:

Handling%20non-determinism%20and%20incompleteness

Description:

Handling non-determinism and incompleteness. Problems, ... Paths to Perdition. Complexity of finding probability 1.0 success plans. Conformant Planning ... – PowerPoint PPT presentation

Number of Views:16

Avg rating:3.0/5.0

Slides: 22

Provided by: min63

Learn more at: https://rakaposhi.eas.asu.edu

Category:

more less

Transcript and Presenter's Notes

Title: Handling%20non-determinism%20and%20incompleteness

1
Handling non-determinism and incompleteness
2
Problems, Solutions, Success Measures3
orthogonal dimensions

Conformant Plans Dont lookjust do
Sequences
Contingent/Conditional Plans Look, and based on
what you see, Do look again
Directed acyclic graphs
Policies If in (belief) state S, do action a
(belief) state?action tables

Incompleteness in the initial state
Un (partial) observability of states
Non-deterministic actions
Uncertainty in state or effects
Complex reward functions (allowing degrees of
satisfaction)

Deterministic Success Must reach goal-state with
probability 1
Probabilistic Success Must succeed with
probability gt k (0ltklt1)
Maximal Expected Reward Maximize the expected
reward (an optimization problem)

3
11/8
4
Some specific cases

1.0 success conformant planning for domains with
incomplete initial states
1.0 success conformant planning for domains with
non-deterministic actions
1.0 success conditional plans for fully
observable domains with incompletely specified
init states, and deterministic actions
1.0 success conditional plans for fully
observable domains with non-deterministic actions
1.0 success conditional plans for parially
observable domains with non-deterministic actions

Probabilistic variants of all the ones on the
left (where we want success probability to be gt
k).

5
Paths to Perdition
Complexity of finding probability 1.0 success
plans
6
Conformant Planning

Given an incomplete initial state, and a goal
state, find a sequence of actions that when
executed in any of the states consistent with the
initial state, takes you to a goal state.
Belief State is a set of states 2S
I as well as G are belief states
(in classical planning, we already support
partial goal state)
Issues
Representation of Belief States
Generalizing progression, regression etc to
belief states
Generating effective heuristics for estimating
reachability in the space of belief states

7
Action Applicability Issue

Action applicability issue (what if a belief
state has 100 states and an action is applicable
to 90 of them?)
Consider actions that are always applicable in
any state, but can leave many states unchanged.
This involves modeling actions without
executability preconditions (they can have
conditional effects). This ensures that the
action is applicable everywhere

8
Generality of Belief State Rep
9
State Uncertainty and Actions

The size of a belief state B is the number of
states in it.
For a world with k fluents, the size of a belief
state can be between 1 (no uncertainty) and 2k
(complete uncertainty).
Actions applied to a belief state can both
increase and reduce the size of a belief state
A non-deterministic action applied to a singleton
belief state will lead to a larger (more
uncertain) belief state
A deterministic action applied to a belief state
can reduce its uncertainty
E.g. B(pen-standing-on-table) (pen-on-ground)
Action A is sweep the table. Effect is
B(pen-on-ground)
Often, a good heuristic in solving problems with
large belief-state uncertainty is to do actions
that reduce uncertainty
E.g. when you are blind-folded and left in the
middle of a room, you try to reach the wall and
then follow it to the door. Reaching the wall is
a way of reducing your positional uncertainty

10
Progression and Regression with Belief States

Given a belief state B, and an action a,
progression of B over a is defined as long as a
is applicable in every state s in B
Progress(B,a) ? progress(s,a) s in B
Given a belief state B, and an action a,
regression of B over a is defined as long as a is
regressable from every state s in B.
Regress(B,a) ? regress(s,a) s in B
Non-deterministic actions complicate regression.
Suppose an action a, when applied to state s can
take us to s1 or s2 non-deterministically. Then,
what is the regression of s1 over a?
Strong and Weak pre-images We consider B to be
the strong pre-image of B w.r.t action a, if
Progress(B,a) is equal to B. We consider B to
be a weak pre-image if Progress(B,a) is a
superset of B

11
(No Transcript)
12
Belief State Search

Planning problem initial belief state BI and
goal state BG and a set of actions ai the
objective is to find a sequence of actions
a1ak that when executed in the initial belief
state takes the agent to some state in BG
The plan is strong if every execution leads to a
state in BG probability of success is 1
The plan is weak if some of the executions lead
to a state in BG probability of success gt 0
If we have stochastic actions, we can also talk
about the degree of strength of the plan 0 lt
p lt 1
We will focus on STRONG plans
Search Start with the initial belief state, BI
and do progression or regression until you find a
belief state B s.t. B is a subset of BG

13
Representing Belief States
14
Belief State Rep (cont)

Belief space planners have to search in the space
of full propositional formulas!!
In contrast, classical state-space planners
search in the space of interpretations (since
states for classical planning were
interpretations).
Several headaches
Progression/Regression will have to be done over
all states consistent with the formula (could be
exponential number).
Checking for repeated search states will now
involve checking the equivalence of logical
formulas (aaugh..!)
To handle this problem, we have to convert the
belief states into some canonical representation.
We already know the CNF and DNF representations.
There is another one, called Ordered Binary
Decision Diagrams that is both canonical and
compact
OBDD can be thought of as a compact
representation of the DNF version of the logical
formula

15
Doing Progression/Regresssion Efficiently

Progression/Regression will have to be done over
all states consistent with the formula (could be
exponential number).
One way of handling this is to restrict the type
of uncertainty allowed. For example, we may
insist that every fluent must either be true,
false or unknown. This will give us just the
space of conjunctive logical formulas (only 3n
space).
Flip side is that we may not be able to represent
all forms of uncertainty (e.g. how do we say that
either P or Q is true in the initial state?)
Another idea is to directly manipulate the
logical formulas during progression/regression
(without expanding them into states)
Tricky connected to Symbolic model checking

16
Effective representations of logical formulas

Checking for repeated search states will now
involve checking the equivalence of logical
formulas (aaugh..!)
To handle this problem, we have to convert the
belief states into some canonical representation.
We already know the CNF and DNF representations.
These are normal forms but are not canonical
Same formula may have multiple equivalent CNF/DNF
representations
There is another one, called Reduced Ordered
Binary Decision Diagrams that is both canonical
and compact
ROBDD can be thought of as a compact
representation of the DNF version of the logical
formula

17
(No Transcript)
18
Symbolic model checking The birds eye view

Belief states can be represented as logical
formulas (and implemented as BDDs )
Transition functions can be represented as
2-stage logical formulas (and implemented as
BDDs)
The operation of progressing a belief state
through a transition function can be done
entirely (and efficiently) in terms of operations
on BDDs

Read Appendix C before next class (emphasize
C.5 C.6)
19
Conformant Planning Efficiency Issues

Graphplan (CGP) and SAT-compilation approaches
have also been tried for conformant planning
Idea is to make plan in one world, and try to
extend it as needed to make it work in other
worlds
Planning graph based heuristics for conformant
planning have been investigated.
Interesting issues involving multiple planning
graphs
Deriving Heuristics? relaxed plans that work in
multiple graphs
Compact representation? Label graphs