Silvia Rossi!

(1)

Lezione n. !

Corso di Laurea: ! Insegnamento: !

Email: !

A.A. 2014-2015 !

Silvia Rossi !

Planning

Lezione n. 4

Informatica !

Insegnamento:

Sistemi !

multi-agente !

Email:

silrossi@unina.it !

(2)

Agent Planning !

(R&N:11.1,11.2)

Additional reading:

Malik Ghallab, Dana Nau, Paolo Traverso, "Automated Task Planning. Theory & Practice", Morgan Kaufmann, (2004).

2

(3)

Block words !

• !A set of blocks, a table and a robot arm.

• !All the blocks are equal in size, form and color, distinguished only by name.

• !The table has an unlimited extension.

• !A block can be on the table, on top of another block or held by the robot.

• !The robot arm can only hold one block at a time.

• !Solve problems by changing the initial configuration (state) into a state that meets the goals.

3

(4)

Planning vs. problem solving !

• ! Planning and problem solving methods can often solve the same sorts of problems

• ! Planning is more powerful because of the representations and methods used

• ! States, goals, and actions are decomposed into sets of sentences (usually in first-order logic)

• ! Search often proceeds through plan space

rather than state space (though there are also state-space planners)

• ! Subgoals can be planned independently,

reducing the complexity of the planning

problem

(5)

• ! Choose actions to achieve a certain goal

• ! But isn’t it exactly the same goal as for problem solving?

• ! Some difficulties with problem solving:

– ! The successor function is a black box: it must be

“applied” to a state to know which actions are possible in that state and what are the effects of each one

Goal of Planning !

• ! Suppose that the goal is HAVE(MILK).

• ! From some initial state where HAVE(MILK) is not satisfied, the successor function must be repeatedly applied to eventually generate a state where

HAVE(MILK) is satisfied.

• ! An explicit representation of the possible actions and their effects would help the problem solver

select the relevant actions

(6)

Otherwise, in the real world an agent would

be overwhelmed by irrelevant actions

(7)

Goal of Planning !

•! Choose actions to achieve a certain goal

•! But isn’t it exactly the same goal as for problem solving?

•! Some difficulties with problem solving:

– ! The goal test is another black-box function, states are domain-specific data structures, and heuristics must be supplied for each new problem

Suppose that the goal is HAVE(MILK)!HAVE(BOOK) Without an explicit representation of the goal, the problem solver cannot know that a state where

HAVE(MILK) is already achieved is more promising than a state where neither HAVE(MILK) nor

HAVE(BOOK) is achieved

(8)

Goal of Planning !

• ! Choose actions to achieve a certain goal

• ! But isn’t it exactly the same goal as for problem solving?

• ! Some difficulties with problem solving:

– ! The goal may consist of several nearly

independent subgoals, but there is no way for the problem solver to know it

HAVE(MILK)and HAVE(BOOK) may be achieved by

two nearly independent sequences of actions

(9)

Planning opens up the black- boxes by using logic to

represent:

– !Actions – !States – !Goals

Problem solving Logic representation

Planning

(10)

STRIPS (Fikes and Nilsson 1971) !

Stanford Research Institute Problem Solver ! Control System for the Shakey robot !

http://www.ai.sri.com/shakey/ !

(11)

Knowledge Representation in Strips !

A state is represented as a conjunction of instantiated predicates !

at(object1,airport1), at(plane1,airport1), !

at(truck1,airport1), at(truck2,post-o "ce2), ...!

Normally, it is assumed that the facts not present in a state are

false: closed world assumption. !

(12)

States and Goals in Block world !

The following predicates can be used: !

!on(x,y): block x is on top of block y!

!on-table(x): block x is on the table!

!clear(x): block x has no block above it and is not being held by the robot arm !

!holding(x): the robot arm is holding block x!

!arm-empty: the robot arm is not holding a block!

(13)

Basic representations for planning !

Classic approach first used in STRIPS planner circa 1970

States represented as a conjunction of ground literals

at(Home)

Goals are conjunctions of literals, but may have existentially quantified variables

at(x) ^ have(Milk) ^ have(bananas) ...

Do not need to fully specify state

Non-specified either don’t-care or assumed false Represent many cases in small storage

Often only represent changes in state rather than entire situation

Unlike theorem prover, not seeking whether the goal is

true, but is there a sequence of actions to attain it

(14)

Operator/action representation !

Operators contain three components:

Action description

Precondition - conjunction of positive literals

Effect - conjunction of positive or negative literals which describe how situation changes when operator is applied

Example:

Op[Action: Go(there),

Precond: At(here) ^ Path(here,there), Effect: At(there) ^ ~At(here)]

All variables are universally quantified

Situation variables are implicit

preconditions must be true in the state immediately before operator is

applied; effects are true immediately after

(15)

Blocks world !

The blocks world is a micro-world that consists of a table, a set of blocks and a robot hand.

Some domain constraints:

Only one block can be on another block Any number of blocks can be on the table The hand can only hold one block

Typical representation:

ontable(a) ontable(c) on(b,a)

handempty clear(b) clear(c)

A B

C

TABLE

(16)

State Representation !

Conjunction of propositions:

BLOCK(A), BLOCK(B), BLOCK(C),

ON(A,TABLE), ON(B,TABLE), ON(C,A), CLEAR(B), CLEAR(C), HANDEMPTY

A B C

TABLE

(17)

Goal Representation !

A B C

Conjunction of propositions:

ON(A,TABLE), ON(B,A), ON(C,B)

The goal G is achieved in a state S if all

the propositions in G are also in S

(18)

Action Representation !

Unstack(x,y)

• ! P =HANDEMPTY, BLOCK(x), BLOCK(y), CLEAR(x), ON(x,y)

• !E =¬HANDEMPTY, ¬CLEAR(x), HOLDING(x),

¬ON(x,y), CLEAR(y)

Precondition: Effect: list of literals conjunction of propositions

Means: Remove HANDEMPTY

from state Means: Add

HOLDING(x) to state

(19)

Example !

A B C

Unstack(C,A)

• ! P =HANDEMPTY, BLOCK(C), BLOCK(A), CLEAR(C), ON(C,A)

• !E =¬HANDEMPTY, ¬CLEAR(C), HOLDING(C),

¬ON(C,A), CLEAR(A)

BLOCK(A), BLOCK(B), BLOCK(C),

ON(A,TABLE), ON(B,TABLE), ON(C,A),

CLEAR(B), CLEAR(C), HANDEMPTY

(20)

Example !

A B

C BLOCK(A), BLOCK(B), BLOCK(C),

ON(A,TABLE), ON(B,TABLE), ON(C,A), CLEAR(B), CLEAR(C), HANDEMPTY,

HOLDING(C), CLEAR(A) Unstack(C,A)

• ! P =HANDEMPTY, BLOCK(C), BLOCK(A), CLEAR(C), ON(C,A)

• !E =¬HANDEMPTY, ¬CLEAR(C), HOLDING(C),

¬ON(C,A), CLEAR(A)

(21)

Action Representation !

Unstack(x,y)

• ! P = HANDEMPTY, BLOCK(x), BLOCK(y), CLEAR(x), ON(x,y)

• !E = ¬HANDEMPTY, ¬CLEAR(x), HOLDING(x), ¬ ON(x,y), CLEAR(y)

Stack(x,y)

• ! P = HOLDING(x), BLOCK(x), BLOCK(y), CLEAR(y)

• ! E = ON(x,y), ¬CLEAR(y), ¬HOLDING(x), CLEAR(x), HANDEMPTY

PutDown(x)

• ! P = HOLDING(x)

• ! E = ON(x,TABLE), ¬HOLDING(x), CLEAR(x), HANDEMPTY

Pickup(x)

• ! P = HANDEMPTY, BLOCK(x), CLEAR(x), ON(x,TABLE)

• ! E = ¬HANDEMPTY, ¬CLEAR(x), HOLDING(x), ¬ ON(x,TABLE)

(22)

Typical BW planning problem !

Initial state:

clear(a) clear(b) clear(c) ontable(a) ontable(b) ontable(c) handempty

Goal:

on(b,c) on(a,b) ontable(c)

A C B

A B C

A plan:

pickup(b)

stack(b,c)

pickup(a)

stack(a,b)

(23)

Another BW planning problem !

Initial state:

clear(a) clear(b) clear(c) ontable(a) ontable(b) ontable(c) handempty

Goal:

on(a,b) on(b,c) ontable(c)

A C B

A B C

A plan:

pickup(a) stack(a,b)

unstack(a,b)

putdown(a)

pickup(b)

stack(b,c)

pickup(a)

stack(a,b)

(24)

Forward Planning !

A B C

A B C A C

B

A C B

A

C B

A C B A

B C

A B C

Unstack(C,A)) Pickup(B)

Forward planning searches a state space

In general, many actions are applicable

to a state ! huge branching factor

(25)

Forward Search !

Operators are applied until the goals are reached ProgWS(state, goals, actions, path)

If state satisfies goals, then return path else a = choose(actions), s.t.

preconditions(a) satisfied in state if no such a, then return failure

else return

ProgWS(apply(a, state), goals, actions, concatenate(path, a))

First call: ProgWS(IS, G, Actions, ())

25

(26)

26 " ! Forward planning still suffers from an

excessive branching factor

" ! In general, there are much fewer actions

that are relevant to achieving a goal than actions that are applicable to a state

" ! How to determine which actions are

relevant? How to use them?

" ! ! Backward planning

(27)

Relevant Action ! An action is relevant to a goal if one of its effects matched a goal proposition

Stack(B,A)

P = HOLDING(B), BLOCK(A), CLEAR(A)

E = ON(B,A), ¬CLEAR(A), ¬HOLDING(B), CLEAR(B), HANDEMPTY

is relevant to ON(B,A), ON(C,B)

(28)

Regression of a Goal ! The regression of a goal G through an

action A is the weakest precondition R[G,A] such that:

If a state S satisfies R[G,A] then:

1. !The precondition of A is satisfied in S 2. !Applying A to S yields a state that

satisfies G

(29)

Example !

• ! G = ON(B,A), ON(C,B)

• ! Stack(C,B)

– ! P = HOLDING(C), BLOCK(C), BLOCK(B), CLEAR(B)

– ! E = ON(C,B), ¬CLEAR(B), ¬HOLDING(C), CLEAR(C), HANDEMPTY

• ! R[G,Stack(C,B)] =

ON(B,A), HOLDING(C), BLOCK(C),

BLOCK(B), CLEAR(B)

(30)

Example !

• ! G = CLEAR(B), ON(C,B)

• ! Stack(C,B)

– ! P = HOLDING(C), BLOCK(C), BLOCK(B), CLEAR(B)

– ! E = ON(C,B), ¬CLEAR(B), ¬HOLDING(C), CLEAR(C), HANDEMPTY

• ! R[G,Stack(C,B)] = False

(31)

Computation of R[G,A] !

1. ! If any element of G is deleted by A then return False

2. ! G’ # Precondition of A

3. ! For every element SG of G do

If SG is not added by A then add SG to G’

4. ! Return G’

(32)

Computation of R[G,A] !

1. ! If any element of G is deleted by A then return False

2. ! G’ # Precondition of A

3. ! For every element SG of G do

If SG is not added by A then add SG to G’

4. ! If an inconsistency rule applies to G’ then return False

5. ! Return G’

(33)

Backward Chaining !

ON(B,A), ON(C,B)

Stack(C,B)

ON(B,A), HOLDING(C), CLEAR(B)

A B C

In general, there are much fewer actions relevant to a goal than there are actions

applicable ! smaller branching factor

than with forward planning

(34)

Backward Chaining !

ON(B,A), ON(C,B)

Stack(C,B)

ON(B,A), HOLDING(C), CLEAR(B) A B C

CLEAR(C), ON(C,TABLE), CLEAR(A), HANDEMPTY, CLEAR(B), ON(B,TABLE) CLEAR(C), ON(C,TABLE), HOLDING(B), CLEAR(A)

Stack(B,A)

Pickup(B) Putdown(C)

CLEAR(A), CLEAR(B), ON(B,TABLE), HOLDING(C)

Unstack(C,A)

CLEAR(B), ON(B,TABLE), CLEAR(C), HANDEMPTY, ON(C,A)

Pickup(C)

ON(B,A), CLEAR(B), HANDEMPTY, CLEAR(C), ON(C,TABLE)

Backward planning searches a goal space

(35)

Progression vs. Regression !

Both algorithms are:

• !Sound: the result plan is valid

• !Complete: if valid plan exists, they find one Non-deterministic choice => search!

• !Brute force: DFS, BFS, Iterative Deepening, ..,

• !Heuristic: A, IDA, …

Complexity: O(b ⁿ ) worst-case

b = branching factor, n = |“choose”|

Regression: often smaller b, focused by goals

Progression: full state to compute heuristics ₃₅

(36)

STRIPS !

• !Symbolic representations of knowledge about the problem domain.

• !Goal decomposition.

• !Incorporate relevant operators to the plan even when they can not be applied to the current state.

• !Mixed of regression and progression.

36

(37)

• !Search:

• ! Nodes: current state and a stack of goals-operators

• ! Root node: initial state and goal configuration

• !Idea:

• ! Push onto the stack the goals to achieve and the operators that achieve these goals

• ! Pop from the stack goals that are present in the current state, and operators that are executable

37

(38)

STRIPS planning !

• ! STRIPS maintains two additional data structures:

– ! State List - all currently true predicates.

– ! Goal Stack - a push down stack of goals to be solved, with current goal on top of stack.

• ! If current goal is not satisfied by present state,

examine add lists of operators, and push operator and preconditions list on stack. (Subgoals)

• ! When a current goal is satisfied, POP it from stack.

• ! When an operator is on top stack, record the

application of that operator on the plan sequence

and use the operator’s add and delete lists to update

the current state.

(39)

1. Match: if the objective of the top of the stack

matches with the current state, this objective is

deleted from the stack. !

(40)

2. Decompose: If the goal in the top of the stack is a

conjunction, then add the literal components at the top

of the stack in a certain order. !

(41)

3. Solve: When the goal at the top of the stack is one

unresolved literal, then find an operator whose adds

list contains a literal that matches with the objective. !

(42)

4. Apply: When the object on top of the stack is an operator, then delete it from the stack and

apply it on the current state. !

(43)

The process is repeated until the stack is

empty. !

(44)

• !Can Strips always find a plan if there is one?!

• !When Strips returns a plan, is it always

correct? e "cient?!

(45)

(46)

(47)

(48)

(49)

(50)

(51)

(52)

(53)

(54)

(55)

(56)

(57)

(58)

(59)

(60)

(61)

(62)

(63)

(64)

Example 2 !

(65)

(66)

(67)

Goal interaction !

• ! Simple planning algorithms assume that the goals to be achieved are independent

– !Each can be solved separately and then the solutions concatenated

• ! This planning problem, called the “Sussman Anomaly,” is the classic example of the goal interaction problem:

– ! Solving on(A,B) first (by doing unstack(C,A), stack(A,B) will be undone when solving the second goal on(B,C) (by doing

unstack(A,B), stack(B,C)).

– !Solving on(B,C) first will be undone when solving on(A,B)

• ! Classic STRIPS could not handle this, although minor modifications can get it to do simple cases

A B

C

Initial state

A

B

C

Goal state

(68)

State-space planning !

• ! We initially have a space of situations (where you are, what you have, etc.)

• ! The plan is a solution found by “searching”

through the situations to get to the goal

• ! A progression planner searches forward from initial state to goal state

• ! A regression planner searches backward from the goal

– ! Ideally this leads to reduced branching –you are only

considering things that are relevant to the goal

(69)

Plan-space planning !

An alternative is to search through the space of plans, rather than situations.

Start from a partial plan which is expanded and refined until a complete plan that solves the problem is generated.

Refinement operators add constraints to the partial plan and modification operators for other changes.

We can still use STRIPS-style operators:

Op(ACTION: RightShoe, PRECOND: RightSockOn, EFFECT: RightShoeOn) Op(ACTION: RightSock, EFFECT: RightSockOn)

Op(ACTION: LeftShoe, PRECOND: LeftSockOn, EFFECT: LeftShoeOn) Op(ACTION: LeftSock, EFFECT: leftSockOn)

could result in a partial plan of

[RightShoe, LeftShoe]

(70)

Partial-order planning !

A linear planner builds a plan as a totally ordered sequence of plan steps

A non-linear planner (aka partial-order planner) builds up a plan as a set of steps with some temporal constraints

constraints of the form S1<S2 if step S1 must comes before S2.

One refines a partially ordered plan (POP) by either:

adding a new plan step, or

adding a new constraint to the steps already in the plan.

A POP can be linearized (converted to a totally ordered

plan) by topological sorting

(71)

Least commitment !

• ! Non-linear planners embody the principle of least commitment

– ! only choose actions, orderings, and variable

bindings that are absolutely necessary, leaving other decisions till later

– ! avoids early commitment to decisions that don’t really matter

• ! A linear planner always chooses to add a plan step in a particular place in the

sequence

• ! A non-linear planner chooses to add a step

and possibly some temporal constraints

(72)

Non-linear plan !

A non-linear plan consists of

(1) A set of steps {S

₁

, S

₂

, S

₃

, S

₄

…}

Each step has an operator description, preconditions and post- conditions

(2) A set of causal links { … (S

_i

,C,S

_j

) …}

Meaning a purpose of step S

_i

is to achieve precondition C of step S

_j

(3) A set of ordering constraints { … S

_i

<S

_j

… }

if step S

_i

must come before step S

_j

(73)

Causal Link:

Action Ac (consumer) has precondition Q that is established in the plan by Ap(producer).

73

(74)

Step At threatens link (Ap, Q, Ac) if:

1. At has (not Q) as an effect, and

2. At could come between Ap and Ac, i.e.

O u (Ap< At< Ac) is consistent

74

(75)

Soluzione !

Una soluzione è un piano completo e consistente In un piano completo ogni precondizione di un passo viene raggiunta da qualche altro passo. Un passo

raggiunge una condizione se la condizione è uno degli

effetti del passo e se nessun altro passo può eliminare la condizione.

In un piano consistente non c’è alcuna contraddizione nell’ordinamento o nei vincoli sui legami (se validi

entrambi c’è contraddizione)

(76)

The initial plan !

Every plan starts the same way

S1:Start

S2:Finish

Initial State

Goal State

(77)

Initial Plan !

For uniformity, represent initial state and goal with two special actions: !

Start: !

• ! no preconditions,

• ! initial state as effects,

• ! must be the first step in the plan.

Finish: !

• ! no effects

• ! goals as preconditions

• ! must be the last step in the plan.

(78)

Trivial example !

Operators:

Op(ACTION: RightShoe, PRECOND: RightSockOn, EFFECT:

RightShoeOn)

Op(ACTION: RightSock, EFFECT: RightSockOn)

Op(ACTION: LeftShoe, PRECOND: LeftSockOn, EFFECT: LeftShoeOn) Op(ACTION: LeftSock, EFFECT: leftSockOn)

S1:Start

S2:Finish

RightShoeOn ^ LeftShoeOn

Steps: {S1:[Op(Action:Start)], S2:[Op(Action:Finish, Pre: RightShoeOn^LeftShoeOn)]}

Links: {}

Orderings: {S1<S2}

(79)

Solution !

Start Left

Sock

Right Sock Right

Shoe Left

Shoe

Finish

(80)

Linearizzazioni !

(81)

POP !

POP is sound and complete POP Plan is a solution if:

!All preconditions are supported (by causal links), i.e., no open conditions.

!No threats

!Consistent temporal ordering

By construction, the POP algorithm reaches a solution plan

81

(82)

Example !

Op(ACTION: START, PRECONDITION: nil, EFFECT: AT(HOME) and SELLS(STORE,DRILL) and SELLS(SUPER,MILK) and SELLS(SUPER, BREAD))

Op(ACTION: FINISH, PRECONDITION: HAVE(DRILL) and

HAVE(MILK) and HAVE(BREAD) and AT(HOME), EFFECT: nil)

Op(ACTION: GO(x), PRECONDITION: AT(x), EFFECT: AT(y) and not AT(x))

Op(ACTION: BUY(x), PRECONDITION: AT(SHOP) and SELLS(SHOP,x), EFFECT: HAVE(x))

82

(83)

Initial Plan !

83

(84)

84

(85)

85

(86)

86

(87)

Corretto? !

87

(88)

Piani con condizioni !

88

(89)

Demotion !

89

(90)

Promotion !

90

(91)

91

(92)

92

(93)

Piano Totale !

93

(94)

POP !

94

(95)

Heuristics !

Heuristics for selecting plans:

!Number of steps N

!Unsolved preconditions OP

Often uses analgorithm A * with heuristics:

f(P)=N(P) + OP(P)

95

(96)

Least commitment !

Non-linear planners embody the principle of least commitment

only choose actions, orderings, and variable bindings that are absolutely necessary, leaving other

decisions till later

avoids early commitment to decisions that don’t really matter

A linear planner always chooses to add a plan step in a particular place in the sequence

A non-linear planner chooses to add a step and

possibly some temporal constraints

(97)

Non-linear plan !

A non-linear plan consists of

(1) A set of steps {S

₁

, S

₂

, S

₃

, S

₄

…}

Each step has an operator description, preconditions and post- conditions

(2) A set of causal links { … (S

_i

,C,S

_j

) …}

Meaning a purpose of step S

_i

is to achieve precondition C of step S

_j

(3) A set of ordering constraints { … S

_i

<S

_j

… }

if step S

_i

must come before step S

_j

A non-linear plan is complete iff

Every step mentioned in (2) and (3) is in (1)

If S

_j

has prerequisite C, then there exists a causal link in (2) of the form (S

_i

,C,S

_j

) for some S

_i

If (S

_i

,C,S

_j

) is in (2) and step S

_k

is in (1), and S

_k

threatens

(S

_i

,C,S

_j

) (makes C false), then (3) contains either S

_k

<S

_i

or

S

_j

>S

_k

(98)

Consistent Plan !

Consistent plan:

• ! No cycles in the ordering constraint;

• ! No conflicts with the causal links.

A consistent plan with no open precondition is a solution

98

(99)

1) The initial plan contains Start and Finish

• ! the ordering constraint Start < Finish,

• ! no causal links

• ! all the preconditions in Finish as open preconditions.

99

(100)

2) The successor function arbitrarily picks one open precondition p on an action B and

generates a successor plan for every possible consistent way of choosing an action A that achieves p.

Consistency is enforced as follows:

1. The causal link A ->p B and the ordering constraint A

< B are added to the plan.

Action A may be an existing action in the plan or a new one. If it is new, add it to the plan and also add Start <

A and A < Finish. ₁₀₀

(101)

2) The successor function arbitrarily picks one open precondition p on an action B and

generates a successor plan for every possible consistent way of choosing an action A that achieves p.

2. Resolve conflicts between the new causal link and all existing actions.

A conflict between A ->p B and C is resolved by making C occur at some time outside the protection interval

B < C C < A

We add successor states for either or both if they result

in consistent plans ₁₀₁

(102)

3) The goal test checks whether a plan is a solution to the original planning problem.

Because only consistent plans are generated, the goal test just needs to check that there are no open

preconditions.

102

(103)

Nonlinear Planning !

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

(104)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

P: HOLDING(B) CLEAR(A) -: CLEAR(A) HOLDING(B) +: ON(B,A) CLEAR(B) HANDEMPTY

Stack(B,A)

The plan is incomplete

Open preconditions

(105)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

P: HOLDING(B) CLEAR(A) -: CLEAR(A) HOLDING(B) +: ON(B,A) CLEAR(B) HANDEMPTY

Stack(B,A)

P: HOLDING(C) CLEAR(B) -: CLEAR(B) HOLDING(C) +: ON(C,B) CLEAR(C) HANDEMPTY

Stack(C,B)

(106)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C)

P: HOLDING(B) CLEAR(A) -: CLEAR(A) HOLDING(B) +: ON(B,A) CLEAR(B) HANDEMPTY

P: HOLDING(C)

CLEAR(B)

-: CLEAR(B)

HOLDING(C)

+: ON(C,B)

CLEAR(C)

HANDEMPTY

(107)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C) Pickup(B)

P: CLEAR(B)

Achievers

Threat Other possible

achiever

(108)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C) Pickup(B)

P: CLEAR(B)

(109)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C) Pickup(B)

Note the similarity with constraint propagation

A consistent plan is one in which there is no cycle and no conflict between achievers and threats

A conflict can be eliminated by constraining the ordering among the actions or by

adding new actions

(110)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C) Pickup(B)

P: HANDEMPTY

(111)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C) Pickup(B)

P: HANDEMPTY

(112)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C) Pickup(B)

P: HANDEMPTY CLEAR(C) ON(C,TABLE)

P: HOLDING(C) CLEAR(B) P: HOLDING(B) CLEAR(A) P: HANDEMPTY

CLEAR(B) ON(B,TABLE)

~ Most-constrained-variable heuristic in CSP

! choose the unachieved precondition that can be satisfied in the fewest number of ways

! ON(C,TABLE)

(113)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C) Pickup(B)

Putdown(C)

(114)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C) Pickup(B)

Putdown(C)

Unstack(C,A)

(115)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C) Pickup(B)

Putdown(C) Unstack(C,A)

P: HANDEMPTY

(116)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C) Pickup(B)

Putdown(C)

Unstack(C,A)

(117)

P: -:

+: ON(A,TABLE) ON(B,TABLE) ON(C,A) CLEAR(B) CLEAR(C) HANDEMPTY

P: ON(B,A) ON(C,B) -:

+:

Stack(B,A)

Stack(C,B)

Pickup(C) Pickup(B)

Putdown(C) Unstack(C,A)

P: HANDEMPTY CLEAR(C) ON(C,TABLE)

P: HOLDING(C) CLEAR(B) P: HOLDING(B) CLEAR(A) P: HANDEMPTY

CLEAR(B) ON(B,TABLE)

P: HOLDING(C) P: HANDEMPTY

CLEAR(C) ON(C,A)

The plan is complete because every precondition of every step is added

by some previous step, and no

intermediate step deletes it

(118)

Hierarchical Planning !

(R&N:12.2)

(119)

Hierarchical Planning Brief History !

Originally developed about 25 years ago Knowledge-based ! Scalable

Task Hierarchy is a form of domain-specific knowledge

Practical, applied to real world problems (linear time vs. exponential)

Lack of theoretical understanding until early 1990’s

Formal semantics, sound/complete algorithm,

(120)

Hierarchical Decomposition !

(121)

HTN Planning !

• ! Capture hierarchical structure of the planning domain

• ! Planning domain contains non-primitive actions and schemas for reducing them

• ! Reduction schemas:

– !given by the designer

– !express preferred ways to accomplish a

task

(122)

Hierarchical decomposition !

• !Hierarchical decomposition, or hierarchical task

network (HTN) planning, uses abstract operators to incrementally decompose a planning problem from a high-level goal statement to a primitive plan network

• !Primitive operators represent actions that are executable, and can appear in the final plan

• !Non-primitive operators represent goals

(equivalently, abstract actions) that require further decomposition (or operationalization) to be executed

• !There is no “right” set of primitive actions: One

agent’s goals are another agent’s actions!

(123)

HTN planning: example !

(124)

Hierarchical Decomposition !

From a high-level

action to a partially

ordered set of lower-

level actions.

(125)

HTN Planning !

A plan library contains both primitive and non- primitive actions.

Non-primitive actions have external

preconditions, as well as external effects.

Sometimes useful to distinguish between

primary effects and secondary effects.

(126)

HTN Formalization (1) !

State: list of ground atoms Tasks:

Primitive tasks: do[f(x ₁ , …, x _n )]

Non-primitive tasks:

Goal task: achieve(l) (l is a literal) Compound task: perform[t(x ₁ , …, x _n )]

Plan Library

Operator:

[operator f(x ₁ , …, x _n ) (pre: l ₁ , …, l _n ) (post: l’ ₁ , …, l’ _n )]

Method: Decompose( ! ^{, d)}

" is a non-primitive task and d is a task network

Plan: sequence of ground primitive tasks

(operators)

(127)

Task Reduction !

(128)

Example action descriptions !

Action(BuyLand, PRECOND:money, EFFECT: Land Money)

Action(GetLoan, PRECOND: GoodCredit, EFFECT: Money Mortage) Action(BuildHouse, PRECOND:Land, EFFECT: House)

Action(GetPermit, PRECOND: Land, EFFECT: Permit) Action(HireBuilder, EFFECT: Contract)

Action(Construction, PRECOND: Permit Contract, EFFECT: HouseBuild Permit)

Action(PayBuilder, PRECOND:Money HouseBuilt,

EFFECT: Money House Contract) 128

(129)

Example action descriptions !

Decompose(BuildHouse,

Plan(STEPS:{S1: GetPermit, S2: HireBuilder, S3: Construction, S4: PayBuilder}

ORDERINGS: {Start S1S3 S4 Finish, Start S2 S3,

LINKS: {Start Land S1, Start Money S4, S1 Permit S3, S2 Contract S3, S3 HouseBuilt S4,

S4 House Finish, S4 Money Finish}))

129

(130)

Correctness !

• ! A decomposition should be a correct implementation of the action.

• ! A plan d implements an action a correctly if d is a complete and consistent partial-order plan for the problem of achieving the

effects of a given the preconditions of a (result of a sound POP).

• ! The plan library contains several decompositions for any high- level action.

• ! Each decomposition might have different preconditions and effects. The preconditions of the high-level action should be the intersection of the preconditions of the decompositions (similarly for the external effects.)

130

(131)

Information hiding !

The high-level description hides all the internal effects of decompositions (e.g., Permit and

Contract).

It also hides the duration the internal preconditions and effects hold.

Advantage: reduces complexity by hiding details Disadvantage: conflicts are hidden too

131

(132)

Example !

132

(133)

HTN Planning Algorithm (intuition) !

Problem reduction:

Decompose tasks into subtasks Handle constraints

Resolve interactions

If necessary, backtrack and try other

decompositions

(134)

Basic HTN Procedure !

1. ! Input a planning problem P

2. ! If P contains only primitive tasks, then

resolve the conflicts and return the result. If the conflicts cannot be resolved, return

failure

3. ! Choose a non-primitive task t in P 4. ! Choose an expansion for t

5. ! Replace t with the expansion

6. ! Find interactions among tasks in P and

suggest ways to handle them. Choose one.

7. ! Go to 2

(135)

For each decomposition d of an action a !

Remove the high level action, and insert/reuse actions for each action in d

Reuse -> subtask sharing

Merge the ordering constraints (If there is an ordering constraint of the form B should every step of d come after B?

Merge the causal links

135

(136)

Comments on HTN planning !

The major idea is to gain efficiency by using the library of preconstructed plans.

When there is recursion, it is undecidable even if the underlying state space is finite.

recursion can be ruled out

the length of solutions can be bound

can use a hybrid POP and HTN approach

136

(137)

SHOP (Simple Hierarchical Ordered Planner) !

• ! Domain-independent algorithm for Ordered Task Decomposition

– ! Sound/complete

• ! Input:

– ! State: a set of ground atoms – ! Task List: a linear list of tasks

– ! Domain: methods, operators, axioms

• ! Output: one or more plans, it can return:

– ! the first plan it finds – ! all possible plans

– ! a least-cost plan

– ! all least-cost plans

(138)

• ! Initial task list: ((travel home park))

• ! Initial state: ((at home) (cash 20) (distance home park 8))

• ! Methods (task, preconditions, subtasks):

– ! (:method (travel ?x ?y)

((at ?x) (walking-distance ?x ?y)) ' ((!walk ?x ?y)) 1) – ! (:method (travel ?x ?y)

((at ?x) (have-taxi-fare ?x ?y))

' ((!call-taxi ?x) (!ride ?x ?y) (!pay-driver ?x ?y)) 1)

• ! Axioms:

– ! (:- (walking-dist ?x ?y) ((distance ?x ?y ?d) (eval (<= ?d 5)))) – ! (:- (have-taxi-fare ?x ?y)

((have-cash ?c) (distance ?x ?y ?d) (eval (>= ?c (+ 1.50 ?d))))

• ! Primitive operators (task, delete list, add list) – ! (:operator (!walk ?x ?y) ((at ?x)) ((at ?y)))

– ! …

Simple Example !

Optional cost;

default is 1

(139)

Precond: Precond:

(travel home park)

(!walk home park)

(!call-taxi home) (!ride home park) (!pay-driver home park) Fail (distance > 5)

Succeed (we have $20, and the fare is only $9.50) Succeed

Succeed (at home) (walking-distance

Home park)

(have-taxi-fare home park)

(at park) (cash 10.50) (distance home park 8)

Simple Example (Continued) !

(at home)

Initial state:

Final state:

(at home)

(cash 20)

(distance home park

8)

(140)

The SHOP Algorithm !

procedure SHOP (state S, task-list T, domain D) 1. if T = nil then return nil

2. t

₁

= the first task in T

3. U = the remaining tasks in T

4. if t is primitive & an operator instance o matches t

₁

then 5. P = SHOP (o(S), U, D)

6. if P = FAIL then return FAIL 7. return cons(o,P)

8. else if t is non-primitive

& a method instance m matches t

₁

in S

& m’s preconditions can be inferred from S then 9. return SHOP (S, append (m(t

₁

), U), D)

10. else

11. return FAIL 12. end if

end SHOP

state S; task list T=( t

₁

,t

₂

,…) operator instance o

state o(S) ; task list T=(t

₂

, …)

task list T=( t

₁

,t

₂

,…) method instance m

task list T=( u

₁

,…,u

_k

,t

₂

,…) nondeterministic

choice among all

methods m whose

preconditions can be

inferred from S

(141)

Introduction to Distributed Planning !

(142)

142 Distributed problem solving !

(143)

143 Distributed planning !

What can be distributed:

$ ! The process of coming out with a plan is distributed among agents

$ ! Execution is distributed among agents

(144)

Centralized planning for distributed plans _!

Operators

" !move(b,x,y)

^!

movetotable(b,x)

Precond: on(b,x) ! clear(b) ! clear(y) Precond: on(b,x) ! clear(b)

Postcond: on(b,y) ! clear(x) ! Postcond: on(b,T ) ! clear(x) ! ¬on(b,x)

¬on(b,x) ! ¬clear(y)

144 A

B D

C E

F

Si

C A

E

B F

D

Sf

I'm Bill Agent1 I'm Tom

Agent2

on(A,B) on(C,D) on(E,F) on(B,T) on(D,T) on(F,T)

on(B,A) on(F,D) on(A,E) on(D,C) on(E,T) on(C,T) on(B,A)

S1: move(B,T,A)

on(B,T) clear(B) clear(A) movetotable(A,B) move(A,B,y)

S2: move(A,B,E) clear(A) clear(E) on(A,B)

on(E,T)

S3: movetotable(E,F)

1. Given a goal description, a set of operators,

and an initial state description

generate a partial order plan

(145)

S1: move(B,T,A) To satisfy the preconditions, we have:

S2: move(A,B,E) S2 < S1, S3 < S4 S3:movetotable(E,F) S6 < S4, S6 < S5 S4: move(F,T,D) Also

S5: move(D,T,C) S2 threat to S3 # S3 < S2 S6: movetotable(C,D) S4 threat to S5 # S5 < S4 Then the partial ordering is: S3 < S2 < S1

S6 < S5 < S4

S3 < S4

S3: movetotable(E,F) S2: move(A,B,E) S1: move(B,T,A) S6: movetotable(C,D) S5: move(D,T,C) S4: move(F,T,D)

Any total ordering that satisfies this partial ordering is a good plan for Agent1 What if we have 2 agents?

DECOMP1

Subplan1 S3 < S2 < S1 Subplan2 S6 < S5 < S4 and S3 < S4 Agent1 S3 < send(clear(F)) < S2 < S1

Agent2 S6 < S5 < wait(clear(F)) < S4 145

< <

2. Decompose the plan into

subproblems so as to minimize order relations across plans 3. Insert synchronization

4. Allocate subplans to agents

(146)

S3: movetotable(E,F) S2: move(A,B,E) S1: move(B,T,A) S6: movetotable(C,D) S5: move(D,T,C) S4: move(F,T,D) DECOMP2

Subplan1 S3 < S5 < S4 Subplan2 S6 < S2 < S1

and S3 < S2 and S6 < S5

Agent1 S3 < send(don't_care(E)) < wait(clear(D)) < S5 < S4 Agent2 S6 < wait(don't_care(E)) < wait(clear(D)) < S2 < S1

"!Obviously, DECOMP2 has more order relations among subplans than DECOMP1

"!Therefore, we choose DECOMP1

S3 < send(clear(F)) < S2 < S1 S6 < S5 < wait(clear(F)) < S4

But

then back to DECOMP2

146 then back to DECOMP2

< <

4. If failure to allocate subplans then redo decomposition (2)

If failure to allocate subplans with any decomposition

then redo generate plan (1) 5. Execute and monitor subplans

I know how to move only

D, E, F

I know how to move only

A, B, C

S3 <

S6 < S5 <

(147)

Distributed planning for centralized plans _!

Each of the planning agents generate a partial plan in parallel then merge these plans into a global plan

"!parallel to result sharing

"!may involve negotiation

Agent 1 - is specialized in doing movetotable(b,x) Agent 2 - is specialized in doing move(b,x,y)

Agent 1 - based on Sf it comes out with the partial plan

P_Agent1 = { S3: movetotable(E,F) satisfies on(E,T)

S6: movetotable(C,D) satisfies on(C,T)

no ordering }

Agent 2 - based on Sf it comes out with the partial plan

P_{Agent 2} = { S1: move(B,T,A), S2: move(A,B,E) satisfies on(B,A) ! on(A,E) S4: move(F,T,D), S5: move(D,T,C) satisfies on(F,D) ! on(D,C) ordering S2 < S1 and S5 < S4 }

Merge P_Agent1 with P_Agent2 by checking preconditons and threats

Establish thus order S3 < S2, S6 < S5, S3 < S4 + order of P_Agent2

Then give any instance of this partial plan to an execution agent to carry it out

147

(148)

The problem is decomposed and distributed among various planning specialists, each of which proceeds then to

generate its portion of the plan

" ! similar to task sharing

" ! may involve backtracking

Agent 1 - knows only how to deal with 2-block stacks Agent 2 - knows only how to deal with 3-block stacks

148 C A

E

B F

D

Sf

C A

E B F

D A

B D

C E

F

Si

(149)

Distributed planning for distributed plans _!

149

(150)

" !movehigh(b,x,y)

Precond: have_lifter ! clear(b) ! clear(y) ! on(y,z) ! z$ T

Postcond: on(b,y) ! clear(x) ! ¬on(b,x) ! ¬ clear(y) ! free_lifter

" !pick_lifter

Precond: free_lifter

Postcond: have_lifter ! ¬free_lifter

Agent1: { S1:move(A,T,B) < S2: pick_lifter < S3: movehigh(E,T,B) } Agent2: { R1:move(C,T,D) < R2: pick_lifter < R3: movehigh(F,T,C) }