Search Control of Plan Generation in Decision-Theoretic Planners
نویسندگان
چکیده
This paper addresses the search control problem of selecting which plan to refine next for decision-theoretic planners, a choice point common to the decision theoretic planners created to date. Such planners can make use of a utility function to calculate bounds on the expected utility of an abstract plan. Three strategies for using these bounds to select the next plan to refine have been proposed in the literature. We examine the rationale for each strategy and prove that the optimistic strategy of always selecting a plan with the highest upper-bound on expected utility expands the fewest number of plans, when looking for all plans with the highest expected utility. When looking for a single plan with the highest expected utility, we prove that the optimistic strategy has the best possible worst case performance and that other strategies can fail to terminate. To demonstrate the effect of plan selection strategies on performance, we give results using the DRWS planner that show that the optimistic strategy can produce exponential improvements in time and space.
منابع مشابه
Using Loops in Decision-Theoretic Refinement Planners
Classical AI planners use loops over subgoals to move a stack of blocks by repeatedly moving the top block. Probabilistic planners and reactive systems repeatedly try to pick up a block to increase the probability of success in an uncertain environment. These planners terminate a loop only when the goal is achieved or when the probability of success has reached some threshold. The tradeoff betw...
متن کاملAn Easy “Hard Problem” for Decision-Theoretic Planning
This paper presents a challenge problem for decision-theoretic planners. State-space planners reason globally, building a map of the parts of the world relevant to the planning problem, and then attempt to distill a plan out of the map. A planning problem is constructed that humans find trivial, but no state-space planner can solve. Existing POCL planners cannot solve the problem either, but fo...
متن کاملExploiting Belief Locality in Run-Time Decision-Theoretic Planners
While Partially-Observable Markov Decision Processes have become a popular means of representing realistic planning problems, exact approaches to finding POMDP policies are extremely computationally complex. An alternative approach for control in POMDP domains is to use run-time optimization over action sequences in a dynamic decision network. While exact algorithms have to generate a policy ov...
متن کاملFlaw Selection Strategies for Value-Directed Planning
A central issue faced by partial-order, cansal-link (POCL) planning systems is how to select which flaw to resolve when generating the refinements of a partial plan. Domain-independent flaw selection strategies have been discussed extensively in the recent literature (Peot ~ Smith 1993; Joslin & Pollack 1994; Schubert & Gerevini 1995). The PYRRHUS planning system is a decision-theoretic extensi...
متن کاملDecision-Theoretic Subgoaling for Planning with External Events
I describe a planning methodology for domains with uncertainty in the form of external events that are not completely predictable. Under certain conditions, these events can be modelled as continuous-time Markov chains whose states are characterised by the planner’s domain predicates. Planning is goal-directed, but the subgoals are suggested by analysing the utility of the partial plan rather t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998