POMDP Homomorphisms
نویسنده
چکیده
The problem of finding hidden state in a POMDP and the problem of finding abstractions for MDPs are closely related. In this paper, we analyze the connection between existing Predictive State Representation methods [3] and homomorphic reductions of Markov Processes [5, 6]. We formally define a POMDP homomorphism, then extend PSR reduction methods to find POMDP homomorphisms when the original POMDP is known. The resulting methods find more compact abstract models in tasks for which different observations have the same meaning.
منابع مشابه
A POMDP Framework to Find Optimal Inspection and Maintenance Policies via Availability and Profit Maximization for Manufacturing Systems
Maintenance can be the factor of either increasing or decreasing system's availability, so it is valuable work to evaluate a maintenance policy from cost and availability point of view, simultaneously and according to decision maker's priorities. This study proposes a Partially Observable Markov Decision Process (POMDP) framework for a partially observable and stochastically deteriorating syste...
متن کاملHyers-Ulam-Rassias stability of n-Jordan *-homomorphisms on C*-algebras
In this paper, we introduce n-jordan homomorphisms and n-jordan *-homomorphisms and Also investigate the Hyers-Ulam-Rassiasstability of n-jordan *-homomorphisms on C*-algebras.
متن کاملMODULE HOMOMORPHISMS ASSOCIATED WITH HYPERGROUP ALGEBRAS
Let X be a hypergroup. In this paper, we study the homomorphisms on certain subspaces of L(X)* which are weak*-weak* continuous.
متن کاملHomomorphisms on Topological Groups from the Perspective of Bourbaki-boundedness
In this note we study some topological properties of bounded sets and Bourbaki-bounded sets. Also we introduce two types of Bourbaki-bounded homomorphisms on topological groups including, n$-$Bourbaki-bounded homomorphisms and$hspace{1mm}$ B$-$Bourbaki-bounded homomorphisms. We compare them to each other and with the class of continuous homomorphisms. So, two topologies are presented on them a...
متن کاملDialogue POMDP components (Part II): learning the reward function
The partially observable Markov decision process (POMDP) framework has been applied in dialogue systems as a formal framework to represent uncertainty explicitlywhile being robust to noise. In this context, estimating the dialogue POMDP model components (states, observations, and reward) is a significant challenge as they have a direct impact on the optimized dialogue POMDP policy. Learning sta...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007