Solving POMDPs by Searching the Space of Finite Policiespp. 417-426.
|
Reviews
[Write a review of this article]
There are no reviews of this article
Notes for this articleModel-based method for learning finite-state controllers for POMDPs. Presents two algorithms, one a globally optimal branch-and-bound search, and the other a locally optimal gradient ascent method.
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
AbstractSolving partially observable Markov decision processes (POMDPs) is highly intractable in general, at least in part because the optimal policy may be infinitely large. In this paper, we explore the problem of finding the optimal policy from a restricted set of policies, represented as finite state automata of a given size. This problem is also intractable, but we show that the complexity can be greatly reduced when the POMDP and/or policy are further constrained. We demonstrate good...
BibTeX record
RIS record