POMDP

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Feb 9 11:44
Editor
Edited
Edited
2025 Sep 7 7:2
Refs
Refs

Partially observable Markov decision process

Generalization of
Markov Decision Process
.
POMDP models an agent decision process where system dynamics are assumed to be determined by MDP, but the agent cannot directly observe the underlying state.
 
 
 
www.cs.cmu.edu
Partially observable Markov decision process
A partially observable Markov decision process (POMDP) is a generalization of a Markov decision process (MDP). A POMDP models an agent decision process in which it is assumed that the system dynamics are determined by an MDP, but the agent cannot directly observe the underlying state. Instead, it must maintain a sensor model (the probability distribution of different observations given the underlying state) and the underlying MDP. Unlike the policy function in MDP which maps the underlying states to the actions, POMDP's policy is a mapping from the history of observations (or belief states) to the actions.

Belief MDP bound (
Token Entropy
)

This represents how much we believe in the observation by adding another multiplication factor.
Simplifying Complex Observation Models in Continuous POMDP Planning...
Solving partially observable Markov decision processes (POMDPs) with high dimensional and continuous observations, such as camera images, is required for many real life robotics and planning...
Online POMDP Planning with Anytime Deterministic Guarantees
Autonomous agents operating in real-world scenarios frequently encounter uncertainty and make decisions based on incomplete information. Planning under uncertainty can be mathematically formalized...
Multi-agent observation sharing for POMDP to approximation
Partially Observable Multi-agent RL with (Quasi-)Efficiency: The...
We study provable multi-agent reinforcement learning (MARL) in the general framework of partially observable stochastic games (POSGs). To circumvent the known hardness results and the use of...
 
 

Recommendations