Partially observable Markov decision process
Generalization of Markov Decision Process.
POMDP models an agent decision process where system dynamics are assumed to be determined by MDP, but the agent cannot directly observe the underlying state.
Partially observable Markov decision process
A partially observable Markov decision process (POMDP) is a generalization of a Markov decision process (MDP). A POMDP models an agent decision process in which it is assumed that the system dynamics are determined by an MDP, but the agent cannot directly observe the underlying state. Instead, it must maintain a sensor model (the probability distribution of different observations given the underlying state) and the underlying MDP. Unlike the policy function in MDP which maps the underlying states to the actions, POMDP's policy is a mapping from the history of observations (or belief states) to the actions.
https://en.wikipedia.org/wiki/Partially_observable_Markov_decision_process
Belief MDP bound (Token Entropy)
This represents how much we believe in the observation by adding another multiplication factor.
Simplifying Complex Observation Models in Continuous POMDP Planning...
Solving partially observable Markov decision processes (POMDPs) with high dimensional and continuous observations, such as camera images, is required for many real life robotics and planning...
https://openreview.net/forum?id=xztVRhFY3c
Online POMDP Planning with Anytime Deterministic Guarantees
Autonomous agents operating in real-world scenarios frequently encounter uncertainty and make decisions based on incomplete information. Planning under uncertainty can be mathematically formalized...
https://openreview.net/forum?id=Cupr2yTFSx¬eId=dGej9Dk4Q3
Multi-agent observation sharing for POMDP to approximation
Partially Observable Multi-agent RL with (Quasi-)Efficiency: The...
We study provable multi-agent reinforcement learning (MARL) in the general framework of partially observable stochastic games (POSGs). To circumvent the known hardness results and the use of...
https://openreview.net/forum?id=6dnHrArK48

Seonglae Cho