Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Utilization of domain knowledge to improve POMDP belief estimation (2302.08748v1)

Published 17 Feb 2023 in cs.AI and cs.LG

Abstract: The partially observable Markov decision process (POMDP) framework is a common approach for decision making under uncertainty. Recently, multiple studies have shown that by integrating relevant domain knowledge into POMDP belief estimation, we can improve the learned policy's performance. In this study, we propose a novel method for integrating the domain knowledge into probabilistic belief update in POMDP framework using Jeffrey's rule and normalization. We show that the domain knowledge can be utilized to reduce the data requirement and improve performance for POMDP policy learning with RL.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Tung Nguyen (58 papers)
  2. Johane Takeuchi (3 papers)

Summary

We haven't generated a summary for this paper yet.