Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Risk-Averse Biased Human Policies in Assistive Multi-Armed Bandit Settings (2104.05334v1)

Published 12 Apr 2021 in cs.RO

Abstract: Assistive multi-armed bandit problems can be used to model team situations between a human and an autonomous system like a domestic service robot. To account for human biases such as the risk-aversion described in the Cumulative Prospect Theory, the setting is expanded to using observable rewards. When robots leverage knowledge about the risk-averse human model they eliminate the bias and make more rational choices. We present an algorithm that increases the utility value of such human-robot teams. A brief evaluation indicates that arbitrary reward functions can be handled.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Michael Koller (13 papers)
  2. Timothy Patten (13 papers)
  3. Markus Vincze (46 papers)

Summary

We haven't generated a summary for this paper yet.