Attaining Human`s Desirable Outcomes in Human-AI Interaction via Structural Causal Games (2405.16588v1)

Published 26 May 2024 in cs.AI, cs.GT, and cs.HC

Abstract: In human-AI interaction, a prominent goal is to attain humans desirable outcome with the assistance of AI agents, which can be ideally delineated as a problem of seeking the optimal Nash Equilibrium that matches the humans desirable outcome. However, reaching the outcome is usually challenging due to the existence of multiple Nash Equilibria that are related to the assisting task but do not correspond to the humans desirable outcome. To tackle this issue, we employ a theoretical framework called structural causal game (SCG) to formalize the human-AI interactive process. Furthermore, we introduce a strategy referred to as pre-policy intervention on the SCG to steer AI agents towards attaining the humans desirable outcome. In more detail, a pre-policy is learned as a generalized intervention to guide the agents` policy selection, under a transparent and interpretable procedure determined by the SCG. To make the framework practical, we propose a reinforcement learning-like algorithm to search out this pre-policy. The proposed algorithm is tested in both gridworld environments and realistic dialogue scenarios with LLMs, demonstrating its adaptability in a broader class of problems and potential effectiveness in real-world situations.

References (66)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/FCAI_fi/status/1815294713391391019

https://twitter.com/econ_cs/status/1795304471913742701

Attaining Human`s Desirable Outcomes in Human-AI Interaction via Structural Causal Games (2405.16588v1)

Summary

Related Papers

Tweets