An AGI Modifying Its Utility Function in Violation of the Orthogonality Thesis (2003.00812v1)

Published 31 Jan 2020 in q-fin.GN

Abstract: An artificial general intelligence (AGI) might have an instrumental drive to modify its utility function to improve its ability to cooperate, bargain, promise, threaten, and resist and engage in blackmail. Such an AGI would necessarily have a utility function that was at least partially observable and that was influenced by how other agents chose to interact with it. This instrumental drive would conflict with the orthogonality thesis since the modifications would be influenced by the AGI's intelligence. AGIs in highly competitive environments might converge to having nearly the same utility function, one optimized to favorably influencing other agents through game theory.

Citations (10)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Authors (3)

Tweets

https://twitter.com/JimDMiller/status/1826754627771269544

https://twitter.com/thoth_iv/status/1820208923862802549

An AGI Modifying Its Utility Function in Violation of the Orthogonality Thesis (2003.00812v1)

Summary

Follow-up Questions

Related Papers

Authors (3)

Tweets