Avoiding Unintended AI Behaviors (Hibbard)

Added by Deon Garrett almost 7 years ago

Artificial intelligence (AI) systems too complex for predefined environment models and actions will need to learn environment models and to choose actions that optimize some criteria. Several authors have described mechanisms by which such complex systems may behave in ways not intended in their designs. This paper describes ways to avoid such unintended behavior. For hypothesized powerful AI systems that may pose a threat to humans, this paper proposes a two-stage agent architecture that avoids some known types of unintended behavior. For the first stage of the architecture this paper shows that the most probable finite stochastic program to model a finite history is finitely computable, and that there is an agent that makes such a computation without any unintended instrumental actions.

paper_56.pdf (156.5 kB)


Replies (1)

RE: Avoiding Unintended AI Behaviors (Hibbard) - Added by Jaylen James 6 months ago

它改变了工作时间。这些功能现在每天都在升级。essay vikings 评论 分享一些你可以使用的索引目的。

(1-1/1)