value-loading-techniques
Methods explored to instill human values into artificial intelligences, including explicit representation, evolutionary selection, reinforcement learning, value accretion, motivational scaffolding, emulation modulation, and institution design.
1 chapter across 1 book
Superintelligence: Paths, Dangers, Strategies (2014)Nick Bostrom
This chapter discusses the challenges and potential costs associated with designing institutions composed of intelligent subagents, including emulations and artificial intelligences, to control superintelligent systems. It explores the ethical concerns such as mind crimes, the unpredictability of social structures among artificial agents, and the complexity added by institution design. The chapter concludes with a summary of various value-loading techniques, highlighting their strengths and weaknesses, and emphasizes the unresolved philosophical problem of determining which values to instill in a superintelligence.