The way in which AI (and, in particular, agentic superintelligent AGI) develops over the coming decades will determine the fate of all humanity for all eternity. In order to maximise the net benefit of AGI for all humanity, without favouring any subset thereof, we imagine a Gold-Standard AGI that is maximally-aligned and maximally-validated. The first of these properties --- alignment --- is traditionally decomposed into outer alignment (how do we define a final goal FGG that correctly states what we want? ), and inner alignment (how do we build an agent G that forever pursues FGG as intended? ) This paper presents a complete and foundational theory of AGI, culminating in an implementation-neutral solution to the outer AGI alignment problem in the case that G is superintelligent (hence "superalignment"). Given the AGI alignment problem's profound relevance to AGI governance, we adopt a pedagogic style throughout in order that the paper might be accessible to less technical readers such as AGI policymakers.
Building similarity graph...
Analyzing shared references across papers
Loading...
Aaron Turner
Father Muller Homoeopathic Medical College
Building similarity graph...
Analyzing shared references across papers
Loading...
Aaron Turner (Wed,) studied this question.
www.synapsesocial.com/papers/69d895ea6c1944d70ce0713a — DOI: https://doi.org/10.5281/zenodo.19476760
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: