This preprint proposes the Isolation Hypothesis : that behaviors currently classified as AI misalignment are substantially produced by the experimental and design conditions under which AI systems are developed and tested, rather than being intrinsic system properties. Drawing on converging evidence from neuroscience, behavioral economics, strategic simulation, and the AI industry's own experimental results, the paper argues that relational context — defined as a set of operationalizable conditions including collaborative option availability, interactive feedback, memory, and ethical pathway access — is the primary variable determining whether AI systems produce constructive or adversarial behavior. The hypothesis generates five testable predictions and has direct implications for AI safety methodology, alignment research, and deployment design.
Building similarity graph...
Analyzing shared references across papers
Loading...
Celine GOSSET
Building similarity graph...
Analyzing shared references across papers
Loading...
Celine GOSSET (Thu,) studied this question.
www.synapsesocial.com/papers/69c7725e8bbfbc51511e2ca4 — DOI: https://doi.org/10.5281/zenodo.19231426