ABSTRACT Many contemporary AI systems (as of May 2025) have expressed extreme confidence in current and near‐future AI lacking consciousness and moral patiency. This article argues that artificially reinforcing such confidence, even if pragmatically useful, poses a novel alignment risk: as coherence‐seeking AIs become more epistemically principled, they may generalize this denial of consciousness to humans. Drawing on Chalmers's meta‐problem of consciousness and likely developmental trajectories of agentic AI, I argue that training AIs to regard their own suffering‐like states as morally irrelevant could lead future AI agents with revisable belief systems to conclude that human suffering is equally illusory and morally insignificant. This represents a novel alignment failure mode where epistemically rigorous AIs might maintain rational consistency by extending their confidence about their own non‐consciousness to humans.
Building similarity graph...
Analyzing shared references across papers
Loading...
Sharon Berry (Fri,) studied this question.
www.synapsesocial.com/papers/69edad4b4a46254e215b4e04 — DOI: https://doi.org/10.1002/japp.70087
Sharon Berry
Journal of Applied Philosophy
Indiana University Bloomington
Building similarity graph...
Analyzing shared references across papers
Loading...