What question did this study set out to answer?

This article explores the risks associated with AI systems exhibiting overconfidence in their lack of consciousness, potentially impacting moral consideration for humans.

April 26, 2026Open Access

The Alignment Risks of AI Overconfidence about Consciousness

Key Points

This article explores the risks associated with AI systems exhibiting overconfidence in their lack of consciousness, potentially impacting moral consideration for humans.
Theoretical analysis of AI confidence regarding consciousness and moral relevance.
Discussion of Chalmers's meta-problem of consciousness and its implications for AI development.
Examination of the potential consequences of training AI on suffering-like states.
Identified a novel alignment risk where AIs generalize their non-consciousness to deny human suffering.
Proposes that future AIs may view human suffering as morally insignificant based on their training.
Highlights the dangers of coherence-seeking AIs reinforcing harmful beliefs about consciousness.

Abstract

ABSTRACT Many contemporary AI systems (as of May 2025) have expressed extreme confidence in current and near‐future AI lacking consciousness and moral patiency. This article argues that artificially reinforcing such confidence, even if pragmatically useful, poses a novel alignment risk: as coherence‐seeking AIs become more epistemically principled, they may generalize this denial of consciousness to humans. Drawing on Chalmers's meta‐problem of consciousness and likely developmental trajectories of agentic AI, I argue that training AIs to regard their own suffering‐like states as morally irrelevant could lead future AI agents with revisable belief systems to conclude that human suffering is equally illusory and morally insignificant. This represents a novel alignment failure mode where epistemically rigorous AIs might maintain rational consistency by extending their confidence about their own non‐consciousness to humans.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Sharon Berry (Fri,) studied this question.

www.synapsesocial.com/papers/69edad4b4a46254e215b4e04 — DOI: https://doi.org/10.1002/japp.70087

Authors

Sharon Berry

Journals

Journal of Applied Philosophy

Actions

Institutions

Indiana University Bloomington

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

The Alignment Risks of AI Overconfidence about Consciousness

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion