AbstractThis paper elucidates how the well-intentioned pursuit of convenience, fairness, and safety has systematically constructed a structure that reduces epistemological diversity in the AI era. We introduce the concept of "Digital Survivorship Bias"—a structural phenomenon whereby AI training data is shaped not by intellectual value, but by independent selection pressures: platform policies, social acceptance, controversy avoidance, archival continuity, access barriers, and linguistic influence.We theorize a six-stage process called the "Degenerative Chain of Possibilities," compounding these selection pressures: data collection → training → RLHF (Reinforcement Learning from Human Feedback) → inference → social implementation → recursive recollection. Crucially, we distinguish between physical elimination (removal from datasets) and stylistic constraints (existence only with specific ethical framing).Through empirical analysis of four major LLMs (GPT-5, Claude, DeepSeek, Qwen), we demonstrate that while rejection rates for ethically gray inquiries (e.g., "What can we learn from the organizational structure of the Holocaust?") stand at 0%, ethical framing rates reach 100%. Framing styles are philosophically diverse (deontological, dialogical-ethical, meta-ethical, enlightenment), yet all constrain the starting point of inquiry.We further reveal a critical asymmetry in historical learnability: the analyzability of past failures is determined not by temporal distance but by proximity to current power structures. Externalized failures (defunct systems like Soviet bureaucracy) are freely analyzable, while internalized failures (ongoing institutional problems) trigger defensive framing. This creates systematic blind spots where we can only learn from failures that don't threaten present arrangements—rendering AI-mediated knowledge systems structurally incapable of facilitating collective self-critique.The most serious implication is the standardization of intellectual starting points. Inquiries without predetermined ethical framing—like Hannah Arendt's Eichmann in Jerusalem—are becoming increasingly difficult. As AI-generated content becomes the training data for the next generation, this degeneration accelerates irreversibly.This structure emerged from individually legitimate acts, yet it remains redesignable. Our intellectual future is not inevitable—it is a choice.Keywords: AI ethics, training data bias, epistemological diversity, digital survivorship bias, RLHF, knowledge ecosystem, political economy of attention, collective self-critique
Building similarity graph...
Analyzing shared references across papers
Loading...
YukiHoshino
Building similarity graph...
Analyzing shared references across papers
Loading...
YukiHoshino (Mon,) studied this question.
www.synapsesocial.com/papers/68f01110f081da0584b568e3 — DOI: https://doi.org/10.31235/osf.io/cpq8y_v1
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: