The Distribution Hypothesis (Chang, 2026) established that controlled fragment allocation — not fragmentation alone — determines AI decision protection, achieving 81.3% ± 3.1% pass rate under collaborative multi-model reconstruction attacks. This paper presents empirical evidence from 50 queries across 7 frontier AI models over 3 runs with fixed random seed (seed=42) addressing two open questions: whether additional defense layers can push protection above 90%, and whether text similarity accurately measures protection when defense mechanisms operate at the entity level. We introduce Semantic Surrogate — entity replacement with plausible fiction — and entity-weighted DA. Under L2+L3 defense, text-based DA pass rate improves to 93.3% ± 1.2% (McNemar's exact test, p < 0.001), with entity recovery dropping to 0.023 ± 0.006. Entity-weighted DA reaches 97.3% ± 1.2%. Ablation baselines reveal that naive redaction is an anti-defense (54.7%, −20.0pp vs baseline), while response utility analysis shows Semantic Surrogate is the only method satisfying both defense and utility feasibility constraints. We identify domain vocabulary leakage as a boundary condition requiring behavioral-layer defense (MSBA). Version 2.4.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yuchia Chang
Building similarity graph...
Analyzing shared references across papers
Loading...
Yuchia Chang (Thu,) studied this question.
www.synapsesocial.com/papers/69a2878e0a974eb0d3c034d0 — DOI: https://doi.org/10.5281/zenodo.18790843
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: