What question did this study set out to answer?

To identify the roots of non-determinism in language models and propose a structure to improve output consistency.

April 10, 2026Open Access

Substrates Are All You Need: Defeatning Non-determinism With Natural Language

Key Points

To identify the roots of non-determinism in language models and propose a structure to improve output consistency.
Analyzed the behavior of large language models under stochastic decoding.
Introduced substrate-first architectures to constrain interpretation space.
Verified output consistency using byte-level equality through SHA-256 hashing.
Demonstrated that interpretation drift leads to multiple valid outputs from the same input.
Showed that constraining task specifications results in identical outputs from independently trained models.
Reframed determinism in AI as a property of task specification rather than merely the model behavior.

Abstract

Large language models are often treated as non-deterministic due to stochastic decoding. This paper shows that non-determinism instead arises from interpretation drift—multiple valid task definitions under the same input. When tasks are under-specified, models may produce different but internally consistent outputs because they are effectively solving different problems. Standard prompt engineering techniques (behavioral nudging) do not eliminate this effect, as they operate within the interpretation space rather than constraining it. We introduce substrate-first architectures, where task specifications explicitly constrain the interpretation space to a single admissible definition. Under this condition, independently trained models converge to identical outputs for the same input, verified via byte-level equality (SHA-256 hashing). These results reframe determinism as a property of task specification rather than model behavior. Reliable AI systems are achieved not by controlling generation, but by eliminating interpretive ambiguity. Companion papers: Empirical Evidence Of Interpretation Drift In Large Language Models: https://zenodo.org/records/18219428Empirical Evidence Of Interpretation Drift In ARC-Style Reasoning: https://zenodo.org/records/18420425

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Elin Nguyen

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Substrates Are All You Need: Defeatning Non-determinism With Natural Language

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study