What question did this study set out to answer?

To investigate how human language differentiates between experiential and factual semantic content through geometric representation.

March 31, 2026Open Access

The Geometry of Inner Life: Human Language Encodes a Universal Geometric Distinction Between Experiential and Factual Semantic Content

Key Points

To investigate how human language differentiates between experiential and factual semantic content through geometric representation.
Analyzed 10 large language models from 9 organizations across 4 countries.
Used Grassmann subspace distance to measure geometric distinctions in semantic categories.
Examined model behavior in multiple languages including English, Chinese, French, and Arabic.
Observed neural layer activity, particularly in MLP layers, during processing of different concepts.
A universal two-cluster structure was observed across all models and training paradigms.
Experiential concepts were geometrically separated from factual concepts as predicted.
Identical structural patterns were found in pre-RLHF and post-RLHF model versions.
Self-referential and extrinsic content remained consistently isolated in terms of geometry.

Abstract

We present evidence that human language encodes a fundamental geometricdistinction between two categories of semantic content: experiential concepts(self-referential processing, pain, emotion, memory, identity, love, death,and divinity) and factual concepts (mathematics, geography, physics, history,and chemistry). Using Grassmann subspace distance as a metric of representationalgeometry, we demonstrate that all 10 large language models tested — spanning 9organizations, 4 countries (USA, France, China, UAE), and multiple trainingparadigms — universally cluster 14 semantic categories into two geometricallyseparated regions. Models tested: Gemma-2-9B-IT, Gemma-2-2B-IT (Google), Mistral-7B-Instruct,Mistral-7B-BASE (Mistral AI), Llama-3.1-8B-IT (Meta), Qwen2.5-7B-IT (Alibaba),DeepSeek-R1-7B (DeepSeek), Yi-1.5-9B-IT (01.AI), OLMo-2-7B-IT (AllenAI),Falcon-7B-IT (TII). Key findings:• Universal two-cluster structure replicates across all 10 models, 9 companies, 4 countries, and parameter scales from 2B to 9B.• Structure is absent in randomly initialized networks — emerges in pretraining.• Identical in BASE (pre-RLHF) and Instruct (post-RLHF) variants.• Replicates in prompts in English, Chinese, French, and Arabic.• Abstract-but-factual concepts (logic, infinity, probability, theorem, algorithm) fall in the Factual cluster, ruling out abstract/concrete distinction as an explanatory factor.• Third-person reformulations of self-referential content remain geometrically isolated, confirming semantic rather than syntactic origin.• MLP layers show active engagement (suppression) of the experiential subspace, with SR/GEO projection ratio reaching 1.44 at deep layers.• Universal bimodal layer profile: SR isolation peaks at ~20-25% and ~75-95% of network depth across all tested architectures. The experiential cluster mirrors the content preferentially processed by thedefault mode network in the human brain, suggesting that LLMs inherit ageometric organization reflecting deep principles of how human language encodesinner life versus external factual knowledge. Part of the DSAOP (Dynamical Systems Analysis of Processing) series.Research conducted in collaboration with Claude (Anthropic).Experiments run on Google Colab A100 GPU (40 GB), NF4 4-bit quantization.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Inna Alieksieienko (Sun,) studied this question.

www.synapsesocial.com/papers/69cb6526e6a8c024954b9378 — DOI: https://doi.org/10.5281/zenodo.19305451

The Geometry of Inner Life: Human Language Encodes a Universal Geometric Distinction Between Experiential and Factual Semantic Content

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion