What question did this study set out to answer?

The research aims to show that smaller, structured neural architectures can outperform larger models in terms of accuracy and interpretability.

April 29, 2026Open Access

Project GlassBox: Structure Over Scale in Neural Reasoning — Breaking the Black Box Through Architectural Transparency

Key Points

The research aims to show that smaller, structured neural architectures can outperform larger models in terms of accuracy and interpretability.
Conducted a 33-phase experimental campaign comparing 77K-parameter Graph Neural Network with a 1.45M-parameter Transformer.
Utilized test-time gradient adaptation with geometric data augmentation for model performance enhancement.
Employed full causal path tracing for attributed predictions.
The 77K structured model achieved 56.8% accuracy compared to 43.9% for the larger model.
Gradient adaptation improved accuracy from 85.1% to 87.4%, surpassing previous limits.
Achieved 82.8% attribution coverage, significantly exceeding large language models' 25% coverage.

Abstract

Project GlassBox is a systematic 33-phase experimental campaign demonstrating that small, structurally constrained neural architectures can simultaneously achieve superior task performance and unprecedented interpretability compared to large unconstrained models. Using ARC-AGI as a benchmark for abstract visual reasoning, a 77K-parameter Graph Neural Network with Pointer attention (the "GlassBox Agent") outperforms a 1.45M-parameter Transformer baseline (56.8% vs 43.9% full match accuracy). Through test-time gradient adaptation with geometric data augmentation, accuracy reaches 87.4%, breaking through a previously observed 85% performance ceiling. Key Results: Structure > Scale: 77K structured parameters outperform 1.45M unstructured parameters (19× smaller, higher accuracy) Hydra Self-Repair: First quantitative characterization of neural self-repair — after destroying 50% of model neurons, few-shot adaptation recovers 95.8% of original performance 82.8% Attribution: Full causal path tracing for 82.8% of predictions, exceeding by 3.3× the 25% attribution coverage reported for large language models Adaptation Supremacy: Test-time gradient adaptation is strictly superior to symbolic program search (+34.5pp improvement) 85% Ceiling Breakthrough: D8 geometric augmentation during adaptation pushes accuracy from 85.1% to 87.4% Source code: https://github.com/hafufu-stack/glassbox Acknowledgments This research was conducted entirely independently, without institutional affiliation or corporate funding. The author currently faces financial constraints that make it increasingly difficult to maintain subscriptions to AI services essential for this line of research. To sustain and improve the quality of future work, the author is actively seeking community sponsorship. Details are available at https://github.com/sponsors/hafufu-stack.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Hiroto Funasaki

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Project GlassBox: Structure Over Scale in Neural Reasoning — Breaking the Black Box Through Architectural Transparency

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study