What does this research mean for the field?

A neural network's architecture and inherent symmetries induce Structural Invariant Manifolds (SIMs) that strictly confine gradient flow trajectories during training, independent of the training data and loss function. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.ESTABLISHES_NEW_DIRECTION.

April 1, 2026Open Access

Architecture induces structural invariant manifolds of neural network training dynamics

Key Points

Key points are not available for this paper at this time.

Abstract

While architecture is recognized as key to the performance of deep neural networks, its precise effect on training dynamics has been unclear due to the confounding influence of data and loss functions. This paper proposed an analytic framework based on the geometric control theory to characterize the dynamical properties intrinsic to a model’s parameterization. We prove that the Structural Invariant Manifolds (SIMs) of an analytic model Formula: see text — submanifolds that confine gradient flow trajectories independent of data and loss — are unions of orbits of the vector field family Formula: see text. We then prove that a model’s symmetry, e.g. permutation symmetry for neural networks, induces SIMs. By applying this, we characterize the hierarchy of symmetry-induced SIMs in fully-connected networks, where dynamics exhibit neuron condensation and equivalence to reduced-width networks. For two-layer networks, we prove all SIMs are symmetry-induced, closing the gap between known symmetries and all possible invariants. Overall, by establishing the framework for analyzing SIMs induced by architecture, our work paves the way for a deeper analysis of neural network training dynamics and generalization in the near future.

Bookmark

View Full Paper

Cite This Study

Zhao et al. (Wed,) studied this question.

synapsesocial.com/papers/6a1af1fa7ff99bba06465fd2 https://doi.org/https://doi.org/10.1142/s0218202526420078

Bookmark

View Full Paper