March 3, 2026

Dual Natural Gradient Descent for Scalable Training of Physics-Informed Neural Networks

Key Points

D-NGD achieves one to three orders of magnitude lower final error L2 compared to first-order methods.
The method computes the Gauss-Newton step in a smaller residual space, improving efficiency significantly.
Utilizing a dense solver for smaller problems, D-NGD scales successfully to networks with 12.8 million parameters.
The full natural gradient training on a single GPU demonstrates D-NGD's practical effectiveness in optimization.

Abstract

Natural–gradient methods markedly accelerate the training of Physics-Informed Neural Networks (PINNs), yet their Gauss–Newton update must be solved in the parameter space, incurring a prohibitive O(n3) time complexity, where n is the number of network trainable weights. We show that exactly the same step can instead be formulated in a general ly smal ler residual space of size m =∑γNγ dγ, where each residual class γ (e.g. PDE interior, boundary, initial data) contributes Nγ collocation points of output dimension dγ. Building on this insight, we introduce Dual Natural Gradient Descent (D-NGD). D-NGD computes the Gauss–Newton step in residual space, augments it with a geodesic-acceleration correction at negligible extra cost, and provides both a dense direct solver for modest m and a Nyström-preconditioned conjugate-gradient solver for larger m. Experimentally, D-NGD scales second-order PINN optimization to networks with up to 12.8 million parameters, delivers one-to three-order-of-magnitude lower final error L2 than first-order (Adam, SGD) and quasi-Newton methods, and —crucially —enables full natural gradient training of PINNs at this scale on a single GPU.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Jnini A.

Vella F.

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Dual Natural Gradient Descent for Scalable Training of Physics-Informed Neural Networks

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study