What question did this study set out to answer?

The research aims to introduce a framework that improves reasoning in large language models by integrating causal inference and logical principles.

March 22, 2026Open Access

Causal Transformer: Toward Deductive Reasoning in Large Language Models

Key Points

The research aims to introduce a framework that improves reasoning in large language models by integrating causal inference and logical principles.
Developed the Causal Transformer framework incorporating latent-space causal inference and variational free-energy minimization.
Used category-theoretic logical constraints to enhance logical consistency.
Implemented Bayesian-conformal uncertainty quantification to calibrate model predictions.
Addressed limitations of current language models, such as hallucination and spurious correlations.
Proposed a practical uncertainty module for calibrated abstention mechanisms.
Outlined a vision for more robust and interpretable AI systems.

Abstract

This work introduces the Causal Transformer (CT), a theoretical framework aimed at advancing the reasoning capabilities of large language models beyond purely statistical prediction. While current Transformer-based architectures excel at approximating conditional probabilities, they lack explicit representations of causality, logical consistency, and calibrated uncertainty. The proposed framework integrates four complementary components: latent-space causal inference, variational free-energy minimisation, category-theoretic logical constraints, and Bayesian-conformal uncertainty quantification. Together, these elements are designed to address key limitations of modern LLMs, including hallucinations, spurious correlations, and overconfident predictions. The contribution is intentionally twofold: on one hand, a practically applicable uncertainty module providing calibrated abstention mechanisms; on the other, a broader architectural vision outlining a path toward more robust, interpretable, and deductive AI systems. This work is not presented as a ready-to-deploy solution, but as a structured theoretical proposal for future research in causal and reasoning-aware language models.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Marco Galli

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Causal Transformer: Toward Deductive Reasoning in Large Language Models

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study