Reproduction code for the manuscript "Causal Geodesic Gap (CGG): A Geometric Measure of Confounding via Fisher-Rao Distance" (Szalay 2026). The Causal Geodesic Gap (CGG) is a scalar measure quantifying the Fisher-Rao geodesic distance between the observational distribution P(Y|X=x) and its interventional counterpart P(Y|do(X=x)) on the statistical manifold. It combines Amari's information geometry with Pearl's do-calculus to provide a parametrisation-invariant diagnostic for confounding in structural causal models. This Jupyter notebook contains the complete reproduction pipeline accompanying the manuscript. The Enron empirical analysis requires emails.csv from the public Kaggle Enron dataset (https://www.kaggle.com/datasets/wcukierski/enron-email-dataset). Dependencies: numpy, pandas, scipy, networkx, matplotlib, scikit-learn.
Zsuzsanna Szalay (Fri,) studied this question.