We study how convolutional neural networks reorganize information during learning in natural image classification tasks by tracking mutual information (MI) between inputs, intermediate representations, and labels. Across VGG-16, ResNet-18, and ResNet-50, we find that label-relevant MI grows reliably with depth while input MI depends strongly on architecture and activation, indicating that “compression’’ is not a universal phenomenon. Within convolutional layers, label information becomes increasingly concentrated in a small subset of channels; inference-time knockouts, shuffles, and perturbations confirm that these high-MI channels are functionally necessary for accuracy. This behavior suggests a view of representation learning driven by selective concentration and decorrelation rather than global information reduction. Finally, we show that a simple dependence-aware regularizer based on the Hilbert–Schmidt Independence Criterion can encourage these same patterns during training, yielding small accuracy gains and consistently faster convergence.
Building similarity graph...
Analyzing shared references across papers
Loading...
Arianna Issitt
Alex Merino
Lamine Deen
Entropy
Florida Institute of Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Issitt et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69706c87b6488063ad5c1941 — DOI: https://doi.org/10.3390/e28010118