Mitigating Spurious Features in Contrastive Learning with Spectral Regularization

0citations

Citations

#1334

in NeurIPS 2025

of 5858 papers

Authors

Data Points

Authors

Naghmeh Ghanooni Waleed Mustafa Dennis Wagner Sophie Fellenz Anthony Lin Marius Kloft

Topics

contrastive learning spurious features spectral regularization self-supervised learning feature covariance representation learning transfer performance

Abstract

Neural networks generally prefer simple and easy-to-learn features. When these features are spuriously correlated with the labels, the network's performance can suffer, particularly for underrepresented classes or concepts. Self-supervised representation learning methods, such as contrastive learning, are especially prone to this issue, often resulting in worse performance on downstream tasks. We identify a key spectral signature of this failure: early reliance on dominant singular modes of the learned feature matrix. To mitigate this, we propose a novel framework that promotes a uniform eigenspectrum of the feature covariance matrix, encouraging diverse and semantically rich representations. Our method operates in a fully self-supervised setting, without relying on ground-truth labels or any additional information. Empirical results on SimCLR and SimSiam demonstrate consistent gains in robustness and transfer performance, suggesting broad applicability across self-supervised learning paradigms. Code: https://github.com/NaghmehGh/SpuriousCorrelation_SSRL

Citation History

Jan 26, 2026

Jan 27, 2026

Feb 2, 2026