Mutual Information Estimators for Optimal Joint Embeddings
ORAL
Abstract
Contrastive objectives used in modern joint embedding architectures can be reinterpreted as variational estimators of mutual information, which reveals an underlying information-bottleneck framework for representation learning. Leveraging this framework, we show that under a broad range of conditions, embeddings of high-dimensional datasets exhibit jointly Gaussian statistics, and the existing methods are optimal and accurately capture the mutual information. For datasets with structured non-Gaussian latent variables, we design optimal architectures which use substantially fewer samples than existing methods. Our framework naturally generalizes to multi-view datasets with more than two modalities, offering a path toward faster, data-efficient model discovery in physical and biological systems from limited and noisy experimental data.
–
Presenters
-
Paarth Gulati
- University of California, Santa Barbara
- Emory University