Modeling Systematic Uncertainty Propagation in Contrastive Learning for Anomaly Detection

ORAL

Abstract

Despite the Standard Model’s success, outstanding questions still motivate searches for physics beyond the Standard Model (BSM) at the Large Hadron Collider. However, the lack of a dominant direction makes it challenging to explore theory models with traditional, signal-driven analysis.

Machine-learning-based anomaly detection (AD) offers a promising, model-agnostic alternative to search for many signals. Recent AI advances in representation learning motivate the use of neural embeddings to map detector data into low-dimensional latent spaces, preserving critical features. However, the propagation of systematic uncertainty in embedded spaces remains poorly understood, severely limiting BSM signal detection in AD.

We address this gap with a machine learning strategy that uses a likelihood-based objective to structure the embedded space. Building on previous work using supervised contrastive learning for physics-aware embeddings, our approach further fine-tunes them with a downstream parameterized classification task that incorporates continuous uncertainties. This enables the latent space to covary predictably with systematic distortions.

Using simulated CMS level-1 trigger data, we show that our method successfully models continuous uncertainties in a 4D latent space, improving both interpretability and robustness for AD. This framework offers a general way to study and control systematic uncertainties in latent spaces and can be applied to other scientific domains and problems.

Publication: [1] K. Metzger et al., "Anomaly Preserving Contrastive Neural Embeddings for End-to-End Model-Independent Searches at the LHC," arXiv:2502.15926 (2025). https://arxiv.org/abs/2502.15926

[2] P. Harris et al., "Re-Simulation-based Self-Supervised Learning for Pre-Training Foundation Models," arXiv:2403.07066 (2024). https://arxiv.org/abs/2403.07066

[3] B. M. Dillon et al., "Physics-Inspired Data Augmentations for High-Energy Physics," arXiv:2301.04660 (2023). https://arxiv.org/abs/2301.04660

[4] R. Dangovski et al., "Equivariant Contrastive Learning," arXiv:2111.00899 (2021). https://arxiv.org/abs/2111.00899

[5] R. T. d'Agnolo et al., "Learning New Physics from an Imperfect Machine," arXiv:2111.13633 (2021). https://arxiv.org/abs/2111.13633

[6] P. Khosla et al., "Supervised Contrastive Learning," arXiv:2004.11362 (2020). https://arxiv.org/abs/2004.11362

[7] K. Metzger et al., "CL4AD: Contrastive Learning for Anomaly Detection," GitHub repository (2025). https://github.com/ksmetzger/cl4ad

Presenters

  • Shelley Tong

    • Massachusetts Institute of Technology (MIT)

Authors

  • Shelley Tong

    • Massachusetts Institute of Technology (MIT)
  • Gaia Grosso

    • Massachusetts Institute of Technology
  • Philip C Harris

    • MIT
    • Massachusetts Institute of Technology