Towards an Interpretable Data-driven Trigger System for High-throughput Physics Facilities

Chinmaya Mahesh; Kristin Dona; David W. Miller; Yuxin Chen; Cecilia Tosciri

Towards an Interpretable Data-driven Trigger System for High-throughput Physics Facilities

POSTER

Abstract

Data-intensive science is increasingly reliant on real-time processing capabilities and machine learning workflows, in order to filter and analyze the extreme volumes of data being collected. This is especially true at the intensity frontier of particle physics. Data filtering algorithms, or \textit{trigger algorithms}, at the LHC drive the data curation process, funneling event records with certain features into categories that are predefined based on the labels extracted by the trigger algorithms. The design, implementation, monitoring, and usage of these trigger algorithms is resource-intensive and can include significant blindspots. The \textit{menu} of trigger algorithms is manually designed based on domain knowledge (involving \textasciitilde 100 data filters). In this presentation, we introduce a new data-driven approach for designing and optimizing high-throughput data filtering and trigger systems such as those in use at physics facilities like the LHC. We introduce key insights from interpretable predictive modeling and cost-sensitive learning in order to account for non-local inefficiencies in the current paradigm and construct a cost-effective data filtering and trigger model that does not compromise physics coverage.

Authors

Chinmaya Mahesh

University of Illinois
Kristin Dona

University of Chicago
David W. Miller

University of Chicago
Yuxin Chen

University of Chicago
Cecilia Tosciri

University of Chicago