Accurate, uncertainty-aware classification of molecular chemical motifs from multi-modal X-ray absorption spectroscopy

ORAL

Abstract

Accurate classification of molecular chemical motifs from experimental measurement is an important problem in molecular physics, chemistry and biology. In this work, we present neural network ensemble classifiers for predicting the presence (or lack thereof) of 41 different chemical motifs on small molecules from simulated C, N and O K-edge X-ray absorption near-edge structure (XANES) spectra. Our classifiers not only reach a maximum average class-balanced accuracy of 0.99 but also accurately quantify uncertainty. We also show that including multiple XANES modalities improves predictions notably on average, demonstrating a "multi-modal advantage" over any single modality. In addition to structure refinement, our approach can be generalized for broad applications with molecular design pipelines.

* This material is based upon work supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, under Award Numbers FWP PS-030 and DE-SC-0012704. This research used the Theory and Computation resources of the Center for Functional Nanomaterials, which is a U.S. DOE Office of Science Facility, and the Scientific Data and Computing Center at Brookhaven National Laboratory under Contract No. DE-SC0012704.

Presenters

  • Deyu Lu

    Brookhaven National Laboratory

Authors

  • Deyu Lu

    Brookhaven National Laboratory

  • Matthew R Carbone

    Brookhaven National Lab, Brookhaven National Laboratory

  • Phillip M Maffettone

    Brookhaven National Laboratory

  • Xiaohui Qu

    Brookhaven National Laboratory, Brookhaven National Lab

  • Shinjae Yoo

    Brookhaven National Laboratory