Prediction of Molecular Properties Using Graph Kernel and Active Learning

Yu-Hang Tang; Wibe A De Jong

Prediction of Molecular Properties Using Graph Kernel and Active Learning

ORAL

Abstract

In this talk, we present a new machine learning method for training predictive models of extensive molecular properties through the application of a similarity kernel on graphical representations of molecules, which is intuitive and can adapt to molecules of arbitrary size and topology. The pairwise similarity matrices between molecules as computed by the graph kernel can be used to construct Gaussian process regression models that can predict extensive properties with provable size scaling. Using an active learning procedure, we demonstrate that models created by our method can achieve a state-of-the-art accuracy of less than 1 kcal/mol on predicting atomization energies for molecules in the QM7 dataset without using any explicit energy decomposition/localization scheme. The method also uses a much smaller number of training samples as compared to other methods to achieve the same level of accuracy.

March 6, 2019, 9:12 AM – March 6, 2019, 9:24 AM

Presenters

Yu-Hang Tang

Lawrence Berkeley National Laboratory

Authors

Yu-Hang Tang

Lawrence Berkeley National Laboratory
Wibe A De Jong

Lawrence Berkeley National Laboratory, Computational Research Division, Lawrence Berkeley National Laboratory