Multi-task learning for molecular electronic structure approaching coupled-cluster accuracy

ORAL

Abstract

Machine learning (ML) plays an important role in quantum chemistry, providing fast-to-evaluate predictive models for various properties of molecules. However, most existing ML models for molecular electronic properties use density functional theory (DFT) databases as ground truth in training, and their prediction accuracy cannot surpass that of DFT. In this work, we developed a unified ML method for electronic structures of organic molecules using the gold-standard CCSD(T) calculations as training data. Tested on hydrocarbon molecules and the QM9 quantum chemistry dataset, our model outperforms DFT with several widely-used hybrid and double hybrid functionals in both computational costs and prediction accuracy of various quantum chemical properties. As case studies, we apply the model to aromatic compounds and semiconducting polymers on both ground state and excited state properties, demonstrating its accuracy and generalization capability to complex systems that are hard to calculate using CCSD(T)-level methods.

*This work was supported by Honda Research Institute (HRI-USA) and SES AI Corp. H.T. acknowledges support from the Mathworks Engineering Fellowship. The calculations in this work were performed in part on the Matlantis high-speed universal atomistic simulator, the Texas Advanced Computing Center (TACC), the MIT SuperCloud, and the National Energy Research Scientific Computing (NERSC).

Publication: arXiv:2405.12229

Presenters

  • Hao Tang

    • Massachusetts Institute of Technology

Authors

  • Hao Tang

    • Massachusetts Institute of Technology
  • Brian Xiao

    • UC Berkeley
  • Wenhao He

    • Massachusetts Institute of Technology
  • Yao Wang

    • Clemson University
    • Emory University
  • Fang Liu

    • Emory University
  • Haowei Xu

    • Massachusetts Institute of Technology
  • Ju Li

    • Massachusetts Institute of Technology