2023 JCP-DCP Future of Chemical Physics Lecture. Talk Title: Towards a big-data ecosystem for quantum chemistry research of solvated molecular systems

ORAL · Invited

Abstract

Machine learning (ML) and big data play increasingly important roles in both experimental and theoretical studies of chemical physics. Although numerous critical chemical processes occur in the solution phase, datasets (computational or experimental) and machine-learning models for solution-phase molecular systems are still scarce. My research group’s objective is to overcome these challenges by building a big data ecosystem for quantum chemistry research of solvated molecular systems.



To enable the efficient generation of computational datasets of solvated molecular systems, we developed strategies to accelerate both the implicit and explicit solvent models for quantum chemistry calculations. For the implicit conductor-like polarization model (C-PCM), we developed algorithms on the graphical processing units (GPUs) to accelerate the calculation. For the explicit solvent model, we developed AutoSolvate, an open-source toolkit to streamline the QC calculation workflow of explicitly solvated molecules. To improve the accuracy of the generated datasets, we develop ML models to reduce the discrepancy between experimental measurements and computationally predicted molecular properties in both implicit and explicit solvent models. This ML correction technique has been applied to predict redox potential and absorption/fluorescence wavelength in the solution phase.

To make these tools more accessible, we are developing a web-based platform to offer automated simulations of solvated molecules on cloud computing resources and publicly disseminate the datasets to the computational molecular science communities.

Publication: A. Gale, E. Hruska, and F. Liu*, Quantum Chemistry for Molecules at Extreme Pressure on Graphical
Processing Units: Implementation of Extreme Pressure Polarizable Continuum Model, J. Chem. Phys.
154, 244103 (2021)

E. Hruska, A. Gale, X. Huang,U F. Liu*, AutoSolvate: A Toolkit for Automating Quantum Chemistry
Design and Discovery of Solvated Molecules, J. Chem. Phys. 156, 12801 (2022)

E. Hruska, A. Gale, F. Liu*, Bridging the experiment-calculation divide: machine learning corrections
to redox potential calculations in implicit and explicit solvent models, J. Chem. Theory Comput. 18,
1096 (2022)

F. Ren, F. Liu* Impacts of Polarizable Continuum Models on the SCF Convergence and DFT Delocalization
Error of Large Molecules J. Chem. Phys. 57, 184106 (2022)

X. Chen, P. Li, E. Hruska, F. Liu* Δ-Machine Learning for Quantum Chemistry Prediction of Solutionphase
Molecular Properties at the Ground and Excited States. Phys. Chem. Chem. Phys. 25, 13417
(2023)

F. Ren, F. Liu* Data-Driven Insights into the Fluorescence of Asphaltene Aggregates Using Extended
Frenkel Exciton Model ChemRxiv Preprint https://doi.org/10.26434/chemrxiv-2023-sbbhc (2023)

Presenters

  • Fang Liu

    Emory University

Authors

  • Fang Liu

    Emory University