MBFormer: A General Transformer-based Learning Paradigm for Many-body Interactions in Real Materials

ORAL

Abstract

Recently, progress in machine learning (ML) has revolutionized computational materials science, enabling rapid materials discovery and property prediction, but the quantum many-body problem, which is the key to understanding excited-state properties, ranging from transport to optics, remains challenging due to the complexity of the nonlocal and energy-dependent interactions. Here, we propose a symmetry-aware, grid-free, transformer-based model, MBFormer, that is designed to learn the entire many-body hierarchy directly from mean-field inputs, exploiting the attention mechanism to accurately capture many-body correlations between mean-field states. As proof of principle, we demonstrate the capability of MBFormer in predicting results based on the GW plus Bethe Salpeter equation (GW-BSE) formalism, including quasiparticle energies, exciton energies, exciton oscillator strengths, and exciton wavefunction distribution. Our model is trained on a dataset of 721 two-dimensional materials from the C2DB database, achieving state-of-the-art performance with a low prediction mean absolute error (MAE) on the order of 0.1-0.2 eV for state-level quasiparticle and exciton energies across different materials. Moreover, we show explicitly that the attention mechanism plays a crucial role in capturing many-body correlations. Our framework provides an end-to-end platform from ground states to general many-body prediction in real materials.

*This work was primarily supported by the U.S. Department of Energy, Office of Science, Basic Energy Sciences under Early Career Award No. DE-SC0021965. Development of the BerkeleyGW code was supported by Center for Computational Study of Excited-State Phenomena in Energy Materials (C2SEPEM) at the Lawrence Berkeley National Laboratory, funded by the U.S. Department of Energy, Office of Science, Basic Energy Sciences, Materials Sciences and Engineering Division, under Contract No. DE-C02-05CH11231. Calculations on benzene and black phosphorus were supported by the National Science Foundation Division of Chemistry under award number CHE-2412412. The calculations used resources of the National Energy Research Scientific Computing (NERSC), a DOE Office of Science User Facility operated under contract no. DE-AC02-05CH11231; the Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS), which is supported by National Science Foundation grant number ACI-1548562; and the Texas Advanced Computing Center (TACC) at The University of Texas at Austin.

Publication: https://arxiv.org/abs/2507.05480

Presenters

  • Bowen Hou

    • Yale University

Authors

  • Bowen Hou

    • Yale University
  • Xian Xu

    • Yale University
  • Jinyuan Wu

    • Yale University
  • Diana Y Qiu

    • Yale University