Data-Driven Discovery of Superconductivity within the Inorganic Crystals Graphlet Bank

ORAL

Abstract

Machine learning offers powerful opportunities for materials discovery, although realizing this potential depends sensitively on how materials are represented. Approaches that incorporate experimentally derived chemical and structural information provide a more physically meaningful basis for connecting atomic-scale organization with emergent quantum phenomena [1].

The Inorganic Crystals Graphlet Bank (ICGB) provides a large, experimentally grounded dataset that encodes local chemistry, geometry, and symmetry for nearly 200,000 materials. Building on previous work [2] where Gaussian Process (GP) models were trained to capture superconducting behavior, we applied these models in a high-throughput search across ICGB to predict new materials with potential superconductivity. This large-scale screening identifies new candidate superconductors and demonstrates how data-driven approaches can leverage ICGB to explore vast materials spaces. The superconductivity example highlights the broader potential of ICGB as a phenomenological, data-driven platform for predicting target properties, leveraging existing measurements and reported properties.

[1] Sommer et al., Sci. Data 10, 816 (2023).

[2] Lesser et al., arXiv:2510.07373 (2025).

*Y.L., A.P. and E.-A.K. are supported by the U.S. Department of Energy, Office of Science, Basic Energy Sciences, Materials Sciences and Engineering Division. OL and E.-A.K. are supported by the U.S. Department of Energy through Award Number: DE-SC0023905. E.-A.K. is supported by the NSF through the AI Research Institutes program Award No. DMR-2433348 and by the grant OAC-2118310.

Presenters

  • YANJUN LIU

    • Cornell University

Authors

  • YANJUN LIU

    • Cornell University
  • Aaditya Panigrahi

    • Cornell University
  • Omri Lesser

    • Cornell University
  • Eun-Ah Kim

    • Cornell University