Subgroup Discovery for Finding Interpretable Local Patterns in Data from Materials-Science

ORAL

Abstract

We demonstrate that subgroup discovery (SGD) can help find physically meaningful descriptors from materials-science data obtained by first-principles calculations. In contrast to global modelling algorithms, SGD finds descriptions of subpopulations in which, locally, the target property takes on an interesting distribution. First, the SGD algorithm is formulated for materials applications. Next, SGD is applied to gas-phase gold clusters (having 5 to 14 atoms) to discern patterns between their geometrical and physicochemical properties. Additionally, SGD is shown to identify subgroups that classify 79 of the 82 octet binary materials as either rock salt or zincblende from only information of its chemical composition. SGD is also used to find descriptors that predict both the formation and bandgap energies of transparent conducting oxides. Lastly, an efficient optimal solver using branch-and-bound is developed for dispersion-corrected objective functions to help find improved subgroups.

Presenters

  • Bryan Goldsmith

    University of Michigan, Chemical Engineering, University of Michigan

Authors

  • Bryan Goldsmith

    University of Michigan, Chemical Engineering, University of Michigan

  • Mario Boley

    Max Planck Institute for Informatics

  • Christopher Sutton

    Fritz Haber Institute of the Max Planck Society, Theory , Fritz-Haber Institute, Chemistry, Duke University, Theory Department, Fritz Haber Institute

  • Jilles Vreeken

    Max Planck Institute for Informatics

  • Matthias Scheffler

    Fritz Haber Institute of the Max Planck Society, Theory, Fritz Haber Institute of the Max Planck Society, Fritz-Haber-Institut der Max-Planck-Gesselschaft, Theory , Fritz-Haber Institute, Fritz-Haber-Institut der Max-Planck-Gesellschaft, Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195 Berlin-Dahlem, Germany, Theory Department, Fritz Haber Institute

  • Luca Ghiringhelli

    Fritz Haber Institute of the Max Planck Society, Theory, Fritz Haber Institute of the Max Planck Society, Theory , Fritz-Haber Institute, Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195 Berlin-Dahlem, Germany, Fritz-Haber-Institut der Max-Planck-Gesellschaft, Theory Department, Fritz Haber Institute