Predicting the superconducting transition temperature (T_c) of materials remains a major challenge in condensed matter physics due to the lack of a comprehensive and quantitative theory. We present a data-driven approach that combines chemistry-informed feature extraction with interpretable machine learning to predict T_c and classify superconducting materials. We develop a systematic featurization scheme that integrates structural and elemental information through graphlet histograms and symmetry vectors. Using experimentally validated structural data from the 3DSC database, we construct a curated, featurized dataset and design a new kernel to incorporate histogram features into Gaussian-process (GP) regression and classification. This framework yields an interpretable T_c predictor with an R^2 value of 0.93 and a superconductor classifier with quantified uncertainties. Feature-significance analysis further reveals that GP T_c predictor can achieve near-optimal performance only using four second-order graphlet features. In particular, we discovered a previously overlooked feature of electron affinity difference between neighboring atoms as a universally predictive descriptor. Our graphlet-histogram approach not only highlights bonding-related elemental descriptors as unexpectedly powerful predictors of superconductivity but also provides a broadly applicable framework for predictive modeling of diverse material properties.
*E.-A.K. and L.S. are supported by the NSF through the AI Research Institutes program Award No.~DMR-2433348 and by the grant OAC-2118310.Y.L. and E.-A.K. were supported in part by the MURI grant FA9550-21-1-0429. K.M. was supported by Eric and Wendy Schmidt AI in Science Postdoctoral Fellowship: a Schmidt Futures program.The computation was done using a high-powered computing cluster that was established through the support of the Gordon and Betty Moore Foundation’s EPiQS Initiative, Grant GBMF10436 to E.-A.K., and through the support of the MURI grant FA9550-21-1-0429.K.M., Y.L. and E.-A.K. are supported by the U.S. Department of Energy, Office of Science, Basic Energy Sciences, Materials Sciences and Engineering Division.O.L. and E.-A.K. are supported by the U.S. Department of Energy through Award Number DE-SC0023905.O.L. is also supported by a Bethe-KIC postdoctoral fellowship at Cornell University.J.G. was supported by NSF grants DBI-2400135 and IIS-2145644. N.M. was supported by an NSF Graduate Research Fellowship.