Assessing the current state of computational models for protein-protein interfaces

ORAL

Abstract



Computational prediction and design of protein-protein interfaces (PPI) is a difficult task that results in models with a wide range of quality. Thus, PPI scoring algorithms have been designed to classify computational models as high- or low-quality without direct knowledge of the experimentally determined structures. To assess PPI scoring algorithms, we scored models of heterodimeric proteins generated with a rigid-body docking algorithm (ZDOCK) and compared these scores to a measure of structural similarity (i.e. DockQ score) to the associated heterodimeric x-ray crystal structures. We generated hundreds of thousands of computational models, sampled uniformly over the DockQ score, to create a balanced dataset of redocked models. We scored this dataset using seven state-of-the-art PPI model scoring algorithms. We show that only a small fraction of PPI targets possesses strong correlations (ρ > 0.8) between the model scores and DockQ. We use three physical features to differentiate easy- and hard-to-score targets: interface flatness, fractional interface surface area, and number of interfacial contacts. We show that there are strong correlations between model and DockQ scores for PPI targets with large, intertwined interfaces, yet weak correlations for targets with small, flat interfaces. We create a support vector regression score using these physical features that matches or exceeds the performance of current state-of-the-art PPI scoring functions.

* NIH Training Grant No: 1T32GM145452

Presenters

  • Jake Sumner

    Yale University

Authors

  • Jake Sumner

    Yale University

  • Grace Meng

    Yale University

  • Naomi Brandt

    Yale University

  • Andrés Córdoba

    Duke University

  • Alex T Grigas

    Yale University

  • Lynne Regan

    University of Edinburgh

  • Corey S O'Hern

    Yale University