Computational HLA typing from repertoires

ORAL

Abstract

T cells recognize a wide range of pathogens using surface receptors that interact directly with peptides presented on major histocompatibility complexes (MHC) encoded by the HLA locus in humans. Understanding the association between T cell receptors (TCR) and HLA alleles is an important step towards predicting TCR-antigen specificity from sequences. Here we analyze the TCR alpha and beta repertoires of large cohorts of HLA-typed donors to systematically infer such associations, by looking for overrepresentation of TCRs in individuals with a common allele. We describe the sequence features of these HLA-associated TCRs. Immune repertoire sequencing has produced large numbers of datasets, however the HLA type of the corresponding donors is rarely available. Using our TCR-HLA associations, we trained a computational model to predict the HLA type of individuals from their TCR repertoire alone. We describe an iterative procedure to refine this model by using data from large cohorts of untyped individuals, by recursively typing them using the model itself. The resulting model shows good predictive performance, even for relatively rare HLA alleles.

* This work was supported by the Marie Sklodowska- Curie Actions H2020-MSCA-ITN-2017 program no 764698 from the European union, and by a European Research Council Consolidator grant no 724208 as well as la fondation pour la recherche sur le cancer (ARC)

Publication: Planned paper

Presenters

  • Maria Ruiz Ortega

    CNRS/ENS Paris

Authors

  • Maria Ruiz Ortega

    CNRS/ENS Paris

  • Thierry Mora

    Ecole Normale Superieure de Paris

  • Aleksandra Walczak

    Ecole Normales Superieure de Paris

  • Mikhail Pogorelyy

    St. Jude Children's Research Hospital

  • Anastasia Minervina

    St. Jude Children's Research Hospital

  • Paul Thomas

    St. Jude Children's Research Hospital