TY - JOUR
T1 - ZEPPI
T2 - Proteome-scale sequence-based evaluation of protein–protein interaction models
AU - Zhao, Haiqing
AU - Petrey, Donald
AU - Murray, Diana
AU - Honig, Barry
N1 - Publisher Copyright:
© 2024 the Author(s).
PY - 2024/5/21
Y1 - 2024/5/21
N2 - We introduce ZEPPI (Z-score Evaluation of Protein–Protein Interfaces), a framework to evaluate structural models of a complex based on sequence coevolution and conservation involving residues in protein–protein interfaces. The ZEPPI score is calculated by comparing metrics for an interface to those obtained from randomly chosen residues. Since contacting residues are defined by the structural model, this obviates the need to account for indirect interactions. Further, although ZEPPI relies on species-paired multiple sequence alignments, its focus on interfacial residues allows it to leverage quite shallow alignments. ZEPPI can be implemented on a proteome-wide scale and is applied here to millions of structural models of dimeric complexes in the Escherichia coli and human interactomes found in the PrePPI database. PrePPI’s scoring function is based primarily on the evaluation of protein–protein interfaces, and ZEPPI adds a new feature to this analysis through the incorporation of evolutionary information. ZEPPI performance is evaluated through applications to experimentally determined complexes and to decoys from the CASP-CAPRI experiment. As we discuss, the standard CAPRI scores used to evaluate docking models are based on model quality and not on the ability to give yes/no answers as to whether two proteins interact. ZEPPI is able to detect weak signals from PPI models that the CAPRI scores define as incorrect and, similarly, to identify potential PPIs defined as low confidence by the current PrePPI scoring function. A number of examples that illustrate how the combination of PrePPI and ZEPPI can yield functional hypotheses are provided.
AB - We introduce ZEPPI (Z-score Evaluation of Protein–Protein Interfaces), a framework to evaluate structural models of a complex based on sequence coevolution and conservation involving residues in protein–protein interfaces. The ZEPPI score is calculated by comparing metrics for an interface to those obtained from randomly chosen residues. Since contacting residues are defined by the structural model, this obviates the need to account for indirect interactions. Further, although ZEPPI relies on species-paired multiple sequence alignments, its focus on interfacial residues allows it to leverage quite shallow alignments. ZEPPI can be implemented on a proteome-wide scale and is applied here to millions of structural models of dimeric complexes in the Escherichia coli and human interactomes found in the PrePPI database. PrePPI’s scoring function is based primarily on the evaluation of protein–protein interfaces, and ZEPPI adds a new feature to this analysis through the incorporation of evolutionary information. ZEPPI performance is evaluated through applications to experimentally determined complexes and to decoys from the CASP-CAPRI experiment. As we discuss, the standard CAPRI scores used to evaluate docking models are based on model quality and not on the ability to give yes/no answers as to whether two proteins interact. ZEPPI is able to detect weak signals from PPI models that the CAPRI scores define as incorrect and, similarly, to identify potential PPIs defined as low confidence by the current PrePPI scoring function. A number of examples that illustrate how the combination of PrePPI and ZEPPI can yield functional hypotheses are provided.
UR - http://www.scopus.com/inward/record.url?scp=85193206787&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85193206787&partnerID=8YFLogxK
U2 - 10.1073/pnas.2400260121
DO - 10.1073/pnas.2400260121
M3 - Article
C2 - 38743624
AN - SCOPUS:85193206787
SN - 0027-8424
VL - 121
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 21
M1 - e2400260121
ER -