Sequence specificity, statistical potentials, and three-dimensional structure prediction with self-correcting distance geometry calculations of β-sheet formation in proteins

Hongyao Zhu, Werner Braun

Research output: Contribution to journalArticle

57 Citations (Scopus)

Abstract

A statistical analysis of a representative data set of 169 known protein structures was used to analyze the specificity of residue interactions between spatial neighboring strands in β-sheets. Pairwise potentials were derived from the frequency of residue pairs in nearest contact, second nearest and third nearest contacts across neighboring β-strands compared to the expected frequency of residue pairs in a random model. A pseudo-energy function based on these statistical pairwise potentials recognized native β- sheets among possible alternative pairings. The native pairing was found within the three lowest energies in 73% of the cases in the training data set and in 63% of β-sheets in a test data set of 67 proteins, which were not part of the training set. The energy function was also used to detect tripeptides, which occur frequently in β-sheets of native proteins. The majority of native partners of tripeptides were distributed in a low energy range. Self-correcting distance geometry (SECODG) calculations using distance constraints sets derived from possible low energy pairing of β-strands uniquely identified the native pairing of the β-sheet in pancreatic trypsin inhibitor (BPTI). These results will be useful for predicting the structure of proteins from their amino acid sequence as well as for the design of proteins containing β-sheets.

Original languageEnglish (US)
Pages (from-to)326-342
Number of pages17
JournalProtein Science
Volume8
Issue number2
StatePublished - 1999

Fingerprint

Geometry
Proteins
Trypsin Inhibitors
Amino Acid Sequence
Statistical methods
Amino Acids
Datasets

Keywords

  • β-strand alignment recognition
  • β-strand pairing
  • Self-correcting distance geometry calculations
  • Statistical potentials
  • Tertiary structure

ASJC Scopus subject areas

  • Biochemistry

Cite this

@article{ee2457ca1c43439ab444339882f6ad24,
title = "Sequence specificity, statistical potentials, and three-dimensional structure prediction with self-correcting distance geometry calculations of β-sheet formation in proteins",
abstract = "A statistical analysis of a representative data set of 169 known protein structures was used to analyze the specificity of residue interactions between spatial neighboring strands in β-sheets. Pairwise potentials were derived from the frequency of residue pairs in nearest contact, second nearest and third nearest contacts across neighboring β-strands compared to the expected frequency of residue pairs in a random model. A pseudo-energy function based on these statistical pairwise potentials recognized native β- sheets among possible alternative pairings. The native pairing was found within the three lowest energies in 73{\%} of the cases in the training data set and in 63{\%} of β-sheets in a test data set of 67 proteins, which were not part of the training set. The energy function was also used to detect tripeptides, which occur frequently in β-sheets of native proteins. The majority of native partners of tripeptides were distributed in a low energy range. Self-correcting distance geometry (SECODG) calculations using distance constraints sets derived from possible low energy pairing of β-strands uniquely identified the native pairing of the β-sheet in pancreatic trypsin inhibitor (BPTI). These results will be useful for predicting the structure of proteins from their amino acid sequence as well as for the design of proteins containing β-sheets.",
keywords = "β-strand alignment recognition, β-strand pairing, Self-correcting distance geometry calculations, Statistical potentials, Tertiary structure",
author = "Hongyao Zhu and Werner Braun",
year = "1999",
language = "English (US)",
volume = "8",
pages = "326--342",
journal = "Protein Science",
issn = "0961-8368",
publisher = "Cold Spring Harbor Laboratory Press",
number = "2",

}

TY - JOUR

T1 - Sequence specificity, statistical potentials, and three-dimensional structure prediction with self-correcting distance geometry calculations of β-sheet formation in proteins

AU - Zhu, Hongyao

AU - Braun, Werner

PY - 1999

Y1 - 1999

N2 - A statistical analysis of a representative data set of 169 known protein structures was used to analyze the specificity of residue interactions between spatial neighboring strands in β-sheets. Pairwise potentials were derived from the frequency of residue pairs in nearest contact, second nearest and third nearest contacts across neighboring β-strands compared to the expected frequency of residue pairs in a random model. A pseudo-energy function based on these statistical pairwise potentials recognized native β- sheets among possible alternative pairings. The native pairing was found within the three lowest energies in 73% of the cases in the training data set and in 63% of β-sheets in a test data set of 67 proteins, which were not part of the training set. The energy function was also used to detect tripeptides, which occur frequently in β-sheets of native proteins. The majority of native partners of tripeptides were distributed in a low energy range. Self-correcting distance geometry (SECODG) calculations using distance constraints sets derived from possible low energy pairing of β-strands uniquely identified the native pairing of the β-sheet in pancreatic trypsin inhibitor (BPTI). These results will be useful for predicting the structure of proteins from their amino acid sequence as well as for the design of proteins containing β-sheets.

AB - A statistical analysis of a representative data set of 169 known protein structures was used to analyze the specificity of residue interactions between spatial neighboring strands in β-sheets. Pairwise potentials were derived from the frequency of residue pairs in nearest contact, second nearest and third nearest contacts across neighboring β-strands compared to the expected frequency of residue pairs in a random model. A pseudo-energy function based on these statistical pairwise potentials recognized native β- sheets among possible alternative pairings. The native pairing was found within the three lowest energies in 73% of the cases in the training data set and in 63% of β-sheets in a test data set of 67 proteins, which were not part of the training set. The energy function was also used to detect tripeptides, which occur frequently in β-sheets of native proteins. The majority of native partners of tripeptides were distributed in a low energy range. Self-correcting distance geometry (SECODG) calculations using distance constraints sets derived from possible low energy pairing of β-strands uniquely identified the native pairing of the β-sheet in pancreatic trypsin inhibitor (BPTI). These results will be useful for predicting the structure of proteins from their amino acid sequence as well as for the design of proteins containing β-sheets.

KW - β-strand alignment recognition

KW - β-strand pairing

KW - Self-correcting distance geometry calculations

KW - Statistical potentials

KW - Tertiary structure

UR - http://www.scopus.com/inward/record.url?scp=0033003903&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0033003903&partnerID=8YFLogxK

M3 - Article

VL - 8

SP - 326

EP - 342

JO - Protein Science

JF - Protein Science

SN - 0961-8368

IS - 2

ER -