Determining the three-dimensional fold of a protein from approximate constraints

A simulation study

Kizhake V. Soman, Werner Braun

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

We propose a new approach for calculating the three-dimensional (3D) structure of a protein from distance and dihedral angle constraints derived from experimental data. We suggest that such constraints can be obtained from experiments such as tritium planigraphy, chemical or enzymatic cleavage of the polypeptide chain, paramagnetic perturbation of nuclear magnetic resonance (NMR) spectra, measurement of hydrogen-exchange rates, mutational studies, mass spectrometry, and electron paramagnetic resonance. These can be supplemented with constraints from theoretical prediction of secondary structures and of buried/exposed residues. We report here distance geometry calculations to generate the structures of a test protein Staphyiococcal nuclease (STN), and the HIV-1 rev protein (REV) of unknown structure. From the available 3D atomic coordinates of STN, we set up simulated data sets consisting of varying number and quality of constraints, and used our group's Self Correcting Distance Geometry (SECODG) program DIAMOD to generate structures. We could generate the correct tertiary fold from qualitative (approximate) as well as precise distance constraints. The root mean square deviations of backbone atoms from the native structure were in the range of 2.0 Å to 8.3 Å, depending on the number of constraints used. We could also generate the correct fold starting from a subset of atoms that are on the surface and those that are buried. When we used data sets containing a small fraction of incorrect distance constraints, the SECODG technique was able to detect and correct them. In the case of REV, we used a combination of constraints obtained from rnutagenic data and structure predictions. DIAMOD generated helix-loop-helix models, which, after four self-correcting cycles, populated one family exclusively. The features of the energy-minimized model are consistent with the available data on REV-RNA interaction. Our method could thus be an attractive alternative for calculating protein 3D structures, especially in cases where the traditional methods of X-ray crystallography and multidimensional NMR spectroscopy have been unsuccessful.

Original languageEnglish (US)
Pages (from-to)283-304
Number of pages22
JournalCell Biochemistry and Biophysics
Volume34
Issue number3
DOIs
StatePublished - 2001

Fingerprint

Geometry
Magnetic Resonance Spectroscopy
Atoms
Proteins
Tritium
X ray crystallography
X Ray Crystallography
Electron Spin Resonance Spectroscopy
Dihedral angle
Nuclear magnetic resonance spectroscopy
Mass spectrometry
Paramagnetic resonance
Hydrogen
Mass Spectrometry
Nuclear magnetic resonance
RNA
Peptides
Experiments
Datasets
Human Immunodeficiency Virus-1 rev protein

Keywords

  • Ab initio structure prediction
  • Distance geometry
  • HIV-1 rev protein
  • Protein structure
  • Staphylococcal nuclease

ASJC Scopus subject areas

  • Biophysics
  • Cell Biology
  • Clinical Biochemistry

Cite this

Determining the three-dimensional fold of a protein from approximate constraints : A simulation study. / Soman, Kizhake V.; Braun, Werner.

In: Cell Biochemistry and Biophysics, Vol. 34, No. 3, 2001, p. 283-304.

Research output: Contribution to journalArticle

@article{6bd03b9395e3404895adfc3e8ee97da5,
title = "Determining the three-dimensional fold of a protein from approximate constraints: A simulation study",
abstract = "We propose a new approach for calculating the three-dimensional (3D) structure of a protein from distance and dihedral angle constraints derived from experimental data. We suggest that such constraints can be obtained from experiments such as tritium planigraphy, chemical or enzymatic cleavage of the polypeptide chain, paramagnetic perturbation of nuclear magnetic resonance (NMR) spectra, measurement of hydrogen-exchange rates, mutational studies, mass spectrometry, and electron paramagnetic resonance. These can be supplemented with constraints from theoretical prediction of secondary structures and of buried/exposed residues. We report here distance geometry calculations to generate the structures of a test protein Staphyiococcal nuclease (STN), and the HIV-1 rev protein (REV) of unknown structure. From the available 3D atomic coordinates of STN, we set up simulated data sets consisting of varying number and quality of constraints, and used our group's Self Correcting Distance Geometry (SECODG) program DIAMOD to generate structures. We could generate the correct tertiary fold from qualitative (approximate) as well as precise distance constraints. The root mean square deviations of backbone atoms from the native structure were in the range of 2.0 {\AA} to 8.3 {\AA}, depending on the number of constraints used. We could also generate the correct fold starting from a subset of atoms that are on the surface and those that are buried. When we used data sets containing a small fraction of incorrect distance constraints, the SECODG technique was able to detect and correct them. In the case of REV, we used a combination of constraints obtained from rnutagenic data and structure predictions. DIAMOD generated helix-loop-helix models, which, after four self-correcting cycles, populated one family exclusively. The features of the energy-minimized model are consistent with the available data on REV-RNA interaction. Our method could thus be an attractive alternative for calculating protein 3D structures, especially in cases where the traditional methods of X-ray crystallography and multidimensional NMR spectroscopy have been unsuccessful.",
keywords = "Ab initio structure prediction, Distance geometry, HIV-1 rev protein, Protein structure, Staphylococcal nuclease",
author = "Soman, {Kizhake V.} and Werner Braun",
year = "2001",
doi = "10.1385/CBB:34:3:283",
language = "English (US)",
volume = "34",
pages = "283--304",
journal = "Cell Biochemistry and Biophysics",
issn = "1085-9195",
publisher = "Humana Press",
number = "3",

}

TY - JOUR

T1 - Determining the three-dimensional fold of a protein from approximate constraints

T2 - A simulation study

AU - Soman, Kizhake V.

AU - Braun, Werner

PY - 2001

Y1 - 2001

N2 - We propose a new approach for calculating the three-dimensional (3D) structure of a protein from distance and dihedral angle constraints derived from experimental data. We suggest that such constraints can be obtained from experiments such as tritium planigraphy, chemical or enzymatic cleavage of the polypeptide chain, paramagnetic perturbation of nuclear magnetic resonance (NMR) spectra, measurement of hydrogen-exchange rates, mutational studies, mass spectrometry, and electron paramagnetic resonance. These can be supplemented with constraints from theoretical prediction of secondary structures and of buried/exposed residues. We report here distance geometry calculations to generate the structures of a test protein Staphyiococcal nuclease (STN), and the HIV-1 rev protein (REV) of unknown structure. From the available 3D atomic coordinates of STN, we set up simulated data sets consisting of varying number and quality of constraints, and used our group's Self Correcting Distance Geometry (SECODG) program DIAMOD to generate structures. We could generate the correct tertiary fold from qualitative (approximate) as well as precise distance constraints. The root mean square deviations of backbone atoms from the native structure were in the range of 2.0 Å to 8.3 Å, depending on the number of constraints used. We could also generate the correct fold starting from a subset of atoms that are on the surface and those that are buried. When we used data sets containing a small fraction of incorrect distance constraints, the SECODG technique was able to detect and correct them. In the case of REV, we used a combination of constraints obtained from rnutagenic data and structure predictions. DIAMOD generated helix-loop-helix models, which, after four self-correcting cycles, populated one family exclusively. The features of the energy-minimized model are consistent with the available data on REV-RNA interaction. Our method could thus be an attractive alternative for calculating protein 3D structures, especially in cases where the traditional methods of X-ray crystallography and multidimensional NMR spectroscopy have been unsuccessful.

AB - We propose a new approach for calculating the three-dimensional (3D) structure of a protein from distance and dihedral angle constraints derived from experimental data. We suggest that such constraints can be obtained from experiments such as tritium planigraphy, chemical or enzymatic cleavage of the polypeptide chain, paramagnetic perturbation of nuclear magnetic resonance (NMR) spectra, measurement of hydrogen-exchange rates, mutational studies, mass spectrometry, and electron paramagnetic resonance. These can be supplemented with constraints from theoretical prediction of secondary structures and of buried/exposed residues. We report here distance geometry calculations to generate the structures of a test protein Staphyiococcal nuclease (STN), and the HIV-1 rev protein (REV) of unknown structure. From the available 3D atomic coordinates of STN, we set up simulated data sets consisting of varying number and quality of constraints, and used our group's Self Correcting Distance Geometry (SECODG) program DIAMOD to generate structures. We could generate the correct tertiary fold from qualitative (approximate) as well as precise distance constraints. The root mean square deviations of backbone atoms from the native structure were in the range of 2.0 Å to 8.3 Å, depending on the number of constraints used. We could also generate the correct fold starting from a subset of atoms that are on the surface and those that are buried. When we used data sets containing a small fraction of incorrect distance constraints, the SECODG technique was able to detect and correct them. In the case of REV, we used a combination of constraints obtained from rnutagenic data and structure predictions. DIAMOD generated helix-loop-helix models, which, after four self-correcting cycles, populated one family exclusively. The features of the energy-minimized model are consistent with the available data on REV-RNA interaction. Our method could thus be an attractive alternative for calculating protein 3D structures, especially in cases where the traditional methods of X-ray crystallography and multidimensional NMR spectroscopy have been unsuccessful.

KW - Ab initio structure prediction

KW - Distance geometry

KW - HIV-1 rev protein

KW - Protein structure

KW - Staphylococcal nuclease

UR - http://www.scopus.com/inward/record.url?scp=0035752810&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0035752810&partnerID=8YFLogxK

U2 - 10.1385/CBB:34:3:283

DO - 10.1385/CBB:34:3:283

M3 - Article

VL - 34

SP - 283

EP - 304

JO - Cell Biochemistry and Biophysics

JF - Cell Biochemistry and Biophysics

SN - 1085-9195

IS - 3

ER -