Abstract
An amino acid sequence, in the context of the solvent environment, contains all of the thermodynamic information necessary to encode a three-dimensional protein structure. To investigate the relationship between an amino acid sequence and its corresponding protein fold, a database of thermodynamic stability information was assembled that spanned 2951 residues from 44 nonhomologous proteins. This information was obtained using the COREX algorithm, which computes an ensemble-based description of the native state of a protein. It was observed that amino acid types partitioned unequally into high, medium, and low thermodynamic stability environments. Furthermore, these distributions were reproducible and were significantly different than those expected from random partitioning. To assess the structural importance of the distributions, simple fold-recognition experiments were performed based on a 3D-1D scoring matrix containing only COREX residue stability information. This procedure was able to recover amino acid sequences corresponding to correct target structures more effectively than scoring matrices derived from randomized data. High-scoring sequences were often aligned correctly with their corresponding target profiles, suggesting that calculated thermodynamic stability profiles have the potential to encode sequence information. As a control, identical fold-recognition experiments were performed on the same database of proteins using DSSP secondary structure information in the scoring matrix, instead of COREX residue stability information. The comparable performance of both approaches suggested that COREX residue stability information and secondary structure information could be of equivalent utility in more sophisticated fold-recognition techniques. The results of this work are a consequence of the idea that amino acid sequences fold not into single, rigidly stable structures but rather into thermodynamic ensembles best represented by a time-averaged structure.
Original language | English (US) |
---|---|
Pages (from-to) | 1032-1045 |
Number of pages | 14 |
Journal | Protein Science |
Volume | 10 |
Issue number | 5 |
DOIs | |
State | Published - 2001 |
Externally published | Yes |
Keywords
- Native state ensemble
- Protein stability
- Protein structure prediction
- Residue thermodynamics
- Threading and fold-recognition
ASJC Scopus subject areas
- Biochemistry
- Molecular Biology