New quantitative descriptors of amino acids based on multidimensional scaling of a large number of physical-chemical properties

Mathura Venkatarajan, Werner Braun

Research output: Contribution to journalArticle

111 Citations (Scopus)

Abstract

We derive new quantitative descriptors for the 20 naturally occurring amino acids based on multidimensional scaling of 237 physical-chemical properties. We show that a five-dimensional property space can be constructed such that the amino acids are in a similar spatial distribution to that in the original high-dimensional property space. Properties that correlate well with the five major components were hydrophobicity, size, preferences for amino acids to occur in α-helices, number of degenerate triplet codons and the frequency of occurrence of amino acid residues in β-strands. Distances computed for pairs of amino acids in the five-dimensional property space are highly correlated with corresponding scores from similarity matrices derived from sequence and 3D structure comparison. We used the five-dimensional property distances to cluster the amino acids in groups depending on a cutoff distance. These groups define a reduced amino acid alphabet for protein folding studies. Our descriptors should provide a quantitative means to identify property motifs in sequences of protein families. Electronic supplementary material to this paper can be obtained by using the Springer Link server located at http://dx.doi.org/10.1007/s00894-001-0058-5.

Original languageEnglish (US)
Pages (from-to)445-453
Number of pages9
JournalJournal of Molecular Modeling
Volume7
Issue number12
DOIs
StatePublished - 2001

Fingerprint

chemical properties
Chemical properties
amino acids
Amino acids
scaling
Amino Acids
alphabets
proteins
Protein folding
Protein Folding
Hydrophobicity
hydrophobicity
Hydrophobic and Hydrophilic Interactions
Codon
strands
helices
folding
Spatial distribution
spatial distribution
cut-off

Keywords

  • Amino acid
  • BLOSUM
  • Cluster analysis
  • Multidimensional scaling
  • PAM
  • Physical-chemical properties
  • Substitution matrices

ASJC Scopus subject areas

  • Chemistry(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Biochemistry
  • Biophysics
  • Computer Science Applications
  • Computational Theory and Mathematics

Cite this

New quantitative descriptors of amino acids based on multidimensional scaling of a large number of physical-chemical properties. / Venkatarajan, Mathura; Braun, Werner.

In: Journal of Molecular Modeling, Vol. 7, No. 12, 2001, p. 445-453.

Research output: Contribution to journalArticle

@article{740ac44f50064f4da7e8abad93e9c677,
title = "New quantitative descriptors of amino acids based on multidimensional scaling of a large number of physical-chemical properties",
abstract = "We derive new quantitative descriptors for the 20 naturally occurring amino acids based on multidimensional scaling of 237 physical-chemical properties. We show that a five-dimensional property space can be constructed such that the amino acids are in a similar spatial distribution to that in the original high-dimensional property space. Properties that correlate well with the five major components were hydrophobicity, size, preferences for amino acids to occur in α-helices, number of degenerate triplet codons and the frequency of occurrence of amino acid residues in β-strands. Distances computed for pairs of amino acids in the five-dimensional property space are highly correlated with corresponding scores from similarity matrices derived from sequence and 3D structure comparison. We used the five-dimensional property distances to cluster the amino acids in groups depending on a cutoff distance. These groups define a reduced amino acid alphabet for protein folding studies. Our descriptors should provide a quantitative means to identify property motifs in sequences of protein families. Electronic supplementary material to this paper can be obtained by using the Springer Link server located at http://dx.doi.org/10.1007/s00894-001-0058-5.",
keywords = "Amino acid, BLOSUM, Cluster analysis, Multidimensional scaling, PAM, Physical-chemical properties, Substitution matrices",
author = "Mathura Venkatarajan and Werner Braun",
year = "2001",
doi = "10.1007/s00894-001-0058-5",
language = "English (US)",
volume = "7",
pages = "445--453",
journal = "Journal of Molecular Modeling",
issn = "1610-2940",
publisher = "Springer Verlag",
number = "12",

}

TY - JOUR

T1 - New quantitative descriptors of amino acids based on multidimensional scaling of a large number of physical-chemical properties

AU - Venkatarajan, Mathura

AU - Braun, Werner

PY - 2001

Y1 - 2001

N2 - We derive new quantitative descriptors for the 20 naturally occurring amino acids based on multidimensional scaling of 237 physical-chemical properties. We show that a five-dimensional property space can be constructed such that the amino acids are in a similar spatial distribution to that in the original high-dimensional property space. Properties that correlate well with the five major components were hydrophobicity, size, preferences for amino acids to occur in α-helices, number of degenerate triplet codons and the frequency of occurrence of amino acid residues in β-strands. Distances computed for pairs of amino acids in the five-dimensional property space are highly correlated with corresponding scores from similarity matrices derived from sequence and 3D structure comparison. We used the five-dimensional property distances to cluster the amino acids in groups depending on a cutoff distance. These groups define a reduced amino acid alphabet for protein folding studies. Our descriptors should provide a quantitative means to identify property motifs in sequences of protein families. Electronic supplementary material to this paper can be obtained by using the Springer Link server located at http://dx.doi.org/10.1007/s00894-001-0058-5.

AB - We derive new quantitative descriptors for the 20 naturally occurring amino acids based on multidimensional scaling of 237 physical-chemical properties. We show that a five-dimensional property space can be constructed such that the amino acids are in a similar spatial distribution to that in the original high-dimensional property space. Properties that correlate well with the five major components were hydrophobicity, size, preferences for amino acids to occur in α-helices, number of degenerate triplet codons and the frequency of occurrence of amino acid residues in β-strands. Distances computed for pairs of amino acids in the five-dimensional property space are highly correlated with corresponding scores from similarity matrices derived from sequence and 3D structure comparison. We used the five-dimensional property distances to cluster the amino acids in groups depending on a cutoff distance. These groups define a reduced amino acid alphabet for protein folding studies. Our descriptors should provide a quantitative means to identify property motifs in sequences of protein families. Electronic supplementary material to this paper can be obtained by using the Springer Link server located at http://dx.doi.org/10.1007/s00894-001-0058-5.

KW - Amino acid

KW - BLOSUM

KW - Cluster analysis

KW - Multidimensional scaling

KW - PAM

KW - Physical-chemical properties

KW - Substitution matrices

UR - http://www.scopus.com/inward/record.url?scp=0035789124&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0035789124&partnerID=8YFLogxK

U2 - 10.1007/s00894-001-0058-5

DO - 10.1007/s00894-001-0058-5

M3 - Article

VL - 7

SP - 445

EP - 453

JO - Journal of Molecular Modeling

JF - Journal of Molecular Modeling

SN - 1610-2940

IS - 12

ER -