Molego-based definition of the architecture and specificity of metal-binding sites

Catherine H. Schein, Bin Zhou, Numan Oezguen, Venkatarajan S. Mathura, Werner Braun

Research output: Contribution to journalArticle

23 Citations (Scopus)

Abstract

Decomposing proteins into "molegos," building blocks that are conserved in sequence and 3D-structure, can identify functional elements. To demonstrate the specificity of the decomposition method, the PCPMer program suite was used to numerically define physical chemical property motifs corresponding to the molegos that make up the metal-containing active sites of three distinct enzyme families, from the dimetallic phosphatases, DNase 1 related nucleases/phosphatases, and dioxygenases. All three superfamilies bind metal ions in a β-strand core region but differ in the number and type of ions needed for activity. The motifs were then used to automatically identify proteins in the ASTRAL40 database that contained similar motifs. The proteins with the highest PCPMer score in the database were primarily metal-binding enzymes that were related in function to those in the alignment used to generate the PCPMer motif lists. The proteins that contained motifs similar to the dioxygenases differed from those found with PCP-motifs for phosphatases and nucleases. Relatively few metal-binding enzymes were detected when the search was done with PCP-motifs defined for interleukin-1 related proteins, which have a O-strand core but do not bind metal ions. While the box architecture was constant in each superfamily, the specificity for the metal ion preferred for enzymatic activity is determined by the pattern of carbonyl, hydroxyl or imadazole groups in key positions in the molegos. These results have implications for the design of metal-binding enzymes, and illustrate the ability of the PCPMer approach to distinguish, at the sequence level, structural and functional elements.

Original languageEnglish (US)
Pages (from-to)200-210
Number of pages11
JournalProteins: Structure, Function and Genetics
Volume58
Issue number1
DOIs
StatePublished - Jan 1 2005

Fingerprint

Metals
Binding Sites
Phosphoric Monoester Hydrolases
Metal ions
Dioxygenases
Enzymes
Proteins
Ions
Deoxyribonucleases
Databases
Interleukin-1
Hydroxyl Radical
Chemical properties
Amino Acid Motifs
Conserved Sequence
Decomposition
Catalytic Domain

Keywords

  • Dimetallic phosphatases
  • Dioxygenases
  • DNase 1 superfamily
  • Identifying functional homologues
  • Interleukin-1 structural family
  • MASIA
  • Metal ion catalysis
  • PCPMer
  • Protein design
  • Total sequence decomposition

ASJC Scopus subject areas

  • Genetics
  • Structural Biology
  • Biochemistry

Cite this

Molego-based definition of the architecture and specificity of metal-binding sites. / Schein, Catherine H.; Zhou, Bin; Oezguen, Numan; Mathura, Venkatarajan S.; Braun, Werner.

In: Proteins: Structure, Function and Genetics, Vol. 58, No. 1, 01.01.2005, p. 200-210.

Research output: Contribution to journalArticle

Schein, Catherine H. ; Zhou, Bin ; Oezguen, Numan ; Mathura, Venkatarajan S. ; Braun, Werner. / Molego-based definition of the architecture and specificity of metal-binding sites. In: Proteins: Structure, Function and Genetics. 2005 ; Vol. 58, No. 1. pp. 200-210.
@article{1948651adf3046fba3a6b4f3c3369ca7,
title = "Molego-based definition of the architecture and specificity of metal-binding sites",
abstract = "Decomposing proteins into {"}molegos,{"} building blocks that are conserved in sequence and 3D-structure, can identify functional elements. To demonstrate the specificity of the decomposition method, the PCPMer program suite was used to numerically define physical chemical property motifs corresponding to the molegos that make up the metal-containing active sites of three distinct enzyme families, from the dimetallic phosphatases, DNase 1 related nucleases/phosphatases, and dioxygenases. All three superfamilies bind metal ions in a β-strand core region but differ in the number and type of ions needed for activity. The motifs were then used to automatically identify proteins in the ASTRAL40 database that contained similar motifs. The proteins with the highest PCPMer score in the database were primarily metal-binding enzymes that were related in function to those in the alignment used to generate the PCPMer motif lists. The proteins that contained motifs similar to the dioxygenases differed from those found with PCP-motifs for phosphatases and nucleases. Relatively few metal-binding enzymes were detected when the search was done with PCP-motifs defined for interleukin-1 related proteins, which have a O-strand core but do not bind metal ions. While the box architecture was constant in each superfamily, the specificity for the metal ion preferred for enzymatic activity is determined by the pattern of carbonyl, hydroxyl or imadazole groups in key positions in the molegos. These results have implications for the design of metal-binding enzymes, and illustrate the ability of the PCPMer approach to distinguish, at the sequence level, structural and functional elements.",
keywords = "Dimetallic phosphatases, Dioxygenases, DNase 1 superfamily, Identifying functional homologues, Interleukin-1 structural family, MASIA, Metal ion catalysis, PCPMer, Protein design, Total sequence decomposition",
author = "Schein, {Catherine H.} and Bin Zhou and Numan Oezguen and Mathura, {Venkatarajan S.} and Werner Braun",
year = "2005",
month = "1",
day = "1",
doi = "10.1002/prot.20253",
language = "English (US)",
volume = "58",
pages = "200--210",
journal = "Proteins: Structure, Function and Bioinformatics",
issn = "0887-3585",
publisher = "Wiley-Liss Inc.",
number = "1",

}

TY - JOUR

T1 - Molego-based definition of the architecture and specificity of metal-binding sites

AU - Schein, Catherine H.

AU - Zhou, Bin

AU - Oezguen, Numan

AU - Mathura, Venkatarajan S.

AU - Braun, Werner

PY - 2005/1/1

Y1 - 2005/1/1

N2 - Decomposing proteins into "molegos," building blocks that are conserved in sequence and 3D-structure, can identify functional elements. To demonstrate the specificity of the decomposition method, the PCPMer program suite was used to numerically define physical chemical property motifs corresponding to the molegos that make up the metal-containing active sites of three distinct enzyme families, from the dimetallic phosphatases, DNase 1 related nucleases/phosphatases, and dioxygenases. All three superfamilies bind metal ions in a β-strand core region but differ in the number and type of ions needed for activity. The motifs were then used to automatically identify proteins in the ASTRAL40 database that contained similar motifs. The proteins with the highest PCPMer score in the database were primarily metal-binding enzymes that were related in function to those in the alignment used to generate the PCPMer motif lists. The proteins that contained motifs similar to the dioxygenases differed from those found with PCP-motifs for phosphatases and nucleases. Relatively few metal-binding enzymes were detected when the search was done with PCP-motifs defined for interleukin-1 related proteins, which have a O-strand core but do not bind metal ions. While the box architecture was constant in each superfamily, the specificity for the metal ion preferred for enzymatic activity is determined by the pattern of carbonyl, hydroxyl or imadazole groups in key positions in the molegos. These results have implications for the design of metal-binding enzymes, and illustrate the ability of the PCPMer approach to distinguish, at the sequence level, structural and functional elements.

AB - Decomposing proteins into "molegos," building blocks that are conserved in sequence and 3D-structure, can identify functional elements. To demonstrate the specificity of the decomposition method, the PCPMer program suite was used to numerically define physical chemical property motifs corresponding to the molegos that make up the metal-containing active sites of three distinct enzyme families, from the dimetallic phosphatases, DNase 1 related nucleases/phosphatases, and dioxygenases. All three superfamilies bind metal ions in a β-strand core region but differ in the number and type of ions needed for activity. The motifs were then used to automatically identify proteins in the ASTRAL40 database that contained similar motifs. The proteins with the highest PCPMer score in the database were primarily metal-binding enzymes that were related in function to those in the alignment used to generate the PCPMer motif lists. The proteins that contained motifs similar to the dioxygenases differed from those found with PCP-motifs for phosphatases and nucleases. Relatively few metal-binding enzymes were detected when the search was done with PCP-motifs defined for interleukin-1 related proteins, which have a O-strand core but do not bind metal ions. While the box architecture was constant in each superfamily, the specificity for the metal ion preferred for enzymatic activity is determined by the pattern of carbonyl, hydroxyl or imadazole groups in key positions in the molegos. These results have implications for the design of metal-binding enzymes, and illustrate the ability of the PCPMer approach to distinguish, at the sequence level, structural and functional elements.

KW - Dimetallic phosphatases

KW - Dioxygenases

KW - DNase 1 superfamily

KW - Identifying functional homologues

KW - Interleukin-1 structural family

KW - MASIA

KW - Metal ion catalysis

KW - PCPMer

KW - Protein design

KW - Total sequence decomposition

UR - http://www.scopus.com/inward/record.url?scp=10844290876&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=10844290876&partnerID=8YFLogxK

U2 - 10.1002/prot.20253

DO - 10.1002/prot.20253

M3 - Article

VL - 58

SP - 200

EP - 210

JO - Proteins: Structure, Function and Bioinformatics

JF - Proteins: Structure, Function and Bioinformatics

SN - 0887-3585

IS - 1

ER -