Characteristic motifs for families of allergenic proteins

Ovidiu Ivanciuc, Tzintzuni Garcia, Miguel Torres, Catherine H. Schein, Werner Braun

Research output: Contribution to journalArticle

46 Citations (Scopus)

Abstract

The identification of potential allergenic proteins is usually done by scanning a database of allergenic proteins and locating known allergens with a high sequence similarity. However, there is no universally accepted cut-off value for sequence similarity to indicate potential IgE cross-reactivity. Further, overall sequence similarity may be less important than discrete areas of similarity in proteins with homologous structure. To identify such areas, we first classified all allergens and their subdomains in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/) to their closest protein families as defined in Pfam, and identified conserved physicochemical property motifs characteristic of each group of sequences. Allergens populate only a small subset of all known Pfam families, as all allergenic proteins in SDAP could be grouped to only 130 (of 9318 total) Pfams, and 31 families contain more than four allergens. Conserved physicochemical property motifs for the aligned sequences of the most populated Pfam families were identified with the PCPMer program suite and catalogued in the webserver MotifMate (http://born.utmb.edu/motifmate/summary.php). We also determined specific motifs for allergenic members of a family that could distinguish them from non-allergenic ones. These allergen specific motifs should be most useful in database searches for potential allergens. We found that sequence motifs unique to the allergens in three families (seed storage proteins, Bet v 1, and tropomyosin) overlap with known IgE epitopes, thus providing evidence that our motif based approach can be used to assess the potential allergenicity of novel proteins.

Original languageEnglish (US)
Pages (from-to)559-568
Number of pages10
JournalMolecular Immunology
Volume46
Issue number4
DOIs
StatePublished - Feb 2009

Fingerprint

Allergens
Proteins
Protein Databases
Immunoglobulin E
Seed Storage Proteins
Tropomyosin
Epitopes
Databases

Keywords

  • Allergen classification
  • Allergen motif
  • Allergy
  • Cross-reactivity

ASJC Scopus subject areas

  • Molecular Biology
  • Immunology

Cite this

Characteristic motifs for families of allergenic proteins. / Ivanciuc, Ovidiu; Garcia, Tzintzuni; Torres, Miguel; Schein, Catherine H.; Braun, Werner.

In: Molecular Immunology, Vol. 46, No. 4, 02.2009, p. 559-568.

Research output: Contribution to journalArticle

Ivanciuc, Ovidiu ; Garcia, Tzintzuni ; Torres, Miguel ; Schein, Catherine H. ; Braun, Werner. / Characteristic motifs for families of allergenic proteins. In: Molecular Immunology. 2009 ; Vol. 46, No. 4. pp. 559-568.
@article{fabdc1faf52146d49a1c4579b6f87368,
title = "Characteristic motifs for families of allergenic proteins",
abstract = "The identification of potential allergenic proteins is usually done by scanning a database of allergenic proteins and locating known allergens with a high sequence similarity. However, there is no universally accepted cut-off value for sequence similarity to indicate potential IgE cross-reactivity. Further, overall sequence similarity may be less important than discrete areas of similarity in proteins with homologous structure. To identify such areas, we first classified all allergens and their subdomains in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/) to their closest protein families as defined in Pfam, and identified conserved physicochemical property motifs characteristic of each group of sequences. Allergens populate only a small subset of all known Pfam families, as all allergenic proteins in SDAP could be grouped to only 130 (of 9318 total) Pfams, and 31 families contain more than four allergens. Conserved physicochemical property motifs for the aligned sequences of the most populated Pfam families were identified with the PCPMer program suite and catalogued in the webserver MotifMate (http://born.utmb.edu/motifmate/summary.php). We also determined specific motifs for allergenic members of a family that could distinguish them from non-allergenic ones. These allergen specific motifs should be most useful in database searches for potential allergens. We found that sequence motifs unique to the allergens in three families (seed storage proteins, Bet v 1, and tropomyosin) overlap with known IgE epitopes, thus providing evidence that our motif based approach can be used to assess the potential allergenicity of novel proteins.",
keywords = "Allergen classification, Allergen motif, Allergy, Cross-reactivity",
author = "Ovidiu Ivanciuc and Tzintzuni Garcia and Miguel Torres and Schein, {Catherine H.} and Werner Braun",
year = "2009",
month = "2",
doi = "10.1016/j.molimm.2008.07.034",
language = "English (US)",
volume = "46",
pages = "559--568",
journal = "Molecular Immunology",
issn = "0161-5890",
publisher = "Elsevier Limited",
number = "4",

}

TY - JOUR

T1 - Characteristic motifs for families of allergenic proteins

AU - Ivanciuc, Ovidiu

AU - Garcia, Tzintzuni

AU - Torres, Miguel

AU - Schein, Catherine H.

AU - Braun, Werner

PY - 2009/2

Y1 - 2009/2

N2 - The identification of potential allergenic proteins is usually done by scanning a database of allergenic proteins and locating known allergens with a high sequence similarity. However, there is no universally accepted cut-off value for sequence similarity to indicate potential IgE cross-reactivity. Further, overall sequence similarity may be less important than discrete areas of similarity in proteins with homologous structure. To identify such areas, we first classified all allergens and their subdomains in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/) to their closest protein families as defined in Pfam, and identified conserved physicochemical property motifs characteristic of each group of sequences. Allergens populate only a small subset of all known Pfam families, as all allergenic proteins in SDAP could be grouped to only 130 (of 9318 total) Pfams, and 31 families contain more than four allergens. Conserved physicochemical property motifs for the aligned sequences of the most populated Pfam families were identified with the PCPMer program suite and catalogued in the webserver MotifMate (http://born.utmb.edu/motifmate/summary.php). We also determined specific motifs for allergenic members of a family that could distinguish them from non-allergenic ones. These allergen specific motifs should be most useful in database searches for potential allergens. We found that sequence motifs unique to the allergens in three families (seed storage proteins, Bet v 1, and tropomyosin) overlap with known IgE epitopes, thus providing evidence that our motif based approach can be used to assess the potential allergenicity of novel proteins.

AB - The identification of potential allergenic proteins is usually done by scanning a database of allergenic proteins and locating known allergens with a high sequence similarity. However, there is no universally accepted cut-off value for sequence similarity to indicate potential IgE cross-reactivity. Further, overall sequence similarity may be less important than discrete areas of similarity in proteins with homologous structure. To identify such areas, we first classified all allergens and their subdomains in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/) to their closest protein families as defined in Pfam, and identified conserved physicochemical property motifs characteristic of each group of sequences. Allergens populate only a small subset of all known Pfam families, as all allergenic proteins in SDAP could be grouped to only 130 (of 9318 total) Pfams, and 31 families contain more than four allergens. Conserved physicochemical property motifs for the aligned sequences of the most populated Pfam families were identified with the PCPMer program suite and catalogued in the webserver MotifMate (http://born.utmb.edu/motifmate/summary.php). We also determined specific motifs for allergenic members of a family that could distinguish them from non-allergenic ones. These allergen specific motifs should be most useful in database searches for potential allergens. We found that sequence motifs unique to the allergens in three families (seed storage proteins, Bet v 1, and tropomyosin) overlap with known IgE epitopes, thus providing evidence that our motif based approach can be used to assess the potential allergenicity of novel proteins.

KW - Allergen classification

KW - Allergen motif

KW - Allergy

KW - Cross-reactivity

UR - http://www.scopus.com/inward/record.url?scp=58249118841&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=58249118841&partnerID=8YFLogxK

U2 - 10.1016/j.molimm.2008.07.034

DO - 10.1016/j.molimm.2008.07.034

M3 - Article

C2 - 18951633

AN - SCOPUS:58249118841

VL - 46

SP - 559

EP - 568

JO - Molecular Immunology

JF - Molecular Immunology

SN - 0161-5890

IS - 4

ER -