TY - JOUR
T1 - Classification of genes and putative biomarker identification using distribution metrics on expression profiles
AU - Huang, Hung Chung
AU - Jupiter, Daniel
AU - VanBuren, Vincent
PY - 2010/2/4
Y1 - 2010/2/4
N2 - Background: Identification of genes with switch-like properties will facilitate discovery of regulatory mechanisms that underlie these properties, and will provide knowledge for the appropriate application of Boolean networks in gene regulatory models. As switch-like behavior is likely associated with tissue-specific expression, these gene products are expected to be plausible candidates as tissue-specific biomarkers. Methodology/Principal Findings:In a systematic classification of genes and search for biomarkers, gene expression profiles (GEPs) of more than 16,000 genes from 2,145 mouse array samples were analyzed. Four distribution metrics (mean, standard deviation, kurtosis and skewness) were used to classify GEPs into four categories: predominantly-off, predominantly-on, graded (rheostatic), and switch-like genes. The arrays under study were also grouped and examined by tissue type. For example, arrays were categorized as 'brain group' and 'non-brain group'; the Kolmogorov-Smirnov distance and Pearson correlation coefficient were then used to compare GEPs between brain and non-brain for each gene. We were thus able to identify tissue-specific biomarker candidate genes. Conclusions/Significance: The methodology employed here may be used to facilitate disease-specific biomarker discovery.
AB - Background: Identification of genes with switch-like properties will facilitate discovery of regulatory mechanisms that underlie these properties, and will provide knowledge for the appropriate application of Boolean networks in gene regulatory models. As switch-like behavior is likely associated with tissue-specific expression, these gene products are expected to be plausible candidates as tissue-specific biomarkers. Methodology/Principal Findings:In a systematic classification of genes and search for biomarkers, gene expression profiles (GEPs) of more than 16,000 genes from 2,145 mouse array samples were analyzed. Four distribution metrics (mean, standard deviation, kurtosis and skewness) were used to classify GEPs into four categories: predominantly-off, predominantly-on, graded (rheostatic), and switch-like genes. The arrays under study were also grouped and examined by tissue type. For example, arrays were categorized as 'brain group' and 'non-brain group'; the Kolmogorov-Smirnov distance and Pearson correlation coefficient were then used to compare GEPs between brain and non-brain for each gene. We were thus able to identify tissue-specific biomarker candidate genes. Conclusions/Significance: The methodology employed here may be used to facilitate disease-specific biomarker discovery.
UR - http://www.scopus.com/inward/record.url?scp=77949350027&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77949350027&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0009056
DO - 10.1371/journal.pone.0009056
M3 - Article
C2 - 20140228
AN - SCOPUS:77949350027
SN - 1932-6203
VL - 5
JO - PloS one
JF - PloS one
IS - 2
M1 - e9056
ER -