A visual data mining tool that facilitates reconstruction of transcription regulatory networks

Daniel C. Jupiter, Vincent VanBuren

Research output: Contribution to journalArticlepeer-review

21 Scopus citations


Background: Although the use of microarray technology has seen exponential growth, analysis of microarray data remains a challenge to many investigations. One difficulty lies in the interpretation of a list of differentially expressed genes, or in how to plan new experiments given that knowledge. Clustering methods can be used to identify groups of genes with similar expression patterns, and genes with unknown function can be provisionally annotated based on the concept of "guilt by association", where function is tentatively inferred from the known functions of genes with similar expression patterns. These methods frequently suffer from two limitations: (1) visualization usually only gives access to group membership, rather than specific information about nearest neighbors, and (2) the resolution or quality of the relationship are not easily inferred. Methodology/Principal Findings: We have addressed these issues by improving the precision of similarity detection over that of a single experiment and by creating a tool to visualize tractable networks: we (1) performed meta-analysis computation of correlation coefficients for all gene pairs in a heterogeneous data set collection from 2,145 publicly available microarray samples in mouse, (2) filtered the resulting distribution of over 130 million correlation coefficients to build new, more tractable distribution from the strongest correlations, and (3) designed and implemented a new Web based tool (StarNet, http://vanburenlab.medicine.tamhsc.edu/stamet.html) for visualization of sub-networks of the correlation coefficients built according to user specified parameters. Conclusion/Significance: Correlations were calculated across a heterogeneous collection of publicly available microarray data. Users can access this analysis using a new freely available Web-based application for visualizing tractable correlation networks that are flexibly specified by the user. This new resource enables rapid hypothesis development for transcription regulatory relationships.

Original languageEnglish (US)
Article numbere1717
JournalPloS one
Issue number3
StatePublished - Mar 5 2008
Externally publishedYes

ASJC Scopus subject areas

  • General Biochemistry, Genetics and Molecular Biology
  • General Agricultural and Biological Sciences
  • General


Dive into the research topics of 'A visual data mining tool that facilitates reconstruction of transcription regulatory networks'. Together they form a unique fingerprint.

Cite this