Correcting positional correlations in Affymetrix® Genome Chips

Dirar Homouz, Gang Chen, Andrzej S. Kudlicki

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

We report and model a previously undescribed systematic error causing spurious excess correlations that depend on the distance between probes on Affymetrix® microarrays. The phenomenon affects pairs of features with large chip separations, up to over 100 probes apart. The effect may have a significant impact on analysis of correlations in large collections of expression data, where the systematic experimental errors are repeated in many data sets. Examples of such studies include analysis of functions and interactions in groups of genes, as well as global properties of genomes. We find that the average correlations between probes on Affymetrix microarrays are larger for smaller chip distances, which points out to a previously undescribed positional artifact. The magnitude of the artifact depends on the design of the chip, and we find it to be especially high for the yeast S98 microarray, where spurious excess correlations reach 0.1 at a distance of 50 probes. We have designed an algorithm to correct this bias and provide new data sets with the corrected expression values. This algorithm was successfully implemented to remove the positional artifact from the S98 chip data while preserving the integrity of the data.

Original languageEnglish (US)
Article number9078
JournalScientific Reports
Volume5
DOIs
StatePublished - 2015

Fingerprint

Artifacts
Genome
Yeasts
Genes
Datasets

ASJC Scopus subject areas

  • General

Cite this

Correcting positional correlations in Affymetrix® Genome Chips. / Homouz, Dirar; Chen, Gang; Kudlicki, Andrzej S.

In: Scientific Reports, Vol. 5, 9078, 2015.

Research output: Contribution to journalArticle

Homouz, Dirar ; Chen, Gang ; Kudlicki, Andrzej S. / Correcting positional correlations in Affymetrix® Genome Chips. In: Scientific Reports. 2015 ; Vol. 5.
@article{a2821f548e694610b220801e3d985e80,
title = "Correcting positional correlations in Affymetrix{\circledR} Genome Chips",
abstract = "We report and model a previously undescribed systematic error causing spurious excess correlations that depend on the distance between probes on Affymetrix{\circledR} microarrays. The phenomenon affects pairs of features with large chip separations, up to over 100 probes apart. The effect may have a significant impact on analysis of correlations in large collections of expression data, where the systematic experimental errors are repeated in many data sets. Examples of such studies include analysis of functions and interactions in groups of genes, as well as global properties of genomes. We find that the average correlations between probes on Affymetrix microarrays are larger for smaller chip distances, which points out to a previously undescribed positional artifact. The magnitude of the artifact depends on the design of the chip, and we find it to be especially high for the yeast S98 microarray, where spurious excess correlations reach 0.1 at a distance of 50 probes. We have designed an algorithm to correct this bias and provide new data sets with the corrected expression values. This algorithm was successfully implemented to remove the positional artifact from the S98 chip data while preserving the integrity of the data.",
author = "Dirar Homouz and Gang Chen and Kudlicki, {Andrzej S.}",
year = "2015",
doi = "10.1038/srep09078",
language = "English (US)",
volume = "5",
journal = "Scientific Reports",
issn = "2045-2322",
publisher = "Nature Publishing Group",

}

TY - JOUR

T1 - Correcting positional correlations in Affymetrix® Genome Chips

AU - Homouz, Dirar

AU - Chen, Gang

AU - Kudlicki, Andrzej S.

PY - 2015

Y1 - 2015

N2 - We report and model a previously undescribed systematic error causing spurious excess correlations that depend on the distance between probes on Affymetrix® microarrays. The phenomenon affects pairs of features with large chip separations, up to over 100 probes apart. The effect may have a significant impact on analysis of correlations in large collections of expression data, where the systematic experimental errors are repeated in many data sets. Examples of such studies include analysis of functions and interactions in groups of genes, as well as global properties of genomes. We find that the average correlations between probes on Affymetrix microarrays are larger for smaller chip distances, which points out to a previously undescribed positional artifact. The magnitude of the artifact depends on the design of the chip, and we find it to be especially high for the yeast S98 microarray, where spurious excess correlations reach 0.1 at a distance of 50 probes. We have designed an algorithm to correct this bias and provide new data sets with the corrected expression values. This algorithm was successfully implemented to remove the positional artifact from the S98 chip data while preserving the integrity of the data.

AB - We report and model a previously undescribed systematic error causing spurious excess correlations that depend on the distance between probes on Affymetrix® microarrays. The phenomenon affects pairs of features with large chip separations, up to over 100 probes apart. The effect may have a significant impact on analysis of correlations in large collections of expression data, where the systematic experimental errors are repeated in many data sets. Examples of such studies include analysis of functions and interactions in groups of genes, as well as global properties of genomes. We find that the average correlations between probes on Affymetrix microarrays are larger for smaller chip distances, which points out to a previously undescribed positional artifact. The magnitude of the artifact depends on the design of the chip, and we find it to be especially high for the yeast S98 microarray, where spurious excess correlations reach 0.1 at a distance of 50 probes. We have designed an algorithm to correct this bias and provide new data sets with the corrected expression values. This algorithm was successfully implemented to remove the positional artifact from the S98 chip data while preserving the integrity of the data.

UR - http://www.scopus.com/inward/record.url?scp=84924873520&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84924873520&partnerID=8YFLogxK

U2 - 10.1038/srep09078

DO - 10.1038/srep09078

M3 - Article

VL - 5

JO - Scientific Reports

JF - Scientific Reports

SN - 2045-2322

M1 - 9078

ER -