Generalized composite multi-sample tests for high-dimensional data

Xiaoli Kong, Alejandro Villasante-Tezanos, David W. Fardo, Solomon W. Harrar

Research output: Contribution to journalArticlepeer-review

Abstract

High-dimensional data is ubiquitous in studies involving omics, human movement, and imaging. A multivariate comparison method is proposed for such types of data when either the dimension or the replication size substantially exceeds the other. A testing procedure is introduced that centers and scales a composite measure of distance statistic among the samples to appropriately account for high dimensions and/or large sample sizes. The properties of the test statistic are examined both theoretically and empirically. The proposed procedure demonstrates superior performance in simulation studies and an application to confirm the involvement of previously identified genes in the stages of invasive breast cancer.

Original languageEnglish (US)
Article number108279
JournalComputational Statistics and Data Analysis
Volume214
DOIs
StatePublished - Feb 2026

Keywords

  • Asymptotic expansion
  • Composite test
  • F-test
  • High-dimension
  • MANOVA
  • Strong-mixing

ASJC Scopus subject areas

  • Statistics and Probability
  • Computational Theory and Mathematics
  • Computational Mathematics
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Generalized composite multi-sample tests for high-dimensional data'. Together they form a unique fingerprint.

Cite this