Broom: Application for non-redundant storage of High Throughput Sequencing data

Levent Albayrak, Kamil Khanipov, George Golovko, Yuriy Fofanov

Research output: Contribution to journalArticlepeer-review

Abstract

Motivation The data generation capabilities of High Throughput Sequencing (HTS) instruments have exponentially increased over the last few years, while the cost of sequencing has dramatically decreased allowing this technology to become widely used in biomedical studies. For small labs and individual researchers, however, storage and transfer of large amounts of HTS data present a significant challenge. The recent trends in increased sequencing quality and genome coverage can be used to reconsider HTS data storage strategies. Results We present Broom, a stand-alone application designed to select and store only high-quality sequencing reads at extremely high compression rates. Written in C++, the application accepts single and paired-end reads in FASTQ and FASTA formats and decompresses data in FASTA format. Availability C++ code available at https://scsb.utmb.edu/labgroups/fofanov/broom.asp Contact lealbayr@utmb.edu

Original languageEnglish (US)
JournalUnknown Journal
DOIs
StatePublished - May 2 2018

ASJC Scopus subject areas

  • Biochemistry, Genetics and Molecular Biology(all)
  • Agricultural and Biological Sciences(all)
  • Immunology and Microbiology(all)
  • Neuroscience(all)
  • Pharmacology, Toxicology and Pharmaceutics(all)

Fingerprint Dive into the research topics of 'Broom: Application for non-redundant storage of High Throughput Sequencing data'. Together they form a unique fingerprint.

Cite this