High-accuracy de novo assembly and SNP detection of chloroplast genomes using a SMRT circular consensus sequencing strategy

Qiushi Li, Ying Li, Jingyuan Song, Haibin Xu, Jiang Xu, Yingjie Zhu, Xiwen Li, Huanhuan Gao, Linlin Dong, Jun Qian, Chao Sun, Shilin Chen

Research output: Contribution to journalArticle

48 Citations (Scopus)

Abstract

Summary: A circular consensus sequencing (CCS) strategy involving single molecule, real-time (SMRT) DNA sequencing technology was applied to de novo assembly and single nucleotide polymorphism (SNP) detection of chloroplast genomes. Chloroplast DNA was purified from enriched chloroplasts of pooled individuals to construct a shotgun library for each species. The sequencing reactions were performed on a PacBio RS platform. CCS sub-reads were generated from polymerase reads that passed the native dumbbell-shaped DNA templates multiple times. The complete chloroplast genome sequence was generated by mapping all reads to the draft sequence constructed in a step-by-step manner. The full-chain, PCR-free approach eliminates the possible context-specific biases in library construction and sequencing reaction. The chloroplast genome was easily and completely assembled using the data generated from one SMRT Cell without requiring a reference genome. Comparisons of the three assembled Fritillaria genomes to 34.1 kb of validation Sanger sequences revealed 100% concordance, and the detected intraspecies SNPs at a minimum variant frequency of 15% were all confirmed. This simple approach with potential for parallel sequencing yields high-quality chloroplast genomes for sensitive SNP detection and comparative analyses. We recommend this approach for its powerful applicability for evolutionary genetics and genomics studies in plants based on the sequences of chloroplast genomes.

Original languageEnglish (US)
Pages (from-to)1041-1049
Number of pages9
JournalNew Phytologist
Volume204
Issue number4
DOIs
StatePublished - Jan 1 2014
Externally publishedYes

Fingerprint

Chloroplast Genome
single nucleotide polymorphism
Single Nucleotide Polymorphism
Libraries
Fritillaria
Genome
Chloroplast DNA
genome
Firearms
Chloroplasts
Genomics
chloroplast DNA
DNA Sequence Analysis
sequence analysis
chloroplasts
chloroplast genome
Technology
genomics
Polymerase Chain Reaction
DNA

Keywords

  • Fritillaria
  • Chloroplast genome
  • Circular consensus sequencing (CCS)
  • Single molecular real-time (SMRT) sequencing
  • Single nucleotide polymorphism (SNP)

ASJC Scopus subject areas

  • Physiology
  • Plant Science

Cite this

High-accuracy de novo assembly and SNP detection of chloroplast genomes using a SMRT circular consensus sequencing strategy. / Li, Qiushi; Li, Ying; Song, Jingyuan; Xu, Haibin; Xu, Jiang; Zhu, Yingjie; Li, Xiwen; Gao, Huanhuan; Dong, Linlin; Qian, Jun; Sun, Chao; Chen, Shilin.

In: New Phytologist, Vol. 204, No. 4, 01.01.2014, p. 1041-1049.

Research output: Contribution to journalArticle

Li, Q, Li, Y, Song, J, Xu, H, Xu, J, Zhu, Y, Li, X, Gao, H, Dong, L, Qian, J, Sun, C & Chen, S 2014, 'High-accuracy de novo assembly and SNP detection of chloroplast genomes using a SMRT circular consensus sequencing strategy', New Phytologist, vol. 204, no. 4, pp. 1041-1049. https://doi.org/10.1111/nph.12966
Li, Qiushi ; Li, Ying ; Song, Jingyuan ; Xu, Haibin ; Xu, Jiang ; Zhu, Yingjie ; Li, Xiwen ; Gao, Huanhuan ; Dong, Linlin ; Qian, Jun ; Sun, Chao ; Chen, Shilin. / High-accuracy de novo assembly and SNP detection of chloroplast genomes using a SMRT circular consensus sequencing strategy. In: New Phytologist. 2014 ; Vol. 204, No. 4. pp. 1041-1049.
@article{92c7e72db30d4a6b80db5c66f57f97fb,
title = "High-accuracy de novo assembly and SNP detection of chloroplast genomes using a SMRT circular consensus sequencing strategy",
abstract = "Summary: A circular consensus sequencing (CCS) strategy involving single molecule, real-time (SMRT) DNA sequencing technology was applied to de novo assembly and single nucleotide polymorphism (SNP) detection of chloroplast genomes. Chloroplast DNA was purified from enriched chloroplasts of pooled individuals to construct a shotgun library for each species. The sequencing reactions were performed on a PacBio RS platform. CCS sub-reads were generated from polymerase reads that passed the native dumbbell-shaped DNA templates multiple times. The complete chloroplast genome sequence was generated by mapping all reads to the draft sequence constructed in a step-by-step manner. The full-chain, PCR-free approach eliminates the possible context-specific biases in library construction and sequencing reaction. The chloroplast genome was easily and completely assembled using the data generated from one SMRT Cell without requiring a reference genome. Comparisons of the three assembled Fritillaria genomes to 34.1 kb of validation Sanger sequences revealed 100{\%} concordance, and the detected intraspecies SNPs at a minimum variant frequency of 15{\%} were all confirmed. This simple approach with potential for parallel sequencing yields high-quality chloroplast genomes for sensitive SNP detection and comparative analyses. We recommend this approach for its powerful applicability for evolutionary genetics and genomics studies in plants based on the sequences of chloroplast genomes.",
keywords = "Fritillaria, Chloroplast genome, Circular consensus sequencing (CCS), Single molecular real-time (SMRT) sequencing, Single nucleotide polymorphism (SNP)",
author = "Qiushi Li and Ying Li and Jingyuan Song and Haibin Xu and Jiang Xu and Yingjie Zhu and Xiwen Li and Huanhuan Gao and Linlin Dong and Jun Qian and Chao Sun and Shilin Chen",
year = "2014",
month = "1",
day = "1",
doi = "10.1111/nph.12966",
language = "English (US)",
volume = "204",
pages = "1041--1049",
journal = "New Phytologist",
issn = "0028-646X",
publisher = "Wiley-Blackwell",
number = "4",

}

TY - JOUR

T1 - High-accuracy de novo assembly and SNP detection of chloroplast genomes using a SMRT circular consensus sequencing strategy

AU - Li, Qiushi

AU - Li, Ying

AU - Song, Jingyuan

AU - Xu, Haibin

AU - Xu, Jiang

AU - Zhu, Yingjie

AU - Li, Xiwen

AU - Gao, Huanhuan

AU - Dong, Linlin

AU - Qian, Jun

AU - Sun, Chao

AU - Chen, Shilin

PY - 2014/1/1

Y1 - 2014/1/1

N2 - Summary: A circular consensus sequencing (CCS) strategy involving single molecule, real-time (SMRT) DNA sequencing technology was applied to de novo assembly and single nucleotide polymorphism (SNP) detection of chloroplast genomes. Chloroplast DNA was purified from enriched chloroplasts of pooled individuals to construct a shotgun library for each species. The sequencing reactions were performed on a PacBio RS platform. CCS sub-reads were generated from polymerase reads that passed the native dumbbell-shaped DNA templates multiple times. The complete chloroplast genome sequence was generated by mapping all reads to the draft sequence constructed in a step-by-step manner. The full-chain, PCR-free approach eliminates the possible context-specific biases in library construction and sequencing reaction. The chloroplast genome was easily and completely assembled using the data generated from one SMRT Cell without requiring a reference genome. Comparisons of the three assembled Fritillaria genomes to 34.1 kb of validation Sanger sequences revealed 100% concordance, and the detected intraspecies SNPs at a minimum variant frequency of 15% were all confirmed. This simple approach with potential for parallel sequencing yields high-quality chloroplast genomes for sensitive SNP detection and comparative analyses. We recommend this approach for its powerful applicability for evolutionary genetics and genomics studies in plants based on the sequences of chloroplast genomes.

AB - Summary: A circular consensus sequencing (CCS) strategy involving single molecule, real-time (SMRT) DNA sequencing technology was applied to de novo assembly and single nucleotide polymorphism (SNP) detection of chloroplast genomes. Chloroplast DNA was purified from enriched chloroplasts of pooled individuals to construct a shotgun library for each species. The sequencing reactions were performed on a PacBio RS platform. CCS sub-reads were generated from polymerase reads that passed the native dumbbell-shaped DNA templates multiple times. The complete chloroplast genome sequence was generated by mapping all reads to the draft sequence constructed in a step-by-step manner. The full-chain, PCR-free approach eliminates the possible context-specific biases in library construction and sequencing reaction. The chloroplast genome was easily and completely assembled using the data generated from one SMRT Cell without requiring a reference genome. Comparisons of the three assembled Fritillaria genomes to 34.1 kb of validation Sanger sequences revealed 100% concordance, and the detected intraspecies SNPs at a minimum variant frequency of 15% were all confirmed. This simple approach with potential for parallel sequencing yields high-quality chloroplast genomes for sensitive SNP detection and comparative analyses. We recommend this approach for its powerful applicability for evolutionary genetics and genomics studies in plants based on the sequences of chloroplast genomes.

KW - Fritillaria

KW - Chloroplast genome

KW - Circular consensus sequencing (CCS)

KW - Single molecular real-time (SMRT) sequencing

KW - Single nucleotide polymorphism (SNP)

UR - http://www.scopus.com/inward/record.url?scp=84911983013&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84911983013&partnerID=8YFLogxK

U2 - 10.1111/nph.12966

DO - 10.1111/nph.12966

M3 - Article

C2 - 25103547

AN - SCOPUS:84911983013

VL - 204

SP - 1041

EP - 1049

JO - New Phytologist

JF - New Phytologist

SN - 0028-646X

IS - 4

ER -