SNP Discovery Using Next Generation Transcriptomic Sequencing

De Wit, Pierre

doi:10.1007/978-1-4939-3774-5_5

Pierre De Wit³

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1452))

2185 Accesses
1 Citations
2 Altmetric

Abstract

In this chapter, I will guide the user through methods to find new SNP markers from expressed sequence (RNA-Seq) data, focusing on the sample preparation and also on the bioinformatic analyses needed to sort through the immense flood of data from high-throughput sequencing machines. The general steps included are as follows: sample preparation, sequencing, quality control of data, assembly, map**, SNP discovery, filtering, validation. The first few steps are traditional laboratory protocols, whereas steps following the sequencing are of bioinformatic nature. The bioinformatics described herein are by no means exhaustive, rather they serve as one example of a simple way of analyzing high-throughput sequence data to find SNP markers. Ideally, one would like to run through this protocol several times with a new dataset, while varying software parameters slightly, in order to determine the robustness of the results. The final validation step, although not described in much detail here, is also quite critical as that will be the final test of the accuracy of the assumptions made in silico.

There is a plethora of downstream applications of a SNP dataset, not covered in this chapter. For an example of a more thorough protocol also including differential gene expression and functional enrichment analyses, BLAST annotation and downstream applications of SNP markers, a good starting point could be the “Simple Fool’s Guide to population genomics via RNA-Seq,” which is available at http://sfg.stanford.edu.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

NGS-Trex: An Automatic Analysis Workflow for RNA-Seq Data

A survey of best practices for RNA-seq data analysis

Article Open access 26 January 2016

Bioinformatics Tools for Next-Generation RNA Sequencing Analysis

References

Barton NH, Keightley PD (2002) Understanding quantitative genetic variation. Nat Rev Genet 3(1):11–21
Article CAS PubMed Google Scholar
Vos P, Hogers R, Bleeker M, Reijans M, Vandelee T, Hornes M, Frijters A, Pot J, Peleman J, Kuiper M, Zabeau M (1995) AFLP – a new technique for DNA-fingerprinting. Nucleic Acids Res 23(21):4407–4414
Article CAS PubMed PubMed Central Google Scholar
Richardson BJ, Baverstock PR, Adams M (1986) Allozyme electrophoresis: a handbook for animal systematics and population studies. Academic, San Diego, CA
Google Scholar
Slatkin M (1995) A measure of population subdivision based on microsatellite allele frequencies. Genetics 139(1):457–462
CAS PubMed PubMed Central Google Scholar
Selkoe KA, Toonen RJ (2006) Microsatellites for ecologists: a practical guide to using and evaluating microsatellite markers. Ecol Lett 9(5):615–629
Article PubMed Google Scholar
Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, Selker EU, Cresko WA, Johnson EA (2008) Rapid SNP discovery and genetic map** using sequenced RAD markers. PLoS One 3(10)
Google Scholar
Wang Z, Gerstein M, Snyder M (2009) RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10:57–63
Article CAS PubMed PubMed Central Google Scholar
Beaumont MA, Nichols RA (1996) Evaluating loci for use in the genetic analysis of population structure. Proc R Soc B Biol Sci 263(1377):1619–1626
Article Google Scholar
Charlesworth B, Nordborg M, Charlesworth D (1997) The effects of local selection, balanced polymorphism and background selection on equilibrium patterns of genetic diversity in subdivided populations. Genet Res 70(2):155–174
Article CAS PubMed Google Scholar
Martin JA, Wang Z (2011) Next-generation transcriptome assembly. Nat Rev Genet 12(10):671–682
Article CAS PubMed Google Scholar
Konczal M, Koteja P, Stuglik MT, Radwan J, Babik W (2013) Accuracy of allele frequency estimation using pooled RNA-Seq. Mol Ecol Resour 14:381–392
Article PubMed Google Scholar
Skelly DA, Johansson M, Madeoy J, Wakefield J, Akey JM (2011) A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data. Genome Res 21:1728–1737
Article CAS PubMed PubMed Central Google Scholar
De Wit P, Pespeni MH, Palumbi SR (2015) SNP genoty** and population genomics from expressed sequences - current advances and future possibilities. Mol Ecol 24(10):2310–2323
Article PubMed Google Scholar
Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, Hacohen N, Gnirke A, Rhind N, di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29(7):644-U130
Article Google Scholar
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25(14):1754
Article CAS PubMed PubMed Central Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25(16):2078
Article PubMed PubMed Central Google Scholar
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, McKenna A, Fennell TJ, Kernytsky AM, Sivachenko AY, Cibulskis K, Gabriel SB, Altshuler D, Daly MJ (2011) A framework for variation discovery and genoty** using next-generation DNA sequencing data. Nat Genet 43:491–498
Article CAS PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

Department of Marine Sciences, University of Gothenburg, Hättebäcksvägen 7, SE-452 96, Strömstad, Sweden
Pierre De Wit

Authors

Pierre De Wit
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pierre De Wit .

Editor information

Editors and Affiliations

Department of Marine Sciences, University of Gothenburg, Gothenburg, Sweden
Sarah J. Bourlat

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

De Wit, P. (2016). SNP Discovery Using Next Generation Transcriptomic Sequencing. In: Bourlat, S. (eds) Marine Genomics. Methods in Molecular Biology, vol 1452. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-3774-5_5

Download citation

DOI: https://doi.org/10.1007/978-1-4939-3774-5_5
Published: 27 July 2016
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-3772-1
Online ISBN: 978-1-4939-3774-5
eBook Packages: Springer Protocols

Publish with us

Policies and ethics

SNP Discovery Using Next Generation Transcriptomic Sequencing

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

NGS-Trex: An Automatic Analysis Workflow for RNA-Seq Data

A survey of best practices for RNA-seq data analysis

Bioinformatics Tools for Next-Generation RNA Sequencing Analysis

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

SNP Discovery Using Next Generation Transcriptomic Sequencing

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

NGS-Trex: An Automatic Analysis Workflow for RNA-Seq Data

A survey of best practices for RNA-seq data analysis

Bioinformatics Tools for Next-Generation RNA Sequencing Analysis

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Search

Navigation