Abstract
Characterization and profiling of the gene expression data or, more formally, called transcriptome are crucial steps in revealing the involvement of RNA in a variety of biological processes. RNA characteristics have been investigated in a number of researches employing whole transcriptome sequencing (RNA-seq) data in all the major crops. To date, a number of crop genomes have been sequenced, which allows researchers to understand the typical biological mechanism, which may further be utilized to discover and characterise candidate genes responsible. Gene expression profiling, that is, RNA-seq, is a useful tool for identifying and understanding biological processes/mechanisms, such as the coding, decoding, regulation, and expression of genes. A basic RNA-seq process may be divided into two broad categories, that is, reference based and de novo, as per the availability of the reference genome related to the species/crop under consideration. In this chapter, we introduce basic RNA-seq analysis approaches, pipelines and software, focusing particularly on de novo transcriptome assembly and identification of differentially expressed genes (DEGs)/transcripts by using finger millet as an example.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Saxena R, Vanga SK, Wang J, Orsat V, Raghavan V (2018) Millets for food security in the context of climate change: a review. Sustainability 10:2228
Avashthi H, Pathak RK, Gaur VS, Singh S, Gupta VK, Ramteke PW et al (2020) Comparative analysis of ROS-scavenging gene families in finger millet, rice, sorghum, and foxtail millet revealed potential targets for antioxidant activity and drought tolerance improvement. Netw Model Anal Health Inform Bioinform 9:1–23
Avashthi H, Pathak RK, Pandey N, Arora S, Mishra AK, Gupta VK et al (2018) Transcriptome-wide identification of genes involved in ascorbate-glutathione cycle (Halliwell-Asada pathway) and related pathway for elucidating its role in antioxidative potential in finger millet (Eleusine coracana (L.)). 3 Biotech 8:1–8
Chinnusamy V, Schumaker K, Zhu JK (2004) Molecular genetics perspectives on cross-talk and specificity in abiotic stress signalling in plants. J Exp Bot 55:225–236
Wang Z, Gerstein M, Snyder M (2009) RNA-seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10:57–63
Dillies MA, Rau A, Aubert J, Hennequet-Antier C, Jeanmougin M, Servant N et al (2012) A comprehensive evaluation of normalization methods for Illumina highthroughput RNA sequencing data analysis. Brief Bioinform 14:671–683
Costa-Silva J, Domingues D, Lopes FM (2017) RNA-Seq differential expression analysis: an extended review and a software tool. PLoS One 12:e0190152
Conesa A, Madrigal P, Tarazona S, Gomez-Cabrero D, Cervera A, McPherson A et al (2016) A survey of best practices for RNA-seq data analysis. Genome Biol 17:13
Seyednasrollah F, Laiho A, Elo LL (2015) Comparison of software packages for detecting differential expression in RNA-seq studies. Brief Bioinform 16:59–70
Kumar A, Gaur VS, Goel A, Gupta AK (2015) De novo assembly and characterization of develo** spikes transcriptome of finger millet (Eleusine coracana): a minor crop having nutraceutical properties. Plant Mol Biol Rep 33:905–922
Griffith M, Walker JR, Spies NC, Ainscough BJ, Griffith OL (2015) Informatics for RNA sequencing: a web resource for analysis on the cloud. PLoS Comput Biol 11:e1004393
Leinonen R, Sugawara H, Shumway M (2010) International nucleotide sequence database collaboration. The sequence read archive. Nucleic Acids Res 39(suppl_1):D19–D21
Trivedi UH, Cezard T, Bridgett S, Montazam A, Nichols J, Blaxter M et al (2014) Quality control of next-generation sequencing data without a reference. Front Genet 5:111
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120
Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I et al (2011) Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data. Nat Biotechnol 29:644
Li B, Dewey CN (2011) RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform 12:1–6
Robinson MD, McCarthy DJ, Smyth GK (2010) edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26:139–140
Green MR, Sambrook J (2014) Molecular cloning, a laboratory manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY
Patel RK, Jain M (2012) NGS QC toolkit: a toolkit for quality control of next generation sequencing data. PLoS One 7:e30619
Yang X, Liu D, Liu F, Wu J, Zou J, **ao X et al (2013) HTQC: a fast quality control toolkit for Illumina sequencing data. BMC Bioinform 14:1–4
Lindgreen S (2012) AdapterRemoval: easy cleaning of next-generation sequencing reads. BMC Res Notes 5:1–7
Martin M (2011) Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J 17:10–12
Liao X, Li M, Zou Y, Wu FX, Pan Y, Wang J (2019) An efficient trimming algorithm based on multi-feature fusion scoring model for NGS data. IEEE/ACM Trans Comput Biol Bioinform 17:728–738
**e Y, Wu G, Tang J, Luo R, Patterson J, Liu S et al (2014) SOAPdenovo-trans: de novo transcriptome assembly with short RNA-Seq reads. Bioinformatics 30:1660–1666
Schulz MH, Zerbino DR, Vingron M, Birney E (2012) Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels. Bioinformatics 28:1086–1092
Robertson G, Schein J, Chiu R, Corbett R, Field M, Jackman SD et al (2010) De novo assembly and analysis of RNA-seq data. Nat Methods 7:909–912
Li X, Brock GN, Rouchka EC, Cooper N, Wu D, O’Toole TE et al (2017) A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data. PLoS One 12:e0176185
Ozsolak F, Milos PM (2011) RNA sequencing: advances, challenges and opportunities. Nat Rev Genet 12:87–98
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Avashthi, H. et al. (2022). Transcriptome Data Analysis Using a De Novo Assembly Approach. In: Wani, S.H., Kumar, A. (eds) Genomics of Cereal Crops. Springer Protocols Handbooks. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-2533-0_8
Download citation
DOI: https://doi.org/10.1007/978-1-0716-2533-0_8
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-2532-3
Online ISBN: 978-1-0716-2533-0
eBook Packages: Springer Protocols