Identifying mutations in sd1, Pi54 and Pi-ta, and positively selected genes of TN1, the first semidwarf rice in Green Revolution

Panibe, Jerome P.; Wang, Long; Lee, Yi-Chen; Wang, Chang-Sheng; Li, Wen-Hsiung

doi:10.1186/s40529-022-00336-x

Identifying mutations in sd1, Pi54 and Pi-ta, and positively selected genes of TN1, the first semidwarf rice in Green Revolution

Original Article
Open access
Published: 26 March 2022

Volume 63, article number 9, (2022)
Cite this article

Download PDF

You have full access to this open access article

Botanical Studies Submit manuscript

Identifying mutations in sd1, Pi54 and Pi-ta, and positively selected genes of TN1, the first semidwarf rice in Green Revolution

Download PDF

Jerome P. Panibe^1,2,3,
Long Wang⁴,
Yi-Chen Lee³,
Chang-Sheng Wang^5,6 &
…
Wen-Hsiung Li^1,3,7

3542 Accesses
Explore all metrics

Abstract

Background

Taichung Native 1 (TN1) is the first semidwarf rice cultivar that initiated the Green Revolution. As TN1 is a direct descendant of the Dee-geo-woo-gen cultivar, the source of the sd1 semidwarf gene, the sd1 gene can be defined through TN1. Also, TN1 is susceptible to the blast disease and is described as being drought-tolerant. However, genes related to these characteristics of TN1 are unknown. Our aim was to identify and characterize TN1 genes related to these traits.

Results

Aligning the sd1 of TN1 to Nipponbare sd1, we found a 382-bp deletion including a frameshift mutation. Sanger sequencing validated this deleted region in sd1, and we proposed a model of the sd1 gene that corrects errors in the literature. We also predicted the blast disease resistant (R) genes of TN1. Orthologues of the R genes in Tetep, a well-known resistant cultivar that is commonly used as a donor for breeding new blast resistant cultivars, were then sought in TN1, and if they were present, we looked for mutations. The absence of Pi54, a well-known R gene, in TN1 partially explains why TN1 is more susceptible to blast than Tetep. We also scanned the TN1 genome using the PosiGene software and identified 11 genes deemed to have undergone positive selection. Some of them are associated with drought-resistance and stress response.

Conclusions

We have redefined the deletion of the sd1 gene in TN1, a direct descendant of the Dee-geo-woo-gen cultivar, and have corrected some literature errors. Moreover, we have identified blast resistant genes and positively selected genes, including genes that characterize TN1’s blast susceptibility and abiotic stress response. These new findings increase the potential of using TN1 to breed new rice cultivars.

Analysis of a rice blast resistance gene Pita-Fuhui2663 and development of selection marker

Article Open access 01 September 2022

Genetic map** and molecular marker development for Pi65(t), a novel broad-spectrum resistance gene to rice blast using next-generation sequencing

Article 16 February 2016

Two genomic regions of a sodium azide induced rice mutant confer broad-spectrum and durable resistance to blast disease

Article Open access 10 January 2022

Background

The Green Revolution (GR) in rice production was attributed to the high-yielding semi-dwarf cultivars. In fact, the miracle rice, IR8, inherited the sd1 (semidwarf 1) gene from the Dee-geo-woo-gen (DGWG) cultivar (Hargrove et al. 1979). It conferred IR8 its short stature, making it lodging resistant, leading to high grain yield. Unknown to many, another cultivar also inherited the sd1 gene directly from DGWG. It is the Taichung Native 1 (TN1), which was popular in the 1960s (Chandler 1992). Recently, the genome of TN1 was sequenced, assembled and annotated, hel** to answer questions about the yield difference between TN1 and IR8 and why they both are photoperiod-insensitive (Panibe et al. 2021).

A fundamental characteristic of TN1 is its short height due to the sd1 gene from DGWG. The deletion of the semidwarf sd1 gene incurs a loss of function for the gibberellin (GA) 20-oxidase 2 (Os20ox2), which is involved in the synthesis of the growth hormone gibberellin (Spielmeyer et al. 2002). A reduction in GA results in a shorter plant height (Itoh et al. 2002). However, the sequence of the sd1 gene is not well studied. The current literature definition of the sd1 gene was based on the comparison of DGWG-type sd1 mutants (Habataki, Milyang 23, and IR24) with the sd1 of Nipponbare, Sasanishiki, and Calrose (Monna et al. 2002). It revealed a 383-bp deletion from the second half of Nipponbare’s exon 1 to the first half of exon 2, or in terms of the expressed sequence, a 278-bp deletion (Monna et al. 2002). Another definition of the sd1 deletion is a 280-bp deletion in the comparison of the semidwarf Doongara with the tall Kyeema, whose sd1 sequence is similar to Nipponbare (Spielmeyer et al. 2002). Those studies were done when the full Nipponbare genome was not yet available (until 2005) (International Rice Genome Sequencing Project and Sasaki 2005), and was later improved in 2013 (Kawahara et al. 2013). With the genomes of TN1 (Panibe et al. 2021) and IR8 (Stein et al. 2018) now available, we aim to compare the sd1 genes of these cultivars and redefine the semidwarf gene based on TN1 and IR8, the two direct descendants of DGWG.

If the greatest strength of TN1 is its high-yielding property due to its semi-dwarf stature from the sd1 gene, its weakness is its high susceptibility to the blast disease. Rice blast leads to a severe annual loss in rice production worldwide (Wang et al. 2014). However, plants have a natural defense against this and other pathogens, thanks to their resistance genes or R genes. Most R genes are composed of a nucleotide-binding site (NBS) domain and a leucine-rich repeat (LRR) domain (Takken and Joosten 2000). A combination of R genes in a plant may lead to a wide range of immunity response (Fukuoka et al. 2015). Unfortunately, TN1 is susceptible to major rice diseases like blast caused by the fungus Pyricularia oryzae (syn. Magnaporthe oryzae) (Sabbu et al. 2016) and the bacterial blight disease caused by the bacteria Xanthomonas oryzae pv. oryzae (Kumar et al. 2012). Predicting the R genes in the genome of TN1 will help understand the resistance profile of TN1, and why it is highly susceptible to blast. For factors that affect plant sensitivity to blast disease, see Chen et al. (2019), Liu et al. (2021), Nugroho et al. (2021) and Zhang et al. (2015).

There are in total 37,526 predicted genes in the TN1 genome (Panibe et al. 2021). Of these thousands of genes, some could be under the influence of positive selection (PS), conferring the cultivar certain advantages that could be related to TN1’s phenotypic characteristics like drought tolerance (Garg and Singh 1971; Garg et al. 2002). Mining the entire genome for genes that makes TN1 unique is no longer highly challenging, thanks to bioinformatics tools that automate the process of looking for positively selected (PS) genes such as PosiGene (Sahm et al. 2017). By using an input of coding sequences from the genomes of GR-related cultivars like IR8 (Stein et al. 2018), MH63 (Zhang et al. 2013) by using the protein alignments of sd1 and the information from their gff annotation (Nagano et al. 2005; Panibe et al. 2021). The range specified by the light blue arrow represents the sequences of sd1 in TN1 and IR8 that were validated by our Sanger sequencing. The 382 bp deletion in TN1 can be derived by computing the difference between 981 and 599, the latter of which represents the gene length of TN1 sd1 before its 2nd intron

Full size image

We further confirmed the sd1 gene sequence of TN by map** TN1 short reads used in the 3000 Rice Genomes Project (3 K RGP) (Wang et al. 2018b). There are actually two sets of TN1 reads in the 3000 Rice Genomes Project and they have the assay IDs, CX270 and CX162. The former has the name TAICHUNGNATIVE1, while the latter is designated as TN1. To determine which one better represents the sequencing reads from the 3000 Rice Genomes Project, we mapped the reads to the TN1 genome. CX162 has a 99.92% and 90.92%, for the overall map** rate and properly paired mapped reads, respectively. In contrast, CX270 has a map** rate of 99.40% and 81.91%. Based on the map** of reads, CX162 better represents the TN1 genome in the 3 K RGP.

We also checked the SNP-Seek database (Mansueto et al. 2017), if there are SNP loci inside the region corresponding to the sd1 deleted sequence in semi-dwarf cultivars. Of the two, TN1 (CX162) has missing SNP positions to deletion in japonica (Fig. 3a), whereas TAICHUNGNATIVE1 (CX270) has alleles on the same set of coordinates. (Additional file 1: Fig. S3). We further inspected the map** of the reads by viewing the sd1 region in Integrative Genomics Viewer (IGV) (Robinson et al. 2011), and they are shown in Fig. 3b (CX162) and Additional file 1: Fig. S4 (CX270). The nucleotide at chromosome 1 position 40,362,230 was supported by the TN1 reads of CX162 (Fig. 3c) and CX270 (Additional file 1: Fig. S5). The former’s reads better covered the position compared to the latter. In CX162, it is mapped by six reads, while in CX270 it is by only one read.

Predicted R genes in TN1

We annotated 383 NLR (nucleotide-binding domain leucine-rich repeat), 34 NB-ARC and 6 LRR (leucine-rich repeat) in the TN1 genome (Additional file 2: Dataset S1). For this purpose, we used the Tetep as a reference because Tetep is known to be highly resistant to blast disease and its genome and R genes have been well characterized (Wang et al. 2019b); indeed, it has been commonly used to breed for new blast resistant cultivars (Singh et al. 2012; Zarbafi and Ham, 2019; Ramalingam et al. 2020). The numbers of orthologues found between Tetep (Wang et al. 2019b) and TN1, MH63, R498 and Nipponbare did not show significant differences (Additional file 1: Table S1). Non-orthologous Tetep NLRs (R genes) were then blasted against the TN1 proteome using their NR-ARC domain protein sequences (Additional file 3: Dataset S2) and those hits with alignment identity < 50% were deemed missing in the TN1 assembly. One of the unfound R genes in TN1 is Pi54 (Pik-h), which is the gene chr11.fgenesh2107 in the assembled Tetep genome (Wang et al. 2019b). Pi54, originally cloned from Tetep, is known to confer broad-spectrum resistance to blast (Gupta et al. 2011; Rai et al. 2011; Thakur et al. 2015). Moreover, ~ 28 of the 90 NLR genes that were found to be resistant to one or more blast fungal strains (Wang et al. 2019b) were found to be missing or mutated in the TN1 genome (Additional file 3: Dataset S2).

By using the method of Mahesh et al. (2016), the set of 22 cloned blast R genes were searched in the TN1 genome. The results are given in Table 1 and those marked with an asterisk were the results different from Mahesh et al. (2016). These genes are confirmed to be present by Blastp in the Tetep genome with the same criteria used by Mahesh et al. (2016), i.e., e-value < 10e-10, identity ≥ 70% and query coverage ≥ 70% (Additional file 4: Dataset S3). The same set of R genes were also searched in the TN1 genome. Some of the R genes are present in TN1 but are mutated (Table 1), preventing the translation of the gene into the right protein. In the case of Tetep, both Wang et al. (2019b) and Mahesh et al. (2016) found the Pi-ta and Pi54 R genes in the blast resistant cultivar (see the Tetep column in Table 1).

Table 1 Distribution of cloned blast resistance genes in sequenced rice varieties

Full size table

Table 1. Distribution of cloned blast resistance genes in sequenced rice varieties. A + means present and a− means absent while M means mutated but with protein structure retained. *means a result different from Mahesh et al. (2016). These genes are confirmed to be present by blastp in the Tetep (Wang et al. 2019b) genome with e-value < 10e−10 and identity > = 70% (see Additional file 4: Dataset S3 for details). chr11.fgenesh2107.1 is a gene name from the Tetep genome annotation.

Haplotype analysis of the Pi-ta and Pi54 genes

To filter the missense variants, we compared each allele in the haplotypes of Pi54 and only obtained the SNPs that are heterozygous (see “Methods” section). Only three SNP positions were left and they are located on chromosome 11: 25,263,636; 25,264,119; and 25,264,164 (see Table 2). We checked their allele frequencies and found that the missense variants have a minimum allele frequency (MAF) of 36% to 39% (Table 2), suggesting that these missense variants are maintained across the rice populations, even though each causes a change in amino acid. Using the major/minor allele section in Table 2, we compared the three alleles of TN1 to the major and minor alleles. At SNP position 25,263,636, TN1 has two possibilities: either allele C or T (Table 2). If it is a T, it will be a minor allele across the 3,024 rice cultivars. The missense variant causes a Glu144Lys mutation (Additional file 6: Dataset S5), changing an acidic amino acid into a basic one. The change in charge of an amino acid could disrupt ionic interactions in the structure of the protein, which could affect its function, supporting our observation from Table 1 that the Pi54 gene in TN1 is missing when compared to the blast resistant Tetep cultivar.

Table 2 Pi54 alleles of blast susceptible and resistant cultivars at the same SNP position

Full size table

We also investigated the Pi-ta gene in SNP-Seek and it returned four haplotypes (Additional file 5: Dataset S4). This is the same number of haplotypes that Jia et al (2003) found. In that study, three of the haplotypes were related to susceptibility to blast and have five nucleotide positions that caused a non-synonymous mutation in Pi-ta. We checked the annotation of the SNPs in the 3 K RGP and found that the I6S mutation (Additional file 6: Dataset S5) was due to the replacement of the G nucleotide by a T at position 10,611,754 (Table 3). TN1 has the A allele at this position, which is the minor allele across the 3,024 cultivars in SNP-Seek. Consequently, TN1 is predicted to have the I6S mutation in its Pi-ta protein. From Table 3, the resistant cultivars Katy and Drew have the alleles T, G and C at positions 10,611,244; 10,611,297; 10,611,327, and an A at position 10,611,754.

Table 3 Pi-ta alleles of blast susceptible and resistant cultivars at the same SNP position

Full size table

However, the susceptible Nan**g 11 cultivar as well as TN1 has the pattern of alleles at the mentioned SNP positions similar to Katy and Drew. We did not see very clear difference between the haplotypes of the susceptible ones and the resistant ones for Pi-ta (Table 3) but for Pi54 the differences look like a bit clearer (Table 2). Pi54 is considered non-functional as the allele in TN1 (OsTN11t002257.1) lost the first 598 amino acids when compared to Tetep (Additional file 1: Fig. S6), resulted in complete loss of the NB-ARC domain. Of the 9 absent cloned NLRs (11 alleles shown in Table 1, which belong to 9 genes) in TN1, only 3 are from indica donors (other 6 might represent japonica/indica differences), including Pi54, Pid2 and Pi1-5. Pid2 is present in both susceptible and resistant cultivars, while Pid1-5 is absent in them all. Only Pi54 shows presence/absence polymorphism in resistant (Tetep and Tadukan) and susceptible (Co-39 and HR-12) cultivars (Mahesh et al. 2016). We further investigated the Pi-ta gene of TN1 by aligning it against its counterpart in Yashiro-mochi (a resistant cultivar). The protein sequence alignment of Pi-ta in TN1 is largely the same compared to the latter (Additional file 1: Fig. S7), suggesting that the function of the gene in TN1 is not largely altered.

Eleven genes in TN1 underwent positive selection

The aim of the genome-wide search for TN1 genes that underwent positive selection (PS) is to identify genes that might explain TN1’s phenotypic characteristics like high yield (Yoshida 1981), photoperiod insensitivity (Vergara and Chang 1985) and drought-tolerance (Garg and Singh 1971). The GO terms assigned to these PS genes would give insights into the biological processes involved as well as the enzymes that confer the function. We identified 11 TN1 genes that were likely subject to PS in TN1 in the past (Table 4).

Table 4 The 11 TN1 genes that underwent positive selection

Full size table

Using the Blast2GO annotation of the TN1 assembly (Panibe et al. 2021), a total of 35 GO terms (Additional file 1: Table S3) were assigned to six of the 11 PS genes (Table 4); see their representative GO terms in Fig. 4. For the Molecular Function (Fig. 4b), a correlation is observed between the protein names of the six genes and their GOs.

Discussion

sd1 has a 382-bp deletion in the semidwarf TN1

To redefine the sd1 gene, we first compared the sd1 genes of TN1, IR8, Nipponbare and the sequence by Monna et al. (2002) (Fig. 1). The alignment of TN1 and IR8 shows a 382 bp deletion, in contrast to Monna et al.’s (2002) 383 bp deletion (Fig. 1). The same observation was found in the sd1 gene of the parent DGWG (Nagano et al. 2005) cultivar, as well as two of its indirect descendants, MH63 (Wu et al. 2a. The italicized nucleotides (including the violet shaded adenine) is intron 1 of TN1 sd1. The colored codon tat (TN1 and IR8) is the synonymous codon of tac of Nipponbare, while codon cgg (TN1 and IR8) is a mutation for codon cag (Nipponbare), which changes amino acid Q (Nipponbare) to R (TN1 and IR8). The violet shaded adenine in codon 1 is the extra nucleotide that caused the 1 bp difference of the genomic deletion of TN1 and IR8 against the sd1 sequence of Monna et al. (2002)

Full size image

We validate the deletion in the sd1 gene sequences of TN1 and IR8 by Sanger sequencing. To get a fair comparison of the differentiated region, we sequence the region from the first nucleotide of exon 1 of TN1 and IR8 sd1 up to one-half the length of exon 1 of the semidwarf cultivars before the exon–intron boundary corresponding to position 981 of the Nipponbare gene structure; see blue arrows in Fig. 2. For TN1, the coordinates are chr 1:40,361,934–40,362,421 (Fig. 2a). For IR8, the region validated includes the 5′ untranslated region (UTR) (chr 1: 39,824,196–39,824,774) because that is part of IR8’s sd1 exon 1 as indicated in its gff annotation (Gramene 2020). In Fig. 2b, the 5′ UTR has become part of the exon gap. To validate that the differentiated regions really exist, we align via Clustal the Sanger sequences to the nucleotides extracted from the genome assemblies of TN1 and IR8. The resulting alignment shows that the sd1 sequence of TN1 is 100% identical to and 100% covered by the Sanger sequences (Additional file 1: Fig. S2). Likewise, the IR8 sd1 sequence from its genome matches its Sanger sequence. When the two Sanger sequences are compared, the TN1 has a perfect overlap with its IR8 counterpart, covering its entire 488 bp length. Because both TN1 and IR8 derived their semidwarf gene from their parent DGWG, the two cultivars should have the same form of the sd1 gene. This suggests that the untranslated region in the IR8 sd1 defined by its annotation is not really a UTR region, but an exon–intron-exon structure similar to TN1 (Fig. 2a). In lieu of this, we propose that the gene model of IR8 should follow that of TN1 and that the current annotation of the IR8 sd1 gene is in error.

We also compare the coding sequences of TN1, IR8 and Nipponbare. Spielmeyer et al. (2002) reported that the first 99 amino acids from the CDS of semidwarf Doongara, a descendant of DGWG, is similar to that of the Nipponbare, and that there is a 280 bp deletion in the coding sequence. Meanwhile, Monna et al. (2002) reported a 278 bp deletion in the expressed sequence of DGWG-type cultivars. The alignment in Fig. 5 indicates that there is 280 bp deletion in the coding sequence of TN1. We obtain this number by computing the difference between the length of the deletion (363 bp) in Fig. 5 and the length of first intron of TN1 sd1 (83 bp). There is also a frameshift mutation in the CDS of TN1 sd1 but this occurs at the junction of position 293 and position 294 (Fig. 5). However, the codon does not change because of the same guanine nucleotide at the start of exon 2, leading to the same valine amino acid.

The Pi54 resistance gene in TN1 is missing

TN1 is known to be highly susceptible to the blast fungus and the cultivar was used as a standard in searching for resistance genes (Sabbu et al. 2016). Using the SES (Standard Evaluation System) for Rice (International Rice Research Institute 2013), which designates a score of 0 to 9 with increments of 1 for the varying severity of the blast disease caused by Pyricularia oryzae. A score of 0 (no spots) to 1 (tiny dots) is considered highly resistant and score of 8 to 9 means highly susceptible (International Rice Research Institute 2013). The score is based on the size of the area damaged by the pathogen on the leaves. TN1 was given a score of 9, wherein 75% of the leaves succumb to P. oryzae, while Tetep was assigned a score of 1 against the blast fungus (Sabbu et al. 2016). Tetep harbors the R genes Pi-ta (Mahesh et al. 2016; Wang et al. 2019b), Pi54(Pik-h) (Sharma et al. 2005), and Pitp(t) (Barman et al. 2004). Thus, Tetep is a good reference in searching for blast R genes in TN1. From the list of predicted R genes in TN1, we looked for the orthologues of TN1 R genes in Tetep and catalogued any mutations between the orthologues. We narrowed down the list of blast R genes to check by using the set of resistance genes studied by Mahesh et al. (2016).

Of the two genes, we suspect the direct absence of Pi54 in TN1 (Table 1) to partly cause its blast susceptibility. The logic is simple: (1) we analyzed nearly all best functionally studied NLR genes in rice (i.e., the 22 genes), and only Pi54 shows presence/absence polymorphism between indica resistant (e.g., Tetep and Tadukan) and susceptible (e.g., HR-12 and Co-39) cultivars and is absent in TN1 (Table 1); (2) Pi54 confers broad spectrum resistance to blast disease, and is being used in some enhanced blast resistant breeding programs (Thakur et al. 2015). Haplotype analysis of 92 cultivars for the Pi54 gene revealed one haplotype out of 50, called H_3 that is composed of blast resistant indica cultivars (Thakur et al. 2015). We expanded the haplotype analysis for Pi54 by checking the SNP-Seek database (Mansueto et al. 2017), which contains data from pre-computed analysis of 3,024 rice cultivars aka the 3000 Rice Genomes Project (Wang et al. 2018b). However, instead of getting 50 haplotypes, the alleles of the 3 K RGP were grouped to only two haplotypes (Additional file 5: Dataset S4). Seventeen SNPS were missense variants (Additional file 6: Dataset S5).

Functions of the genes subjected to positive selection in TN1

For GA 3β-hydroxylase, it is gibberellin 3-beta-dioxygenase activity (GO:0016707). Probable TPP (trehalose-phosphate phosphatase) C has the function of trehalose-phosphatase activity (GO:0004805), while KARI, chloroplastic for ketol-acid reductoisomerase activity (GO:0004455), is involved in biosynthesis of branch chain amino acids valine (GO:0009099) and isoleucine (GO:0009097). For the transmembrane transporter activity (GO:0022857), it refers to the transmembrane protein 56 isoform X1 gene.

TPP (EC:3.1.3.12) and trehalose-6-phosphate synthase (TPS) (EC:2.4.1.15) are important enzymes in trehalose biosynthesis. TPP acts on the product of TPS, which is trehalose-6-phosphate (T6P), dephosphorylating it to produce the end-product trehalose, a disaccharide composed of two glucose molecules linked by an α(1 → 1) glycosidic bond.

Trehalose is a non-reducing sugar (Stick and Williams 2009), stable enough to become a natural anti-desiccant (Luyckx and Baudouin 2011). This property of trehalose was studied in a fusion gene of TPS and TPP in transgenic rice that led to an increase in trehalose, inducing the plants to become resistant to drought, sodicity and low temperatures (Garg et al. 2002). T6P has been associated with increased yield. In wheat, an increase in T6P led to an increase in yield through the inhibition of sucrose nonfermenting 1 (SNF1)-related protein kinase 1 (SnRK1), while in maize a decrease in T6P led to increased activity of SnRK1, leading to more sucrose transport and an increase in yield (Paul et al. 2018). TN1 is reported to be a drought-resistant cultivar (Garg and Singh 1971) as well as a high-yielding variety. This suggests that the OsTN8g001161 PS gene encoding for TPP could have played a role in this drought resistant, high-yield characteristic of TN1, either by an increase/decrease in T6P or through an enhanced production of trehalose.

The two GO terms protein serine/threonine kinase activity (GO:0004674) and transmembrane receptor protein serine/threonine kinase activity (GO:0004675) are synonymous to each other, and they refer to two different proteins, probable LRR receptor-like serine/threonine-protein kinase At3g47570 and L-type lectin-domain containing receptor kinase IX.1-like. The former is a type of leucine-rich repeat receptor like kinase (LRR-RLK), while the latter is commonly called LecRK or a lectin receptor kinase. The 309 LRR-RLKs in Nipponbare (Sun and Wang 2011) have a role in abiotic stress response (Dievart et al. 2016), while LecRKs are associated with plant immunity (Wang and Bouwmeester 2017). The BP GO terms (Additional file 1: Table S3) of defense response to oomycetes (GO:0002229) and defense response to bacterium (GO:0042742) support the notion of stress response to pathogens through the LecRK PS gene. However, the LecRK in TN1 could have other functions. In Nipponbare, OsLecRK is not only involved in immune response but also in seed germination (Cheng et al. 2013).

Although five of the PS genes have no assigned function (Additional file 1: Table S3), OsTN2g002903 and OsTN1g003572 were identified as a PLATZ transcription factor (TF) family protein and an armadillo/beta-catenin repeat protein-like, respectively. Previous studies have shown that PLATZ TF GL6 in rice affects grain size and number (Wang et al. 2013) was used. The input were the protein sequences of the genes (protein ids as fasta headers) aligned by Clustal (version 2.1, parameter: -output = FASTA -type = protein –align). The alignment file was uploaded to https://genepainter.motorprotein.de/genepainter, together with the segment of the gff annotation file containing the lines with the gene ids and transcript ids of the sd1 of Nipponbare, TN1, and IR8. For the Nipponbare vs TN1 gene structure models, two gff files were prepared and named as Os01t0883800-02.gff and OsTN1t004133.1.gff, the filenames matching the fasta headers in the alignment file. The same method was done for the Nipponbare vs IR8.

Sanger sequencing of the sd1 gene

Genomic DNA was extracted from young leaves of TN1 and IR8 with DNeasy Plant Mini Kit (Qiagen). Primers for amplifying sd1 gene were designed at the flanking regions about 150 bp upstream or downstream the target region. TN1 sd1 gene was amplified by forward primer (5′-ATGTCTGTCCAGTGGCAACC-3′) and reverse primer (5′-CTTGAATTACTTGTTCTGTTGCTTC-3′) and IR8 sd1 by forward primer (5′-ACCTTTAAACTTGGTCTAAAAGGATG-3′) and reverse primer (5′-GCTTGAATTACTTGTTCTGTTGC-3′) with ALLinTM Mega HiFi DNA polymerase (highQu). The result PCR products were purified with FB PCR Clean Up/ Gel Extraction Kit (Fair Biotech) and then sequenced by DNA Sequencing Core Facility of the Institute of Biomedical Sciences, Academia Sinica.

Comparison of the sd1 gene sequence against TN1 reads from the 3000 Rice Genomes Project

We first searched the SNP-Seek database (Mansueto et al. 2017) for any entry about the Taichung Native 1 cultivar by checking each results page and searching the page for key words like “Taichung” or “TN1”. We found two and they have the assay ids CX270 and CX162. The former has the entry “TAICHUNGNATIVE1”, while the latter is named as “TN1”. Alternatively, the search can be faster by doing this clicks in the SNP-Seek website: Home—> Download—> SNPs Analysis Files—> Variety drop down menu. We downloaded the Sequence Read Archive (SRA) reads associated with these entries in SNP-Seek and mapped them into the TN1 genome and checked their map** coverage via (IGV) (Robinson et al. 2011).

To download the SRA reads, we use fastq-dump of the SRA Toolkit (SRA Tools 2021) version v2.10.5 (command: fastq-dump –split-files < SRA Accession ID >) to retrieve them as paired-end reads. We trimmed the reads using Trimmomatic (Bolger et al. 2014) v0.39 (parameters: adapters.fa:2:30:10 LEADING:20 TRAILING:20 SLIDINGWINDOW:4:20 MINLEN:50 CROP:82). The trimmed paired-end reads were interleaved by BBTools (Bushnell 2021) v38.90 (command: reformat.sh in1 = read1.fastq in2 = read2.fastq out = out.fastq). The output reads were mapped via BWA (Li 2013) (version 0.7.17) mem (default options) into the TN1 genome. Each output sam file was converted into a bam file via Samtools (Li et al. 2009) version 1.9 (commands: samtools view -S -h -b -f 3 -F 12 -q 20; samtools sort -T tmp; samtools index), then the filtered bam files were combined as one file with the merge command (parameter: -f -h < sam file > –output-fmt BAM). The output bam file was sorted (command: samtools sort -T tmp) and indexed (default options). To get the map** in the sd1 region, the bam file was sliced (command: samtools view -b < input bam file > 'TN1_chr1:40,361,934–40,364,270'). The mentioned coordinates in chromosome 1 of TN1 represent the locus of the sd1 gene. The bam file was viewed in IGV (Eddy 1998) v2.10.0 through the Files tab and choosing Load from File. After loading the bam file, the TN1 genome fasta file was read in IGV through the Genomes tab and selecting Load Genome from File. Finally, the coordinates of the sd1 gene locus ('TN1_chr1:40,361,934–40,364,270) was inputted in the search box and Go was clicked.

To retrieve the SNP data of the sd1 gene, the Genotypes icon was clicked from the homepage of the SNP-Seek website. Os01g0883800 (RAPDB-ID of the sd1 gene) was inputted in the Gene locus section. A dropdown menu appeared and Os01g0883800 was selected and automatically the CHR1, 38382385, and 38385469 became the values for Chromosome, Start and End, respectively. Variety set was “3 k” and the SNP set was “3kfiltered”. In the options settings, Include Indels was checked. All other settings were default. The Search button was clicked. In the results table, the Subpopulation column title was clicked.

Prediction of NBS-LRR genes

To detect the NBS-LRR genes of TN1, the proteins extracted from the gff annotation file and genome of TN1, via gffread (Pertea and Pertea 2020) (default options) were used as input in hmmscan (Eddy 1998), (via HMMER, version 3.1b2) and NLR-parser (Steuernagel et al. 2015), version 1.0 (Additional file 1: Fig. S8). By using the Pfam 30.0 database (El-Gebali et al. 2018), domains of the TN1 proteins were predicted. The domain table output contained the list of predicted functional domain of each protein with their identified locations.

NLR-parser (Steuernagel et al. 2015) looked for R genes in the TN1 proteins based on a pre-defined set of NLR domains, classifying them either as an NB-ARC, LRR or NB-LRR. NLR-Parser needs an xml file as input and this was produced by the mast (Bailey and Gribskov 1998) [MEME Suite (Bailey et al. 2009), version 4.9.1] tool. The resulting output xml file became the input for the NLR-Parser run. The results of the hmmscan and NLR-parser runs were integrated and saved as TN1_genome.maker.pass2.maker_proteins.rename.NLRs.csv.

Search for Tetep NLR orthologues in TN1

Input data was prepared by extracting first the coding sequence (CDS) of TN1, followed by getting the CDS of NB-ARC domains. Finally, the protein sequences of the NB-ARC domains were finally obtained. The protein fasta file served as the input file in the OrthoFinder (Emms and Kelly 2019) run.

To investigate why TN1 is susceptible to blast disease, the NB-ARC fasta files of Tetep were blasted to TN1 via OrthoFinder (Emms and Kelly 2019), version 2.2.7, blastp (Camacho et al. 2009) (NCBI BLAST + , version 2.3.0) and blastn (Camacho et al. 2009) (NCBI BLAST + , version 2.3.0) (Additional file 1: Fig. S9). OrthoFinder searched for orthologues of the TN1 NB-ARC domains against the NB-ARC domains of Tetep, Nipponbare, MH63 and R498. Results were organized and saved as TN1_Orthologues.ortho_pairs.csv. Identified orthologues between TN1 and Tetep were considered found with respect to TN1. If no orthologue was found, Tetep NB-ARC was aligned via blastp (Camacho et al. 2009) (parameter: -evalue 1e-10; output filename: Tetep_NBS_domain_protein.blastp.TN1_marker_protein.csv) against the set of TN1 proteins. The best results were saved as Tetep_NBS_domain_protein.blastp.TN1_marker_protein.best.csv. An R gene was found if it had more than 50% alignment coverage. The process was repeated for the NB-ARC of Tetep against MH63, R498 and Nipponbare.

The results of the map** of the tested NLRs were saved as Tetep_NBS_domain_protein.blastp.TN1_marker_protein.best.tested.csv. The file Tetep_tested.csv was derived from Table S6 results of the Wang et al. (2019b) study and is available at https://doi.org/10.6084/m9.figshare.14546724. The file has two columns. Column 1 is Gene ID of the Tetep R gene. Column 2 follows this notation, Receptor:Total_Tested:Resistant:Susceptible. Receptor refers to either of TP309 or Shin2 as the receptor cultivars of the R gene. Total_Tested refers to the number of tested blast strains for the receptor cultivar, while Resistant and Susceptible are the counts of being resistant or susceptible to the pathogen. To confirm that whether those "absent" genes are deleted or possibly not coding anymore, we blasted Tetep NB-ARC domain CDS against TN1 genome via blastn. The results were organized and saved as Tetep_NBS_domain_cds.blastn.TN1_genome.best.csv.

To check the effect of sequence variation, the TN1 genome and the Tetep genome were aligned via MUMmer (Kurtz et al. 2004), version 3.23. Only unique alignments were used in variants calling. Effects of variants were predicted using snpEff (Cingolani et al. 2012), version 4.3o (parameter: -ud 2000, input: Tetep_v_TN1.nucmer1.filter.vcf.gz, output: Tetep_v_TN1.nucmer1.filter.snpEff.vcf): The output file was parsed by sum_snpEff.pl and map_records.pl script, and the results were saved as Tetep.NBS_genes.TN1_nucmer1.snpEff.csv.

The commands used in the search for Tetep NLR orthologues in TN1 are available at https://doi.org/10.6084/m9.figshare.14555598 as the Search_for_Tetep_NLR_orthologues_in_TN1_script.sh file. The code for sum_snpEff.v1.0.1.pl and for the prediction of NBS-LRR genes, and in the detection of presence or absence of blast R genes in TN1, is also available on the same Figshare link. The gff2fasta.pl, map_records.pl, and extract_split_seqs.pl scripts are available at https://github.com/wl13/BioScripts. For the mummer2Vcf.pl script, it is hosted at https://github.com/douglasgscofield/bioinfo/blob/master/scripts/.

A Chi-squared test was done using R (R Core Team 2021), to test whether the ratio of resistant/non-resistant NLR orthologues of TN1 were significantly different as compared to Tetep (Additional file 1: Table S2). R (version 3.6.1) command: chisq.test(c(69,170–69), p = c(90/219,1–90/219)). Result of the R command: X-squared = 0.018098, df = 1, p-value = 0.893.

Detection of presence or absence of blast R genes in TN1

The study of Mahesh et al. (2016) was repeated for TN1 to detect whether a specific blast R gene was present or absent. Twenty-two cloned blast NLR protein sequences (Pib, Pi-ta, Pi54(Pik-h), Pid2, Pi9, Piz-t, Pi37, Pi36, Pik-m, pi21, Pit, Pi5, Pid3, Pb1, Pish, Pi25, Pia(RGA4), Pik-p, Pik, Pi54rh, Pi1, Pi64) were aligned via blastp (Camacho et al. 2009), version (NCBI BLAST + , version 2.3.0, parameter: -evalue 1e−10) to the TN1 protein sequences, and also by tblastn (Camacho et al. 2009), (NCBI BLAST + , version 2.3.0, parameter: -evalue 1e-10) against the TN1 genome to detect similar protein sequences. We get the hits which have an e-value < 10e−10 and identity ≥ 70%. The same method was applied to the Tetep proteins and genome sequence.

Missing R genes were denoted by a− sign and those that are found are given by a + mark, provided that the alignment sequences showed high similarity. An R gene was classified as mutated if there was a disagreement with the alignment, or the blastn best hit was better than the blastp result.

To find the Nipponbare orthologs of the TN1 blast R genes, OrthoFinder (Emms and Kelly 2019), version 2.3.11 (parameter: -S blast) was executed against the Nipponbare proteome from RAP-DB (Rice Annotation Project Database) (Sakai et al. 2013).

Haplotype analysis using data from the 3000 Rice Genomes Project

Haplotype analysis of the Pi-ta and Pi54 genes were done in the SNP-Seek database (Mansueto et al. 2017) using the 3 k filtered dataset. The objective is to get the haplotypes of the two genes. Starting from homepage of SNP-Seek, Genotype was clicked. Inputs in the Gene locus were the RAP-DB IDs of Pi-ta and Pi54. These were Os12g0281300 and Os11g0639100, respectively. In the options, Include Indels was also selected, while all other settings were default before executing the search. For Pi-ta, it resulted in a set of 3024 varieties with 42 SNP and 127 INDEL positions, while for Pi54 it was 3024 varieties with 46 SNP and 24 INDEL positions. From the Table view of the results, the Haplotype tab was selected. The resulting haplotypes were regrouped using the autogroup and pamk options. Results about the variety order and grou** of the alleles were downloaded.

From the study of Jia et al. (2003), Wang et al. (2008) and Thakur et al. (2015), a list of resistant and susceptible cultivars to blast disease harboring the Pi-ta or Pi54 gene were gathered. Each of the cultivars was checked to see whether the SNPs were listed in the SNP-Seek Database. To know whether they are in SNP-Seek, these series of clicks were done: Home—> Download—> SNPs Analysis Files. Another way is to check the variety order tab of Additional file 5: Dataset S4. All possible combinations of naming the cultivar were tried for those containing numbers. For example, NANJING 11 was searched as NANJING11, NANJING-11 or NANJING 11. Keywords were also tried; e.g., for the cultivar PUSA BUSMATI 1, the query used was BASMATI and one of the hits was PUSA (BASMATI 1). The important/causal SNPs related to susceptibility (Jia et al. 2003; Wang et al. 2008; Thakur et al. 2015) were checked on the SNP effects data in Additional file 6: Dataset S5 to find any similarity.

To build Tables 2 and 3, the following series of steps were followed: (1) Get haplotypes of Pi54 and Pi-ta; (2) find the cultivars from (Jia et al. 2003; Wang et al. 2008; Thakur et al. 2015) in SNP-Seek; (3) from the haplotypes, get the nucleotide position in which the SNPs are different (heterozygous) across all haplotype group; (4) list the heterozygous alleles for each cultivar; (5) list the number of mismatch SNPs per cultivar from the variety order tab of Additional file 5: Dataset S4; (6) list the alleles in the heterozygous SNP positions for each haplotype group; (7) get the major and minor alleles and minimum allele frequency, from the graph portion of the tabular results of SNP-Seek, by clicking the line graph to find the right SNP position and see the information sought.

We were not able to find Tetep in the list of cultivars included in the 3 K RGP so to get the SNPs of Tetep, its chromosome 11 (containing Pi54) and tig00012489 (containing Pi-ta) were aligned against their equivalent chromosomes in Nipponbare containing the said R genes. This was done via nucmer (default options) of the MUMmer version 4. The output delta file was used as an input in the show-snps (parameter: -C, default options) command. To get the alleles of Tetep, those corresponding to the coordinates of Nipponbare indicated in Tables 2 and 3 were checked. If the coordinate was not found in the output show-snps, then the reference allele and the Tetep allele were assumed to be the same.

Clustal alignment was done for the Pi54 and Pi-ta protein sequences of TN1 against Tetep (Pi54) and Yashiro-mochi (Pi-ta). The alignment file was viewed in Jalview (Waterhouse et al. 2009), version 2.11.1.4. Protein identifiers/GenBank accession numbers of the input protein sequences were: OsTN11t002257.1 for TN1 Pi54; chr11.fgenesh2107.1 for Tetep Pi54; OsTN12t001092.1 for TN1 Pi-ta; ACY25067.1 for Yashiro-mochi Pita. Creation of the images for the Clustal alignment were similar to the method done by Panibe et al. (2021).

Detection of genes subjected to positive selection in TN1

Coding sequences from 24 plant genomes (Additional file 1: Table S4) were used as input in PosiGene (Sahm et al. 2017), version 0.1 (parameters: -as = TN1 -rs = TN1 -ts = TN1 -nhsbr) to detect positive selection. Fifteen rice varieties or species were: five indica cultivars (TN1, IR8, MH63, IR64 and 9311), the Nipponbare reference genome, two wild species of the indica cultivar (O. rufipogon and O. nivara) plus seven non-Oryza sativa species (O. barthii, O. brachyantha, O. glaberrima, O. glumipatula, O. punctata, O. meridionalis, and O. longistaminata). Nine members of the grass family (Brachypodium distachyon, Eragrostis tef, Leersia perrieri, Panicum hallii fil2, Panicum hallii hal2, Setaria italica, Sorghum bicolor, Triticum aestivumm, Zea mays) were used as outgroups. This was to prevent the TN1 genome from becoming the last common ancestor in the species tree that PosiGene would create. The CDSs of TN1 and IR64 were extracted from their gff file via gffread (Pertea and Pertea 2020) (default options). The CDSs of the other cultivars were downloaded directly; see Additional file 1: Table S4. Fasta headers were processed to follow an “isoform|gene” name format (ex. gene1.1|gene1) as required by PosiGene. This helped the software identify which isoforms were from the same gene. The tool was executed with TN1 as the as (anchor species) (most complete set of genes), rs (reference species) (basis for orthologue assignment), and ts (target species) (branch to test). This was to make sure that all the TN1 genes were tested for positive selection. The HomoloGene file for rice, which PosiGene recommends, was not used because it was based on Build 4.0 of Nipponbare, which is a japonica cultivar and outdated. The instructions in the PosiGene manual were followed to run the PosiGene.pl perl script. In the results output of PosiGene, those with FDR < 0.05 are PS genes.

PosiGene command.

The PosiGene command below is for testing the branch leading to TN1 only:

perl PosiGene.pl -o = TN1_GRgenes -as = TN1 -rs = TN1:folder/TN1_cds.fasta -tn = 32 -ts = TN1 \

-nhsbr = TN1:folder/TN1_cds.fasta, \

IR64:folder/IR64_cds.fasta, \

IR8:folder/oryza_indicair8_cds.fasta, \

O_rufipogon:folder/oryza_rufipogon_cds.fasta, \

O_nivara:folder/Oryza_nivara_cds.fasta, \

MH63:folder/MH63_cds.fasta, \

Nipponbare:folder/IRGSP_cds.fasta,\

O_barthii:folder/Oryza_barthii_cds.fasta, \

O_brachyantha:folder/Oryza_brachyantha_cds.fasta, \

O_glaberrima:folder/Oryza_glaberrima_cds.fasta, \

O_glumipatula:folder/Oryza_glumipatula_cds.fasta, \

9311:folder/Oryza_indica_cds.fasta, \

O_longistaminata:folder/Oryza_longistaminata_cds.fasta, \

O_meridionalis:folder/Oryza_meridionalis_cds.fasta, \

O_punctata:folder/Oryza_punctata_cds.fasta, \

Brachypodium_distachyon:folder/Brachypodium_distachyon_cds.fasta, \

Eragrostis_tef:folder/Eragrostis_tef_cds.fasta,\

Leersia_perrieri:folder/Leersia_perrieri_cds.fasta, \

Panicum_hallii_fil2:folder/Panicum_hallii_fil2_cds.fasta, \

Panicum_hallii_hal2:folder/Panicum_hallii_hal2_cds.fasta, \

Setaria_italica:folder/Setaria_italica_cds.fasta, \

Sorghum_bicolor:folder/Sorghum_bicolor_cds.fasta, \

Triticum_aestivum:folder/Triticum_aestivum_cds.fasta, \

Zea_mays:folder/Zea_mays_cds.fasta.

The IR8 genome was also scanned via PosiGene using the same set of CDSs to detect any PS genes in TN1 (parameters: -as = IR8 -rs = IR8 -ts = IR8 -nhsbr). Unfortunately, no IR8 gene got an FDR < 0.05.

REVIGO visualization of GO Terms

The GO terms of the TN1 genes under positive selection were visualized using the REVIGO (Supek et al. 2011) website (http://revigo.irb.hr/, accessed May 31, 2020). The inputs in REVIGO were the list of GO terms of all the proteins of the PS gene (Additional file 1: Table S3). The online tool clustered the GO terms and selected the representative terms based on the cut-off value of similarity (also called dispensability), which is based on semantic distance computed by the SimRel algorithm. Settings for the PosiGene result: database with GO term sizes: whole UniProt; semantic similarity measure: SimRel; similarity cut-off value: 0.7.

Because the output is online, clicking the scatterplot will reveal the actual value of uniqueness when the user hovers their mouse pointer on a specific sphere. The tabular output listing all the inputted GO terms, their grou** as well as their corresponding frequency, uniqueness and dispensability values were downloaded from the website.

Availability of data and materials

The genomes used in the study have the following GenBank accession numbers TN1 (GCA_018853525.1), IR8 (GCA_001889745.1), Tetep (GCA_004348155.2), MH63 (GCA_001623365.2), R498 (GCA_002151415.1), Nipponbare (GCA_001433935.1). The TN1 reads from the 3000 Rice Genome Sequencing Project assay CX270 have the SRA accession numbers: ERX576687, ERX576688, ERX576689, ERX576690, ERX576691, ERX576692, ERX576693, ERX576694, ERX576695, ERX576696, ERX576697, ERX576698, ERX576699, ERX576700. For assay CX162, the SRA accession numbers are ERX592032, ERX592033, ERX592034, ERX592035, ERX592036, ERX592037, ERX592038, ERX592039, ERX592040, ERX592041, ERX592042, ERX592043. Sources links of the coding sequences used in the PosiGene analysis are in Additional file 1: Table S4. The gff annotation file of IR8 is available at the Gramene, http://ftp.gramene.org/oge/release-3/gff3/oryza_indicair8/, while for R498 it is available at MBKBase, http://mbkbase.org/R498/. The Blast2GO annotation file of the genes of TN1 and IR8 are available at Figshare, https://doi.org/10.6084/m9.figshare.13010333. TN1’s gff annotation file can also be found at the Figshare link previously mentioned. The gff annotation file of Tetep is available at https://doi.org/10.6084/m9.figshare.7775810.v1. The datasets supporting the conclusions of this article are included within the article and its additional files.

References

Amador V, Monte E, Garcı́a-Martı́nez J-L, Prat S (2001) Gibberellins signal nuclear import of PHOR1, a photoperiod-responsive protein with homology to Drosophila armadillo. Cell 106:343–354. https://doi.org/10.1016/s0092-8674(01)00445-7
Article CAS PubMed Google Scholar
Bailey TL, Gribskov M (1998) Combining evidence using p-values: application to sequence homology searches. Bioinformatics 14:48–54. https://doi.org/10.1093/bioinformatics/14.1.48
Article CAS PubMed Google Scholar
Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS (2009) MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res 37:W202–W208. https://doi.org/10.1093/nar/gkp335
Article CAS PubMed PubMed Central Google Scholar
Barman SR, Gowda M, Venu RC, Chattoo BB (2004) Identification of a major blast resistance gene in the rice cultivar ‘Tetep.’ Plant Breed 123:300–302. https://doi.org/10.1111/j.1439-0523.2004.00982.x
Article CAS Google Scholar
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics 30:2114–2120. https://doi.org/10.1093/bioinformatics/btu170
Article CAS PubMed PubMed Central Google Scholar
Bushnell B (2021) BBMap. https://sourceforge.net/projects/bbmap/. Accessed 4 Feb 2021
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL (2009) BLAST+: architecture and applications. BMC Bioinformatics 10:421. https://doi.org/10.1186/1471-2105-10-421
Article CAS PubMed PubMed Central Google Scholar
Chandler RF Jr (1992) An adventure in applied science: a history of the International Rice Research Institute. International Rice Research Institute, Los Baños, pp 51–116
Google Scholar
Chen X, Jia Y, Wu BM (2019) Evaluation of rice responses to the blast fungus Magnaporthe oryzae at different growth stages. Plant Dis 103:132–136. https://doi.org/10.1094/PDIS-12-17-1873-RE
Article CAS PubMed Google Scholar
Cheng X, Wu Y, Guo J, Du B, Chen R, Zhu L, He G (2013) A rice lectin receptor-like kinase that is involved in innate immune responses also contributes to seed germination. Plant J 76:687–698. https://doi.org/10.1111/tpj.12328
Article CAS PubMed PubMed Central Google Scholar
Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM (2012) A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w¹¹¹⁸; iso-2; iso-3. Fly 6:80–92. https://doi.org/10.4161/fly.19695
Article CAS PubMed PubMed Central Google Scholar
Coates JC, Laplaze L, Haseloff J (2006) Armadillo-related proteins promote lateral root development in Arabidopsis. Proc Natl Acad Sci USA 103:1621–1626. https://doi.org/10.1073/pnas.0507575103
Article CAS PubMed PubMed Central Google Scholar
Dievart A, Perin C, Hirsch J, Bettembourg M, Lanau N, Artus F, Bureau C, Noel N, Droc G, Peyramard M, Pereira S, Courtois B, Morel J-B, Guiderdoni E (2016) The phenome analysis of mutant alleles in leucine-rich repeat receptor-like kinase genes in rice reveals new potential targets for stress tolerant cereals. Plant Sci 242:240–249. https://doi.org/10.1016/j.plantsci.2015.06.019
Article CAS PubMed Google Scholar
Eddy SR (1998) Profile hidden Markov models. Bioinformatics 14:755–763. https://doi.org/10.1093/bioinformatics/14.9.755
Article CAS PubMed Google Scholar
El-Gebali S, Mistry J, Bateman A, Eddy SR, Luciani A, Potter SC, Qureshi M, Richardson LJ, Salazar GA, Smart A, Sonnhammer ELL, Hirsh L, Paladin L, Piovesan D, Tosatto SCE, Finn RD (2018) The Pfam protein families database in 2019. Nucleic Acids Res 47:D427–D432. https://doi.org/10.1093/nar/gky995
Article CAS PubMed Central Google Scholar
Emms DM, Kelly S (2019) OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol 20:238. https://doi.org/10.1186/s13059-019-1832-y
Article PubMed PubMed Central Google Scholar
Fukuoka S, Saka N, Mizukami Y, Koga H, Yamanouchi U, Yoshioka Y, Hayashi N, Ebana K, Mizobuchi R, Yano M (2015) Gene pyramiding enhances durable blast disease resistance in rice. Sci Rep 5:7773. https://doi.org/10.1038/srep07773
Article CAS PubMed PubMed Central Google Scholar
García-Martinez JL, Gil J (2001) Light regulation of gibberellin biosynthesis and mode of action. J Plant Growth Regul 20:354–368. https://doi.org/10.1007/s003440010033
Article CAS PubMed Google Scholar
Garg OK, Singh BP (1971) Physiological significance of ascorbic acid in relation to drought resistance in rice (Oryza sativa L.). Plant Soil 34:219–223
Article CAS Google Scholar
Garg AK, Kim J-K, Owens TG, Ranwala AP, Choi YD, Kochian LV, Wu RJ (2002) Trehalose accumulation in rice plants confers high tolerance levels to different abiotic stresses. Proc Natl Acad Sci USA 99:15898–15903. https://doi.org/10.1073/pnas.252637799
Article CAS PubMed PubMed Central Google Scholar
Gramene (2020) http://www.gramene.org/. Accessed 10 May 2020
Gupta SK, Rai AK, Kanwar SS, Chand D, Singh NK, Sharma TR (2011) The single functional blast resistance gene Pi54 activates a complex defence mechanism in rice. J Exp Bot 63:757–772. https://doi.org/10.1093/jxb/err297
Article CAS PubMed Google Scholar
Hammesfahr B, Odronitz F, Mühlhausen S, Waack S, Kollmar M (2013) GenePainter: a fast tool for aligning gene structures of eukaryotic protein families, visualizing the alignments and map** gene structures onto protein structures. BMC Bioinformatics 14:77. https://doi.org/10.1186/1471-2105-14-77
Article PubMed PubMed Central Google Scholar
Hargrove TR, Coffman WR, Cabanilla VL (1979) Genetic interrelationships of improved rice varieties in Asia. International Rice Research Institute, Manila, pp 2–10
Google Scholar
International Rice Genome Sequencing Project, Sasaki T (2005) The map-based sequence of the rice genome. Nature 436:793–800. https://doi.org/10.1038/nature03895
Article CAS Google Scholar
International Rice Research Institute (IRRI) (2013) Standard evaluation system for rice, 5th edn. International Rice Research Institute, Manila, pp 2–18
Google Scholar
Itoh H, Ueguchi-Tanaka M, Sakamoto T, Kayano T, Tanaka H, Ashikari M, Matsuoka M (2002) Modification of rice plant height by suppressing the height-controlling gene, D18. Rice Breed Sci 52:215–218. https://doi.org/10.1270/jsbbs.52.215
Article CAS Google Scholar
Jia Y, Bryan GT, Farrall L, Valent B (2003) Natural variation at the Pi-ta rice blast resistance locus. Phytopathology 93:1452–1459. https://doi.org/10.1094/PHYTO.2003.93.11.1452
Article CAS PubMed Google Scholar
Jia X, Yu L, Tang M, Tian D, Yang S, Zhang X, Traw MB (2020) Pleiotropic changes revealed by in situ recovery of the semi-dwarf gene sd1 in rice. J Plant Physiol 248:153141. https://doi.org/10.1016/j.jplph.2020.153141
Article CAS PubMed Google Scholar
Kawahara Y, de la Bastide M, Hamilton JP, Kanamori H, McCombie WR, Ouyang S, Schwartz DC, Tanaka T, Wu J, Zhou S, Childs KL, Davidson RM, Lin H, Quesada-Ocampo L, Vaillancourt B, Sakai H, Lee SS, Kim J, Numa H, Itoh T, Buell CR, Matsumoto T (2013) Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice 6:4. https://doi.org/10.1186/1939-8433-6-4
Article PubMed PubMed Central Google Scholar
Kumar PN, Sujatha K, Laha GS, Rao KS, Mishra B, Viraktamath BC, Hari Y, Reddy CS, Balachandran SM, Ram T, Madhav MS, Rani NS, Neeraja CN, Reddy GA, Shaik H, Sundaram RM (2012) Identification and fine-map** of Xa33, a novel gene for resistance to Xanthomonas oryzae pv. oryzae. Phytopathology 102:222–228. https://doi.org/10.1094/PHYTO-03-11-0075
Article CAS PubMed Google Scholar
Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL (2004) Versatile and open software for comparing large genomes. Genome Biol 5:R12. https://doi.org/10.1186/gb-2004-5-2-r12
Article PubMed PubMed Central Google Scholar
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23:2947–2948. https://doi.org/10.1093/bioinformatics/btm404
Article CAS PubMed Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079. https://doi.org/10.1093/bioinformatics/btp352
Article CAS PubMed PubMed Central Google Scholar
Li H (2013) Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. https://arxiv.org/abs/1303.3997. Accessed 7 May 2021
Li Q, Wang J, Ye J, Zheng X, **ang X, Li C, Fu M, Wang Q, Zhang Z, Wu Y (2017) The maize imprinted gene Floury3 encodes a PLATZ protein required for tRNA and 5S rRNA transcription through interaction with RNA polymerase III. Plant Cell 29:2661–2675. https://doi.org/10.1105/tpc.17.00576
Article CAS PubMed PubMed Central Google Scholar
Liu L-W, Hsieh S-H, Lin S-J, Wang Y-M, Lin W-S (2021) Rice blast (Magnaporthe oryzae) occurrence prediction and the key factor sensitivity analysis by machine learning. Agronomy 11:771. https://doi.org/10.3390/agronomy11040771
Article CAS Google Scholar
Luyckx J, Baudouin C (2011) Trehalose: an intriguing disaccharide with potential for medical application in ophthalmology. Clin Ophthalmol 5:577–581. https://doi.org/10.2147/OPTH.S18827
Article CAS PubMed PubMed Central Google Scholar
Ma J, Lei C, Xu X, Hao K, Wang J, Cheng Z, Ma X, Ma J, Zhou K, Zhang X, Guo X, Wu F, Lin Q, Wang C, Zhai H, Wang H, Wan J (2015) Pi64, encoding a novel CC-NBS-LRR protein, confers resistance to Leaf and neck blast in rice. Mol Plant Microbe Interact 28:558–568. https://doi.org/10.1094/MPMI-11-14-0367-R
Article CAS PubMed Google Scholar
Mahesh HB, Shirke MD, Singh S, Rajamani A, Hittalmani S, Wang GL, Gowda M (2016) Indica rice genome assembly, annotation and mining of blast disease resistance genes. BMC Genomics 17:242. https://doi.org/10.1186/s12864-016-2523-7
Article CAS PubMed PubMed Central Google Scholar
Mansueto L, Fuentes RR, Borja FN, Detras J, Abriol-Santos JM, Chebotarov D, Sanciangco M, Palis K, Copetti D, Poliakov A, Dubchak I, Solovyev V, Wing RA, Hamilton RS, Mauleon R, McNally KL, Alexandrov N (2017) Rice SNP-seek database update: new SNPs, indels, and queries. Nucleic Acids Res 45:D1075–D1081. https://doi.org/10.1093/nar/gkw1135
Article CAS PubMed Google Scholar
Marçais G, Delcher AL, Phillippy AM, Coston R, Salzberg SL, Zimin A (2018) MUMmer4: a fast and versatile genome alignment system. PLoS Comput Biol 14:e1005944. https://doi.org/10.1371/journal.pcbi.1005944
Article CAS PubMed PubMed Central Google Scholar
Monna L, Kitazawa N, Yoshino R, Suzuki J, Masuda H, Maehara Y, Tanji M, Sato M, Nasu S, Minobe Y (2002) Positional cloning of rice semidwarfing gene, sd-1: rice “green revolution Gene” encodes a mutant enzyme involved in gibberellin synthesis. DNA Res 9:11–17. https://doi.org/10.1093/dnares/9.1.11
Article CAS PubMed Google Scholar
Nagano H, Onishi K, Ogasawara M, Horiuchi Y, Sano Y (2005) Genealogy of the “Green Revolution” gene in rice. Genes Genet Syst 80:351–356. https://doi.org/10.1266/ggs.80.351
Article CAS PubMed Google Scholar
Nugroho C, Raharjo D, Mustaha MA, Asaad M (2021) Assessing disease severity of rice blast under different rates of nitrogen fertilizer and planting system. E3S Web Conf 306:1034
Article CAS Google Scholar
Panibe JP, Wang L, Li J, Li M-Y, Lee Y-C, Wang C-S, Ku MSB, Lu M-YJ, Li W-H (2021) Chromosomal-level genome assembly of the semi-dwarf rice Taichung Native 1, an initiator of Green Revolution. Genomics 113:2656–2674. https://doi.org/10.1016/j.ygeno.2021.06.006
Article CAS PubMed Google Scholar
Paul MJ, Gonzalez-Uriarte A, Griffiths CA, Hassani-Pak K (2018) The role of trehalose 6-phosphate in crop yield and resilience. Plant Physiol 177:12–23. https://doi.org/10.1104/pp.17.01634
Article CAS PubMed PubMed Central Google Scholar
Pertea G, Pertea M (2020) GFF utilities GffRead and GffCompare. F1000Res 9:304. https://doi.org/10.12688/f1000research.23297.2
Article Google Scholar
R Core Team (2021) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/. Accessed 7 May 2021
Rai AK, Kumar SP, Gupta SK, Gautam N, Singh NK, Sharma TR (2011) Functional complementation of rice blast resistance gene Pi-k^h (Pi54) conferring resistance to diverse strains of Magnaporthe oryzae. J Plant Biochem Biotechnol 20:55–65. https://doi.org/10.1007/s13562-010-0026-1
Article CAS Google Scholar
Ramalingam J, Raveendra C, Savitha P, Vidya V, Chaithra TL, Velprabakaran S, Saraswathi R, Ramanathan A, Arumugam Pillai MP, Arumugachamy S, Vanniarajan C (2020) Gene pyramiding for achieving enhanced resistance to bacterial blight, blast, and sheath blight diseases in rice. Front Plant Sci 11:591457. https://doi.org/10.3389/fpls.2020.591457
Article PubMed PubMed Central Google Scholar
Reinecke DM, Wickramarathna AD, Ozga JA, Kurepin LV, ** AL, Good AG, Pharis RP (2013) Gibberellin 3-oxidase gene expression patterns influence gibberellin biosynthesis, growth, and development in pea. Plant Physiol 163:929–945. https://doi.org/10.1104/pp.113.225987
Article CAS PubMed PubMed Central Google Scholar
Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP (2011) Integrative genomics viewer. Nat Biotechnol 29:24–26. https://doi.org/10.1038/nbt.1754
Article CAS PubMed PubMed Central Google Scholar
Sabbu S, Pandey MK, Reddy B, Shaik H, Kumar SV, Kousik MBVN, Bhadana VP, Madhav MS, Kota S, SubbaRao LV, Kumaraswamy M, Giri A, Narasu BL, Rani NS, Sundaram RM (2016) Introgression of major bacterial blight and blast resistant genes into Vallabh Basmati 22 an elite Basmati variety. Int J Dev Res 6:8366–8370
Sahm A, Bens M, Platzer M, Szafranski K (2017) PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes. Nucleic Acids Res 45:e100. https://doi.org/10.1093/nar/gkx179
Article CAS PubMed PubMed Central Google Scholar
Sakai H, Lee SS, Tanaka T, Numa H, Kim J, Kawahara Y, Wakimoto H, Yang CC, Iwamoto M, Abe T, Yamada Y, Muto A, Inokuchi H, Ikemura T, Matsumoto T, Sasaki T, Itoh T (2013) Rice annotation project database (RAP-DB): an integrative and interactive database for rice genomics. Plant Cell Physiol 54:e6. https://doi.org/10.1093/pcp/pcs183
Article CAS PubMed PubMed Central Google Scholar
Sharma TR, Madhav MS, Singh BK, Shanker P, Jana TK, Dalal V, Pandit A, Singh A, Gaikwad K, Upreti HC, Singh NK (2005) High-resolution map**, cloning and molecular characterization of the Pi-kh gene of rice, which confers resistance to Magnaporthe grisea. Mol Genet Genomics 274:569–578. https://doi.org/10.1007/s00438-005-0035-2
Article CAS PubMed Google Scholar
Sharma M, Singh A, Shankar A, Pandey A, Baranwal V, Kapoor S, Tyagi AK, Pandey GK (2014) Comprehensive expression analysis of rice armadillo gene family during abiotic stress and development. DNA Res 21:267–283. https://doi.org/10.1093/dnares/dst056
Article CAS PubMed PubMed Central Google Scholar
Singh A, Singh VK, Singh SP, Pandian RT, Ellur RK, Singh D, Bhowmick PK, Gopala Krishnan S, Nagarajan M, Vinod KK, Singh UD, Prabhu KV, Sharma TR, Mohapatra T, Singh AK (2012) Molecular breeding for the development of multiple disease resistance in Basmati rice. AoB Plants 2012:pls029. https://doi.org/10.1093/aobpla/pls029
Article CAS PubMed PubMed Central Google Scholar
Spielmeyer W, Ellis MH, Chandler PM (2002) Semidwarf (sd-1), “green revolution” rice, contains a defective gibberellin 20-oxidase gene. Proc Natl Acad Sci USA 99:9043–9048. https://doi.org/10.1073/pnas.132266399
Article CAS PubMed PubMed Central Google Scholar
Stein JC, Yu Y, Copetti D, Zwickl DJ, Zhang L, Zhang C, Chougule K, Gao D, Iwata A, Goicoechea JL, Wei S, Wang J, Liao Y, Wang M, Jacquemin J, Becker C, Kudrna D, Zhang J, Londono CEM, Song X, Lee S, Sanchez P, Zuccolo A, Ammiraju JSS, Talag J, Danowitz A, Rivera LF, Gschwend AR, Noutsos C, Wu CC, Kao S-M, Zeng J-W, Wei F-J, Zhao Q, Feng Q, El Baidouri M, Carpentier M-C, Lasserre E, Cooke R, da Rosa FD, da Maia LC, Dos Santos RS, Nyberg KG, McNally KL, Mauleon R, Alexandrov N, Schmutz J, Flowers D, Fan C, Weigel D, Jena KK, Wicker T, Chen M, Han B, Henry R, Hsing Y-IC, Kurata N, de Oliveira AC, Panaud O, Jackson SA, Machado CA, Sanderson MJ, Long M, Ware D, Wing RA (2018) Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza. Nat Genet 50:285–296. https://doi.org/10.1038/s41588-018-0040-0
Article CAS PubMed Google Scholar
Steuernagel B, Jupe F, Witek K, Jones JDG, Wulff BBH (2015) NLR-parser: rapid annotation of plant NLR complements. Bioinformatics 31:1665–1667. https://doi.org/10.1093/bioinformatics/btv005
Article CAS PubMed PubMed Central Google Scholar
Stick RV, Williams SJ (2009) Disaccharides, oligosaccharides and polysaccharides. In: Stick RV, Williams SJ (eds) Carbohydrates: the essential molecules of life, 2nd edn. Elsevier, Amsterdam, p 335
Google Scholar
Sun X, Wang G-L (2011) Genome-wide identification, characterization and phylogenetic analysis of the rice LRR-kinases. PLoS ONE 6:e16079. https://doi.org/10.1371/journal.pone.0016079
Article CAS PubMed PubMed Central Google Scholar
Supek F, Bošnjak M, Škunca N, Šmuc T (2011) REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS ONE 6:e21800. https://doi.org/10.1371/journal.pone.0021800
Article CAS PubMed PubMed Central Google Scholar
Takken FLW, Joosten MHAJ (2000) Plant resistance genes: their structure, function and evolution. Eur J Plant Pathol 106:699–713. https://doi.org/10.1023/A:1026571130477
Article CAS Google Scholar
Tanaka T, Nishijima R, Teramoto S, Kitomi Y, Hayashi T, Uga Y, Kawakatsu T (2020) De novo genome assembly of the Indica rice variety IR64 using linked-read sequencing and nanopore sequencing. G3 Bethesda 10:1495–1501. https://doi.org/10.1534/g3.119.400871
Article CAS PubMed PubMed Central Google Scholar
Thakur S, Singh PK, Das A, Rathour R, Variar M, Prashanthi SK, Singh AK, Singh UD, Chand D, Singh NK, Sharma TR (2015) Extensive sequence variation in rice blast resistance gene Pi54 makes it broad spectrum in nature. Front Plant Sci 6:345. https://doi.org/10.3389/fpls.2015.00345
Article PubMed PubMed Central Google Scholar
SRA Tools (2021) https://github.com/ncbi/sra-tools. Accessed 7 May 2021
Vergara BS, Chang TT (1985) The flowering response of the rice plant to photoperiod: a review of the literature, 4th edn. International Rice Research Institute, Los Baños, pp 5–35
Google Scholar
Wang Y, Bouwmeester K (2017) L-type lectin receptor kinases: new forces in plant immunity. PLoS Pathog 13:e1006433. https://doi.org/10.1371/journal.ppat.1006433
Article CAS PubMed PubMed Central Google Scholar
Wang X, Jia Y, Shu QY, Wu D (2008) Haplotype diversity at the Pi-ta locus in cultivated rice and its wild relatives. Phytopathology 98:1305–1311. https://doi.org/10.1094/PHYTO-98-12-1305
Article CAS PubMed Google Scholar
Wang X, Lee S, Wang J, Ma J, Bianco T, Jia Y (2014) Current advances on genetic resistance to rice blast disease. In: Yan W, Bao J (eds) Rice—germplasm, genetics and improvement. IntechOpen, London. https://doi.org/10.5772/56824
Chapter Google Scholar
Wang J, Ji C, Li Q, Zhou Y, Wu Y (2018a) Genome-wide analysis of the plant-specific PLATZ proteins in maize and identification of their general role in interaction with RNA polymerase III complex. BMC Plant Biol 18:221. https://doi.org/10.1186/s12870-018-1443-x
Article CAS PubMed PubMed Central Google Scholar
Wang W, Mauleon R, Hu Z, Chebotarov D, Tai S, Wu Z, Li M, Zheng T, Fuentes RR, Zhang F, Mansueto L, Copetti D, Sanciangco M, Palis KC, Xu J, Sun C, Fu B, Zhang H, Gao Y, Zhao X, Shen F, Cui X, Yu H, Li Z, Chen M, Detras J, Zhou Y, Zhang X, Zhao Y, Kudrna D, Wang C, Li R, Jia B, Lu J, He X, Dong Z, Xu J, Li Y, Wang M, Shi J, Li J, Zhang D, Lee S, Hu W, Poliakov A, Dubchak I, Ulat VJ, Borja FN, Mendoza JR, Ali J, Li J, Gao Q, Niu Y, Yue Z, Naredo MEB, Talag J, Wang X, Li J, Fang X, Yin Y, Glaszmann J-C, Zhang J, Li J, Hamilton RS, Wing RA, Ruan J, Zhang G, Wei C, Alexandrov N, McNally KL, Li Z, Leung H (2018b) Genomic variation in 3,010 diverse accessions of Asian cultivated rice. Nature 557:43–49. https://doi.org/10.1038/s41586-018-0063-9
Article CAS PubMed PubMed Central Google Scholar
Wang A, Hou Q, Si L, Huang X, Luo J, Lu D, Zhu J, Shangguan Y, Miao J, **e Y, Wang Y, Zhao Q, Feng Q, Zhou C, Li Y, Fan D, Lu Y, Tian Q, Wang Z, Han B (2019a) The PLATZ transcription factor GL6 affects grain length and number in rice. Plant Physiol 180:2077–2090. https://doi.org/10.1104/pp.18.01574
Article CAS PubMed PubMed Central Google Scholar
Wang L, Zhao L, Zhang X, Zhang Q, Jia Y, Wang G, Li S, Tian D, Li W-H, Yang S (2019b) Large-scale identification and functional analysis of NLR genes in blast resistance in the Tetep rice genome sequence. Proc Natl Acad Sci USA 116:18479–18487. https://doi.org/10.1073/pnas.1910229116
Article CAS PubMed PubMed Central Google Scholar
Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ (2009) Jalview version 2–a multiple sequence alignment editor and analysis workbench. Bioinformatics 25:1189–1191. https://doi.org/10.1093/bioinformatics/btp033
Article CAS PubMed PubMed Central Google Scholar
Wu B, Hu W, Ayaad M, Liu H, **ng Y (2017) Intragenic recombination between two non-functional semi-dwarf 1 alleles produced a functional SD1 allele in a tall recombinant inbred line in rice. PLoS ONE 12:e0190116. https://doi.org/10.1371/journal.pone.0190116
Article CAS PubMed PubMed Central Google Scholar
Yoshida S (1981) Fundamental of rice crop science. International Rice Research Institute, Los Baños, p 215
Google Scholar
Zarbafi SS, Ham JH (2019) An overview of rice QTLs associated with disease resistance to three major rice diseases: blast, sheath blight, and bacterial panicle blight. Agronomy 9:177. https://doi.org/10.3390/agronomy9040177
Article CAS Google Scholar
Zeng L-R, Qu S, Bordeos A, Yang C, Baraoidan M, Yan H, **e Q, Nahm BH, Leung H, Wang G-L (2004) Spotted leaf11, a negative regulator of plant cell death and defense, encodes a U-Box/Armadillo repeat protein endowed with E3 ubiquitin ligase activity. Plant Cell 16:2795–2808. https://doi.org/10.1105/tpc.104.025171
Article CAS PubMed PubMed Central Google Scholar
Zhang X, Yang S, Wang J, Jia Y, Huang J, Tan S, Zhong Y, Wang L, Gu L, Chen JQ, Pan Q, Bergelson J, Tian D (2015) A genome-wide survey reveals abundant rice blast R genes in resistant cultivars. Plant J 84:20–28. https://doi.org/10.1111/tpj.12955
Article CAS PubMed PubMed Central Google Scholar
Zhang J, Chen LL, **ng F, Kudrna DA, Yao W, Copetti D, Mu T, Li W, Song J-M, **e W, Lee S, Talag J, Shao L, An Y, Zhang C-L, Ouyang Y, Sun S, Jiao W-B, Lv F, Du B, Luo M, Maldonado CE, Goicoechea JL, **ong L, Wu C, **ng Y, Zhou D-X, Yu S, Zhao Y, Wang G, Yu Y, Luo Y, Zhou Z-W, Hurtado BE, Danowitz A, Wing RA, Zhang Q (2016) Extensive sequence divergence between the reference genomes of two elite indica rice varieties Zhenshan 97 and Minghui 63. Proc Natl Acad Sci USA 113:E5163–E5171. https://doi.org/10.1073/pnas.1611012113
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was supported by Academia Sinica (AS-TP-109-L10 and AS-KPQ-109-ITAR-TD05).

Funding

This research was funded by Academia Sinica, Taiwan, grant number AS-TP-109-L10 and AS-KPQ-109-ITAR-TD05.

Author information

Authors and Affiliations

Institute of Molecular and Cellular Biology, National Tsing Hua University, Hsinchu, 300, Taiwan
Jerome P. Panibe & Wen-Hsiung Li
Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, 115, Taiwan
Jerome P. Panibe
Biodiversity Research Center, Academia Sinica, Taipei, 115, Taiwan
Jerome P. Panibe, Yi-Chen Lee & Wen-Hsiung Li
State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nan**g University, Nan**g, 210023, China
Long Wang
Department of Agronomy, National Chung-Hsing University, Taichung, 40227, Taiwan
Chang-Sheng Wang
Advanced Plant Biotechnology Center, National Chung Hsing University, Taichung, 40227, Taiwan
Chang-Sheng Wang
Department of Ecology and Evolution, University of Chicago, Chicago, IL, 60637, USA
Wen-Hsiung Li

Authors

Jerome P. Panibe
View author publications
You can also search for this author in PubMed Google Scholar
Long Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Chen Lee
View author publications
You can also search for this author in PubMed Google Scholar
Chang-Sheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Hsiung Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Analysis of sd1 genes, map** of TN1 reads from the 3000 Rice Genomes Project, genome-wide scan of genes under positive selection, REVIGO visualization, orthologue search between Nipponbare and TN1 proteins, haplotype analysis of Pi54 and Pi-ta, by JPP.; prediction of NBS-LRR genes, search for Tetep NLR orthologues in TN1, detection of presence or absence of blast R genes in TN1, by LW; experiments including polymerase chain reaction amplification of TN1 and IR8 DNA for Sanger sequencing, by Y-CL; advised the study, by C-SW; designed, advised, and supervised the study, by W-HL. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Wen-Hsiung Li.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

TN1 Botanical studies.

Additional file 2.

Predicted R genes in TN1, their Pfam domains, NLR-Parser result and R gene classification.

Additional file 3.

Results of finding Tetep NLRs in TN1.

Additional file 4.

Blastp and tblastn hits of the cloned R genes to the TN1 and Tetep genome.

Additional file 5.

Haplotype and variety order of Pi54 and Pi-ta from SNP-Seek.

Additional file 6.

SNP effect results from SNP-Seek.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Panibe, J.P., Wang, L., Lee, YC. et al. Identifying mutations in sd1, Pi54 and Pi-ta, and positively selected genes of TN1, the first semidwarf rice in Green Revolution. Bot Stud 63, 9 (2022). https://doi.org/10.1186/s40529-022-00336-x

Download citation

Received: 25 November 2021
Accepted: 17 February 2022
Published: 26 March 2022
DOI: https://doi.org/10.1186/s40529-022-00336-x

Identifying mutations in sd1, Pi54 and Pi-ta, and positively selected genes of TN1, the first semidwarf rice in Green Revolution

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Background

Predicted R genes in TN1

Haplotype analysis of the Pi-ta and Pi54 genes

Eleven genes in TN1 underwent positive selection

Discussion

sd1 has a 382-bp deletion in the semidwarf TN1

The Pi54 resistance gene in TN1 is missing

Functions of the genes subjected to positive selection in TN1

Sanger sequencing of the sd1 gene

Comparison of the sd1 gene sequence against TN1 reads from the 3000 Rice Genomes Project

Prediction of NBS-LRR genes

Search for Tetep NLR orthologues in TN1

Detection of presence or absence of blast R genes in TN1

Haplotype analysis using data from the 3000 Rice Genomes Project

Detection of genes subjected to positive selection in TN1

REVIGO visualization of GO Terms

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation