Identification of an Elite Core Panel as a Key Breeding Resource to Accelerate the Rate of Genetic Improvement for Irrigated Rice

Juma, Roselyne U.; Bartholomé, Jérôme; Thathapalli Prakash, Parthiban; Hussain, Waseem; Platten, John D.; Lopena, Vitaliano; Verdeprado, Holden; Murori, Rosemary; Ndayiragije, Alexis; Katiyar, Sanjay Kumar; Islam, Md Rafiqul; Biswas, Partha S.; Rutkoski, Jessica E.; Arbelaez, Juan D.; Mbute, Felister N.; Miano, Douglas W.; Cobb, Joshua N.

doi:10.1186/s12284-021-00533-5

Identification of an Elite Core Panel as a Key Breeding Resource to Accelerate the Rate of Genetic Improvement for Irrigated Rice

Original article
Open access
Published: 13 November 2021

Volume 14, article number 92, (2021)
Cite this article

Download PDF

You have full access to this open access article

Rice Submit manuscript

Identification of an Elite Core Panel as a Key Breeding Resource to Accelerate the Rate of Genetic Improvement for Irrigated Rice

Download PDF

Roselyne U. Juma¹^nAff2,
Jérôme Bartholomé ORCID: orcid.org/0000-0002-0855-3828^1,3,
Parthiban Thathapalli Prakash¹,
Waseem Hussain¹,
John D. Platten¹,
Vitaliano Lopena¹,
Holden Verdeprado¹,
Rosemary Murori^1,4,
Alexis Ndayiragije^1,5,
Sanjay Kumar Katiyar^1,6,
Md Rafiqul Islam^1,7,
Partha S. Biswas¹^nAff8,
Jessica E. Rutkoski¹^nAff9,
Juan D. Arbelaez¹^nAff9,
Felister N. Mbute¹⁰,
Douglas W. Miano¹⁰ &
…
Joshua N. Cobb¹^nAff11

4766 Accesses
17 Citations
6 Altmetric
Explore all metrics

Abstract

Rice genetic improvement is a key component of achieving and maintaining food security in Asia and Africa in the face of growing populations and climate change. In this effort, the International Rice Research Institute (IRRI) continues to play a critical role in creating and disseminating rice varieties with higher productivity. Due to increasing demand for rice, especially in Africa, there is a strong need to accelerate the rate of genetic improvement for grain yield. In an effort to identify and characterize the elite breeding pool of IRRI’s irrigated rice breeding program, we analyzed 102 historical yield trials conducted in the Philippines during the period 2012–2016 and representing 15,286 breeding lines (including released varieties). A mixed model approach based on the pedigree relationship matrix was used to estimate breeding values for grain yield, which ranged from 2.12 to 6.27 t·ha⁻¹. The rate of genetic gain for grain yield was estimated at 8.75 kg·ha⁻¹ year⁻¹ (0.23%) for crosses made in the period from 1964 to 2014. Reducing the data to only IRRI released varieties, the rate doubled to 17.36 kg·ha⁻¹ year⁻¹ (0.46%). Regressed against breeding cycle the rate of gain for grain yield was 185 kg·ha⁻¹ cycle⁻¹ (4.95%). We selected 72 top performing lines based on breeding values for grain yield to create an elite core panel (ECP) representing the genetic diversity in the breeding program with the highest heritable yield values from which new products can be derived. The ECP closely aligns with the indica 1B sub-group of Oryza sativa that includes most modern varieties for irrigated systems. Agronomic performance of the ECP under multiple environments in Asia and Africa confirmed its high yield potential. We found that the rate of genetic gain for grain yield found in this study was limited primarily by long cycle times and the direct introduction of non-improved material into the elite pool. Consequently, the current breeding scheme for irrigated rice at IRRI is based on rapid recurrent selection among highly elite lines. In this context, the ECP constitutes an important resource for IRRI and NAREs breeders to carefully characterize and manage that elite diversity.

Genetic gains in IRRI’s rice salinity breeding and elite panel development as a future breeding resource

Article Open access 31 January 2024

Effectiveness of early-generation testing applied to upland rice breeding

Article 05 March 2018

Unravelling genetic architecture and development of core set from elite rice lines using yield-related candidate gene markers

Article 01 June 2022

Introduction

Rice (Oryza sativa L.) is one of the world’s major staple crops feeding more than 3.5 billion people (Global Rice Science Partnership 2013). It is believed that by 2050 the global population will be approximately 10 billion (United Nations 2019) and much of this population increase will occur in the regions of Africa and Southern Asia, which are highly dependent on rice. As such, rice will be crucial to ensuring equitable food security for the foreseeable future (Peng et al. 2004; Godfray 2014; Li et al. 2018). Challenges posed by climate change as well as increasing consumer demand further highlight the importance of rice to global food security (Silvern and Young 2013). While agricultural intensification using modernized management practices (Garnett et al. 2013) can help boost productivity, the importance of rice genetic improvement in the context of these management systems is also an important driver of sustainable productivity (Guimaraes 2009; Atlin et al. 2017). The rate at which this genetic improvement occurs is often referred to as genetic gain and in order to deliver improved varieties to the farmers of the twenty-first century, the rate of genetic gain in rice must accelerate relative to twentieth century levels (Atlin et al. 2017).

With the acceleration of genoty** technologies through the early twenty-first century and the subsequent maturation of genomic selection-based breeding strategies, there has been a renewed interest in the application of quantitative genetics to plant breeding programs (Cobb et al. 2019b; Bernardo 2020). To this end the irrigated rice breeding program at the International Rice Research Institute (IRRI) has spent significant effort to develop a modernized approach to rice breeding to substantially and sustainably increase response to selection (Collard et al. 2019). In addition to implementing accelerated single seed descent strategies (Collard et al. 2017), another major pillar of IRRI’s effort to transform rice breeding is the deep characterization of the elite genetic base from which new products are derived. While the characterization and dissection of rice genetic diversity in public germplasm collections has advanced considerably (Li et al. 2014; McCouch et al. 2016; Sun et al. 2017), to be fully leveraged for varietal improvement, it needs to be paired with an equally in-depth characterization of the elite genetic diversity residing in breeding programs across the world.

The irrigated rice breeding program at IRRI has been a source of elite breeding germplasm for decades (Peng and Khush 2003; Mackill and Khush 2018; Collard et al. 2019). This genetic diversity has been utilized in combination with landraces and local varieties to contribute substantially to the yield improvement achieved in Asia to date. The breeding strategies used to achieve this post-Green Revolution yield improvement however, frequently varied according to funding priorities, available technology, and evolution of scientific thinking (see Fig. 1). IRRI’s early breeding effort culminated in the development of IR8, the first widely-adopted semi-dwarf variety of the Green Revolution (Chandler 1982; Peng et al. 1999; Peng and Khush 2003). Though this variety was high yielding, it lacked acceptable cooking and eating quality and therefore was quickly superseded by other varieties that excelled in both grain yield and marketability (Khush 2001). During this time, a focus on improved disease resistance and continued efforts to increase genetic variation led to many new varieties introgressed with genetics from wild species (Brar and Khush 2002, 2018) that were created using strategies such as backcrossing, top crossing, and pedigree breeding methods. IR 36, for instance, resulted from the combination of 13 landraces from six different countries (Khush 2005). This variety displayed good grain quality, early maturity, tolerance to abiotic stresses, and resistance to multiple pests and disease (Peng and Khush 2003). Further advances in grain quality (soft gel consistency, translucent and long slender grains, intermediate amylose content and intermediate gelatinization temperature) were made with the release of IR 64 which resulted from combining extant improved lines with 19 traditional varieties (Mackill and Khush 2018), but which was still heavily based on IR8. A renewed focus on yield improvement in the late 1980s and 1990s sparked the development of an ideotype breeding strategy known as the new plant type (NPT, Fig. 1) (Cassman 1994; Peng et al. 2004; Yadi et al. 2021). With the advent of the molecular marker technologies during the same period, this was quickly followed up by selection strategies based on marker-assisted backcrossing to introduce major genes for biotic or abiotic stress tolerance to produce enhanced versions of existing varieties. This effort was recently coupled with an enhanced focus on bio-fortification in order to couple high yield with high nutritional value. However, post-Green Revolution breeding for quality and disease resistance, while successful, has not brought about the realized genetic gain for yield that is needed to meet the projected demand. More recent approaches aim to integrate principles of quantitative genetics into the breeding strategy by focusing the molecular breeding strategy on well-known high-value haplotypes and using a genomics-enabled rapid recurrent selection strategy to improve quantitative traits mainly through accelerated breeding cycles (Fig. 1).

The objective of this study was twofold: (i) estimate gains in breeding value for yield over the entire history of IRRI’s breeding program for irrigated systems, and (ii) identify and characterize a panel of elite lines from among the available germplasm that balances high breeding value for yield with sufficient genetic variance to preserve long term gain from selection. To this end, we gathered historical data from 102 yield trials in the IRRI phenotypic database (Breeding4Results 2021) spanning the years from 2012 to 2016 and combined it with pedigree data from the International Rice Information System (McLaren et al. 2005; Collard et al. 2019) to estimate breeding values for grain yield for all extant or recently extant lines. These trials included most of the existing advanced material of the breeding program as well as many replicated observations of IRRI released varieties, allowing us to estimate the rate of genetic gain over five decades. The same data was then used to identify high yielding lines from the breeding program to form the IRRI irrigated elite core panel (ECP). Seventy-two lines were ultimately chosen to comprise the elite panel and were subjected to extensive genetic and phenotypic characterization to assess suitability for short-cycle recurrent selection.

Results

Estimation of the Genetic Gain for Grain Yield

Genetic gain for grain yield was estimated as a function of change in breeding value over time. Breeding values for 15,286 lines evaluated in 102 trials conducted between 2012 and 2016 were estimated using a two-stage mixed model analysis (Table 1, Additional file 1: Table S1). The majority of these lines were advanced lines from the breeding program that never achieved varietal status and released varieties from different decades. Eighty percent of the lines originated from crosses that were made after 2009 (Fig. 2A). As expected, the reliability of the breeding values of older lines (generated before 2000) were higher compared to more recent material, with an average value of 0.43 (σ = 0.23) and 0.1 (σ = 0.17), respectively. Breeding values for grain yield ranged from 2.12 to 6.27 t·ha⁻¹. The genetic trend as measured by this analysis of the IRRI irrigated rice breeding program since its initiation in 1960 to 2014 is presented in Fig. 2B. Over this period the linearized genetic trend was estimated to be 8.75 kg·ha⁻¹ year⁻¹ (0.23%). Despite the smaller sample size for the earlier historical periods, an upward trend from 1960 to 1980 is apparent followed by a period of variability in the average breeding value which eventually plateaus around 4.38 t·ha⁻¹ after 2008. In order to interrogate the drivers of this genetic trend further, the equivalent complete generation (EqG, see “Materials and Methods” section) for each line was calculated as an estimate of the number of effective breeding cycles that had taken place prior to the crossing event. EqG is a key indicator of the rate of introduction of new material and the extent at which improved material is recycled into the breeding program. A similar trend to breeding values was also observed for EqG for the same period. Values had exceeded two by the end of the sixties to reach its maximum average value of six in the eighties (Fig. 2C). This was followed by a marked decrease to an average value of four in the nineties the average maximal values after 2000 never exceeded six equivalent generations. In addition, a large variance in EqG was found across lines from the most recent decade with values ranging from 1 to 7.56 highlighting the extensive use of non-improved material in combination with more elite lines.

Table 1 Summary of yield trials used to estimate breeding values for grain yield

Full size table

Eighty-six released varieties included in the dataset were analyzed separately to better characterize the long-term trend in breeding values for yield and its relationship with EqG (Fig. 3). This includes material dating from the Green Revolution and post-Green revolution eras (IR8, IR36), mega-variety (IR64) and more recent high performing releases (IRRI 154, IRRI 156). Altogether, these lines covered a large period from 1962 to 2006. In this period genetic gain for grain yield was estimated based on released varieties to be 17.36 kg·ha⁻¹ year⁻¹ (0.46%; Fig. 3A). When regressing breeding values on EqG we observed significant correlation and estimated the rate of genetic gain per cycle to be 185 kg·ha⁻¹ cycle⁻¹ (4.95%; Fig. 3B).

Retrospective Analysis of Crosses

The pattern of parental selection among crosses made by IRRI’s irrigated breeding program was analyzed over a period of thirty years (1985–2014) to assess the evolution of the crossing strategy and its relationship to EqG. During this period, 13,190 crosses were made. The number of crosses and the proportion of the type of cross varied substantially from one year to the next (Fig. 4A). However, the total number of crosses has been on a downward trend. During this period, most crosses were single crosses (71.9%) or three-way crosses (24.9%) and a small proportion were backcrosses (2.7%), complex crosses (0.4%) or double crosses (0.1%). The proportion of single crosses varied from 42.9% in 2007 to a high of 99.6% in 1989. To further dissect the impact of parental selection and mating design on EqG, crosses from this period were classified as elite by elite (41.4%), elite by non-elite (34.2%) and non-elite by non-elite (24.4%) based on the EqG of the parents. Since an EqG of 4 represented lines from the most advanced available breeding cycle in 1985, any line with an EqG of four or greater was considered elite and any line with an EqG of less than four was considered non-elite. Similar to cross type, the three classes of cross varied substantially from one year to the next (Fig. 4B). Notably, from 1991 to 1997 the proportion of non-elite by non-elite crosses increased dramatically with up to 82% of the crosses falling in this category for that period. This corresponds to a decrease in EqG for the same period and is likely a function of the introduction of new material into the breeding program to achieve the objectives of the NPT initiative (Additional file 2: Fig. S1). During the period 1985–2014, 6,228 unique lines were used as parents and most of them were used only once (65.4%). On the other hand, a few lines were heavily used as parents with 90 lines being used more than 40 times each during the 29-year period. As expected, this list included well-known IRRI varieties such as IR36, IR64, IR72, IRRI 104, IRRI 105, IRRI 118 IRRI 123 and IRRI 154 but also included traditional varieties like Kalimonch, Basmati 370, Shen Nung 89–366 and MD-2 used as donors of alleles with particular value. Interestingly, some of the most used parental lines were crossed during several periods with sometimes more than 20 years between their first and last use (IR64, IR72, IRRI 104, and IRRI 105). This prominent reuse of old material serves to lengthen effective breeding cycles despite advancing the pedigree and is likely one of the primary limitations on the historical rate of genetic gain for grain yield.

Defining the Elite Core Panel (ECP)

The best performing lines in terms of breeding value for yield were selected and filtered based on the reliability of the breeding value estimate and their relatedness to other lines in the dataset based on pedigree (see “Materials and Methods” section). The final ECP was composed of 72 lines falling within the top 2% of breeding values, ranging between 4.93 and 6.01 t·ha⁻¹ with a mean value of 5.27 t·ha⁻¹ (Fig. 5). Most of the selected lines were of medium duration with breeding values for flowering time (days to 50% heading) averaging at 90 days. The average EqG was 5.7 with 90% of the lines having an EqG greater than four. The majority of the lines were developed after 2000 with 37.5% in 2010 and onward (Fig. 5). Interestingly one line (IR05N341) has a significantly lower EqG of 2.44, but has a breeding value of 5158 kg·ha⁻¹ and ranks as 45/72 among ECP breeding values. IR05N341 is an NPT inbred with a number of introduced lines in its pedigree, namely SHEN-NUNG 89–366, KETAN LUMBU, GUNDIL KUNING, and JHUM PADDY 7.

Genetic Characterization of the ECP

In order to quantify the genetic variation available to breeding in the ECP, the panel was genotyped with an amplicon panel of genome-wide markers specifically chosen to be informative among elite indica lines known as the 1k-RiCA (see “Materials and Methods” section). Using publicly available sequence data, the genome-wide SNPs assayed on the ECP lines were compared to the ** of ECP with ** the Elite Core Panel

The contemporary program has moved to a much more intensive recurrent selection strategy based on quantitative genetics principles to drive genetic gain for yield in the context of a disease resistant and high grain quality genetic background. This approach is a natural progression building on previous eras where the focus was on the identification and integration of genetic variation for yield potential traits (Peng et al. 2008). With that, it becomes necessary to systematically evaluate the existing genotypes in the program and select a number of high performing lines to form the basis of a gene pool upon which selection for high breeding value can, in combination with other innovations, drive improved rates of genetic gain (Xu et al. 2017). The 72 lines selected based on breeding value to be part of the ECP essentially represent the initial founders of the recurrent selection program moving forward. While the phenotypic value of this panel should be quickly eclipsed by successive generations of breeding, every new cohort represents an admixture of allelic variance of the panel. Thus, having clearly maintained seed sources for the original lines offers several distinct advantages, including as an elite source of genetic variation to be evaluated alongside the contemporary cohorts for new traits of interest. Such a panel is also helpful for validating trait markers for high-value haplotypes which reduce the occurrence of type I and type II errors when genoty** the progeny (Platten et al. 2019; Cobb et al. 2019a). Once sequenced, the panel also becomes a powerful resource for determining identity-by-descent (IBD) information among progeny cohorts and potentially reducing the need for routine use of high-density markers through the development of a breeding program specific imputation framework (Browning and Browning 2012; Nyine et al. 2019; Wang et al. 2020).

Genetic Diversity Captured by the Elite Core Panel

A natural concern to limiting the breeding program to crosses among such a small number of lines is the reduction in genetic variation that may occur due both to selection and genetic drift. As the ECP lines were selected based on the pedigree-estimated breeding values, the genetic characterization of the panel is a necessary next step to demonstrate its utility as a resource for breeding in a population improvement program (Warburton et al. 2005; Wen et al. 2012). While the panel itself was selected based on breeding value for yield, the mean flowering time compared to the entire dataset has not changed. This is largely a function of including flowering time as a covariate in the model, which factored out confounding effects due to the positive correlation between yield and flowering in rice.

We used the 3K reference genome panel (Wang et al. 2018) to better understand how well the ECP sampled the genetic space within rice genetic diversity at large. Unsurprisingly, it falls within the ** amplicon-based SNP assay for genetics and breeding applications in rice. Rice 12:55. https://doi.org/10.1186/s12284-019-0311-0 " href="/article/10.1186/s12284-021-00533-5#ref-CR4" id="ref-link-section-d192778788e3355">2019). This level of Ne is similar to what has been calculated in other rice breeding programs. For example, Grenier et al. (2015) showed a Ne in the range of 23–57 in four breeding populations of rice derived through recurrent selection programs. Morais Júnior et al. (2017) observed slightly higher values (40–60) in an irrigated rice breeding population using pedigree data. Values of Ne associate positively with additive genetic variation and the ability of a population to respond to selection for the trait under consideration (Falconer and Mackay 1996). Depletion in variability in population is proportional to Ne, and the time required to deplete the variability or fix one or other alternative allele in a population is a function of Ne, allele frequency (p) in the population, and the selection intensity (Walsh 2003). The theoretical limits of selection response as given by Robertson (1960) postulate that the total response to selection is equal to 2Ne times the initial gain in the first generation assuming genes with additive effects and relatively low selection intensity. This is to say that the Ne of any given generation is equal to the number of effective cycles before half the genetic variability is eroded by selection or drift. Therefore, holding the unlikely assumption that no new introductions were to be made into the program moving forward, the ECP could theoretically support at least 22 breeding cycles before half of the genetic variance is eroded. This could be further extended by the implementation of marker-optimized mating designs such as optimum contribution selection (Akdemir and Sánchez 2016; Akdemir et al. 2019) and targeted pre-breeding activities.

Expected Performance of the ECP Across Target Environments

Many traits in rice are governed by a number of high value, large effect alleles that affect patterns of phenotypic variance across environments (Wei et al. 2021). Some of these alleles (particularly disease and grain quality loci) are extremely valuable and deserve proactive management of their frequencies (Cobb et al. 2019a). Understanding these allele frequencies among the ECP lines is therefore essential for setting breeding strategy. As might be expected, the ECP displayed a wide range of frequencies for major pathogens and pests related genes. A few were essentially fixed for the positive allele; many of these represent indica/japonica differences, where the indica allele is favorable, such as Pi25 or Pid2. The value of these genes has already been captured by the breeding program, thus further improvement of these traits must rely on other genetic variation. Other genes are absent, such as xa13, Xa23, the rice yellow mottle virus resistance genes, among others. The lack of these genes in elite material necessitates some pre-breeding effort to introduce them to the elite pool before their value can be leveraged (Cobb et al. 2019a). Between these two extremes are the genes that can actually be selected in existing breeding material, and so are those contributing to variation in the elite pool. These include genes such as Xa7, Xa21 or TBV1 which although present are very rare and only available from particular lines, thus delaying their full deployment as the program generally cannot risk bottleneck a cohort through just a few lines. A few genes are at appreciable frequencies (but not fixed), and so represent diversity that is easily selectable in the existing breeding populations; these include Pita, Ptr, Pii, BPH32, sweet13, xa5 and TSV1.

Given the importance of bacterial blight and blast resistance specifically to germplasm exchange across Asia and Africa we decided to challenge the ECP lines against several common isolates of both pathogens to check the effectiveness of the gene combinations present in the ECP against blast and BLB disease in these specific genetic backgrounds. It has been reported by Shanti et al. (2010) that a four-gene combination (Xa4, xa5, xa13 and Xa21) is the most effective combination conferring broad-spectrum resistance for bacterial blight. While 30% of the ECP lines contained one or more bacterial blight resistance alleles, none of the ECP lines contained this specific combination as xa13 in particular is at a frequency of zero. However, it is clear the gene combinations that are present offer resistance to all tested isolates except PXO 99. PXO 99 is a common Xanthamonas isolate in the Philippines (Tu et al. 2000) indicating that while the ECP lines are directly useful in many geographies, a targeted backcrossing and MAS approach is needed to increase the frequency of high value alleles currently at low frequency among the breeding progeny of the ECP. Likewise, the blast resistance genes present in most of ECP lines also effectively controlled the manifestation of disease among the ECP lines for the five strains tested, likely due to the high frequency and broad spectrum resistance offered by the Pi-ta locus (Jia et al. 2016).

It is not unexpected that a subset of the breeding germplasm selected based on breeding value for yield would require pre-breeding/backcrossing and MAS to fully address the complexity of trait targets in the product concept. In order to avoid further erosion of the genetic variance (particularly in the form of selective sweeps around low frequency loci) some strategic cautions are warranted. Product development is the primary goal of a breeding program, however when product development is strongly emphasized, it is tempting for breeders to overuse specific high-value lines in a crossing block, at the expense of gene pool management. While every cross may not bring together the complete package necessary for a product release, specific crosses generated with the intention of creating progressively improved allelic combinations (and slowly increase in frequency of major genes) can generate useful lines that can be recycled into the crossing block as parents. Likewise, the use of haplotype-matched backcrossing donors of key genes can be a powerful tool for introducing novel alleles that are at zero or low frequency into the elite gene pool while preserving the availability of genetic variance immediately around them for recombination and the improvement of quantitative traits.

Managing the deployment of single genes within the breeding program due to limitations imposed by their frequency within the ECP is not the only consideration future breeding efforts based on this germplasm resource must consider. Since quantitative traits are not governed by single genes, location main effects and genotype by environment interactions must be routinely accounted for in phenotypic analysis strategies to factor out their strong influence on phenotypic outcomes. Because the primary source of data for determining the breeding values for yield that were used to identify the ECP were trials conducted in the Philippines, an understanding of yield performance in other target environments was also necessary. In order to evaluate the performance of the ECP lines relative to local and IRRI checks in relevant geographies outside the Philippines, six breeding trials were conducted in India, Kenya, and Tanzania. BLUP values centered on the mean performance of each location indicate strong performance of the ECP lines relative to the highest yielding local checks in each location. The performance relative to IRRI varieties and the local checks is a strong indication that the observed genetic variance is manifested as phenotypic variance within each location, indicating that crossing and selection among high performing lines within each breeding zone is likely to result in genetic gain for yield. Further analysis of genotype x environment interactions and the genetic correlations between target environments is warranted to help identify a global testing strategy that best leverages limited public resources available to the IRRI breeding program.

Conclusion

Achieving short- and medium-term genetic gains for yield is a key target for almost every breeding program. In the case of the IRRI’s breeding program for irrigated systems, the rate of genetic gain for grain yield was estimated at 17.36 kg·ha⁻¹ year⁻¹ (0.46%) for released varieties. This rate of gain appears to be largely limited by long cycle times and the re-introduction of old material or landraces into the elite pool. This observation highlights the need to optimize the breeding strategy for quantitative traits by using quantitative genetics principles to get closer to the annual 1.5% gain in grain yield needed to cover the expected increase in rice consumption. The elite core panel identified and characterized in this study is a key component of this optimization. Indeed, recurrent selection with short cycles based on elite-by-elite crosses implemented at IRRI to deliver a higher rate of genetic gain for grain yield requires careful management of the genetic diversity, which starts with a comprehensive characterization of the most elite germplasm.

Materials and Methods

Historical Yield Data

Experimental Studies and Pedigree Information

All the yield data from trials conducted by the irrigated breeding program during the period 2012–2016 across multiple locations were retrieved from the IRRI database. From these trials, the following phenotypic information was extracted: plant height, number of days to flowering, grain yield and number of hills per plot. The phenotypic information extracted were filtered based on the following quality criterion: presence of a randomized experimental design, percentage of missing data for grain yield and flowering time lower than 15%, and harvested area greater than 2 m². We considered an environment as the combination of location, year and season. The environments considered varied in experimental designs according to either a row-column, alpha-lattice, augmented randomized complete block, or ordinary randomized complete block design (RCBD). A total of 102 studies were conducted in 23 environments with a total of 17,216 lines from which 15,286 were sorted out as irrigated rice lines data after filtering (Additional file 1: Table S1). All the studies were conducted in the Philippines, 51 studies having been planted during the wet season and another 51 studies during the dry season. The pedigree information for the selected lines was extracted from the IRRI genealogy management system (McLaren et al. 2005) database using custom scripts. The date of the initial cross was also retrieved for all the breeding lines whose crossing year information was available in the database (16,317 lines). The pedigree information was also used to compute equivalent complete generations (EqG) (Boichard et al. 1997; Gutiérrez et al. 2008; Leroy et al. 2013) for each line. EqG for a given line was calculated as follow:

$$EqG=\sum _{i = 1}^{n}\left(\frac{1}{{2}^{{g}_{i}}}\right)$$

where g_i represents the number of generations between the line and its ancestor i (one for the parents, two for the grandparents, etc.).

Estimating Breeding Values for Lines

A two-stage mixed model analysis (Piepho et al. 2008; Smith and Cullis 2018) using grain yield data as response variable was used to estimate the breeding values of each line. The two-stage mixed model analysis was adopted to account for specific experimental design layouts across the environments (Damesa et al. 2017). In the first stage, each trial or environment (combination of location, year and season) was analyzed separately and best linear unbiased predictors (BLUPs) were extracted per environment using the following baseline mixed-model:

$$y_{ij} = \mu + g_{i} + \cdots + \varepsilon_{i}$$

(1)

where y_ij represents grain yield for ith observation, μ is the overall mean, g_i is the random effect of ith genotype with iid g_i ∼ N(0, Iσ²_g) where σ²_g is genetic variance and ε_ij is the residual error with iid ε_ij ∼ N(0, Rσ²_ε). To account for heterogeneous error variance caused by differences in the numbers of hills harvested from plot to plot and from trial to trial, the diagonal of R was set to h/h_max where, h is the number of hills harvested and h_max is the maximum number of hills harvested in the environment. The … in the model denotes the blocking factors and a covariate for missing hills which were conditional to the trial. These terms were included in the model because they were identified as improving model fit during analyses of individual trials. Blocking factors were considered random if they had more than five levels. The possible blocking factors were modelled to determine which factors led to the lowest Bayesian information criterion (Spilke et al. 2010; Piepho et al. 2015). For trials that followed a row-column design, the possible factors were row and column, for those following a partially replicated design, the possible factors were row, column, replicate, and block, for those following a RCBD or augmented RCBD, the possible factor was replicate, for those following an alpha-lattice design the possible factors were replicate, block nested within replicate, row, and column. The model with lowest Bayesian information criterion was selected and used to extract BLUP of each line and their prediction error variances (PEV) were obtained for each environment. Reliabilities of the BLUPs were estimated according to $r^{2} = 1 - \frac{PEV}{{\sigma_{g}^{2} }}$. The process for BLUP estimation per environment was repeated for days to flowering.

In the second stage model, the BLUPs obtained from the first stage model were de-regressed by dividing by the reliability as described in Garrick et al. (2009), and used as response variable in the second stage pedigree-based mixed model analysis. The de-regressed BLUPs for yield within each environment were modeled according to Bates et al. (2014). The model used is as follows:

$$y_{ij} = \mu + g_{i} + e_{j} + \varepsilon_{ijk}$$

(2)

where y_ij is the de-regressed BLUP of each line in environment j, μ is the overall mean, g_i is a random effect of line i with g_i ∼ N(0, Aσ²_g) where σ²_g is the genetic variance and A is the additive genetic relationship matrix based on pedigree, e_j is a fixed effect of the environment j, ε_ij is the residual error with ε_ij ∼ N(0, Rσ²_ε) where R is a matrix proportional to the residual error covariance matrix and σ²_ε is the error variance. To account for heterogeneous error variance, the diagonal of R was 1/r². In the above model yield was adjusted using days to flowering as covariate in the model. The R packages lme4 (Bates et al. 2015) and pedigreemm (Bates et al. 2014) were used to implement the models.

Assessment of Rate of Genetic Gain

Genetic gain was assessed using breeding values following the procedure reviewed by Garrick (2010). Briefly, for each year, the breeding values obtained were regressed on the year when the cross was made to get the genetic gain trends.

Retrospective Analysis of Crosses

The pedigree data of breeding lines developed by the IRRI’s irrigated breeding program was obtained from the genealogy management system. All crosses made were retrieved from the database and filtered based on the availability of the following information: whether or not it belonged to the irrigated program and the year when the cross was made. After those filters, only crosses made in 1985 and onward were available. This is related to the absence of clear boundaries between breeding programs before that date. Indeed, even though lots of material has been produced since IRRI inception in 1960, the information of the breeding program was not recorded in the database before 1985 making it difficult to extract relevant information. For each cross the EqGs of the F1 and of the parental lines were computed based on pedigree information. Each cross was then classified based on EqG of the parental lines. Parental lines with an EqG lower than four (mean EqG value of parents used in 1985) were classified as non-elite material and the remaining part as elite material.

Formulation of the Elite Core Panel

In order to identify the best breeding genotypes (hereafter called elite lines) the lines were filtered based on high ranking for their grain yield breeding values. Genetic relationship for the genotypes was measured by calculating the coefficient of parentage (CoP) based on the pedigree information using R package pedigreemm (Bates et al. 2014). Among the 15,286 evaluated lines, 1192 had reliability for the breeding value greater than or equal to 0.4. Then ten percent of these lines were selected to represent elite material. In total, 119 elite lines were selected among which only 80 elite lines were retained based on seed availability. Further based on seed-viability and intellectual property restrictions, some of the lines were removed and 72 elite lines were forwarded for the formulation of the elite core panel (ECP). The 72 ECP lines are reported in Additional file 1: Table S1.

Genetic Characterization of the ECP

Genetic diversity of ECP and its relationship with the 3000 rice genomes (Li et al. 2014) was assessed. Effective population size (Ne) of ECP was calculated using SNP data. Further, frequency of major trait genes was estimated in the ECP.

DNA Extraction, Genoty** and SNP Filtering

Leaf samples were obtained from the 72 ECP lines and 10 IRRI varieties (IR 6, IR 64, IR 68, IR 72, IR 74, IRRI 115, IRRI 116, IRRI 146, IRRI 151, IRRI 164) at vegetative growth stage (28–35 days old) plants. DNA was isolated and purified according to the modified Cetyltrimmethyl Ammonium Bromide (CTAB) protocol (Aboul-Maaty and Oraby 2019). Genoty** was done using the 1k-RiCA assay (Arbelaez et al. 2019). The 1k-RiCA SNPs were filtered in TASSEL v5.0 (Bradbury et al. 2007) using the following criteria: individuals with more than 15% of heterozygous loci where removed, markers with more than 15% of missing values and a minor allele frequency below 0.05 were removed. After filtering, 703 markers on 76 lines including 66 ECP lines were retained for downstream genetic analyses (Additional file 1: Table S2).

To enable a better characterization of key genes related to biotic stresses, resequencing data was generated on ECP lines. DNA was extracted from mature leaf tissue using the QIAgen DNeasy Plant maxi kit. Sequencing was performed on total genomic DNA on an Illumina Sequel II system (Macrogen, Korea) or a HiSeq 2000 system (Corteva, Hyderabad). Resequencing data was filtered and trimmed for low-quality base calls using standard pipelines, and mapped to the MSU7 build of the Nipponbare genome. Based on sequencing data quality, four lines were discarded. Base calls at specified informative positions were generated using SAMtools (Li et al. 2009) and analysed to generate a call for the allele present at each of 37 specified high-value genes related to abiotic stresses but only 33 had sufficient data to be called. The information related to the 37 genes is available in Additional file 1: Table S3.

Diversity of ECP and Its Relationship with 3K-RG, and Favorable Frequency Estimation

The relationship of ECP lines with the indica subpopulation of the 3K-RG was assessed using principal component analysis (PCA). First, the physical position coordinates of the 703 filtered 1k-RiCA SNPs were used to extract the filtered set of markers from the 3K-RG using the rice SNP-Seek database (Mansueto et al. 2017). Out of 703 filtered markers, 625 markers were common between the two data sets and used for downstream analysis. Modern varieties coming from IRRI are known to be within the ** method. Three leaves per plant were inoculated. Evaluation of resistance was done at 14 days after inoculation (DAI) by measuring the lesion starting from the point of inoculation to the end with visible symptoms. For a given isolate, the genotypes with less than four measurements (out of six) were removed from the analysis. The average lesion length was computed for all the ECP lines and the checks. The final dataset had 2.9% missing data and is available in Additional file 1: Table S6.

Field Based Evaluation and Data Analysis

Phenotypic characterization of ECP for grain yield was conducted in Kenya, Tanzania, India and the Philippines. Different experimental designs such as randomized complete block, alpha-lattice and partially replicated designs were used across these environments (Additional file 1: Table S7). Plot-level yields were normalized based on plot size and all the experimental data is stored in the breeding for results (B4R) data management system.

Grain yield (ton/ha) and days to 50% flowering time (days) from all the ECP trials were used in this study. The best linear unbiased predictor (BLUP) values were calculated using the predict function from the R package asreml in which the entry was used as a random effect in all the trials. For the trials that followed a partially replicated design, the possible factors were row, column, replicate, and block, for those following a randomized complete block design, the possible factor was replicate, for those following an alpha-lattice design the possible factors were replicate, block nested within replicate, row, and column. The BLUP values were used to rank the performance of the accessions along with the IRRI and local checks grown in each location. The three IRRI checks used in this study were IR 64, IR 72 and IRRI 154, which are high yielding varieties released in the years 1985, 1988 (Peng and Khush 2003) and 2010, respectively. The local checks are specific to the regions and are selected by the partners conducting the trial. BASMATI 370 and IRRI 215 in Kenya; SUPA, SUPA BC and TXD 306 in Tanzania; MTU 1010, IGKV-R1, CG-Deobhog, CR Dhan-304, CR Dhan-307, ARIZE 6444 Gold, SWARNA, IRRI 216 in India were the local check entries in the trials. For the Philippines, the global checks also served as the local checks.

Availability of Data and Materials

The datasets supporting the conclusions of this article are included within the article and its additional files.

Abbreviations

BLUP:: Best linear unbiased predictions
BV:: Breeding values
ECP:: Elite core panel
EqG:: Equivalent complete generation
MAS:: Marker assisted selection
Ne:: Effective population
NPT:: New plant type
RYMV:: Rice yellow mottle virus
SSD:: Single seed descent

References

Aboul-Maaty NA-F, Oraby HA-S (2019) Extraction of high-quality genomic DNA from different plant orders applying a modified CTAB-based method. Bull Natl Res Centre 43:25. https://doi.org/10.1186/s42269-019-0066-1
Article Google Scholar
Akdemir D, Sánchez JI (2016) Efficient breeding by genomic mating. Front Genet 7:210. https://doi.org/10.3389/fgene.2016.00210
Article PubMed PubMed Central CAS Google Scholar
Akdemir D, Beavis W, Fritsche-Neto R et al (2019) Multi-objective optimized genomic breeding strategies for sustainable food improvement. Heredity 122:672–683. https://doi.org/10.1038/s41437-018-0147-1
Article PubMed CAS Google Scholar
Arbelaez JD, Dwiyanti MS, Tandayu E et al (2019) 1k-RiCA (1K-Rice Custom Amplicon) a novel genoty** amplicon-based SNP assay for genetics and breeding applications in rice. Rice 12:55. https://doi.org/10.1186/s12284-019-0311-0
Article PubMed PubMed Central Google Scholar
Atlin GN, Cairns JE, Das B (2017) Rapid breeding and varietal replacement are critical to adaptation of crop** systems in the develo** world to climate change. Glob Food Secur 12:31–37. https://doi.org/10.1016/j.gfs.2017.01.008
Article Google Scholar
Bates D, Vazquez A (2014) Pedigreemm: pedigree-based mixed-effects models. R package version 0.3-3. https://CRAN.Rproject.org/package=pedigreemm
Bates D, Mächler M, Bolker B, Walker S (2015) Fitting linear mixed-effects models using lme4. J Stat Softw 67:1–48. https://doi.org/10.18637/jss.v067.i01
Article Google Scholar
Bernardo R (2020) Reinventing quantitative genetics for plant breeding: something old, something new, something borrowed, something BLUE. Heredity 125:375–385. https://doi.org/10.1038/s41437-020-0312-1
Article PubMed PubMed Central Google Scholar
Boichard D, Maignel L, Verrier E (1997) The value of using probabilities of gene origin to measure genetic variability in a population. Genet Sel Evol 29:5–23. https://doi.org/10.1186/1297-9686-29-1-5
Article PubMed Central Google Scholar
Bonman J, Vergel de Dios T, Khin M (1986) Physiologic specialization of Pyricularia oryzae in the Philippines. Plant Disease 70(8):767–769
Article Google Scholar
Bradbury PJ, Zhang Z, Kroon DE et al (2007) TASSEL: software for association map** of complex traits in diverse samples. Bioinformatics 23:2633–2635. https://doi.org/10.1093/bioinformatics/btm308
Article PubMed CAS Google Scholar
Brar D, Khush G (2002) Transferring genes from wild species into rice. In: Kang MS (ed) Quantitative genetics, genomics, and plant breeding. CABI, Boston, p 197
Chapter Google Scholar
Brar DS, Khush GS (2018) Wild relatives of rice: a valuable genetic resource for genomics and breeding research. In: Mondal TK, Henry RJ (eds) The wild Oryza genomes. Springer International Publishing, Cham, pp 1–25
Google Scholar
Breeding4Results (2021) Breeding4Results (B4R). https://riceinfo.atlassian.net/wiki/spaces/ABOUT/pages/326172737/Breeding4Results+B4R. Accessed 10 July 2021
Breseghello F, de Morais OP, Pinheiro PV et al (2011) Results of 25 years of upland rice breeding in Brazil. Crop Sci 51:914–923. https://doi.org/10.2135/cropsci2010.06.0325
Article Google Scholar
Browning SR, Browning BL (2012) Identity by descent between distant relatives: detection and applications. Annu Rev Genet 46:617–633. https://doi.org/10.1146/annurev-genet-110711-155534
Article PubMed CAS Google Scholar
Bruskiewich RM, Cosico AB, Eusebio W et al (2003) Linking genotype to phenotype: the International Rice Information System (IRIS). Bioinformatics 19:i63–i65. https://doi.org/10.1093/bioinformatics/btg1006
Article PubMed Google Scholar
Cassman KG (1994) Breaking the yield barrier: proceedings of a workshop on rice yield potential in favorable environments, IRRI, 29 November–4 December 1993. International Rice Research Institute, Los Baños
Google Scholar
Chandler RF (1982) An adventure in applied science: a history of the International Rice Research Institute. IRRI, Dhaka
Google Scholar
Cobb JN, Biswas PS, Platten JD (2019a) Back to the future: revisiting MAS as a tool for modern plant breeding. Theor Appl Genet 132:647–667. https://doi.org/10.1007/s00122-018-3266-4
Article PubMed CAS Google Scholar
Cobb JN, Juma RU, Biswas PS et al (2019b) Enhancing the rate of genetic gain in public-sector plant breeding programs: lessons from the breeder’s equation. Theor Appl Genet. https://doi.org/10.1007/s00122-019-03317-0
Article PubMed PubMed Central Google Scholar
Collard B, Beredo J, Lenaerts B et al (2017) Revisiting rice breeding methods—evaluating the use of rapid generation advance (RGA) for routine rice breeding. Plant Prod Sci 20:1–16. https://doi.org/10.1080/1343943X.2017.1391705
Article Google Scholar
Collard B, Gregorio B, Thomson M, Islam M (2019) Transforming rice breeding: re-designing the irrigated breeding pipeline at the International Rice Research Institute (IRRI). Crop Breed Genet Genom. https://doi.org/10.20900/cbgg20190008
Article Google Scholar
Covarrubias-Pazaran G (2016) Genome-assisted prediction of quantitative traits using the R package sommer. PLoS ONE 11:e0156744. https://doi.org/10.1371/journal.pone.0156744
Article PubMed PubMed Central CAS Google Scholar
Damesa TM, Möhring J, Worku M, Piepho H-P (2017) One step at a time: stage-wise analysis of a series of experiments. Agron J 109:845–857. https://doi.org/10.2134/agronj2016.07.0395
Article Google Scholar
Falconer DS, Mackay TFC (1996) Introduction to Quantitative Genetics. Pearson Education Limited-Prentice Hall, Essex
Google Scholar
Garnett T, Appleby MC, Balmford A et al (2013) Sustainable intensification in agriculture: premises and policies. Science 341:33–34. https://doi.org/10.1126/science.1234485
Article PubMed CAS Google Scholar
Garrick DJ (2010) An animal breeding approach to the estimation of genetic and environmental trends from field populations1. J Anim Sci 88:E3–E10. https://doi.org/10.2527/jas.2009-2329
Article PubMed CAS Google Scholar
Garrick DJ, Taylor JF, Fernando RL (2009) Deregressing estimated breeding values and weighting information for genomic regression analyses. Genet Sel Evol 41:55. https://doi.org/10.1186/1297-9686-41-55
Article PubMed PubMed Central Google Scholar
Global Rice Science Partnership (2013) Rice Almanac: source book for one of the most important economic activities on earth, 4th edn. International Rice Research Institute, Los Baños
Google Scholar
Godfray HCJ (2014) The challenge of feeding 9–10 billion people equitably and sustainably. J Agric Sci 152:2–8. https://doi.org/10.1017/S0021859613000774
Article Google Scholar
Goto M (2012) Fundamentals of bacterial plant pathology. Academic Press, London
Google Scholar
Grenier C, Cao T-V, Ospina Y et al (2015) Accuracy of genomic selection in a rice synthetic population developed for recurrent selection breeding. PLoS ONE 10:e0136594. https://doi.org/10.1371/journal.pone.0136594
Article PubMed PubMed Central CAS Google Scholar
Guimaraes EP (2009) Rice breeding. In: Carena MJ (ed) Cereals. Springer, Berlin, pp 99–126
Chapter Google Scholar
Gutiérrez JP, Cervantes I, Molina A et al (2008) Individual increase in inbreeding allows estimating effective sizes from pedigrees. Genet Sel Evol 40:359–378. https://doi.org/10.1186/1297-9686-40-4-359
Article PubMed PubMed Central Google Scholar
Hill WG (1981) Estimation of effective population size from data on linkage disequilibrium. Genet Res 38:209–216. https://doi.org/10.1017/S0016672300020553
Article Google Scholar
IRRI (2013) Standard evaluation system for rice. International Rice Research Institute, Philippines
Google Scholar
Jia Y, Zhou E, Lee S, Bianco T (2016) Coevolutionary dynamics of rice blast resistance gene Pi-ta and Magnaporthe oryzae Avirulence gene AVR-Pita 1. Phytopathology 106:676–683. https://doi.org/10.1094/PHYTO-02-16-0057-RVW
Article PubMed CAS Google Scholar
Khush GS (2001) Green revolution: the way forward. Nat Rev Genet 2:815–822. https://doi.org/10.1038/35093585
Article CAS PubMed Google Scholar
Khush GS (2005) What it will take to Feed 5.0 billion rice consumers in 2030. Plant Mol Biol 59:1–6. https://doi.org/10.1007/s11103-005-2159-5
Article PubMed CAS Google Scholar
Kumar A, Raman A, Yadav S et al (2021) Genetic gain for rice yield in rainfed environments in India. Field Crop Res 260:107977. https://doi.org/10.1016/j.fcr.2020.107977
Article Google Scholar
Leroy G, Mary-Huard T, Verrier E et al (2013) Methods to estimate effective population size using pedigree data: examples in dog, sheep, cattle and horse. Genet Sel Evol 45:1. https://doi.org/10.1186/1297-9686-45-1
Article PubMed PubMed Central Google Scholar
Li H, Handsaker B, Wysoker A et al (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079. https://doi.org/10.1093/bioinformatics/btp352
Article PubMed PubMed Central CAS Google Scholar
Li J-Y, Wang J, Zeigler RS (2014) The 3000 rice genomes project: new opportunities and challenges for future rice research. GigaScience 3:2047–217X. https://doi.org/10.1186/2047-217X-3-8
Article CAS Google Scholar
Li H, Rasheed A, Hickey LT, He Z (2018) Fast-forwarding genetic gain. Trends Plant Sci 23:184–186. https://doi.org/10.1016/j.tplants.2018.01.007
Article PubMed CAS Google Scholar
Mackill DJ, Khush GS (2018) IR64: a high-quality and high-yielding mega variety. Rice 11:18. https://doi.org/10.1186/s12284-018-0208-3
Article PubMed PubMed Central Google Scholar
Mansueto L, Fuentes RR, Borja FN et al (2017) Rice SNP-seek database update: new SNPs, indels, and queries. Nucleic Acids Res 45:D1075–D1081. https://doi.org/10.1093/nar/gkw1135
Article PubMed CAS Google Scholar
McCouch SR, Wright MH, Tung C-W et al (2016) Open access resources for genome-wide association map** in rice. Nat Commun 7:10532. https://doi.org/10.1038/ncomms10532
Article PubMed PubMed Central CAS Google Scholar
McLaren CG, Bruskiewich RM, Portugal AM, Cosico AB (2005) The International Rice Information System. A platform for meta-analysis of rice crop data. Plant Physiol 139:637–642. https://doi.org/10.1104/pp.105.063438
Article PubMed PubMed Central CAS Google Scholar
Money D, Gardner K, Migicovsky Z et al (2015) LinkImpute: fast and accurate genotype imputation for nonmodel organisms. G3 (bethesda) 5:2383–2390. https://doi.org/10.1534/g3.115.021667
Article Google Scholar
Morais Júnior OP, Breseghello F, Duarte JB et al (2017) Effectiveness of recurrent selection in irrigated rice breeding. Crop Sci 57:3043–3058. https://doi.org/10.2135/cropsci2017.05.0276
Article Google Scholar
Nyine M, Wang S, Kiani K et al (2019) Genotype imputation in winter wheat using first-generation haplotype map SNPs improves genome-wide association map** and genomic prediction of traits. G3 Genes Genomes Genet 9:125–133. https://doi.org/10.1534/g3.118.200664
Article CAS Google Scholar
Peng S, Khush G (2003) Four decades of breeding for varietal improvement of irrigated lowland rice in the International Rice Research Institute. Plant Prod Sci 6:157–164. https://doi.org/10.1626/pps.6.157
Article Google Scholar
Peng S, Cassman KG, Virmani SS et al (1999) Yield potential trends of tropical rice since the release of IR8 and the challenge of increasing rice yield potential. Crop Sci 39:1552–1559. https://doi.org/10.2135/cropsci1999.3961552x
Article Google Scholar
Peng S, Laza RC, Visperas RM et al (2000) Grain yield of rice cultivars and lines developed in the Philippines since 1966. Crop Sci 40:307–314. https://doi.org/10.2135/cropsci2000.402307x
Article Google Scholar
Peng S, Huang J, Sheehy JE et al (2004) Rice yields decline with higher night temperature from global warming. Proc Natl Acad Sci U S A 101:9971–9975. https://doi.org/10.1073/pnas.0403720101
Article PubMed PubMed Central CAS Google Scholar
Peng S, Khush GS, Virk P et al (2008) Progress in ideotype breeding to increase rice yield potential. Field Crop Res 108:32–38
Article Google Scholar
Piepho HP, Möhring J, Melchinger AE, Büchse A (2008) BLUP for phenotypic selection in plant breeding and variety testing. Euphytica 161:209–228. https://doi.org/10.1007/s10681-007-9449-8
Article Google Scholar
Piepho H-P, Möhring J, Pflugfelder M et al (2015) Problems in parameter estimation for power and AR(1) models of spatial correlation in designed field experiments. Commun Biom Crop Sci 10:3–16
Google Scholar
Platten JD, Cobb JN, Zantua RE (2019) Criteria for evaluating molecular markers: comprehensive quality metrics to improve marker-assisted selection. PLoS ONE 14:e0210529. https://doi.org/10.1371/journal.pone.0210529
Article PubMed PubMed Central CAS Google Scholar
Portugal A, Balachandra R, Metz T et al (2007) International crop information system for germplasm data management. Methods Mol Biol 406:459–471. https://doi.org/10.1007/978-1-59745-535-0_22
Article PubMed CAS Google Scholar
R Core Team (2018) R: A Language and Environment for Statistical Computing. URL http://www.R-project.org/, R Foundation for Statistical Computing, Vienna, Austria
Robertson A (1960) A theory of limits in artificial selection. Proceedings of the Royal Society of London. Ser B. Biol Sci 153(951):234–249
Rutkoski JE (2018) Estimation of realized rates of genetic gain and indicators for breeding program assessment. bioRxiv 409342. https://doi.org/10.1101/409342
Shanti ML, Shenoy VV, Devi GL et al (2010) Marker-assisted breeding for resistance to bacterial leaf blight in popular cultivar and parental lines of hybrid rice. J Plant Pathol 92:495–501
CAS Google Scholar
Silvern S, Young S (2013) Environmental change and sustainability. BoD—Books on Demand, Norderstedt
Book Google Scholar
Smith AB, Cullis BR (2018) Plant breeding selection tools built on factor analytic mixed models for multi-environment trial data. Euphytica 214:143. https://doi.org/10.1007/s10681-018-2220-5
Article Google Scholar
Spilke J, Richter C, Piepho HP (2010) Model selection and its consequences for different split-plot designs with spatial covariance and trend. Plant Breed 129:590–598. https://doi.org/10.1111/j.1439-0523.2010.01795.x
Article Google Scholar
Sun C, Hu Z, Zheng T et al (2017) RPAN: rice pan-genome browser for ∼ 3000 rice genomes. Nucleic Acids Res 45:597–605. https://doi.org/10.1093/nar/gkw958
Article PubMed CAS Google Scholar
Sved JA (1971) Linkage disequilibrium and homozygosity of chromosome segments in finite populations. Theor Popul Biol 2:125–141. https://doi.org/10.1016/0040-5809(71)90011-6
Article PubMed CAS Google Scholar
Telebanco-Yanoria MJ, Ohsawa R, Senoo S et al (2008) Diversity analysis for resistance of rice (Oryza sativa L.) to blast disease [Magnaporthe grisea (Hebert) Barr.] using differential isolates from the Philippines. Plant Breed 127:355–363. https://doi.org/10.1111/j.1439-0523.2008.01497.x
Article Google Scholar
Tu J, Datta K, Khush GS, Zhang Q, Datta SK (2000) Field performance of Xa21 transgenic indica rice (Oryza sativa L.), IR72. Theor Appl Genet 101(1):15–20
Article CAS Google Scholar
United Nations UN (2019) Growing at a slower pace, world population is expected to reach 9.7 billion in 2050 and could peak at nearly 11 billion around 2100. United Nations Department of Economic and Social Affairs, New York
Google Scholar
Walsh B (2003) Population- and quantitative-genetic models of selection limits. Plant Breed Rev 24:177–225
Google Scholar
Wang W, Mauleon R, Hu Z et al (2018) Genomic variation in 3010 diverse accessions of Asian cultivated rice. Nature 557:43–49. https://doi.org/10.1038/s41586-018-0063-9
Article PubMed PubMed Central CAS Google Scholar
Wang X, Su G, Hao D et al (2020) Comparisons of improved genomic predictions generated by different imputation methods for genoty** by sequencing data in livestock populations. J Anim Sci Biotechnol 11:3. https://doi.org/10.1186/s40104-019-0407-9
Article PubMed PubMed Central CAS Google Scholar
Warburton ML, Ribaut JM, Franco J et al (2005) Genetic characterization of 218 elite CIMMYT maize inbred lines using RFLP markers. Euphytica 142:97–106. https://doi.org/10.1007/s10681-005-0817-y
Article CAS Google Scholar
Wei X, Qiu J, Yong K et al (2021) A quantitative genomics map of rice provides genetic insights and guides breeding. Nat Genet 53:243–253. https://doi.org/10.1038/s41588-020-00769-9
Article PubMed CAS Google Scholar
Wen W, Franco J, Chavez-Tovar VH et al (2012) Genetic characterization of a core set of a tropical maize race Tuxpeño for further use in maize improvement. PLoS ONE 7:e32626. https://doi.org/10.1371/journal.pone.0032626
Article PubMed PubMed Central CAS Google Scholar
Wickham H (2016) ggplot2: elegant graphics for data analysis, 2nd edn. Springer International Publishing, Cham
Book Google Scholar
Wing RA, Purugganan MD, Zhang Q (2018) The rice genome revolution: from an ancient grain to Green Super Rice. Nat Rev Genet 19:505–517. https://doi.org/10.1038/s41576-018-0024-z
Article PubMed CAS Google Scholar
**e W, Wang G, Yuan M et al (2015) Breeding signatures of rice improvement revealed by a genomic variation map from a large germplasm collection. PNAS 112:E5411–E5419. https://doi.org/10.1073/pnas.1515919112
Article PubMed PubMed Central CAS Google Scholar
Xu Y, Li P, Zou C et al (2017) Enhancing genetic gain in the era of molecular breeding. J Exp Bot 68:2641–2666. https://doi.org/10.1093/jxb/erx135
Article PubMed CAS Google Scholar
Yadi R, Heravan IM, Heidari Sharifabad H (2021) Identifying the superior traits for selecting the ideotype of rice cultivars. Cereal Res Commun. https://doi.org/10.1007/s42976-020-00088-z
Article Google Scholar

Download references

Acknowledgements

Special thanks goes to the irrigated rice team at IRRI: Rose Imee Zhella, Romnick Javier, Joel Evangelista and Cesar Manalang for their support in conducting and obtaining data reported in this paper. Special gratitude to various groups at IRRI HQ including Pathology team led by Ricardo Oliva; the database team led by Marko Karkkainen and members of the Genoty** service laboratory at IRRI who were supportive and instrumental in obtaining disease stress data, data management and storage; and genoty** service procurement, respectively. Thanks to the IRRI Breeding Zones teams and Collaborators in South and South East Asia (Md Ariful Islam, M Mobarak Hossain, Rameswar Prasad Sah, Ramlakhan Verma, Sunil Nair, Girish Chandel, Narne Chamundeswari, Revathi Ponnusamy); Est and Southern Africa (Oliver Nyongesa, Simon Njau Kariuki, Rehema Kwayu, Mohamed Mkuya and Ruth Musila) for their support in obtaining ECP data from breeding zones. We thank Hans Bhardwaj (present Global Lead—Rice Breeding) for his support and team building that contributed to the success of the wider trials conducted and development of this paper.

Funding

The Bill and Melinda Gates Foundation: Stress Tolerant Rice for Africa and South Asia (STRASA), Transformation of Rice Breeding (TRB) in South Asia and Sub-Saharan Africa and Accelerated Genetic Gain in Rice (AGGRi) Alliance projects sponsored and funded this work (Grant Nos. OPP1194925, OPP1088843). The funders of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the article. The corresponding authors had full access to all the data in the study and had final responsibility for the decision to submit for publication.

Author information

Roselyne U. Juma
Present address: Kenya Agricultural and Livestock Research Organization, 50100-169, Kakamega, Kenya
Partha S. Biswas
Present address: Plant Breeding Division, Bangladesh Rice Research Institute (BRRI), Gazipur, Bangladesh
Jessica E. Rutkoski & Juan D. Arbelaez
Present address: University of Illinois at Urbana-Champaign, Urbana, USA, Illinois
Joshua N. Cobb
Present address: RiceTec. Inc, PO Box 1305, Alvin, TX, 77512, USA

Authors and Affiliations

Rice Breeding Innovations Platform, International Rice Research Institute, 1301 Los Baños, Metro, DAPO Box 7777, Manila, Philippines
Roselyne U. Juma, Jérôme Bartholomé, Parthiban Thathapalli Prakash, Waseem Hussain, John D. Platten, Vitaliano Lopena, Holden Verdeprado, Rosemary Murori, Alexis Ndayiragije, Sanjay Kumar Katiyar, Md Rafiqul Islam, Partha S. Biswas, Jessica E. Rutkoski, Juan D. Arbelaez & Joshua N. Cobb
AGAP Institut, CIRAD, INRA, Montpellier SupAgro, Univ Montpellier, Montpellier, France
Jérôme Bartholomé
International Rice Research Institute (IRRI) C/O ILRI, Old Naivasha Road, PO Box 30709, 00100, Nairobi, Kenya
Rosemary Murori
Institiuto de Investigação de Moçambique (IIAM), Av. das FPLM nr 2698, Recinto do IIAM, Maputo, Mozambique
Alexis Ndayiragije
International Rice Research Institute, South Asia Hub, ICRISAT, Hyderabad, 502324, India
Sanjay Kumar Katiyar
Bangladesh Office, International Rice Research Institute (IRRI), Dhaka, Bangladesh
Md Rafiqul Islam
Department of Plant Science and Crop Protection, University of Nairobi, PO Box 29053, 00625, Kangemi, Kenya
Felister N. Mbute & Douglas W. Miano

Authors

Roselyne U. Juma
View author publications
You can also search for this author in PubMed Google Scholar
Jérôme Bartholomé
View author publications
You can also search for this author in PubMed Google Scholar
Parthiban Thathapalli Prakash
View author publications
You can also search for this author in PubMed Google Scholar
Waseem Hussain
View author publications
You can also search for this author in PubMed Google Scholar
John D. Platten
View author publications
You can also search for this author in PubMed Google Scholar
Vitaliano Lopena
View author publications
You can also search for this author in PubMed Google Scholar
Holden Verdeprado
View author publications
You can also search for this author in PubMed Google Scholar
Rosemary Murori
View author publications
You can also search for this author in PubMed Google Scholar
Alexis Ndayiragije
View author publications
You can also search for this author in PubMed Google Scholar
Sanjay Kumar Katiyar
View author publications
You can also search for this author in PubMed Google Scholar
Md Rafiqul Islam
View author publications
You can also search for this author in PubMed Google Scholar
Partha S. Biswas
View author publications
You can also search for this author in PubMed Google Scholar
Jessica E. Rutkoski
View author publications
You can also search for this author in PubMed Google Scholar
Juan D. Arbelaez
View author publications
You can also search for this author in PubMed Google Scholar
Felister N. Mbute
View author publications
You can also search for this author in PubMed Google Scholar
Douglas W. Miano
View author publications
You can also search for this author in PubMed Google Scholar
Joshua N. Cobb
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

RUJ, JB and JNC designed the study. RUJ, RM, AN, SJ, RMI, VL and HV supervise screen house and field experiments. RUJ, JB, PTP, WH, PSB, JDP, JER and JDA performed the analyses. RUJ, JB and JNC wrote the manuscript with the help from PTP, WH, and JER. FNB, DWM, JB and JNC supervised and provided intellectual guidance. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Jérôme Bartholomé or Joshua N. Cobb.

Ethics declarations

Ethics Approval and Consent to Participate

Not applicable.

Consent for Publication

Not applicable.

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

List of all lines 15,286 of the irrigated breeding program selected for the analysis of genetic gain. The following information are available: designation (name of the lines), GID (identification number), parentage, year of the initial cross, type of entry (breeding lines or IRRI released varieties), ECP (belong or not to the elite core panel), equivalent complete generation (EqG), breeding value for yield corrected from flowering time. Table S2. Genotypic data for 66 ECP and 10 IRRI varieties from the breeding program on 703 SNP from the 1k-RiCA platform. The data are in hapmap format. Table S3. Information on the 37 high-value genes related to abiotic stresses. The favorable allele frequency is reported for 33 genes which had sufficient data. Table S4. Combined genotypic data of ECP and 3K-RG indica accessions formatted as a dosage matrix with SNP genotypes coded as 0, 0.5, 1. Table S5. Results of the evaluation of the elite core panel against blast disease. The average infection score to five highly virulent Magnaporthe oryzae isolates measured in controlled conditions are reported along with the favorable alleles (if any) for each line. The resistant and susceptible checks are also included. Table S6. Results of the evaluation of the elite core panel against bacterial leaf blight. The average average lesion length (based on six plants) to fourteen isolates of Xanthomonas oryzae pv. oryzae measured in controlled conditions are reported along with the favorable alleles (if any) for each line. The resistant and susceptible checks are also included. Table S7. Studies conducted in the target regions for IRRI's irrigated rice breeding program to evaluate the performance of the elite core panel (ECP) lines in 2019.

Additional file 2: Fig. S1.

Evolution of equivalent complete generation (EqG) of parental lines used between 1985 and 2014 for IRRI’s breeding program for irrigated systems. Fig. S2. Distribution of the infection score (0 = highly resistant to 5 = hihgly susceptible) for five isolates of blast (Magnaporthe oryzae). The elite core panel (ECP) lines are in grey, the susceptible checks (Lijiangxintuanheigu and CO 39) are in orange and the resistant checks (IRBLta2-Pi, IRBLSH-B, IRBLkm-Ts and IRBLKh-K3) in green. Fig. S3. Scatter plots and rank correlations between blast isolates using infection scores. Fig. S4. Distribution of the lesion length of elite core panel (ECP) lines after controlled inoculation with Xanthomonas oryzae pv. Oryzae. Fourteen different isolates were used to assess the level of resistance of the ECP lines. The ECP is in grey, the susceptible check (IR 24) is in orange and the resistant checks (IRBB23, IRBB60 and IRBB62) in green. Fig. S5. Scatter plots and rank correlations between bacterial leaf blight isolates using the average lesion length for each genotype. Fig. S6. Correlation matrix for grain yield (A) and time to flowering (B) for all the environments where the ICP lines have been evaluated.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Juma, R.U., Bartholomé, J., Thathapalli Prakash, P. et al. Identification of an Elite Core Panel as a Key Breeding Resource to Accelerate the Rate of Genetic Improvement for Irrigated Rice. Rice 14, 92 (2021). https://doi.org/10.1186/s12284-021-00533-5

Download citation

Received: 22 August 2021
Accepted: 29 October 2021
Published: 13 November 2021
DOI: https://doi.org/10.1186/s12284-021-00533-5

Identification of an Elite Core Panel as a Key Breeding Resource to Accelerate the Rate of Genetic Improvement for Irrigated Rice

Abstract

Similar content being viewed by others

Genetic gains in IRRI’s rice salinity breeding and elite panel development as a future breeding resource

Effectiveness of early-generation testing applied to upland rice breeding

Unravelling genetic architecture and development of core set from elite rice lines using yield-related candidate gene markers

Introduction

Results

Estimation of the Genetic Gain for Grain Yield

Retrospective Analysis of Crosses

Defining the Elite Core Panel (ECP)

Genetic Characterization of the ECP

Genetic Diversity Captured by the Elite Core Panel

Expected Performance of the ECP Across Target Environments

Conclusion

Materials and Methods

Historical Yield Data

Experimental Studies and Pedigree Information

Estimating Breeding Values for Lines

Assessment of Rate of Genetic Gain

Retrospective Analysis of Crosses

Formulation of the Elite Core Panel

Genetic Characterization of the ECP

DNA Extraction, Genoty** and SNP Filtering

Diversity of ECP and Its Relationship with 3K-RG, and Favorable Frequency Estimation

Field Based Evaluation and Data Analysis

Availability of Data and Materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics Approval and Consent to Participate

Consent for Publication

Competing Interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1: Table S1.

Additional file 2: Fig. S1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation