A new chicken 55K SNP genoty** array

Liu, Ranran; **ng, Siyuan; Wang, Jie; Zheng, Maiqing; Cui, Huanxian; Crooijmans, Richard P. M. A.; Li, Qinghe; Zhao, Gui**; Wen, Jie

doi:10.1186/s12864-019-5736-8

A new chicken 55K SNP genoty** array

Methodology article
Open access
Published: 22 May 2019

Volume 20, article number 410, (2019)
Cite this article

Download PDF

You have full access to this open access article

BMC Genomics Aims and scope Submit manuscript

A new chicken 55K SNP genoty** array

Download PDF

Ranran Liu¹^na1,
Gui** Zhao^1,3,4 &
…
Jie Wen ORCID: orcid.org/0000-0002-8842-6107^1,3,4

5341 Accesses
31 Citations
6 Altmetric
Explore all metrics

Abstract

Background

China has the richest local chicken breeding resources in the world and is the world’s second largest producer of meat-type chickens. Development of a moderate-density SNP array for genetic analysis of chickens and breeding of meat-type chickens taking utility of those resources is urgently needed for conventional farms, breeding industry, and research areas.

Results

Eight representative local breeds or commercial broiler lines with 3 pools of 48 individuals within each breed/line were sequenced and supplied the major SNPs resource. There were 7.09 million - 9.41 million SNPs detected in each breed/line. After filtering using multiple criteria such as preferred incorporation of trait-related SNPs and uniformity of distribution across the genome, 52.18 K SNPs were selected in the final array. It consists of: (i) 19.22 K SNPs from the genomes of yellow-feathered, cyan-shank partridge and white-feathered chickens; (ii) 5.98 K SNPs related to economic traits from the Illumina 60 K SNP Bead Chip, which were found as significant associated SNPs with 15 traits in a Bei**g-You crossed Cobb F2 resource population by genome-wide association study analysis; (iii) 7.63 K SNPs from 861 candidate genes of economic traits; (iv) the 0.94 K SNPs related to residual feed intake; and (v) 18.41 K from chicken SNPdb. The polymorphisms of 9 extra local breeds and 3 commercial lines were examined with this array, and 40 K - 47 K SNPs were polymorphic (with minor allele frequency > 0.05) in those breeds. The MDS result showed that those breeds can be clearly distinguished by this newly developed genoty** array.

Conclusions

We successfully developed a 55K genoty** array by using SNPs segregated from typical local breeds and commercial lines. Compared to the existing Affy 600 K and Illumina 60 K arrays, there were 21,41 K new SNPs included on our Affy 55K array. The results of the 55K genoty** data can therefore be imputed to high-density SNPs genoty** data. The array offers a wide range of potential applications such as genomic selection breeding, GWAS of interested traits, and investigation of diversity of different chicken breeds.

Genome-wide association study on chicken carcass traits using sequence data imputed from SNP array

Article 23 June 2018

High-throughput and Cost-effective Chicken Genoty** Using Next-Generation Sequencing

Article Open access 25 May 2016

Genome-wide association study of body weight in Wenshang Barred chicken based on the SLAF-seq technology

Article 27 June 2018

Background

With a total of 107 chicken breeds, China has one of the richest local breed resources [1]. This diverse chicken genetic resource is a vital part of the diversity of biological genetic resources around the world and provides excellent material for breeding new varieties or to genetically improve breed.

China is the second-largest broiler producer and consumer all over the world, which accounts for approximately 11% of the chicken production across the globe (FAOSTAT, 2017). In China, chicken is the second largest meat product after pork, making up to 17% of the total meat production. Chicken meat is mainly obtained from the introduced white feather broilers and domestic yellow-feathered meat-type chickens (meat-type local chicken breed, meat-type bred variety and a relevant strain containing the consanguinity of Chinese indigenous chicken), each accounting for half of the consumption. However, the current challenge is how to effectively protect and maintain the existing local varieties. On the other hand, if breeding efficiency is promoted, new chicken lines breeding would be accelerated. The genome-wide SNP chip, also known as SNP array, arranges up to 25 million of DNA marker flanks on glass or special silicon chip to form the SNP probe array. It functions by means of the reaction of base pairing between the chip fixed DNA marker flanks with the target genome, so as to accurately identify the genetic information.

The genoty** arrays have been developed for pig [2], cow [3], dairy cattle [4], sheep [5], salmon [6], and buffalo [7] et al. In chicken, the first 3 K genoty** array was developed in 2005 with 3072 SNPs [8]. After that, in 2008, Groenen et al. did develop a 60 K bead chip for chicken which evenly covered the whole genome [9]. To date, the only available commercial arrays for chicken is Chicken the Affy 600 K SNP Array (Axiom Genome-Wide Chicken Genoty** Array), which was developed by Kranis et al [10]. The other arrays are privately owned by commercial companies. The array supplied an important tool for the genetic diversity analysis, breeds relationship analysis, GWAS, quantitative character positioning analysis of QTL, selective evolution investigation, and Genomic Selection [11]. Up till now the most efficient ways for SNP genoty**, biodiversity measuring, QTL map** and genomic selection is using SNP arrays. These applications provide improved technical support for the conservation of indigenous breeds and development of new genetic lines/breeds.

One pitfall of all current chicken SNP arrays is the bias towards western commercial lines. The current chicken arrays, however, lack the genomic variation information of Chinese indigenous breeds. Therefore, it is imperative to develop a new type of genome-wide SNP chip with moderate flux in the chicken breeding industry, and also contains the genetic variation information specific to Chinese indigenous breeds. Overlap with the current arrays of the different platforms (Axiom and Illumina) is essential to link the commercial SNP arrays.

Through whole genome re-sequencing of a variety of Chinese native breeds and commercial chicken lines, integrating SNPs associated with economic traits detected in a crossing breed (either indigenous and commercial), a new public available moderate density (55 K) chicken array (IASCHICK) has been developed.

Results

The SNPs selection was performed in four groups. The roadmap is shown in Fig. 1, and the establishment of the four groups are indicated in the following paragraphs.

Genome re-sequencing of chickens supplying the first SNP group

Eight Chinese local chicken breeds or inbred lines were selected for whole genome sequencing. Each breed/line holds 3 pools of 16 individuals per library without individual barcodes (Table 1). The data summary of each library is provided in the Additional file 1. The number of SNPs per breed/line varied from 7.09 million to 9.41 million SNPs. The average number of detected SNPs was 8.61 M in the local lines, and 7.73 M in the commercial broilers. The total number of SNPs detected overall 8 breeds/lines was 15.2 M. The SNPs with minor allele frequency (MAF) < 0.05 and with low ΔF were excluded for further steps. The 140 K SNPs, which allelic frequencies distinct to the control breeds, were subsequently used as the first group of candidate SNPs.

Table 1 Sequenced chickens and the number of SNPs detected from different breeds

Full size table

Selection of the second group of candidate SNPs based on the GWAS of 15 traits

The 7.42 K SNPs were demonstrated to have the top 1% genome-wide significance in 15 traits and were selected as the second group of SNPs. The details are shown in Additional file 2.

Selection of the third group of candidate SNPs based on the genes associated with economic traits

SNPs in the regions of 861 candidate genes related to economic traits were used according to previous studies of gene/protein expression profiles. A total of 66.37 K SNPs in 383 genes for breast muscle and intramuscular fat development in embryonic and post-hatching periods [The fourth group of candidate SNPs are derived from whole genome sequences of low- and high-RFI chickens

Whole genome sequencing of low- and high-RFI chickens were performed to locate the genomic variants for RFI based on differences in allelic frequency between high- and low-RFI chickens as described in our previous study [Designing the Affy 55K genoty** array

Based on the above four groups of candidate SNPs, a custom-made algorithm was used to fix the final array. Finally, 52,184 SNPs were selected for the final array. The mean physical distance of SNPs in each involved chromosome shows in Table 2. The priority 1 SNPs (the SNPs in group 2, 3 and 4) and 25 INDELs were first placed on the final SNP panel. The next step was addition of the priority 2 SNPs (the SNPs in group 1). The remaining 18.41 K SNPs was selected for the blank windows in the whole chicken genome which the SNPs in the four groups cannot be covered.

Table 2 The number of SNPs of the 55K array on each chromosome and their distance^a

Full size table

The SNPs positions of 55K array were given in Additional file 5. The selected SNPs were derived from the following five groups (Table 3): (i) 19.2 K SNPs from whole genome sequencing of the eight chicken breeds/lines; (ii) 7.42 K trait-related SNPs from the Illumina 60 K SNP Bead Chip, which were found as SNPs significantly associated with 15 economic traits; (iii) 15.98 K SNPs from 861 candidate genes of target traits and high IgY level related region; (iv) 4.32 K SNPs related to chicken RFI; and (v) 18.41 K from chicken SNPdb. In the final genoty** array, 99.85% of SNPs could be annotated (Table 4). The distribution of SNPs on the chromosomes is shown in Fig. 2.

Table 3 The number of SNPs from five candidate groups in the final 55K array

Full size table

Table 4 Summary of the SNPs effect prediction in 55K array

Full size table

The comparisons of the Affy 55K array with the existing chicken arrays (Affy 600 K array, and Illumina 60 K)

All the SNPs of this 55K array, Affy 600 K array [10], and Illumina 60 K array [9] were mapped to the latest chicken genome (GRCg6a). The overlap of the 3 arrays is shown in Fig. 3. There are 6740 SNPs (13%) which overlap between the Affy 55K array and the Illumina 60 K array. When comparing to the Affy 600 K array, there are 24,227 SNPs that overlap between the 55K array which accounts for 46%. There were 21,412 new SNPs included in 55K array compared to the existing arrays.

Validation of the 55K array in 13 chicken breeds/lines

All samples from 10 Chinese local breeds (Chahua, Dagu, Liyang, Luhua, Qingyuan, Silkie, Wenchang, Bai’er, ** data to the high-density SNPs genoty** data is possible. In the new 55K genoty** array, 69% of SNPs are within genes (non-intergenic variant), the proportion is higher than the proportion in the Affy 600 K array (54%), and lower than the proportion in Illumina 60 K array (86%).

To investigate the ability of our 55K panel to detect polymorphisms and population structure in local or commercial breeds/lines. Nine Chinese local breeds (Chahua, Dagu, Liyang, Luhua, Qingyuan, Silkie, Wenchang, Bai’er, and ** array can be used to determine genetic variation both in various local Chinese breeds and in commercial meat-type and egg-type breeds.

According to the results of MDS analysis (Fig. 4), individuals originating from the commercial broilers, Hubbard and Cobb clustered together tightly and the two Chinese indigenous egg-type breeds, ** array. The 55K array has a medium SNPs density, cost-efficient, and optimal for Chinese local breeds compared with the existing 600 K commercial array. Furthermore, the 55K genoty** array incorporated known SNPs loci that possess a high potential for association with economic traits and traits that are expensive and difficult to measure, which will be interesting for both GWAS and genomic selection (GS) projects.

With the rapid development of next-generation sequencing technologies and reduction of the costs, genoty** with re-sequencing (IBS) will be the focus of future research. In the current phase, however, the IBS system is more complex and not as solid as the SNP array. The array genotyped data can be easily analyzed and standardized according to constant array SNP positions. The batch effect can be excluded by different laboratories and companies.

Conclusions

In conclusion, we developed Affy 55K genoty** array that was designed to use SNPs that are segregated in Chines local chicken breeds and commercial lines/breeds, and where large number of SNPs are associated with economic traits. Compared to the existing Affy 600 K and Illumina 60 K arrays, 21,41 K new SNPs were included in the 55K SNP array. The results from the our Affy 55K genoty** array can be imputed to the high-density SNPs genoty** data. This array offers wide range of potential applications, such as the evaluation of germplasm resources of chicken breeds, investigation of diversity of different chicken breeds, implementation of genome-wide association studies and genomic selection.

Methods

Animals

For whole genome sequencing, the 384 chickens were sampled from eight local breeds or inbred lines (Table 1). Chickens were supplied by Institute of Animal Sciences in CAAS (local breed Bei**g-You, inbred **gxing-Huang line), Jiangsu Lihua Co. Ltd. (Cyan-shank Partridge lines with fast and mediate growth rates, respectively), Institute of Poultry Sciences of CAAS (Sanhuang chicken and Recessive White chicken), **nguang Nongmu Co. Ltd. (paternal and maternal line of Cobb in parental generation). In addition, a set of 15 to 21 chickens in each breed/line were used for SNP array evaluation, which were sampled from 9 local breeds and 3 commercial lines. Chickens were supplied by the Institute of Poultry Sciences of CAAS (Bai’er chicken, Chahua chicken, Dagu chicken, Liyang chicken, Qingyuan chicken, Silkie, Wenchang chicken, Luhua chicken and **anju chicken), **nguang Nongmu Co. Ltd. (paternal lines in parent generation from Cobb and Hubbard), the Institute of Animal Sciences of CAAS (White Leghorn). Two groups with 87 and 100 chickens from **gxing-Huang and Cobb were also used for SNP array evaluation. The blood samples used in this study were all collected from chickens under the veterinary supervision and the Guidelines for Experimental Animals established by the Ministry of Science and Technology (Bei**g, China), and with the approval of Animal Ethics Committee of the Institute of Animal Sciences. No anaesthesia or euthanasia methods were used. There was no evidence at health examination that any of the involved chickens had clinical diseases caused by the sampling.

Whole genome re-sequencing

Genomic DNA was isolated from blood samples by the phenol-chloroform method. Samples DNA quality were validated by gel electrophoresis and Nanophotometer. The individual DNA samples (48 from each breed/line) were pooled to construct three libraries, with each library containing 8 males and 8 females. The libraries were constructed using the Nextera DNA Library Preparation Kit (Illumina Inc., San Diego, CA) according to the manufacturer’s standard protocol. All libraries were sequenced on the Illumina Hiseq2500 (2 × 125 bp).

Genome sequence alignment and detection of the first group of candidate SNPs

Reads were filtered for low quality (> 10 consecutive nucleotides with Phred scores < 10), adaptor sequences, and sequences without a quality control-passed paired read using NGSQC toolkit (v2.3.3) [22]. Each trimmed pool sequencing coverage are shown in Table S5. Filtered sequenced reads were mapped to the reference genome (Gallus_gallus_4.0) by BWA software (v0.7.10) [23]. PCR duplications were removed with -rmdup argument in Samtools (version 0.1.1.18) [24]. SNPs were identified and genotyped for each data set with mpileup function in Samtools, then called by VarScan [25]. Only those highly confident variants supported by both methods were kept for downstream analyses. The SNPs calling details parameter were described by Liu et al [16]. The SNPs with MAF < 0.05 and the INDELs in each breed/line were filtered by vcftools [26]. In Bei**g-you chicken, **gxing-Huang chicken, Sanhuang chicken, and the two lines of cyan-shank partridges minus the MAFs of Cobb paternal line, as well as the MAFs of Recessive White chicken, and the paternal and maternal generation of Cobb minus the MAFs of Bei**g-You chicken, respectively. The SNPs with low ΔF were excluded. The value of ΔF was adjusted for 140 K SNPs reserved in local breeds and commercial lines to generate the first group of candidate SNPs. The threshold of △F in local breeds and commercial lines are 0.609 and 0.731, respectively. The SNPs acquired through genome re-sequencing of eight breeds/lines supplied the major data for the first group of SNPs in the array. SNPs specific for chromosome W were removed and were not considered in current designing. There are also 25 INDELs for special interest, which were defined as priory 1.

Selection of the second group of candidate SNPs based on GWAS analysis of 15 traits

The second group of candidate SNPs was selected according to a GWAS analysis of 15 traits. Phenotype and genotype data were generated from the CAAS chicken F2 resource population as described in Sun’s report [27]. Briefly, the population was derived from a cross between local Bei**g-You chickens and commercial Cobb broilers (Cobb-Vantress, Inc.). The weight, carcass, immune and meat quality traits were measured from 367 F2 chickens. The 15 traits were as follows, (a.) body weight of day 28 and day 42, (b.) carcass traits including total weight percentage after slaughtering, breast muscle weight percentage, leg muscle weight percentage, abdominal fat percentage, (c.) meat quality traits including the breast muscle intramuscular fat ratio, ultimate pH (24 h), meat lightness, redness value and yellowness value of breast muscle, (d.) immune traits including IgY level to sheep red blood cell, the heterophil and lymphocyte ratio, IgY level in serum, and the average red blood cell backlog.

SNPs were genotyped by using Illumina 60 K SNP Bead chip for chicken [9]. All description of the phenotypes had been reported by Sun et al. in 2013 [27]. To maximize the polymorphism resources for SNP array, the GLM procedures were used for the GWAS analysis and was performed by PLINK software (version 1.07) [28] with 42,585 SNPs passed quality control. The details were described by Sun et al. [27]. The SNPs with top 1% lowest p-values were used in the following procedures.

Selection of the third group of candidate SNP based on the associated genes for target traits

Known candidate genes for economic traits were collected and used for the SNP array design. All genes were identified through previous researches by our group [12, 13, 29, 30]. We retrieved total 861 genes related to skeletal muscle and intramuscular fat development, chicken fat metabolism, salmonella enteritidis resistance etc. (Additional file 2). The SNPs were annotated by the Ensembl tool VEP [31]. Mutations and the SNPs in the exons, splicing region, and UTRs were firstly selected out. A maximum of 5 candidate SNPs were selected out for each gene.

In addition, the SNPs in this group also included a batch SNPs detected from a set of capture sequencing of Chr. 11, Chr. 16, and Chr. 19 of White Leghorns and Bei**g-You chickens with low or high serum IgY (Liu et al., unpublished, Supplement Table S3).

Selection of the fourth group of candidate SNPs for RFI

The fourth group candidate SNPs were selected from a whole genomic re-sequencing research of low- and high- RFI Cobb and Bei**g-You chickens. SNPs calling results showed that 8,505,214 and 8,479,041 single nucleotide polymorphisms (SNPs) were detected in low- and high-RFI Bei**g-You chickens, respectively; 8,352,008 and 8,372,769 SNPs were detected in low- and high-RFI Cobb chickens, respectively. The SNPs with Fst value < 5% in each breed were excluded followed by SNPs with mean ΔF < 0.35 between low- and high-RFI chickens. Through the above filtering processes, 3.74 K SNPs assigned to 1137 candidate genes in Bei**g-You chickens and 0.58 K SNPs (448 genes) in Cobb chickens were reserved [16].

Selection of the SNPs from chicken SNPs database

The first four groups cannot cover the whole genome evenly. In the fifth group, SNPs were selected from chicken SNPs database from NCBI (ftp://ftp.ncbi.nih.gov/snp/organisms/archive/chicken_9031/).

SNP screening according to the scoring of probes

All the SNPs’ positions were transformed from WASHUC2.1 (Illumina 60 K), and Gallus_gallus-4.0 (Affy 600 K) to Gallus_gallus-5.0 (Affy 55 K) by the LiftOver tool on UCSC Genome Browser. Take utility of all SNPs from the five candidate groups above, in silico validation, was performed using the AxiomGTv1 algorithm of APT, which generated an output score file containing p-convert values, signifying the SNP array quality and list of recommended and non-recommended SNP probes. For a high-quality SNP array design, non-recommended SNP probes were all excluded in the following procedure.

SNPs selection procedure for the final 55K array

The final SNPs selection was done in multiple steps using several criteria. The roadmap is shown in Fig. 1.

A custom-made algorithm was applied as described below. According to the Gallus_gallus-5.0, the chicken genome length is about 1.2 Gb. To ensure the probe position evenly distributed in the chicken genome, the whole genome was distributed by windows with 22 Kb length. The backward window started from the probe position of the forward probe position. The selection of the final array was performed on each chromosome separately. The first four groups SNPs were divided as 2 priorities. The SNPs in group 2, group 3, group 4, and the INDELs in group 1 were defined as priority 1, and the SNPs in group 1 were defined as priority 2.

1.
a) The selection of the SNPs in priority 1. If there is no SNP in a 22 kb window, the window will be reserved. b) If there are one or two SNPs in the window, the SNP(s) was reserved. c) If there are 3 or more SNPs in a window, only 2 SNPs in this window will be reserved, which can make the SNPs even distributed in this window according to the following formula. SD²= \( \frac{{\left(\mathrm{S}-\overline{\mathrm{x}}\right)}^2+{\left({N}_i-\overline{\mathrm{x}}\right)}^2+{\left({\mathrm{N}}_{\mathrm{j}}-\overline{\mathrm{x}}\right)}^2+{\left(E-\overline{\mathrm{x}}\right)}^2}{4} \). In the formula above, the S and E are the start position and the end position of the window respectively; and N_i and N_j are the target SNPs position in the window. The SNPs N_i and N_j which can minimum the SD², will be reserved.
2.
The selection of priority 2 SNPs. The windows reserved 1 or 2 SNPs will be skipped. The windows without SNP will be filled by one SNP of priority 2 according to the formula described above.
3.
The windows without any SNP will be filled by 1 SNP from the NCBI SNPdb of chicken, while the validated SNPs will have a priority for filling.

The final array contains 55K probes for 52 K SNPs, which were manufactured by Affymetrix® using photolithography. The redundant probes are used for interrogating each SNPs [32, 33]. The final 52 K SNPs were annotated by the online tool Ensembl VEP [34].

The comparisons of the 55K Affy array with the existing arrays (Affy 600 K array, and Illumina 60 K)

All the SNPs’ positions were transformed from WASHUC2.1 (Illumina 60 K), Gallus_gallus-4.0 (Affy 600 K) and Gallus_gallus-5.0 (Affy 55 K) to GRCg6a by the LiftOver tool on UCSC Genome Browser. All the SNP positions of the three genoty** arrays were compared. The SNPs on 600 K array and 60 K array were also performed by Ensembl VEP [31]. Overlap** Venn plot was performed by the Calculate and draw custom Venn diagrams website (http://bioinformatics.psb.ugent.be/webtools/Venn/).

Validation of the 55K array in 13 chicken breeds/lines

The genomic DNA from 12 breeds/lines (Chahua, Dagu, Liyang, Luhua, Qingyuan, Silkie, Wenchang, Bai’er, and ** was done on Axiom® arrays using the Affymetrix® GeneTitan® system according to the procedure described by Affymetrix (https://assets.thermofisher.com/TFS-Assets/LSG/manuals/702899_PI.pdf) in the Bei**g Compass Biotechnology Co., Ltd. (Bei**g, China).

Basic genotype statistics for each marker, including call rate, MAF, Hardy-Weinberg Equilibrium (HWE), allele and genotype counts were calculated using the Quality Assurance Module from the SNP Variation Suite version 7 (SVS; Golden Helix Inc., Bozeman, Montana: www.goldenhelix.com). The following quality control criteria (filtering) were used to remove SNPs with less than 95% call rate for further analysis. The SNPs with less than 0.05 MAF. SNPs were tested for HWE (P < 0.001) to identify possible ty** error. Samples with more than 10% missing genotypes were removed from the study.

The MDS was performed using the genotype data of the SNPs from the 55K panel on all the breeds samples (n = 226) to assess the utility of the panel in detecting population structure. Population structure between 12 breeds was carried out using PLINK software (version 1.90b3) [28] with the MDS method on, and the plot was performed by ggplot2 [35]. The linkage disequilibrium in 2 populations were performed by the GAPIT [36]. The LD decay plot performed by PopLDdecay software are presented as whole genome levels and as chromosome levels with the parameter of smaller break point size of 5 Kb and bigger break point size of 40 Kb [37].

Abbreviations

CAAS:: Chinese Academy of Agricultural Sciences
Chr:: Chromosome
Da:: inter-population net nucleotide divergence
GLM:: general liner model
GWAS:: Genome-Wide Association Study
HWE:: Hardy-Weinberg Equilibrium
INDEL:: insert/deletion.
LD:: Linkage Disequilibrium
MAF:: Minor Allele Frequency
MDS:: Multidimensional Scaling
QTL:: Quantitative Traits Loci
RFI:: Residual Feed Intake
SNP:: Single Nucleotide Polymorphism
UTRs:: Untranslated Regions
VEP:: Variant Effect Predictor

References

Resources CNCoAG: animal genetic resources in China: poultry: China agriculture press; 2011.
Ramos AM, Crooijmans RP, Affara NA, Amaral AJ, Archibald AL, Beever JE, Bendixen C, Churcher C, Clark R, Dehais P. Design of a high density SNP genoty** assay in the pig using SNPs identified and characterized by next generation sequencing technology. PLoS One. 2009;4(8):e6524.
Article PubMed PubMed Central CAS Google Scholar
Matukumalli LK, Lawley CT, Schnabel RD, Taylor JF, Allan MF, Heaton MP, O'Connell J, Moore SS, Smith TP, Sonstegard TS. Development and characterization of a high density SNP genoty** assay for cattle. PLoS One. 2009;4(4):e5350.
Article PubMed PubMed Central CAS Google Scholar
Dash S, Singh A, Bhatia A, Jayakumar S, Sharma A, Singh S, Ganguly I, Dixit S. Evaluation of bovine high-density SNP genoty** Array in indigenous dairy cattle breeds. Anim Biotechnol. 2017:1–7.
Anderson R. Development of a high density (600K) Illumina ovine SNP Chip and its use to fine map the yellow fat locus. Plant & Animal Genome. 2014.
Houston RD, Taggart JB, Cézard T, Bekaert M, Lowe NR, Downing A, Talbot R, Bishop SC, Archibald AL, Bron JE. Development and validation of a high density SNP genoty** array for Atlantic salmon ( Salmo salar ). BMC Genomics,15,1(2014-02-14). 2014;15(1):90.
Article PubMed PubMed Central CAS Google Scholar
Iamartino D, Nicolazzi EL, Van Tassell CP, Reecy JM, Fritz-Waters ER, Koltes JE, Biffani S, Sonstegard TS, Schroeder SG, Ajmone-Marsan P. Design and validation of a 90K SNP genoty** assay for the water buffalo (Bubalus bubalis). PLoS One. 2017;12(10):e0185220.
Article PubMed PubMed Central CAS Google Scholar
Muir WM, Wong GK, Zhang Y, Wang J, Groenen MAM, Crooijmans RPMA, Megens HJ, Zhang HM, Mckay JC, Mcleod S. Review of the initial validation and characterization of a 3K chicken SNP array. Worlds Poultry Science Journal. 2008;64(2):219–26.
Article Google Scholar
Groenen MA, Megens HJ, Zare Y, Warren WC, Hillier LW, Crooijmans RP, Vereijken A, Okimoto R, Muir WM, Cheng HH. The development and characterization of a 60K SNP chip for chicken. BMC Genomics. 2011;12(1):274.
Article PubMed PubMed Central Google Scholar
Kranis A, Gheyas AA, Boschiero C, Turner F, Le Y, Smith S, Talbot R, Pirani A, Brew F, Kaiser P. Development of a high density 600K SNP genoty** array for chicken. BMC Genomics. 2013;14(1):59.
Article CAS PubMed PubMed Central Google Scholar
Derks MFL, Megens HJ, Bosse M, Visscher J, Peeters K, Bink MCAM, Vereijken A, Gross C, Ridder DD, Reinders MJT. A survey of functional genomic variation in domesticated chickens. Genet Sel Evol. 2018;50(1):17.
Article PubMed PubMed Central CAS Google Scholar
Liu R, Wang H, Jie L, Jie W, Zheng M, Tan X, **ng S, Cui H, Li Q, Zhao G. Uncovering the embryonic development-related proteome and metabolome signatures in breast muscle and intramuscular fat of fast-and slow-growing chickens. BMC Genomics. 2017;18(1):816.
Article PubMed PubMed Central CAS Google Scholar
Huang HY, Zhao GP, Liu RR, Li SF, Zhao ZH, Li QH, Zheng MQ, Wen J: Expression profiles of novel genes and microRNAs involved in lipid deposition in chicken’s adipocyte. 2017:1–6.
Li P, Fan W, Everaert N, Liu R, Li Q, Zheng M, Cui H, Zhao G, Wen J. Messenger RNA sequencing and pathway analysis provide novel insights into the susceptibility to Salmonella enteritidis infection in chickens. Front Genet. 2018;9:256.
Article PubMed PubMed Central CAS Google Scholar
Fan W, Liu H, Zheng M, Liu R, Li Q, Wen J, Zhao G. Association of BMP15 gene polymorphisms with egg laying in Wuxing-yellow chicken. Chinese Journal of Animal Science. 2015;51(11):13–8.
CAS Google Scholar
Liu J, Liu R, Wang J, Zhang Y, **ng S, Zheng M, Cui H, Li Q, Li P, Cui X, et al. Exploring genomic variants related to residual feed intake in local and commercial chickens by whole genomic resequencing. Genes (Basel). 2018;9(2).
Qi KK, Chen JL, Zhao GP, Zheng MQ, † JW: Effect of dietary ω6/ω3 on growth performance, carcass traits, meat quality and fatty acid profiles of Bei**g-you chicken. Journal of Animal Physiology & Animal Nutrition 2010, 94(4):474–485.
Merat P. The sex-linked dwarf gene in the broiler chicken industry. Worlds Poultry Science Journal. 1984;40(1):10–8.
Article Google Scholar
Aggrey SE, Karnuah AB, Sebastian B, Anthony NB. Genetic properties of feed efficiency parameters in meat-type chickens. Genet Sel Evol. 2010;42(1):1–5.
Article CAS Google Scholar
Wen-bin SJ-t BAO, Cun-bo WANG, Hong-xia ZHANG, Weigend S, Guo-hong CHEN. Investigation on genetic diversity and systematic Evolut ion in Chinese domestic fowls and red jungle fowls by analyzing the mtDNA control region. Acta veterinaria et zootechnica Sinica. 2008;39(11):1449–59.
Google Scholar
Fu W, Dekkers JC, Lee WR, Abasht B. Linkage disequilibrium in crossbred and pure line chickens. Genet Sel Evol. 2015;47(1):11.
Article PubMed PubMed Central Google Scholar
Patel RK, Jain M. NGS QC toolkit: a toolkit for quality control of next generation sequencing data. PLoS One. 2012;7(2):e30619.
Article CAS PubMed PubMed Central Google Scholar
Li H, Durbin R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics. 2009;25(14):1754–60.
Article CAS PubMed PubMed Central Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. The sequence alignment/map (SAM) format and SAMtools. Transplant Proc. 2009;25(1 Pt 2):1653–4.
Google Scholar
Koboldt DC, Chen K, Wylie T, Larson DE, Mclellan MD, Mardis ER, Weinstock GM, Wilson RK, Li D. VarScan: variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics. 2009;25(17):2283.
Article CAS PubMed PubMed Central Google Scholar
Petr D, Adam A, Goncalo A, Albers CA, Eric B, Depristo MA, Handsaker RE, Gerton L, Marth GT, Sherry ST. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156–8.
Article CAS Google Scholar
Sun Y, Zhao G, Liu R, Zheng M, Hu Y, Wu D, Zhang L, Li P, Wen J. The identification of 14 new genes for meat quality traits in chicken using a genome-wide association study. BMC Genomics. 2013;14(1):458.
Article CAS PubMed PubMed Central Google Scholar
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, Maller J, Sklar P, Bakker PIWD, Daly MJ. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75.
Article CAS PubMed PubMed Central Google Scholar
Liu J, Fu R, Liu R, Zhao G, Zheng M, Cui H, Li Q, Song J, Wang J, Wen J. Protein profiles for muscle development and intramuscular fat accumulation at different post-hatching ages in chickens. PLoS One. 2016;11(8):e0159722.
Article PubMed PubMed Central CAS Google Scholar
Cui HX, Liu RR, Zhao GP, Zheng MQ, Chen JL, Wen J. Identification of differentially expressed genes and pathways for intramuscular fat deposition in pectoralis major tissues of fast-and slow-growing chickens. BMC Genomics. 2012;13(1):213.
Article CAS PubMed PubMed Central Google Scholar
Mclaren W, Gil L, Hunt SE, Riat HS, Ritchie GRS, Thormann A, Flicek P, Cunningham F. The Ensembl variant effect predictor. Genome Biol. 2016;17(1):122.
Article PubMed PubMed Central CAS Google Scholar
Gunderson K, Steemers F, G, Mendoza L, Chee M: A genome-wide scalable SNP genoty** assay using microarray technology. Nat Genet 2005, 37(5):549–554.
Syv, Auml AC, nen. Toward genome-wide SNP genoty**. Nat Genet. 2005;37 Suppl(37 Suppl:S5.
Article CAS Google Scholar
Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J, Billis K, Cummins C, Gall A, Giron CG, et al. Ensembl 2018. Nucleic Acids Res. 2018;46(D1):D754–61.
Article CAS PubMed Google Scholar
Wickham H. ggplot2: Elegant Graphics for Data Analysis: Springer Publishing Company, Incorporated; 2009.
Google Scholar
Alexander EL, Feng T, Qishan W, Jason P, Meng L, Peter JB, Michael AG, Edward SB, Zhiwu Z. GAPIT: genome association and prediction integrated tool. Bioinformatics. 2012;28(18):2397.
Article CAS Google Scholar
Zhang C, Dong SS, Xu JY, He WM, Yang TL. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. .
Wang Y, Song F, Zhu J, Zhang S, Yang Y, Chen T, Tang B, Dong L, Nan D, Qian Z. GSA: genome sequence archive. Genomics Proteomics & Bioinformatics. 2017;15(1):14–8.
Article Google Scholar
Members BDC. Database resources of the BIG data center in 2018. Nucleic Acids Res. 2018;46(Database issue):D14–20.
Google Scholar

Download references

Acknowledgments

We would like to thank the Institute of Poultry Sciences of Chinese Academy of Agriculture Sciences, Jiangsu Lihua Co. Ltd. and **nguang Nongmu Co. Ltd. for the provision of chicken samples. We would like to thank Prof. Jianfeng Liu (China Agricultural University) and Dr. Huamiao Liu (Institute of Special Animal and Plant Sciences of CAAS) for their help on the SNP selection work, and Prof. Martien AM Groenen (Wageningen University & Research) for his good comments and suggestions on the manuscript.

Funding

The design of the study and collection of data were supported by the National Key Technology R&D Program (2015BAD03B03); the collection, analysis and interpretation of data were supported by the earmarked fund for the modern agro-industry technology research system (CARS-41) and Agricultural Science and Technology Innovation Program (ASTIP-IAS04; ASTIP-IAS-TS-15); the interpretation of data and writing the manuscript were supported by the National Nonprofit Institute Research Grant (2017ywf-zd-2).

Availability of data and materials

The whole genome sequencing clean data reported in this paper have been deposited in the Genome Sequence Archive [38] in BIG Data Center [39] under accession number CRA001289 which can be publicly accessed at http://bigd.big.ac.cn/gsa.

Author information

Ranran Liu and Siyuan ** Zhao & Jie Wen
Animal Breeding and Genomics, Wageningen University & Research, Wageningen, The Netherlands
Siyuan ** Zhao & Jie Wen
Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Bei**g, 100193, People’s Republic of China
Gui** Zhao & Jie Wen

Authors

Ranran Liu
View author publications
You can also search for this author in PubMed Google Scholar
Siyuan **ng
View author publications
You can also search for this author in PubMed Google Scholar
Jie Wang
View author publications
You can also search for this author in PubMed Google Scholar
Maiqing Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Huanxian Cui
View author publications
You can also search for this author in PubMed Google Scholar
Richard P. M. A. Crooijmans
View author publications
You can also search for this author in PubMed Google Scholar
Qinghe Li
View author publications
You can also search for this author in PubMed Google Scholar
Gui** Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jie Wen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

RRL contributed to the design and performing of the study, the analysis and interpretation of data and writing of the manuscript. SX contributed to the data analysis, SNPs selection, and manuscript writing. JW contributed to the data analysis. RC contributed to the manuscript writing. MQZ, HXC, and QHL contributed to the sample and data collection. GPZ and JW contributed to the design of the study and interpretation of data. All authors submitted comments on the draft, read, and approved the final manuscript.

Corresponding authors

Correspondence to Gui** Zhao or Jie Wen.

Ethics declarations

Ethics approval and consent to participate

All experimental procedures with chickens were performed according to the Guidelines for Experimental Animals established by the Ministry of Science and Technology (Bei**g, China). Ethical approval on animal survival was given by the animal ethics committee of the Institute of Animal Sciences (IAS), Chinese Academy of Agricultural Sciences (CAAS, Bei**g, China) with the following reference number: IASCAAS-AE-03.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

The summary of each sequencing pools. The raw reads number, clean reads number, sequencing depth, Q30 percentage, and coverage et al. for each sequencing library were provided. (XLSX 14 kb)

Additional file 2:

The second group of SNPs which related to the economic traits. The locus and the p-value of SNPs which related to 15 economic traits were provided. (XLSX 998 kb)

Additional file 3:

The third group of SNPs which related to 861 candidate genes. The information of 861 candidate genes and 118.4 K SNPs selected were provided. (XLSX 6645 kb)

Additional file 4:

The third group of SNPs which related to serum IgY. The loci and allele information of 0.8 K SNPs related to serum IgY were provided. (XLSX 32 kb)

Additional file 5:

The loci information for the 55K array. The loci, allele information, SNPs frequencies in each breed/line, and overlap information were provided. (XLSX 7761 kb)

Additional file 6:

The LD decay in whole genome level in Cobb population. (JPEG 653 kb)

Additional file 7:

The LD decay in whole genome level in **gxing-Huang population. (JPEG 578 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Liu, R., ** array. BMC Genomics 20, 410 (2019). https://doi.org/10.1186/s12864-019-5736-8

Download citation

Received: 21 July 2018
Accepted: 25 April 2019
Published: 22 May 2019
DOI: https://doi.org/10.1186/s12864-019-5736-8

A new chicken 55K SNP genoty** array

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Background

Results

Genome re-sequencing of chickens supplying the first SNP group

Selection of the second group of candidate SNPs based on the GWAS of 15 traits

Selection of the third group of candidate SNPs based on the genes associated with economic traits

The comparisons of the Affy 55K array with the existing chicken arrays (Affy 600 K array, and Illumina 60 K)

Validation of the 55K array in 13 chicken breeds/lines

Conclusions

Methods

Animals

Whole genome re-sequencing

Genome sequence alignment and detection of the first group of candidate SNPs

Selection of the second group of candidate SNPs based on GWAS analysis of 15 traits

Selection of the third group of candidate SNP based on the associated genes for target traits

Selection of the fourth group of candidate SNPs for RFI

Selection of the SNPs from chicken SNPs database

SNP screening according to the scoring of probes

SNPs selection procedure for the final 55K array

The comparisons of the 55K Affy array with the existing arrays (Affy 600 K array, and Illumina 60 K)

Validation of the 55K array in 13 chicken breeds/lines

Abbreviations

References

Acknowledgments

Funding

Availability of data and materials

Author information

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Additional files

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation