Introduction

Natural killer (NK) cells, associated with the innate immune response, are considered as the first line of defense against both infected and malignantly transformed cells1. As a kind of bone marrow-derived lymphocyte, NK cells use specific cell surface receptors to distinguish healthy cells and diseased cells2. Like T cells, NK cells possess the qualified characteristics of the adaptive immune system, including the production of memory cells that persist following antigen invasion and the ability to create a secondary recall response3. Whether NK cells produce activation or inhibitory function depends on the varieties of their surface receptors. NK cells express three main families of receptors: (i) the highly polymorphic killer cell immunoglobulin-like receptors (KIRs) which specifically interact with classical MHC Class I molecules, (ii) the non-polymorphic C-type lectin (CD94/NKGs) receptors which bind to the non-classical MHC molecule HLA-E, and (iii) the immunoglobulin-like transcripts4. Killer cell immunoglobulin-like receptors (KIRs), defined as specific cell surface receptors, are a group of glycoproteins expressed on the surface of both NK cells and a few subsets of T cells. Upon the interaction with polymorphic human leukocyte antigen (HLA) class I molecules on the surface of target cells and other ligands, the KIRs participate in various immune responses to different infectious agents. Because of the high diversity of KIR genes, it is reasonable to hypothesize that the polymorphism of the KIRs in combination with HLA genes might affect predisposition to autoimmune disease. Recent genetic experiments demonstrated the associations between KIR and HLA genes with susceptibility to autoimmune diseases including Systemic Lupus Erythematosus (SLE), rheumatoid arthritis, systemic sclerosis and multiple sclerosis5,6,7,8,9,10. NK cells derived from human pulmonary artery hypertension (PAH) patients exhibited a lower level of the KIRs 2DL1/S1 and 3DL1 expression as well as a great disruption of 3DL1 receptor associated cytolytic function, suggesting a novel and substantive role for KIRs in the occurrence and development of immune associated vascular disease11.

The KIRs are encoded by a family of highly polymorphic genes clustered within the leukocyte receptor complex region on human chromosome 19q13.4. All of the KIR genes are encoded within a range of 160 kb genomic sequence, and they cluster together with a genetic distance shorter than 3 kb. As well, the KIR clusters have shown a good deal of gene duplications and unequal crossing over, which lead to a wide range of KIR gene combinations. KIR genes contain a tandem array of highly homologous genes, the number and type of which have the ability to show high diversity in different NK clone cells and individuals12. KIR gene nomenclature as defined by the World Health Organization (WHO) subcommittee is based on the structure of the encoded molecules. Accordingly, the number of extracellular immunoglobulin domain (D) could be double (2D) or triple (3D) and therefore the length of the intracytoplasmic tail would be short (S) or long (L). In terms of the order of KIR genes and the gene contents of 15 loci, KIR genes are divided into two haplotypes, A and B. The A haplotype contains at least six encoding inhibitory receptors (KIR3DL3, 2DL3, 2DL1, 2DL4, 3DL1 and 3DL2), one pseudogene (KIR3DP1) and one activating receptor gene (KIR2DS4)13. The haplotype B consists of a great variety of subtypes that differ from each other in the combination of stimulatory receptors (KIR2DS1, 2DS2, 2DS3, 2DS5, 3DS1, 2DL2 and 2DL5)14. On condition that the presence of KIR2DS1, 2DS2, 2DS3, 2DS5, 3DS1, 2DL2 and 2DL5 are observed, the genotype would be determined as including B. If all of the above-mentioned genes are absent, the genotype would be defined as AA. When one or more of KIR genes belonging to A group are absent, the genotype would be taken as BB. Otherwise, all the remaining genotypes are defined to be AB. Seven KIR genes belonging to B group are located centromeric or telomeric half in B cluster, B group is classified into 2 half groups: C4 half (KIR2DS2, 2DL2, 2DS3 and 2DL5) and T4 half (KIR3DS1, 2DL5, 2DS1 and 2DS5). Herein, the B group is divided into 4 subgroups: C4T4, C4Tx, CxT4, and CxTx. Seven KIRs (KIR2DL1, 2DL2, 2DL3, 2DL5, 3DL1, 3DL2 and 3DL3) have inhibitory functions while five KIRs (KIR2DS1, 2DS2, 2DS3, 2DS5 and 3DS1) exhibit active functions. KIR2DL4 has both inhibitory and active functions12. Among all KIR genes, four genes KIR2DL4, KIR3DL2, KIR3DL3 and KIR3DP1 are described as framework genes, and they are present in nearly all individuals15.

In the previous research, studies have shown the KIR gene diversity in different geographical populations16,17,18,19,20,21,22,23,24,25,

Methods

Study samples

Blood samples were collected from a total of 123 randomly selected healthy donors of Han ancestry living in Nan**g, Jiangsu province of China, membership of Eastern China. All participants provided their written informed consent and completed a basic health screen. Also, each participant was interviewed to ensure that no individuals have common ancestry going back at least three generations and their three generations are native of eastern coastal area of China. The whole-blood samples anti-coagulated with ethylene diamine tetraacetic acid (EDTA) were frozen at −80 °C until use. The study was conducted in accordance with the human and ethical research principles of Nan**g Medical University and approved by the ethics committee of Nan**g Medical University.

DNA isolation

According to the manufacturer’s instructions, we extracted genomic DNA from 300 μl peripheral blood containing ethylene diamine tetraacetic acid (EDTA), using TIANamp Genomic DNA Kit (TIANGEN, Bei**g, China). The quality and quantity of extracted DNA samples were determined by NanoDrop 2000 (Thermo Fisher Scientific, Waltham, USA). The optimal density values used to evaluate the concentration and purity of extracted DNA reflected by the A260/280 values (1.7 to 1.9). The concentration was adjusted to 20–40 ng/μl.

KIR PCR-SSP genoty**

KIR genes were genotyped for the presence or absence of the 17 KIR genes, including KIR2DL1, 2DL2, 2DL3, 2DL4, 2DL5, 3DL1, 3DL2, 3DL3, 2DS1, 2DS2, 2DS3, 2DS4, 2DS5, 3DS1 (in the full-length form), 1D (in the deleted form) and two pseudogenes, 3DP1(putative protein product) and 2DP1(no protein expression) using PCR-SSP method with a commercially available KIR GENOTYPING SSP KIT (Invitrogen, California, USA). The PCR amplification was performed with the PCR system in a reaction mixture volume of 10 μl consisting of 4 μl pre-aliquoted PCR buffer, 0.06 μl Taq polymerase, and 30 to 50 ng of template DNA. Temperature cycling conditions for PCR reactions were as follows: denaturation for 1 minute at 95 °C, followed by 30 cycles for 20 seconds at 94 °C, 20 seconds at 63 °C, 1.5 minutes at 72 °C, a elongation step for 10 minutes at 72 °C and finally hold in 4 °C. PCR products were visualized under ultraviolet light after electrophoresis in 1.5% agarose gel well mixed with 10% v/v ethidium bromide. Negative controls were performed for each gene while positive internal controls were performed for each lane. False reactions that yielded no internal control bands were repeated.

Statistical analysis

The observed carrier frequencies (OFs) of the KIR genes were determined by the number of positive ty** reactions. Based on the assumption of Hardy-Weinberg equilibrium, we calculated the estimated gene frequencies (GFs) using the formula, GF = 1−(1−OF)1/2. The GF is determined by the OF of the KIR gene in all individuals. Package “pheatmap” (https://cran.r-project.org/web/packages/pheatmap/index.html) based on statistical software R version 3.2.5 (https://www.r-project.org/) was used to draw a Heatmap containing Eastern Chinese Han and 58 other populations with complete KIR genoty** files of 16 KIR genes exclusive of KIR1D which are Jilin Han27, Shaanxi Han18, Shenzhen Han28, Sichuan Han29, ** profiles observed on Killer cell immunoglobulin-like receptors (KIRs) in the Eastern Chinese Han(n = 123).

Figure 1
figure 1

The classification and relative proportions of haplotypes, genotypes and linkage groups based on the genoty** profiles observed on 16 Killer cell immunoglobulin-like receptors (KIRs) in the Eastern Han population (n = 123).

Linkage disequilibrium (LD) analysis

Since KIR genes are close to each other by approximately 3 kb genomic DNA region, linkage disequilibrium should be taken into consideration, especially for those two nearest-neighbor KIRs (present or absent simultaneously). Therefore, we conducted the LD analysis, in which D was the parameter representing the test statistics linkage disequilibrium coefficient. P-value < 0.05 signified a strong genetic relation between two KIRs. Based on the genoty** profiles of 123 Eastern Chinese Hans, Table 3 and Fig. 2 listed the D coefficients and conspicuous differences of 45 pairs among 10 KIR genes except for 7 KIR genes which appeared in all individuals. The Linkage Disequilibrium comparisons were listed in an inverted triangle in Fig. 2.

Table 3 Linkage disequilibrium coefficients of 45 KIR comparison pairs in Eastern Chinese Han.
Figure 2: The LD analysis among 10 KIR genes.
figure 2

As shown by the red arrow, the comparisons of the locus KIR2DS3 and the rest 9 loci were arranged in left bottom (9 square grids). The other comparisons were listed as well. The red area encircled by thick black line represented strong linkage relationship.

The results of LD analysis showed 35 strong relations in total, among which 6 were negative, all associated with KIR1D (1D-3DL1, 1D-2DS4, 1D-2DL5, 1D-3DS1, 1D-2DS1, and 1D-2DS3). The rest 29 relations were significantly positive on different levels. In terms of group classification, 1 relation was from A group (3DL1-2DS4) while 10 pairs linked A and B groups (3DL1-2DL5, 3DL1-3DS1, 3DL1-2DS1, 3DL1-2DS5, 3DL1-2DS3, 2DS4-2DL5, 2DS4-3DS1, 2DS4-2DS1, 2DS4-2DS5, and 2DS4-2DS3) and 18 linkage relations existed within B group. After further demarcation, 6 linkage relations (2DL5-2DL2, 2DS2-2DL2, 2DS3-2DL2, 2DS2-2DL5, 2DS3-2DL5, and 2DS2-2DS3) and 5 (3DS1-2DL5, 2DS1-2DL5, 2DS5-2DL5, 3DS1-2DS5, and 2DS1-2DS5) were found within C4 and T4 subgroups from B family respectively, consistent to the genomic location of the genes. As for 1D, a known 2DS4 variant, we illustrated a very close linkage relationship between 1D and KIR2DS4 which showed the linkage coefficient was −0.0245 (p < 0.001). Thus, the results supported that KIR1D is the variant of KIR2DS447. These findings are consistent to other published studies48,49,50,51.

Evolution significance of KIR cluster and Phylogenetic restructure

In order to investigate the potential role of KIRs in population genetics, we compared the KIR frequencies of Eastern Han and 58 other populations and performed population differentiation analysis. Because KIR1D was excluded from the database, we utilized the rest 16 genes for the Heatmap analysis. The 59 selected populations are from East Europe, West Europe, Africa, North America, Latin America, Asia and Pacific Ocean, which are almost on behalf of human populations all over the earth.

In Fig. 3, the deeper color represented higher observed frequency. We found all KIR genes are present in Eastern Han population. However, KIR3DS1 was not detected in African populations such as Bantu NE, Bantu S, Mbuti Pygmies and San groups. Similarly, 2DS3 was missing in some populations from East Asia (Hezhen, Lahu, Mongola, and Tujia) and America (Karitiana, Pima, Surui, and Colombian). With the knowledge that EDAR variant emerged with ameliorative phenotypes in East Asian, especially Central Chinese by natural selection52, we speculated that KIR3DS1 was beginning to surface and KIR2DS3 disappeared gradually due to evolutionary pressure such as microbial infection, communicable disease or natural environment on the route of “out of Africa”. However, the potential causes remained to be discovered. The Hierarchical Clustering method was employed to reconstruct the cluster trees and molecular evolutionary structures based on Euclidean distance. The populations and KIRs were listed corresponding to the population genetic and homologue analysis, respectively (Fig. 3). Firstly, we found that molecular evolutionary structure was divided into two clusters. The left was composed of KIR genes defined as B group, while the right consisted of A group and KIR2DP1. The results were consistent with the deduced classification clones from original research14. Interestingly, KIR2DS1 clustered with KIR3DS1, KIR2DL2 and 2DS2 were from the same branch, matching up with the classification of linkage subgroups (C4, T4). Secondly, from the phylogenetic structure, two main clusters existed according to the relative OFs of B group loci. The lower cluster differed from the top by higher OFs for most B haplotype KIRs (especially the lowest 5 populations from Oceania and Africa including Melanesian, Papuan, Mbuti Pygmies, Biaka Pygmies and San). The studied Eastern Han labelled with red arrow resided in the top cluster. Obviously, Han populations from HGDP-CEPH database including ** files of 16 KIR genes from a mathematics angle.

Figure 3: A Heatmap illustrated containing Eastern Han and 58 other populations distributed worldwide, as well as 16 KIR loci in molecular evolutionary structure.
figure 3

The deeper color indicated higher observed carrier frequency from 0 to 1.

Phylogenetic structure and MDS analysis

In the validation study, we constructed another Neighbor-Joining tree (Fig. 4) by utilizing the observed frequency data of the Eastern Han, 52 populations from HGDP-CEPH, and reported 9 Han groups at 13 KIR loci (KIR2DL1, 2DL2, 2DL3, 2DL4, 2DL5, 3DL1, 3DL2, 3DL3, 2DS1, 2DS2, 2DS3, 2DS5 and 3DS1) with the purpose of revealing genetic relationships among these 62 populations. The results of interior branch test were shown in Figure S1 with the sum of branch length = 0.41784153. The values of DA and significant level were listed in Supplementary Table S1. Without correction, the p-value greater than 0.05 appeared 330 times and the rest 1,561 comparisons were found to be significantly different. The number of significances ranged from 13 in Pacific Melanesian to 60 in Bedouin, Balochi, Brahui, ** files, phylogenetic construction, evolutionary molecular analysis and sub-Han group comparison provided valuable resources for both enriching information pool of forensic and population genetics and benefiting in uncovering underlying genetic mechanisms underlying immune diseases in the future. The comprehensive and comparative analysis on KIRs revealed a unique genetic background of Eastern Chinese Han through phylogenetic construction, evolutionary molecular analysis, and sub-Han group comparison which provided valuable resources for both enriching information pool of forensic and population genetics.

Additional Information

How to cite this article: Yin, C. et al. Genetic polymorphism and evolutionary differentiation of Eastern Chinese Han: a comprehensive and comparative analysis on KIRs. Sci. Rep. 7, 42486; doi: 10.1038/srep42486 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.