Background

Polycystic ovary syndrome (PCOS) is a prevalent endocrine and metabolic disorder in women of childbearing age, with a prevalence of 7.8% [1]. PCOS is characterized by a series of reproductive abnormalities, including of oligo-/ano-ovulation, hyperandrogenism (HA) and polycystic ovarian morphology (PCOM) [2]. As the key feature of PCOS, HA results from abnormal gonadotropin releasing hormone stimulating ovarian theca cell to produce androgen. The hormone environment impedes follicular maturation, resulting in a considerable amount of small antral follicles and ovulatory dysregulation [3, 4].

Androgen receptor (AR) is one of PCOS candidate genes [5, 6]. AR belongs to a nuclear receptor superfamily of transcription factors and locates on Xq11-12 [7]. It has three major domains: an N-terminal transactivation domain, a DNA-binding domain and a C-terminal hormone-binding domain [8]. The N-terminal transactivation domain is encoded by the exon one of AR gene and is composed of a polyglutamine tract, which is encoded by a variable length of CAG repeat polymorphism. Due to the two alleles of X chromosome in females, one allele with relatively longer CAG repeat length is termed as “long allele” and another allele is termed as “short allele”. “Biallelic average” is equal to the average length of “short allele” and “long allele”. In recent years, several studies concentrated on the relationship between CAG repeat polymorphism and PCOS [9,10,11].

Simultaneously, AR is highly expressed in granulosa cells of pre- and early antral follicles and decreases during follicular maturation, indicating that AR plays an important role in follicular development [12,13,14]. The new international PCOS guideline recommends antral follicle count (AFC) ≥ 20 as a new standard of PCOM [15]. Based on the new cutoff of PCOM, our previous study divided PCOS into low AFC (L-AFC) group and high AFC (H-AFC) group and reported that the AR expression decreased significantly in PCOS, especially for the H-AFC group [16].

However, the relationship between AR CAG polymorphism and PCOM based on the new PCOS guideline has not been investigated yet. Hence, we conducted this study to investigate the relationship between PCOM and CAG repeats length.

Methods

Study population

A total of 170 participants with PCOS cases and controls aged 18–45 years were recruited from the reproductive center of General Hospital of Ningxia Medical University. All available details (age, weight and height) were recorded. PCOS cases were included according to the Rotterdam Revised 2003 diagnosis criteria [17]: oligo-/ano-ovulatory, clinical or biochemical hyperandrogenism (HA) and PCOM diagnosed by transvaginal ultrasound. The diagnosis can be defined when two of three are fulfilled. Clinical HA included hirsutism, acne and so on. Individuals with total testosterone concentration above 48.1 ng/dL will be defined with biochemical HA. The exclusion criteria consisted of androgen tumor, congenital adrenal hyperplasia, Cushing’s syndrome, thyroid related disease and so on. And the controls were who visited the clinic due to oviduct dysfunction or male infertility and all of them had normal menstrual cycle and ovarian morphology.

AR CAG length measurement and analysis

DNA was isolated from peripheral blood by TIANamp Genomic DNA Kit (TIANGEN, China) protocol and quantified by spectrophotometry. Primers of AR were constructed in Shanghai Generay Biotechnology: fluorescent-labeled forward primer FAM-5’-TCCAGAATCTGTTCCAGAGCGTGC-3’ and reverse primer 5’-GCTGTGAAGGTTGCTGTTCCTCAT-3’. According to previous reported method [18], the genomic DNA was amplified by polymerase chain reaction (PCR) with AR primers. PCR products were sequenced by ABI 3730 DNA Sequencer (Applied Biosystems, USA) under standard conditions and analyzed by Peak Scanner software to determine the length genetic polymorphism.

As the AR gene is located on the X chromosome and two alleles exist in women, one allele with relatively longer CAG repeat length was termed as “long allele” and another allele was termed as “short allele”. We employed the conventional method to analyze the alleles: 1) the mean value of the two alleles (biallelic average), 2) the short allele alone, and 3) the long allele alone. The median values of CAG repeats length are 22.5 for biallelic average, 21 for short allele and 24 for long allele, which were used as the cutoff to divide CAG repeats length into lower and higher part for the frequency distribution analysis.

Clinical and biochemical measurement

For individuals with normal cycle, peripheral blood was collected on the day 3 of menstrual cycle; for women with oligo-/amenorrhea, peripheral blood was collected at any time. The serum follicle stimulating hormone, luteinizing hormone, estradiol and total testosterone were measured using chemiluminescence immunoassay and enzyme-linked immunosorbent assay in the clinical laboratory of Reproductive center of Ningxia Medical university. Transvaginal ultrasound was used to evaluate the AFC in the follicular stage.

Statistical analysis

Variables corresponding to normal distribution were compared with independent t-test between controls and PCOS. And the non-normal variables were analyzed with non-parametric Mann–Whitney U-test between controls and PCOS. In the subgroup analysis, one-factor analysis of variance test and least significance difference were used to compare different data. Chi-square distribution (χ2) test was used to compare the distribution frequency among different groups. Logistic regression model was constructed with the presence or absence of long allele < 24 and biallelic average < 22.5 as the independent variable and the presence or absence of PCOS/L-AFC/H-AFC as the dependent variable separately. All the statistical analysis were performed with the SPSS version 26. Statistical significance was defined as a two-side p value less than 0.05, and data was reported as mean ± SD and number (percent).

Results

The AR CAG repeats length in PCOS cases and controls

The baseline information of controls and PCOS cases were listed in Table 1.

Table 1 The clinical features and AR CAG frequency distribution of controls and PCOS cases

No difference was found in the mean CAG repeats length between PCOS cases and controls, whether using short allele, long allele or biallelic average of them (Fig. 1A).

Fig. 1
figure 1

A CAG repeats length of AR gene in controls and polycystic ovary syndrome (PCOS). B CAG allele distribution. Frequency distribution of CAG allele in exon one of AR gene in PCOS cases and controls for short allele, long allele and biallelic average. The blue band represented controls and the red band represented PCOS cases. C CAG repeats length of AR gene in controls, L-AFC and H-AFC PCOS cases. P represented p value given by the least significance difference. L-AFC, low AFC (AFC < 20); H-AFC, high AFC (AFC ≥ 20). D CAG repeats length in AR gene in control, NHA and HA PCOS cases. NHA, non-hyperandrogenism; HA, hyperandrogenism

Allele distribution was also similar between the two groups (Fig. 1B). AR CAG repeats length ranged from 14–33 among PCOS cases, and from 14–35 among controls. Frequency distribution analysis showed us no difference of frequency distribution between PCOS cases and controls (Table 1).

The AR gene CAG repeats length in low and high AFC group

To further investigate the relationship between PCOM and CAG repeat polymorphism, PCOS group was categorized into L-AFC and H-AFC group. The clinical manifestation of controls, L-AFC and H-AFC PCOS cases was presented in Table 2.

Table 2 The clinical features and AR CAG frequency distribution of controls, L-AFC and H-AFC PCOS cases

Examination of the mean CAG repeats length revealed that the AR CAG repeats length in the H-AFC group was lower than in the L-AFC group (Fig. 1C).

For the allele distribution, we found that most individuals in H-AFC group had lower CAG repeats lengths than those in L-AFC group (Table 2). Results showed that 58.82% of H-AFC group for long allele and 64.71% of H-AFC group for biallelic averages had lower CAG repeats lengths (less than 24 for long allele and 22.5 for biallelic average). Frequency comparison between L-AFC and H-AFC group showed us significant difference (p = 0.039 for long allele and p = 0.041 for biallelic averages).

The AR gene CAG repeats length in non-hyperandrogenism (NHA) PCOS and hyperandrogenism (HA) PCOS

As the AR gene CAG repeats length is correlated with the action of AR, we further divided the PCOS group into NHA and HA subgroup with clinical features listed in Supplemental Table 1. No difference of mean number for short allele, long allele and biallelic average was observed among controls, NHA-PCOS and HA-PCOS (Fig. 1D). In addition to that, there was also no difference on the frequency distribution in the PCOS subgroup (Supplemental Table 2).

Lower CAG repeat length contributes to the risk of H-AFC in PCOS

Given the significant difference of CAG allele distribution frequency between L-AFC and H-AFC, we constructed the binary logistic regression model to determine whether lower CAG repeat length contributes to the risk of PCOS or PCOM. The results showed us that there is no relationship between lower CAG repeats length and PCOS (Table 3).

Table 3 Predictive models of PCOS and subgroups with CAG biallelic average

Furthermore, we investigated whether CAG repeats length contributed to the risk of PCOS subgroup. The logistic regression analysis indicated that long allele < 24 was associated the incidence of H-AFC compared with L-AFC PCOS (Table 3). For PCOS women with CAG < 24, the risk to develop H-AFC was more than twice times larger than those with CAG ≥ 24. Additionally, the results showed that biallelic average < 22.5 contributed to the risk of PCOS with H-AFC compared with control, however, the significance disappeared after the adjustment of age and BMI. There was no relationship between CAG repeats length and the risk of PCOS with L-AFC, indicating that the CAG repeats length was primarily associated with the risk of develo** H-AFC in PCOS (Table 3).

Discussion

In the present study, the CAG length of our participants ranged from 14–35 in total, which is in the normal range [19]. According to our data, we found no significant difference in the mean CAG repeats length between PCOS cases and controls, which was consistent with previous studies [10, 19,20,21,22]. Apart from this, the frequency distribution of CAG repeats length was found no distinction between PCOS and controls. However, some studies found that PCOS exhibited a greater frequency of CAG repeats length longer than 22 repeats [9], while some studies presented that individuals with precocious pubarche had greater proportion of short allele less than 22 repeats [23]. For these inconsistent results, further studies involving a larger number of women are needed.

As for the AFC, we split PCOS group into L-AFC and H-AFC subgroup according to the new PCOS guideline. The H-AFC group had a lower CAG repeats length than the L-AFC group. Moreover, a significantly greater proportion of the H-AFC group had CAG repeats length less than 24 for long allele or less than 22.5 for biallelic averages. Logistic regression analysis suggested individuals with CAG < 24 for long allele were more likely to be affected by PCOM.

Disordered follicle development is regulated by the interaction of androgen and AR [13, 24, 25]. In our cohort, the total testosterone concentration in H-AFC group was lower than L-AFC group. It has been reported that the shorter CAG length in H-AFC can increase AR sensitivity to androgen [7, 26]. Therefore, lower CAG repeats length in H-AFC group contributes to increase transactivation of AR, resulting in the incidence of follicular arrest and an excessive number of small antral follicles.

Our previous study also investigated the relationship between AR and PCOM with cutoff of new guideline, which concentrated on the AR expression of granulosa cells and PCOM. Those results suggested decreased AR expression in PCOS group, especially in the H-AFC group [16]. The inconsistency of results may result from the different tissues, this means that peripheral blood was used in the present study, while granulocytes were used in the previous study. Despite differences, the commonality of them suggested that AR is different between H-AFC and L-AFC group in PCOS, no matter from its expression or its CAG polymorphism.

The study gave insight into the association between AR CAG polymorphism and PCOM diagnosed based on the new international guideline, demonstrating that CAG polymorphism had an influence on the risk of H-AFC in PCOS. Whereas, the sample size, particularly in the H-AFC subgroup, was relatively small. Further investigation is needed in a larger population and also for the specific mechanism.

In conclusion, we enrolled 61 controls and 109 PCOS cases in the General Hospital of Ningxia Medical University and tested the hormonal parameter and AR CAG repeats length. Our results showed no significant difference in mean CAG repeats length and distribution frequency between controls and PCOS cases or between HA and NHA PCOS. However, our results revealed that individuals in the H-AFC group had a shorter mean CAG repeats length and a larger fraction of H-AFC group tends to have shorter CAG length for biallelic averages and long allele. Logistic regression model suggested that CAG < 24 for long allele can increase the risk of H-AFC in PCOS.

The exact diagnosis of PCOS, a common reproductive-age disease with substantial health and economic burden, is important for our society [27]. The new international guideline for PCOS aims to reduce the overdiagnosis and provide more accurate diagnosis. In the direction of new guideline, our study adds to the accumulating evidence that AR signaling plays an important role in the follicular development and provides insight on the relationship between CAG polymorphism and follicular arrest.

Conclusions

In this study, we demonstrated that the portion of individuals with lower CAG repeats length in H-AFC group was significantly larger than those with higher CAG repeats length. Logistic model revealed that individuals with lower CAG length tended to develop H-AFC, suggesting that CAG repeats length contributed to the risk of PCOM in the setting of new international PCOS guideline.