Introduction

The menopause - defined as cessation of menstrual periods - is a gradual, naturally occurring process for women that typically occurs between 42 and 55 years of age [1]. Problems ‘below the waist’ are often unattractive topics for public or political discourse; the UK Government’s recent publication of the Women’s Health Strategy [2] and establishment of the Menopause Task Force [3] have increased attention in the UK.

Many women (around 80%) experience sometimes debilitating symptoms associated with oestrogen and testosterone depletion for an average of four (but up to 12) years. These include hot flushes, night sweats, sleep disruption, fatigue, difficulty concentrating, depression and anxiety, mood swings, irritability, and loss of confidence [1]. Women experiencing problematic menopausal symptoms report lower levels of health-related quality of life and greater use of healthcare than women without symptoms. Prolonged lack of oestrogen impacts the cardiovascular system and can increase risk of long-term conditions including osteoporosis [4]. Although many women experience problematic menopause symptoms, many fewer use Hormone Replacement Treatment (HRT) [5].

Emerging evidence suggests physical activity may be beneficial [6] for alleviating some menopausal symptoms and hel** prevent diseases with increased risk during menopause. Physical activity can be categorised into occupational, sports, conditioning, household, or other activities. Exercise is physical activity that is planned, structured, and repetitive and has improvement or maintenance of physical fitness as an objective [7]. There is strong evidence that, throughout life, physical activity is protective against chronic conditions including coronary heart disease, obesity, type 2 diabetes, and mental health problems [8]. UK recommendations are clear about the benefits of physical activity, importance of strength training, and amount of recommended weekly activity [9].

However, women may reduce physical activity during menopause transition [10]. In England 38% of women aged 45–54 years do less than the weekly recommended 150 min moderate activity, and 60% of adult women do less than twice weekly strength training [11]. The ‘Women in Sport’ report [10] found that, although physical activity levels were low, menopausal women’s desire to be more active was high, and increased when recommended by a health professional. Many women with menopause symptoms may seek medical advice from their GP. However, not all women want to, or are able to take, or access, HRT. Strengthening the evidence-base for all intervention options may improve agency over menopause symptom management for women [5].

Aim

This overview aims to assess the effectiveness of physical activity and exercise interventions targeting women experiencing menopause. There are multiple recent and ongoing systematic reviews in this area, but no existing overview [12,13,14].

Methods

The overview protocol was registered on PROSPERO (CRD42022298908) [15] which is an international prospective register of systematic reviews. This platform allows authors to view systematic reviews in progress and also improves accountability for authors to justify any decisions made to change their original published protocol. It was conducted using guidance from the Cochrane Handbook for Systematic Reviews of Interventions [16] and reported following the Preferred Reporting Items for Overviews of Reviews (PRIOR) [17].

Inclusion criteria

The PICO (population, intervention, comparator, outcomes) inclusion criteria are provided in Box 1.

Box 1 Inclusion criteria for systematic reviews

Types of studies

We included systematic reviews (with or without meta-analysis). Systematic reviews were defined using criteria from the Database of Abstracts of Reviews of Effects [29]. Reviews had to have: clear inclusion/exclusion criteria; adequate search strategy; synthesis of studies; and quality assessment of studies. Reviews reported only as conference abstracts, or not published in English, and review protocols were excluded.

Search strategy

In June 2023 we searched Medline, Embase (both OvidSP), CINAHL (EBSCO), Scopus (Elsevier), The Cochrane Database of Systematic Reviews and Social Science Citation Index (Clarivate); reference lists were also searched. These databases were chosen based on previous reviews on a similar topic. Date limitation (2000 onwards) was used because a Medline search produced no prior results and because validated tools and instruments were not previously routinely used for menopause symptoms [30]. The strategy combined keywords relating to physical activity, exercise, menopausal symptoms and study design (see Supplementary Table 3 for example search strategy). Only reviews written in the English language were included.

Selection criteria

Two independent reviewers (AM and DH/AMK) used the systematic review organisational web tool, Rayyan [31] to review both abstracts and full text articles of potentially relevant studies. This ensures robust screening of papers and reduces the risk of bias as papers are blinded until screening is complete. At each stage disagreements were resolved through discussion with a third reviewer (CET).

Data extraction and management

A data extraction table was created based on Cochrane guidance for overview of reviews [16]. Two reviewers (AM and AMK/DH) extracted data including: review methods and characteristics; participants and interventions; outcome data; and demographic characteristics in line with PROGRESSplus which is a validated list of characteristics known to affect health outcomes and equity [32]. PROGRESS refers to Place of residence; Race/ethnicity/culture/language; Occupation; Gender/sex; Religion; Education; Socioeconomic status; Social capital and PLUS refers to: (1) personal characteristics associated with discrimination (e.g. age, disability) (2) features of relationships (e.g. smoking parents, excluded from school 3) time-dependent relationships (e.g. leaving the hospital, respite care, other instances where a person may be temporarily at a disadvantage).

Methodological quality

Two reviewers (AM and AMK) independently assessed reviews using the AMSTAR-2 [33] (Assessing the Methodological Quality of Systematic Reviews) tool to generate overall ratings of ‘high’, ‘moderate’, ‘low’ or ‘critically low’ based on 16 questions/items, of which seven are deemed critical. The review team discussed critical domain weaknesses to produce consensus judgements. This tool was chosen as it is appropriate for systematic reviews that include both randomised and non-randomised trials.

Overlap

To assess overlap in included primary studies between reviews, the corrected covered area (CCA) [34] was calculated using GROOVE (Graphical Representation of Overlap for Overviews). This software calculates the percentage of studies that appear in two reviews. This was to ensure primary studies were not counted more than once which would potentially give false weighting to a result. A CCA score or > 15% indicates very high overlap.

Synthesis

We used an approach guided by the SWiM [35] (synthesis without meta-analysis) methodology, and key aspects of this approach are as follows. We considered two main types of comparison: physical activity versus no intervention and comparisons between two different types of activity. The synthesis was further grouped by activity type and then by symptoms. We used a best-evidence approach to focus on reviews with high-quality AMSTAR-2 ratings published within the past 10 years. Guided by GRADE domains, we also considered risk of bias, imprecision and inconsistency of evidence within each review and overall [36].

Results

We identified 370 records (Fig. 1). Following deduplication, we assessed 297 abstracts and 215 full texts, and included seventeen systematic reviews [6, 14, 37,38,39,40,41,42,43,44,45,46,47,48,49,50,51].

Fig. 1
figure 1

PRISMA flowchart of data selection

Description of included reviews

Main review characteristics are presented in Table 1. From these reviews we extracted 80 unique studies (71 RCTs) and 8983 participants once overlap has been accounted for; study overlap is shown in Fig. 2. Reporting on demographic characteristics in line with PROGRESSPlus [32] guidance was limited.

Table 1 Summary of included systematic reviews
Fig. 2
figure 2

Corrected Coverage Area (CCA) for included reviews

Interventions included: aerobic exercise (11 reviews); yoga (11 reviews); other forms of mind-body therapy, such as stretching and relaxation, Tai Chi and Qi Gong (eight reviews); resistance/strength exercises (seven reviews); and pelvic floor muscle training (two reviews). Yoga is categorised separately to mind-body therapy due to the large number of reviews focusing solely on yoga. Intervention durations varied from one week to 24 months. Session length ranged from 20 to 120 min and frequency from 0.5 to 7 times per week. Comparators included: no active treatment, wait-list controls, health education or alternative exercise. Two RCTs in the reviews used HRT as a comparator but this was not reported at review level. Table 2 shows assessed comparisons and symptoms reported.

Eight reviews were rated high quality [6, 14, 38, 45, 46, 48, 50, 51] via AMSTAR-2, the remainder were moderate [39, 42], low [37, 41, 44, 49] or critically low quality [40, 43, 47] (Supplementary Table 2). The most common issues were lack of protocol before conducting the search, not justifying exclusion decisions, and not considering bias assessment in discussion of individual results. We summarise only the high-quality effectiveness evidence here; detailed results including effect estimates are in Table 3. Detailed results of moderate, low, and critically low-quality evidence are summarised in Supplementary Table 1.

Table 2 Evidence map by symptom category and comparison
Table 3 Summary of findings (high quality reviews only)

Comparison 1 – Exercise vs. no intervention

Aerobic exercise vs. no intervention

Total menopause symptoms

Two recent, high-quality reviews [14, 46] assessed the effect of aerobic exercise on total menopause symptoms (seven RCTs, 542 participants). There is no primary study overlap between these high-quality reviews because of different inclusion criteria. One review [46] (two RCTs, 101 participants), showed a beneficial effect of aerobic exercise, the second review [14] (five RCTs, 441 participants) reported no effect of interventions.

Vasomotor symptoms

Four high-quality reviews reported vasomotor symptoms [6, 14, 46, 51] (21 RCTs, 2219 participants). Three reviews consistently reported no clear effect of aerobic exercise. Evidence from all three reviews [6, 14, 46] is impacted by imprecision, one review’s GRADE assessment was low certainty evidence [6]. One review separately analysed frequency and severity of vasomotor symptoms, finding no clear effect on frequency but a beneficial effect of aerobic exercise on severity [51].

Other symptoms

One high-quality review [14] found no clear effect for the four following symptom categories: psychological, physical, urogenital, sexual. For each outcome there was high heterogeneity, at least one study at high risk of bias, or both (Table 3).

Exercise intensity: low

Psychological symptoms

Three reviews assessed low intensity exercise (including walking and yoga) [43, 44, 48] one was high-quality [48]. The high-quality review found no clear effect based on four RCTs (460 participants). The evidence was impacted by imprecision.

Exercise intensity: moderate-high

Psychological symptoms

Three reviews assessed moderate to high intensity exercise for psychological symptoms [43, 44, 48] one was high-quality [48]. Moderate intensity exercise was consistently defined across all three reviews; aerobic and cardiovascular activity were considered moderately intense. The high-quality review found no clear effect based on seven RCTs (1250 participants).

Exercise duration

Both ‘short to medium term exercise’ and ‘long term exercise’ versus no intervention were assessed by four reviews [43, 44, 46, 48] two were high quality [46, 48].

Short to medium-term exercise (up to 6 months)

Total symptoms

One high-quality review [46] assessed between one and 24 weeks exercise and found a benefit based on five RCTs (282 participants).

Psychological symptoms

The second high-quality review [48] assessed up to 12 weeks of exercise for psychological symptoms and found a benefit based on seven RCTs (939 participants).

Long-term exercise (over 6 months)

Psychological symptoms

One high-quality review [48] assessed long-term exercise for psychological symptoms (five RCTs, 1013 participants) and found no effect. The evidence is impacted by high risk of bias.

Yoga versus no intervention

Total symptoms

One high-quality review [38] (eight RCTs, 671 participants) found a benefit on total menopause symptoms. This review included some studies at high risk of bias, but it was unclear how many such studies contributed to the effect.

Vasomotor symptoms

Three high-quality reviews [14, 38, 45] found differing results. One review [38] (eight RCTs, 548 participants), found a benefit of yoga. Two reviews [14, 45] (eleven RCTs and 383 participants) found no effect of interventions.

Psychological symptoms

Three high-quality reviews [14, 38, 45] included ten, four and six RCTs respectively when assessing the impact of yoga on psychological symptoms but the number of unique RCTs and participants is unclear. One review [45] showed a benefit of yoga; the other reviews found no clear effects [14, 38]. All reviews were impacted by high risk of bias in studies with substantial numbers of participants.

Physical symptoms

Two high-quality reviews [14, 38] (11 RCTs, unclear number of participants), found a benefit of yoga. Symptoms included were sleep, joint pain and fatigue. One review [38] included some high risk of bias studies but their relevance to this outcome is unclear.

Urogenital symptoms

One high-quality review [38] found a benefit of yoga from a meta-analysis of seven RCTs (661 participants). Review authors reported moderate heterogeneity (61%) and some high risk of bias.

Sexual symptoms

One high-quality review [14] found no effect of interventions from two RCTs (150 participants). This is strongly impacted by imprecision because of small numbers of participants.

Mind-body therapy versus no intervention

Total symptoms

One high-quality review [46] found a positive effect favouring mind-body therapy based on two RCTs (92 participants). This review is impacted by high levels of imprecision because of the small number of participants.

Resistance exercise versus no intervention

In both total and vasomotor symptoms evidence is very limited despite high-quality reviews.

Total symptoms

One high-quality review [46] included a single very small study (23 participants) which found no effect on total menopause symptoms. Very high levels of imprecision limit the certainty of this finding.

Vasomotor symptoms

One high-quality review [50] reported a benefit of resistance training on incidence of moderate or severe hot flash episodes. This is based on a single, very small RCT (58 participants) and the effect estimate reflects high levels of imprecision.

Pelvic floor training versus no intervention

Total symptoms

One high-quality review [14] found no effect based on two RCTs (59 participants). Again, small numbers of participants mean evidence is very imprecise.

Comparison 2: Different types of exercise compared with each other

Aerobic exercise versus yoga

Total symptoms

One high-quality review [38] found no difference between aerobic exercise and yoga based on four RCTs (386 participants). While the review included some studies with high risk of bias it is unclear how many of these contributed to this evidence.

Vasomotor symptoms

Four high-quality reviews [6, 38, 45, 51] had mixed findings. One review [38] (three RCTs, 373 participants) found an effect in favour of yoga. This review included some studies at high risk of bias, but it was unclear how many contributed to the effect. Two reviews [6, 51] (two RCTs, 279 participants and three RCTs, 432 participants) each found no clear differences between aerobic exercise and yoga. A fourth review did not pool the two included RCTs (432 participants); one of these showed a benefit of yoga [45].

Other symptoms

One high-quality review [38] assessed aerobic exercise versus yoga on psychological (five RCTs, 526 participants), physical (four RCTs, 483 participants) and urogenital symptoms (three RCTs, 376 participants). There were no clear differences between the groups for any symptoms. This review included some high risk of bias studies, but it was unclear how many contributed to the effect estimates.

Aerobic exercise versus resistance training

Total symptoms

One high-quality review [50] included one RCT (36 participants) comparing aerobic versus resistance training for 8 weeks. No differences were observed in total menopause quality-of-life domains. This result is subject to a high level of imprecision due to the small sample size.

Vasomotor symptoms

One high-quality [50] review reported an effect favouring resistance training over aerobic exercise on frequency of hot flash symptoms based on one RCT (36 participants). Due to the small sample size, this result is subject to a high level of imprecision.

Discussion

This overview identified seventeen systematic reviews which included 80 relevant primary studies (8983 unique participants). Eight high-quality reviews evaluated various forms of physical activity. Fifteen reviews compared an activity to no intervention, eleven reviews researched aerobic exercise and eleven reviews researched yoga interventions, with limited evidence for other interventions. Nine reviews related to total symptom measures, vasomotor and psychological symptoms, with very limited evidence for physical, sexual, urogenital symptoms. Key limitations of the evidence from high-quality reviews were high risk of bias in primary studies, differences between review findings and heterogeneity between studies within reviews, and in some instances very high levels of imprecision due to few studies with very small numbers of participants. All of these reduce our confidence in the evidence.

Yoga was the intervention with most evidence (11 reviews) in terms of numbers of primary studies. There is evidence showing improvement of physical, urogenital, and total symptoms following yoga interventions two to three times weekly. Evidence for vasomotor and psychological symptoms was inconclusive. Session lengths and intervention durations varied, ranging from 20 to 120 mins and from three weeks to 24 months respectively. Findings for aerobic exercise were inconclusive although there were some examples of beneficial effects on total and vasomotor symptoms. Findings directly comparing yoga with aerobic exercise varied; it is unclear whether their effects differ. Evidence for resistance training was very limited. Based on the evidence synthesised in this overview, there is insufficient good quality evidence to recommend one form of exercise over any other. There is clear need for further high quality RCTs in this area to identify efficacious exercise interventions.

Strengths and limitations

This overview systematically appraised the evidence for the effects of a wide range of physical activity on menopausal symptoms although interventions were predominantly structured forms of exercise rather than general activity. We used a comprehensive search and rigorous systematic review method; our synthesis focused on the highest-quality evidence. Identified reviews included primarily RCTs meaning most evidence originates from studies designed to answer questions of intervention efficacy. A risk with undertaking an overview is that included reviews may not be up to date with the most recent RCTs. An overview review also depends on existing systematic reviews including salient activities and outcomes. These limitations are to some degree mitigated by the relatively recent search dates and wide scope of included reviews, which should capture most interventions and outcomes. Limited impact was observed for our date and language restrictions: no reviews were excluded due to our post-2000 restriction; four reviews were excluded due to language limitations.

Overviews risk multiple counting of primary studies and participants due to overlap. Good reporting in included reviews allowed us to map overlap and determine the number of unique RCTs and participants contributing to each symptom assessment for most comparisons. This means we are confident we have not over-estimated the evidence base.

Overviews necessarily rely on the decisions and judgements of authors of included reviews which are sometimes poorly reported or unreliable. We have mitigated this by focusing on reviews assessed as high quality and by considering, wherever possible, the consistency and precision of their included evidence as well as risk of bias. We have noted where this was unclear. Activities were generally not well or systematically described (there was no use of TIDieR) [52] and many exercise interventions were simply classed as ‘aerobic’; further research should clearly delineate exercise categories to permit greater understanding of the impact of exercise type, intensity, and duration. A narrative review on walking as a potential benefit for menopause symptoms [53] previously highlighted this difficulty in navigating the variations in exercise duration and frequency as a challenge for researchers with respect to agreement on beneficial outcome.

Reporting of factors relating to equity was also limited, making the relevance of evidence to disadvantaged groups unclear. Current reviews categorise women on menopause status; broadening this to include ethnicity, income status, employment type and other PROGRESSPlus [32] factors will allow better understanding of the context in which an intervention is most effective.

Evaluation of HRT as a comparator was lacking; only seen in two reviews [6, 42]. Each review included a single RCT comparing this with exercise; findings were inconsistent with evidence strongly favouring HRT over physical activity for hot flashes [6] but a benefit of physical activity over HRT on menopause-specific quality of life was evidenced [42]. Reporting of complementary use of HRT was also limited. Since many women use exercise as an alternative to or complement of HRT, the comparison between and intersection of the two approaches could usefully be explored. HRT prescribing is unequal; in the UK women from lower socio-economic groups are around 30% less likely to be prescribed HRT than those from more affluent areas [54]. Communities with higher Black and minority ethnic populations are more likely to have lower socio-economic profiles, so multiple equity indicators are likely involved in prescribing [55, 56]. Strengthening evidence for all menopause management options available to women may support equity and accessibility of useful treatments.

Further research

Reviews frequently reported there was no evidence of a difference between exercise groups and controls, or between different types of exercise, but it should be noted that most included RCTs were underpowered. Thus findings of no difference between different types of exercise do not indicate that either exercise would have no effect relative to no intervention; but an assessment of appropriate transitivity would need to be undertaken in order proceed with a network meta-analysis of all available trials to fully utilise the data for this. Direct comparisons indicate that both yoga and aerobic exercise may sometimes be effective relative to no intervention. In many cases included trials were too small and too few to identify effects; optimal information sizes were not calculated but would likely not be met. It is therefore important that reviews, and our synthesis, are not interpreted as evidence of no effect; but rather as evidence that more adequately powered studies are needed.

We have identified several research gaps via this overview. More research on resistance training and high intensity aerobic exercise is needed, as is research on the impact of exercise on physical, sexual, and urogenital symptoms. Beyond this, a comprehensive review and network meta-analysis would enable full use of all the available RCT evidence to identify where new trials are most urgently needed. Any future RCTs should be adequately powered, rigorously designed, and clearly reported. This includes clear description of interventions using TIDieR domains [52] and clear reporting of PROGRESSPlus participant characteristics [32].

Conclusion

There is some evidence that both yoga, and to a lesser extent, aerobic exercise may be beneficial for some menopause symptoms, but there is insufficient evidence to recommend a particular form of exercise for specific symptom management. Exercise benefits for general health are well documented; both aerobic and resistance training are generally recommended [9]. These recommendations, together with the personal preferences of women, are currently likely to be most important in guiding exercise choice and clinical prescription.