Introduction

Making music is a multimodal task that requires not only auditory-sensory-motor integration and emotional processing, but also higher order cognitive functions1,2. These cognitive functions, which include processing speed, attention, executive functioning, as well as working memory are all known to naturally decline with age3.

Therefore, to fully grasp brain processes underlying the production of music and its potential effects on aging, it is important to study the interaction between the different brain areas involved in these processes. One way to quantify these interactions is using functional connectivity (FC) by calculating and comparing the temporal correlation between different parts of the brain. FC can be measured using resting-state functional Magnetic Resonance Imaging (rs-fMRI). Brain regions that are functionally related, share similar spontaneous low-frequency fluctuations during rest4. This way, distinct functionally connected neural networks have been identified in the human brain, such as primary processing networks, which include the sensorimotor network (SMN), and the auditory network (AN), but also higher-order networks, like the executive control network (ECN, involved in goal-directed behavior, attention and working memory) and the default mode network (DMN)5. The DMN is unique as it shows a high level of activity at rest, but lower levels of activity when performing a task6. Most interestingly, it is the most consistently found network to be affected by aging (for reviews see e.g. ref.5,7). Another higher-order network that has been associated with non-pathological cognitive ageing in a large scale study is the cingulo-opercular network (CON), involved in cognitive control and goal-directed behaviour8. Often the CON network is also referenced as the salience network. For other networks, especially the primary processing networks, findings are more diverse. However, most studies find decreases in the AN and no change or decreases in the SMN in older adults5.

When investigating the effects of playing an instrument, typically musicians and non-musicians are compared. Here, the most consistent finding among studies is a higher FC between auditory and motor regions in musicians9,10,11,12,13. Interestingly, the largest study to date14 found increased FC within the AN and between bilateral superior and middle temporal, inferior frontal, and inferior parietal regions, but not between auditory and motor regions in musicians compared to non-musicians. Thus, results are still too inconsistent to draw clear conclusions about the exact effects of musical training on functional brain plasticity. One main problem is that most studies used relatively small sample sizes, which not only leads to a reduced chance of finding a true effect but also increases the probability that significant results do not reflect a true effect15.

Few studies investigated the effect of musical training on FC in longitudinal settings and those who did also used relatively small sample sizes. One study16 investigated eight weeks of drum training in a group of 15 young (16–19 years old) participants compared to a passive control group and identified a significant increase in FC between auditory and motor regions as well as between auditory and parietal regions in the drum group but not in the control group over time. Another study conducted over 24 weeks of piano training found increased FC within the SMN and between auditory-motor regions in 29 young adults compared to a passive control group17. However, these effects receded after 12 weeks of no training17. While these studies used younger populations to study the effect of musical practice, learning to play an instrument has also become a target intervention to slow down or counteract age-related cognitive decline in recent years18,19,20. Although these studies show promising results, they lack statistical power (between 8 and 16 participants in the intervention groups) and they did not use neuroimaging methods to investigate the underlying neural adaptions induced by the training.

Yet, important factors in the aging process of the human brain are maladaptive changes of its FC, most often resulting in reduced cognitive performance5. These include reduced FC within and between networks, but also increased FC, which is usually interpreted as a compensatory mechanism21,22,23,24,25. Based on the above-mentioned studies, musical training might be a promising tool to promote successful aging, potentially altering connectivity between brain areas, which in consequence may lead to preservation or even improvement of cognitive abilities26. Especially since Rogenmoser et al.27 have suggested, based on structural data, that music as a leisure activity can have an age-decelerating effect on the brain.

A recent publication by our consortium28 investigated the effect of six months of piano practice (PP) on white matter structural connectivity in healthy older adults as part of a larger randomized controlled trial (RCT). Analyses revealed a stabilizing effect of PP on microstructure of the fornix (a main output tract of the hippocampus), but no significant (Bonferroni-corrected) effects in auditory and motor-related tracts. However, PP participants still advanced in their playing abilities. While these improvements could not be explained by structural connectivity changes, here we investigate whether FC changes may underlie these improvements, using the same study sample. In addition, we are interested in whether piano training also influences FC of the DMN, which is known to be affected by aging and other primary processing networks, where the exact impact of ageing is still unknown5. The current study aims to investigate FC changes after 6 and 12 months of PP compared to an active control group participating in music listening/musical culture (MC) lessons in a large older adults’ sample. Specifically, we use seed-to-whole-brain FC analysis with the following 8 seeds that have been implicated in previous studies on musical expertise and aging: Heschl’s gyrus (HG, belonging to the AN)29, motor hand area (M1H, belonging to the SMN)30, hippocampal formation (HPF, belonging to the DMN)31, and inferior frontal gyrus (IFG, belonging to the ECN)32.

The HG is affected by musicians’ status in a large number of studies33,34,35,36,37,38, with changes being related to music proficiency33,34. The HG, as part of the auditory cortex, is anatomically linked to the primary motor cortex and the IFG via the arcuate fasciculus39. The IFG has been shown to be involved in music processing40,41 and the generation and selection of motor sequences42. Due to its involvement in working memory processes43 the IFG represents an interesting seed region. Improvements in working memory have been shown after six months of piano training in healthy older adults18,20, although working memory is particularly affected by age3. As playing the piano specifically engages fine motor control of hands and fingers, we selected a seed located at the M1H44. In an earlier publication of this study we showed that in comparison with the MC group, first, practicing piano resulted in greater improvements in manual dexterity and second, that unimanual dexterity and grey matter volume of the contralateral M1 changed together during the second semester of training45. Further, fine motor control is known to deteriorate with advanced age46,47. Another region of interest is the HPF, not only because of its involvement in aging and cognitive decline48, but also because of its involvement with music-evoked emotions49 and higher-order pitch processing50,51,52.

Each seed was investigated separately for the left and right hemispheres. We hypothesize that learning to play the piano primarily leads to FC increase between auditory and motor regions as found in previous studies9,10,11,12,13,16,17, verifying the association of those FC changes with behavioural measures and piano practice progress, closely linked to piano training, so-called near-transfer, primarily expressed in auditory and motor tasks. In addition, we hypothesize that learning to play the piano has the potential to counteract or slow down age-related FC changes, including increased FC in and between the SMN and AN. Considering the limited literature on the effects of musical instrument playing on the ageing DMN and ECN, we included these networks for exploratory analyses, hypothesizing that PP could have a beneficial effect counteracting age-related changes in these networks.

Materials and methods

The study was reviewed and approved by the ethics committee of Hannover Medical School as well as the cantonal ethics committee Geneva. All participants gave written informed consent before participation and all experiments were compliant with the Declaration of Helsinki.

Participants and study design

A total of 155 subjects (62–78, mean age = 69.7 years, 92 females) were recruited for the current RCT in Hannover (Germany, 92 participants) and Geneva (Switzerland, 63 participants). Participants had to be in good overall health, right handed53, retired, and non-reliant on hearing-aids. Furthermore, included participants should not have received more than 6 months of formal musical training outside the school curriculum during their lifetime. The general sophistication scale of the Goldsmiths Musical Sophistication Index (Gold-MSI) was assessed to measure their musical sophistication at baseline (minimum/maximum achievable score = 18/128)54. The cognitive telephone screening instrument (COGTEL)55,56 was used to assess cognitive functioning and avoid the inclusion of participants with early-stage dementia. As this test was specifically developed for older adults, it provides a global measure of cognition, based on several memory and executive function subtests. A cut-off value of below 10 was set a priori based on the original publications55,56, however each participant in the current study reached a score of at least 15. The participants were randomly assigned to one of the two intervention groups: PP or MC. In order to obtain homogeneous groups, age, gender, education level, and COGTEL score were taken into account to achieve equal distributions of these factors across both groups. Participants only learned of their group assignment after baseline testing. Both groups received one 60 min lesson per week for one year and were instructed to do homework daily for at least 30 min. PP lessons were taught in dyads whereas MC lessons took place with 4–6 participants. The lessons were designed and regularly supervised by two music pedagogy professors. Twenty-four teachers were involved who all were professional musicians with at least a bachelor’s degree. Participants in the PP group learned to play the piano with both hands (separately at first), how to read a musical score, and played different styles of music. MC lessons focused on active and analytical listening to music of different epochs and genres, learning about music history and musical styles, and information on basic formal and structural properties. The course aimed to create general enjoyment and appreciation of music, however, it excluded any active music-making (e.g. singing), or other music-related movements (e.g. clap**). More details about the study design can be found in James et al.57 and in the supplementary material of Worschech et al.58. To monitor the participants’ adherence to the study protocol, daily homework logs were collected. Adherence was based on self-report from the participants on how much they practiced daily at home, which had to be put into a training diary and collected regularly. Participation was assessed by the teachers after 6 and 12 months. In addition, the teachers followed the participants’ progress to make sure they did indeed practice at home. We observed a solid adherence to the interventions in all participants.

Participants underwent testing before the start of the interventions (T0), after 6 months of lessons (T1), and after 12 months of lessons (T2). Due to the COVID-19 outbreak and local restrictions, there were delays in testing at T2 for the majority of participants. Although some participants had to finish their courses online, all included participants had at least 46 lessons when T2 testing took place. To factor in the delays in testing, all statistical models include the time between measurements as a covariate.

Of the 155 participants, 120 completed MRI scanning at all 3 time points. Of these, 4 had to be excluded from the analyses because of artefacts in the T1 image, one participant was excluded because of too much atrophy and myelin degeneration, 6 participants showed excessive head movement and had less than 150 of 460 remaining volumes in the functional scan after head-motion scrubbing. This resulted in a total of 109 participants who were included in the final analyses (53 in the MC group and 56 in the PP group).

Image acquisition

MR images were acquired on 3.0 T Siemens MRI scanners (Hannover: Magnetom Skyra, Geneva: Magnetom Tim Trio; Siemens, Erlangen, Germany) using 32-channel head coils. Both study sites used the same scanning parameters. For the resting-state fMRI images, the following echo planar imaging (EPI) sequence was used: voxel size = 2.5 mm isotropic, FoV = 210 × 210 × 135 mm3, multiband acceleration factor = 3, repetition time (TR) = 1350 ms, echo time (TE) = 31.6 ms, and 460 volumes. The scan lasted 10.31 min and participants were instructed to focus on a fixation cross in the middle of the screen during scanning. Additionally, a high-resolution T1-weighted structural image was acquired, using the following MP2RAGE sequence59: voxel size = 1 mm isotropic, FoV = 256 × 240 × 176 mm3, TR = 5000 ms, TE = 2.98 ms, flip-angle 1 = 4°, flip-angle 2 = 5°, and 176 slices. To account for potential site effects and different MRI scanners, this factor was included as a covariate of no interest in all statistical analyses. For noise reduction during scanning, subjects wore Comply™ Canal Tips with an average noise reduction of 29 dB.

Preprocessing

The data was analyzed using the Data Processing Assistant for Resting-State Toolbox (DPARSFA, http://rfmri.org/DPARSF)60 and SPM12 (Wellcome Trust Centre for Neuroimaging, London, England). Preprocessing followed the recommended standard DPARSFA protocol.

In a first step, the first five image volumes of the functional images were discarded to account for initial signal instability. The remaining images were then slice-timing corrected and realigned to a mean image. Voxel-specific head motion parameters were calculated. Then structural images were co-registered to the functional images and segmented into white matter (WM), grey matter (GM), and cerebrospinal fluid (CSF) maps using DARTEL61. The Friston 24-parameter model62 was used to regress out motion artefacts from the realigned images. In addition, WM, CSF, and global signal were regressed out. Images were then normalized to MNI space using DARTEL and temporally bandpass filtered (0.01–0.10 Hz). In the next step, head motion scrubbing was performed, which removed time points with a framewise displacement of more than 0.4 mm63. In the last step, images were smoothed with a Gaussian kernel of 7 mm3 (full width at half maximum).

Seed-based functional connectivity analysis

Calculation of FC was carried out with a seed-based approach, using the DPARSFA toolbox60. Here, the time course of each seed (i.e. the averaged time courses of all voxels within this region) was correlated pairwise with the time course of all other voxels within the brains’ GM (Pearson’s correlations). Then individual’s correlation (r) values were normalized by means of Fisher’s z transformation and evaluated using an SPM second level model. Difference images for T1-T0 and T2-T0 were calculated and entered into a General Linear Model to assess changes after 6 and 12 months respectively. Anatomical locations of the significant clusters were identified using the Automated anatomical labelling atlas (AAL3)64.

Seed regions

As outlined in the introduction, eight different seeds were selected based on a recent study by our consortium28, their relevance in music making/musical processing, aging brain literature, and their susceptibility to training-induced changes.

Peak coordinates for the seeds were selected based on existing literature and a mask for each seed was created using a 5 mm radius around the peak coordinates. Table 1 summarizes the selected seeds, their peak MNI coordinates and the reference the coordinates are based on. Locations of the selected seeds are visualized in Fig. 1.

Table 1 Seed regions.
Figure 1
figure 1

Seed regions visualized on MNI template. Coronal slices are displayed in radiological convention (left = right hemisphere, right = left hemisphere).

Behavioral tests

As this analysis is focusing on resting-state FC, only tests that are relevant for the statistically significant FC changes are explained in detail below. As explicated in the hypothesis, the focus of this analysis is on the acquisition of a motor skill. Thus, for the evaluation of near transfer effects of the acquisition only motor tasks were evaluated. For a full list of administered tests in the study, please refer to James et al.57.

MIDI-based scale analysis

An adaption of the MIDI-based scale analysis (MSA)66 was used in both groups to measure improvements in piano playing. This measure has previously been shown to be an indicator of pianistic expertise67. Participants were instructed to play sequences of 15 five-tone range scales (C-G) in both playing directions with the right hand, using the following fingering: 1-2-3-4-5-4-3-2-1-2-3-4-5-4-3-2-1 (1 = thumb, 5 = little finger). The outcome measure was the mean standard deviation of the time between the onsets of two subsequent notes (IOI = inter-onset interval). The desired IOI was set to 790 ms and paced by a metronome.

As we considered this task too difficult and thus frustrating to perform with the left hand for the MC group at T1 and T2, this test was performed with the right hand only.

Purdue pegboard

The Purdue Pegboard Test (PT)68 was used to measure fine motor dexterity and gross movements of the hands, fingers and arms. The PT consists of three sub tests that all measure gross hand and arm control: right hand (PT-RH), left hand (PT-LH), and both hands (PT-BH, also measuring bimanual coordination). These tests take 30 s each and consist of placing as many pins as possible in a vertical row with the specified hand (PT-RH and PT-LH) or both hands at the same time (PT-BH).

Moreover, there is another assembly subtest measuring fine finger/ fingertip dexterity and bimanual coordination (PT-A). In this test, participants have 60 s to assemble a small tower consisting of a pin, a washer, a collar, and another washer. Final scores consist of the total number of pins placed/objects assembled within the given time.

Statistical analysis

Group-level

In a first step, only baseline measurements were taken into account to assess whether the two groups differed in FC before the start of the interventions. Group differences at baseline were assessed with two-sample t tests entered into separate General Linear Models (GLM) for the eight seeds of interest in SPM12, including site, age at the start of the intervention (demeaned), and gender as nuisance covariates.

In a next step, to assess training-related changes, difference images were calculated between T0 and T1 as well as T0 and T2 and also entered into a GLM, with site, age at the start of the intervention (demeaned), gender, and time between scans (demeaned) as nuisance covariates. Separate GLMs were created for each seed for six and twelve months’ data. To measure FC changes over time, one-sample t tests including the full sample (both groups combined) were calculated (PP+  + MC+ for increased FC over time and PP + MC for decreased FC over time).

Group differences (over time) between the two intervention groups were assessed with a two-sample t test (MC > PP and PP > MC). To account for the amount of voxels in the whole brain analysis, we used FWE correction at cluster level, and lowered the significance threshold further, to account for the eight seed regions, resulting in a significance threshold for each seed region of pFWE < 0.05/8 = 0.00625.

In a last step, it was important to know whether the seeds and cluster regions were positively or negatively correlated (i.e. anticorrelated) at baseline to distinguish between different FC changes over time. A FC increase over time could for example mean a higher positive correlation or a decreased anticorrelation. For this purpose, we created binary masks from the significant clusters and used these to extract the baseline correlations from the T0 images for the respective seed regions.

Further statistical analyses were performed in IBM SPSS Statistics version 27 (IBM Corp., Armonk, NY, USA). As difference images were entered into the GLMs, significant effects do not quantify where this difference originates from. Therefore, the mean value over the significant cluster was extracted and compared against 0 for each group using a one-sample t test. A non-significant test result implicated no change over time for this group, while a significantly higher mean would quantify an FC increase and a lower mean a significant FC decrease over time. Due to the number of tests (12 in total), Bonferroni correction was used, setting p < 0.0042 as the significance threshold.

Behavioral and correlational analyses

To keep in line with the FC analysis, we performed separate analyses for 6 and 12 months data. Hence, MSA was analyzed using two 2 × 2 (Group × Time) mixed-design ANOVAs, with group as a between-subject variable and time as a within-subject variable. Outliers with more than two SD from the mean were excluded from this analysis. This was the case for four participants. Eight additional participants with at least one missing value were excluded. Therefore, the final analysis on MSA comprised 45 MC and 52 PP participants. PT was analyzed separately for the four sub-tests, each with a 2 × 2 (Group × Time) mixed-design ANOVA. For correlation analyses, difference scores were calculated and correlated with extracted cluster FC changes (T1–T0, or T2–T0, depending on the outcome of the FC analysis). Correlations were only calculated if post-hoc analyses on FC revealed a significant FC change over time.

In total eight correlations were computed, therefore, the significance threshold was set to p < 0.00625.

Results

Demographic data (N = 109)

Both groups did not differ in respect to age, gender, education level, COGTEL score and Gold-MSI score (Table 2).

Table 2 Demographics (N = 109).

FC group differences at baseline

No significant group differences were detected at baseline.

FC results—6 months

There were no significant effects after 6 months of training.

FC results—12 months (T2)

No significant effects were found for the left HG, left M1H, right HPF, and left IFG seeds. The right IFG was the only seed region that showed a main effect of time. Across all participants, there was a significantly higher FC (pFWE = 0.002, k (number of voxels in the significant cluster) = 310, T = 4.46) between the right IFG and the left angular gyrus after twelve months of interventions. For that seed, no significant effect of group could be detected. The right HG, right M1H and left HPF all revealed significant FC group differences over time. The only FC seed which showed a significant MC > PP effect of group after 12 months was the right HG. These significant group results are described in more detail below.

A summary of all significant effects after 12 months of interventions are shown in Table 3. Additionally, results for the PP group after 12 months are displayed in Fig. 2.

Table 3 Summary of significant effects after 12 months of interventions.
Figure 2
figure 2

Results for the PP group after 12 months of intervention. Significant FC increases or decreases that were found for the PP group in comparison to the MC group. Gray edges indicate increased FC and red edges indicate reduced FC between regions after piano training. Seed regions are printed in bold. Blue nodes = results for HG (Heschl’s gyrus), green node = results for HPF (Hippocampal formation), Black nodes = results for M1H (Motor hand area). IPL inferior parietal lobule, dPCC dorsal posterior cingulate cortex, dPMC dorsal premotor cortex, PCG precentral gyrus, SPL superior parietal lobule, R right, L left. Images are displayed in radiological convention (left = right hemisphere, right = left hemisphere). The results were visualized with BrainNet viewer104, thus the current findings might not be generalizable to the global population. This poses an interesting and relevant research gap that should be explored specifically in future research.

In conclusion, the current study demonstrated piano training-induced FC changes in specific sensorimotor and auditory-motor networks in healthy older adults after 12 months of training. Although evidence for the positive effect of learning to play the piano to counteract or slow down age-related decline is limited in the current study, the fact that the older adult brain shows neuroplastic adaption to a newly learned skill raises hope that it is possible to actively influence the course of neurological aging, potentially by increasing cognitive reserve. Therefore, future studies are needed to see whether learning to play the piano at an advanced age might be an effective tool to reduce the burdens related to age-related cognitive decline and sensorimotor deterioration in the long term. That music practice might be effective is plausible, as musical activities involve several functions and skills that are prone to decline with age, such as working memory, attention, processing speed, executive functions, and even abstract reasoning. The fact that a majority of individuals experiences music-making as pleasant would allow long lasting involvement after retirement. Overall, the current study poses an essential starting point showing that functional neuroplasticity in older adults is possible in response to musical training.