Introduction

Low back pain (LBP) is a leading cause of disability worldwide, placing a great burden on the global health care system [1, 2]. Intervertebral disc (IVD) degeneration (IVDD) is a significant contributor to nonspecific LBP, with a lifetime prevalence of over 80% [3, 4].

Early stages of IVDD are mainly in the form of biochemical changes, including proteoglycan (PG) reduction, dehydration, and collagen degeneration. It will lead to a decrease in hydrostatic pressure, resulting in nucleus pulposus (NP) dehydration and loss of the structural and mechanical properties of the IVDs. In advanced stages of IVDD, along with loss of hydration and the subsequent drop in disc pressure, IVD height decreases under load [5,6,7,8,9]. These degenerative changes are accompanied by structural lesions, such as disc herniation, causing LBP, neurogenic claudication, and even cauda equina syndrome. At this stage, treatment strategy is limited to conservative treatment alone or surgical excision [10]. Early detection of alterations in IVDD is important for develo** preventative strategies or reestablishing degenerated IVDs, such as gene therapy, stem cell therapy, and growth factor therapy [11,12,13].

Conventional magnetic resonance imaging (MRI) is widely used for morphologic, qualitative assessment of IVDD in the clinical workup. Lumbar IVDD is commonly scored using the Pfirrmann grading system, which is based on the assessment of structure and loss of the signal intensity on T2-weighted imaging (T2WI). This grading system provides a standardized and reliable assessment of MRI disc morphology, but cannot detect early degeneration of IVDs characterized by a loss of PG quantitatively [14, 15].

Several quantitative MRI techniques to evaluate IVD degeneration objectively have been reported, such as diffusion-weighted imaging, diffusion tensor imaging, glycosaminoglycan chemical exchange saturation transfer, sodium, delayed gadolinium-enhanced MRI, T2/T2*, and T1rho map**. Previous studies have demonstrated that T2* map** could be quantitative imaging biomarkers for evaluating the biochemical state of the discs and correlating that with histology, water content, and degeneration [16].

Ultrashort time-to-echo (UTE) imaging as a novel MRI technique has the capacity to catch very short T2* signals (0.008 ~ 0. 50 ms) [17,18,19,20], It has been confirmed to be sensitive to changes in the deep tissue matrix and to subtle and even preclinical degeneration [21]. To date, UTE-T2* imaging has been reported to be a reliable tool for quantitative assessment of the biochemical changes of short T2 tissues, including tendon, cartilage, and ligament, etc. [20,21,22,23,24,25]. However, to the best of our knowledge, studies on UTE-T2* quantitative technique in evaluating IVDD are scarce. We hypothesized that quantitative UTE-T2* map** is capable of revealing degenerative changes in the discs.

The present study aimed to assess whether lumbar IVDD can be evaluated using UTE-T2* map** and to compare the potential of UTE-T2*and T2* values in the diagnosis of early IVDD.

Materials and methods

Subjects

Ethics approval for this study was provided by the ethics commission of the Fudan University Affiliated Zhongshan Hospital **. T2WI images were used for Pfirrmann grading. Detailed acquisition parameters were assigned to T2WI, UTE-T2* map**, and T2* map**.

For T2WI, repetition time (TR)/echo time (TE) = 2500/120 ms, slice thickness/gap = 5/0.5 mm, field of view (FOV) = 300 × 300 mm, matrix = 256 × 192, and bandwidth = 41.7 kHz. For UTE-T2* map**, TR = 112.4 ms, TE = 0.032/3.8/8.8/13.2 ms, slice thickness/gap = 3/0.5 mm, flip angle = 15, FOV = 300 × 300 mm, matrix = 300 × 256, and bandwidth = 62.5 kHz. Maps of the T2* was calculated using R2*(R2* = 1/T2*) [26, 27].For R2* map**, TR = 12.5 ms, TE = 1.40 /2.29 /3.18/4.07/ 4.96/5.85/ 6.74 / 7.63 / 8.52 ms, slice thickness/gap = 5/0 mm, flip angle = 5, FOV = 300 × 300 mm, matrix = 256 × 192, and bandwidth = 111.1 kHz.

Image analysis

All lumbar IVDs were evaluated by 2 musculoskeletal radiologists each with more than six years of experience using T2WI and assigned a Pfirrmann grade (Table 1) [28]. Disc degenerations also were divided into three categories [11]: normal (Pfirrmann grade I), early disc degeneration (Pfirrmann grade II-III), and advanced disc degeneration (Pfirrmann grade IV-V). UTE-T2* values were calculated by mono-exponential fitting using custom code in MATLAB (MathWorks, Natick, MA). Five circular regions of interest (ROIs) of equal size were manually drawn on the midline slice of sagittal UTE-T2* and T2* map**s from anterior to posterior (Fig. 1), including the anterior annulus fibrosus (AAF; ROI 1), nucleus pulposus (NP; ROI 2–4), and posterior annulus fibrosus (PAF; ROI 5) [29].

Table 1 Pfirrmann grades of disc degeneration
Fig. 1
figure 1

MR images of the lumbar spine of a 39-year-old woman. Every lumbar IVD was cut into 5 uniform parts in each UTE-T2* and T2* map**. An ROI of 1 represented AAF, ROI 2–4 represented NP, and ROI 5 represented PAF. IVD, intervertebral disc; NP, nucleus pulposus; PAF, posterior annulus fibrosus; ROI, region of interest; UTE, ultrashort time-to-echo

Statistical analysis

Statistical analysis was conducted using SPSS 22.0 software (IBM, Armonk, NY) and Medcalc 20.022 (Mariakerke, Belgium). The Kruskal–Wallis test was performed to determine differences among the 5-level Pfirrmann grades. The differences between the two methods were expressed using the ± 95% confidence intervals (CIs) from the Bland–Altman analysis. Correlations of quantitative values with Pfirrmann grades were analyzed using Spearman's rank correlation. Receiver operating characteristic (ROC) analysis was performed and area under the curve (AUC), sensitivity, specificities, positive likelihood ratio (+ LR), and negative likelihood ratio (− LR) were obtained to assess the diagnostic efficacy of each quantitative parameter for differentiating normal IVDs from early disc degeneration and to differentiate early disc degeneration from advanced disc degeneration. AUCs were compared using the DeLong method [30]. A P value less than 0.05 was considered statistically significant.

Results

Clinical characteristics

Seventy-six subjects aged 54.4 ± 15.7 years (range, 19–85 years) were recruited into this study. They included 29 males (aged 51.8 ± 17.2 years; range, 19–82 years) and 47 females (aged 54.5 ± 14.9 years; range, 27–85 years). Five intervertebral discs (L1/L2–L5/S1) per subject were examined, of which 14 discs were excluded due to previous vertebral fusion operation (n = 2), collapsed disc space (Pfirrmann grade V) that made it impossible to measure the quantitative values (n = 9), and image quality problems (n = 3). A total of 366 IVDs were included in this study.

Using the Pfirrmann grading system, 73 discs were categorized as grade I; 110 discs, as grade II; 164 discs, as grade III; and 19 discs, as grade IV. The flowchart for the enrollment of the study population is presented in Fig. 2. The distribution of the UTE-T2*and T2* values with respect to Pfirrmann grades is provided in Table 2.

Fig. 2
figure 2

Flow diagram of the study enrolment discs

Table 2 UTE-T2* and T2* values of AAF, NP, and PAF in each Pfirrmann grades

Correlation of UTE-T2*and T2* values with Pfirrmann grades

The Kruskal–Wallis test demonstrated that all quantitative values for all segments were significantly different among different Pfirrmann grades (Table 2). Bland–Altman plots are shown in Fig. 3. There was no significant bias between UTE-T2*and T2* values in NP and PAF(P > 0.05).UTE-T2* values showed high correlations with Pfirrmann grade in NP (r =  − 0.733; P < 0.001). Moderate correlations with Pfirrmann grade were observed in T2* value of NP (r = -0.654; P < 0.001). Comparing the Spearman correlation coefficient, the highest correlation value was seen in NP and the lowest was seen in AAF. Among those, the UTE-T2* value of NP showed the highest correlation values with Pfirrmann grades. Results of Spearman's correlation analysis are summarized in Fig. 4.

Fig. 3
figure 3

Bland–Altman plots comparing both UTE-T2* and T2* values of AAF, NP, PAF. Bias (solid line) and limits of agreement (dashed line) are shown for each variable. The mean score is plotted on the x-axis, while the difference between the two methods is plotted on the y-axis (mean difference ± 1.96 SD). AAF, anterior annulus fibrosus; NP, nucleus pulposus; PAF, posterior annulus fibrosus

Fig. 4
figure 4

Scatter plots of the values in AAF, NP, and PAF according to the Pfirrmann grades. a, c, and e are respectively UTE-T2* relaxation time of AAF, NP, and PAF correlated with disc degeneration grading; b, d, and f are respectively T2* value of AAF, NP, and PAF correlated with disc degeneration grading

Post hoc multiple comparisons among each Pfirrmann grades

There were significant differences in UTE-T2* values of NP and PAF between each Pfirrmann grade. T2* values were found to be significantly different between Pfirrmann grade II and grade III in AAF, NP, and PAF (Fig. 5).

Fig. 5
figure 5

Post hoc multiple comparisons among 4 Pfirrmann grades. There were significant differences in UTE-T2* values of NP and PAF between each Pfirrmann grades. T2* values were found to be significantly different between Pfirrmann grades II and III in AAF, NP, and PAF. *P values of < 0.05 and **P values of < 0.001 were considered statistically significant. NP, nucleus pulposus; PAF, posterior annulus fibrosus; UTE, ultrashort time-to-echo

Diagnostic performance of UTE-T2* and T2* values in distinguishing each degeneration groups

ROC curves of UTE-T2* and T2* values for distinguishing each degeneration groups are plotted in Fig. 6 The corresponding diagnostic test characteristics are provided in Table 3. The AUC values of UTE-T2* map** in AAF, NP, and PAF were 0.715,0.876,0.787, respectively, for identification of the early disc degeneration, and 0.726,0.893,0.804, respectively, for identification of the advanced disc degeneration. The AUC values of T2* map** in AAF, NP, and PAF were 0.620, 0.763, 0.670, respectively, for identification of the early disc degeneration, and 0.570, 0.842, 0.720, respectively, for identification of the advanced disc degeneration.

Fig. 6
figure 6

Receiver operating characteristic (ROC) analysis. Graphs show ROC curves to distinguish each degeneration groups. Numbers are areas under the curves with 95% confidence intervals in parentheses

Table 3 Diagnostic performance of UTE-T2* and T2* values in distinguishing degeneration groups

For pairwise comparisons of ROC curves, UTE-T2* values in NP and PAF were better in identifying early degeneration IVDs than those in T2*. There were no significant differences among UTE-T2* and T2* map** in AAF. Comparing between different segments, diagnostic performance of NP was the highest in predicting the early degeneration IVDs, AAF and PAF performed similarly. For differentiating early and advanced disc degeneration, the UTE-T2* value of PAF was better than that of T2* and there were no significant differences among UTE-T2* and T2* map** in AAF and NP. Comparing between different segments, diagnostic performance of NP was better than AAF, AAF and PAF performed similarly in predicting the advanced disc degeneration.

Overall, the diagnostic efficacy of UTE T2* map** was better than that of T2* map** for evaluating IVDD, especially in NP. The results of this study demonstrated that the UTE-T2* value in NP showed high correlations with Pfirrmann grade (r =  − 0.733; P < 0.001) and that AUCs for the assessment of the early disc degeneration (AUC 0.876) were significantly higher than those for T2*(AUC 0.763).

Discussion

This study is the first to investigate and compare the diagnostic efficacies of UTE-T2* and T2* map** in detecting IVDD in humans. The results may help to confirm the feasibility and specificity of UTE-T2* as an objective and quantitative tool to identify early degenerative changes of the disc and show promise for clinicians to modify the diagnostics and therapeutic management strategies more accurately.

Conventional MRI, such as the Pfirrmann scale with T2WI, was limited in detecting ultrastructural alterations of early IVDD. Early stages of disc degeneration include biochemical changes, such as a loss or reduction of PG content, which can ultimately lead to dehydration. T2 relaxation reflected the integrated environment of the IVD, including water, protein, collagen, and other solutes [31], and was sensitive to water content and the composition of the collagen network structure. Researchers have reported that T2* relaxation time showed a good correlation with PG and collagen contents in 18 humans cadaveric IVDs [32]. Our findings confirmed an increase in the quantitative T2* values between the AAF and NP and a decrease between the NP and the PAF. In line with an earlier report [33, 34],T2 and T2* map** provided roughly similar results. The inverse correlation of the T2 relaxation time in the disc with Pfirrmann grade has been reported by Welsch et al. [35] and Noebauer et al. [36], the early study reported a low-to-moderate correlation between Pfirrmann grades and T2 relaxation times, which were consistent with the Spearman's correlation coefficient between Pfirrmann grades and T2*value in our results. Both T2 and T2* map** differ in the biochemical sensitivity of disc tissue, with T2 map** being sensitive to tissue hydration, while T2* map** being more sensitive to changes in tissue integrity [35].T2* map** provide more valuable biochemical information on the IVDs ultrastructure, together with three-dimensional acquisition capability and higher spatial resolution in a short scan time [37].

UTE-T2* map** was acquired using different echo times in the short (1–10 ms) and ultrashort echo time range [19, 22]. Because UTE-T2* map** can catch the short T2* relaxations from tissues, it is more sensitive to biochemical collagen matrix changes compared with conventional MRI techniques, based on histologic standards [19].

Multiple large general-population–based studies have proved that UTE-T2* map** can detect cartilage subsurface matrix changes, which can be indicative of reduced cartilage health from injury or early degeneration noninvasively [20, 23, 25]. Similar to the results reported by Detiger et al. [16], we observed a trend of decreasing UTE-T2* value with increasing degree of degeneration. The previous study revealed a significant correlation between T2* relaxation time and glycosaminoglycans (GAG) content in the nucleus pulposus, as well as histologic scoring with varying grades of degeneration [16]. During the aging process, the quantity and quality of PG and collagen contents diminish, along with a decrease in short T2* signal accordingly [21, 25, 34, 38]. T2* relaxometry appeared to be sensitive to water and PG contents. This may be the initial step in the degenerative cycle [25, 34, 38], which could be the underlying reasons for the decreased UTE-T2* and T2* value.

In NP and PAF, UTE-T2* map** showed significantly higher diagnostic accuracy in differentiating early disc degeneration from normal than did T2*. Theoretically, both UTE-T2* and T2* map** measured the T2* value of the tissue. UTE MRI technique mitigates the rapid signal loss from short T2* by reducing the TE to the scale of 0–200 microseconds to sample the free induction decay as early as possible. With considerably shorter TEs (0.032 ms in this study) than T2*, UTE-T2*map** allows signals from very short T2 components to be detected [23]. Thus, UTE-T2*map** is less sensitive to the magic angle effect and more sensitive to water protons and their local environment, making it a satisfactory method for evaluating disc generation. Because T2* relaxation time has been reported to reflect both the water content and PG content reduction [16], it is not hard to understand why the UTE-T2* value has better diagnostic accuracy than T2* for differentiating early disc degeneration.

Previous studies have reported that T2 relaxation time of Pfirrmann grades IV is significantly shorter than Pfirrmann grades III, and no significant differences were found between Pfirrmann grades IV and V, both of which show extremely low signal intensity [39]. The results of our study showed that compared to T2* values, UTE-T2* values conveyed significantly higher diagnostic performance in distinguishing the early disc degeneration from the advanced in PAF. Takashima et al. [40] reported that short T2* relaxation times with UTE are promising for assessing progressive IVD degeneration with poor water content, such as fibrosis change of IVDD with short T2 relaxation time. Our results are consistent with those findings quantitatively. Takashima et al. did not further discuss the quantitative evaluation of the early disc degeneration because their study population did not include grade I IVDs. Our results would appear to complement and refine their research.

A previous report on healthy ovine IVDs demonstrated that the T2 values show regional variation in discs and reported that high T2 values were observed in NP and low T2 values in the AAF and PAF when histologically evaluated [41]. Disc degeneration is believed to originates in NP with depletion of GAG, followed by a reduction in water content [5, 42]. Similar to previous reports, our study showed that correlations with Pfirrmann grade and UTE-T2* and T2* values were highest in NP and lowest in AAF. Our results also showed that NP had the highest diagnostic accuracy in predicting the early degeneration IVDs, while AFP and PFP were similar in predicting the early degeneration IVDs. These results suggest that the destruction of hydrophilic GAGs within NP was the main cause of the accumulation of cleaved extracellular matrix fragments with disc aging [16].

There were some limitations in this study. First, our study had no detailed histologic confirmation associated with IVDD changes. This is hard to achieve in humans. In addition, the relationship between T2* values and biochemical changes in IVDD has been previously established in human cadaveric lumbar discs [32]. Second, we were unable to compare the related clinical symptoms with the degree of degeneration in MRI quantitative parameters. Future research is warranted to explore these ideas. Third, no patient in this study showed grade V IVDs because grade V IVDs tend to have a collapsed disc space or a vacuum phenomenon, which is unable to measure the quantitative values due to susceptibility artifacts. If the complete grade V IVDs data set were available, quantitative evaluation of advanced IVD degeneration could be closer to reality. However, as our principal purpose was to detect early disc degeneration, the impact of incomplete grade V IVDs dataset on our results is within acceptable limits.

Conclusions

We demonstrated that UTE-T2*map** was more accurate than T2* map** in quantitatively diagnosing early intervertebral disc degeneration. In particular, UTE-T2* map** allowed for precisely distinguishing disc degeneration, potentially providing a promising imaging biomarker with potential applications in intervertebral disc degeneration for the emerging cell-based therapies.