Introduction

Solitary pulmonary lesions (SPLs) encompass single well-defined solid or sub-solid lung lesions surrounded by normal lung tissue and lack signs such as atelectasis or significant pleural effusion [1, 2]. In recent years, due to increasing environmental pollution, the prevalence of tobacco usage, and the increased awareness of health check-ups, the detection of SPLs has been on the rise [3]. Although most SPLs are eventually determined to be benign, early differentiation between benign and malignant lesions remains crucial for effective patient management [4]. For instance, accurate identification and prompt resection of malignant SPLs can improve the 5-year survival rate of patients with non-small cell lung cancer [5]. Similarly, accurately identifying benign SPLs before treatment could avoid unnecessary interventions, optimise the allocation of healthcare resources, and alleviate patient suffering [6]. Needle biopsy is currently an accepted method for distinguishing between benign and malignant SPLs prior to treatment. However, its limitations, such as small sample size and invasiveness, not only hinder its ability to accurately represent the characteristics of SPLs but also pose challenges for patients in poor physical condition or with lesions near vital structures such as the heart or large blood vessels [7, 8]. Therefore, finding a non-invasive method that can differentiate between benign and malignant SPLs prior to treatment holds importance for the patients involved.

In clinical practice, computed tomography (CT) is commonly used as the primary modality for evaluating patients with SPLs. However, it has inherent limitations, including exposure to ionising radiation and reliance solely on morphological criteria [4]. One of the most valuable diagnostic and evaluation tools for oncology is 18Fluorine-fluorodeoxyglucose positron emission tomography/magnetic resonance imaging (18F-FDG PET/MRI). This imaging technique not only provides information about glucose metabolism through 18F-FDG PET imaging but also enables the simultaneous acquisition of multiple quantitative MRI sequences during PET imaging, offering a more comprehensive assessment for clinical decision-making in these patients [9]. Amide proton transfer-weighted imaging (APTWI), mono-exponential model diffusion-weighted imaging (MEM-DWI), and stretched exponential model DWI (SEM-DWI) are quantitative MRI imaging sequences. APTWI allows the evaluation of mobile protein and peptide content in biological tissues without using exogenous contrast agents [10], while MEM-DWI and SEM-DWI provide insights into water molecule diffusion and tissue heterogeneity within the body [11]. Currently, 18F-FDG PET and MEM-DWI have shown promising results in differentiating between benign and malignant lung lesions and are widely used [12]. However, APTWI and SEM-DWI are still in the early stages of research regarding the differentiation between benign and malignant lung lesions, with limited studies available, often characterised by small sample sizes and the exclusion of SEM-DWI [13, 14]. Moreover, to the best of our knowledge, no systematic comparison has been made to assess the diagnostic performance of 18F-FDG PET, APTWI, MEM-DWI, and SEM-DWI in differentiating malignant from benign SPLs, and there is a lack of guidance on which parameters should be selected to help establish a clinical diagnosis.

This study aimed to use a hybrid 18F-FDG PET/MRI scanner to perform simultaneous chest 18F-FDG PET, APTWI, MEM-DWI, and SEM-DWI scans in patients with SPLs to compare the differences in each quantitative/semi-quantitative parameter between the benign and malignant groups, identify independent predictors, and establish a prediction model and validate it. The ultimate goal is to offer a novel reference for the clinical management of these patients.

Materials and methods

Patients

This study was approved by the ethics review committee at our hospital, and written informed consent was obtained from all patients. From August 2020 to April 2022, a series of 165 patients diagnosed with SPLs on CT underwent chest 18F-FDG PET/MRI. The following patients were excluded from the analysis: (i) those unable to complete all imaging sequences due to claustrophobia or other physical symptoms (n = 9); (ii) patients who had previously undergone radiotherapy, chemotherapy, or surgery prior to the 18F-FDG PET/MRI scan (n = 11); (iii) patients with poor image quality for 18F-FDG PET, APTWI, MEM-DWI, or SEM-DWI, making analysis challenging (n = 10); (iv) patients with missing clinical or histopathological information (n = 15). A total of 120 patients with SPLs were enrolled, and information on age, sex, smoking history, and maximum diameter of the lesion was collected. The study flow is presented in Fig. 1.

Fig. 1
figure 1

Flow diagram of the patient selection process

Image acquisition

A chest scan was conducted using a hybrid 3.0 T PET/MRI system (uPMR 790, United Imaging, Shanghai, China) equipped with a 12-channel phased-array body coil. The 18F-FDG used in this study was produced by FracerLab FX-FDG (GE Minitrac) with a purity of > 95% and a pH of 4.5–8.5. All patients fasted for a minimum of 6 h before the scan to ensure that their serum glucose levels were < 6.5 mmol/L while injecting 18F-FDG (0.11 mCi/kg). The PET scan began 60 min after administering 18F-FDG and lasted for 27 min. All patients were placed in the supine position headfirst and were scanned from the upper thoracic inlet to the lower lung margin. All patients underwent breathing training before scanning in order to maintain smooth breathing during scanning and reduce image artefacts. A breathing strap was attached to monitor respiration by attaching it to the patient’s abdomen. Magnetic resonance-based attenuation correction was performed using a three-dimensional T1-weighted spoiled gradient-echo sequence with Dixon-based water-fat separation imaging. This technique enabled the segmentation of corrected images into soft tissue, fat, lung, and air compartments [15, 16]. The PET images were reconstructed using ordered subsets expectation maximisation with two iterations, 20 subsets, and a voxel size of 2.6 × 2.6 × 2.0 mm3. Simultaneously, during the 18F-FDG PET scan, axial T1-weighted imaging, T2-weighted imaging (T2WI), SEM-DWI, and APTWI were sequentially performed. The APTWI parameters were as follows: B1 values of 1.3 μT and 2.5 μT, ETL of 39, Gaussian pulse, 10 repeats, 100 ms duration, with an additional S0 image without chemical exchange saturation transfer (CEST) saturation pulse for normalisation, and Δ ranging from -4.5 to 4.5 ppm in 31 steps. Additionally, 11 low-power B1 images (B1 = 0.13 μT) were acquired with Δ ranging from -1.0 to 1.0 ppm for wide-angle staring synthetic aperture radar images used in B0 map correction. Table 1 provides a comprehensive description of the protocol details.

Table 1 Details of scanning protocol

Parameter generation

All images were uploaded to the post-processing Workstation (uWS-MR005, United Imaging, Shanghai, China) for motion correction and analysis. Fused PET/MRI software was used to automatically extract the volume of interest (VOI), determine the maximum standardised uptake value (SUVmax), and quantify the metabolic tumour volume (MTV) and total lesion glycolysis (TLG) using a 40% SUVmax threshold [17]. Advanced analysis toolkit software, specifically diffusion analysis and CEST software, were used to process the MEM-DWI, SWM-DWI, and APTWI data. The parameters for MEM-DWI and SEM-DWI were calculated using Eqs. 1 and 2:

$${\mathrm S}_{\mathrm b}={\mathrm S}_0\,\exp\left(-\mathrm b\cdot\mathrm{ADC}\right)$$
(1)
$${S}_{b}/{S}_{0}=exp \left[{\left(-b\times DDC\right)}^{a}\right]$$
(2)

where b represents the diffusion sensitising factor, S0, and Sb represent the signal intensities (SIs) at a b-value of 0 or the b-value indicated by the subscript, respectively. ADC, DDC, and α represent the standard apparent diffusion coefficient, distributed diffusion coefficient, and water molecular diffusion heterogeneity index, respectively [11, 18]. The APTWI parameter was derived from the following formula:

$${\text{MTRasym}}\left(3.5{\text{ppm}}\right)=\left[{S}_{sat}\left(-3.5ppm\right)-{S}_{sat}\left(+3.5ppm\right)\right]/{S}_{0}$$
(3)

where S0 and Ssat were the SIs obtained without and with selective saturation, respectively, and MTRasym (3.5 ppm) was the magnetisation transfer ratio asymmetry at 3.5 ppm downfield from the water signal [10]. The regions of interest (ROIs) were manually drawn within the tumour margin layer by layer on the axial T2WI images with reference to the PET/MR fusion image. Areas with cystic degeneration, necrosis, apparent signs and haemorrhage artefacts, and blood vessels were avoided. Subsequently, all completed ROIs were copied to the pseudo colour maps of the MEM-DWI-, SWM-DWI-, and APTWI-derived parameters to calculate the mean values based on the VOI. An attending radiologist and an associate chief radiologist who had 8 and 15 years of experience, respectively, independently performed the above procedures. Both the radiologists were blinded to each other's results and the patient's clinicopathological data.

Histopathologic evaluation

Within 2 weeks after 18F-FDG PET/MRI, surgical resection or biopsy were performed to obtain specimens of all malignant SPLs and 30 benign SPLs. These specimens were sent to our pathology centre for histological analysis [19]. The remaining 11 cases of benign SPLs were followed up for 5–20 weeks to obtain a final diagnosis.

Statistical analysis

All data were analysed using R (version 3.5.3; R Foundation, Auckland, Zealand) and SPSS (version 15.0; MedCalc Software, Ostend, Belgium). Interobserver consistency for the 18F-FDG PET, MEM-DWI, SEM-DWI, and APTWI parameters was assessed using the interclass correlation coefficient (ICC), with interpretations as follows: < 0.40 for poor consistency, 0.40–0.60 for fair consistency, 0.60–0.75 for good consistency, and > 0.75 for excellent consistency [20]. Categorical variables are presented as counts and percentages. Continuous variables are presented as the median and upper and lower quartiles if non-normally distributed and as the mean ± standard deviation if normally distributed. The Mann–Whitney U test, independent samples t-test, and chi-square test were used to compare different variables between the benign and malignant groups. The diagnostic efficacy was evaluated using the area under the receiver operating characteristic curve (AUC), and differences in AUCs were assessed using the DeLong test. The logistic regression (LR) analysis (forward LR method) was used to identify independent predictors and establish a prediction model. Calibration curves and decision curve analysis (DCA) were used for evaluating the prediction model. Statistical significance was set at P < 0.05.

Results

Basic information

A total of 79 malignant SPLs (11 small cell lung cancer, 16 squamous cell carcinoma, and 52 adenocarcinoma cases) and 41 benign SPLs (eight common inflammation, eight mechanical pneumonia, six tuberculosis, three hamartoma, eight fungal infection, five lung abscess, and three inflammatory pseudotumour cases) were enrolled in this study. Based on the principle of randomisation, 70% of patients in the benign and malignant groups were selected to form a training set (n = 84), while the remaining 30% of patients formed a test set (n = 36). The patients’ clinical characteristics are summarised in Table 2.

Table 2 Summary of characteristics in training and testing sets

Consistency test

The SUVmax, MTV, TLG, ADC, DDC, α, and MTRasym(3.5 ppm) values measured by the 2 radiologists had excellent consistency. The ICC were 0.969 (95% CI: 0.957 ~ 0.979), 0.988 (95% CI: 0.983 ~ 0.992), 0.968 (95% CI: 0.954 ~ 0.978), 0.910 (95% CI: 0.871 ~ 0.938), 0.940 (95% CI: 0.915 ~ 0.958), 0.932 (95% CI: 0.902 ~ 0.953), and 0.897 (95% CI: 0.852 ~ 0.928), respectively. The average results were used for the ultimate analysis.

Parameter comparison

SUVmax, MTV, TLG, α, and MTRasym(3.5 ppm) values were significantly lower and ADC, DDC values were significantly higher in benign SPL than malignant SPL (P < 0.001, < 0.001, < 0.001, = 0.004, < 0.001, < 0.001, and < 0.001, respectively, Table 3, Figs. 2 and 3).

Table 3 Comparison of different characteristics between benign and malignant group in the training set
Fig. 2
figure 2

A 35-year-old woman with benign SPLs in the lower lobe of the left lung (arrowheads, size 15 mm × 25 mm × 21 mm, fibrous tissue hyperplasia with chronic inflammation). a Map of T2WI; b Map of 18F-FDG PET; (c) Map of MEM-DWI (b = 600 s/mm.2); (d) Pseudo colored map of ADC; (e) Pseudo colored map of DDC; (f) Pseudo colored map of α; (g) Map of APTWI; (h) Pseudo colored map of MTRasym(3.5 ppm); and (i) Pathological images (H&E staining,100 ×)

Fig. 3
figure 3

A 35-year-old man with malignant SPLs in the high lobe of the left lung (arrowheads, size 50 mm × 50 mm × 40 mm, mucinous adenocarcinoma). a Map of T2WI; (b) Map of 18F-FDG PET; (c) Map of MEM-DWI (b = 600 s/mm.2); (d) Pseudo colored map of ADC; (e) Pseudo colored map of DDC; (f) Pseudo colored map of α; (g) Map of APTWI; (h) Pseudo colored map of MTRasym(3.5 ppm); and (i) Pathological images (H&E staining,100 ×)

Regression analyses

The potential risk-related factors such as age, maximum diameter, sex, smoking, SUVmax, MTV, TLG, ADC, DDC, α, and MTRasym(3.5 ppm) were all enrolled in regression analysis. Univariate analysis demonstrated that maximum diameter, SUVmax, MTV, TLG, ADC, DDC, α, and MTRasym(3.5 ppm) were all risk predictors (P all < 0.05), while multivariate analysis showed that only SUVmax, ADC, and MTRasym(3.5 ppm) were independent predictors (P = 0.001, 0.001, and 0.024, respectively, Table 4).

Table 4 Univariate and multivariate analyses

Diagnostic performance

Within the training set, the prediction model based on these independent predictors demonstrated optimal diagnostic efficacy (AUC, 0.976; sensitivity, 94.64%; specificity, 92.86%), surpassing ADC, SUVmax, TLG, DDC, α, MTV, and MTRasym (3.5 ppm) with statistical significance (AUC = 0.888, 0.853, 0.809, 0.786, 0.786, 0.768, and 0.691, Z = 2.761, 3.080, 3.653, 3.844, 3.773, 4.030, and 4.938, respectively, P = 0.006, 0.002, < 0.001, < 0.001, < 0.001, < 0.001, and < 0.001, respectively). Similarly, within the test set, the prediction model exhibited optimal diagnostic efficacy (AUC, 0.957; sensitivity, 91.30%; specificity, 92.31%, Table 5, Fig. 4), outperforming TLG, MTV, α, ADC, MTRasym (3.5 ppm), and DDC with statistical significance (AUC = 0.833, 0.793, 0.776, 0.756, 0.732, and 0.729, Z = 2.317, 2.337, 2.514, 2.263, 2.562, and 2.697, respectively, P = 0.021, 0.020, 0.012, 0.024, 0.010, and 0.007, respectively).

Table 5 Predictive performance for identifying benign and malignant SPL
Fig. 4
figure 4

The area under receiver-operator characteristic (ROC) curves of different parameters and the prediction model. a, b ROC curve of the training set. c, d ROC curve of the test set

Validation

The calibration curves and DCA revealed that the prediction model exhibited not only good consistency but also provided greater clinical benefits to the relevant patients compared with any single parameter, as evident in the training and test sets (Fig. 5).

Fig. 5
figure 5

Calibration curves and decision curve analysis (DCA) curves. a, b Calibration curve and DCA of the training set. c, d Calibration curve curve and DCA of the test set

Discussion

Currently, 18F-FDG PET is widely employed as a molecular imaging technique in clinical practice. It provides valuable information regarding the metabolism of the target tissue, with derived parameters such as SUVmax, TLG, and MTV commonly used to quantify glucose metabolism. SUVmax primarily reflects the highest levels of glucose metabolism within the target region, while TLG and MTV reflect the overall glucose metabolism in the target region [21]. Erdoğan et al. comprehensively evaluated 18F-FDG PET data from 113 patients with lung lesions and observed that SUVmax, TLG, and MTV all played a positive role in distinguishing between benign and malignant lung lesions [22]. In a meta-analysis conducted by Li et al., the role of 18F-FDG PET in evaluating lung lesions was assessed. The findings demonstrated that although there was some overlap in 18F-FDG uptake between benign and malignant lesions, 18F-FDG PET remained a reliable diagnostic tool [23]. In this study, malignant SPLs exhibited elevated metabolic activity and higher 18F-FDG transport and uptake compared with benign SPLs. Consequently, there were significant increases in SUVmax, MTV, and TLG in the malignant group. Furthermore, the LR analysis confirmed that SUVmax served as an independent predictor for discriminating between benign and malignant SPLs, which is consistent with the above findings, further suggesting that 18F-FDG PET could help differentiate between benign and malignant SPLs.

The MEM-DWI represents the first DWI method to be used in clinical practice and assumes a uniform Gaussian distribution for the diffusion motion of water molecules in biological tissues. It uses the quantitative parameter ADC to capture variations in water molecule diffusion motion [24]. The value of ADC has been extensively investigated for its potential in differentiating between benign and malignant lung lesions. Numerous studies have consistently reported significantly lower ADC values in malignant lesions compared with benign lesions, primarily attributed to the increased cell proliferation, dense tissue structure, and more restricted diffusion of water molecules within the malignant lesions [25]. In our study, the ADC values in malignant SPLs were significantly lower compared with those in benign SPLs. Furthermore, ADC emerged as an independent predictor for differentiating between benign and malignant SPLs, which is consistent with the previous studies and further demonstrates the role of MEM-DWI in differentiating between benign and malignant SPLs.

In contrast to MEM-DWI, SEM-DWI assumes that the movement of water molecules in biological tissues occurs within a non-homogeneous environment. This approach yields two quantitative parameters, namely DDC, which reflects the distributed diffusion of water molecules, and α, which reflects tissue heterogeneity [11]. Similar to ADC, DDC values are primarily influenced by the tightness of the tissue structure, In this study, patients with malignant SPLs, characterised by increased cell proliferation and tighter structure, exhibited significantly lower DDC values compared to those with benign SPLs. On the other hand, the magnitude of α is closely associated with tissue heterogeneity. Previous studies on endometrial carcinoma [26], breast cancer [27], and renal cancer [28] have demonstrated that more malignant lesions generally exhibit significant tissue heterogeneity due to factors such as necrosis, haemorrhage, and cellular heterogeneity, resulting in reduced α values. Surprisingly, the findings of this study revealed an opposite trend, wherein the α value was increased in malignant SPLs compared with benign SPLs. It is speculated that this discrepancy may be attributed to the specific structure of the lung tissue. In benign SPLs, although there is relatively less necrosis, haemorrhage, and cellular heterogeneity, it contains more normal lung tissues such as alveoli and fine bronchi. These elements could contribute to increased tissue heterogeneity to some extent. Conversely, in malignant SPLs, despite the presence of more pronounced necrosis, haemorrhage, and cellular heterogeneity, tissues such as alveoli and fine bronchi are often replaced by cancer cells, thereby reducing the heterogeneity [29]. Nevertheless, the accuracy of this inference should be validated, given the limited application of SEM-DWI in lung lesions.

APTWI, a quantitative MRI sequence based on CEST, uses proton exchange to transfer variations in tissue-mobile protein/peptide concentrations to water molecules. It calculates the quantitative parameter MTRasym (3.5 ppm) to reflect changes in mobile protein and peptide concentrations within biological tissues [10]. This study aimed to compare the differences in MTRasym (3.5 ppm) values between patients with benign and malignant SPLs. The results indicated that patients in the malignant SPL group exhibited significantly higher MTRasym (3.5 ppm) values than those in the benign SPLs group, consistent with the findings of Ohno et al. [13, 14]. These findings further suggest the potential of APTWI in aiding the qualitative diagnosis of SPLs. One possible explanation for this outcome is that patients with benign SPLs tend to have more robust cell proliferation and a higher presence of necrotic and haemorrhagic components. Consequently, there is an increased concentration of mobile protein/peptide within the tissue, resulting in increased MTRasym (3.5 ppm) values [30,31,32].

Due to the inherent tissue heterogeneity in tumourous lesions, relying on a single parameter for a comprehensive and accurate assessment can be challenging [33]. Previous studies have highlighted the advantages of using multi-parameter combination diagnosis based on multiple quantitative or semi-quantitative parameters, particularly multiple independent predictors, in improving the diagnosis and evaluation of tumours compared with relying on single parameters [34, 35]. Accordingly, the study employed univariate and multivariate LR analysis to identify independent predictors (SUVmax, MTRasym [3.5 ppm], and ADC) for differentiating between benign and malignant SPLs among various clinical factors and quantitative and semi-quantitative parameters. Based on these predictors, a corresponding prediction model was developed. The results demonstrated that the prediction model not only exhibited varying degrees of improved diagnostic efficacy compared with individual parameters but also provided reliable benefits to the patients, as observed in both the training and test sets. These findings suggest that the combination of multiple parameters might more and accurately reflect the characteristics of the lesion, emphasising the importance of using as many imaging methods as possible to assess patients whenever feasible.

While this study yielded encouraging results, it is important to acknowledge several limitations. First, the study was conducted at a single institution, and although a training and test set were employed, the sample size remained relatively small. Additionally, the external validation of these findings across multiple institutions was not performed, which might affect the reliability of the results to some extent. Second, the exclusion of some microscopic lesions, particularly benign SPLs, from 18F-FDG PET/MRI scans due to poor display could limit the applicability of the study. Third, respiratory and cardiovascular pulsation artefacts in the lungs can be significant, despite attempts to mitigate their effects through various techniques. These artefacts might still influence the stability of the various quantitative/semi-quantitative parameters. Fourthly, the present study has not explored the value of APTWI, SEM-DWI, and 18F-FDG PET in the assessment of different histopathological features of SPL such as Ki-67, grade, etc., which may have led to an inadequate study. Future studies should aim to expand the sample size, conduct multi-center studies, asses a wider range of histopathological features, and explore technologies that can reduce artefacts, such as cardiovascular gating, rapid scanning and electrocardiogram triggering, to enhance imaging quality and obtain more stable and reliable experimental results.

Conclusion

Multiparametric PET/MRI based on 18 F-FDG PET, MEM-DWI, SEM-DWI, and APTWI can effectively evaluate the characteristics of SPLs. The prediction model comprising SUVmax, ADC, and MTRasym (3.5 ppm) demonstrated superior diagnostic efficacy compared with individual parameters. It holds promise as a reliable imaging marker for differentiating between benign and malignant SPLs.