Background

Glioblastoma multiforme (GBM) and solitary brain metastases (SBM) are the most common malignant brain tumors, and their correct identification is key for further diagnosis and treatment [1,2,3]. Although magnetic resonance imaging (MRI) is the main tool for differentiating between the two types of tumors, both GBM and SBM may show marked peritumoral edema and similar contrast-enhancement patterns on routine MRI, leading to great challenges in identification [4,5,6].

Previous studies reported that radiomics combined with routine MRI showed significant advantages in distinguishing GBM from SBM and suggested that specific imaging features are helpful in distinguishing between the two types of tumors [7, 8]. Currently, the acquisition of specific image features can be summarized into two trends: applying special MRI modalities or focusing on specific image feature types [9, 10].

Diffusion-weighted imaging (DWI) can provide a class of microscopic features related to the movement of water molecules in tissues, such as the current advanced diffusion imaging model and neurite orientation dispersion and density imaging (NODDI) [11, 12]. NODDI is a multi-spherical shell diffusion model based on the difference in the diffusion of water molecules inside and outside the cell and is more often used to characterize the difference in water diffusion between tumor infiltration and vasogenic edema [Full size image

Radiomic texture extraction and model construction

Feature extraction, feature selection, and model building were performed using the open-source software FeAture Explorer (FAE, version 0.5.2) [23]. Based on the automatically segmented ROIs, radiomic texture features were extracted using first-order statistical functions, gray-level co-occurrence matrix (GLCM) functions, and gray-level run-length matrix (GLRLM) functions on the original NODDI parametric maps, as well as eight sub-bands of its wavelet transformation. As controls, the features of routine MR images (T2WI, T2-dark-fluid, T1WI, and CE-T1 MPRAGE) were extracted in the same manner. Radiomic texture analysis for each ROI was based on a combination of three parametric map features from the NODDI or a combination of four routine MRI features. Finally, 234 features were extracted from each parameter map of NODDI (or each type of routine MRI). Details of the extracted features are presented in Supplementary Appendix E2.

Because of the imbalanced GBM-to-SBM sample ratio (2.5:1), we applied upsampling to the training dataset. After feature extraction, all radiomic texture feature values were normalized using the min-max or Z-score method. Four feature selection methods—Pearson’s correlation coefficient (PCC), analysis of variance, recursive feature elimination, and the Kruskal-Wallis test—as well as eight classifiers—support vector machine, linear discriminant analysis, auto-encoder, random forest, logistic regression, logistic regression via Lasso, ada-boost, and decision tree—were utilized to construct texture feature prediction models for each ROI. When the PCC value of a feature pair was greater than 0.90, only one of the features was randomly retained. Five-fold cross-validation was used to determine the hyperparameters of each model. After determining the hyperparameters, all training data were retrained for the final models. The maximum number of features included in the radiomic texture analysis model construction was four. Details of the sample size and feature number estimates are displayed in Supplementary Appendix E3. The final models were determined based on the highest area under the receiver operating characteristic (ROC) curve (AUC) value in the cross-validation, and a time-independent test dataset was used to evaluate the performance of the final model. The performance of the test dataset was determined through ROC curve analysis and evaluations of accuracy, AUC, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV).

Statistical analysis

Statistical analyses were performed using SPSS (version 21.0) and MedCalc (version 20.015) software. Differences in clinical characteristics between GBM and SBM were assessed using chi-square tests (or the Mann–Whitney U test, depending on the results of normality and homoscedasticity tests) and independent t-tests, as appropriate. DeLong’s test was used to assess differences in AUC values between models. Statistical significance was set at a two-sided p value < 0.05.