A deep learning model for brain segmentation across pediatric and adult populations

Simarro, Jaime; Meyer, Maria Ines; Van Eyndhoven, Simon; Phan, Thanh Vân; Billiet, Thibo; Sima, Diana M.; Ortibus, Els

doi:10.1038/s41598-024-61798-6

A deep learning model for brain segmentation across pediatric and adult populations

Article
Open access
Published: 22 May 2024

Volume 14, article number 11735, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

A deep learning model for brain segmentation across pediatric and adult populations

Download PDF

Jaime Simarro^1,2,
Maria Ines Meyer¹,
Simon Van Eyndhoven¹,
Thanh Vân Phan¹,
Thibo Billiet¹,
Diana M. Sima¹ &
…
Els Ortibus^2,3,4

2067 Accesses
2 Altmetric
Explore all metrics

Abstract

Automated quantification of brain tissues on MR images has greatly contributed to the diagnosis and follow-up of neurological pathologies across various life stages. However, existing solutions are specifically designed for certain age ranges, limiting their applicability in monitoring brain development from infancy to late adulthood. This retrospective study aims to develop and validate a brain segmentation model across pediatric and adult populations. First, we trained a deep learning model to segment tissues and brain structures using T1-weighted MR images from 390 patients (age range: 2–81 years) across four different datasets. Subsequently, the model was validated on a cohort of 280 patients from six distinct test datasets (age range: 4–90 years). In the initial experiment, the proposed deep learning-based pipeline, icobrain-dl, demonstrated segmentation accuracy comparable to both pediatric and adult-specific models across diverse age groups. Subsequently, we evaluated intra- and inter-scanner variability in measurements of various tissues and structures in both pediatric and adult populations computed by icobrain-dl. Results demonstrated significantly higher reproducibility compared to similar brain quantification tools, including childmetrix, FastSurfer, and the medical device icobrain v5.9 (p-value< 0.01). Finally, we explored the potential clinical applications of icobrain-dl measurements in diagnosing pediatric patients with Cerebral Visual Impairment and adult patients with Alzheimer’s Disease.

Deep learning-based, fully automated, pediatric brain segmentation

Article Open access 22 February 2024

An automatic and accurate deep learning-based neuroimaging pipeline for the neonatal brain

Article 08 March 2023

Effect of head motion-induced artefacts on the reliability of deep learning-based whole-brain segmentation

Article Open access 31 January 2022

Introduction

Neuroimaging techniques play a crucial role in advancing our understanding of the human brain, covering its structure, development, function, and pathologies¹. Magnetic Resonance Imaging (MRI) stands out as a non-invasive technology to obtain high-resolution, in vivo measurements of the human brain². Automated analysis of MR images contributes to the diagnosis of neurological pathologies across various life stages, from childhood (e.g., focal cortical dysplasia³) to late adulthood (e.g., Alzheimer’s disease⁴).

Quantitative assessment, exemplified by volumetric analysis, enhances the objectivity of brain interpretation compared to visual MRI scan inspection alone. Traditional techniques for brain MR image segmentation involve atlas-based methods and statistical models, such as FreeSurfer⁵, volBrain⁶, or the medical device software, icobrain v5.9^7,8. Nevertheless, recent progress in deep learning models, such as QuickNat⁹, AssemblyNet¹⁰, and FastSurfer¹¹, has demonstrated superior performance compared to traditional methodologies, as evidenced in a recent review¹².

Despite the growing role of quantitative analysis tools, additional technical and clinical validation is required⁴. Notably, there is a lack of validated models for robust and reliable brain quantification in multi-scanner settings, common in clinical data. Additionally, recent algorithms, including deep learning methods, are usually developed and validated using adult datasets. However, standard MRI processing methods designed for adult images may not be suitable for pediatric datasets¹³. Pediatric brain analysis poses unique challenges such as reduced tissue contrast, within-tissue intensity heterogeneities, and smaller regions of interest^13,14. Consequently, pediatric brain analysis commonly employs specialized analysis tools like childmetrix¹⁵.

In pediatric studies, a common dilemma arises regarding the use of age-appropriate methods for different developmental stages or maintaining a consistent method across all ages¹⁶. While age-specific models are optimized for specific age ranges, their use introduces the risk of attributing age-related differences to methodological inconsistencies rather than genuine brain development or change. Particularly when monitoring patients across different transitional phases, such as from the pediatric stage through adolescence and into adulthood, there is a significant need for a general, consistent, and reliable method, eliminating reliance on multiple age-specific methods.

In this work, we develop and validate a brain segmentation pipeline across pediatric and adult populations, emphasizing the impact of heterogeneous and representative training data rather than the optimization of the deep learning architecture employed. The primary objective of this study is to explore whether a single deep learning model can be optimized to consistently quantify structural MRI across the lifespan, reflecting the distinctive neuroanatomy of each developmental stage. We hypothesize that a single deep learning model trained on datasets covering a wide age range will perform comparably to age-specific models within their respective age groups. The secondary objective is to validate the proposed pipeline’s performance in terms of reproducibility, diagnostic accuracy, and computational time. We hypothesize that the proposed deep learning-based pipeline will produce results comparable to established methods such as childmetrix, icobrain v5.9, and FastSurfer, while ensuring accurate and reproducible brain quantification across pediatric and adult populations.

Materials and methods

Datasets

Four separate datasets collectively containing 390 patients, aged between 2 and 81 years, were utilized for training. Validation was performed on a separate cohort of 280 patients from six distinct test datasets, covering an age range from 4 to 90 years. These datasets consisted of 757 T1-weighted MRI scans acquired from various manufacturers (Philips, Siemens, GE, Fujifilm) with different magnetic field strengths (1.5T/3T ∼ 32%/68%) across 21 scanners. The patients represented a diverse pathological conditions, including developmental disorders, cerebral visual impairment, depression, bipolar disorder, schizophrenia, multiple sclerosis, and Alzheimer’s disease. Table 1 presents a summary of the diverse datasets employed in this retrospective study. Further details about these datasets can be found in Appendix A.

Table 1 The datasets utilized for model training and validation consisted of both pediatric (denoted with suffix p) and adult (denoted with suffix a) data.

Full size table

Training dataset

The training dataset comprises a wide age range, pathologies and acquisition protocols. T1-weighted images were sourced from pediatric datasets, including the Healthy Brain Network (HBN, dataset 1.1.p)¹⁷ and the Calgary Preschool MRI (dataset 1.2.p)¹⁸. Additionally, T1-weighted images of adult patients were obtained from a research cohort (dataset 1.3.a) focused on the relations between very-late-onset schizophrenia-like psychosis, hippocampal volume, early adversity, and memory function¹⁹ as well as another cohort from clinical practice (dataset 1.4.a).

Segmentation accuracy testing dataset

Two publicly available manually annotated datasets were used to validate the segmentation accuracy: the Child and Adolescent NeuroDevelopment Initiative (CANDI, dataset 2.p)²⁰ and the MICCAI 2012 Grand Challenge and Workshop on Multi-Atlas Labeling (MICCAI2012, dataset 2.a)²¹. We excluded 5 images from the latter due to repeated scans of the same patient.

Reproducibility testing dataset

The reproducibility of the measurements was evaluated by analyzing two images from the same individual acquired with re-positioning within a very short time interval, ensuring no anatomical change between the two images (i.e., test and retest images). Two test-retest datasets were used to validate the reproducibility. The first dataset is a pediatric intra-scanner dataset obtained from Nathan Kline Institute (NKI, dataset 3.p)²², while the second dataset comprises 10 adult individuals who underwent two scans, using three different types of scanners (Re3T, dataset 3.a)⁷. Using repeated scans in multiple scanner types enables analysis for intra-scanner and inter-scanner validation.

Diagnostic performance testing dataset

The diagnostic performance is assessed using two separate datasets. The first dataset comprises pediatric patients suspected of suffering from Cerebral Visual Impairment (CVI) (dataset 4.p), approved by the local Ethical Committee of UZ Leuven, Belgium (S65276). All methods were carried out in accordance with relevant guidelines and regulations. Informed consent was obtained from all subjects or their legal guardians. Secondly, we used the Minimal Interval Resonance Imaging in Alzheimer’s Disease (MIRIAD, dataset 4.a), which includes both patients with Alzheimer’s Disease (AD) and healthy elderly individuals²³.

icobrain-dl pipeline: design and development

icobrain-dl is a pipeline for brain quantification. The pipeline processes a 3D T1-weighted MR image as input and undergoes three main steps: preprocessing, brain segmentation using a deep learning model, and brain quantification. The output includes brain segmentation masks for various regions of interest (ROIs) and brain volumes.

Pre-processing

Prior to training, the images underwent several fully automated pre-processing steps. Firstly, bias-field correction was performed using the N4 inhomogeneity correction algorithm as implemented in the Advanced Normalization Tools (ANTs) toolkit²⁴. In pediatric cases, an age-specific atlas is used to obtain the brain mask for N4 correction. Secondly, the images were affinely registered to MNI space using the $\texttt {reg\_aladin}$ algorithm in NiftyReg²⁵. To minimize the effect of outliers, intensities were clipped at the 1^st and 99^th percentile. Finally, the intensities were normalized using a variation on z-scoring, this function was computed over values above the 10^th percentile, with preference given to the median over the mean. The standard deviation was then computed within the 90^th percentile.

Simultaneous segmentation of brain tissue and structures via a multi-head deep learning model

The proposed deep learning model is designed to perform two tasks, brain tissue segmentation and brain structural segmentation, whose labels are not mutually exclusive.

Task 1: Tissue segmentation. This task involves the segmentation of brain tissues into four distinct classes: background (i.e., not brain tissue), white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF).
Task 2: Structural segmentation. This task involves the segmentation of 22 anatomical brain structures and background. A detailed list of the structures is provided in Appendix B.

The architecture utilizes a 3D U-net backbone²⁶, incorporating two segmentation heads. Each of both outputs is a softmax array of $N_k$ probability maps, where $N_k$ is the number of classes being predicted in task k. Moreover, certain modifications were made to the original architecture, including substituting batch normalization with weight normalization²⁷, using leaky ReLU as the primary activation function, and using strided convolutions instead of max pooling²⁸. Figure 1 illustrates our final architecture, while detailed information including justification for the multi-task architecture can be accessed in Appendix C.

The model was trained using a weighted sum of the per-task losses, each comprising of a soft Dice loss ($L_{Dice}$) and a weighted categorical cross-entropy loss ($L_{\text {w}CE}$), as shown in Eq. (1).

$$\begin{aligned} \mathscr {L}_{total} = ~ \alpha _1\left( \mathscr {L}^{(1)}_{\text {w}CE} + \mathscr {L}^{(1)}_{Dice}\right) +~ \alpha _2\left( \mathscr {L}^{(2)}_{\text {w}CE} + \mathscr {L}^{(2)}_{Dice}\right) \end{aligned}$$

(1)

We set $\alpha _1=1$ (tissue segmentation) and $\alpha _2=10$ (structural segmentation).

The proposed model is trained on patches of $128\times 128\times 128$ voxels from T1-weighted MR images acquired without contrast agent injection. To augment the variability in the training set, ensuring that the range of intensities and tissue contrasts is similar with those observed in multi-center, multi-scanner cohorts, we applied intensity-based data augmentation as described in Meyer et al.²⁹. This technique uses Gaussian Mixture Modeling to change the intensity of the individual tissue components within an MR image while preserving structural information. We utilized the predefined default parameters of the public implementation of this code, available at https://github.com/icometrix/gmm-augmentation.

The model was implemented using Tensorflow 2.6 and employed He weight initialization. The training process was stopped upon detecting convergence of the validation loss. The validation set, which constituted a randomly selected 15% of the training dataset, was not utilized for optimizing the network weights. Adam optimizer was deployed with an initial learning rate of $\lambda = 0.001$.

Efficient generation of high-quality training labels

To address the challenge of obtaining manual annotations for large datasets, we created ‘silver’ ground truth, starting from the labels predicted by icobrain v5.9 on the training datasets. Subsequently, minor manual corrections were made where necessary.

Models training scheme

We trained three deep learning models with identical architecture, each using a different set of data for training:

The icobrain-dl model was trained on both pediatric and adult data, providing the most comprehensive training dataset (i.e., datasets 1.1.p, 1.2.p, 1.3.a and 1.4.a ).
The pediatric-specific model, termed icobrain-dl-p, was exclusively trained on pediatric datasets (i.e., datasets 1.1.p and 1.2.p).
The adult-specific model, termed icobrain-dl-a, was solely trained on adult datasets (i.e., datasets 1.3.a and 1.4.a ).

Validating technical and diagnostic performance

Two sets of experiments were conducted to validate both technical and diagnostic performance, with a focus on segmentation accuracy, intra- and inter-scanner variability, and computational time.

Segmentation accuracy was evaluated through the Dice similarity coefficient (DSC) and Hausdorff distance (HD)³⁰. DSC is a metric quantifying the overlap between two segmentation masks, with values ranging from 0 (indicating no overlap) to 100 (indicating perfect agreement). The HD measures the maximal contour distance (in millimeters) between the two masks. A smaller HD indicates greater similarity between the masks. To address the high sensitivity of the HD to outliers³¹, we considered the 95th percentile of the HD, denoted as HD95. In the initial experiment, DSC and HD95 calculations were performed between ground truth segmentations and both icobrain-dl and the age-specific models (icobrain-dl-p or icobrain-dl-a). Subsequently, DSC and HD95 values were computed between the icobrain-dl model and the age-specific models on datasets 2.p and 2.a.

The reproducibility of icobrain-dl was assessed by comparing it with established non-deep learning algorithms, specifically the pediatric-focused childmetrix¹⁵ and the clinically-used adult-focused medical device software icobrain v5.9, referred to as icobrain-nondl^7,8. Additionally, the state-of-the-art deep learning model FastSurfer¹¹ was included. Test-retest relative differences were computed with respect to the mean volumes across methods (dataset 3.p and 3.a), and the Wilcoxon signed-rank test was employed to identify significant differences between methods at levels of 0.01 and 0.001.

The validation of diagnostic performance serves as a proof of concept for the clinical application of the segmentation algorithm. To demonstrate the icobrain-dl’s applicability across both pediatric and adult populations, two pathologies with distinct volumetric patterns were selected. In the first experiment, the objective was to differentiate patients with CVI from those without CVI using the whole brain white matter volume (dataset 4.p), motivated by the known association between periventricular white matter damage and CVI³². The second experiment aimed to distinguish patients with AD from cognitively healthy individuals using temporal lobe cortical gray matter volume (dataset 4.a). Previous research has established the reliability of this region in discerning between AD patients and healthy controls⁸. Volumes from the different pipelines were normalized for head size employing the determinant of the affine transformation to the MNI atlas as a scaling factor. Head size-normalized volumes of the regions of interest (i.e., whole brain white matter and temporal lobe cortical gray matter) were used to distinguish pathology and non-pathology. Model comparisons were conducted using the area under the receiver operating characteristic curve (AUC) and the DeLong test, with a significance level of 0.05³³. The assessment of accuracy, specificity, and sensitivity metrics was based on the maximum value of the Youden index.

Results

Accuracy

On the pediatric dataset 2.p, the deep learning models icobrain-dl and icobrain-dl-p exhibited comparable performance in accurately segmenting brain structures, achieving an average DSC of 82.2% and 80.8%, respectively. Their average HD95 were 3.26mm and 3.23mm. Additionally, there was a high overlap between the segmentations of icobrain-dl and the pediatric-oriented icobrain-dl-p, with an average DSC of 87.4% and HD95 1.76mm. Similar results were observed in the adult dataset 2.a, where icobrain-dl achieved an average DSC of 82.6% and HD95 of 2.27mm when compared to manual segmentations. For icobrain-dl-a, the metrics were 81.9% and 2.37mm, respectively. The average DSC between both segmentation models was 92.4% with an average HD95 of 1.02mm. Table 2 and Table 3 display the DSC and HD95 between manual ground truth segmentations and segmentations calculated by the three deep learning models.

These findings suggest that icobrain-dl is as effective as the age-specific models in accurately segmenting brain structures in both pediatric and adult populations.

Table 2 icobrain-dl consistently achieves high overlap in segmenting different brain structures across subject age ranges, while only minimally sacrificing accuracy and sometimes even outperforming models that are tailored for specific age ranges (icobrain-dl-p for pediatric data and icobrain-dl-a for adult data).

Full size table

Table 3 Summary of the Hausdorff distance 95th percentile (HD95) between ground truth (GT) and icobrain-dl or the age-specific models, and between the age-specific models and icobrain-dl.

Full size table

Reproducibility

The segmentations generated by icobrain-dl systematically had lower test-retest volume differences for the pediatric intra-scanner setting (dataset 3.p) than childmetrix and FastSurfer, as illustrated in Figure 2. For most structures, these test-retest differences from icobrain-dl were significantly lower than the comparable methods ($p < 0.01$).

A similar pattern of lower test-retest volume differences provided by icobrain-dl was observed in adults (dataset 3.a) for intra-scanner and inter-scanner settings (see Figure 3 and 4). Specifically, in the inter-scanner setting, icobrain-dl outperformed icobrain-nondl and FastSurfer, except in the right white matter and left cortical gray matter. Notably, icobrain-dl produced significantly lower inter-scanner test-retest errors ($p < 0.01$) across all substructures, including the caudate nucleus, hippocampus, globus pallidus, putamen, and thalamus.

Diagnostic performance

The performance of icobrain-dl in detecting pediatric patients with CVI surpassed childmetrix (AUC of 0.48) and FastSurfer (AUC of 0.60), with an AUC of 0.69, as shown in Table 4. There was no statistically significant difference between icobrain-dl and FastSurfer in terms of AUC. Nevertheless, icobrain-dl exhibited significantly superior performance compared to childmetrix ($p < 0.05$).

Table 4 The proposed method has superior performance in detecting pediatric patients with Cerebral Visual Impairment (CVI) from those without CVI using the white matter volume normalized for head size (dataset 4.p) and comparable high performance in detecting adult patients with Alzheimer’s Disease from age-matched controls using the cortical grey matter of the temporal lobe normalized for head size (MIRIAD, dataset 4.a).

Full size table

In supporting the classification of AD patients from age-matched controls, the icobrain-dl demonstrated comparable high performance in terms of accuracy, sensitivity, and specificity. The AUC for icobrain-dl was 0.99, icobrain-nondl was 0.98, and FastSurfer was 0.98, with no statistically significant difference.

Computational time

On average, the proposed method took approximately 5 minutes to complete the entire pipeline when running on a server without a GPU (amazon web services cloud environment c6i.2xlarge, 8vCPU and 16GiB of Memory RAM) while the pipeline based on FastSurfer requires nearly 6 minutes on a GPU server (cloud environment p2.xlarge, NVIDIA Tesla K80 (12 GiB), 4vCPU and 61GiB of Memory RAM). In contrast, the non-deep learning approaches childmetrix and icobrain v5.9 running on a server without a GPU (cloud environment c6i.2xlarge, 8vCPU and 16GiB of Memory RAM) required on average 24 minutes and 27 minutes.

Qualitative results

Figure 5 illustrates the segmentation results of icobrain-dl in test patients across the lifespan, with ages ranging from 4 to 85 years old. These qualitative results demonstrate the model’s robustness to diverse pathological conditions and scans with differing intensities and contrasts.

Discussion

This study introduces icobrain-dl, a deep learning-based pipeline capable of performing quantitative assessment of brain tissues and structures across pediatric and adult populations.

The pipeline was developed and validated using T1-weighted images obtained from various scan vendors with different magnetic field strengths. The dataset includes patients across a broad age range with various pathological conditions. Evaluation of the proposed pipeline included segmentation accuracy and reproducibility assessments, along with an exploration of its clinical application through diagnostic performance and computational efficiency.

In contrast to methods tailored for specific age ranges, such as childmetrix for children or icobrain-nondl and FastSurfer for adults, icobrain-dl provides quantitative brain measurements across the human lifespan, from early childhood (i.e., 4 years old) to maturation and older age, within a single deep learning model. Previous experiments have shown the accuracy performance of adult-trained models in pediatric data^9,10. However, in this study, we explicitly included pediatric data to train the model and observed that it does not compromise the performance on scans from adult subjects, and vice versa. Furthermore, the inclusion of a pediatric cohort allowed the deep learning model to learn and adapt to challenges associated with brain development, including reduced tissue contrast, within-tissue intensity heterogeneities, and smaller regions of interest. The proposed single deep learning model eliminates the need for multiple age-specific segmentation models, enabling consistent measurements across transitional phases, such as from the pediatric stage through adolescence to adulthood. This facilitates the creation of a reference standard for human brain development, essential for quantifying developmental changes, interpreting deviations, and identifying patterns of anatomical differences in neurological and psychiatric disorders that manifest during various stages of development and aging³⁴.

High reproducibility is crucial for accurately measuring brain changes and atrophy³⁵. The proposed icobrain-dl, was compared with state-of-the-art brain segmentation models, including childmetrix¹⁵, FastSurfer¹¹ and the medical device software icobrain-nondl (i.e., icobrain v5.9^7,8). The results demonstrated overall superior reproducibility assessed in pediatric intra-scanner and adult intra- and inter-scanner scenarios, particularly in the adult inter-scanner setting, with significantly lower variability observed in all brain substructures (p < 0.01). This improvement can be attributed to the diverse sources of T1-weighted images used in training, along with the integration of a data augmentation algorithm. This algorithm enhanced the variability of training data in terms of intensity and contrast, which has been proven to be particularly beneficial for repetitions in different scanners (i.e., inter-scanner)²⁹.

Volumetric imaging biomarkers provided by icobrain-dl required good accuracy, specificity and sensitivity to be used as a metric for diagnosis (e.g., distinguishing patients with Alzheimer’s vs. healthy controls). The proposed pipeline exhibited comparable diagnostic performance to state-of-the-art methods, achieving the highest AUC for both clinical conditions. It is important to note that the purpose of the diagnostic performance scenario was to compare different methods using the same measurement, rather than to identify clinically relevant imaging biomarkers for specific pathologies. Future studies will explore the potential of volumetric imaging biomarkers to enhance our understanding of the underlying mechanisms of diseases and improve their diagnosis, particularly in complex and partly understood conditions like CVI. This involves increasing sample sizes and considering factors such as sexual dimorphism³⁶ and age-dependent developmental trajectories¹³.

The proposed pipeline also analyses the images faster than traditional segmentation approaches, aligning with findings from previous studies employing deep learning models^9,11. However, in contrast with previous deep learning models, the proposed model deployed a lightweight deep learning architecture, consisting of relatively few layers. This design choice aimed to reduce the computational complexity, facilitating model inference on CPU-only platforms and ensuring efficient segmentation without incurring the elevated economical costs associated with GPU usage. The reduced processing time avoids creating additional bottlenecks in the radiological workflow.

The annotation protocols used to establish the ground truth of brain structures may vary across datasets, potentially differing from our definition of brain structure borders. This discrepancy could explain the higher overlap observed between models than the overlap between models and ground truth. Notably, icobrain-dl and the age-specific models are trained on datasets with overlap** patients and employ the same annotation protocol.

The icobrain-dl pipeline is designed to use T1-weighted images to analyse the structural anatomy of the brain. Currently, its application is limited to conditions characterized by non-mass effects due to the absence of multimodal data, such as fluid-attenuated inversion recovery (FLAIR) images. However, future iterations of icobrain-dl aim to integrate multimodal data, thereby expanding its utility to cover a broader spectrum of pathologies.

The proposed deep learning model covers the human lifespan, starting at 4 years of age. The period preceding this age is the most dynamic phase of postnatal human brain development³⁷. Maturation processes, including myelination, notably influence T1-weighted image contrasts, for instance, shifting from hypointense white matter in newborns to hyperintese in 2-year-old infants, making the development of a reliable segmentation model a very complex task. Hence, additional exploration is required to incorporate quantification of brain segmentation during this initial phase of brain development.

Conclusion

The proposed deep learning-based pipeline, icobrain-dl, is capable of quantifying brain tissues and structures across the human lifespan beginning at 4 years of age. Extensive validation in clinically relevant settings has demonstrated its ability to provide accurate and reproducible volume quantification of relevant brain anatomical structures from T1-weighted images.

By offering a unified solution from early childhood to maturation and older age, icobrain-dl has the potential to significantly enhance research and clinical applications in monitoring brain development and diagnosing neurological conditions.

Data and code availability

Further details regarding the publicly available datasets analyzed in the current study can be found in Appendix A. Additional datasets analyzed during the current study can be made available from the corresponding author with the permission of a third party upon reasonable request. The code employed in this study is not publicly accessible due to commercial restrictions but is available from the corresponding author upon reasonable request.

References

Smith, S. M. et al. Advances in functional and structural MR image analysis and implementation as FSL. Neuroimage 23, S208–S219 (2004).
Article PubMed Google Scholar
Mills, K. L. & Tamnes, C. K. Methods and considerations for longitudinal structural brain imaging analysis across development. Dev. Cogn. Neurosci. 9, 172–190 (2014).
Article PubMed PubMed Central Google Scholar
Urbach, H. et al. "within a minute" detection of focal cortical dysplasia. Neuroradiology 64, 715–726 (2022).
Article PubMed Google Scholar
Pemberton, H. G. et al. Technical and clinical validation of commercial automated volumetric MRI tools for dementia diagnosis-a systematic review. Neuroradiology 63, 1773–1789 (2021).
Article PubMed PubMed Central Google Scholar
Fischl, B. Freesurfer. Neuroimage 62, 774–781 (2012).
Article PubMed Google Scholar
Manjón, J. V. & Coupé, P. volBrain: An online MRI brain volumetry system. Front. Neuroinform. 10, 30 (2016).
Article PubMed PubMed Central Google Scholar
Jain, S. et al. Automatic segmentation and volumetry of multiple sclerosis brain lesions from MR images. NeuroImage Clin. 8, 367–375 (2015).
Article PubMed PubMed Central Google Scholar
Struyfs, H. et al. Automated MRI volumetry as a diagnostic tool for Alzheimer’s disease: Validation of icobrain dm. NeuroImage Clin. 26, 102243 (2020).
Article PubMed PubMed Central Google Scholar
Roy, A. G. et al. QuickNAT: A fully convolutional network for quick and accurate segmentation of neuroanatomy. Neuroimage 186, 713–727 (2019).
Article Google Scholar
Coupé, P. et al. AssemblyNet: A large ensemble of CNNs for 3D whole brain MRI segmentation. Neuroimage 219, 117026 (2020).
Article PubMed Google Scholar
Henschel, L. et al. Fastsurfer-a fast and accurate deep learning based neuroimaging pipeline. Neuroimage 219, 117012 (2020).
Article PubMed Google Scholar
Jyothi, P. & Singh, A. R. Deep learning models and traditional automated techniques for brain tumor segmentation in MRI: A review. Artif. Intell. Rev. 56, 2923–2969 (2023).
Article Google Scholar
Phan, T. V., Smeets, D., Talcott, J. B. & Vandermosten, M. Processing of structural neuroimaging data in young children: Bridging the gap between current practice and state-of-the-art methods. Dev. Cogn. Neurosci. 33, 206–223 (2018).
Article PubMed Google Scholar
Phan, T. V. et al. Evaluation of methods for volumetric analysis of pediatric brain data: the childmetrix pipeline versus adult-based approaches. NeuroImage Clin. 19, 734–744 (2018).
Article PubMed PubMed Central Google Scholar
Phan, T. V. et al. Structural brain dynamics across reading development: A longitudinal MRI study from kindergarten to grade 5. Hum. Brain Mapp. 42, 4497–4509 (2021).
Article PubMed PubMed Central Google Scholar
Turesky, T. K., Vanderauwera, J. & Gaab, N. Imaging the rapidly develo** brain: Current challenges for MRI studies in the first five years of life. Dev. Cogn. Neurosci. 47, 100893 (2021).
Article PubMed Google Scholar
Alexander, L. M. et al. Data descriptor: An open resource for transdiagnostic research in pediatric mental health and learning disorders. Sci. Datahttps://doi.org/10.1038/sdata.2017.181 (2017).
Article PubMed PubMed Central Google Scholar
Paniukov, D., Lebel, R. M., Giesbrecht, G. & Lebel, C. Calgary cerebral blood flow increases across early childhood. NeuroImagehttps://doi.org/10.1016/j.neuroimage.2019.116224 (2020).
Article PubMed Google Scholar
Van Assche, L. et al. Hippocampal volume as a vulnerability marker for late onset psychosis: Associations with memory function and childhood trauma. Schizophr. Res. 224, 201–202 (2020).
Article PubMed Google Scholar
Kennedy, D. N. et al. CANDIShare: A resource for pediatric neuroimaging data. Neuroinformatics 10, 319–322. https://doi.org/10.1007/s12021-011-9133-y (2012).
Article PubMed PubMed Central Google Scholar
Landman, B. & Warfield, S. Miccai 2012 workshop on multi-atlas labeling. In MICCAI Grand Challenge and Workshop on Multi-Atlas Labeling (CreateSpace Independent Publishing Platform, Nice, France, 2012).
Nooner, K. B. et al. The NKI-Rockland sample: A model for accelerating the pace of discovery science in psychiatry. Front. Neurosci. 6, 152 (2012).
Article PubMed PubMed Central Google Scholar
Malone, I. B. et al. Miriad-public release of a multiple time point Alzheimer’s MR imaging dataset. Neuroimage 70, 33–36 (2013).
Article PubMed Google Scholar
Tustison, N. J. et al. N4ITK: Improved N3 bias correction. IEEE Trans. Med. Imaging 29, 1310–1320. https://doi.org/10.1109/TMI.2010.2046908 (2010).
Article PubMed PubMed Central Google Scholar
Ourselin, S., Roche, A., Subsol, G., Pennec, X. & Ayache, N. Reconstructing a 3D structure from serial histological sections. Image Vis. Comput. 19, 25–31. https://doi.org/10.1016/S0262-8856(00)00052-4 (2001).
Article Google Scholar
Çiçek, Ö., Abdulkadir, A., Lienkamp, S. S., Brox, T. & Ronneberger, O. 3D U-Net: Learning dense volumetric segmentation from sparse annotation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2016, 424–432 (Springer International Publishing, Cham, 2016).
Salimans, T. & Kingma, D. P. Weight normalization: A simple reparameterization to accelerate training of deep neural networks. Adv. Neural Inf. Process. Syst., 901–909 (2016).
Isensee, F., Jaeger, P. F., Kohl, S. A., Petersen, J. & Maier-Hein, K. H. nnU-Net: A self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 18, 203–211. https://doi.org/10.1038/s41592-020-01008-z (2021).
Article CAS PubMed Google Scholar
Meyer, M. I. et al. A contrast augmentation approach to improve multi-scanner generalization in MRI. Front. Neurosci.https://doi.org/10.3389/FNINS.2021.708196 (2021).
Article PubMed PubMed Central Google Scholar
Taha, A. A. & Hanbury, A. Metrics for evaluating 3D medical image segmentation: Analysis, selection, and tool. BMC Med. Imaging 15, 1–28 (2015).
Article Google Scholar
Isensee, F. et al. Automated brain extraction of multisequence MRI using artificial neural networks. Hum. Brain Mapp. 40, 4952–4964 (2019).
Article PubMed PubMed Central Google Scholar
Ortibus, E., Fazzi, E. & Dale, N. Cerebral visual impairment and clinical assessment: The European perspective. In Seminars in Pediatric Neurology, 31, 15–24 (Elsevier, 2019).
Sun, X. & Xu, W. Fast implementation of Delong’s algorithm for comparing the areas under correlated receiver operating characteristic curves. IEEE Signal Process. Lett. 21, 1389–1393 (2014).
Article ADS Google Scholar
Bethlehem, R. A. et al. Brain charts for the human lifespan. Nature 604, 525–533 (2022).
Article CAS PubMed PubMed Central Google Scholar
Guo, C., Ferreira, D., Fink, K., Westman, E. & Granberg, T. Repeatability and reproducibility of freeSurfer, FSL-SIENAX and SPM brain volumetric measurements and the effect of lesion filling in multiple sclerosis. Eur. Radiol. 29, 1355–1364 (2019).
Article PubMed Google Scholar
López-Ojeda, W. & Hurley, R. A. Sexual dimorphism in brain development: Influence on affective disorders. J. Neuropsychiatry Clin. Neurosci. 33, A4-85 (2021).
Article PubMed Google Scholar
Li, G. et al. Map** region-specific longitudinal cortical surface expansion from birth to 2 years of age. Cereb. Cortex 23, 2724–2733 (2013).
Article PubMed Google Scholar

Download references

Acknowledgements

The PARENT project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Maria Skłodowska Curie-Innovative Training Network 2020, Grant Agreement No 956394. We wish to thank Elien Bollen for her contribution in curating the data. We also express our gratitude to all investigators involved in collecting and providing free access to the datasets. This manuscript reflects the views of the authors and may not reflect the opinions or views of the database providers.

Author information

Authors and Affiliations

icometrix, Leuven, Belgium
Jaime Simarro, Maria Ines Meyer, Simon Van Eyndhoven, Thanh Vân Phan, Thibo Billiet & Diana M. Sima
Department of Development and Regeneration, KU Leuven, Leuven, Belgium
Jaime Simarro & Els Ortibus
Department of Pediatric Neurology, UZ Leuven, Leuven, Belgium
Els Ortibus
Child and Youth Institute, KU Leuven, Leuven, Belgium
Els Ortibus

Authors

Jaime Simarro
View author publications
You can also search for this author in PubMed Google Scholar
Maria Ines Meyer
View author publications
You can also search for this author in PubMed Google Scholar
Simon Van Eyndhoven
View author publications
You can also search for this author in PubMed Google Scholar
Thanh Vân Phan
View author publications
You can also search for this author in PubMed Google Scholar
Thibo Billiet
View author publications
You can also search for this author in PubMed Google Scholar
Diana M. Sima
View author publications
You can also search for this author in PubMed Google Scholar
Els Ortibus
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.S. Investigation, Methodology, Validation, Formal analysis, Writing—original draft. M.I.M. Conceptualization, Methodology, Software. S.V.E Methodology, Software, Writing—review & editing. T.V.P. Conceptualization, Methodology, Writing—review & editing T.B. Supervision, Writing—review & editing, Funding acquisition. D.M.S. Supervision, Methodology, Conceptualization, Writing—review & editing. E.O. Supervision, Writing—review & editing, Funding acquisition.

Corresponding author

Correspondence to Jaime Simarro.

Ethics declarations

Competing interests

The following authors are employed (or have been employed at the time of performing the work relevant for this paper) by icometrix: J.S, M.I.M, S.V.E, T.V.P., T.B., D.M.S. E.O. declares no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Simarro, J., Meyer, M.I., Van Eyndhoven, S. et al. A deep learning model for brain segmentation across pediatric and adult populations. Sci Rep 14, 11735 (2024). https://doi.org/10.1038/s41598-024-61798-6

Download citation

Received: 08 January 2024
Accepted: 09 May 2024
Published: 22 May 2024
DOI: https://doi.org/10.1038/s41598-024-61798-6
Springer Nature Limited

A deep learning model for brain segmentation across pediatric and adult populations

Abstract

Similar content being viewed by others

Deep learning-based, fully automated, pediatric brain segmentation

An automatic and accurate deep learning-based neuroimaging pipeline for the neonatal brain

Effect of head motion-induced artefacts on the reliability of deep learning-based whole-brain segmentation

Introduction

Materials and methods

Datasets

Training dataset

Segmentation accuracy testing dataset

Reproducibility testing dataset

Diagnostic performance testing dataset

icobrain-dl pipeline: design and development

Pre-processing

Simultaneous segmentation of brain tissue and structures via a multi-head deep learning model

Efficient generation of high-quality training labels

Models training scheme

Validating technical and diagnostic performance

Results

Accuracy

Reproducibility

Diagnostic performance

Computational time

Qualitative results

Discussion

Conclusion

Data and code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation