2D and 3D convolutional neural networks for outcome modelling of locally advanced head and neck squamous cell carcinoma

Starke, Sebastian; Leger, Stefan; Zwanenburg, Alex; Leger, Karoline; Lohaus, Fabian; Linge, Annett; Schreiber, Andreas; Kalinauskaite, Goda; Tinhofer, Inge; Guberina, Nika; Guberina, Maja; Balermpas, Panagiotis; von der Grün, Jens; Ganswindt, Ute; Belka, Claus; Peeken, Jan C.; Combs, Stephanie E.; Boeke, Simon; Zips, Daniel; Richter, Christian; Troost, Esther G. C.; Krause, Mechthild; Baumann, Michael; Löck, Steffen

doi:10.1038/s41598-020-70542-9

2D and 3D convolutional neural networks for outcome modelling of locally advanced head and neck squamous cell carcinoma

Article
Open access
Published: 24 September 2020

Volume 10, article number 15625, (2020)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

2D and 3D convolutional neural networks for outcome modelling of locally advanced head and neck squamous cell carcinoma

Download PDF

Sebastian Starke^1,2,3,
Stefan Leger^2,3,4,
Alex Zwanenburg^2,3,4,
Karoline Leger^2,3,4,5,
Fabian Lohaus^2,3,4,5,
Annett Linge^2,3,4,5,
Andreas Schreiber⁶,
Goda Kalinauskaite^7,8,
Inge Tinhofer^7,8,
Nika Guberina^9,10,
Maja Guberina^9,10,
Panagiotis Balermpas^11,12,
Jens von der Grün^11,12,
Ute Ganswindt^13,14,15,16,
Claus Belka^13,14,15,
Jan C. Peeken^13,17,18,
Stephanie E. Combs^13,17,18,
Simon Boeke^19,20,
Daniel Zips^19,20,
Christian Richter^2,3,5,21,
Esther G. C. Troost^2,3,4,5,21,
Mechthild Krause^2,3,4,5,21,
Michael Baumann^{2,3,4,5,21,22} &
…
Steffen Löck^2,3,5

11k Accesses
2 Altmetric
Explore all metrics

Abstract

For treatment individualisation of patients with locally advanced head and neck squamous cell carcinoma (HNSCC) treated with primary radiochemotherapy, we explored the capabilities of different deep learning approaches for predicting loco-regional tumour control (LRC) from treatment-planning computed tomography images. Based on multicentre cohorts for exploration (206 patients) and independent validation (85 patients), multiple deep learning strategies including training of 3D- and 2D-convolutional neural networks (CNN) from scratch, transfer learning and extraction of deep autoencoder features were assessed and compared to a clinical model. Analyses were based on Cox proportional hazards regression and model performances were assessed by the concordance index (C-index) and the model’s ability to stratify patients based on predicted hazards of LRC. Among all models, an ensemble of 3D-CNNs achieved the best performance (C-index 0.31) with a significant association to LRC on the independent validation cohort. It performed better than the clinical model including the tumour volume (C-index 0.39). Significant differences in LRC were observed between patient groups at low or high risk of tumour recurrence as predicted by the model ($p=0.001$). This 3D-CNN ensemble will be further evaluated in a currently ongoing prospective validation study once follow-up is complete.

Deep learning in head & neck cancer outcome prediction

Article Open access 26 February 2019

Radiomics and deep learning methods for the prediction of 2-year overall survival in LUNG1 dataset

Article Open access 19 August 2022

A deep learning-based radiomics approach to predict head and neck tumor regression for adaptive radiotherapy

Article Open access 27 May 2022

Introduction

Treatment individualisation is a central objective for the improvement of radiotherapy outcomes¹. In particular, patients diagnosed with locally advanced head and neck squamous cell carcinoma (HNSCC) might benefit from individualised treatment, since five-year overall survival probability after primary radiochemotherapy is only approx. 50%². Subgroups of patients may be identified that are currently under- or overtreated and might benefit from e.g. escalated or de-escalated dose prescriptions. Individualisation of treatment may be based on statistical survival models that predict endpoints such as overall survival or loco-regional tumour control (LRC). Survival models are able to analyse time-to-event data which frequently contain censored observations. The prognostic value of these models is based on biomarkers that are able to stratify patients into groups at different risk of treatment failure. Such biomarkers may result from clinical or tumour-related features such as age, gender or tumour stage, molecular analyses of tumour biopsies such as human papillomavirus (HPV) status or gene signatures, dosimetric information or clinical imaging data from computed tomography (CT), magnetic resonance imaging (MRI), positron emission tomography (PET) scans or combinations thereof^{3,4,5,6,7,8,9,10,11,12,13}.

Imaging data are considered a valuable source of information for tailoring individual treatment, due to their non-invasiveness, repeatability and their ability to represent the entire tumour. Numerous radiomics models, in which traditional machine-learning (ML) methods were applied on hundreds to thousands of pre-defined and handcrafted image features, have been developed^{14,15,16,17,18}, but have not yet surpassed the threshold for clinical acceptance and applicability¹⁹. Recently, Ger et al.²⁰ found that radiomics features of CT and PET scans failed to improve upon clinical risk models in a large head and neck cancer dataset. With the recent advances that deep convolutional neural networks (CNNs) have brought to the fields of natural and medical image analysis, there is hope to elevate model performance for radiotherapy outcome modelling, as well. This is mostly due to the fact that CNNs are able to automatically learn abstract feature representations of the input data during training. However, so far most applications of deep learning to medical images revolve around tasks for segmentation²¹ or classification^22,23,24. The same holds true for the field of radiotherapy, where most applications of deep learning focus on segmentation, computer-aided detection or motion management²⁵. Only few attempts have been published to combine deep learning on medical imaging data and survival analysis^{26,27,1. We investigated four approaches to develop survival models based on the CPHM for the prediction of LRC hazards for patients diagnosed with locally-advanced HNSCC. First, (i) a clinical model was trained on the exploratory cohort and evaluated on the independent validation cohort to provide baseline performance metrics. Moreover, three deep learning based strategies using CNNs were applied: We (ii) trained models completely from scratch, using 3D-CNNs as well as 2D-CNNs, applied (iii) a transfer learning approach leveraging weights of pre-trained 2D-CNN networks, and created (iv) a deep autoencoder and used its bottleneck features in a traditional CPHM.}

Prognostic performance was evaluated by two approaches, calculation of the concordance index (C-index) and the ability to stratify patients into two risk groups based on the model predictions. The C-index^37,38,39 measures the alignment between the observed times of loco-regional recurrence and the model predictions. It is given on a scale between zero and one with 0.5 indicating no prognostic value of the model. A C-index close to zero represents perfect predictions, since predicted hazards should be lower for patients with a longer recurrence-free time. We emphasise that this is in contrast to the situation of directly predicting event times, where a C-index close to one would be desirable. 95% confidence intervals (CI) for C-indices were computed using the survcomp R package^40,41 which implements the method proposed by Pencina et al.⁴². Models that did not contain the C-index 0.5 within the 95% CI on the independent validation cohort were considered as successfully validated.

Furthermore, based on the model predictions, patients were assigned to two groups, at low or at high risk for loco-regional recurrence. This stratification was based on the hazard values predicted by the models for every individual patient. The median value of these predictions on the exploratory cohort was used as a cutoff. Patients with a predicted hazard exceeding the cutoff were assigned to the high risk group and the remaining patients with hazards smaller or equal to the cutoff were assigned to the low risk group. To stratify patients of the independent validation cohort, the same cutoff was applied. The difference in LRC between the stratified patient groups was assessed using the log-rank test for the Kaplan–Meier (KM) curves of both risk groups. Significance was established for p values below 0.05.

To address the random nature of the CNN training procedure and to leverage the benefits of model ensembles⁴³, we repeated model training three times, each time using 10-fold cross-validation (CV) based on the exploratory cohort, stratified by the LRC event status, for a total of 30 CV runs. By applying CV on the exploratory cohort, splits of the samples into training and internal test folds were obtained. Models were built in each CV run using the data of the training fold. Data of the internal test fold was set aside for optional hyperparameter tuning and data of the independent validation cohort was used to measure model performance on previously unseen data.

Since each of the 30 CV runs resulted in a trained model (which we refer to as single model), we created ensemble predictions by averaging of the network outputs, essentially considering the information of multiple models before making a final prediction.

Image processing

Preprocessing of patient CT scans was carried out using an in-house developed toolkit⁴⁴ (available from https://github.com/oncoray/mirp) by performing (1.) cubic interpolation to isotropic voxel size of $1 \, {\text {mm}}^{3}$, (2.) crop** of the transversal plane to 224 by 224 pixels (with the tumour’s centre of mass as the centre of the cropped slice), (3.) clip** of the intensity range of Hounsfield units (HU) to the range [-200, 200] and (4.) normalisation of pixel values to the interval (0, 1).

Multiple image samples of each patient’s CT scan were extracted and used for model training and prediction. For all 2D-CNN models, we used 7 slices cranial and 8 slices caudal of the slice with the largest tumour area as provided by the segmentation mask, comprising a total of 16 transversal CT slices per patient. For training of the 3D-CNNs we used smaller image regions of the axial plane due to GPU memory limitations. We first extracted a $32 \times 64 \times 64$ ($\hbox {z} \times \hbox {y} \times \hbox {x}$) sized volume centered at the tumour centre of mass. Then, 15 additional random volumes of the same size were extracted for each patient. The volume centres were uniformly sampled from a cubic region of edgelength 32 around the tumour centre of mass. Zero padding was added to all extracted volumes where necessary. For each of the volumes, a prediction was computed. Those were subsequently averaged to obtain a single prediction for each patient.

Cox proportional hazards model

The traditional CPHM fits the effect of p-dimensional covariates ${\varvec{x}}$ on the hazard function h via $h(t, {\varvec{x}}) = h_{0}(t) \exp \left( \sum _{j=1}^{p}\beta _{j}x_{j}\right) ,$ with an unspecified baseline hazard function $h_{0}(t)$. We followed Katzman et al.²⁹ in extending this to the more general form of $h(t, {\varvec{x}}) = h_{0}(t) \exp \left( \gamma _{\varvec{\beta }}({\varvec{x}})\right)$ with $\varvec{\beta }$ denoting weights learned by a neural network. Log-hazard values $\gamma _{\varvec{\beta }}({\varvec{x}})$ were estimated from CT image samples ${\varvec{x}}$ by minimisation of (a batch approximation of) the negative of the Cox partial log-likelihood function

$$\begin{aligned} \ln L = \sum _{i=1}^{n} \delta _{i} \left( \gamma _{\varvec{\beta }} ({\varvec{x}}_{i}) - \ln \left( \sum _{\begin{array}{c} j=1\\ t_{j} \ge t_{i} \end{array}}^{n}\exp (\gamma _{\varvec{\beta }}({\varvec{x}}_{j}))\right) \right) , \end{aligned}$$

(1)

letting $\delta _{i}$ denote an event indicator variable that takes on the value 1 if loco-regional tumour recurrence was observed for CT sample i and 0 otherwise, and n being the total number of available CT samples. Further details on survival analysis and the CPHM are given in “Survival analysis and deep Cox proportional hazards modelling” section of the supplement.

All computations were done using Python 3.6.7 and Keras 2.2.4⁴⁵ with tensorflow (v1.12.0) backend. Our code is publically available from https://github.com/oncoray/cnn-hnscc and experimental outputs can be downloaded from http://doi.org/10.14278/rodare.255.

Clinical model

To develop the clinical CPHM, we considered the clinical features patient age, gender, cT-stage, cN-stage, UICC-stage, tumour site, p16 status, pathological grading, smoking status, alcohol consumption and primary tumour volume. These features have already been considered in previous studies^7,35. Tumour site comprised the values oropharynx, hypopharynx, larynx and oral cavity and was one-hot encoded. Volume was computed by summation of tumour segmentation masks and division by a factor of 1000 to obtain units of ${\text {cm}}^{3}$, followed by a (natural) logarithmic transformation.

Imputation of missing values for cN-stage, pathological grading and smoking status (1, 14 and 23 cases, respectively) was performed through selection of the most frequent value in the exploratory cohort. Due to more missing values (58 cases), p16 was converted into the variables ${\text {p16}}_{{\text {unknown}}}$ and p16. The same was done for alcohol consumption for which there were 96 missing cases. cT, cN, UICC and pathological grading stages were converted into the binary categories ${\text {cT}}<4$, ${\text {cN}}<2$, ${\text {UICC}}<4$⁷ and ${\text {pathological grading}}<2$. Patient age and tumour volume were z-score normalised with means and standard deviations obtained from the exploratory cohort. Clinical features prognostic for LRC were selected by applying a forward variable selection CPHM based on the likelihood ratio test (inclusion $\alpha = 0.05$, exclusion $\alpha = 0.1$) using the exploratory cohort. Finally, a CPHM was trained on the exploratory cohort using the selected features and applied to the independent validation cohort.

Model ensembles

Due to our cross-validation approach (10-fold CV repeated three times), 30 different models were trained in every analysis. By averaging the resulting predicted log-hazard values, one final ensemble prediction for the hazard of loco-regional recurrence was obtained for every patient. On the independent validation cohort, a patient’s ensemble prediction was computed by averaging over all 30 model predictions. For every patient of the exploratory cohort, a training and an internal test ensemble prediction was computed, since they appeared as part of the training folds and as part of the internal test folds. Training ensemble predictions were obtained by computing for every patient an average over all those 27 models for which that patient was part of the training fold. Similarly, internal test ensemble predictions were computed by only using the remaining three models for which the patient belonged to the internal test fold. For ensemble stratification of patients into groups at low and high risk of loco-regional recurrence, the cutoff value was determined as the median value of the training ensemble predictions.

Training from scratch

Different network architectures of 3D-CNN and 2D-CNN models were trained from scratch. In all trainings we used the AMSGrad version⁴⁶ of the Adam optimiser to estimate model parameters. For the 3D-CNN experiments the same architecture and hyperparameters as given by Hosny et al.²³ were used with small changes. Due to a different input shape, the first dense layer contained slightly fewer neurons. In the last layer, a single output neuron with $\tanh$ activation was used instead of two neurons with softmax activation which they used for classification purposes. Each model was trained for a fixed number of 200 epochs with a batch size of 24. Neither data augmentation nor callbacks for early stop** or learning rate adjustments were used.

The 2D-CNN architecture (Fig. 2) was loosely inspired by the VGG architecture^{$\min\_\hbox {lr}=10^{-7}$) via the provided Keras callbacks. We also evaluated performance after replacing the final $\tanh$ activation with a linear output, essentially allowing for unrestricted log-hazard ranges. Moreover, the effect of inserting BN layers between convolutions and ReLU activations was assessed.}

The effect of combining clinical features and CT samples as two separate inputs to a 2D-CNN was evaluated. First, Spearman correlation coefficients between the 2D-CNN model-output (with BN and $\tanh$ as final activation) and the clinical features were computed. Then, a second input branch, designed to estimate log-hazard values from the clinical features was added to the network architecture as depicted in Fig. 2b). It consisted of a single dense neuron with $\tanh$ activation and with BN. The log-hazard estimates coming from the clinical branch and the image branch were then concatenated and fed through the final output layer consisting again of a single dense neuron with $\tanh$ activation and with BN.

Transfer learning

We evaluated the capabilities of transfer learning for training 2D-CNNs. The ResNet50^{$\min\_\hbox {lr}= 10^{-7}$) via the provided Keras callbacks. A batch size of 32 was used and neither data augmentation nor weight regularisation were applied.}

Deep features

Following Wang et al.34, we trained a 2D-CNN autoencoder model that learns to reproduce input CT slices as close as possible while passing through a so called bottleneck layer which acts as a means of compression and dimensionality reduction. Successful reconstruction requires capturing of important image characteristics at the bottleneck and we assumed that relevant tumour information was also encoded within those features. The model architecture is provided in Fig. 3 and consisted of an encoder part of six convolutional layers with filter size $3 \times 3$, starting with 16 filters and doubling on each subsequent layer. Leaky ReLU ($\alpha =0.01$) was used as activation. No BN was applied. Between convolutional layers, max-pooling was used to reduce spatial resolution by a factor of two. Finally, a last $3 \times 3$ convolutional layer with 64 filters and the same specification as above was applied to reduce the number of features in the bottleneck representation. The following decoder model was constructed as a mirror image of the encoder using upsampling layers for doubling spatial resolution in each step. The decoder’s last layer was a single $1 \times 1$ convolutional filter with sigmoid activation function to produce outputs with a data range of (0, 1), matching the input image range. Using the binary-crossentropy loss function, we trained the autoencoder for 100 epochs with batches of size 32 using the AMSGrad version of the Adam optimiser with learning rate $10^{-3}$. We used data augmentation by randomly shearing (shear_range = 0.1), zooming (zoom_range = 0.1) and rotating (rotation_range = 45) the input data. We then extracted the bottleneck feature maps of each slice which were of shape $7 \times 7 \times 64$, leading to a reduction to 6.25% of the original image size ($224 \times 224$). Those features were then flattened into a 3136 dimensional vector and a principal component analysis (PCA) was performed using the features of all slices of every patient from the training fold of the CV as a means of dimensionality reduction. Classical CPHMs were subsequently fitted on those training folds using one, two, five and ten PCA features. The learned PCA transformation was then applied to the independent validation cohort features before evaluating the performance of the trained CPHMs on those transformed features. In addition, a Lasso-based CPHM (LCPHM)⁵², that automatically selects relevant features, was fit on the full set of bottleneck features of each training fold without performing a PCA for a maximum of 5000 iterations. The best hyperparameter $\lambda$, which determines the amount of L1 regularisation of the LCPHM, was obtained by another nested CV run on each training fold. This procedure was implemented using the R programming language and the glmnet package⁵³.

Results

Clinical model

All available clinical features were considered to develop a clinical model for the prediction of LRC hazards. Based on the forward variable selection procedure, only the tumour volume was selected. This univariate CPHM achieved a C-index of 0.39 (95% CI: 0.32-0.45) on the exploratory cohort and a C-index of 0.39 (95% CI: 0.30–0.48) on the independent validation cohort. Stratification of the independent validation cohort into patient groups at low and high risk of loco-regional recurrence based on this clinical model showed a statistical trend approaching significance ($p=0.052$, Supplementary Fig. 1).

Training from scratch

An ensemble of 3D-CNNs was successfully validated for the prediction of LRC. It achieved a C-index of 0.31 (95%-CI: 0.22-0.39) on the independent validation cohort (Table 2), outperforming the clinical model. Ensembling slightly improved average single model performance (C-index: 0.32, Supplementary Table 1). Moreover, stratification of patients of the independent validation cohort (Fig. 4, top row) into groups at low and high risk of loco-regional recurrence based on the model predictions revealed significant differences in LRC ($p=0.001$). Ensembles of 2D-CNN models trained from scratch were also successfully validated for prognosis of LRC. However, they showed higher C-indices than the 3D model (C-index: 0.38-0.39, Table 2), i.e. a performance comparable to the clinical model. Average single model performance was similar (Supplementary Table 1). All 2D ensemble models led to significant patient stratifications on the independent validation cohort for LRC or showed a statistical trend (Fig. 4, centre row) ($p=0.051$). Table 2 also shows that the inclusion of BN and the choice of final activation did not have a strong impact on performance regarding C-indices or stratification ability of the independent validation cohort. The Spearman correlation coefficient between model predictions and z-score normalised log-tumour volume was moderate across all 30 models (with BN, $\tanh$ as final activation), with average values of 0.30 and 0.36 for the exploratory and independent validation cohort, respectively. Combining imaging data and tumour volume as network input resulted in decreased performance compared to models with only the CT image as input: a C-index of 0.40 (95%-CI: 0.29-0.50) was obtained on the independent validation cohort and model predictions did not result in a statistically significant stratification ($p=0.070$).

Table 2 Ensemble training from scratch: C-indices for the endpoint loco-regional control (LRC) are computed by averaging the model predictions of the repeated cross-validation models to build an ensemble model.

Full size table

Transfer learning

For transfer learning, the ensemble of DenseNet201 models in combination with its last convolutional layer as the foundation was successfully validated for prognosis of LRC and achieved the best C-index of 0.37 (95%-CI: 0.27-0.47) on the independent validation cohort (Table 3), which was slightly better than the clinical model. Compared to average single model peformance (C-index: 0.41, Supplementary Table 2) this was an improvement of 0.04. Moreover, a statistically significant stratification into low and high risk groups of loco-regional recurrence was achieved by this ensemble for the independent validation cohort (Fig. 4, bottom row) ($p=0.041$). Using the last convolutional layer as foundation, ensembles of ResNet50 or IRNV2 models were not able to successfully stratify patients of the independent validation cohort. Layers different from the last convolutional layer of the pre-trained models as input for the newly added dense layers resulted in slightly worse C-indices in all cases.

Table 3 Ensemble of transfer learning models: C-indices for the endpoint loco-regional control (LRC) are computed by averaging the model predictions of the repeated cross-validation models to build an ensemble model.

Full size table

Boxplots showing the variability of ensemble predictions for patients of the independent validation cohort are provided in Supplementary Figs. 2, 3 and 4 for the ensemble of 3D-CNN models, 2D-CNN models and DenseNet201 models, respectively.

Deep features

The prognostic performance of classical CPHMs using bottleneck features of autoencoder models as covariates are given in Table 4. Model performance was inferior to the clinical model in all scenarios and none of the models achieved a statistically significant stratification of the independent validation cohort into low and high risk groups. The best C-index on the independent validation cohort was 0.42 (95%-CI: 0.32–0.53), obtained by the LCPHM ensemble. The ensemble model improved the C-index on the independent validation cohort by 0.03 compared to the average single model C-index (Supplementary Table 3). The amount of the full variance of the data captured by the PCA features is provided in Supplementary Table 4.

Table 4 Ensemble of autoencoder models: C-indices for the endpoint loco-regional control (LRC) are computed by averaging the model predictions of the repeated cross-validation models to build an ensemble model.

Full size table

Discussion

We investigated deep learning methods in a survival analysis setting for the endpoint LRC, based on treatment-planning CT images of locally advanced HNSCC patients treated with primary radiochemotherapy. Best performance and successful validation was achieved by an ensemble of 3D-CNNs with a C-index of 0.31 on the independent validation cohort. Patient risk groups defined by the model predictions showed significant differences in LRC ($p=0.001$). Ensembles of different 2D-CNN approaches performed similar to a clinical CPHM based on the tumour volume (independent validation C-index of 0.39). Compared to using only a single trained model instance, our analysis revealed benefits in using model ensembles for final predictions, which is in line with the reasoning of Dietterich⁴³.

Overall, reported performances for 2D-CNNs were comparable to results previously published from our group by Leger et al.¹⁷. They evaluated multiple combinations of feature selection algorithms and classical machine learning models based on handcrafted radiomics features on the same dataset. An average independent validation C-index over all combinations of 0.62 was achieved (which corresponds to a C-index of 0.38 in our context, as explained in the “Methods” section). Similarly, Haarburger et al.^54,55 seems to be a promising way to tackle the small sample size problem of medical imaging. This includes setting up infrastructures to allow to collaboratively train models on data of multiple institutions without violating data-privacy regulations. Also, exploring generative adversarial networks for enhancing dataset sizes through simultaneous generation of synthetic image samples and plausible time-to-event labels^56,57 might provide a potentially interesting task. However, for HNSCC, treatment-planning CT scans may simply not contain much more predictive information to achieve better performance, no matter the deep learning approach, model architecture or hyperparameters. As previously indicated^3,9,58, considering additional imaging during the course of treatment or additional imaging modalities such as MRI or PET may offer improved predictive potential. Another limitation of our analysis concerns the Cox partial log-likelihood function, as given by equation (1), which does not account for ties in the data. This can very well occur if multiple samples of the same patient are present in a single training batch. Therefore, we plan on using e.g. Efrons correction method⁵⁹ in future analysis but refrained from that in our current experiments in order to avoid introduction of additional complexity in the loss function. Instead, we experimented with using slight random perturbations on the observed event times to avoid exact matches. We did, however, not observe noteworthy changes in model performance (see first row of Supplementary Table 6). An alternative to the CPHM is the combination of deep learning with accelerated failure time models, as demonstrated by Chapfuwa et al.⁵⁷ on clinical data. Due to their fully-parametric nature, direct prediction of event times becomes easier and non-monotonic hazard functions can be modelled.

Deep learning approaches on treatment-planning CT images can be useful building blocks on the way to achieve the goal of personalisation of radiotherapy. They may be extended using additional information, e.g. from tumour histology or molecular samples. Nevertheless, deep learning approaches should not be considered the universal remedy since they also bring with them some drawbacks compared to simpler models. Those include increased computational complexity and difficulties in understanding the image-based causes of their predictions, leading to decreased model interpretability.

In this study, we implemented CNNs for the prediction of LRC after primary radiochemotherapy of locally advanced HNSCC based on CT imaging. An ensemble of 3D-CNN models was successfully validated and showed an improved performance compared to 2D-CNN approaches and a clinical model. Risk groups defined on these models differed significantly in LRC. In the future, we aim to assess robustness and translational ability of our trained models by applying them to data of the prospective HNPrädBio trial of the DKTK-ROG as another independent validation (NCT02059668)⁶⁰.

Data availability

The datasets used and analysed during the current study are available from the corresponding author on reasonable request. Experimental output and trained models are accessible from http://doi.org/10.14278/rodare.255. Python and R code of our analyses is available from https://github.com/oncoray/cnn-hnscc.

References

Baumann, M. et al. Radiation oncology in the era of precision medicine. Nat. Rev. Cancer 16, 234–249 (2016).
Article CAS PubMed Google Scholar
Leemans, C. R., Braakhuis, B. J. M. & Brakenhoff, R. H. The molecular biology of head and neck cancer. Nat. Rev. Cancer 11, 9–22 (2011).
Article CAS PubMed Google Scholar
Zips, D. et al. Exploratory prospective trial of hypoxia-specific PET imaging during radiochemotherapy in patients with locally advanced head-and-neck cancer. Radiother. Oncol. 105, 21–28 (2012).
Article PubMed Google Scholar
Aerts, H. J. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5, 1–9 (2014).
Google Scholar
Lohaus, F. et al. HPV16 DNA status is a strong prognosticator of loco-regional control after postoperative radiochemotherapy of locally advanced oropharyngeal carcinoma: Results from a multicentre explorative study of the German Cancer Consortium Radiation Oncology Group (DKTK-ROG). Radiother. Oncol. 113, 317–323 (2014).
Article CAS PubMed Google Scholar
Vallières, M., Freeman, C. R., Skamene, S. R. & Naqa, I. E. A radiomics model from joint FDG-PET and MRI texture features for the prediction of lung metastases in soft-tissue sarcomas of the extremities. Phys. Med. Biol. 60, 5471–5496 (2015).
Article PubMed Google Scholar
Linge, A. et al. HPV status, cancer stem cell marker expression, hypoxia gene signatures and tumour volume identify good prognosis subgroups in patients with HNSCC after primary radiochemotherapy: A multicentre retrospective study of the German Cancer Consortium Radiation Oncology Group (DKTK-ROG). Radiother. Oncol. 121, 364–373 (2016).
Article PubMed Google Scholar
Linge, A. et al. Low cancer stem cell marker expression and low hypoxia identify good prognosis subgroups in HPV(-) HNSCC after postoperative radiochemotherapy: A multicenter study of the DKTK-ROG. Clin. Cancer Res. 22, 2639–2649 (2016).
Article ADS CAS PubMed Google Scholar
Löck, S. et al. Residual tumour hypoxia in head-and-neck cancer patients undergoing primary radiochemotherapy, final results of a prospective trial on repeat FMISO-PET imaging. Radiother. Oncol. 124, 533–540 (2017).
Article PubMed Google Scholar
Bogowicz, M. et al. Post-radiochemotherapy PET radiomics in head and neck cancer - The influence of radiomics implementation on the reproducibility of local control tumor models. Radiother. Oncol. 125, 385–391 (2017).
Article PubMed Google Scholar
Vallières, M. et al. Radiomics strategies for risk assessment of tumour failure in head-and-neck cancer. Sci. Rep. 7, 10117 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Schmidt, S. et al. Development and validation of a gene signature for patients with head and neck carcinomas treated by postoperative radio(chemo)therapy. Clin. Cancer Res. 24, 1364–1374 (2018).
Article CAS PubMed Google Scholar
Deist, T. M. et al. Machine learning algorithms for outcome prediction in (chemo)radiotherapy: an empirical comparison of classifiers. Med. Phys. 45, 3449–3459 (2018).
Article PubMed PubMed Central Google Scholar
Parmar, C. et al. Radiomic machine-learning classifiers for prognostic biomarkers of head and neck cancer. Front. Oncol. 5, 272 (2015).
Article PubMed PubMed Central Google Scholar
Lambin, P. et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 14, 749–762 (2017).
Article PubMed Google Scholar
Li, Q. et al. A fully-automatic multiparametric radiomics model: towards reproducible and prognostic imaging signature for prediction of overall survival in glioblastoma multiforme. Sci. Rep. 7, 1–9 (2017).
Article ADS CAS Google Scholar
Leger, S. et al. A comparative study of machine learning methods for time-to-event survival data for radiomics risk modelling. Sci. Rep. 7, 13206 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Peeken, J. C. et al. Radiomics in radiooncology: challenging the medical physicist. Phys. Med. 48, 27–36 (2018).
Article PubMed Google Scholar
Sollini, M., Antunovic, L., Chiti, A. & Kirienko, M. Towards clinical application of image mining: a systematic review on artificial intelligence and radiomics. Eur. J. Nucl. Med. Mol. Imaging 46, 2656–2672 (2019).
Article PubMed PubMed Central Google Scholar
Ger, R. B. et al. Radiomics features of the primary tumor fail to improve prediction of overall survival in large cohorts of CT- and PET-imaged head and neck cancer patients. PLoS ONE 14, e0222509 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: convolutional networks for biomedical image segmentation. In Lecture Notes in Computer Science (Including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9351, 234–241 (Springer Verlag, 2015).
Diamant, A., Chatterjee, A., Vallières, M., Shenouda, G. & Seuntjens, J. Deep learning in head & neck cancer outcome prediction. Sci. Rep. 9, 1–10 (2019).
Article CAS Google Scholar
Hosny, A. et al. Deep learning for lung cancer prognostication: a retrospective multi-cohort radiomics study. PLoS Med. 15, 1–25 (2018).
Article Google Scholar
Baek, S. et al. Deep segmentation networks predict survival of non-small cell lung cancer. Sci. Rep. 9, 17286 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Meyer, P., Noblet, V., Mazzara, C. & Lallement, A. Survey on deep learning for radiotherapy. Comput. Biol. Med. 98, 126–146 (2018).
Article PubMed Google Scholar
Zhu, X., Yao, J. & Huang, J. Deep convolutional neural network for survival analysis with pathological images. In Proceedings—2016 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2016 544–547 (2017).
Mobadersany, P. et al. Predicting cancer outcomes from histology and genomics using convolutional networks. In Proceedings of the National Academy of Sciences of the United States of America (2018).
Haarburger, C., Weitz, P., Rippel, O. & Merhof, D. Image-based survival analysis for lung cancer patients using CNNs (2018). ar**v:1808.09679v1.
Katzman, J. L. et al. DeepSurv: Personalized treatment recommender system using a Cox proportional hazards deep neural network (2018). ar**v:1606.00931v3.
Ching, T., Zhu, X. & Garmire, L. X. Cox-nnet: an artificial neural network method for prognosis prediction of high-throughput omics data. PLoS Comput. Biol. 14, 1–18 (2018).
Article CAS Google Scholar
Liu, R. et al. Exploring deep features from brain tumor magnetic resonance images via transfer learning. Proc. Int. Jt. Conf. Neural Netw. 2016–Octob, 235–242 (2016).
Google Scholar
Paul, R. et al. Deep feature transfer learning in combination with traditional features predicts survival among patients with lung adenocarcinoma. Tomography 2, 388 (2016).
Article PubMed PubMed Central Google Scholar
Lao, J. et al. A Deep Learning-Based Radiomics Model for Prediction of Survival in Glioblastoma Multiforme. Scientific Reports 7, 1–8 (2017).
Article ADS CAS Google Scholar
Wang, S. et al. Deep learning provides a new computed tomography-based prognostic biomarker for recurrence prediction in high-grade serous ovarian cancer. Radiother. Oncol. 132, 171–177 (2019).
Article PubMed Google Scholar
Linge, A. et al. Independent validation of tumour volume, cancer stem cell markers and hypoxia-associated gene expressions for HNSCC after primary radiochemotherapy. Clin. Transl. Radiat. Oncol. 16, 40–47 (2019).
Article PubMed PubMed Central Google Scholar
Welz, S. et al. Prognostic value of dynamic hypoxia PET in head and neck cancer: Results from a planned interim analysis of a randomized phase II hypoxia-image guided dose escalation trial. Radiother. Oncol. 124, 526–532 (2017).
Article PubMed Google Scholar
Harrell, F. E., Lee, K. L. & Mark, D. B. Tutorial in biostatistics multivariable prognostic models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat. Med. 15, 361–387 (1996).
Article PubMed Google Scholar
Mayr, A. & Schmid, M. Boosting the concordance index for survival data: a unified framework to derive and evaluate biomarker combinations. PLoS ONE 9, e84483 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Steck, H., Krishnapuram, B., Dehing-oberije, C., Lambin, P. & Raykar, V. C. On ranking in survival analysis: bounds on the concordance index. Adv. Neural Inf. Process. Syst. 20, 1209–1216 (2008).
Google Scholar
Haibe-Kains, B., Desmedt, C., Sotiriou, C. & Bontempi, G. A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?. Bioinformatics 24, 2200–2208 (2008).
Article CAS PubMed PubMed Central Google Scholar
Schröder, M. S., Culhane, A. C., Quackenbush, J. & Haibe-Kains, B. survcomp: an R/Bioconductor package for performance assessment and comparison of survival models. Bioinformatics 27, 3206–3208 (2011).
Article PubMed PubMed Central CAS Google Scholar
Pencina, M. J. & DAgostino, R. B. Overall C as a measure of discrimination in survival analysis model specific population value and confidence interval estimation. Stat. Med. 23, 2109–2123 (2004).
Article PubMed Google Scholar
Dietterich, T. G. Ensemble methods in machine learning. In Multiple Classifier Systems, 1–15 (Springer, Berlin, 2000).
Zwanenburg, A. et al. Assessing robustness of radiomic features by image perturbation. Sci. Rep. 9, 614 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Chollet, F. et al. Keras. https://keras.io (2015).
Reddi, S.J., Kale, S. & Kumar, S. On the convergence of adam and beyond. In International Conference on Learning Representations (2018).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition (2014). ar**v:1409.1556.
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition (2015). ar**v:1512.03385.
Huang, G., Liu, Z., van der Maaten, L. & Weinberger, K. Q. Densely connected convolutional networks (2016). ar**v:1608.06993.
Szegedy, C., Ioffe, S., Vanhoucke, V. & Alemi, A. Inception-v4, inception-resnet and the impact of residual connections on learning (2016). ar**v:1602.07261.
Mormont, R., Geurts, P. & Maree, R. Comparison of deep transfer learning strategies for digital pathology. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, vol. 2018-June, 2343–2352 (IEEE Computer Society, 2018).
Tibshirani, R. The LASSO method for variable selection in the Cox model. Stat. Med. 16, 385–395 (1997).
Article CAS PubMed Google Scholar
Simon, N., Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for Coxs proportional hazards model via coordinate descent. J. Stat. Softw. 39, 1–13 (2011).
Article PubMed PubMed Central Google Scholar
Jochems, A. et al. Distributed learning: develo** a predictive model based on data from multiple hospitals without data leaving the hospital - A real life proof of concept. Radiother. Oncol. 121, 459–467 (2016).
Article PubMed Google Scholar
Jochems, A. et al. Develo** and validating a survival prediction model for NSCLC patients through distributed learning across 3 countries. Int. J. Radiat. Oncol. Biol. Phys. 99, 344–352 (2017).
Article PubMed PubMed Central Google Scholar
Aggarwal, K., Kirchmeyer, M., Yadav, P., Keerthi, S.S. & Gallinari, P. Regression with conditional GAN (2019). ar**v:1905.12868.
Chapfuwa, P. et al. Adversarial time-to-event modeling. In Proceedings of the 35th International Conference on Machine Learning, vol. 80 of Proceedings of Machine Learning Research, 735–744 (2018).
Leger, S. et al. CT imaging during treatment improves radiomic models for patients with locally advanced head and neck cancer. Radiother. Oncol. 130, 10–17 (2019).
Article PubMed Google Scholar
Efron, B. The efficiency of Coxs likelihood function for censored data. J. Am. Stat. Assoc. 72, 557–565 (1977).
Article MathSciNet MATH Google Scholar
Observational Study on Biomarkers in Head and Neck Cancer (HNprädBio). ClinicalTrials.gov [Internet], Bethesda (MD): National Library of Medicine (US), 2000 Feb 29. Identifier NCT02059668 [registered 2014 Feb 11, updated 2019 Feb 18, cited 2020 Jan 30]. https://clinicaltrials.gov/ct2/show/NCT02059668 (2014).

Download references

Acknowledgements

We acknowledge the creators of the github repository at https://github.com/HarisIqbal88/PlotNeuralNet/ for providing code for the creation of neural network architecture plots. We used their code for creating Figs. 2 and 3. Also we would like to thank the authors of the repository at https://github.com/MIC-DKFZ/batchgenerators for making their data augmentation code publically available. “Neural network” icon used in Fig. 1 by Ben Davis from thenounproject.com (https://thenounproject.com/term/neural-network/1718441/). “Machine learning” icon used in Fig. 1 by Mohamed Mb from thenounproject.com (https://thenounproject.com/search/?q=machine%20learning&i=1705417). The author SLe is supported by the Federal Ministry of Education and Research (BMBF-13GW0211D).

Funding

Open access funding provided by Projekt DEAL.

Author information

Authors and Affiliations

Helmholtz-Zentrum Dresden - Rossendorf, Department of Information Services and Computing, Dresden, Germany
Sebastian Starke
OncoRay - National Center for Radiation Research in Oncology, Faculty of Medicine and University Hospital Carl Gustav Carus, Technische Universität Dresden, Helmholtz-Zentrum Dresden - Rossendorf, Dresden, Germany
Sebastian Starke, Stefan Leger, Alex Zwanenburg, Karoline Leger, Fabian Lohaus, Annett Linge, Christian Richter, Esther G. C. Troost, Mechthild Krause, Michael Baumann & Steffen Löck
German Cancer Research Center (DKFZ), Heidelberg and German Cancer Consortium (DKTK) partner site Dresden, Dresden, Germany
Sebastian Starke, Stefan Leger, Alex Zwanenburg, Karoline Leger, Fabian Lohaus, Annett Linge, Christian Richter, Esther G. C. Troost, Mechthild Krause, Michael Baumann & Steffen Löck
National Center for Tumor Diseases (NCT), Partner Site Dresden, Germany
Stefan Leger, Alex Zwanenburg, Karoline Leger, Fabian Lohaus, Annett Linge, Esther G. C. Troost, Mechthild Krause & Michael Baumann
Department of Radiotherapy and Radiation Oncology, Faculty of Medicine and University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
Karoline Leger, Fabian Lohaus, Annett Linge, Christian Richter, Esther G. C. Troost, Mechthild Krause, Michael Baumann & Steffen Löck
Department of Radiotherapy, Hospital Dresden-Friedrichstadt, Dresden, Germany
Andreas Schreiber
German Cancer Research Center (DKFZ), Heidelberg and German Cancer Consortium (DKTK) partner site Berlin, Berlin, Germany
Goda Kalinauskaite & Inge Tinhofer
Department of Radiooncology and Radiotherapy, Charité University Hospital, Berlin, Germany
Goda Kalinauskaite & Inge Tinhofer
German Cancer Research Center (DKFZ), Heidelberg and German Cancer Consortium (DKTK) partner site Essen, Essen, Germany
Nika Guberina & Maja Guberina
Department of Radiotherapy, Medical Faculty, University of Duisburg-Essen, Essen, Germany
Nika Guberina & Maja Guberina
German Cancer Research Center (DKFZ), Heidelberg and German Cancer Consortium (DKTK) partner site Frankfurt, Frankfurt, Germany
Panagiotis Balermpas & Jens von der Grün
Department of Radiotherapy and Oncology, Goethe-University Frankfurt, Frankfurt, Germany
Panagiotis Balermpas & Jens von der Grün
German Cancer Research Center (DKFZ), Heidelberg and German Cancer Consortium (DKTK) partner site Munich, Munich, Germany
Ute Ganswindt, Claus Belka, Jan C. Peeken & Stephanie E. Combs
Department of Radiation Oncology, Ludwig-Maximilians-Universität, Munich, Germany
Ute Ganswindt & Claus Belka
Clinical Cooperation Group, Personalized Radiotherapy in Head and Neck Cancer, Helmholtz Zentrum, Munich, Germany
Ute Ganswindt & Claus Belka
Department of Radiation Oncology, Medical University of Innsbruck, Anichstraße 35, 6020, Innsbruck, Austria
Ute Ganswindt
Department of Radiation Oncology, Technische Universität München, Munich, Germany
Jan C. Peeken & Stephanie E. Combs
Institute of Radiation Medicine (IRM), Helmholtz Zentrum München, Neuherberg, Germany
Jan C. Peeken & Stephanie E. Combs
German Cancer Research Center (DKFZ), Heidelberg and German Cancer Consortium (DKTK) partner site Tübingen, Tübingen, Germany
Simon Boeke & Daniel Zips
Department of Radiation Oncology, Faculty of Medicine and University Hospital Tübingen, Eberhard Karls Universität Tübingen, Tübingen, Germany
Simon Boeke & Daniel Zips
Helmholtz-Zentrum Dresden - Rossendorf, Institute of Radiooncology – OncoRay, Dresden, Germany
Christian Richter, Esther G. C. Troost, Mechthild Krause & Michael Baumann
German Cancer Research Center (DKFZ), Heidelberg, Germany
Michael Baumann

Authors

Sebastian Starke
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Leger
View author publications
You can also search for this author in PubMed Google Scholar
Alex Zwanenburg
View author publications
You can also search for this author in PubMed Google Scholar
Karoline Leger
View author publications
You can also search for this author in PubMed Google Scholar
Fabian Lohaus
View author publications
You can also search for this author in PubMed Google Scholar
Annett Linge
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Schreiber
View author publications
You can also search for this author in PubMed Google Scholar
Goda Kalinauskaite
View author publications
You can also search for this author in PubMed Google Scholar
Inge Tinhofer
View author publications
You can also search for this author in PubMed Google Scholar
Nika Guberina
View author publications
You can also search for this author in PubMed Google Scholar
Maja Guberina
View author publications
You can also search for this author in PubMed Google Scholar
Panagiotis Balermpas
View author publications
You can also search for this author in PubMed Google Scholar
Jens von der Grün
View author publications
You can also search for this author in PubMed Google Scholar
Ute Ganswindt
View author publications
You can also search for this author in PubMed Google Scholar
Claus Belka
View author publications
You can also search for this author in PubMed Google Scholar
Jan C. Peeken
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie E. Combs
View author publications
You can also search for this author in PubMed Google Scholar
Simon Boeke
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Zips
View author publications
You can also search for this author in PubMed Google Scholar
Christian Richter
View author publications
You can also search for this author in PubMed Google Scholar
Esther G. C. Troost
View author publications
You can also search for this author in PubMed Google Scholar
Mechthild Krause
View author publications
You can also search for this author in PubMed Google Scholar
Michael Baumann
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Löck
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.S. developed analysis tools and, together with S.L., A.Z. and S.L. analysed the data and wrote the paper. M.B., M.K. and S.L. conceived of the Project and reviewed the manuscript. K.L., E.G.C.T. and F.L. provided expert guidance, data and reviewed the manuscript. A.L., A.S., G.K., I.T., N.G., M.G., P.B., J.v.d.G., U.G., C.B., J.C.P., S.E.C., S.B., D.Z. and C.R. provided data and reviewed the manuscript.

Corresponding author

Correspondence to Sebastian Starke.

Ethics declarations

Competing interests

Dr. Linge is involved in an ongoing publicly funded (German Federal Ministry of Education and Research) Project with the companies Medipan, Attomol GmbH, GA Generic Assays GmbH, Gesellschaft für medizinische und wissenschaftliche genetische Analysen, Lipotype GmbH and PolyAn GmbH (2019–2021). For the present manuscript, Dr. Linge confirms that this above mentioned funding source was not involved. Dr. Richter received individual funding as lecturer from Siemens Healthineers (2018). OncoRay has institutional research agreements with Siemens Healthineers. Furthermore, OncoRay has an institutional agreement as reference center for dual-energy CT in radiotherapy as well as a software evaluation contract with Siemens Healthineers. In the past 5 years, Dr. Krause received funding for her research Projects by IBA (2016), Merck KGaA (2014–2018 for preclinical study; 2018-2020 for clinical study), Medipan GmbH (2014–2018). In the past 5 years, Dr. Troost, Dr. Krause and Dr. Löck have been involved in an ongoing publicly funded (German Federal Ministry of Education and Research) Project with the companies Medipan, Attomol GmbH, GA Generic Assays GmbH, Gesellschaft für medizinische und wissenschaftliche genetische Analysen, Lipotype GmbH and PolyAn GmbH (2019–2021). For the present manuscript, none of the above mentioned funding sources were involved. In the past 5 years, Dr. Baumann attended an advisory board meeting of MERCK KGaA (Darmstadt), for which the University of Dresden received a travel grant. He further received funding for his research Projects and for educational grants to the University of Dresden by Teutopharma GmbH (2011–2015), IBA (2016), Bayer AG (2016–2018), Merck KGaA (2014–2030), Medipan GmbH (2014-2018). For the German Cancer Research Center (DKFZ, Heidelberg) Dr. Baumann is on the supervisory boards of HI-STEM gGmbH (Heidelberg). Dr. Baumann, as former chair of OncoRay (Dresden) and present CEO and Scientific Chair of the German Cancer Research Center (DKFZ, Heidelberg), was or is responsible for collaborations with a multitude of companies and institutions, worldwide. In this capacity, he has signed/signs contracts for his institute(s) and for the staff for research funding and/or collaborations with industry and academia, worldwide. In this role, he was/is further responsible for commercial technology transfer activities of his institute(s), including patent and other similar IP portfolios. Dr. Baumann confirms that none of the above funding sources were involved in the preparation of this paper. University Department of Radiation Oncology Tübingen receives financial and technical support from Elekta AB (Stockholm, Sweden) under a research agreement. The other authors have nothing to disclose.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Starke, S., Leger, S., Zwanenburg, A. et al. 2D and 3D convolutional neural networks for outcome modelling of locally advanced head and neck squamous cell carcinoma. Sci Rep 10, 15625 (2020). https://doi.org/10.1038/s41598-020-70542-9

Download citation

Received: 27 February 2020
Accepted: 20 July 2020
Published: 24 September 2020
DOI: https://doi.org/10.1038/s41598-020-70542-9
Springer Nature Limited

2D and 3D convolutional neural networks for outcome modelling of locally advanced head and neck squamous cell carcinoma

Abstract

Similar content being viewed by others

Deep learning in head & neck cancer outcome prediction

Radiomics and deep learning methods for the prediction of 2-year overall survival in LUNG1 dataset

A deep learning-based radiomics approach to predict head and neck tumor regression for adaptive radiotherapy