Deep learning-based transformation of H&E stained tissues into special stains

de Haan, Kevin; Zhang, Yijie; Zuckerman, Jonathan E.; Liu, Tairan; Sisk, Anthony E.; Diaz, Miguel F. P.; Jen, Kuang-Yu; Nobori, Alexander; Liou, Sofia; Zhang, Sarah; Riahi, Rana; Rivenson, Yair; Wallace, W. Dean; Ozcan, Aydogan

doi:10.1038/s41467-021-25221-2

Deep learning-based transformation of H&E stained tissues into special stains

Article
Open access
Published: 12 August 2021

Volume 12, article number 4884, (2021)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

Deep learning-based transformation of H&E stained tissues into special stains

Download PDF

44k Accesses
105 Citations
50 Altmetric
4 Mentions
Explore all metrics

Abstract

Pathology is practiced by visual inspection of histochemically stained tissue slides. While the hematoxylin and eosin (H&E) stain is most commonly used, special stains can provide additional contrast to different tissue components. Here, we demonstrate the utility of supervised learning-based computational stain transformation from H&E to special stains (Masson’s Trichrome, periodic acid-Schiff and Jones silver stain) using kidney needle core biopsy tissue sections. Based on the evaluation by three renal pathologists, followed by adjudication by a fourth pathologist, we show that the generation of virtual special stains from existing H&E images improves the diagnosis of several non-neoplastic kidney diseases, sampled from 58 unique subjects (P = 0.0095). A second study found that the quality of the computationally generated special stains was statistically equivalent to those which were histochemically stained. This stain-to-stain transformation framework can improve preliminary diagnoses when additional special stains are needed, also providing significant savings in time and cost.

Digital synthesis of histological stains using micro-structured and multiplexed virtual staining of label-free tissue

Article Open access 06 May 2020

Deep learning-enabled virtual histological staining of biological samples

Article Open access 03 March 2023

MulHiST: Multiple Histological Staining for Thick Biological Samples via Unsupervised Image-to-Image Translation

Introduction

Histological analysis of stained human tissue samples is the gold standard for evaluation of many diseases, as the fundamental basis of any pathologic evaluation is the examination of histologically stained tissue affixed on a glass slide using either a microscope or a digitized version of the histologic image following the image capture by a whole slide image (WSI) scanner. The histological staining step is a critical part of the pathology workflow and is required to provide contrast and color to tissue by facilitating a chromatic distinction among different tissue constituents. The most common stain (otherwise referred to as the routine stain) is the hematoxylin and eosin (H&E), which is applied to nearly all clinical cases, covering ~80% of all the human tissue staining performed globally¹. The H&E stain is relatively easy to perform and is widely used across the industry. In addition to H&E, there are a variety of other histological stains with different properties which are used by pathologists to better highlight different tissue constituents. For example, Masson’s trichrome (MT) stain is used to view connective tissue² and periodic acid-Schiff (PAS) can be used to better scrutinize basement membranes. The black staining in the Jones methenamine silver (JMS) stain offers a sharp contrast to visualize glomerular architecture and enables the pathologist to recognize subtle basement membrane abnormalities resulting from remodeling due to various forms of injury. These features have importance for certain disease types such as nonneoplastic kidney disease³. These non-H&E stains are also called special stains and their use is the standard of care in the pathologic evaluation of certain disease entities including nonneoplastic kidney, liver, and lung diseases, among others.

The traditional histopathology workflow can be time-consuming, expensive, and requires laboratory infrastructure. Tissue must first be sampled from the patient, fixed either through freezing in optimal cutting temperature (OCT) compound, or paraffin embedding, sliced into thin (2–10 μm) sections, and mounted onto a glass slide. Only then can these sections be stained using the desired chemical staining procedure. Furthermore, if multiple stains are needed, multiple tissue sections are cut, and a separate procedure must be used for each stain. While H&E staining is performed using a streamlined staining procedure, the special stains often require more preparation time, effort, and monitoring by a histotechnologist, which increases the cost of the procedure and takes additional time to produce. This can in turn increase the time for diagnosis, especially when a pathologist determines that these additional special stains are needed after the H&E stained tissue has been examined. The tissue sectioning and staining procedure may therefore need to be repeated for each special stain, which is wasteful in terms of resources, materials, and might place a burden on both the healthcare system and patients if there is an urgent need for a diagnosis.

Recognizing some of these limitations, different approaches have been developed to improve the histopathology workflow. Histological staining has been reproduced by imaging rapidly labeled tissue sections (usually by a nuclear staining dye) using an alternative contrast mechanism acquired by e.g., nonlinear microscopy⁴ or ultraviolet tissue surface excitation⁵, and digitally transforming the captured images into user-calibrated H&E-like images⁶. These approaches mainly focus on eliminating tissue fixation from the workflow, targeting rapid intraoperative contrast to unfixed specimens. More recently, computational staining techniques known as virtual staining have been developed. Using deep learning, virtual staining has been applied on label-free (i.e., unstained) fixed and glass slide affixed tissue sections using various modalities such as autofluorescence^7,8, hyperspectral imaging⁹, quantitative phase imaging¹⁰, and others^11,12. Virtual staining of label-free tissue not only has the ability to reduce costs and allow for faster staining, but also allows the user to perform further advanced analysis on the tissue since the destructive additional sectioning and staining process is avoided that can cause the specimen to be depleted leading to e.g., additional/unnecessary biopsies from the patients¹³. Furthermore, virtual staining of label-free tissue enables new capabilities such as the use of multiple virtual stains upon a single tissue section, stain normalization (i.e., standardization), the region-of-interest specific digital blending of multiple stains, all of which are challenging or highly impractically with standard histochemical staining workflows^7,8.

An alternative approach that can be used to bypass histochemical tissue staining is to computationally transform the WSI of an already stained tissue into another stain (this will be referred to as stain transformation). This allows users to reduce the number of physical stains required without making any changes to their traditional histopathology workflow, and also carries many of the benefits of the virtual staining techniques such as improving stain consistency and reduction in stain preparation time. Different stain transformations have been demonstrated in the literature, e.g., the transformation of H&E into MT¹⁴ or transformation of fibroblast activation protein-cytokeratin (FAP-CK), duplex immunohistochemistry (IHC) protocol¹⁵, from images of Ki67-CD8 stained slides. Stain transformations have also been used as a tool to improve the effectiveness of image segmentation algorithms^{16,2a demonstrates three examples of such variations for stains produced by the same lab). However, in order for a stain transformation technique to be effective for any practical application, the network must generalize across this wide sample space. As one of the key features of virtual staining is stain normalization⁷, the network requires data augmentation to better facilitate the learning across a wide input staining distribution. For this purpose, we used a set of eight CycleGAN networks to perform this stain data augmentation of the H&E dataset used to train our stain transformation network. The use of CycleGAN networks to perform a stain normalizing style transfer has been shown to be more effective than traditional stain normalization algorithms²³. Furthermore, they have proven to be highly effective at performing data augmentation for medical imaging²⁴. By applying these CycleGAN augmentation networks to our training image dataset, we were able to successfully generalize to various slides used for blind testing. Three examples of this CycleGAN-based stain augmentation results are reported in Supplementary Fig. 2b, which demonstrates that the three different networks are capable of converting the virtually stained tissue to have H&E distributions which match the distributions seen in Fig. S2a. Furthermore, the results show that the same stain transformation network is consistent across these various distributions as there is little variation among the virtual PAS outputs (Supplementary Fig. 2b). These style normalization/transfer networks used in data augmentation can be easily further expanded upon, if needed, using existing databases of H&E images.}

As we have emphasized earlier, these style transfer networks were only used for H&E stain data augmentation and were not included in our stain transformation loss function. We utilized perfectly registered training images generated by virtual staining of label-free tissue; as a result of this, potential hallucinations or artifacts related to unsupervised training with CycleGANs and unpaired training data are eliminated (as can be seen in Supplementary Fig. 3). When the same CycleGAN architecture used for the data augmentation is applied to the various stain transformations, a number of clear hallucinations occur. These hallucinations are particularly evident for the PAS and Jones Silver stain, where the networks incorrectly label the tubular basement membranes (see Supplementary Fig. 3). The tubules are composed of epithelial cells lining basement membranes that stain black on the Jones stain and magenta on the PAS stain. The brush border lining the luminal surface of the epithelial cells is also normally lightly stained black and magenta by the Jones and PAS stains, respectively. The CycleGAN method incorrectly recognized the basement membranes and tubular brush borders leading to incorrect image generation, which is a very significant error. In contrast, the quality and features of the MT stain appear to be more similar between the two techniques. This is believed to be due to the MT stain being relatively similar to H&E, while the other stains require significant structural changes which can cause hallucinations for CycleGANs. These results and observations highlight the significant advantages of our stain-to-stain transformation network compared to standard CycleGAN-based methods.

It is important to note that the current stain-to-stain network is trained to work with H&E stains performed at a few institutions and imaged by different microscopes from the same vendor/model (Leica Biosystems Aperio AT2 slide scanner). Additional data would be required for the network to generalize to samples imaged using microscopes with different specifications or vendors, or any H&E stains which are performed in a significantly different manner. Furthermore, while this study covers a broad range of diseases, it is still a proof of concept. Future studies should be performed which contain both larger training and test datasets in order to conclusively show the technique may be suitable for diagnostic use. Future work may also apply this technique that we have presented to other biomarkers that are currently labeled with IHC to help target specific conditions.

In addition to histological stains, immunofluorescence and electron microscopy²⁵ based evaluation play significant roles in the standard of care for nonneoplastic kidney biopsy evaluation. In this study, we have attempted to isolate the role of standard light microscopy in the nonneoplastic kidney disease evaluation and therefore these other modalities were not included. However, their application in clinical cases would only serve to support the pathologic final diagnosis and add a layer of further confirmation and safety to this resource-saving stain transformation technique.

In this work, we focused on image transformations from H&E to special stains, since H&E is used as the bulk of the staining procedures, covering ~80% of all the human tissue staining procedures¹. However, other stain-to-stain transformations can also be considered. For example, transformations from special stains to H&E or from immunofluorescence to H&E or special stains could be performed using the presented method. Our approach allows pathologists to visualize different tissue constituents without waiting for additional slides to be stained with special stains, and we demonstrated it to be effective for the clinical diagnosis of multiple renal diseases. Another advantage of the presented technique is that it can rapidly perform the stain transformation (at a rate of 1.5 mm²/s on a consumer-grade desktop computer with two GPUs), while saving labor, time, chemicals, and can significantly benefit the patient as well as the healthcare system.

Methods

Training of stain transformation network

All of the stain transformation networks and virtual staining networks used in this paper were trained using GANs. Each of these GANs consists of a generator (G) and a discriminator (D). The generator is used to perform the transformation of the input images (x_input), while the discriminator network is used to help train the network to generate images, which match the distribution of the ground truth stained images. It does this by trying to discriminate between the generated images (G(x_input)) and the ground truth images (z_label). The generator is in turn taught to generate images, which cannot be classified correctly by the discriminator. This GAN loss is used in conjunction with two additional losses: a mean absolute error (L₁) loss and a total variation (TV) loss. The L₁ loss is used to ensure that the transformations are performed accurately in space and color, while the TV loss is used as a regularizer, and reduces noise created by the GAN loss. Together, the overall loss function is described as:

$${l}_{{{{{{\rm{generator}}}}}}}={L}_{1}\{{z}_{{{{{{\rm{label}}}}}}},G({x}_{{{{{{\rm{input}}}}}}})\}+\alpha \times {{{{{\rm{TV}}}}}}\{G({x}_{{{{{{\rm{input}}}}}}})\}+{\beta }\times {(1-D(G({x}_{{{{{{\rm{input}}}}}}})))}^{2}$$

(1)

where α and β are constants used to balance the various terms of the loss function. The stain transformation networks are tuned such that the L₁ loss makes up ~1% of the overall loss, the TV loss makes up only ~0.03% of the overall loss, and the discriminator loss makes up the remaining ~99% of the loss (relative ratios change over the course of the training). The L₁ portion of the loss can be written as:

$${L}_{1}\left({{{{z}}}}{{{{{\mathscr{,}}}}}}G\right)=\frac{1}{P\times Q}\mathop{\sum}\limits_{p}\mathop{\sum}\limits_{q}{{{{{\rm{|}}}}}}{{{{{z}}}}}_{p,q}-{G({x}_{{{{{{\rm{input}}}}}}})}_{p,q}{{{{{\rm{|}}}}}}$$

(2)

where p and q are the pixel indices and P and Q are the total number of pixels in each image. The total variation loss is defined as:

$${{{{{\rm{TV}}}}}}(G({x}_{{{{{{\rm{input}}}}}}}))=\mathop{\sum}\limits_{p}\mathop{\sum}\limits _{q}{{{{{\rm{|}}}}}}{G({x}_{{{{{{\rm{input}}}}}}})}_{p+1,q}-{G({x}_{{{{{{\rm{input}}}}}}})}_{p,q}{{{{{\rm{|}}}}}}+{{{{{\rm{|}}}}}}{G({x}_{{{{{{\rm{input}}}}}}})}_{p,q+1}-{G({x}_{{{{{{\rm{input}}}}}}})}_{p,q}{{{{{\rm{|}}}}}}$$

(3)

The discriminator network has a separate loss function which is defined as:

$${l}_{{{{{{\rm{discriminator}}}}}}}={D(G({x}_{{{{{{\rm{input}}}}}}}))}^{2}+{(1-D({z}_{{{{{{\rm{label}}}}}}}))}^{2}$$

(4)

A modified U-net¹ neural network architecture was used for the generator, while the discriminator used a VGG-style² network. The U-net architecture uses a set of four up-blocks and four down-blocks, each containing three convolutional layers with a 3 × 3 kernel size, activated upon by the LeakyReLU activation function which is described as:

$${{{{{\rm{LeakyReLU}}}}}}\left(x\right)=\left\{\begin{array}{ccc} x & {{{{{\rm{for}}}}}}\; x\, > \, 0\hfill\\ 0.1\; x & {{{{{\rm{otherwise}}}}}}\end{array}\right.$$

(5)

The first down-block increases the number of channels to 32, while the rest each increase the number of channels by a factor of two. Each of these down-blocks ends with an average pooling layer which has both a stride and a kernel size of two. The up-blocks begin with a bicubic up-sampling prior to the application of the convolutional layers. Between each of the blocks of a certain layer, a skip connection is used to pass data through the network without needing to go through all the blocks. After the final up-block, a convolutional layer maps back to three channels.

The discriminator is made up of five blocks. These blocks contain two convolutional layers and LeakyReLU pairs, which together increase the number of channels by a factor of two. These are followed by an average pooling layer with a stride of two. After the five blocks, two fully connected layers reduce the output dimensionality to a single value, which in turn is input into a sigmoid activation function to calculate the probability that the input to the discriminator network is real, i.e., not generated.

Both the generator and discriminator were trained using the adaptive moment estimation (Adam)²⁶ optimizer to update the learnable parameters. A learning rate of 1 × 10⁻⁵ was used for the discriminator network while a rate of 1 × 10⁻⁴ was used for the generator network. For each iteration of the discriminator training, the generator network is trained for seven iterations. This ratio reduces by one every 4000 iterations of the discriminator to a minimum of one discriminator iteration for every three generator iterations. The network was trained for 50000 iterations of the discriminator, with the model being saved every 1000 iterations. The best generator model was chosen manually from these saved models by visually comparing different models. For all three of the generator networks (MT, PAS, and JMS), the 15,000th iteration of the discriminator was chosen as the optimal model.

The stain transformation networks were trained using pairs of 256 × 256-pixel image patches generated by the class conditional virtual staining network (label-free), downsampled by a factor of 2 (to match 20× magnification). These patches were randomly cropped from one of 1013 712 × 712-pixel images coming from ten unique tissue sections, leading to ~7836 unique patches usable for training. Seventy-six additional images coming from three unique tissue sections were used to validate the network. These images were augmented using the eight stain augmentation networks and further augmented through random rotation and flip** of the images. The diagnoses of each of the samples used for training and validation have been added to Supplementary Tables 2 and 3. Each of the three stain transformation networks (MT, PAS, and JMS) were trained using images generated by the label-free virtual staining networks from the same input autofluorescence images. Furthermore, the images were converted to the YCbCr color space²⁷ before being used as either the input or ground truth for the neural networks.

As this stain transformation neural network performs an image-to-image transformation, it learns to transform specific structures using the ~513 million pixels in the dataset that are independently accounted for in the loss function. Furthermore, since the network learns to convert structures which are common throughout many different types of samples, it can be applied to tissues with diseases that the network was not trained with. When used in conjunction with the eight data augmentation networks which convert the values of these pixels, as well as random rotation and flip** (for an additional 8×) augmentation, there are effectively many billions of pixels which are used to learn the desired stain-to-stain transformation. Because of these advantages, a much smaller number of training samples from unique patients can be used than would be required for a typical classification neural network.

Image data acquisition

All of the neural networks were trained using data obtained by microscopic imaging of thin tissue sections coming from needle core kidney biopsies. Unlabeled tissue sections were obtained from the UCLA Translational Pathology Core Laboratory (TPCL) under UCLA IRB 18-001029, from an existing specimen. The autofluorescence images were captured using an Olympus IX-83 microscope (controlled with the MetaMorph microscope automation software, version 7.10.161), using a DAPI filter cube (Semrock OSFI3-DAPI5060C, EX 377/50 nm EM 447/60 nm) as well as a Texas Red filter cube (Semrock OSFI3-TXRED-4040C, EX 562/40 nm EM 624/40 nm) to generate the second autofluorescence image channel.

In order to create the training dataset for the virtual staining network, pairs of matched unlabeled autofluorescence images and brightfield images of the histochemically stained tissue were obtained. H&E, MT, and PAS histochemical staining were performed by the Tissue Technology Shared Resource at UC San Diego Moores Cancer Center. The JMS staining was performed by the Department of Pathology and Laboratory Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA. These stained slides were digitally scanned using a brightfield scanning microscope (Leica Biosystems Aperio AT2 slide, using 40x/0.75NA objective). All the slides and digitized slide images were prepared from an existing specimen. Therefore, this work did not interfere with standard practices of care or sample collection procedures. The H&E image dataset used for the study came from the existing UCLA pathology database containing WSIs of stained kidney needle core biopsies, under UCLA IRB 18-001029. These slides were similarly imaged using Aperio AT2 slide scanning microscopes.

Image co-registration

To train label-free virtual staining networks, the autofluorescence images of unlabeled tissue were co-registered to brightfield images of the same tissue after it had been histochemically stained. This image co-registration was done through a multistep process²⁸, beginning with a coarse matching which was progressively improved until subpixel level accuracy is achieved. The registration process first used a cross-correlation-based method to extract the most similar portions of the two images. Next, the matching was improved using multimodal image registration²⁹. This registration step applied an affine transformation to the images of the histochemically stained tissue to correct for any changes in size or rotations. To achieve pixel-level co-registration accuracy, an elastic registration algorithm was then applied. However, this relies upon a local correlation-based matching. Therefore, to ensure that this matching could be accurately performed, an initial rough virtual staining network is applied to the autofluorescence images^7,8. These roughly stained images were then co-registered to the brightfield images of the stained tissue using a correlation-based elastic pyramidal co-registration algorithm³⁰.

Once the image co-registration is complete, the autofluorescence images were normalized by subtracting the average pixel value of the tissue area for the WSI and subsequently dividing it by the standard deviation of the pixel values in the tissue area.

Class conditional virtual staining of label-free tissue

A class conditional GAN was used to generate both the input and the ground truth images to be used during the training of the presented stain transformation networks (Fig. 2a). This class conditional GAN allows multiple stains to be created simultaneously using a single deep neural network⁸. To ensure that the features of the virtually stained images are highly consistent between stains, a single network must be used to generate the stain transformation network input (virtual H&E) and the corresponding ground truth images (virtual special stains) that are automatically registered to each other as the information source is the same image. This is only required for the training of the stain transformation neural networks and is rather beneficial as it allows both the H&E and special stains to be perfectly matched. Furthermore, an alternative image dataset made up of co-registered virtually stained and histochemically stained fields of view will present limitations due to imperfect co-registration and deformities caused by the staining process. These are eliminated by using a single class conditional GAN to generate both the input and the ground truth images.

This network uses the same general architecture as the network described in the previous section, with the addition of a Digital Staining Matrix concatenated to the network input for both the generator and discriminator⁸. This staining matrix defines the stain coordinates within a given image FOV. Therefore, the loss functions for the generator and discriminator are:

$${l}_{{{{{{\rm{generator}}}}}}}={L}_{1}\{{z}_{{{{{{\rm{label}}}}}}},G({x}_{{{{{{\rm{input}}}}}}},\widetilde{{{{{{\bf{c}}}}}}})\}+\alpha \times {{{{{\rm{TV}}}}}}\{G({x}_{{{{{{\rm{input}}}}}}},\widetilde{{{{{{\bf{c}}}}}}})\}+{\beta }\times {(1-D(G({x}_{{{{{{\rm{input}}}}}}},\widetilde{{{{{{\bf{c}}}}}}}),\widetilde{{{{{{\bf{c}}}}}}}))}^{2}$$

(6)

$${l}_{{{{{{\rm{discriminator}}}}}}}={D(G({x}_{{{{{{\rm{input}}}}}}},\widetilde{{{{{{\bf{c}}}}}}}),\widetilde{{{{{{\bf{c}}}}}}})}^{2}+{(1-D({z}_{{{{{{\rm{label}}}}}}},\widetilde{{{{{{\bf{c}}}}}}}))}^{2}$$

(7)

where $\widetilde{{{{{{\bf{c}}}}}}}$ is a one-hot encoded digital staining matrix with the same pixel dimensions as the input image. When used in the testing phase, the one-hot encoding allows the network to generate two separate stains (H&E and the corresponding special stain) for each FOV.

The number of channels in each layer used by this deep neural network was increased by a factor of two compared to the stain transformation architecture described above to account for the larger dataset size and the need for the network to perform two distinct stain transformations.

A set of four adjacent tissue sections were used to train the virtual staining networks for H&E and the three special stains. The H&E portion of all three of the networks was trained with 1058 1424 × 1424-pixel images coming from ten unique patients, the PAS network was trained with 946 1424 × 1424-pixel images coming from 11 unique patients, the Jones network was trained with 816 1424 × 1424-pixel images coming from ten unique patients, and the MT network was trained with 966 1424 × 1424-pixel images coming from ten unique patients. A list of the samples used to train the various networks, and the original diagnoses of the patients can be seen in Supplementary Table 2. All of the stains were validated using the same three validations slides.

Style transfer for H&E image data augmentation

In order to ensure that the stain transformation neural network is capable of being applied to a wide variety of histochemically stained H&E images, we use the CycleGAN¹⁸ model to augment the training dataset by performing style transfer (Fig. 2b). As discussed, these CycleGAN networks only augment the image data used as inputs in the training phase. This CycleGAN model learns to map between two domains $X$ and $Y$ given the training samples $x$ and $y$, where $X$ is the domain for the original virtually stained H&E and Y is the domain for the H&E image generated by a different lab or hospital. This model performs two map**s ${G:X}\to Y$ and ${F:Y}\to X$. In addition, two adversarial discriminators ${D}_{X}$ and ${D}_{Y}$ are introduced. A diagram showing the relationship between these various networks is shown in Supplementary Fig. 4.

The loss function of the generator ${l}_{{{{{{\rm{generator}}}}}}}$ contains two types of terms: adversarial losses ${l}_{{{{{{\rm{adv}}}}}}}$ to match the stain style of the generated images to the style of histochemically stained images in target domain; and cycle consistency losses ${l}_{{{{{{\rm{cycle}}}}}}}$ to prevent the learned map**s $G$ and $F$ from contradicting each other. The overall loss is therefore described by:

$${l}_{{{{{{\rm{generator}}}}}}}=\lambda {\times l}_{{{{{{\rm{cycle}}}}}}}+\varphi \times {l}_{{{{{{\rm{adv}}}}}}}$$

(8)

where$\lambda$ and $\varphi$ are relative weights/constants. For each of the networks, we set $\lambda$ = 10 and $\varphi$ = 1. Each generator is associated with a discriminator, which ensures that the generated image matches the distribution of the ground truth. The adversarial losses for each of the generator networks can be written as:

$${l}_{{{{{{\rm{adv}}}}}}X\to Y}={\left(1-{D}_{Y}\left(G\left(x\right)\right)\right)}^{2}$$

(9)

$${l}_{{{{{{\rm{adv}}}}}}Y\to X}={\left(1-{D}_{X}\left(F\left(y\right)\right)\right)}^{2}$$

(10)

And the cycle consistency loss can be described as:

$${l}_{{{{{{\rm{cycle}}}}}}}={L}_{1}\left\{y,G\left(F\left(y\right)\right)\right\}+{L}_{1}\left\{x,F\left(G\left(x\right)\right)\right\}$$

(11)

The adversarial loss terms used to train ${{{{{{\rm{D}}}}}}}_{{{{{{\rm{X}}}}}}}$ and ${{{{{{\rm{D}}}}}}}_{{{{{{\rm{Y}}}}}}}$ are defined as:

$${l}_{{D}_{X}}={\left({1-D}_{X}\left(x\right)\right)}^{2}+{D}_{X}{\left(F\left(y\right)\right)}^{2}$$

(12)

$${l}_{{D}_{Y}}={\left({1-D}_{Y}\left(y\right)\right)}^{2}+{D}_{Y}{\left(G\left(x\right)\right)}^{2}$$

(13)

For these CycleGAN models, $G$ and $F$ use U-net architectures similar to the stain transformation network. It consists of three down-blocks followed by three up-blocks. Each of these down-blocks and up-blocks are identical to the corresponding blocks in the stain transformation network. ${D}_{X}$ and ${D}_{Y}$ also have similar architectures to the discriminator network of stain transformation network. However, they have four blocks rather than five blocks as in the previous model.

During the training, the Adam optimizer was used to update the learnable parameters with learning rates of 2 × 10⁻⁵ for both the generator and discriminator networks. For each step of discriminator training, one iteration of training was performed for the generator network, and the batch size for training was set to 6.

A list of the original diagnoses of the samples used to train the CycleGAN stain augmentation networks can be seen in Supplementary Table 3. The same table also indicates how many FOVs were used for each sample used to train the CycleGAN network.

Training of single-stain virtual staining networks

In addition to performing multiple virtual stains using a single neural network, separate networks which only generate one individual virtual stain each were also trained. These networks were used to perform the rough virtual staining that enables the elastic co-registration. These networks use the same general architecture as the stain transformation networks, with the only difference being that the first block in both the generator and the discriminator increases the number of channels to 64. The input and output images are the autofluorescence images and the histochemically stained images, respectively, processed using the image registration described in the image co-registration section.

Implementation details

The image co-registration was implemented in MATLAB using version R2018a (The MathWorks Inc.). The neural networks were trained and implemented using Python version 3.6.2 with TensorFlow version 1.8.0. The timing was measured on a Windows 10 computer with two Nvidia GeForce GTX 1080 Ti GPUs, 64GB of RAM, and an Intel I9-7900X CPU.

Pathologic evaluation of kidney biopsies

An initial study of 16 sections—comparing the diagnoses made with H&E only against the diagnoses made with H&E as well as the stain-transformed special stains—was first performed to determine the feasibility of the technique. For this initial evaluation, 16 nonneoplastic kidney cases were selected by a board-certified kidney pathologist (J.E.Z.) to represent a variety of kidney diseases (listed in Supplementary Data 1). For each case, the WSI of the histochemically stained H&E slide, along with a worksheet that included a brief clinical history, were presented to three board-certified renal pathologists (W.D.W, M.F.P.D., and A.E.S.). The diagnostic worksheet can be seen in Supplementary Table 4. The WSIs were exported to the Zoomify format³¹, and uploaded to the GIGAmacro³² website to allow the pathologists to confidentially view the images using a standard web browser. The WSIs were viewed using standard displays (e.g., LCD Monitor, FullHD, 1920 × 1080 pixels).

In the diagnostic worksheet, the reviewers were given the H&E WSI and brief patient history and asked to make a preliminary diagnosis and quantify certain features of the biopsy (i.e., number of glomeruli and arteries) and provide additional comments if necessary. After a >3-week washout period to reduce the pathologists’ familiarity with the cases, the three reviewing pathologists received, in addition to the same histologically stained H&E WSIs and the same patient medical history, three computationally generated special stain WSIs for each case: MT, PAS, and JMS. Being given these slides, they were asked to provide a preliminary diagnosis for a second time. This >3-week washout period was chosen to be 1 week greater than the College of American Pathologists Pathology and Laboratory Quality Center guidelines³³, ensuring that the pathologists were not influenced by previous diagnoses.

To test the hypothesis that using additional stain-transformed WSIs can be used to improve the preliminary diagnosis, the adjudicator pathologist (J.E.Z.) who was not among the three diagnosticians provided judgment to determine Concordance (C), Discordance (D), or Improvements (I) between the diagnosis quality of the first and second round of preliminary diagnoses provided by the group of diagnosticians (see Supplementary Table 4).

To expand the total number of cases to 58 and perform the third study, (Fig. 3) the same set of steps were repeated. To allow for higher throughput, in this case, the WSIs were uploaded to a custom-built online file viewing server based on the Orthanc server package³⁴. Using this online server, the user is able to swap between the various cases. For each case, the patient history is presented, along with the WSI and the option to swap between the various stains, where applicable. The pathologists were asked to input their diagnosis, the chronicity, and any comments that they might have into text boxes within the interface.

Once the pathologists completed the diagnoses with H&E only as well as with H&E and the stain-transformed special stains, another >3-week washout period was observed. Following this second washout period, the pathologists were given WSIs of the original histochemically stained H&E along with the three histochemically stained special stains coming from serial tissue sections. Two of these cases used in the preliminary study were excluded from the final analysis, as WSIs of the three special stains could not be obtained from serial tissue sections. For the first of these excluded cases, all of the pathologist’s diagnoses were improved using stain-to-stain transformation, and for the second, one of the diagnoses was improved while the other two pathologists’ diagnoses were concordant.

The pathologists’ diagnoses and comments can be found in the file Supplementary Data 1. Pathologist 2 was replaced for the expanded study due to time availability. Therefore, there is a separate page containing the initial study diagnoses for this pathologist.

Statistical analysis

Using the preliminary study of 16 samples, we calculated that a total of 41 samples are needed to show statistical significance (using a power of 0.8 and an alpha level of 0.05 and using a one-tailed t-test). Therefore, the total number of patients was increased to 58 to ensure that the study was sufficiently powered.

A one-tailed t-test was used to determine whether a statistically significant number of improvements were made when using either [H&E and stain-transformed special stains], or [H&E and histochemically stained special stains] over only [H&E] images. The statistical analyses were performed by giving a score of +1 to any improvement, −1 to any discordance, and 0 to any concordance. The score for each case was then averaged among the three pathologists who evaluated the case, and the test showed that the amount of improvement (i.e., if the average score is greater than zero) across the 58 cases was statistically significant.

A chi-squared test with two degrees of freedom was used to compare the proportion of improvements, concordances, and discordances between the methods tested above. The improvements, concordances, and discordances for each pathologist was compared individually.

For all tests, a P value of 0.05 or less was considered to be significant.

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Data supporting the results demonstrated by this study are available within the main text and the Supplementary Information. The full set of images used for the stain quality assessment study can be found in the Supplementary Data 2 file as well as at: https://github.com/kevindehaan/stain-transformation. The full pathologist reports and adjudication results can be found in the Supplementary Data 1 file and on GitHub. Examples of patient sample fields of view can also be found on GitHub at https://github.com/kevindehaan/stain-transformation. For each example, the histochemically stained H&E and stain transformed special stains are shown for the same FOV, while the histochemically stained special stains that are shown come from the same biopsy, through serial tissue sections. Raw whole slide images corresponding to patient specimen were obtained under UCLA IRB 18-001029 from the UCLA Health private database for the current study and therefore cannot be made publicly available.

Code availability

The stain-to-stain transformation-related TensorFlow codes used in this manuscript can be found on GitHub at https://github.com/kevindehaan/stain-transformation.

References

Global Transformational Health Research Team at Frost & Sullivan. Global Tissue Diagnostics Market, Forecast to 2022 (Frost and Sullivan 2018).
Alturkistani, H. A., Tashkandi, F. M. & Mohammedsaleh, Z. M. Histological stains: a literature review and case study. Glob. J. Health Sci. 8, 72–79 (2016).
Article Google Scholar
Walker, P. D., Cavallo, T. & Bonsib, S. M., Ad Hoc Committee on Renal Biopsy Guidelines of the Renal Pathology Society. Practice guidelines for the renal biopsy. Mod. Pathol. 17, 1555–1563 (2004).
Article Google Scholar
Tao, Y. K. et al. Assessment of breast pathologies using nonlinear microscopy. Proc. Natl Acad. Sci. USA 111, 15304–15309 (2014).
Article ADS CAS Google Scholar
Fereidouni, F. et al. Microscopy with ultraviolet surface excitation for rapid slide-free histology. Nat. Biomed. Eng. 1, 957–966 (2017).
Article CAS Google Scholar
Glaser, A. K. et al. Light-sheet microscopy for slide-free non-destructive pathology of large clinical specimens. Nat. Biomed. Eng. 1, 1–10 (2017).
Article Google Scholar
Rivenson, Y. et al. Virtual histological staining of unlabelled tissue-autofluorescence images via deep learning. Nat. Biomed. Eng. https://doi.org/10.1038/s41551-019-0362-y (2019).
Zhang, Y. et al. Digital synthesis of histological stains using micro-structured and multiplexed virtual staining of label-free tissue. Light. Sci. Appl. 9, 78 (2020).
Article ADS CAS Google Scholar
Bayramoglu, N., Kaakinen, M., Eklund, L. & Heikkilä, J. Towards virtual H and E staining of hyperspectral lung histology images using conditional generative adversarial networks. In IEEE International Conference on Computer Vision Workshops (ICCVW) (Cucchiara R., Matsushita, Y. Sebe, N. & Soatto, S.) 64–71 (IEEE, 2017).
Rivenson, Y. et al. PhaseStain: the digital staining of label-free quantitative phase microscopy images using deep learning. Light. Sci. Appl. 8, 23 (2019).
Article ADS Google Scholar
Rana, A. et al. Use of deep learning to develop and analyze computational hematoxylin and eosin staining of prostate core biopsy images for tumor diagnosis. JAMA Netw. Open 3, e205111–e205111 (2020).
Article Google Scholar
Borhani, N., Bower, A. J., Boppart, S. A. & Psaltis, D. Digital staining through the application of deep neural networks to multi-modal multi-photon microscopy. Biomed. Opt. Express 10, 1339–1350 (2019).
Article CAS Google Scholar
Roy-Chowdhuri, S. et al. Collection and handling of thoracic small biopsy and cytology specimens for ancillary studies: guideline from the College of American Pathologists in collaboration with the American College of Chest Physicians, Association for Molecular Pathology, American Society of Cytopathology, American Thoracic Society, Pulmonary Pathology Society, Papanicolaou Society of Cytopathology, Society of Interventional Radiology, and Society of Thoracic Radiology. Arch. Pathol. Lab. Med. https://doi.org/10.5858/arpa.2020-0119-CP (2020).
Levy, J. J., Jackson, C. R., Sriharan, A., Christensen, B. C. & Vaickus, L. J. Preliminary Evaluation of the Utility of Deep Generative Histopathology Image Translation at a Mid-sized NCI Cancer Center. in Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 3: BIOINFORMATICS,(eds. Maria, E. D., Fred, A. L. N. & Gamboa, H.) 302–311 (SCITEPRESS, 2020).
Lahiani, A., Klaman, I., Navab, N., Albarqouni, S. & Klaiman, E. Seamless virtual whole slide image synthesis and validation using perceptual embedding consistency. IEEE J. Biomed. Health Inform. https://doi.org/10.1109/JBHI.2020.2975151 (2020).
Gadermayr, M., Appel, V., Klinkhammer, B. M., Boor, P. & Merhof, D. In Medical Image Computing and Computer Assisted Intervention – MICCAI 2018 (eds Frangi, A. F., Schnabel, J. A., Davatzikos, C., Alberola-López, C. & Fichtinger, G.) 165–173 (Springer, 2018).
Kapil, A. et al. DASGAN–Joint domain adaptation and segmentation for the analysis of epithelial regions in histopathology PD-L1 images. Preprint at ar**v:1906.11118 [cs, eess] (2019).
Zhu, J.-Y., Park, T., Isola, P. & Efros, A. A. Unpaired image-to-image translation using cycle-consistent adversarial networks. In 2017 IEEE International Conference on Computer Vision (ICCV) (eds Cucchiara, R., Matsushita, Y., Sebe, N. & Soatto, S.) 2242–2251 (IEEE, 2017).
Cohen, J. P., Luck, M. & Honari, S. Distribution Matching Losses Can Hallucinate Features in Medical Image Translation. in Medical Image Computing and Computer Assisted Intervention – MICCAI 2018 (eds. Frangi, A. F., Schnabel, J. A., Davatzikos, C., Alberola-López, C. & Fichtinger, G.) 529–536 (Springer International Publishing, 2018). https://doi.org/10.1007/978-3-030-00928-1_60.
Fujitani, M. et al. Re-staining pathology images by FCNN. In 16th International Conference on Machine Vision Applications (MVA) (eds Maki, A. & Favaro, P.) 1–6 (IEEE, 2019).
Mercan, C. et al. Virtual Staining for Mitosis Detection in Breast Histopathology. in 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI) (IEEE, 2020). https://doi.org/10.1109/isbi45749.2020.9098409.
Bauer, T. W. et al. Validation of whole slide imaging for primary diagnosis in surgical pathology. Arch. Pathol. Lab. Med. 137, 518–524 (2013).
Article Google Scholar
Shaban, M. T., Baur, C., Navab, N. & Albarqouni, S. Staingan: stain style transfer for digital histological images. In 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019) (eds Carbayo, M. L., Ángel, M. & Ballester, G.) 953–956 (IEEE, 2019).
Sandfort, V., Yan, K., Pickhardt, P. J. & Summers, R. M. Data augmentation using generative adversarial networks (CycleGAN) to improve generalizability in CT segmentation tasks. Sci. Rep. 9, 16884 (2019).
Article ADS Google Scholar
Erlandson, R. A. Role of electron microscopy in modern diagnostic surgical pathology. Mod. Surg. Pathol. https://doi.org/10.1016/B978-1-4160-3966-2.00005-9 (2009).
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. In 3rd International Conference on Learning Representations, ICLR (eds. Bengio, Y. & LeCun, Y.) (2015).
Convert RGB color values to YCbCr color space - MATLAB rgb2ycbcr. https://www.mathworks.com/help/images/ref/rgb2ycbcr.html (2020).
Wang, H. et al. Deep learning enables cross-modality super-resolution in fluorescence microscopy. Nat. Methods 16, 103–110 (2019).
Article CAS Google Scholar
Register Multimodal MRI Images - MATLAB & Simulink Example. https://www.mathworks.com/help/images/registering-multimodal-mri-images.html (2020).
Culley, S. et al. Quantitative map** and minimization of super-resolution optical imaging artifacts. Nat. Methods 15, 263–266 (2018).
Article CAS Google Scholar
Zoomify — Zoomable web images! http://zoomify.com/ (2020).
GIGAmacro: Exploring Small Things in a Big Way. https://viewer.gigamacro.com/ (2020).
Pantanowitz, L. et al. Validating whole slide imaging for diagnostic purposes in pathology: guideline from the College of American Pathologists Pathology and Laboratory Quality Center. Arch. Pathol. Lab. Med. 137, 1710–1722 (2013).
Article Google Scholar
Jodogne, S. The orthanc ecosystem for medical imaging. J. Digit. Imaging 31, 341–352 (2018).
Article Google Scholar

Download references

Acknowledgements

The authors acknowledge the funding of the NSF Biophotonics Program (USA). Mei Leng from the Department of Medicine Statistics Core at the UCLA Clinical and Translational Science Institute is also acknowledged for hel** to perform the statistical analysis.

Author information

Authors and Affiliations

Electrical and Computer Engineering Department, University of California, Los Angeles, CA, USA
Kevin de Haan, Yijie Zhang, Tairan Liu, Yair Rivenson & Aydogan Ozcan
Bioengineering Department, University of California, Los Angeles, CA, USA
Kevin de Haan, Yijie Zhang, Tairan Liu, Yair Rivenson & Aydogan Ozcan
California NanoSystems Institute (CNSI), University of California, Los Angeles, CA, USA
Kevin de Haan, Yijie Zhang, Tairan Liu, Yair Rivenson & Aydogan Ozcan
Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
Jonathan E. Zuckerman, Anthony E. Sisk, Alexander Nobori, Sofia Liou, Sarah Zhang & Rana Riahi
Kaiser Permanente Los Angeles Medical Center, Department of Pathology, Los Angeles, CA, USA
Miguel F. P. Diaz
Department of Pathology and Laboratory Medicine, University of California at Davis, Sacramento, CA, USA
Kuang-Yu Jen
Department of Pathology and Laboratory Medicine, Keck School of Medicine of USC, Los Angeles, CA, USA
W. Dean Wallace
Department of Surgery, David Geffen School of Medicine, University of California, Los Angeles, CA, USA
Aydogan Ozcan

Authors

Kevin de Haan
View author publications
You can also search for this author in PubMed Google Scholar
Yijie Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan E. Zuckerman
View author publications
You can also search for this author in PubMed Google Scholar
Tairan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Anthony E. Sisk
View author publications
You can also search for this author in PubMed Google Scholar
Miguel F. P. Diaz
View author publications
You can also search for this author in PubMed Google Scholar
Kuang-Yu Jen
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Nobori
View author publications
You can also search for this author in PubMed Google Scholar
Sofia Liou
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Rana Riahi
View author publications
You can also search for this author in PubMed Google Scholar
Yair Rivenson
View author publications
You can also search for this author in PubMed Google Scholar
W. Dean Wallace
View author publications
You can also search for this author in PubMed Google Scholar
Aydogan Ozcan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.L. imaged the unlabeled tissue sections. K.d.H. and Y.Z. processed the data. A.E.S., M.F.P.D., and W.D.W. performed the diagnoses for the initial study. A.E.S., W.D.W, and K.Y.J performed the diagnoses for the expanded study. J.E.Z. chose the cases used to test the study and performed the adjudication. A.E.S., W.D.W., and J.E.Z. performed the stain quality assessment study. A.N., S.L, S.Z., and R.R. digitized and performed quality checks on the digital slides. K.d.H., Y.Z., Y.R., W.D.W., and A.O. prepared the manuscript, and all authors contributed to the manuscript. A.O. supervised the research.

Corresponding authors

Correspondence to Yair Rivenson, W. Dean Wallace or Aydogan Ozcan.

Ethics declarations

Competing interests

Y.R. and A.O. are co-inventors of a pending patent application US20210043331A1, which covers the use of label-free autofluorescence images to generate virtually stained images. K.d.H., Y.Z., Y.R., and A.O. have a pending patent application (PCT/US2020/066708), which covers the use of the stain transformation network and the use of multiple stains being performed through a single neural network. K.d.H., Y.R., W.D.W., and A.O. have a financial interest in the commercialization of deep learning-based tissue staining. J.E.Z. is a paid consultant for Leica Biosystems. The remaining authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

de Haan, K., Zhang, Y., Zuckerman, J.E. et al. Deep learning-based transformation of H&E stained tissues into special stains. Nat Commun 12, 4884 (2021). https://doi.org/10.1038/s41467-021-25221-2

Download citation

Received: 09 September 2020
Accepted: 29 July 2021
Published: 12 August 2021
DOI: https://doi.org/10.1038/s41467-021-25221-2
Springer Nature Limited

This article is cited by

Dual contrastive learning based image-to-image translation of unstained skin tissue into virtually stained H&E images
- Muhammad Zeeshan Asaf
- Babar Rao
- Muhammad Shahmir Abbasi
Scientific Reports (2024)
Rapid deep learning-assisted predictive diagnostics for point-of-care testing
- Seungmin Lee
- Jeong Soo Park
- Jeong Hoon Lee
Nature Communications (2024)
Intraoperative margin assessment for basal cell carcinoma with deep learning and histologic tumor map** to surgical site
- Joshua J Levy
- Matthew J Davis
- Matthew R LeBoeuf
npj Precision Oncology (2024)
Virtual histological staining of unlabeled autopsy tissue
- Yuzhu Li
- Nir Pillar
- Aydogan Ozcan
Nature Communications (2024)
AI-based computational H&E staining in lymphomas
- Laura M. Wake
- Rima Koka
- Michael E. Kallen
Journal of Hematopathology (2024)

Deep learning-based transformation of H&E stained tissues into special stains

Abstract

Similar content being viewed by others

Introduction

Methods

Training of stain transformation network

Image data acquisition

Image co-registration

Class conditional virtual staining of label-free tissue

Style transfer for H&E image data augmentation

Training of single-stain virtual staining networks

Implementation details

Pathologic evaluation of kidney biopsies

Statistical analysis

Reporting Summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation