Introduction: The Haematopoietic System

Haematopoietic stem cells (HSCs) are capable of self-renewal and differentiation into all cell lineages found in the blood, providing the lifelong basis for many vital processes including immunological defence, transport of materials, blood clotting following injury, and oxygenation of the body’s tissues. Under homeostatic conditions, the blood system is required to generate in the region of 1011 neutrophils and red blood cells daily [1]. Although HSCs are predominantly quiescent in vivo, and more rapidly cycling transit amplifying or progenitor cells are responsible for generating this large number of mature blood cells in the short term, HSCs are ultimately responsible for ensuring the long-term production of these progenitor cells and are therefore essential for the lifelong maintenance of haematopoiesis. Under conditions of stress, such as following infection, blood loss or the ablation of mature blood cells via chemotherapy or irradiation, HSCs demonstrate enormous regenerative potential.

This property makes HSCs ideal for treatment of acquired and inherited diseases as they can engraft into a recipient patient’s bone marrow space and repopulate the entire haematopoietic system. HSCs from bone marrow, peripheral blood and cord blood have been developed and applied clinically over the last 60 years, and transplantation is the gold standard treatment for many haematological disorders including lymphoma, leukaemia, thalassemia and anaemias [2]. This treatment is not without drawbacks, which include difficulties associated with finding an HLA-matched donor, graft rejection and the donor cells reacting against the recipient’s tissues (graft versus host disease) [3]. All of these can affect the morbidity and mortality of treated patients.

Correction of Autologous Stem Cells

Since HSCs can be harvested from their bone marrow niche and then re-transplanted back into a new recipient (allogeneic transplant) or the original donor (autologous transplant), they are particularly amenable to ex vivo manipulation using gene therapy, to attenuate, augment or correct mutated disease-causing genes. In an autologous setting, this means that HSCs from a patient can be phenotypically corrected and returned to treat blood diseases, without the need of finding a matched donor or the risk of graft rejection or graft versus host disease. To date, integrating viral vector systems such as attenuated replication-defective lentivirus or gammaretrovirus have been predominantly used to deliver a correcting gene copy into the HSCs of patients with inherited haematologic disorders. These vectors integrate their payload into the genome of target cells, a necessity for longevity of transgene expression in cells that have the potential to self-renew for the lifespan of the patient. The majority of current gene therapy approaches relies on the delivery of a functional copy of a gene’s complementary DNA (cDNA) to express the functioning version of a protein, the lack of which is the cause of disease. This is generally performed ex vivo where patient HSCs are removed from the bone marrow or collected from cells mobilized into the peripheral blood. These cells are then enriched using the CD34 cell surface marker which is prevalent, but not unique, in stem cells and cultured to enable gene delivery using a viral vector. Cells are then returned to the patient by intravenous injection, and the corrected cells home back to the bone marrow, engraft in the correct niche and can produce all mature blood cell lineages expressing the correct protein. While this approach is adequate for treating many disorders, it does have limitations including the size of gene that can be delivered, production of non-physiological levels of RNA from a heterologous promoter and the associated lack of response to relevant transcription factors that control endogenous gene expression. Trials to treat disorders including combined immunodeficiencies, metabolic diseases and globinopathies have shown remarkable benefits in early clinical trials [38] where treatment options can be limited if matched HSC donors cannot be found.

The use of integrating vectors is required for long-term therapy, but this approach has additional associated disadvantages. Lack of control over the genomic integration site of the viral vector means that expression in cell populations is heterogeneous and affected by ‘position effects’ where the chromatin state of surrounding DNA can exert control and restrict gene expression. A more serious consequence of integrating cassettes into genomic DNA in an undirected manner is insertional mutagenesis, whereby the foreign vector sequence can unintentionally interfere with genes flanking the integration site, up-regulating or inhibiting them. This can then have severe consequences if oncogenes or tumour suppressor genes are involved. Early gene therapy trials highlighted these pitfalls, where the gene therapy vehicle contributed to initiation of blood cell clonal expansions and leukaemia [911]. These serious adverse effects directed increased research efforts to produce safety modifications in the vectors, and the newest generation of viruses are considered less mutagenic [8, 12, 13], but the unknown effects of non-targeted integration remain.

Genome Engineering

One solution to the lack of control caused by random vector integration would be to place therapeutic sequences in a precise location within the genome or alternatively repair an endogenous gene locus. This would enable a mutation to be engineered out of cells and replaced with the correct sequence or the faulty dominant negative version of a gene to be simply deleted. Alternatively, exogenous coding regions could be placed in ‘safe harbours’ in intrageneic regions [14] or inserted upstream of an endogenous promoter to provide equivalent physiological levels of expression [15, 16]. This genome engineering approach would avoid the random insertion of foreign DNA and exogenous controlling elements while allowing for the seamless correction of mutations or the exact placement of other sequences.

Gene editing can be achieved by using targeting endonucleases, which create double-strand breaks in host cell DNA. Such breaks can be created using an increasing array of tools, including zinc-finger nucleases, homing endonucleases, transcription activator-like effector nucleases (TALENs) and the clustered regularly interspaced short palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9) system, which have been reviewed elsewhere [17, 18] and within this edition. This approach creates a genetic lesion which is recognized by the cell and repaired by either the non-homologous end joining (NHEJ) pathway or homologous recombination (HR). NHEJ is error-prone, often introducing deletions, while HR can repair the break specifically using a DNA template which, in the setting of genome engineering, consists of an exogenous synthetic DNA donor molecule. The synthetic donor DNA spans the genomic locus that is to be modified, with the perfect sequence homology between the arms of the synthetic donor and the endogenous locus directing the two DNA molecules to align. The cells’ HR DNA repair machinery then mediates the replacement of the endogenous genomic locus with the DNA template molecule. Normally, this is a very inefficient process and is dependent on the size of homologous region, the cell type targeted and the cell-cycle status of the cell, as HR is up-regulated at progression into mitosis [1921]. Dramatic improvements in efficiency have been made by creating a double strand break in the target DNA which activates the cellular DNA repair machinery and increases the rate of homologous recombination, but the efficiency of HR even with the use of site-specific nucleases is still limiting for most clinical applications using HSC.

If a double-strand break is engineered and the cell uses NHEJ to repair the lesion, it often leads to a loss of function if the break is within a gene coding region due to the low fidelity of ligating the DNA ends, resulting in varying sizes of genomic insertions or deletions. Creating double-strand breaks without providing any donor sequence can help increase the level of NHEJ and is useful for disrupting a disease-causing allele or removing an entire region of DNA.

Gene Editing in Haematopoietic Stem Cells

Experience of ex vivo manipulation and nucleic acid delivery to HSCs made them an obvious early choice for the application for gene editing. Some haematologic disease backgrounds, such as the immunodeficiency SCID-X1, provide an environment where there is a strong selective advantage for cells that express the wild-type protein. This was observed in a SCID-X1 patient who fortuitously developed a point mutation, resulting in attenuation of the disease-causing mutation in the IL2RG gene, thus allowing the progenitor cells to receive survival, growth and differentiation signals sufficient to reconstitute the patient’s immune system and correct the disease phenotype [22••]. Such a clinical outcome proves that correction of minimal numbers of cells could have a major impact in the treatment of some diseases providing principal efficacy for gene editing in small numbers of primitive HSCs. Building upon this, the IL2RG cDNA has been inserted upstream of the endogenous promoter in CD34+ HSC cell populations, resulting in the expression of the transgene with the correct controlling elements [15], providing an approach for correcting any given mutation within the gene. This paradigm exists in other immunodeficiencies [23] and haematologic disease models where the selective proliferative advantage conferred to corrected cells is elicited at the level of the HSC, such as in the most frequently inherited bone marrow failure syndrome, Fanconi anaemia. In Fanconi anaemia, the phenomenon of reverse mosaicism has been observed for a number of patients, where a single somatic HSC has undergone a spontaneous correcting mutation that has conferred a huge selective advantage on the resulting clone and its progeny [2426] suggesting that even genome editing at a very low efficiency may have clinical impact.

A second approach was directed by observations in people who are resistant to HIV infection. These individuals harbour a homozygous mutation, delta 32, in the CCR5 gene which encodes a co-receptor for HIV cell entry [27, 28]. The importance of this was emphasized by reports of the HIV-positive ‘Berlin patient’ who received an HSC transplantation for leukaemia. The bone marrow donor had a homozygous delta 32 CCR5 mutation. As donor T cells developed from the engrafted stem cells, the HIV could no longer interact with cells, effectively curing him of HIV as well as treating the leukaemia [29]. This has led to efforts using gene editing to disrupt the CCR5 locus as a treatment for HIV using zinc-finger nuclease technology in T cells [30, 31] or in HSC [32••]. An alternative approach has been to use this location as a ‘safe harbour’ to integrate gene expression cassettes with minimal genotoxic consequences [14] due to the fact that cells lacking CCR5 do not appear to have diminished functionality, at least in in vivo experiments [32••].

Building on these approaches, modifications are now being attempted in HSC to treat other diseases. Haemoglobinopathies are caused by mutations within genes that produce different globin chains, with mutations within β globin resulting in β-thalassaemia or sickle cell anaemia. Hoban et al. were the first group to demonstrate that gene editing could be used to correct the sickle mutation in β globin in clinically relevant patient bone marrow CD34+ cells [33•].

The discovery of further elements that control globin switching has revealed new targets for gene editing to treat β-thalassaemia and sickle cell anaemia. Persistence of fetal haemoglobin in adults can reduce or ameliorate symptoms in these diseases, as the reliance on β-globin to form functional haemoglobin molecules is reduced [34]. Bcl11A represses expression of fetal haemoglobin after birth, representing a novel target for gene therapy [35]. Application of CRISPR-Cas9 libraries has assisted the map** of erythroid-specific enhancer regions that control Bcl11A [36] revealing potential clinical targets for DNA disruption in non-coding sequences. Modification of these enhancers could down-regulate fetal haemoglobin repression and help alleviate symptoms of β-globinopathies. As a proof of concept, bcl11a expression has been disrupted in mouse erythroleukaemia cells using TALENs, increasing the level of fetal haemoglobin [37].

Challenges Facing Gene Editing in HSC

Despite promising experiments and trials, major challenges remain for the widespread clinical application of gene editing in HSCs. For the majority of diseases, the efficiency of gene correction or disruption is insufficient. More primitive, long-term repopulating stem cells are thought to be more difficult to correct than more differentiated cells [15] probably largely due to the fact that HSCs are extremely difficult to transfect or electroporate. In addition, a number of groups have identified that certain DNA repair pathways, such as HR-mediated repair, may be differentially regulated in HSCs compared to their more mature progeny [38, 39]. Such defects in DNA repair may also negatively impact on the efficiency of genome editing in HSCs. An associated problem relates to the risks inherent to the creation of double-strand breaks in genomic DNA. Designer nucleases greatly improve the specificity of cutting sequences, but off-target cleavage of DNA does occur, especially where sequences are similar. For instance, when targeting the β globin locus, cleavage was also observed in the δ-globin gene [33•]. In this example, δ-globin is functionally dispensable but highlights the challenge of targeting specific sequences. Measuring off-target cleavage effects also poses a significant problem, as non-specific cleavage can be rapidly repaired, often resulting in small insertions or deletions that would require clonal genome-wide sequencing to identify [17, 40]. Bioinformatic analysis can be used to reveal similar genomic sequences that have an increased chance of being affected, so they can be checked in detail, but this approach is limiting. Gene editing in polyclonal populations is effective to a degree [15, 33•, 41] but from a safety perspective, being able to correct and expand a thoroughly characterized cell or limited population of clones would be preferable. While this is possible in induced pluripotent stem cells which can be differentiated into some lineages of blood cell [42], the technology is not yet available to recreate all blood lineages from pluripotent stem cells. Equally, expansion of HSC in vitro is not yet possible as the media used to achieve cell cycling also leads to differentiation, reducing the engraftment and self-renewal capacity of the cells [43]. The lack of suitable culture conditions to effectively maintain or even expand HSCs in vitro is one of the most significant barriers to effective gene editing of these cells.

Expansion of Haematopoietic Stem Cells

Much effort has been put into discovering pathways that control HSC self-renewal and the maintenance of the stem cell phenotype as in vitro approaches to induce HSC replication cause differentiation and loss of self-renewal capacity. Such information could be practically applied to the expansion of HSCs from sources where the number of cells is limited, such as umbilical cord blood. However, the development of such technology would also potentially create a paradigm shift in the setting of HSC gene editing, as it would facilitate the clonal expansion of individual HSCs that had been subject to ex vivo manipulation.

A number of experimental strategies have been explored to facilitate the expansion of murine and/or human HSCs in vitro. Perhaps the best characterised is the expansion and enhanced engraftment of murine and human HSCs via the ectopic overexpression of HOXB4. HOXB4 is a member of the homeobox transcription factor family which is involved in developmental cell-fate determination and embryonic patterning during development. Overexpression of HOXB4 can expand murine HSCs in vivo and in vitro [4446], and it is also able to mediate expansion of human HSCs, although to a much lesser degree than has been shown for mouse. However, constitutive overexpression of HOXB4 has also been associated with some serious deleterious effects, including pronounced dosage-dependent lineage skewing [47], delays to differentiation [48] and, in large animal models, contributing towards leukaemic transformation by blocking differentiation [49]. Although it has been reported that HSC expansion can also be facilitated by the delivery of HOXB4 protein directly to cells in culture [50, 51], this is a much more modest effect than that achieved by retroviral mediated constitutive overexpression. Clearly, for HOXB4 to become clinically relevant, a methodology must be devised for transient or regulated delivery of HOXB4 itself, or activation of its downstream effectors that are important for HSC expansion. In addition to HOXB4, a significant degree of interest has been shown the use of angiopoietin-like proteins for the maintenance and expansion of human and murine HSCs in vitro. Originally, these proteins were implicated in the expansion of HSCs by the laboratory of Harvey Lodish, who recognized that they were a highly expressed component of the cellular secretome from a fetal liver cell population that could facilitate HSC expansion when co-cultured with HSCs [52]. It was subsequently shown that angiopoietin-like proteins (1, 2, 3, 4, 6 and 7) may assist HSC survival and expansion in vivo in media supplemented with additional growth factors [53, 54].

Another promising candidate for facilitating the expansion of human, but not murine, HSCs in vitro is the inhibition of the aryl-hydrocarbon receptor, which has previously been identified as an important mediator of the cellular response to environmental toxins [55]. A novel small molecule antagonist of the aryl-hydrocarbon receptor, StemRegenin (SR1), was identified during a chemical library screen for factors that could improve the frequency of immunophenotypically defined human HSCs in vitro. It has subsequently been shown that SR1 treatment allows functional HSCs to propagate and even expand in culture [55].

Currently, SR1 probably represents the most likely candidate for translation towards clinical expansion of HSCs. However, a number of other potential experimental strategies for HSC expansion have been documented (Table 1), and with further development, these may prove to be even more effective than SR1.

Table 1 Summary of additional approaches to facilitate HSC expansion

Conclusions

Gene editing in HSCs holds a great deal of promise for the treatment of both inherited and acquired defects within the blood in terms of targeting a cell population that can easily be reintroduced back into patients without the need for complicated surgery and which can subsequently regenerate the whole of the haematopoietic system with modified cells. If the efficiency of gene editing within HSCs can be improved, then it will potentially allow the field to completely overcome toxicities associated with viral vector integration effects that have been documented in several gene therapy clinical trials. Advances in our ability to maintain functional stem cells during in vitro culture/manipulation will be an essential step towards realising the promise of HSC gene editing.