Unraveling spatial cellular pattern by computational tissue shuffling

Laruelle, Elise; Spassky, Nathalie; Genovesio, Auguste

doi:10.1038/s42003-020-01323-3

Unraveling spatial cellular pattern by computational tissue shuffling

Article
Open access
Published: 23 October 2020

Volume 3, article number 605, (2020)
Cite this article

Download PDF

You have full access to this open access article

Communications Biology

Unraveling spatial cellular pattern by computational tissue shuffling

Download PDF

3099 Accesses
5 Citations
1 Altmetric
Explore all metrics

Abstract

Cell biology relies largely on reproducible visual observations. Unlike cell culture, tissues are heterogeneous, making difficult the collection of biological replicates that would spotlight a precise location. In consequence, there is no standard approach for estimating the statistical significance of an observed pattern in a tissue sample. Here, we introduce SET (for Synthesis of Epithelial Tissue), a method that can accurately reconstruct the cell tessellation formed by an epithelium in a microscopy image as well as thousands of alternative synthetic tessellations made of the exact same cells. SET can build an accurate null distribution to statistically test if any local pattern is necessarily the result of a process, or if it could be explained by chance in the given context. We provide examples in various tissues where visible, and invisible, cell and subcellular patterns are unraveled in a statistically significant manner using a single image and without any parameter settings.

Griottes: a generalist tool for network generation from segmented tissue images

Article Open access 11 August 2022

CellOrganizer: Learning and Using Cell Geometries for Spatial Cell Simulations

LocalZProjector and DeProj: a toolbox for local 2D projection and accurate morphometrics of large 3D microscopy images

Article Open access 02 July 2021

Introduction

Since the advent of high throughput dispensing and automated microscopy, methods and software tools for large-scale single-cell image analysis have blossomed and enabled profiling and comparison of large ranges of perturbations on cell cultures^1,2,3,4. Variance estimation and statistical testing in these approaches are achieved by simply producing several replicates per condition. This is made possible by the fact that cell culture permits robust standardized and sometimes fully automated replication of a sample condition. In contrast, while it is still possible to detect and quantify a single-cell event in large slide of cell tissues, their comparison and statistical analysis have remained hampered, apart from stereotypic exceptions, by the imprecision of microdissection, spatial inhomogeneity of samples, and notorious replicate variability⁵. In fact, spatial inhomogeneity is what makes tissues interesting to study in comparison to cell culture that often spreads a single or a few cell types uniformly but barely matches the spatial organization of these cells in an organism. In tissue samples, the state of a cell within its local context is observed once, and this exact context can in general barely be reproduced with precision. Therefore, as heterogeneity across a sample is the rule, obtaining robust standardized replicates is difficult for a single-cell event and impossible for a small cell patch or cell organization pattern observed locally. The unavailability of reliable replicates of such an event makes comparison of between-versus-within group variance irrelevant and statistical evidence unworkable.

However, what is sought in studies that relies on tissue sample observation are factors underlying cell organization, developmental process, or disease progression, independent of the variability between observations. From this point, how to deal with the impossibility of obtaining robust standardized replicates of an event? Is it possible to statistically test for the existence of a local cell to cell relationship from a single replicate? How to assess the existence and detect the heterogeneity of a local phenotype across one tissue sample? All these questions can be summarized in one: is the cell organization, observed at a specific location, driven by molecular or mechanical factors, or is it likely to be expected by chance given the distribution of cell shape and size? Being able to systematically answer this question is of growing importance as bridging the gap between profiles of single-cell gene expression and spatial cell relationships and morphology in tissue samples is at reach^{2). Further details on the design of the parametric distance, the identification of the parameters by fitting to a cell contour and the reconstruction of the tessellation, are provided in the sections below. We also describe how the content of each cell can be preserved in the synthetic images and how statistical significance of any observed cell pattern can be obtained using SET.}

Design of a flexible parametric distance

Let y be a bivariate random vector following a centered standard joint uncorrelated (not necessarily normal) distribution such that E(y) = 0 and cov(y) = I, S a diagonal scaling matrix, R a rotation matrix, and μ a translating vector. Scaling, rotating, and translating y yield x = RSy + μ. Similarly, y can be retrieved from x by inverting the transformation y = (RS)⁻¹ (x−μ). It is then straightforward to show that E(x) = μ, cov(x) = RSSR′ and to retrieve that the Euclidean distance between the origin and y is the Mahalanobis distance (parameterized by μ, R, and S) between the origin and x which is the scaled, rotated, and translated vector y:

$$d_{L_2}(0,{\mathbf{y}}) = \sqrt {{\mathbf{y}}^{\prime} {\mathbf{y}}} = \sqrt {({\mathbf{x}} - {\mathbf{\mu }})^{\prime} ({\mathbf{RSSR}}^{\prime} )^{ - 1}({\mathbf{x}} - {\mathbf{\mu }})} = d_{M({\mathbf{\mu }},{\mathbf{R}},{\mathbf{S}})}({\mathbf{x}}).$$

Independently, the Euclidean distance d_L2 can be rewritten in the following uncommon way:

$$d_{L_2}(0,{\mathbf{y}}) = \sqrt {{\boldsymbol{y}}_1^2 + {\boldsymbol{y}}_2^2} = (1^{\prime} {\mathbf{y}}^{ \circ 2})^{\frac{1}{2}},$$

where the symbol ◦ means that the exponent 2 is applied to the vector y elementwise. By simply replacing y, the Mahalanobis distance d_M can then also be rewritten this way:

$$d_{M({\mathbf{\mu }},{\mathbf{R}},{\mathbf{S}})}({\mathbf{x}}) = d_{L_2}(0,{\mathbf{y}}) = \left( {1^{\prime} \left| {({\mathbf{RS}})^{ - 1}({\mathbf{x}} - {\mathbf{\mu }})} \right|^{ \circ 2}} \right)^{\frac{1}{2}}.$$

Unlike the usual quadratic form of the Mahalanobis distance showed earlier, this form presents the compelling advantage of making possible the generalization of Mahalanobis and Minkowski distances (such as Euclidean, Manhattan and Chebychev) under a single parametric function that we name Minkovski Affine Transform (MAT) distance by introducing a parameter p that denotes the Minkovski order:

$$d_{{\mathrm{{MAT}}}({\mathbf{\mu }},{\mathbf{R}},{\mathbf{S}},p)}({\mathbf{x}}) = d_{L_p}(0,{\mathbf{y}}) = \left( {1^{\prime} \left| {({\mathbf{RS}})^{ - 1}({\mathbf{x}} - {\mathbf{\mu }})} \right|^{ \circ p}} \right)^{\frac{1}{p}}.$$

Note that if p ≥ 1, dMAT is a metric, especially if p = 2 the d_MAT is the Mahalanobis distance and if p < 1, triangle inequality is lost and d_MAT is a semi-metric. This formulation offers the possibility to design flexible distance functions based on the affine transformation of any Minkovski metric. This relationship between original Minkovski metrics and their transformation to a parameterized MAT distance function are illustrated by Supplementary Fig. 11.

The level sets of MAT are super ellipses that are more flexible than the standard ellipses provided by the Mahalanobis distance. They offer the possibility of modeling roundish rectangular shapes such as some plant cells or diamond like cells. However, they are all symmetric about their center and about the two principal axes. In order to obtain a distance function with level sets possibly matching asymmetric cell shapes, we generalized the MAT distance further by introducing two asymmetric terms a₁ and a₂. These terms weight how much the value of an axis influences the value of the other axis and reversely, considering the yet unrotated and unscaled vector y. The Asymmetric Minkovski Affine Transform (AMAT) distance d_AMAT we propose reads:

$$d_{{\mathrm{{AMAT}}}({\mathbf{\mu }},{\mathbf{R}},{\mathbf{S}},{\mathbf{A}},p)}({\mathbf{x}}) = \left( {1^{\prime} e^{ - {\mathbf{AJ}}{\mathrm{{diag}}}({\mathbf{y}}){\mathbf{J}}}\left| {\mathbf{y}} \right|^{ \circ p}} \right)^{\frac{1}{p}}$$

with

$${\mathbf{A}} = \left[ {\begin{array}{*{20}{c}} {{\mathrm{a}}_1} & 0 \\ 0 & {{\mathrm{a}}_2} \end{array}} \right]\quad {\mathbf{J}} = \left[ {\begin{array}{*{20}{c}} 0 & 1 \\ 1 & 0 \end{array}} \right]\quad {\mathrm{and}}\quad {\mathbf{AJ}}{\mathrm{{diag}}}\left( {\mathbf{y}} \right){\mathbf{J}} = \left[ {\begin{array}{*{20}{c}} {{\mathrm{a}}_1{\mathrm{y}}_2} & 0 \\ 0 & {{\mathrm{a}}_2{\mathrm{y}}_1} \end{array}} \right]$$

for the sake of clarity we recall here that

$${\mathbf{\mu }} = \left[ {\begin{array}{*{20}{c}} {\mu _1} \\ {\mu _2} \end{array}} \right]{\mathbf{S}} = \left[ {\begin{array}{*{20}{c}} {{\mathrm{s}}_1} & 0 \\ 0 & {{\mathrm{s}}_2} \end{array}} \right]{\mathbf{R}} = \left[ {\begin{array}{*{20}{c}} {{\mathrm{cos}}(\alpha )} & {{\mathrm{sin}}(\alpha )} \\ { - {\mathrm{sin}}(\alpha )} & {{\mathrm{cos}}(\alpha )} \end{array}} \right]{\mathbf{y}} = \left( {{\mathbf{RS}}} \right)^{ - 1}({\mathbf{x}} - {\mathbf{\mu }}).$$

Altogether, the AMAT distance comprises eight parameters: μ₁, μ₂ the coordinates of the cell, s₁ the length of the longest axis containing μ, s₂ the length of the shortest orthogonal axis containing μ, α the angle of the longest axis containing μ with the x-axis, a₁ the degree of asymmetry about the longest axis, a₂ the degree of asymmetry about the shortest orthogonal axis, and p the Minkovski order. This function offers a distance map with short range level sets modeling for a large panel of closed shapes such as cells can display (Supplementary Fig. 12). Note that, unlike the MAT distance, some combinations of parameters a₁, a₂, and p may in theory lead to an AMAT map that can contain critical points at other places than the origin, possibly leading to non-closed or disconnected level sets at long range. In short, AMAT is not guaranteed to be a distance for all combinations of parameters. However, we will see that such a situation can easily be handled, as for our modeling purposes we are only interested in short distances defined locally about the cell membrane, that is about distance 1 from the origin, for which AMAT behaves well as expected.

Fitting d_AMAT = 1 to a cell contour

The segmented contour of each cell is subsampled to an arbitrary resolution of N points x_i regularly spread (typically N = 100). The following sum of squared error is then minimized for each cell:

$$\mathop {{\min }}\limits_{{\mathbf{\mu }},{\mathbf{R}},{\mathbf{S}},{\mathbf{A}},p} \mathop {\sum }\limits_{i = 1}^N \left( {d_{{\mathrm{{AMAT}}}\left( {{\mathbf{\mu }},{\mathbf{R}},{\mathbf{S}},{\mathbf{A}},p} \right)}\left( {{\mathbf{x}}_{\boldsymbol{i}}} \right) - 1} \right)^2.$$

Note that if a₁ = 0, a₂ = 0, and p = 2 are fixed, then no minimization process is needed, as the AMAT distance is the Mahalanobis distance and the location is the centroid of the cell, and the scale and rotation parameters can be obtained by diagonalization of the covariance matrix of the pixels of the cell. If any of a₁, a₂, or p are let free to evolve then the minimization process is needed for all parameters and these values are instead used for initialization. The eight parameters of the AMAT distance are then initialized to the centroid of the cell for μ₁ and μ₂, the lengths of the principal axes of the cell for s₁ and s₂, the angle of the principal axis with the x-axis for α, a₁ = 0, a₂ = 0, and p = 2. The parameter p enables modeling of squarish cells, the parameters a₁ and a₂ enable triangular modeling or egg like cells, and most importantly, combinations of all of the eight parameters enable a large set of complex cell shapes to be modeled. Whether it is arbitrarily decided to fix some known parameters or not, after this fitting, each cell is represented by a vector of eight parameters that describe a specific parametrization of the AMAT distance function. The level 1 of this two-dimensional (2D) function then matches closely the contour of the cell (Supplementary Movies 1–5). For numerical optimization, the L-BFGS-B algorithm available in scipy.optimize was used, as it enabled us to make the process more robust, by introducing some constraints on the range of values the parameters can take.

Generation of a tessellation from individual cell metrics

While, for instance, five parameters enable us to model an elliptical shape and eight parameters enables us to model triangular or rectangular shapes, it does not mean that the cell shape will end up being reconstructed exactly as an ellipse, a triangle, or a rectangle. In fact, the competition for space between cells, each equipped with their own distance, will permit reconstruction of the cell pavement accurately, without any holes. To reconstruct the original image tessellation of K cells, we aim at performing the following minimization:

$$\mathop {{\min }}\limits_{{{C}}_1, \ldots ,{{C}}_K} \mathop {\sum }\limits_{j = 1}^K \mathop {\sum }\limits_{{\mathbf{x}}_i \in C_j} d_{\mathrm{{AMAT}}\left( {{\mathbf{\mu }}_{{j}},{\mathbf{R}}_{{j}},{\mathbf{S}}_{{j}},{\mathbf{A}}_{{j}},p_j} \right)}\left( {{\mathbf{x}}_{{i}}} \right),$$

where C_j denotes the set of pixels x_i that belong to the cell j with μ_j and R_j (the location and orientation parameters) left free to evolve while S_j, A_j, and p_j (the shape parameters) are fixed. It can be solved using a modified Lloyd algorithm. Lloyd is usually employed to obtain a Voronoi tessellation of a 2D or a 3D space with the standard Euclidean distance^8,36,37. In that case all computed distances along the process are similar and do not depend on parameters. Here, we also aim at performing a tessellation but each compartment uses its own parameterized distance function, as described in the previous section, so as to impose the shape of an actual cell. To our knowledge, Lloyd, with a different metric per cell, was not used for modeling cells. Furthermore, the idea of having each of these metrics matching the properties of a real cell is to our knowledge novel. To reconstruct the original image, the first step is similar to Lloyd and consists of computing the distance of all pixels to all cells (using dedicated AMAT distances) and labeling each of those pixels with the label of its closest cell. In the second step, Lloyd was modified such that the location parameters μ₁, μ₂, and α of each cell are updated by minimizing the sum of square error previously described. Those three parameters only are left free to evolve (except for the incomplete cells at the border of the image for which all parameters are fixed). The five other parameters s₁, s₂, a₁, a₂, and p, describing the cell shape, are estimated once from the original cell segmentations and remain constant along the rest of the iteration process for all cells to maintain their shape. These two steps are repeated until no more pixels change label. At the end of this process, a tessellation that is an accurate approximation of the original cell tessellation is obtained with only eight parameters per cell provided at initialization by fitting (Fig. 1b–d). To synthesize a random tessellation based on all the cells of a given image, the exact same process is used over the same image dimension but the location and orientation parameters are initialized randomly, still kee** the shape parameters for all cells constant. This process applied on the image Supplementary Fig. 1 to produce its reconstruction by SET and three random SET can be visualized in Supplementary Movie 6. Figure 2 shows that random SET preserves single-cell properties of various tissues while expectedly breaking cell relationships.

Null distribution and associated p value

It is important to notice that the reconstruction by SET of the original image could possibly be one sample of the random SET, as the construction process is exactly the same. Only the initialization of the positional parameters (location and orientation) willingly differ: for the reconstruction, these parameters are the original one while in synthetic images they are randomly sampled. This is the foundation of the statistical approach we present: a thousand pictures representing alternative random tessellations of the real image are generated and compared to a reconstruction of that real image using the same process. The statistical significance of any quantitative feature computed from a local group of cells can then be obtained the following way. The considered feature is computed on each random tessellation. Altogether, the sample distribution of these values approximates the null distribution of that feature. Then, the computation of that same feature is also performed on the reconstruction of the original image. If the value computed from the reconstruction falls within the null distribution, then by definition the null hypothesis cannot be rejected. If the value obtained is aside from the null distribution, then a p value can directly be obtained as the ratio of random tessellations that display the same or a more extreme value than value computed from the reconstruction of the original image. Note that if the computed feature is a sum or the mean of independent and identically distributed events over the image, as for Fig. 5e, the null distribution can be approximated by a Gaussian under the CLT. The last combines the advantages of obtaining a more precise p value while necessitating in principle the generation of only one random SET.

Cell texture map**

Independent of the reconstruction of the cell tessellation, we additionally transport the texture content of each cell so as to enable the possible statistical analysis of organelle positioning within the context of its cell neighborhood. To this aim we used a particular weighting of barycentric coordinates called the mean value coordinates, developed by Michael S. Floater³⁸. The mean value coordinate method offers a way to smoothly morph the content of an arbitrary polygon to the content of another arbitrary polygon with the same number of vertices. As the synthetic cell contour is about the same shape and size as the original cell contour, we do not expect significant distortion of the content if the orientation of the reference coordinates is similar. Therefore, the segmented contour of each cell and its synthetic counterpart were respectively subsampled to an arbitrary resolution of n ordered points _pi and _pi′ (typically 100). The first points p₀ and p₀′ of both contours correspond respectively to the orientation of their major axis so as to align the two shapes. For each pixel of the synthetic shape we then computed the n mean value coordinates relative to the n points of the contour _pi′ and applied the same n weights to the n points of the contour _pi to compute a floating point location in the original cell image. A bilinear interpolation of the four closest pixels from that location enabled recovery of a color value that was then used in the synthetic cell (Supplementary Fig. 2). Using this approach, all pixel values of all synthetic cells could be recovered (Figs. 2a, 5b and Supplementary Movie 6).

Alternative approaches

Cell compartments constraint the cell centers to be spread from one another, such that they do not behave as freely as points process approach could essentially model. Therefore, regular point process statistics would hardly be relevant for this type of spatial analyses. We then chose to compare our method to two other approaches that could be considered for such analysis (Fig. 3). These two other approaches, like ours, seek to compare the image observation to a null distribution that should capture the variation about the null hypothesis stating that cells are organized randomly. The difference between the three methods essentially lies in how that null distribution is built by computational means and how relevant it is.

Alternative approach—Shuffle on a hexagonal grid

The first approach (red distribution Fig. 3) uses a honeycomb grid containing as many hexagonal cells as in the original image^39,40. For each run, cell identities were assigned randomly with respect to the observed cell type ratio (83 stem cells from a total cell count of 190 cells for Fig. 3) and the number of contacts between two stem cells was retrieved (Supplementary Fig. 6A).

Alternative approach—Shuffle on the segmentation

The second approach (gray distribution Fig. 3) uses the segmentation of the original cell pattern and cell identities are shuffled in order to preserve the distribution of the cell shapes while producing a realistic graph of cell adjacency (Supplementary Fig. 6B). In practice, this model produces a null distribution that is close to the one obtained with the honeycomb method (Fig. 3c) and led to close conclusions.

Raw image information—Mice

Ependymal images were acquired from E18, P1, and P30 mice. The experiments were performed in conformity with French and European Union regulations and the recommendations of the local ethics committee (Comité d’éthique en experimentation animale no. 005). The date of the vaginal plug was recorded as embryonic day (E) 0.5 and the date of birth as postnatal day (P) 0. Healthy, immunocompetent animals were kept in a 12 h light/12 h dark cycle at 22 °C and fed ad libitum. The mice used in this study include OF1 (Charles River Laboratories) and Centrin2-GFP (CB6-Tg(CAG-EGFP/CETN2)3-4Jgg/J; The Jackson Laboratory).

Raw image information—Immunostainings

Wholemounts of the lateral walls of the lateral LV were dissected²⁷ from animals sacrificed by cervical dislocation and fixed for 15 min in pure methanol at −20 °C. The samples were incubated for 1 h in blocking solution (1× PBS with 0.1% Triton X-100 and 10% fetal bovine serum) at room temperature followed by overnight incubation at 4 °C in the primary antibodies diluted in blocking solution. The primary antibodies used targeted ZO1 (1:100, cell junction marker; Thermo Fischer Scientific), FOP (1:600, centriole marker, Abnova Corporation), Sas6 (1:500, pro-centriole marker, Santa Cruz), -Catenin (1:500, cell junction marker, Millipore). The following day, the samples were stained with species-specific AlexaFluor fluorophore-conjugated secondary antibodies (1:400, Thermo Fischer Scientific or Jackson ImmunoResearch Labs). Nuclei were counterstained with a 1:1500 Hoechst solution (from a 20 mg/ml stock, Sigma-Aldrich), containing the secondary antibodies for 2 h at room temperature.

Finally, the wholemounts were redissected to keep only the thin lateral walls of the LV²⁰ which were mounted with Fluoromount-G mounting medium (Southern Biotech, 0100-01).

Raw image information—Others

The root image is a Col-0 Arabidopsis thaliana sample and has been treated by propidium iodide to label cell walls⁴¹ and imaged with a Zeiss 710 confocal. The root image is a slice from a 3D stack. The shoot apical meristem image is a FM4-64 staining of a Col-0 Arabidopsis thaliana and was acquired with a Leica SP2 confocal as described in ref. ⁴². The 3D stack was flattened with merryproj⁴³. The Drosophila image is originally from ref. ⁴⁴. The membranes are visualized with antibodies against E-Cadherin. Image of chick basilar papilla is originally from ref. ⁴⁵ and was recently used in ref. ⁴⁶. The samples were treated with anti-cingulin and anti-hair cell antigen to visualize membrane junction and cell identity. The Xenopus epidermis image was acquired from a stage 33 larva. The visualization of membranes and cell identities was made possible by phalloidin labeling of the actin, acetylated alpha tubuli-488 for cillia, andlectin-pna-594 to label mucins in goblet cells and SSCs⁴⁷. The image was extracted from a 3D stack using the SME algorithm⁴⁸.

Cell segmentation

The minimal input to the SET model is an image of segmented cells where each pixel takes as value the integer label that represents all the pixels of the same cell. All 2D cell segmentations presented in this manuscript were performed using a modified version of the “Morphological Segmentation” plugin of the MorphoLibJ package of ImageJ/Fiji⁴⁹. However, this preprocessing step can be performed by numerous other software packages that do exist to segment images of cells. In practice, as only one image is needed, the full automation of the detection process is not required and segmentation can possibly be manually corrected. Prior segmentation, 2D images of Xenopus larva epidermis and mice ependyma were extracted from 3D stack using the SME algorithm⁴⁸.

Computational resources

Lloyld relaxation with hundreds to thousands of cells each equipped with their own metric to iteratively redistribute labels over millions of pixels can be a demanding process. Our approach is faster when using only Five parameters per cell (xy position, rotation angle, and main axes length) as only the covariance matrix of the pixels of each cell need to be computed and the Mahalanobis distance to each cell equipped with its own matrix can be used. The computation of the last is made very efficient by the cdist function from the scipy Python package (see the code for implementation details). Therefore, when the cells could reproduce correctly the observed image using five parameters (e.g. E18 and adult ependymal cells) we choose this option. In this case the approximate computation time was between 1 h 30 min and 10 h for 1000 SET simulations on 200 cpus Intel Xeon Processor 2400 MHz, depending on the image size. When cell were highly asymetrics such that five parameters did not reproduce properly the observation (e.g. Xenopus), then we used eight parameters and it took between 9 and 16 days to compute about 300 simulations on the same computer configuration. All calculations were submitted in parallel thanks to the IBENS computing cluster. Note that the code made available offers the possibility to parallelize computation on all CPUs of a single computer. Furthermore, we anticipate that this type of computation would significantly gain to be ported to GPU computing as it can be highly parallelized per cell; however, we have not investigated this possibility.

Ethical aspect

The experiments using mice were performed in conformity with French and European Union regulations and the recommendations of the local ethics committee (Comité d’éthique en experimentation animale no. 005). Mice were bred and maintained in the animal facility of IBENS (Agreement 5502 from the French Ministry of Research and Agreement OGM2014 from the Préfecture de Paris-French ministry of interior). The minimal number of animals was used for the project and the procedures implemented ensured their welfare during their lives.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The chick image from ref. ⁴⁶ was provided by David Sprinzak, Guy Richardson, and Richard Goodyear, and was initially from ref. ⁴⁵, Copyright 1997 Society for Neuroscience. The Drosophila image from ref. ⁴⁴ was provided by Yohanns Bellaiche with the permissions of AAAS. The Arabidopsis thaliana root image was provided by Jean-Christophe Palauqui. The Arabidopsis thaliana shoot apical meristem image was provided by Katia Belcram. The Xenopus image was provided by Peter Walentek. Mice ependyma images were produced by Nathalie Spassky. A copy of these image data is made available on the Github page along the code to run the method https://github.com/biocompibens/cellmodelling.

Code availability

The SET method and all necessary images and code to reproduce the results are available as Python scripts from github (https://github.com/biocompibens/cellmodelling). Version v1.0.0 can be found here⁵⁰.

References

Smith, K. et al. Phenotypic image analysis software tools for exploring and understanding big image data from cell-based assays. Cell Syst. 6, 636–653 (2018).
Article CAS PubMed Google Scholar
Rose, F. et al. Compound functional prediction using multiple unrelated morphological profiling assays. SLAS Technol. https://doi.org/10.1177/2472630317740831 (2017).
Genovesio, A. et al. Automated genome-wide visual profiling of cellular proteins involved in HIV infection. J. Biomol. Screen. 16, 945–958 (2011).
Article CAS PubMed Google Scholar
Perlman, Z. E. et al. Multidimensional drug profiling by automated microscopy. Science 306, 1194–1198 (2004).
Article CAS PubMed Google Scholar
Bankhead, P. et al. QuPath: Open source software for digital pathology image analysis. Sci. Rep. 7, 1–7 (2017).
Article CAS Google Scholar
**a, C., Fan, J., Emanuel, G., Hao, J. & Zhuang, X. Spatial transcriptome profiling by MERFISH reveals subcellular RNA compartmentalization and cell cycle-dependent gene expression. Proc. Natl. Acad. Sci. USA. https://doi.org/10.1073/pnas.1912459116 (2019).
Lee, J. H. et al. Highly multiplexed subcellular RNA sequencing in situ. Science 343, 1360–1363 (2014).
Article CAS PubMed PubMed Central Google Scholar
Osborne, J. M., Fletcher, A. G., Pitt-Francis, J. M., Maini, P. K. & Gavaghan, D. J. Comparing individual-based approaches to modelling the self-organization of multicellular tissues. PLoS Comput. Biol. 13, e1005387 (2017).
Article PubMed PubMed Central CAS Google Scholar
Møller, J. & Waagepetersen, R. Some recent developments in statistics for spatial point patterns. Annu. Rev. Stat. Appl. 4, 317–342 (2017).
Article Google Scholar
Kraus, O. Z. et al. Automated analysis of high-content microscopy data with deep learning. Mol. Syst. Biol. 13, 924 (2017).
Article PubMed PubMed Central CAS Google Scholar
Scheeder, C., Heigwer, F. & Boutros, M. Machine learning and image-based profiling in drug discovery. Curr. Opin. Syst. Biol. 10, 43–52 (2018).
Article PubMed PubMed Central Google Scholar
Zhang, Z. et al. Pathologist-level interpretable whole-slide cancer diagnosis with deep learning. Nat. Mach. Intell. 1, 236–245 (2019).
Article Google Scholar
Wählby, C., Lindblad, J., Vondrus, M., Bengtsson, E. & Björkesten, L. Algorithms for cytoplasm segmentation of fluorescence labelled cells. Anal. Cell. Pathol. 24, 101–111 (2002).
Article PubMed PubMed Central Google Scholar
van der Walt, S. et al. scikit-image: image processing in Python. PeerJ 2, e453 (2014).
Article PubMed PubMed Central Google Scholar
Heller, D. et al. EpiTools: an open-source image analysis toolkit for quantifying epithelial growth dynamics. Dev. Cell 36, 103–116 (2016).
Article CAS PubMed PubMed Central Google Scholar
Aigouy, B., Umetsu, D. & Eaton, S. Segmentation and quantitative analysis of epithelial tissues. Methods Mol. Biol. 1478, 227–239 (2016).
Article CAS PubMed Google Scholar
Mashburn, D. N., Lynch, H. E., Ma, X. & Hutson, M. S. Enabling user-guided segmentation and tracking of surface-labeled cells in time-lapse image sets of living tissues. Cytometry A 81, 409–418 (2012).
Article PubMed PubMed Central Google Scholar
Berg, S. et al. ilastik: interactive machine learning for (bio)image analysis. Nat. Methods 16, 1226–1232 (2019).
Article CAS PubMed Google Scholar
McQuin, C. et al. CellProfiler 3.0: Next-generation image processing for biology. PLoS Biol. 16, e2005970 (2018).
Article PubMed PubMed Central CAS Google Scholar
Mirzadeh, Z., Merkle, F. T., Soriano-Navarro, M., Garcia-Verdugo, J. M. & Alvarez-Buylla, A. Neural stem cells confer unique pinwheel architecture to the ventricular surface in neurogenic regions of the adult brain. Cell Stem Cell 3, 265–278 (2008).
Article CAS PubMed PubMed Central Google Scholar
Walentek, P. et al. A novel serotonin-secreting cell type regulates ciliary motility in the mucociliary epidermis of Xenopus tadpoles. Development 141, 1526–1533 (2014).
Article CAS PubMed Google Scholar
Dubaissi, E. et al. A secretory cell type develops alongside multiciliated cells, ionocytes and goblet cells, and provides a protective, anti-infective function in the frog embryonic mucociliary epidermis. Development 141, 1514–1525 (2014).
Article CAS PubMed PubMed Central Google Scholar
Walentek, P. & Quigley, I. K. What we can learn from a tadpole about ciliopathies and airway diseases: using systems biology in Xenopus to study cilia and mucociliary epithelia. Genesis 55, e23001 (2017).
Article Google Scholar
Zhang, S. & Mitchell, B. J. Centriole biogenesis and function in multiciliated cells. Methods Cell Biol. 129, 103–127 (2015).
Article CAS PubMed PubMed Central Google Scholar
Deblandre, G. A., Wettstein, D. A., Koyano-Nakagawa, N. & Kintner, C. A two-step mechanism generates the spacing pattern of the ciliated cells in the skin of Xenopus embryos. Development 126, 4715–4728 (1999).
CAS PubMed Google Scholar
Stubbs, J. L. Radial intercalation of ciliated cells during Xenopus skin development. Development 133, 2507–2515 (2006).
Article CAS PubMed Google Scholar
Mirzadeh, Z., Han, Y.-G., Soriano-Navarro, M., García-Verdugo, J. M. & Alvarez-Buylla, A. Cilia organize ependymal planar polarity. J. Neurosci. 30, 2600–2610 (2010).
Article CAS PubMed PubMed Central Google Scholar
Song, H. et al. Planar cell polarity breaks bilateral symmetry by controlling ciliary positioning. Nature 466, 378–382 (2010).
Article CAS PubMed PubMed Central Google Scholar
Vőfély, R. V., Gallagher, J., Pisano, G. D., Bartlett, M. & Braybrook, S. A. Of puzzles and pavements: a quantitative exploration of leaf epidermal cell shape. N. Phytol. 221, 540–552 (2019).
Article CAS Google Scholar
Carter, R., Sánchez-Corrales, Y. E., Hartley, M., Grieneisen, V. A. & Marée, A. F. M. Pavement cells and the topology puzzle. Development 144, 4386–4397 (2017).
Article CAS PubMed PubMed Central Google Scholar
Sapala, A. et al. Why plants make puzzle cells, and how their shape emerges. Elife 7, e32794 (2018).
Article PubMed PubMed Central Google Scholar
Jackson, M. D. B. et al. Global topological order emerges through local mechanical control of cell divisions in the Arabidopsis shoot apical meristem. Cell Syst. 8, 53–65.e3 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gibson, W. T. et al. Control of the mitotic cleavage plane by local epithelial topology. Cell 144, 427–438 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kondo, T. & Hayashi, S. Mitotic cell rounding accelerates epithelial invagination. Nature 494, 125–129 (2013).
Article CAS PubMed Google Scholar
Bergmann, D. C., Lukowitz, W. & Somerville, C. R. Stomatal development and pattern controlled by a MAPKK kinase. Science 304, 1494–1497 (2004).
Article CAS PubMed Google Scholar
Sánchez-Gutiérrez, D. et al. Fundamental physical cellular constraints drive self-organization of tissues. EMBO J. 35, 77–88 (2016).
Article PubMed CAS Google Scholar
Honda, H. Description of cellular patterns by Dirichlet domains: the two-dimensional case. J. Theor. Biol. 72, 523–543 (1978).
Article CAS PubMed Google Scholar
Floater, M. S. Mean value coordinates. Comput. Aided Geometric Des. 20, 19–27 (2003).
Article Google Scholar
Gibson, M. C., Patel, A. B., Nagpal, R. & Perrimon, N. The emergence of geometric order in proliferating metazoan epithelia. Nature 442, 1038–1041 (2006).
Article CAS PubMed Google Scholar
Ortiz-Álvarez, G. et al. Adult neural stem cells and multiciliated ependymal cells share a common lineage regulated by the geminin family members. Neuron 102, 159–172.e7 (2019).
Article PubMed PubMed Central CAS Google Scholar
Truernit, E. et al. High-resolution whole-mount imaging of three-dimensional tissue organization and gene expression enables the study of Phloem development and structure in Arabidopsis. Plant Cell 20, 1494–1503 (2008).
Article CAS PubMed PubMed Central Google Scholar
Grandjean, O. et al. In vivo analysis of cell division, cell growth, and differentiation at the shoot apical meristem in Arabidopsis. Plant Cell 16, 74–87 (2004).
Article CAS PubMed PubMed Central Google Scholar
de Reuille, P. B., Bohn-Courseau, I., Godin, C. & Traas, J. A protocol to analyse cellular dynamics during plant development. Plant J. 44, 1045–1053 (2005).
Article PubMed CAS Google Scholar
Bosveld, F. et al. Mechanical control of morphogenesis by fat/dachsous/four-jointed planar cell polarity pathway. Science 336, 724–727 (2012).
Article CAS PubMed Google Scholar
Goodyear, R. & Richardson, G. Pattern formation in the basilar papilla: evidence for cell rearrangement. J. Neurosci. 17, 6289–6301 (1997).
Article CAS PubMed PubMed Central Google Scholar
Shaya, O. et al. Cell-cell contact area affects notch signaling and notch-dependent patterning. Dev. Cell 40, 505–511.e6 (2017).
Article CAS PubMed PubMed Central Google Scholar
Walentek, P. Manipulating and analyzing cell type composition of the Xenopus mucociliary epidermis. Methods Mol. Biol. 1865, 251–263 (2018).
Article CAS PubMed Google Scholar
Shihavuddin, A. et al. Smooth 2D manifold extraction from 3D image stack. Nat. Commun. 8, 15554 (2017).
Article CAS PubMed PubMed Central Google Scholar
Legland, D., Arganda-Carreras, I. & Andrey, P. MorphoLibJ: integrated library and plugins for mathematical morphology with ImageJ. Bioinformatics 32, 3532–3534 (2016).
CAS PubMed Google Scholar
L-EL, Biocomp team (IBENS, CNRS UMR8197, France) & Genovesio, A. biocompibens/cellmodelling: cellModelling-v1.0.0-submission. https://doi.org/10.5281/zenodo.3937609 (2020).

Download references

Acknowledgements

We thank our colleagues who provided image data: David Sprinzak, Guy Richardson, and Richard Goodyear for the chick basilar papilla; Bellaiche Yohanns for the Drosophila dorso thorax; Jean-Christophe Palauqui for the Arabidopsis thaliana root; Katia Belcram for the Arabidopsis thaliana shoot apical meristem; and Peter Walentek for the Xenopus epidermis. We thank Xavier Morin and Mary Ann Letellier for valuable comments on the manuscript and Felipe Delestro from the Bioinformatics platform of IBENS for the design of the figures. This work has received support under the program “Investissements d’Avenir” launched by the French Government and implemented by the ANR, with the references: ANR-10-LABX-54 MEMO LIFE ANR-11-IDEX-0001-02 PSL* Research University.

Author information

Authors and Affiliations

Institut de Biologie de l’Ecole Normale Supérieure (IBENS), CNRS UMR8197, INSERM U1024, PSL Research University, 46 rue d’Ulm, 75005, Paris, Paris, France
Elise Laruelle, Nathalie Spassky & Auguste Genovesio

Authors

Elise Laruelle
View author publications
You can also search for this author in PubMed Google Scholar
Nathalie Spassky
View author publications
You can also search for this author in PubMed Google Scholar
Auguste Genovesio
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.G. designed and implemented the SET model, E.L. implemented and performed image analysis and all numerical experiments, N.S. performed all lab experiments and microscopy acquisition of mice ependymal images. A.G. and E.L. wrote the manuscript. All authors edited the manuscript.

Corresponding authors

Correspondence to Nathalie Spassky or Auguste Genovesio.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Movie 1

Supplementary Movie 2

Supplementary Movie 3

Supplementary Movie 4

Supplementary Movie 5

Supplementary Movie 6

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Laruelle, E., Spassky, N. & Genovesio, A. Unraveling spatial cellular pattern by computational tissue shuffling. Commun Biol 3, 605 (2020). https://doi.org/10.1038/s42003-020-01323-3

Download citation

Received: 21 January 2020
Accepted: 23 September 2020
Published: 23 October 2020
DOI: https://doi.org/10.1038/s42003-020-01323-3
Springer Nature Limited

This article is cited by

A two-vertex theorem for normal tilings
- Gábor Domokos
- Ákos G. Horváth
- Krisztina Regős
Aequationes mathematicae (2023)

Associated content

Centrosomes and Cilia

Collection 29 November 2021

Unraveling spatial cellular pattern by computational tissue shuffling

Abstract

Similar content being viewed by others

Introduction

Design of a flexible parametric distance

Fitting dAMAT = 1 to a cell contour

Generation of a tessellation from individual cell metrics

Null distribution and associated p value

Cell texture map**

Alternative approaches

Alternative approach—Shuffle on a hexagonal grid

Alternative approach—Shuffle on the segmentation

Raw image information—Mice

Raw image information—Immunostainings

Raw image information—Others

Cell segmentation

Computational resources

Ethical aspect

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation

Fitting d_AMAT = 1 to a cell contour