Canalization reduces the nonlinearity of regulation in biological networks

Kadelka, Claus; Murrugarra, David

doi:10.1038/s41540-024-00392-y

Canalization reduces the nonlinearity of regulation in biological networks

Article
Open access
Published: 13 June 2024

Volume 10, article number 67, (2024)
Cite this article

Download PDF

You have full access to this open access article

npj Systems Biology and Applications

Canalization reduces the nonlinearity of regulation in biological networks

Download PDF

296 Accesses
2 Altmetric
Explore all metrics

Abstract

Biological networks, such as gene regulatory networks, possess desirable properties. They are more robust and controllable than random networks. This motivates the search for structural and dynamical features that evolution has incorporated into biological networks. A recent meta-analysis of published, expert-curated Boolean biological network models has revealed several such features, often referred to as design principles. Among others, the biological networks are enriched for certain recurring network motifs, the dynamic update rules are more redundant, more biased, and more canalizing than expected, and the dynamics of biological networks are better approximable by linear and lower-order approximations than those of comparable random networks. Since most of these features are interrelated, it is paramount to disentangle cause and effect, that is, to understand which features evolution actively selects for, and thus truly constitute evolutionary design principles. Here, we compare published Boolean biological network models with different ensembles of null models and show that the abundance of canalization in biological networks can almost completely explain their recently postulated high approximability. Moreover, an analysis of random N–K Kauffman models reveals a strong dependence of approximability on the dynamical robustness of a network.

Models of Gene Regulation: Integrating Modern Knowledge into the Random Boolean Network Framework

Modelling the Evolution of Dynamic Regulatory Networks: Some Critical Insights

Emergent Properties of Gene Regulatory Networks: Models and Data

Introduction

Biological systems are frequently represented as networks, which describe the interactions between different biological entities such as genes, proteins, or metabolites. For instance, gene regulatory networks (GRNs) describe how a collection of genes governs key processes within a cell. A static biological network is completely described by a wiring diagram, which contains nodes (e.g., genes) and edges between nodes, which can be undirected (e.g., in protein–protein interaction networks), directed, and even signed (e.g., in gene regulatory networks). Static networks are, however, insufficient to obtain accurate insights into the often complex, non-linear dynamics of biological networks¹. Dynamic biological networks possess additional information on how each node is regulated by the set of regulators. Popular dynamic modeling frameworks include differential equation models and discrete models. While the former harbors the potential for quantitative predictions, it requires a substantial amount of data for accurate inference of its many kinetic parameters. Therefore, many modelers prefer discrete models and their qualitative predictions. Boolean networks constitute the simplest type of discrete model. Here, each node takes on only two values, and time is discretized as well. The two values can be interpreted as low and high concentration, unexpressed and expressed genes or proteins, etc. Particularly for GRNs, Boolean networks have become increasingly popular. Over 160 Boolean GRN models have been curated by experts in their respective fields - most over the course of the last twelve years². These models range in size from 3 to 302 nodes and describe various processes in many species and kingdoms of life.

Over the last few decades, a number of interesting features of biological networks have been identified. At the structural “wiring diagram" level, biological networks are sparsely connected with an average degree of about 2.5 and are enriched for certain network motifs such as coherent feed-forward loops and complex feedback loops, particularly those that contain many negative interactions^2,3. Dynamically, most biological networks operate at the critical edge between order and chaos^2,4,5. For random N − K Kauffman networks, it is well-established that the network dynamics are generally ordered whenever 2Kp(1 − p) < 1 and chaotic whenever 2Kp(1 − p) > 1; at 2Kp(1 − p) = 1, a phase transition happens^6,7. Here, K is the average degree of the network, while p describes the bias of picking one in the Boolean function’s truth table; the unbiased case corresponds to p = 0.5, and the absolute bias can be quantified by 2∣0.5 − p∣ ∈ [0, 1], or alternatively by 1 − 4p(1 − p) ∈ [0, 1], with 0 corresponding in both cases to the unbiased case. Networks with ordered dynamics typically possess few and short attractors, while chaotic dynamics are characterized by the presence of many long attractors⁸.

The dynamic update rules of Boolean biological network models are also remarkable. They are highly canalizing, redundant, and have a high absolute bias^2,9,10. Canalization is a widely used term in biology. First coined by developmental geneticist Waddington in the 1940s¹¹, it refers to the tendency of developmental processes to follow particular trajectories, despite internal and external perturbations¹². In other words, it refers to low variation in phenotypes despite potentially high variation in genotypes and the environment¹³. Correspondingly, Kauffman introduced Boolean canalizing functions as suitable update rules to describe the gene regulatory logic¹⁴. A canalizing function possesses a canalizing variable, which, when it receives its canalizing input, determines the output of the function, irrespective of all other inputs. If the subfunction which is evaluated when the canalizing variable does not receive its canalizing input is also canalizing, the function is 2-canalizing, etc.¹⁵. If all n variables of a function become eventually canalizing, the function is n-canalizing, also known as nested canalizing¹⁶. The number of variables that become eventually canalizing is known as the canalizing depth¹⁵. Every non-zero Boolean function possesses a unique standard monomial form, from which the canalizing depth and the number of variables in each “layer” of canalization can be directly derived^17,18. As the number of variables increases, canalizing and especially nested canalizing functions become increasingly rare^19,20,21. It is, therefore, very surprising that almost all rules in published Boolean biological network models are canalizing and even nested canalizing^2,9.

Another recently discovered feature of biological Boolean network models is the high approximability of their dynamics by linear and low-order continuous Taylor approximations of the Boolean update rules²². Here, the mean approximation error (MAE) is defined as follows: each update rule of a given Boolean network is replaced by a continuous Taylor approximation of a defined order. The MAE describes the mean squared error between the long-term state of the Boolean network and the long-term state of the continuous approximation when starting from a random initial state (see “Methods” for details). Manicka et al found that biological networks were consistently more approximable (i.e., had lower MAE values) than random networks with the same wiring diagram (i.e., matching degree distribution) and matching update rule bias²².

Many of the described remarkable features of biological networks are interrelated and correlated. For instance, canalizing Boolean functions are, on average, more redundant and have a higher absolute bias than random functions². In this paper, we show that the described increased approximability of biological networks can be almost fully explained by the abundance of canalization, which was not considered in²². We further show that the approximability of a Boolean network depends mostly on its dynamic regime, which in turn depends on its update rules (that is, average degree, bias, and amount of canalization)^2,23. A network with ordered dynamics (i.e., few and short attractors) tends to possess much more approximable dynamics than a network with chaotic dynamics. For questions related to the interpretation of approximability from a biological perspective (e.g., what it means for a biological network to be highly approximable), we refer the interested reader to²².

Results

To test the hypothesis that the increased canalization in biological networks explains their increased approximability, we compared the approximability of published expert-curated biological networks with several ensembles of random null models, similar to²². All random networks possessed the same wiring diagram as the respective biological network. The authors in²² considered an “unconstrained” null model, where each biological update rule was replaced by a non-constant random Boolean function (of the same degree), and a “constrained" model (null model type 1 in this study), which additionally matched the bias of each biological update rule. Neither model accounted for the high degree of canalization in biological networks. We therefore considered two additional null models, one which matches the degree and canalizing depth of each biological update rule (null model type 2), and one which matches degree, canalizing depth and bias (null model type 3; see Methods for details). Note that additional null models could have been considered, even more stringent ones by matching, e.g., the exact canalizing layer structure¹⁷. However, such null models would potentially only possess low variation in their dynamics, complicating the interpretation of results. After excluding highly similar biological models (to avoid the introduction of selection bias) and those with a maximal degree of eleven or more (see Methods), we compared the approximability of 110 published expert-curated biological Boolean network models² and the three different ensembles of null models. As in²², we found that random networks of type 1 were less approximable (Fig. 1). However, random networks that accounted for the increased canalization (null models of type 2 and type 3) exhibited similar levels of approximability as the biological networks. Interestingly, the higher the order of employed approximation the more significant were the differences, quantified by p-values from a Wilcoxon signed-rank test, in the MAE distributions between biological and random networks (Fig. 2). Third-order Taylor approximations recovered the dynamics of biological networks slightly better than those of random networks with matched degree, bias and canalizing depth.

**Fig. 1: Canalization explains the high approximability of biological networks.**

**Fig. 2: Mean approximation errors of biological networks and their random null models.**

To ensure these findings are not simply due to a lack of variation in the dynamics of the null models, especially for the most constrained null model of type 3, we computed the variability in the number of network attractors among the null models (Fig. 3). To enable an exhaustive attractor search, we restricted this analysis to the 29 out of 110 Boolean networks with 15 or fewer nodes. We observed no significant change in the standard deviation of the number of attractors between null models of type 1, 2, and 3, indicating that the stringency of the constraints does not affect the findings shown in Figs. 1 and 2. Overall, these results show that the approximability of biological networks can be almost entirely explained by their high degree of canalization, measured by the canalizing depth.

**Fig. 3: Variability in the dynamics of random null models.**

However, a related question, which has implications for the control of Boolean networks²⁴, remains: Why can the dynamics of biological networks be approximated so well by low-order and even linear continuous Taylor approximations? We hypothesized that the approximability of a Boolean network is strongly correlated with its dynamical robustness, which is typically measured by the average sensitivity⁷ and Derrida values^25,26. That is, we thought that networks with robust dynamics are more approximable because they possess typically few and short attractors²⁷. The robustness metrics describe how a small perturbation affects the network over time. If the perturbation gets on average smaller after each node has been synchronously updated once, the system operates in the ordered regime; if, on average, it increases in size, the system is in the chaotic regime, and if it remains, on average, of similar size, the system exhibits criticality. All biological systems that have thus far been modeled as Boolean networks operate close to the critical edge between order and chaos^2,4,5. This is likely because most update rules in biological networks are nested canalizing - in fact, biological networks are even particularly enriched for insensitive nested canalizing functions (NCFs)²—and the expected average sensitivity of an NCF in any number of variables is 1. On the contrary, the average sensitivity of random Boolean functions with degree k and bias p is 2kp(1 − p). That is, it increases as the number of inputs increases and decreases as the function becomes more biased (where p = 0.5 corresponds to the unbiased case). Boolean networks governed by such random functions thus exhibit a phase transition at 2kp(1 − p) = 1^6,7.

To test which features of a biological network make it highly approximable, we computed Spearman correlations (ρ) between the mean approximation errors of the 110 biological networks and several structure- and dynamics-related properties (Fig. 4). Highly connected networks proved less approximable (ρ > 0.6). This is likely due to the fact that a continuous Taylor approximation of order n matches a Boolean function with k ≤ n variables perfectly everywhere. Thus, the higher the average degree, 〈K〉, of a Boolean network, the lower the chance for perfect matches. Highly connected, large networks generally possess more recurring patterns, so-called network motifs. It was thus not surprising that biological networks with many feed-forward loops (FFLs) and/or feedback loops proved less approximable. Across the three approximation orders, the average degree 〈K〉 and the average effective degree 〈K_e〉, defined in¹⁰, proved roughly equally negatively correlated with network approximability. This is somewhat surprising because the latter, which takes into account the importance of Boolean inputs, is a much stronger predictor of the dynamical robustness of a Boolean network, measured by its mean average sensitivity^2,23. In line with this, the strongest predictor of the mean average sensitivity of a Boolean network, 〈K_e〉〈p(1 − p)〉, as well as the mean average sensitivity itself were both not strongly correlated with the approximability of a Boolean network, with the correlation becoming insignificant for higher-order approximations. One possible explanation for this lack of strong correlation between approximability and dynamic robustness is the fact all these metrics, including the mean average sensitivity (that is, the Derrida coefficient), are ineffective measures of the true dynamic regime of a Boolean network. That is, they cannot accurately predict if two states will eventually (i.e., at time t = ∞) transition to the same network attractor or not^28,29,30. On the contrary, the mean normalized canalizing depth of a biological network as well as the proportion of Boolean rules, which are nested canalizing, were fairly strongly correlated with the approximability for all orders of approximation (∣ρ∣ > 0.4). The higher these values, the more approximable the network. Canalizing rules, especially those with a low sensitivity, have typically a fairly high absolute bias. In line with the result on the proportion of NCFs, more biased networks proved more approximable (∣ρ∣ > 0.5). Biological Boolean rules with a higher number of inputs tend to possess a higher absolute bias⁵. Interestingly, the covariance between p(1 − p) and the in-degree was the only property that became more correlated with approximability at higher approximation orders.

**Fig. 4: Predictors of approximability of biological networks.**

Metrics that explicitly describe dynamic aspects of a Boolean network also exhibited interesting correlations with the approximability. Assuming as in the computation of approximability²² a synchronous update of all nodes, we obtained, through simulation, for each biological network a lower bound of the number of attractors, as well as the approximate mean length of the attractors, the proportion of steady state attractors and the entropy of the basin sizes (see “Methods”). While the third-order approximability was not correlated with any of these metrics, networks with more attractors, a lower proportion of steady state attractors, and higher entropy possessed dynamics that were less approximable at first and second order. This goes against our hypothesis that networks with robust dynamics are highly approximable since the presence of many long attractors, and concomitant high entropy is associated with Boolean networks that operate in the chaotic regime²⁷. We further lack an explanation as to why the correlation of the MAE with dynamics-related metrics generally decreases when considering higher-order approximations, while this appears not to be the case for structural metrics.

Given the apparent correlation between many of these structure- and dynamics-related network properties (Supplementary Fig. 1), we employed a linear LASSO regression³¹ with variable regularization strength to identify the most important predictors of first-, second-, and third-order approximability of the biological networks (Fig. 5). First-order and second-order approximability proved well-explained by a linear model involving mean absolute bias, average effective degree or average degree and the number of 3- or 4-loops. Interestingly, the best predictors of third-order approximability included a number of network motif counts (number of 3- and 4-loops as well as the number of coherent FFLs). We lack a hypothesis that may serve as an explanation for this finding, beyond the trivial observation that the signal-to-noise ratio in the distribution of order 3 MAE values is substantially lower than in those for order 1 and order 2 MAE values (Fig. 2).

**Fig. 5: Regularization path of a linear LASSO regression to identify best predictors of approximability.**

To rule out potential confounders such as differences in network size, average degree as well as degree distribution, we considered modified N–K Kauffman networks, first defined in³². In these random networks of size N, each node has a constant degree K. The Boolean update rule of each node is generated by drawing 2^K times randomly with replacement from {0, 1} with probability 1 − p and bias p, respectively. We further required the wiring diagram of each network to be strongly connected since the dynamics decouple otherwise³³. Networks with a higher absolute bias exhibited more approximable dynamics (Fig. 6). Moreover, sparse networks (i.e., with low in-degree) were, on average, more approximable. Interestingly, the MAE did not always decrease as the approximation order increased. For unbiased networks with high in-degree (e.g., K = 5, p = 0.5), the MAE was very close to the maximally observed value of 0.25, even when using fourth-order Taylor approximations. Low-degree functions with a high absolute bias exhibit the highest degree of canalization, irrespective of whether canalization is measured on the variable level^14,16 or the function level^10,34 (Fig. 7). The amount of canalization in N–K Kauffman networks correlates thus highly with their approximability.

**Fig. 6: Effect of bias and in-degree on the approximability of the dynamics of Boolean networks.**

**Fig. 7: Average canalization of Boolean functions with specific bias and degree.**

Since a Boolean function with K inputs is perfectly matched everywhere by a continuous Taylor approximation of order K, the MAE values were zero in these cases. If only J < K of the inputs of a Boolean function are essential, then the Jth order Taylor approximation already provides a perfect match. Note that a Boolean input is non-essential if a change in this input never changes the output of the function. For example, f(x, y) = x has a non-essential input y. To rule out a potentially confounding effect created by perfect matches, we required, in a sensitivity analysis, all update rules to be non-degenerated, i.e., to contain only essential variables (Supplementary Fig. 2). Most MAE values were slightly higher, likely due to the higher effective degree. Qualitatively, the results were, however, very similar.

Combining all 2000 random networks (100 each for combinations of constant in-degree K ∈ {2, 3, 4, 5} and bias p ∈ {0.1, 0.2, 0.3, 0.4, 0.5}), we computed, as before, the Spearman correlation between MAE values and metrics that explicitly describe network dynamics. The dynamical robustness of a network, measured by the mean average sensitivity, was strongly positively correlated with first-, second-, and third-order MAE values (ρ > 0.75; Fig. 8). Given that the average sensitivity of random Kauffman networks is 2Kp(1 − p)⁷, this agrees qualitatively with the results from Fig. 6. Also in line is the finding that random networks are more approximable if they have few and short attractors, a high proportion of steady states, and low entropy in the distribution of the basin sizes. These four properties characterize networks that operate mostly in the ordered and critical dynamical regime. As observed for the biological networks, the correlations were consistently weaker when considering higher-order approximations. Note, however, that by design of the computational experiment, 25% (50%) of the networks perfectly match their second-order (third-order) approximation, which certainly contributed to weaker correlations.

**Fig. 8: Predictors of approximability of random networks.**

To study the effect of canalization on the nonlinearity of regulation in more detail, we modified the random networks such that the update rules were restricted to specific classes of functions. First, we compared the approximability of random networks governed by 4-variable functions with different minimal canalizing depths (see Methods). While networks without required canalization were hardly approximable (MAE ≈ 0.25), the restriction to canalizing update rules gave rise to more approximable dynamics (Fig. 9a). Canalizing networks became increasingly more approximable as the order of the Taylor approximations increased. On the other hand, networks governed by arbitrary 4-input functions were not better approximated by higher-order Taylor approximations, unless obviously perfect matches were used. We note that functions with a higher canalizing depth are, however, on average, also less sensitive³⁵ and exhibit a higher absolute bias (Fig. 7, Supplementary Fig. 1).

**Fig. 9: The approximability of Boolean network dynamics depends on canalization.**

While the canalizing depth provides a crude measure of the amount of canalization in a Boolean function, more detailed information is contained in the canalizing layer structure^17,18,35. To investigate this, we compared the approximability of random networks, each governed entirely by 4-variable NCFs but with different layer structure. Networks governed by NCFs with layer structure k₁ = 4, e.g., an AND-NOT function ${x}_{1}\wedge {\bar{x}}_{2}\wedge {\bar{x}}_{3}\wedge {x}_{4}$, are highly approximable (Fig. 9b). On the other hand, networks governed by NCFs with layer structure k₁ = 1, k₂ = 3, e.g., functions such as x₁ ∨ (x₂ ∧ x₃ ∧ x₄), are much less approximable. Again, as the approximability of these networks decreases, the sensitivity of the underlying NCFs increases, and the absolute bias decreases³⁵.

Discussion

The idea of a probabilistic generalization of Boolean logic dates back all the way to George Boole³⁶. In this manuscript, we study in depth a recent implementation of this idea: using continuous Taylor approximations of Boolean functions to approximate the dynamics of a Boolean network. We show that the high approximability of biological networks first postulated in²², can be almost entirely explained by the abundance of canalization in biological networks. We conjecture that the remaining higher approximability of biological networks is due to the reported increased occurrence of insensitive canalizing rules in biological networks². Through a computational analysis of random networks, we show that the dynamical robustness of a network strongly influences its approximability: Networks with low mean average sensitivity, operating in the ordered and critical dynamical regime and characterized by few and short attractors, possess generally more approximable dynamics. In line with this, networks governed by canalizing or even nested canalizing functions, which possess a high absolute bias and are insensitive to perturbations, proved more approximable. These two findings match because such canalizing networks are known to give rise to particularly ordered dynamics with few and short attractors, mainly steady states^35,37,38.

This study possesses a number of limitations. First, we analyze published expert-curated biological Boolean network models. Since any human is biased, the curated models are affected by bias as well. A shared bias among many modelers may give rise to abundant features in these models that are not due to evolutionary or biological reasons but purely due to this bias. In general, we cannot rule this out unless our understanding of the biology underlying these network models improves dramatically. Second, the approximability of Boolean networks, quantified by the mean approximation error and introduced in²², has potential shortcomings as well. It fails to consider the fact that two states may eventually transition to the same attractor but be time-shifted. Repeating the analyses in this study with a future version of approximability that considers these two states as dynamically equal, similar to the phenotypical robustness, defined in³³, or the quasicoherence, defined in²⁸, would be very interesting. However, the fact that the approximability compares the dynamics of discrete and continuous models will likely complicate this endeavor.

Assuming biological Boolean network models are worth investigating despite their biases, fully disentangling the relative contribution of the related properties canalization, bias, and sensitivity on approximability (see e.g., Fig. 9b) constitutes one of several open questions. Moreover, it remains to be investigated how well non-perfect continuous approximations of Boolean networks perform in the context of predicting control targets or specific dynamical features. A more technical question is whether Boolean functions that can be well approximated by low-order continuous extensions give rise to more approximable Boolean networks.

Methods

Boolean networks

A Boolean network F in variables x₁, …, x_n can be viewed as a function on binary strings of length n, which can be described coordinate-wise by n Boolean update functions f_i: {0, 1}ⁿ → {0, 1}. Every Boolean network defines a canonical map, where the functions are synchronously updated:

$$F:{\{0,1\}}^{n}\to {\{0,1\}}^{n},\,F({x}_{1},\ldots ,{x}_{n})=({f}_{1}(x),\ldots ,{f}_{n}(x)).$$

(1)

In this paper, we only consider this canonical map, i.e., we only consider synchronously updated Boolean networks.

While possible, most update functions in a Boolean network do not depend on all n variables. The wiring diagram describes the dependencies. It contains n nodes, corresponding to the x_i, and a directed edge from x_i to x_j if f_j depends on x_i (that is, if f_j(x₁, …, x_i = 0, …, x_n) ≠ f_j(x₁, …, x_i = 1, …, x_n) for at least some (x₁, …, x_i−1, x_i+1, …, x_n) ∈ {0, 1}ⁿ⁻¹). If f_j depends on x_i, x_i is a essential variable. Otherwise, it is non-essential. From the wiring diagram, the degree of each node can be derived.

Metrics describing Boolean network dynamics

A second graph associated with a synchronously updated Boolean network F, the state space, contains as nodes the 2ⁿ binary strings and a directed edge from x ∈ {0, 1}ⁿ to y ∈ {0, 1}ⁿ if F(x) = y. Each connected component of the state space corresponds to a basin of attraction, consisting of a directed loop, the attractor, as well as trees feeding into the attractor. Attractors can be steady states (also known as fixed points) or limit cycles. Due to its finite size, all states in a Boolean network eventually transition to an attractor. Every attractor in a biological network model typically corresponds to a distinct phenotype³⁹.

Since the number of nodes, n, in the investigated biological Boolean network models differs from 3 to 302, some of the state spaces are huge (size 2ⁿ). We therefore used the following procedure to approximate several dynamics-related metrics. For each biological network F, we randomly picked 1000 different initial values x₀ ∈ {0, 1}ⁿ. For each x₀, we synchronously updated F until a repeated state was reached, indicating the arrival at an attractor. The number of updates between first and second transition to the repeated state corresponds to the length of the attractor. This process yields a non-empty list of attractors {A₁, …, A_s} of length {L₁, …, L_s} with corresponding basin sizes {B₁, …, B_s}. We used $\frac{1}{s}{\sum }_{i}{L}_{i}$ as the approximate mean length of the attractor and $\frac{1}{s}{\sum }_{i}1({L}_{i}=1)$ as the approximate proportion of steady state attractors. We considered an alternative version of these two measures, weighted by the relative basin sizes (that is, $\frac{1}{1000}{\sum }_{i}{B}_{i}{L}_{i}$ and $\frac{1}{1000}{\sum }_{i}{B}_{i}1({L}_{i}=1)$). Since the alternative versions differed barely from the respective base versions (Spearman correlations of ρ > 0.95 across the 110 investigated biological networks), we decided to only use the base versions in the analysis. We approximated the entropy of the basin sizes as $-\frac{1}{1000}{\sum }_{i}\ln (\frac{{B}_{i}}{1000}){B}_{i}\in [0,\infty )$. Note that to ensure we use the same method, we approximated the state space even for networks with small state space.

Finally, we used s as the lower bound of the number of attractors. In a network with many attractors, we almost certainly fail to discover all attractors when starting from only 1000 random states. However, all attractors with a large basin size are discovered with high probability.

For the random Boolean networks of fixed size n = 15, analyzed in Figs. 6, 8, 9 and Supplementary Fig. 2, we computed the entire state space. All dynamics-related metrics, including the number of network attractors, are therefore exact in these analyses.

Continuous extensions of Boolean functions

To compute the approximability of Boolean networks, we use the same approach as in²². We start by defining continuous extensions of Boolean functions. Any Boolean function f: {0, 1}ⁿ → {0, 1} is defined in the corners of the n-dimensional hypercube, {0, 1}ⁿ, and can be extended to the entire hypercube [0, 1]ⁿ by defining a function $\hat{f}:{[0,1]}^{n}\to [0,1]$ such that $\hat{f}(x)=f(x)$ for all x ∈ {0, 1}ⁿ. Specifically, we employ a probabilistic generalization of Boolean logic, already introduced by George Booole³⁶. We consider random variables X_i: {0, 1} → [0, 1] with Bernoulli distributions and set p_i = Prob(X_i = 1). Let X = X₁ × ⋯ × X_n be the product of random variables. Then, we define

$$\hat{f}({p}_{1},\ldots ,{p}_{n})=\sum\limits_{\begin{array}{c}x\in X:\\ f(x)=1\end{array}}\prod\limits_{i=1}^{n}{\hat{p}}_{i}$$

(2)

where

$${\hat{p}}_{i}=\left\{\begin{array}{ll}{p}_{i}\quad &\,{{\mbox{if}}}\,\,{x}_{i}=1,\\ 1-{p}_{i}\quad &\,{{\mbox{if}}}\,\,{x}_{i}=0.\end{array}\right.$$

(3)

By this definition, $\hat{f}:{[0,1]}^{n}\to [0,1]$ is a continuous function that satisfies $\hat{f}(x)=f(x)$ for all x ∈ {0, 1}ⁿ²².

Taylor polynomials of Boolean functions

Since $\hat{f}$ is a continuous-variable function, we can consider different orders of approximation for $\hat{f}$ using its Taylor expansion. As described in²², $\hat{f}$ is a square-free polynomial and its Taylor expansion is finite. More specifically, the nth order approximation will match $\hat{f}$ perfectly, and if only m < n inputs of f are essential, then the mth order approximation already matches $\hat{f}$ perfectly.

For a given α = (α₁, …, α_n) ∈ {0, 1}ⁿ and x ∈ [0, 1]ⁿ, we define

$$| \alpha | ={\alpha }_{1}+\cdots +{\alpha }_{n},$$

(4)

$${x}^{\alpha }={x}_{1}^{{\alpha }_{1}}{x}_{2}^{{\alpha }_{2}}\cdots {x}_{n}^{{\alpha }_{n}},$$

(5)

$${\partial }^{\alpha }\hat{f}={\partial }_{1}^{{\alpha }_{1}}{\partial }_{2}^{{\alpha }_{2}}\cdots {\partial }_{n}^{{\alpha }_{n}}\hat{f}=\frac{{\partial }^{| \alpha | }\hat{f}}{{\partial }_{1}^{{\alpha }_{1}}{\partial }_{2}^{{\alpha }_{2}}\cdots {\partial }_{n}^{{\alpha }_{n}}},$$

(6)

with the convention that ${\partial }_{i}^{0}\hat{f}\equiv \hat{f}$. For p ∈ [0, 1]ⁿ, we have

$$\hat{f}(x)=\sum\limits_{\alpha \in {\{0,1\}}^{n}}\frac{{\partial }^{\alpha }\hat{f}(p)}{| \alpha | !}{(x-p)}^{\alpha }=\hat{f}(p)+\sum\limits_{\begin{array}{c}\alpha \in {\{0,1\}}^{n}\\ 0 < | \alpha | \le n\end{array}}\frac{{\partial }^{\alpha }\hat{f}(p)}{| \alpha | !}{(x-p)}^{\alpha }.$$

(7)

If $p=(\frac{1}{2},\ldots ,\frac{1}{2})$, which represents the unbiased selection of each variable, then $\hat{f}(p)$ equals the output bias of f, as shown in²². The Taylor decomposition yields different approximations of a Boolean function by restricting the sum in Eq. (7) to α with ∣α∣ ≤ m ≤ n. The Taylor polynomial of order m is given by

$${\hat{f}}^{(m)}(x)=\sum\limits_{\begin{array}{c}\alpha \in {\{0,1\}}^{n}\\ | \alpha | \le m\end{array}}\frac{{\partial }^{\alpha }\hat{f}(p)}{| \alpha | !}{(x-p)}^{\alpha }$$

(8)

Approximability of a Boolean network by continuous extensions

Let F = (f₁, ⋯ , f_n): {0, 1}ⁿ → {0, 1}ⁿ be a Boolean network. We define the mth order approximation of F to be

$${\hat{F}}^{(m)} =\left(\max\left(0,\min\left(1,{\hat{f}}_1^{(m)}\right)\right),\ldots,\max\left(0,\min\left(1,{\hat{f}}_n^{(m)}\right)\right)\right):[0,1]^n\to[0,1]^n,$$

(9)

where the update functions of ${\hat{F}}^{(m)}$ are the mth order Taylor approximations of the update functions of F, ${\hat{f}}_{i}^{(m)}$ as defined in Equation (8), rescaled to the interval [0, 1].

With this, we can define the mean approximation error (MAE) as the mean squared error between the long-term state of the Boolean network and the long-term state of its continuous approximation. That is,

$$\,{{\mbox{MAE}}}\,(F,m)=\frac{1}{{2}^{n}}\sum\limits_{{x}_{0}\in {\{0,1\}}^{n}}\parallel {F}^{\infty }({x}_{0})-{\hat{F}}^{(m),\infty }({x}_{0}){\parallel }^{2}$$

(10)

where F^∞(x₀) and ${\hat{F}}^{(m),\infty }({x}_{0})$ describe the long-term state of the Boolean network F and its mth order approximation, respectively. In practice, we approximated the MAE, using the Python library boolion²², by updating both F and ${\hat{F}}^{(m)}$ synchronously 25 times and using 1000 random initial values X ⊂ {0, 1}ⁿ. That is, we approximate MAE(F, m) by computing

$$\frac{1}{1000}\mathop{\sum}\limits_{{x}_{0}\in X}\parallel {F}^{\infty }({x}_{0})-{\hat{F}}^{(m),\infty }({x}_{0}){\parallel }^{2}.$$

(11)

Linear LASSO regression

To better determine the relative importance of a number of correlated structure- and dynamics-related network properties on approximability, we performed a multivariable linear LASSO regression³¹. Assume $\tilde{{{{\bf{y}}}}}$ describes the MAE of order k = 1, 2, or 3 for the N = 110 biological networks, and $\tilde{{{{{\bf{x}}}}}_{{{{\bf{i}}}}}},i=1,\ldots ,d$ describes the explanatory variables shown in Fig. 4 (d = 24). To enable an appropriate interpretation of the results, we first scaled each vector to have mean 0 and standard deviation 1, yielding y and x_i. We then solved the following minimization problem using sklearn.linear_model in Python 3.10:

$$\mathop{\min }\limits_{\beta \in {{\mathbb{R}}}^{d}}\left\{\frac{1}{N}\parallel {{{\bf{y}}}}-X\beta {\parallel }_{2}^{2}+\alpha \parallel \beta {\parallel }_{1}\right\},$$

(12)

where X = [x₁ ⋯ x_d] is the design matrix, $\beta \in {{\mathbb{R}}}^{d}$ the vector of regression parameters, and α ≥ 0 the regularization (i.e., penalization) parameter. Note that the smaller α the larger is typically the number of explanatory variables with non-zero parameters. In the LASSO regularization paths, shown in Fig. 5, we reduced α until nine properties with non-zero parameter β_i had emerged.

Canalization

This study employs several mathematical concepts related to canalization. By¹⁴, a Boolean function f(x₁, …, x_n): {0, 1}ⁿ → {0, 1} is canalizing if there exists a canalizing variable x_i, a canalizing input a ∈ {0, 1} and a canalized output b ∈ {0, 1} such that

$$f({x}_{1},\ldots ,{x}_{n})=\left\{\begin{array}{ll}b\quad &\,{{\mbox{if}}}\,{x}_{i}=a,\\ g({x}_{1},\ldots ,{x}_{i-1},{x}_{i+1},\ldots ,{x}_{n})\quad &\,{{\mbox{otherwise.}}}\,\end{array}\right.$$

(13)

Some authors argue that constant functions are not canalizing, thus requiring the subfunction g to differ from b¹⁷. If g is also canalizing, then f is 2-canalizing, etc. More generally, f is k-canalizing, where 1 ≤ k ≤ n, with respect to the permutation $\sigma \in {{{{\mathcal{S}}}}}_{n}$, inputs a₁, …, a_k, and outputs b₁, …, b_k if

$$f({x}_{1},\ldots ,{x}_{n})=\left\{\begin{array}{ll}{b}_{1}\quad &{x}_{\sigma (1)}={a}_{1},\\ {b}_{2}\quad &{x}_{\sigma (1)}\,\ne\, {a}_{1},{x}_{\sigma (2)}={a}_{2},\\ {b}_{3}\quad &{x}_{\sigma (1)}\,\ne\, {a}_{1},{x}_{\sigma (2)}\,\ne\, {a}_{2},{x}_{\sigma (3)}={a}_{3},\\ \vdots \quad &\vdots \\ {b}_{k}\quad &{x}_{\sigma (1)}\,\ne\, {a}_{1},\ldots ,{x}_{\sigma (k-1)}\,\ne\, {a}_{k-1},{x}_{\sigma (k)}={a}_{k},\\ {f}_{C}\not\equiv {b}_{k}\quad &{x}_{\sigma (1)}\,\ne\, {a}_{1},\ldots ,{x}_{\sigma (k-1)}\,\ne\, {a}_{k-1},{x}_{\sigma (k)}\,\ne\, {a}_{k}.\end{array}\right.$$

(14)

Here, f_C = f_C(x_σ(k+1), …, x_σ(n)) is the core function, a Boolean function on n − k variables. When f_C is not canalizing, then the integer k is the canalizing depth of f¹⁵. If k = n (i.e., if all variables are become eventually canalizing), then f is a nested canalizing function (NCF)¹⁶. By¹⁷, every nonzero Boolean function f(x₁, …, x_n) can be uniquely written as

$$f({x}_{1},\ldots ,{x}_{n})={M}_{1}({M}_{2}(\cdots ({M}_{r-1}({M}_{r}{p}_{C}+1)+1)\cdots )+1)+q,$$

(15)

where each ${M}_{i}=\mathop{\prod }\nolimits_{j = 1}^{{k}_{i}}({x}_{{i}_{j}}+{a}_{{i}_{j}})$ is a non-constant extended monomial, p_C is the core polynomial of f, and $k=\mathop{\sum }\nolimits_{i = 1}^{r}{k}_{i}$ is the canalizing depth. Each x_i appears in exactly one of {M₁, …, M_r, p_C}. The layer structure of f is the vector (k₁, k₂, …, k_r) and describes the number of variables in each layer M_i^18,35.

Published biological Boolean network models

As part of a recent meta-analysis of 122 published biological Boolean network models, a repository of 163 such models was created². All 110 models analyzed in this study come from this repository. As in², we excluded highly similar models to avoid the introduction of selection bias. That is, for each set of models with highly similar variables (where similarity was assessed using the Szymkiewicz-Simpson “overlap" coefficient⁴⁰), we only kept the most final version of the model. This led to the exclusion of 39 of the 163 models. For details, see². Moreover, as the MAE computation for networks with high in-degree is very computationally expensive, we only considered networks with a maximal in-degree of ten or lower. This led to the exclusion of 12 additional models, yielding a total of 110 models, which were analyzed in this study. We note that in the initial analysis of approximability of biological networks, reported in²², highly similar models were not excluded.

Random null models of biological networks

We compared biological Boolean network models to three ensembles of null models that matched different characteristics of the biological networks, as shown in Fig. 1. All null models matched the in-degree of the biological networks. Null models 1 and 3 matched, in addition, the bias of each biological update rule, while null models 2 and 3 matched the canalizing depth.

Let F = (f₁, …, f_n) be a biological Boolean network model. For each f_i, we first simplified the function to only include essential variables, yielding ${\tilde{f}}_{i}:{\{0,1\}}^{k}\to \{0,1\}$, where k is the number of essential variables, i.e., the in-degree. While this step was omitted in²², it appears important for an unbiased comparison given that close to 2% of regulators in biological networks are non-essential². We then computed the number of ones in the truth table of ${\tilde{f}}_{i}$, denoted q, and ${\tilde{f}}_{i}$’s canalizing depth d, following¹⁸.

To obtain a random Boolean function g (for null model 1) with the same bias as ${\tilde{f}}_{i}$ and arbitrary canalizing depth, we simply selected a random subset Ω ⊆ {0, 1}^k of size ∣Ω∣ = q, and set

$$g(x)=\left\{\begin{array}{ll}1\quad &\,{{\mbox{if}}}\,\,x\in \Omega \\ 0\quad &\,{{\mbox{if}}}\,\,x\,\notin \,\Omega .\end{array}\right.$$

(16)

To obtain a random Boolean function g (for null model 2) with exact canalizing depth d and arbitrary bias, we randomly selected d out of ${\tilde{f}}_{i}$’s k essential variables, arranged them in a random order, and randomly selected for each of the d variables a canalizing input value a ∈ {0, 1} and a canalized output value b ∈ {0, 1} (see Equations (13), (14)). Finally, we randomly selected a core function g_C: {0, 1}^k−d → {0, 1}, ensuring that g_C depends on all k − d variables and that g_C is not canalizing, by repeating this random selection process until both conditions were met. We then filled the truth table of g, as outlined in Equation (14). This entire procedure has already been implemented in the Python library canalizing_function_toolbox, published along with².

To obtain a random Boolean function g (for null model 3) with the same bias as ${\tilde{f}}_{i}$ and the same canalizing depth d, we followed the same procedure as for null model 2, with two exceptions. First, we did not randomly select the canalized output values b₁, …, b_d but instead used the canalized output values of ${\tilde{f}}_{i}$. Otherwise, it is impossible to obtain the same bias. Second, we randomly selected a core function g_C of g that has the same number of ones as the core function of ${\tilde{f}}_{i}$ (following the same approach as for null model 1).

Many biological networks contain external parameters, which remain constant over time. In an n-node Boolean network, a variable x_i is an external parameter if its update function is f_i(x₁, …, x_n) = x_i. That is, if x_i(0) = a ∈ {0, 1}, then x_i(t) = a for all t ≥ 0. An n-node Boolean network with m < n of the nodes external parameters can be seen as 2^m distinct Boolean networks since all 2^m state spaces are disconnected. Specifically, the Boolean network has at least 2^m attractors. In the null model generation, we ensured that external parameters remained external parameters. That is, we did not allow for external parameters x_i to obtain the f(x₁, …, x_n) = ¬ x_i even though this rule has the same degree, bias and canalizing depth.

Random Boolean networks

To generate a random Boolean network F = (f₁, …, f_N) (modified N − K Kauffman network), we first generated a random directed graph of N nodes (the wiring diagram), where each node has K incoming edges. We ensured the graph is simple (i.e., does not contain self-edges/auto-regulations). We further ensured the graph is strongly connected since the dynamics decouple otherwise³³.

To obtain the random Boolean update rules f₁, …, f_N, we randomly selected, for the networks analyzed in Figs. 6, 8, any Boolean function g: {0, 1}^K → {0, 1}. In a sensitivity analysis, reported in Supplementary Fig. 2, we ensured that g is non-degenerated, i.e., that all variables of g are essential, by repeating the random selection until this condition was met. For the random Boolean networks with constant degree 4 and minimal canalizing depth d ∈ {0, 1, 2, 4}, analyzed in Fig. 9a, we followed a very similar procedure as for null model 2 (see above), with one exception: We allowed the core function to be canalizing so that the realized canalizing depth may be larger than d. For the random nested canalizing Boolean networks with constant degree 4 and different layer structure, analyzed in Fig. 9b, we followed again a very similar procedure as for null model 2, with the exception that the layer structure determines the canalized output values, b₁, …, b₄³⁵.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Kadelka et al.² contains standardized update rules of the 110 investigated published, expert-curated Boolean biological network models.

Code availability

Original code to compute the approximability of Boolean networks, published along with²², is available at https://gitlab.com/smanicka/boolion. Code to analyze the 110 investigated published, expert-curated Boolean biological network models, as well as the Python library canalizing_function_toolbox, published along with², is available at https://github.com/ckadelka/DesignPrinciplesGeneNetworks. New code underlying the analyses described in this paper is available at https://github.com/ckadelka/ApproximabilityBooleanNetworks.

References

Barrat, A., Barthelemy, M. & Vespignani, A. Dynamical Processes On Complex Networks (Cambridge University Press, 2008).
Kadelka, C. et al. A meta-analysis of Boolean network models reveals design principles of gene regulatory networks. Sci. Adv. 10, eadj0822 (2024).
Article PubMed PubMed Central Google Scholar
Shen-Orr, S. S., Milo, R., Mangan, S. & Alon, U. Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet. 31, 64–68 (2002).
Article CAS PubMed Google Scholar
Balleza, E. et al. Critical dynamics in genetic regulatory networks: examples from four kingdoms. PLoS ONE 3, e2456 (2008).
Article PubMed PubMed Central Google Scholar
Daniels, B. C. et al. Criticality distinguishes the ensemble of biological regulatory networks. Phys. Rev. Lett. 121, 138102 (2018).
Article CAS PubMed Google Scholar
Luque, B. & Solé, R. V. Lyapunov exponents in random Boolean networks. Physica A 284, 33–45 (2000).
Article Google Scholar
Shmulevich, I. & Kauffman, S. A. Activities and sensitivities in Boolean network models. Phys. Rev. Lett. 93, 048701 (2004).
Article PubMed PubMed Central Google Scholar
Chandrasekhar, K., Kadelka, C., Laubenbacher, R. & Murrugarra, D. Stability of linear Boolean networks. Physica D 451, 133775 (2023).
Article Google Scholar
Harris, S. E., Sawhill, B. K., Wuensche, A. & Kauffman, S. A model of transcriptional regulatory networks based on biases in the observed regulation rules. Complexity 7, 23–40 (2002).
Article Google Scholar
Gates, A. J., Brattig Correia, R., Wang, X. & Rocha, L. M. The effective graph reveals redundancy, canalization, and control pathways in biochemical regulation and signaling. Proc. Natl Acad. Sci. USA 118, e2022598118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Waddington, C. H. Canalization of development and the inheritance of acquired characters. Nature 150, 563–565 (1942).
Article Google Scholar
Hallgrímsson, B., Willmore, K. & Hall, B. K. Canalization, developmental stability, and morphological integration in primate limbs. Am. J. Phys. Anthropol. 119, 131–158 (2002).
Article Google Scholar
Flatt, T. The evolutionary genetics of canalization. Q. Rev. Biol. 80, 287–316 (2005).
Article PubMed Google Scholar
Kauffman, S. The large scale structure and dynamics of gene control circuits: an ensemble approach. J. Theor. Biol. 44, 167–190 (1974).
Article CAS PubMed Google Scholar
Layne, L., Dimitrova, E. & Macauley, M. Nested canalyzing depth and network stability. Bull. Math. Biol. 74, 422–433 (2012).
Article PubMed Google Scholar
Kauffman, S., Peterson, C., Samuelsson, B. & Troein, C. Random Boolean network models and the yeast transcriptional network. Proc. Natl Acad. Sci. USA 100, 14796–14799 (2003).
Article CAS PubMed PubMed Central Google Scholar
He, Q. & Macauley, M. Stratification and enumeration of Boolean functions by canalizing depth. Physica D 314, 1–8 (2016).
Article Google Scholar
Dimitrova, E., Stigler, B., Kadelka, C. & Murrugarra, D. Revealing the canalizing structure of Boolean functions: algorithms and applications. Automatica 146, 110630 (2022).
Article Google Scholar
Just, W., Shmulevich, I. & Konvalina, J. The number and probability of canalizing functions. Physica D 197, 211–221 (2004).
Article Google Scholar
Li, Y., Adeyeye, J. O., Murrugarra, D., Aguilar, B. & Laubenbacher, R. Boolean nested canalizing functions: a comprehensive analysis. Theor. Comput. Sci. 481, 24–36 (2013).
Article Google Scholar
Kadelka, C., Li, Y., Kuipers, J., Adeyeye, J. O. & Laubenbacher, R. Multistate nested canalizing functions and their networks. Theor. Comput. Sci. 675, 1–14 (2017).
Article Google Scholar
Manicka, S., Johnson, K., Levin, M. & Murrugarra, D. The nonlinearity of regulation in biological networks. NPJ Syst. Biol. Appl. 9, 10 (2023).
Article PubMed PubMed Central Google Scholar
Manicka, S., Marques-Pita, M. & Rocha, L. M. Effective connectivity determines the critical dynamics of biochemical networks. J. R. Soc. Interface 19, 20210659 (2022).
Article PubMed PubMed Central Google Scholar
Borriello, E. & Daniels, B. C. The basis of easy controllability in Boolean networks. Nat. Commun. 12, 5227 (2021).
Article CAS PubMed PubMed Central Google Scholar
Derrida, B. & Weisbuch, G. Evolution of overlaps between configurations in random Boolean networks. J. Phys. 47, 1297–1303 (1986).
Article Google Scholar
Derrida, B. & Pomeau, Y. Random networks of automata: a simple annealed approximation. Europhys. Lett. 1, 45 (1986).
Article Google Scholar
Drossel, B. Random Boolean networks. In Reviews of Nonlinear Dynamics and Complexity (ed. Schuster, H. G.) 69–110 (2008).
Park, K. H., Costa, F. X., Rocha, L. M., Albert, R. & Rozum, J. C. Models of cell processes are far from the edge of chaos. PRX Life 1, 023009 (2023).
Article PubMed PubMed Central Google Scholar
Moreira, A. A. & Amaral, L. A. N. Canalizing kauffman networks: nonergodicity and its effect on their critical behavior. Phys. Rev. Lett. 94, 218702 (2005).
Article PubMed PubMed Central Google Scholar
Zanudo, J. G., Aldana, M. & Martínez-Mekler, G. Boolean threshold networks: virtues and limitations for biological modeling. In Information Processing and Biological Systems (eds. Niiranen, S. & Ribeiro, A.) 113–151 (2011).
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B 58, 267–288 (1996).
Article Google Scholar
Kauffman, S. A. Metabolic stability and epigenesis in randomly constructed genetic nets. J. Theor. Biol. 22, 437–467 (1969).
Article CAS PubMed Google Scholar
Kadelka, C., Wheeler, M., Veliz-Cuba, A., Murrugarra, D. & Laubenbacher, R. Modularity of biological systems: a link between structure and function. J. R. Soc. Interface 20, 20230505 (2023).
Article PubMed PubMed Central Google Scholar
Kadelka, C., Keilty, B. & Laubenbacher, R. Collectively canalizing boolean functions. Adv. Appl. Math. 145, 102475 (2023).
Article Google Scholar
Kadelka, C., Kuipers, J. & Laubenbacher, R. The influence of canalization on the robustness of Boolean networks. Physica D 353, 39–47 (2017).
Article Google Scholar
Boole, G. Studies in Logic and Probability (Dover Publications, 2012).
Kauffman, S., Peterson, C., Samuelsson, B. & Troein, C. Genetic networks with canalyzing boolean rules are always stable. Proc. Natl Acad. Sci. USA 101, 17102–17107 (2004).
Article CAS PubMed PubMed Central Google Scholar
Karlsson, F. & Hörnquist, M. Order or chaos in boolean gene networks depends on the mean fraction of canalizing functions. Physica A 384, 747–757 (2007).
Article Google Scholar
Schwab, J. D., Kühlwein, S. D., Ikonomi, N., Kühl, M. & Kestler, H. A. Concepts in Boolean network modeling: what do they all mean? Comput. Struct. Biotechnol. J. 18, 571–582 (2020).
Article PubMed PubMed Central Google Scholar
Szymkiewicz, D. Une contribution statistique à la géographie floristique. Acta Soc. Bot. Pol. 11, 249–265 (1934).
Article Google Scholar

Download references

Acknowledgements

C.K. and D.M. were both partially supported by travel grants from the Simons Foundation (grant numbers 712537 and 850896, respectively). The authors thank Iowa State University for making high-performance computing freely available to C.K.

Author information

Authors and Affiliations

Department of Mathematics, Iowa State University, 411 Morrill Rd, Ames, 50011, IA, USA
Claus Kadelka
Department of Mathematics, University of Kentucky, 719 Patterson Office Tower, Lexington, 40506, KY, USA
David Murrugarra

Authors

Claus Kadelka
View author publications
You can also search for this author in PubMed Google Scholar
David Murrugarra
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: C.K.; Methodology: C.K. and D.M.; Software & Visualization: C.K.; Formal analysis: C.K.; Writing—Original Draft: C.K. and D.M.; Writing—Review & Editing: C.K. and D.M.

Corresponding author

Correspondence to Claus Kadelka.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kadelka, C., Murrugarra, D. Canalization reduces the nonlinearity of regulation in biological networks. npj Syst Biol Appl 10, 67 (2024). https://doi.org/10.1038/s41540-024-00392-y

Download citation

Received: 14 February 2024
Accepted: 31 May 2024
Published: 13 June 2024
DOI: https://doi.org/10.1038/s41540-024-00392-y
Springer Nature Limited

Canalization reduces the nonlinearity of regulation in biological networks

Abstract

Similar content being viewed by others

Models of Gene Regulation: Integrating Modern Knowledge into the Random Boolean Network Framework

Modelling the Evolution of Dynamic Regulatory Networks: Some Critical Insights

Emergent Properties of Gene Regulatory Networks: Models and Data

Introduction

Results

Discussion

Methods

Boolean networks

Metrics describing Boolean network dynamics

Continuous extensions of Boolean functions

Taylor polynomials of Boolean functions

Approximability of a Boolean network by continuous extensions

Linear LASSO regression

Canalization

Published biological Boolean network models

Random null models of biological networks

Random Boolean networks

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Reporting Summary

Rights and permissions

About this article

Cite this article

Design principles and phenotypic landscapes of biological networks

Navigation

Canalization reduces the nonlinearity of regulation in biological networks

Abstract

Similar content being viewed by others

Models of Gene Regulation: Integrating Modern Knowledge into the Random Boolean Network Framework

Modelling the Evolution of Dynamic Regulatory Networks: Some Critical Insights

Emergent Properties of Gene Regulatory Networks: Models and Data

Introduction

Results

Discussion

Methods

Boolean networks

Metrics describing Boolean network dynamics

Continuous extensions of Boolean functions

Taylor polynomials of Boolean functions

Approximability of a Boolean network by continuous extensions

Linear LASSO regression

Canalization

Published biological Boolean network models

Random null models of biological networks

Random Boolean networks

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation