Approximation of Almost Diagonal Non-linear Maps by Lattice Lipschitz Operators

Arnau, Roger; Calabuig, Jose M.; Erdoğan, Ezgi; Sánchez Pérez, Enrique A.

doi:10.1007/s00574-024-00385-9

Approximation of Almost Diagonal Non-linear Maps by Lattice Lipschitz Operators

Open access
Published: 13 February 2024

Volume 55, article number 11, (2024)
Cite this article

Download PDF

You have full access to this open access article

Bulletin of the Brazilian Mathematical Society, New Series Aims and scope Submit manuscript

Approximation of Almost Diagonal Non-linear Maps by Lattice Lipschitz Operators

Download PDF

387 Accesses
Explore all metrics

Abstract

Lattice Lipschitz operators define a new class of nonlinear Banach-lattice-valued maps that can be written as diagonal functions with respect to a certain basis. In the n-dimensional case, such a map can be represented as a vector of size n of real-valued functions of one variable. In this paper we develop a method to approximate almost diagonal maps by means of lattice Lipschitz operators. The proposed technique is based on the approximation properties and error bounds obtained for these operators, together with a pointwise version of the interpolation of McShane and Whitney extension maps that can be applied to almost diagonal functions. In order to get the desired approximation, it is necessary to previously obtain an approximation to the set of eigenvectors of the original function. We focus on the explicit computation of error formulas and on illustrative examples to present our construction.

Extension procedures for lattice Lipschitz operators on Euclidean spaces

Article Open access 24 February 2023

Fractional Laplace operator in two dimensions, approximating matrices, and related spectral analysis

Article Open access 10 August 2020

Toeplitz Localization Operators: Spectral Functions Density

Article 20 May 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction and Notation

Diagonalizable operators play a fundamental role in classical operator theory. Indeed, they have special geometric features that allow the mathematical treatment of properties in many fields of science, and can be used to approximate linear operators in infinite-dimensional Banach spaces. However, there is no analog for the case of Lipschitz maps in Euclidean spaces, although there are many situations in which both classes (linear and Lipschitz maps) can be naturally associated.

Several authors have recently been interested in the determination and characterization of classes of operators for which the notion of diagonalization could be adapted. To do this, one must begin by giving a definition of eigenvector that can make sense in the specific context in which one wishes to investigate. Nonlinear spectral theory is already a classical subject and has a long development, both from a theoretical and practical point of view (Appell and Dörfner 1997; Dancer and Phillips 1974; López-Gómez 2001). For example, for the case of multilinear map**s (da Silva et al. 2021; Milano et al. 2020), polynomials (Mackey et al. 2015) or Lipschitz maps (Erdoğan et al. 2022). In particular, we have studied suitable versions of these notions for Lipschitz maps in Arnau et al. (2023), Erdoğan et al. (2022).

Our aim in the present paper is to adapt for the case of lattice Lipschitz operators the classical extension/representation of diagonalizable linear operators once a basis of eigenvectors is known. The idea is to use the same representation pattern (but in an approximate form) for the case of functions that are approximable by lattice Lipschitz maps, which is a broad class of functions containing linear operators. As we will explain, our method is based on the use of the order in the Euclidean lattice, rather than linearity.

Thus, the class of lattice Lipschitz operators has recently been introduced to fill the gap in the helpful definition of diagonalizable (linear) operators for the case of Lipschitz (Arnau et al. 2023) maps. Using the results presented in the aforementioned paper, we explain here an approximation method for “almost diagonalizable" Lipschitz operators, i.e., Lipschitz functions that can be approximately computed as lattice Lipschitz maps. In this paper we provide specific tools for the (approximate) representation of general Lipschitz maps by means of their eigenvectors using the vector-valued lattice versions of the classical McShane and Whitney formulas that were obtained in Arnau et al. (2023).

Although some theoretical statements will also be presented, this paper is written from the point of view of applied mathematics, with the idea of providing an efficient algorithm for the approximation of Lipschitz maps with the requirements, as we have said, of being “almost diagonal”. An explanation of the “initial class" of lattice Lipschitz operators, together with some clarifying examples, are given in Sect. 2. General results on error bounds for such approximation can be found in Arnau et al. (2023), but we will go further in this direction in Sect. 3 of the present paper, which is mainly devoted to the approximate calculation of a “sufficient" subset of eigenvectors. As will be explained, our method is based on the McShane–Whitney extension which is based on the order of the studied operator restricted to that subset of eigenvectors, as if it were a lattice Lipschitz operator. Once such a set is obtained, we present in Sect. 4.1 how to define a suitable order in Euclidean space that allows the use of McShane-Whitney expressions. Sections 4.2 and 4.3 are devoted to give the representation formulas and the final computational algorithms. In Sect. 5 we show an illustrative example, and in Sect. 6 we give some final comments. The algorithm used to compute a set of approximate eigenvectors, programmed in R, is given in Appendix A.

We will use the notation of general topology and mathematical analysis. We will write $\mathbb {R}$ for the set of the real numbers endowed with its standard Euclidean distance, and E for any Euclidean space as ${\mathbb {R}}^n,$ $n \in \mathbb N.$ If (M, d) and $(D,\rho )$ are metric spaces, we say that a map $T: M \rightarrow D$ is Lipschitz if there is a constant $K>0$ such that $\rho \big (T(a),T(b) \big ) \le K \, d \big ( a,b \big )$ for all $a,b \in M$ (Cobzaş et al. 2019). The Lipschitz constant of T is the infimum of all such constants K.

We will center our attention on Lipschitz-type functions between Euclidean spaces X. Recall that the classical McShane-Whitney Theorem [see for example (Cobzaş et al. 2019), Ch.4] establishes that, if S is a subset of a metric space (D, d) and $T: S \rightarrow \mathbb {R}$ is a Lipschitz function with Lipschitz constant K, we can always find an extension to D with the same Lipschitz constant. There are two classical ways of computing such an extension, that are provided by the formulas

$$\begin{aligned} {{T}^M}(b)= & {} \sup _{}\Big \{T(a)-K\,d(b,a): \, a\in S \Big \}, \quad b \in D, \quad \text {(McShane)}\\ T^W(b)= & {} \inf _{}\Big \{T(a)+K\, d (b,a): \, a\in S \Big \}, \quad b \in D, \quad \text {(Whitney)}. \end{aligned}$$

Suitable versions of such formulas, adapted for Lipschitz operators $T:E \rightarrow E,$ will be the main computational tools in this paper.

Given a Euclidean space X of dimension n and a basis in it, we can always define an associated order in X that is provided by the coordinate-wise ordering of the vectors (two vectors $x=(x_1,...,x_n), y=(y_1,...,y_n) \in X$ are ordered, $x \le y,$ if $x_i \le y_i$ for $i=1,...,n$). This gives, when the 2-norm of the coordinates is considered, a Banach lattice structure $(X, \Vert \cdot \Vert , \le ).$ From this point of view, each vector represented by its coordinates in the chosen basis can be considered as a function f, $\{1,...,n\} \ni w \mapsto f(w)=x_w.$ The standard symbols $\vee $ and $\wedge $ are used for the maximum and the minimum of two (or several) vectors in the lattice, respectively.

2 Lattice Lipschitz Operators

As has been shown in Arnau et al. (2023), the operators belonging to the special class of Lipschitz maps, that are called lattice Lipschitz operators, have a relevant property which motivates the method we propose in this paper.

Property. Each lattice Lipschitz operator T can be written as the McShane extension (equivalently, the Whitney extension) of the restriction $T|_{R({\mathcal {B}})}$ of the operator itself to the set of the rays $R({\mathcal {B}})$ generated by the basis ${\mathcal {B}}$ of eigenvalues of T that provides the order.

Although we cannot expect all Lipschitz maps to behave like lattice Lipschitz operators, we can use this specific class as an approximation family. The idea is to find suitable extensions for each (approximated) eigenvector basis we can obtain for X, and mix them in a common structure to find good approximations to such a Lipschitz operator.

On the other hand, the concept of diagonalizable linear operator on finite dimensional Euclidean spaces can be extended to the setting of Lipschitz maps by means of the notion of lattice Lipschitz operator. Essentially, these are Lipschitz maps that can be written as combination of real valued Lipschitz functions that works independently in the directions of a given basis of the space. The main characterizations and results regarding this class or maps have been extensively studied in Arnau et al. (2023). In what follows we recall the main technical definitions and results, as well as some illustrative examples.

Let ${\mathcal {B}}$ be a fixed basis of E, the finite dimensional space ${\mathbb {R}}^n.$ We will consider the order $\le $ provided by the pointwise order of the coordinates of the vectors of E in the basis ${\mathcal {B}}.$ As usual, using the coordinate representation given by ${\mathcal {B}},$ every vector $x=(\alpha _1,...,\alpha _n) \in E$ can be understood as a function $x: \{1,...,n\} \rightarrow {\mathbb {R}},$ $x(w)=\alpha _w, $ $w \in \{1,...,n\}= \Omega .$ The same ideas can be used for any Euclidean space X.

In this setting, we recall the definition of our main tool.

Definition 1

(Definition 1 in Arnau et al. 2023) Let $X_0 \subseteq X.$ A Lipschitz operator $T:X_0 \rightarrow X$ is lattice Lipschitz (with respect to the order $\le $ associated to the basis ${\mathcal {B}}$) if there is a function $K:\Omega \rightarrow {\mathbb {R}}^+$ satisfying the inequality

$$\begin{aligned} \big |T(x)-T(y)\big |(w) \le K(w) \big |x-y\big |(w), \quad \text {for all} \,\,\, x,y \in X_0, \quad w \in \Omega . \end{aligned}$$

(1)

In case it exists, the minimum (otherwise, the infimum) of $K(\cdot )$ is associated function.

As we will see below, lattice Lipschitz operators can be identified with another class of functions that allows a geometric description. Let us give this definition for the case $E={\mathbb {R}}^n$ for simplicity, althought it clearly makes sense for any Euclidean space X. A Lipschitz operator $T: E_0 \rightarrow E$ is diagonal with respect to a basis ${\mathcal {B}} = \{ x_1, x_2, \ldots , x_n\}$ of $E = {\mathbb {R}}^n$ if there exist real functions $f_i: {\mathbb {R}} \rightarrow {\mathbb {R}}$ for $1 \le i \le n$ such that

$$\begin{aligned} T\left( \sum _{i=1}^n \alpha _i x_i \right) = \sum _{i=1}^n f_i(\alpha _i) x_i, \qquad \alpha _1, \alpha _2, \ldots , \alpha _n \in {\mathbb {R}}. \end{aligned}$$

(2)

We call the functions $f_i$ the coordinate functions of T with respect to the basis ${\mathcal {B}}.$

Example 1

Let us give an example of diagonal map. Consider the function $S: B_{{\mathbb {R}}^2} \rightarrow {\mathbb {R}}^2$ given by

$$\begin{aligned} S(x,y)= \left( x^2 + y^2, 2 xy \right) , \quad (x,y) \in B_{{\mathbb {R}}^2}. \end{aligned}$$

Both the coordinates in the domain (x, y) and in the range are assumed to be with respect to the canonical basis of ${\mathbb {R}}^2.$ To find the eigenvectors and the eigenvalue functions, just notice that we have to find the vectors (x, y) such that S(x, y) and (x, y) are linearly dependent. This means that

$$\begin{aligned} \left| \begin{array}{cc} x^2+y^2 &{} 2 x y \\ x &{} y \end{array} \right| =0, \end{aligned}$$

that is, $y (x^2+ y^2)= 2 x^2 y$, what leads to the solutions

$$\begin{aligned} y=0 \, \,\, \text {with all} \, \,\, x \in {\mathbb {R}}, \quad \text {or} \quad y^2=x^2, \end{aligned}$$

and so, the set of eigenvectors is

$$\begin{aligned} \left\{ (t,0): |t| \le 1\right\} \cup \left\{ (t,t): |t| \le 1/\sqrt{2}\right\} \cup \left\{ (t,-t): |t| \le 1/\sqrt{2}\right\} . \end{aligned}$$

Consider the basis for ${\mathbb {R}}^2$ given by ${\mathcal {B}}= \{(1,1),(1,-1) \}.$ Then we can consider the change of coordinates provided by $(x,y)= z(1,1)+ v(1,-1),$ what gives

$$\begin{aligned} x=z+v, \quad y=z-v, \end{aligned}$$

and so

$$\begin{aligned} z= \frac{x+y}{2}, \quad v= \frac{x-y}{2}. \end{aligned}$$

In this new basis, we can write the map S as

$$\begin{aligned}{} & {} S \left( z(1,1)+v(1,-1) \right) = S(x,y)= \left( (z+v)^2 + (z-v)^2, 2 (z+v)(z-v) \right) \\= & {} \left( 2z^2+ 2v^2, 2 (z^2-v^2) \right) = 2 z \cdot z \, (1,1) + 2v \cdot v \,(1,-1). \end{aligned}$$

Consequently, S is a diagonal map defined by the Lipschitz eigenvalue functions (that is, the functions that provide the eigenvalue associated to a given eigenvector) $e_1 \big ( z(1,1) \big )= 2 z$ and $e_2 \big ( v(1,-1) \big )= 2 v.$ The coordinate functions are $f_1(z)= 2 z^2$ and $f_2(v)=2 v^2,$ that are also Lipschitz in the domain. An easy computation taking into account that the function is defined on $B_{{\mathbb {R}}^2}$ gives that the associate function is $K(1)=K(2)=2.$

However, note that S is not diagonal with respect to the basis ${\mathcal {D}}= \{(1,0), (1,1)\}.$ Indeed, note that

$$\begin{aligned} \begin{aligned} S(z(1,0)+ t(1,1))&= S(x,y)= S(z+t,t) \\&=\big ( (z+t)^2+t^2, 2(z+t)t \big )= ( z^2 + 2 t^2+2zt, 2 t^2 + 2zt ). \end{aligned} \end{aligned}$$

Since $( z^2 + 2 t^2+2zt, 2 t^2 + 2zt)$ cannot be written as $s_1(z) (1,0) + s_2(t) (1,1),$ we have that S is not diagonal with respect to ${\mathcal {D}}.$

Next result establishes the identification among diagonal and lattice Lipschitz operators.

Theorem 1

(Theorem 3 in Arnau et al. (2023)) For an Euclidean space X and Lipschitz operator $T: X \rightarrow X,$ the following statements are equivalent.

(1)
T is lattice Lipschitz (with respect to a certain order $\le $) and has an associate function $K: \Omega \rightarrow {\mathbb {R}}.$
(2)
T is diagonal when the basis ${\mathcal {B}}$ is considered (the basis gives the order $\le $), and K(i), $i=1,...,n$ are the constants appearing in each coordinate.

Note that, following Theorem 1 and taking into account the last part of Example 1, we know that an operator can be diagonal (and therefore lattice Lipschitz) with respect to a given eigenvector basis ${\mathcal {B}}$, and not be diagonal with respect to another eigenvector basis ${\mathcal {D}}.$ In this case, using the theorem we obtain that the operator is not lattice Lipschitz with respect to the order defined by ${\mathcal {D}}.$

Although we do not use it in this paper, we should note that Theorem 1 allows a “local” version, i.e., it can be adapted for operators that are only defined on subsets of X satisfying certain properties. In its generality, this result opens the door to the approximation of Lipschitz operators by means of lattice Lipschitz extensions. Since having an eigenvector basis will play a central role in this, we will fix in the next section a method to deal with the (approximate) computation of an eigenvector basis in the case of Lipschitz operators.

Example 2

The previous result can be directly applied to characterize lattice Lipschitz operators. Let us explain with an easy example how to do it. Take the function $G: {\mathbb {R}}^2 \rightarrow {\mathbb {R}}^2$ given by the formula

$$\begin{aligned} G(x,y)= \left( x-y+ \frac{|y|}{1+|y|}, \frac{|y|}{1+|y|} \right) , \quad (x,y) \in {\mathbb {R}}^2, \end{aligned}$$

where (x, y) are the coordinates with respect to the canonical basis, as well as the coordinates in the range.

(1)
Let us compute the conditions for being eigenvectors of the map. The requirement (that has to be computed pointwise) is given by the equation
$$\begin{aligned} \begin{vmatrix} x-y+ \frac{|y|}{1+|y|}&x \\ \frac{|y|}{1+|y|}&y \\ \end{vmatrix} = xy -y^2 + \frac{y |y|}{1+|y|} - \frac{x |y|}{1+|y|} = 0. \end{aligned}$$
We directly get that this relation is satisfied for all vectors as (x, 0), $x \in {\mathbb {R}},$ and
$$\begin{aligned} x \left( y- \frac{|y|}{1+|y|} \right) = y \left( y- \frac{|y|}{1+|y|} \right) , \end{aligned}$$
what gives $x=y.$ Therefore, all vectors as (x, 0) and (x, x), $x \in {\mathbb {R}},$ are eigenvectors.
(2)
Consider the basis ${\mathcal {B}}=\{ (1,0), (1,1) \},$ and take the coordinates of the vectors in this basis to be (z, t). The equation $(x,y)= z(1,0)+ t(1,1)$ gives the change of coordinates
$$\begin{aligned} x= z+t, \quad y=t. \end{aligned}$$
Therefore, the formula
$$\begin{aligned} G(z(1,0)+ t(1,1))&= G(z+t,t)= \left( z+ \frac{|t|}{1+|t|}, \frac{|t|}{1+|t|} \right) \\&= z(1,0) + \frac{|t|}{t(1+|t|)} \, t \, (1,1), \end{aligned}$$
for $(z,t) \in {\mathbb {R}}^2,$ gives a diagonal representation for the function G.
(3)
The functions $g_1(z)=z$ and $g_2(t)= |t|/(1+|t|)$ are real valued Lipschitz functions with constant 1. It is obvious for $g_1.$ For $g_2,$ just note that for all $ t_1, t_2 \in {\mathbb {R}},$
$$\begin{aligned} \left| \frac{|t_1|}{1+|t_1|} - \frac{|t_2|}{1+|t_2|} \right|&= \left| \frac{|t_1|(1+|t_2|) }{(1+|t_1|)(1+|t_2|)} - \frac{|t_2|(1+|t_1|)}{(1+|t_1|)(1+|t_2|)} \right| \\&= \frac{ \left| |t_1| - |t_2| \right| }{(1+|t_1|)(1+|t_2|)} \le \left| |t_1|- |t_2| \right| \le |t_1-t_2|, \end{aligned}$$
what means that the Lipschitz constant is less or equal to one, and doing $t_1 \rightarrow \infty $ and $t_2=0$ we see that this constant is 1. The associate function is then given by $K(1)=1$ and $K(2)=1.$

Summing up, all the vectors as z(1, 0) and t(1, 1) are eigenvectors, with eigenvalues $e_1\big (z(1,0)\big )=1$ and $e_2\big (t(1,1)\big )= |t|/t(1+ |t|).$ Using Theorem 1, we can say that G is a lattice Lipschitz operator when the order is the one inherited by the basis ${\mathcal {B}}.$

3 Estimates of the Eigenvectors of a Lipschitz Operator: The Exponential Distribution Monte Carlo Model

Following the arguments provided in the previous sections, we need to find sets of vectors that are approximately eigenvectors of any Lipschitz map we want to analyze. In this section we explain how to perform a statistical procedure to obtain such sets. As we have seen, to know a basis of eigenvectors with respect to which a given Lipschitz map is diagonal allows an easy representation of a lattice Lipschitz operator. However, we plan to establish an approximation procedure for a broad class of Lipschitz maps, so we cannot expect them to be lattice Lipschitz. We intend to use the same formulas that provide representations of lattice Lipschitz maps, so we need to determine—at least, approximately— the set of eigenvectors of a given Lipschitz map in order to fix a convenient basis of the space, if possible.

To find a set of (approximated) eigenvectors we mix geometric and stochastic arguments. Suppose that X is a real Euclidean space, and recall that its dual space can be directly identified with the original space X and the dual action is the scalar product. Take a Lipschitz function $T:X \rightarrow X.$ Following the same ideas used in [Erdoğan et al. (2022), Section 2] we define the diagonal value $\lambda _T(x)$ of T(x) as the real number that satisfies

$$\begin{aligned} \lambda _T(x)= \frac{\left\langle T(x), x \right\rangle }{\Vert x\Vert ^2}, \quad x \in X. \end{aligned}$$

We will simply write $\lambda (x)$ instead of $\lambda _T(x)$ if T is fixed in the context.

Note that if x is an eigenvector of T with associated eigenvalue $\beta \in {\mathbb {R}},$ the real number $\lambda (x)$ defined as above gives the eigenvalue, that is,

$$\begin{aligned} \lambda (x)= \left\langle T(x), \frac{x}{\Vert x\Vert ^2} \right\rangle =\left\langle \beta \cdot x, \frac{x}{\Vert x\Vert ^2} \right\rangle = \beta \frac{\langle x, x \rangle }{\Vert x\Vert ^2} = \beta . \end{aligned}$$

The main property of the diagonal value is that it represents the optimal diagonal approximation to T(x). Furthermore, for the case of Euclidean spaces, the optimal diagonal approximation coincides with the projection of T(x) on the unit vector in the direction of x. We explicitly write this in Proposition 2, including the proof—a straightforward consequence of the Euclidean geometry—for the aim of completeness.

Let us define before the diagonal projection error, $\varepsilon _T(x)(\alpha )$, as the function of $\alpha \in {\mathbb {R}}$ that represents the size of the difference between T(x) and $\alpha \cdot x.$ After normalization, the formula for this quantity is given by

$$\begin{aligned} \varepsilon _T(x)(\alpha )= \frac{1}{\Vert x\Vert } \left\| T(x) - \alpha \cdot x \right\| , \quad \alpha \in {\mathbb {R}}. \end{aligned}$$

We will write $\varepsilon _T(x)$ for the minimal value of the diagonal projection error and, as in the case of $\lambda ,$ $\varepsilon (x)$ if T has been already fixed.

Proposition 2

Let X be a Euclidean space, $T:X \rightarrow X$ a Lipschitz operator and $x \in X.$ Then the diagonal value $\lambda _T(x)=\lambda (x)$ is the real number that minimizes the diagonal projection error, that is, the diagonal error is given by

$$\begin{aligned} \varepsilon (x)= \frac{1}{\Vert x\Vert } \, \min _{\alpha \in {\mathbb {R}}} \Big \Vert T(x) - \alpha \cdot x \Big \Vert = \frac{1}{\Vert x\Vert } \, \Big \Vert T(x) - \lambda (x) \cdot x \Big \Vert = \varepsilon _T(x) \big (\lambda (x) \big ). \end{aligned}$$

Proof

It is given by a direct computation. Note that the solution of the equation

$$\begin{aligned} \frac{\partial \varepsilon _T(x)^2(\alpha )}{\partial \alpha }= & {} \frac{1}{\Vert x\Vert ^2} \, \frac{\partial }{\partial \alpha } \Big ( \big \langle T(x) - \alpha \cdot x, T(x) - \alpha \cdot x \big \rangle \Big )\\= & {} \frac{1}{\Vert x\Vert ^2} \, \frac{\partial }{\partial \alpha } \Big ( \Vert T(x)\Vert ^2 -2 \alpha \langle T(x),x \rangle + \alpha ^2 \Vert x\Vert ^2 \Big )\\= & {} - \frac{2}{\Vert x\Vert ^2} \, \langle T(x),x \rangle + \frac{2}{\Vert x\Vert ^2} \,\alpha \Vert x\Vert ^2 =0, \end{aligned}$$

is given by $\alpha = \langle T(x),x \rangle /\Vert x\Vert ^2= \lambda (x).$ Since

$$\begin{aligned} \frac{\partial ^2 \varepsilon _T(x)^2(\alpha )}{\partial \alpha ^2} = 2 \frac{1}{\Vert x\Vert ^2} \, \Vert x\Vert ^2 =2 >0, \end{aligned}$$

we get the result. $\square $

Corollary 1

For every point $x\in X,$ the error $\varepsilon (x)$ is given by the formula

$$\begin{aligned} \varepsilon (x) = \sqrt{\frac{\Vert T(x)\Vert ^2}{\Vert x\Vert ^2}- \lambda (x)^2}. \end{aligned}$$

Proof

It is a consequence of Proposition 2 and a direct calculation involving the definition of $\lambda (x)$. Indeed, we have that

$$\begin{aligned} \varepsilon (x)^2&= \frac{1}{\Vert x\Vert ^2} \Big | \langle T(x)-\lambda (x) \cdot x ,T(x)-\lambda (x) \cdot x \rangle \Big |\\&=\frac{1}{\Vert x\Vert ^2} \left( \Vert T(x)\Vert ^2 - 2 \lambda (x) \langle T(x), x \rangle + \lambda (x) \, \Vert x\Vert ^2 \right) \\&= \frac{\Vert T(x)\Vert ^2}{\Vert x\Vert ^2} - \lambda (x)^2. \end{aligned}$$

$\square $

Using this result, the idea is to perform a Monte Carlo procedure to obtain a large enough set of eigenvectors of the Lipschitz operator T. It must be said that most of the effort for the design of Monte Carlo methods for operator diagonalization comes from quantum physics and stochastic analysis (Husslein et al. 1997; Lee et al. 2001; Williams 2010, 2013). In general, these approaches do not provide a fundamental framework to support a general methodology, since they focus on the actual computation of quantities with some physical or mathematical meaning. Consequently, we propose in what follow our own procedure based on the functions defined just above.

We fix a suitable uniform value $\epsilon $ to be accepted for the error committed when the operator is approximated by its diagonal value. We will use such a set for the computation of the McShane and Whitney representation formulas, as a substitute of the exact set of eigenvalues that could not be computable. An example of such situation coincides with the case in which the Lipschitz operator T is the addition of a linear diagonalizable operator L plus a perturbation P with small norm, $T= L+P.$

Therefore, we will base our method on a sampling procedure supported by probabilistic arguments using the normal distribution and following the next steps.

(1)
We start by fixing a bounded set in which we will search for our approximate eigenvectors; if no additional information is known, we choose a product P of intervals in ${\mathbb {R}}^n$ centered in 0; in case we know previously that the eigenvectors are located in a particular set M, we use it instead.
(2)
We use the uniform distribution to sample a starting set of vectors $S_0.$ In case we have some previous knowledge on the set, we can introduce Bayesian procedures to fix a more accurated probability distribution $\Psi $ for doing the sampling.
(3)
We compute the functional $\varepsilon (x)$—where $\varepsilon (\cdot )$ is given by the error formula provided in Proposition 2—for all $x \in S_0.$ Now, we consider the set $S_1$ of all the points x that are most similar to an eigenvector in terms of having small $\varepsilon (x)$. This can be done by selecting the points with smallest $\varepsilon (x)$—for example, $10 \%.$ of the points of $S_0$—.
(4)
Now, for every $s \in S_1$, we start an iterative process. Sample a fixed number of points around s using a Gaussian distribution centered on s and with variance depending of the error $\varepsilon (x)$ formula
$$\begin{aligned} \psi (x)= \frac{1}{(\tau \, \varepsilon (s) \, \pi )^{n/2}} \, \exp \left( - \frac{|| a - x ||^2}{\tau \varepsilon (s)} \right) , \end{aligned}$$
where $\tau $ is a fitting parameter. Select the point with the smallest $\varepsilon (\cdot )$ and repeat this step a fixed number of times—changing the value of $\tau $ if necessary—. Taking into account that the only property we know about the function T is that it is Lipschitz, this distribution allows to center the sampling near the points in which the diagonal error is small. By the Lipschitz inequality $\Vert T(x)-T(a)\Vert \le K \Vert x-a\Vert ,$ T(x) and T(a) are controled by the distance between x and a, so using the proposed distribution maximizes the probability of getting approximated eigenvectors in the sampling.

The computations involved in the algorithm whose scheme has been presented above are easy to perform using R. To show concrete situations, we will show some numerical examples of functions $T:{\mathbb {R}}^2 \rightarrow {\mathbb {R}}^2,$ since they allow a graphical representation. The algorithm used can be found in the Appendix 1 of this work.

Example 3

Consider the parametric family of functions $R_r: {\mathbb {R}}^2 \rightarrow {\mathbb {R}}^2,$ $r \in {\mathbb {R}},$ given by the expression

$$\begin{aligned} R_r(x,y)=\Big ( 8x + r \cdot \sin (5xy), 4x^2 + 4xy + y^2 - 2x - \frac{1}{5} \, \sqrt{|x+y|} \Big ), \quad (x,y) \in {\mathbb {R}}^2. \end{aligned}$$

The calculations shown below have followed the next rule. We consider the domain subset $[-5,5] \times [-5,5].$ We start with $N= 500$ initial points in the domain, which are obtained randomly. We choose the $N_0=100$ best with respect to the error value. Using the distribution $\psi $ written in step (3) of the algorithm with $\tau =5$, for each of these points we generate $N_1=10$ points around, from which the best one is selected. We repeat this step 10 times.

Let us show the graphical representation of the results with three different values of the parameter r.

$r=0.$ The sine term in the first coordinate is eliminated. This makes the example simpler, with an easy to understand representation of a suitable subset of approximate eigenvectors.
$r=3.$ In this case, the sine term in the first coordinate causes an important perturbation, producing a more dispersed eigenvector structure.
$r=-10.$ The sine term causes a stronger perturbation (of negative sign). The consequence is that the set of eigenvectors no longer follows (even approximately) clear lines.

These examples show that, although we cannot expect a diagonal distribution of eigenvectors for non lattice Lipschitz maps, we can sometimes treat general Lipschitz operators as perturbations of lattice Lipschitz maps, in the case where some diagonal distribution of a set of approximate eigenvectors is still preserved. Since no clearly defined axes are given for such a set, we have to provide a technique for the generation of two straight lines (or n straight lines in the general case) that can play this role. How to do so will be shown in the next section.

4 General algorithm and examples

Let us show in this section how the extension/representation formulas (McShane and Whitney versions) that work for the case of lattice Lipschitz operators can be adapted for any Lipschitz operator $T: S \subset X \rightarrow X$ in order to obtain an approximate functional expression for T [Definition 2, Proposition 1 and Proposition 2 in Arnau et al. (2023)]. In the case of a lattice Lipschitz operator L, if we fix S to be the union of the rays defined by a certain basis of eigenvectors of L for X, both of these formulas give exact representations of L [Theorem 4 in Arnau et al. (2023)]. That is, the operator L coincides with both $(L|_S)^M$ and $(L|_S)^W.$ In case we consider S an arbitrary subset of X, the McShane and Whitney formulas give approximations, for which the error expressions are known. These adapted formulas are

$$\begin{aligned} T^M(x)(w):=\bigvee \Big \{T(z)(w)- K(w) \vert x-z\vert (w): z \in S \Big \}, \quad x \in X, \end{aligned}$$

and

$$\begin{aligned} T^W(x)(w):=\bigwedge \Big \{T(z)(w)+K(w) \vert x-z\vert (w): z \in S \Big \}, \quad x \in X, \end{aligned}$$

where each w denotes the index on the corresponding element in the basis ${\mathcal {B}},$ and the function K(w) is the pointwise evaluation of the Lipschitz constant for each coordinate.

Therefore, the use of these formulas explicitly requires an order in space. Indeed, the expression $|s-v|(w)$ appearing in them is calculated using the order provided by a basis. In the next subsection we propose a method for defining such an order for the case of general Lipschitz maps.

4.1 The Definition of the Order for the Approximate Representation of a Lipschitz Operator

As we are designing an approximation method, the procedure to find a good basis has to be related to the mass distribution of the set of approximate eigenvalues. The main idea consists in defining a partition of the set of approximate eigenvalues into n sets that are intended to describe the mass distribution. The centres of mass of the subsets of the partition give the n-dimensional basis necessary for the definition of the lattice order. Any clustering method could provide a technique for doing so. In this section we explain a method based on observing the mass distribution of the set of approximate eigenvectors obtained, from Principal Component Analysis of the point cloud defined by this set. Fix an operator $T: {\mathbb {R}}^n \rightarrow {\mathbb {R}}^n.$ The proposed method follows the next steps, that give different solutions depending on the symmetry of the set of approximated eigenvectors that are obtained.

(1)
The direct case: if the approximated eigenvectors are distributed around a set of n vectors that are linearly independent, we take them as the adequate basis ${\mathcal {B}}.$
(2)
Otherwise, we compute the PC (Principal Components) of the cloud of approximated eigenvectors of T, that has been obtained. This technique is widely used for determined the main trends that can be detected in a point cloud [see for example (Abdi and Williams 2010; Jolliffe and Cadima 2016)]. We define the new (orthonormal) basis ${\mathcal {B}}$ using the PC. This is a candidate for being a good basis in case the symmetry of the sets that define the distribution of mass of the eigenvectors coincide with the directions of the vectors provided by the PC. The main problem is that this method always gives an orthogonal basis, which, as we have seen, is not necessarily the best way to describe the distribution of the eigenvector distribution, even if we have a lattice Lipschitz map.
(3)
Suppose now that the cloud of approximate eigenvectors is not oriented following the direction of any set of n vectors, but can be found in a particular region of space. In this case we consider the octants (or hyperoctants) defined by the orthogonal basis provided by the PCA. We will choose a new basis ${\mathcal {C}}$ defined by the vectors crossing these octants along their axes of symmetry. That is, take $\sigma $ to be any of the elements of $\{-1,1\}^n.$ We consider the vectors, expressed in the orthogonal basis provided by the PC, as
$$\begin{aligned} c= \frac{\sigma }{\Vert \sigma \Vert } = \frac{ \big (\sigma (1),...,\sigma (n) \big )}{n^{1/2}}. \end{aligned}$$
For example, we get the first vector of ${\mathcal {C}}$ to be $(1,1,1,...,1)/n^{1/2},$ and we choose other $n-1$ vectors as the ones above to complete a basis.

In the spirit of setting a concrete procedure for this article, we will follow the rule explained above for the definition of the order in the lattice in the next section. As we have said, this is not the only way that can be proposed to obtain such an order. In general, any rule for defining a suitable basis should depend on the symmetry of the problem.

4.2 Weakening the Lattice Lipschitz Inequality

Although the definitions of the lattice versions of the McShane and Whitney extensions provide accurate results for diagonal Lipschitz maps, we cannot expect such good behavior for operators that are not exactly lattice Lipschitz. If the map** T is not diagonalizable or the set of axes is not exactly determined, the assumptions of Theorem 1 are not satisfied, so that T may not be a lattice Lipschitz operator.

This makes that the associated function K(w) may be too large, and therefore also the error of the approximation.

The solution we present for this problem is to change the condition (1) to

$$\begin{aligned} \big | T(x) - T(y) \big |(i) \le K(i) \big ( (1 - \alpha ) | x - y |(i) + \alpha \cdot || x - y || \big ), \quad 1 \le i \le n, \end{aligned}$$

(3)

where $0 \le \alpha \le 1$ and $|| \cdot ||$ is a norm on ${\mathbb {R}}^n$. Writing it in terms of the order of the space E, as

$$\begin{aligned} | T(x) - T(y) | \le K \big ( (1-\alpha ) | x - y | + \alpha || x - y || \cdot {\textbf {1}} \big ). \end{aligned}$$

where $\textbf{1}$ denotes the constant function one in $\Omega $.

Observe that if $\alpha = 0$, the condition (3) is the same as (1) and, if $\alpha = 1$, it is equivalent to every coordinate function $T_i$ of T being a real Lipschitz function. In the case that $T = L + P$ is the addition of a lattice Lipschitz operator L plus a perturbation P, which we assume to have a small Lipschitz constant, T satisfies

$$\begin{aligned} | T(x) - T(y) | \le \left( \frac{K}{1-\alpha } + \frac{C}{\alpha } \right) \cdot \big ( (1-\alpha ) | x - y | + \alpha || x - y || \cdot {\textbf {1}} \big ), \end{aligned}$$

where K is the associated function of L and C the Lipschitz constant of P. Note that, to control the function K of (1), $\alpha $ can be smaller the smaller is the perturbation P.

If a function defined on a subset S of ${\mathbb {R}}^n$, $T: S \rightarrow {\mathbb {R}}^n$ satisfies (3), we can also define the McShane and Whitney extensions as

$$\begin{aligned} T^M(x)(i)&:= \bigvee \big \{ T(z)(i) - K(i) \big ( (1-\alpha ) |x-z|(i) + \alpha || x - z || \big ) : z \in S \big \}, \\ T^W(x)(i)&:= \bigwedge \big \{ T(z)(i) + K(i) \big ( (1-\alpha ) |x-z|(i) + \alpha || x - z || \big ) : z \in S \big \}. \end{aligned}$$

Following the arguments presented in Arnau et al. (2023), we find that the error bounds become in this case

$$\begin{aligned} \begin{aligned} - 2 K \bigwedge \{ (1-\alpha ) | x - z | + \alpha || x - z || \cdot {\textbf {1}}: \, z \in S \} \le (T|_{S})^M(x) - T(x) \le 0 \\ 0 \le (T|_{S})^W(x) - T(x) \le 2 K \bigwedge \{ (1-\alpha ) | x - z | + \alpha || x - z || \cdot {\textbf {1}}: \, z \in S \}. \end{aligned} \end{aligned}$$

(4)

The idea now is to use the previous method to approximate a function, using for doing so a small $\alpha > 0$.

4.3 General Algorithm

Using the formulas explained, we apply the complete procedure following the steps below.

(1)
Fix $n \in {\mathbb {N}}$ and an operator $T: \mathbb {R}^n \rightarrow \mathbb {R}^n$ that one wants to reconstruct from its eigenvectors. If the set of eigenvectors is known, sample some points in it, calculate the value of T and go to the next step. Otherwise, the eigenvector set is approximated by a Monte Carlo procedure based on diagonal error minimization as is explained in Sect. 3. Call S the set of —approximate— eigenvectors.
(2)
For fixing the order on ${\mathbb {R}}^n$, use the method exposed on Sect. 4.1 based on PCA —other procedures could also be possible—.
(3)
Fix a small $\alpha > 0$ (for example $\alpha = 0.1$) and compute the best K(w) possible by using the formula
$$\begin{aligned} K(w) = \max \left\{ \dfrac{ | T(x) - T(y) |(w) }{ (1 - \alpha ) | x - y |(i) + \alpha \cdot || x - y || }: \, x, y \in S, \, x \ne y \right\} , \end{aligned}$$
and the McShane and Whitney lattice-type formulas associated to the order provided in the previous step.
(4)
An interpolation of these formulas provide a(n approximate) representation of the original operator T. We control the error commited by these formulas by using the error associated to the Lipschitz inequality and the formulas in (4), together with the diagonal error of the approximation of the set of eigenvectors.

5 A Numerical Example

Let us show how the method explained in Sect. 4.2 works in a concrete numerical example. Let $E = {\mathbb {R}}^2$ and consider the function $f: {\mathbb {R}}^2 \rightarrow {\mathbb {R}}^2$ defined as the function $(x,y) \mapsto (x+y, x-y)$ (which is a diagonal map, and a lattice Lipschitz operator), with a small perturbation:

$$\begin{aligned} f(x,y) = \left( x + y + \frac{1}{5} \sin (10x) + \frac{xy}{100}, x - y - \frac{1}{10} \cos (x+5y) \right) . \end{aligned}$$

First of all, we approximate the set of eigenvectors with the Monte Carlo method explained above. Let $S_0$ be a sample 250 points using a uniform distribution on the set $P = [-5,5] \times [-5,5]$. Compute the diagonal error $\varepsilon (\cdot )$ at each point of $S_0$, and select the 50 of them with smallest error ($20\%$ of the points on $S_0$). Write $S_1$ for this set. Now, for each element x on $S_1$, we start an iterative process to find a set of approximated eigenvectors as explained before. After 5 iterations, the result (the set $S_2$) is plotted in Fig. 4.

In the next step we choose the axes that will define the new lattice structure for E. In this case, we apply Principal Component Analysis (PCA) (Abdi and Williams 2010). The resulting new axes can be seen in Fig. 5. These axes “look good" because they have a distribution similar to that of the true eigenvectors.

The final step is to compute the McShane and Whitney formulas (with $\alpha = 0.1$ and the Euclidean norm $|| \cdot ||$), using the points of $S_2$ and the order given by the orthonormal basis provided by the PCA. The best function K is in this case

$$\begin{aligned} K(1) = 2.24, \quad K(2) = 3.26. \end{aligned}$$

Observe that if the norm modification is not considered on the lattice Lipschitz inequality ($\alpha = 0$), the best K possible is much larger: $K(1) = 16.2, K(2) = 220.4$, which would cause a worst approximation. The approximation result computed as the mean value of the McShane and Whitney formulas,

$$\begin{aligned} \widehat{f}(x,y) = \dfrac{f^M(x,y) + f^W(x,y)}{2}, \end{aligned}$$

can be seen in Fig. 6.

In order to compare our approximation with the original f, we compute the error using a Monte-Carlo procedure, and we obtain

$$\begin{aligned} \frac{1}{\mu (P)} \big \Vert \big ( \Vert \widehat{f} - f \Vert \big ) \big \Vert _2 = \frac{1}{100} \left( \int _P \Vert \widehat{f} - f \Vert ^2 dx \right) ^\frac{1}{2} \approx 0.65. \end{aligned}$$

The pointwise error is bounded; using the formulas (4) we obtain

$$\begin{aligned}{} & {} \big | \widehat{f}(x,y) - f(x,y) \big |(i)\\{} & {} \le K(i) \bigwedge _{(z,t) \in S_2} \Big ( 0.9 (|x-z|,|y-t|) (i) + 0.1 \Vert (x,y) - (z,t) \Vert _2 \Big ), \end{aligned}$$

for each component, $i = 1, 2$.

6 Conclusions

The notion of lattice Lipschitz operator in finite-dimensional normed spaces has been introduced to provide a suitable set of Lipschitz-type operators that can be used for the design of approximation algorithms. Since any lattice Lipschitz operator always allows a diagonal representation, the family of functions to which this approximation method is applied is composed of nonlinear functions with the property that they allow an “almost diagonal” representation.

This makes it necessary to find an approximation method to find the “almost eigenvalues” of the objective function, and, in a second step, to determine a good set of lattice Lipschitz maps that can be used as an approximation family for it. We propose a concrete algorithm, and show with an example how it works, taking into account the measure of the error made when using the approximation, whose formulas have also been obtained in the paper.

References

Abdi, H., Williams, L.J.: Principal component analysis. Wiley interdiscipl. Rev. Comput. Stat. 2(4), 433–459 (2010)
Article Google Scholar
Appell, J., Dörfner, M.: Some spectral theory for nonlinear operators. Nonlinear Anal. 28(12), 1955–1976 (1997)
Article MathSciNet Google Scholar
Arnau, R., Calabuig, J.M., Erdogan, E., Sánchez-Pérez, E.A.: Extension procedures for lattice Lipschitz operators on Euclidean space. Revista de la Real Academia de Ciencias Exactas, Físicas y Naturales. Serie A. Matemáticas, 117(2), art. 76 (2023)
Cobzaş, Ş, Miculescu, R., Nicolae, A.: Lipschitz Functions. Springer, Berlin (2019)
Book Google Scholar
Dancer, E.N., Phillips, R.: On the structure of solutions of non-linear eigenvalue problems. Indiana Univ. Math. J. 23(11), 1069–1076 (1974)
Article MathSciNet Google Scholar
da Silva, E.B., Fernandez, D.L., de Andrade Neves, M.V.: A spectral theorem for bilinear compact operators in Hilbert spaces. Banach J. Math. Anal. 15(2), 1–36 (2021)
Article MathSciNet Google Scholar
Erdoğan, E., Sánchez Pérez, E.A.: Approximate diagonal integral representations and Eigenmeasures for Lipschitz operators on Banach spaces. Mathematics 10, 220 (2022). https://doi.org/10.3390/math10020220
Article Google Scholar
Husslein, T., Fettes, W., Morgenstern, L.: Comparison of calculations for the Hubbard model obtained with quantum-Monte-Carlo, exact, and stochastic diagonalization. Int. J. Mod. Phys. C 8(02), 397–415 (1997)
Article ADS Google Scholar
Jolliffe, I.T., Cadima, J.: Principal component analysis: a review and recent developments. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 374(2065), 20150202 (2016)
Article ADS MathSciNet Google Scholar
Lee, D., Salwen, N., Lee, D.: The diagonalization of quantum field Hamiltonians. Phys. Lett. B 503(1–2), 223–235 (2001)
Article ADS MathSciNet CAS Google Scholar
López-Gómez, J.: Spectral Theory and Nonlinear Functional Analysis. CRC Press, New York (2001)
Book Google Scholar
Mackey, D.S., Mackey, N., Tisseur, F.: Polynomial eigenvalue problems: Theory, computation, and structure. Numerical Algebra, Matrix Theory, In: Differential-Algebraic Equations and Control Theory, pp. 319-348. Berlin, Springer (2015)
Milano, F., Dassios, I., Liu, M., Tzounas, G.: Eigenvalue Problems in Power Systems. CRC Press, Boca Raton (2020)
Book Google Scholar
Williams, M.M.R.: A method for solving stochastic eigenvalue problems. Appl. Math. Comput. 215(11), 3906–3928 (2010)
MathSciNet Google Scholar
Williams, M.M.R.: A method for solving stochastic eigenvalue problems II. Appl. Math. Comput. 219(9), 4729–4744 (2013)
MathSciNet Google Scholar

Download references

Acknowledgements

The first author was supported by a contract of the Programa de Ayudas de Investigación y Desarrollo (PAID-01-21), Universitat Politècnica de València. This publication is part of the R &D &I project PID2020-112759GB-I00 funded by MCIN/AEI /10.13039/501100011033. This publication is part of the R &D &I project PID2022-138342NB-I00 funded by MCIN/AEI /10.13039/501100011033.

Funding

Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature.

Author information

Authors and Affiliations

Instituto Universitario de Matemática Pura y Aplicada, Universitat Politècnica de València, Camino de Vera, 46022, Valencia, Spain
Roger Arnau, Jose M. Calabuig & Enrique A. Sánchez Pérez
Department of Mathematics, University of Marmara, Kadıköy, 34722, Istanbul, Turkey
Ezgi Erdoğan

Authors

Roger Arnau
View author publications
You can also search for this author in PubMed Google Scholar
Jose M. Calabuig
View author publications
You can also search for this author in PubMed Google Scholar
Ezgi Erdoğan
View author publications
You can also search for this author in PubMed Google Scholar
Enrique A. Sánchez Pérez
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors sincerely thank the referee for her/his careful review and mathematical suggestions, which have greatly improved the quality of the article.

Corresponding author

Correspondence to Enrique A. Sánchez Pérez.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix A. R code

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Arnau, R., Calabuig, J.M., Erdoğan, E. et al. Approximation of Almost Diagonal Non-linear Maps by Lattice Lipschitz Operators. Bull Braz Math Soc, New Series 55, 11 (2024). https://doi.org/10.1007/s00574-024-00385-9

Download citation

Received: 14 October 2023
Accepted: 19 January 2024
Published: 13 February 2024
DOI: https://doi.org/10.1007/s00574-024-00385-9

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Approximation of Almost Diagonal Non-linear Maps by Lattice Lipschitz Operators

Abstract

Similar content being viewed by others

Extension procedures for lattice Lipschitz operators on Euclidean spaces

Fractional Laplace operator in two dimensions, approximating matrices, and related spectral analysis

Toeplitz Localization Operators: Spectral Functions Density

1 Introduction and Notation