Abstract
We establish new properties of inhomogeneous spin q-Whittaker polynomials, which are symmetric polynomials generalizing \(t=0\) Macdonald polynomials. We show that these polynomials are defined in terms of a vertex model, whose weights come not from an R-matrix, as is often the case, but from other intertwining operators of \(U'_q({\widehat{\mathfrak {sl}}}_2)\)-modules. Using this construction, we are able to prove a Cauchy-type identity for inhomogeneous spin q-Whittaker polynomials in full generality. Moreover, we are able to characterize spin q-Whittaker polynomials in terms of vanishing at certain points, and we find interpolation analogues of q-Whittaker and elementary symmetric polynomials.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
1.1 Overview
Inhomogeneous spin q-Whittaker polynomials are symmetric polynomials generalizing the classical q-Whittaker functions (specialization of Macdonald symmetric functions at \(t=0\)) by adding two sequences of parameters \({\mathcal {A}}=(a_0, a_1, \dots )\) and \({\mathcal {B}}=(b_0, b_1, \dots )\). In full generality they were recently constructed in [4] using integrable lattice models, and some particular degenerations were described earlier in [10, 29].
One of the main features of the spin q-Whittaker polynomials can be summarized as follows: while the new parameters \({\mathcal {A}}, {\mathcal {B}}\) are added in a non-trivial way, the resulting functions share several defining algebraic-combinatorial properties with the classical q-Whittaker functions. Namely, there exist spin q-Whittaker analogues of the branching rule and the (skew) Cauchy identity. Moreover, there also exists an analogue of the dual Cauchy identity, which involves spin q-Whittaker polynomials and spin Hall–Littlewood functions: a generalization of Hall–Littlewood functions constructed from integrable lattice models in [7, 8]. Other known properties of spin q-Whittaker polynomials include formulae for explicit action of certain first order difference operators [29], and applications to various stochastic models from integrable probability [24, 29].
The above properties were proved using an explicit construction of inhomogeneous spin q-Whittaker polynomials in terms of an integrable vertex model, called the q-Hahn vertex model. In particular, the Cauchy and dual Cauchy identities follow directly from appropriate modifications of the Yang–Baxter equation using zipper or train argument. This idea is not new: vertex model constructions and Yang–Baxter equations were used to study numerous other special functions, examples can be found in [1, 3, 7,8,9, 11, 22, 44, 46]. However, there is an important difference distinguishing the q-Hahn vertex model among the other integrable vertex models. Usually, integrable vertex models originate from matrix coefficients of R-matrices acting on specific representations of quantum groups, and the Yang–Baxter equation for vertex models is a reformulation of the corresponding equation for quantum groups. But the general q-Hahn vertex model cannot be presented in such a fashion, and in [4] somewhat different combinatorial methods were used to show integrability of this model and deduce properties of inhomogeneous spin q-Whittaker polynomials.
In this work we establish several new properties of the spin q-Whittaker polynomials that can be grouped in two clusters. The first group is focused around the integrability of the q-Hahn model. It turns out that, while the q-Hahn vertex model does not come from an R-matrix of a quantum group, it still has a quantum group interpretation, where instead of the R-matrix we consider different intertwining operators between tensor products of certain representations of \(U'_q(\widehat{\mathfrak {sl}}_2)\). Unlike the R-matrix, these intertwining operators change the tensor factors instead of just permuting them. The integrability and the (modified) Yang–Baxter equations for the vertex model follow immediately from this observation, and using this new point of view we can finish the proof of the Cauchy identity for the spin q-Whittaker polynomials, which was partially done in [4] but was not completed in full generality.
The second group of our new results reveals an unexpected connection of spin q-Whittaker polynomials with another area of the theory of symmetric functions: interpolation symmetric functions. These are inhomogeneous deformations of classical symmetric functions which can be characterized by vanishing at specific points. The main known classes of interpolation symmetric functions are factorial Schur functions and interpolation Macdonald functions.Footnote 1 The former ones are well-studied and their interpolation properties have appeared in the contexts of Capelli identities, multiplicity free spaces and asymptotic representation theory, see [18, 36, 41]. The interpolation Macdonald functions are more complicated and still somewhat mysterious, however a number of nice combinatorial properties is known about them and their degenerations, see [15, 26, 31, 32, 35, 38, 42]. Moreover, there exist more general type \(BC_n\) interpolation Macdonald polynomials, which are Laurent symmetric polynomials with connections to Koornwinder polynomials and applications to multivariate analogues of q-hypergeometric transformations, see [23, 33, 39, 40].
In [34] it was shown that, under a certain constraint, all interpolation symmetric polynomials of interest generally fall in one of three classes: factorial monomial functions (which were considered trivial), factorial Schur functions and (type \(BC_n\)) interpolation Macdonald functions. However, we show that inhomogeneous spin q-Whittaker polynomials also can be characterized by vanishing at specific points. Moreover, by specializing inhomogeneous spin q-Whittaker polynomials, one can obtain new symmetric functions which should fill the place of interpolation q-Whittaker and interpolation elementary functions. These new interpolation symmetric functions depend on a countable family of parameters (similarly to the factorial Schur functions and unlike the interpolation Macdonald functions) and both do not naturally arise via the interpolation Macdonald functions. Finally, we are able to find and prove a parallel version of the classification from [34], which identifies our new interpolation functions as unique functions satisfying a constraint similar to the one from [34].
Overall, our new results give two descriptions of inhomogeneous spin q-Whittaker polynomials: an improved constructions in terms of the vertex model from [4] and a characterization in terms of vanishing at specific points. Combination of these two properties is rare: so far and to the best of our knowledge the only other symmetric functions simultaneously having both vertex model description and a vanishing property were factorial Schur functions,Footnote 2Footnote 3 We still don’t know if this situation is an artifact specific, for some reason, to the Schur and q-Whittaker levels, or it might indicate existence of yet uncovered even more general symmetric functions with similar properties, but both possibilities look exciting for us.
Below we briefly state the main results of this work.
1.2 q-Hahn vertex model and intertwining operators
Inhomogeneous spin q-Whittaker polynomials \({\mathbb {F}}_{\lambda }(x_1, \dots , x_n\mid {\mathcal {A}}, {\mathcal {B}})\) are defined as partition functions of a vertex model consisting of a square grid of vertices with weights
Here i, j are coordinates of the vertex, \((x;q)_k\) denote the q-Pochhammer symbol recalled in Sect. 1.5 below, \(a_i, b_i\) are parameters from \({\mathcal {A}},{\mathcal {B}}\), and \(I,J,K,L\in {\mathbb {Z}}_{\ge 0}\) are integers, representing a configuration of edges around the vertex. Note that these weights are orthogonality weights for q-Hahn polynomials, so they are traditionally called the q-Hahn weights. The exact definition of the spin q-Whittaker polynomials \({\mathbb {F}}_{\lambda }(x_1, \dots , x_n\mid {\mathcal {A}}, {\mathcal {B}})\) via such weights is given in Sect. 5 of the text.
Our first main result states that q-Hahn vertex weights are in fact matrix coefficients of an isomorphism between two generically irreducible representations of the affine quantum algebra \(U'_q(\widehat{\mathfrak {sl}}_2)\). The representations in question are tensor products of the evaluation modules \(V(s)_z\), which are induced from the irreducible \(U_q(\mathfrak {sl}_2)\)-module V(s) with highest weight s along the evaluation map \(\textrm{ev}_z:U'_q(\widehat{\mathfrak {sl}_2})\rightarrow U_q(\mathfrak {sl}_2)\). As in the classical \(\mathfrak {sl}_2\) case, representations \(V(s)_z\) have a natural countable basis \(|v_I\rangle \) consisting of weight vectors.
Theorem A
(Theorem 3.1 in the text) For generic parameters \(a_1,a_2,b_1,b_2\) the representations \(V(a_1/b_1)_{a_1b_1}\otimes V(a_2/b_2)_{a_2b_2}\) and \(V(a_1/b_2)_{a_1b_2}\otimes V(a_2/b_1)_{a_2b_1}\) are irreducible and isomorphic, with the isomorphism \(W^b\) given explicitly by
The isomorphism between the representations \(V(a_1/b_1)_{a_1b_1}\otimes V(a_2/b_2)_{a_2b_2}\) and \(V(a_1/b_2)_{a_1b_2}\otimes V(a_2/b_1)_{a_2b_1}\) is not new: one can show that for generic parameters both \(V(a_1/b_1)_{a_1b_1}\otimes V(a_2/b_2)_{a_2b_2}\) and \(V(a_1/b_2)_{a_1b_2}\otimes V(a_2/b_1)_{a_2b_1}\) are irreducible and have the same highest l-weight \(\frac{(a_1^{-1}-a_1u)(a_2^{-1}-a_2u)}{(b_1^{-1}-b_1u)(b_2^{-1}-b_2u)}\) (l-weights are rational functions in u, which extend the usual weights to \(U'_q(\mathfrak {sl}_2)\)-modules, cf. [45]). The nontriviality of our result is the explicit form for matrix coefficients of this isomorphism, which have a surprisingly simple form identical to the q-Hahn weights. Our result can also be extended to the higher rank case, see Theorem 3.1 in the main text for the \(U'_q(\widehat{\mathfrak {sl}}_n)\) version.
We can use the expression for the isomorphism \(W^b\) above to deduce two other results. First, we can obtain an explicit expression for the fully fused R-matrix \(R:V(a_1/b_1)_{a_1b_1}\otimes V(a_2/b_2)_{a_2b_2}\rightarrow V(a_2/b_2)_{a_2b_2}\otimes V(a_1/b_1)_{a_1b_1}\), reproducing in a new way the results from [5, 28], see Proposition 3.2 in the text. Another consequence is the following Cauchy identity for spin q-Whittaker polynomials \({\mathbb {F}}_{\lambda }(x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\):
Theorem B
(Theorem 5.8 in the text) The following equality of formal power series in x, y, q holds:
where the sum is over all partitions \(\lambda \).
Here \({\mathbb {F}}^*_{\lambda }(y_1, \dots , x_n\mid {\overline{{\mathcal {B}}}},{\overline{{\mathcal {A}}}})\) is an explicit renormalization of spin q-Whittaker polynomials, with parameters \(a_i,b_i\) swapped and inverted. The proof of the Cauchy identity is based on an algebraic identity, obtained by choosing specific parameters \(s_i, s_i', z_i, z_i'\) and constructing an isomorphism \(V(s_1)_{z_1}\otimes V(s_2)_{z_2}\otimes V(s_3)_{z_3}\cong V(s'_1)_{z'_1}\otimes V(s'_2)_{z'_2}\otimes V(s'_3)_{z'_3}\) of irreducible representations in two different ways, using our explicit expressions for isomorphisms \(W^b\) and R.
1.3 Vanishing properties and interpolation characterization
For this group of results the starting point is the following vanishing property of inhomogeneous spin q-Whittaker polynomials \({\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\):
This vanishing property is new, and it leads to an alternative characterization of the inhomogeneous spin q-Whittaker polynomials. Let
and let \({\mathcal {G}}^n_{0}\subset {\mathcal {G}}^n_{1}\subset {\mathcal {G}}^n_{2}\subset \dots \) denote a certain deformation of the natural filtration of the algebra of symmetric functions in n variables, which depends on parameters \(b_1,b_2, \dots \) from \({\mathcal {B}}\) and is defined before Theorem 5.12 in the text.
Theorem C
(Theorem 5.12 in the text) For each partition \(\lambda \in {\mathbb {Y}}^n\) the spin q-Whittaker polynomial \({\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) is uniquely characterized, up to a scalar, by the following properties:
-
1.
\({\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\in {\mathcal {G}}^{n}_{|\lambda |}\).
-
2.
For any partition \(\mu \) such that \(|\mu |\le |\lambda |\) and \(\mu \ne \lambda \) we have \({\mathbb {F}}_\lambda ({\textbf{x}}^n_{{\mathcal {A}}}(\mu )\mid {\mathcal {A}},{\mathcal {B}})=0\).
-
3.
\({\mathbb {F}}_\lambda ({\textbf{x}}^n_{{\mathcal {A}}}(\lambda )\mid {\mathcal {A}},{\mathcal {B}})\ne 0\).
For the normalization we use the values \({\mathbb {F}}_\lambda ({\textbf{x}}^n_{{\mathcal {A}}}(\lambda )\mid {\mathcal {A}},{\mathcal {B}})\) from (3) can be found explicitly using the vertex model description and they are products of q-Pochhammer symbols, see Proposition 5.10 in the text. Note that in the characterization above the parameters \(a_i\) from \({\mathcal {A}}\) determine the interpolation points, while the dependence on the second family \({\mathcal {B}}\) is hidden in the subspaces \({\mathcal {G}}^n_{k}\).
These vanishing and characterization properties closely resemble the defining properties of interpolation symmetric polynomials, which are families of symmetric polynomials \(F_{\lambda }(x_1, \dots , x_n)\) satisfying
-
(i)
\(\deg F_\lambda (x_1, \dots , x_n)\le |\lambda |\);
-
(ii)
\(F_{\lambda }(\mho (\mu ))=0\) unless \(\lambda \subset \mu \);
-
(iii)
\(F_{\lambda }(\mho (\lambda ))\ne 0\)
for some family \(\mho (\mu )\) of n-dimensional points enumerated by partitions \(\mu \). Property (ii), called the vanishing property, is the non-trivial one: it overdefines the functions \(f_{\lambda }\) and makes the existence of these functions for a given family \(\mho \) exceptional. The factorial Schur functions and the interpolation Macdonald functions are examples of such functions \(F_\lambda \) for certain choices of \(\mho \), in both these examples \(\mho \) is of the form
for some functions \(f_1, \dots , f_n\).
The spin q-Whittaker functions \({\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) almost satisfy the properties of interpolation symmetric functions \(F_\lambda \), but in the first property we have the function spaces \({\mathcal {G}}^n_k\) instead of the natural degree filtration of the algebra of symmetric polynomials. However, when the parameters \(b_i\) from \({\mathcal {B}}\) tend to \(\infty \), the subspaces \({\mathcal {G}}^n_{k}\) degenerate to this natural filtration, and in this way we can find two degenerations of the spin q-Whittaker functions, which satisfy properties (i)–(iii) for certain choices of \(\mho \). Namely, when \(b_i\rightarrow \infty \) we get functions \({\widetilde{{\mathbb {F}}}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},\infty )\), which satisfy the interpolation properties (i)–(iii) with
We call these functions interpolation q-Whittaker polynomials, as their top homogeneous components coincide with the usual q-Whittaker polynomials.
The other degeneration is obtained by setting \(b_i=\infty \) and considering the following limit regime:
As a result we obtain symmetric polynomials \({\mathbb {F}}^{el}(r_1, \dots , r_n\mid {\mathcal {C}}, \infty )\) in \(r_1, \dots , r_n\), depending on a family of parameters \({\mathcal {C}}=(c_1, c_2, \dots )\) and satisfying an interpolation property with
Moreover, the top homogeneous component of \({\mathbb {F}}^{el}(r_1, \dots , r_n\mid {\mathcal {C}}, \infty )\) coincides with the elementary symmetric polynomial \(e_{\lambda '}(r_1, \dots , r_n)\) for the conjugated partition \(\lambda '\), so we call this second degeneration elementary interpolation polynomials.
As a final result of our work we show that these two degenerations, interpolation q-Whittaker polynomials and elementary interpolation polynomials, exhaust all interpolation symmetric polynomials for \(\mho (\mu )\) of the form
Note that each part depends on the difference \(\mu _i-\mu _{i+1}\), unlike all previously known interpolation symmetric functions where coordinates depend on single parts \(\mu _i\). A precise result is stated in Theorem 6.2 of the text, below we give its brief reformulation:
Theorem D
(Theorem 6.2 in the text) Assume that \(n\ge 3\) and \(\mho (\mu )\) is a collection of interpolation points of the form
Then the interpolation polynomials \(F_\lambda \) satisfying the properties (i)–(iii) exist only if either
for some \(c,q, a_1, a_2, \dots , a_n\), or if
for some \(d, c_1, c_2, \dots , c _n\). In these cases they coincide with the interpolation q-Whittaker polynomials and the elementary interpolation polynomials, respectively.
1.4 Layout of the paper
In Sect. 2 we recall definitions and necessary properties of the affine quantum algebra \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\) and its representations \(V(s)_z\). In Sect. 3 we introduce explicit expressions for intertwining operators between tensor products \(V(s_1)_{z_1}\otimes V(s_2)_{z_2}\), and use them to deduce equations needed for the later parts. Section 4 is devoted to row operators and exchange relations, which are a convenient technical tool for working with the vertex model description of the spin q-Whittaker polynomials. In Sect. 5 we collect all results about spin q-Whittaker polynomials, proving the Cauchy identity and the vanishing property. In the same section we also describe degenerations of spin q-Whittaker functions: interpolation q-Whittaker and elementary functions. In Sect. 6 we discuss the general classification for interpolation symmetric polynomials, and prove the version covering interpolation q-Whittaker and elementary functions.
1.5 Notation
Throughout the work we treat \(q^{\frac{1}{2}}\) either as a fixed transcendental complex number such that \(|q^{\frac{1}{2}}|<1\), or as a formal variable. In both cases we set \(q=(q^{\frac{1}{2}})^2\) and
where \(m,n\in {\mathbb {Z}}_{\ge 0}\). We define the q-Pochhammer symbol by
Let \((x;q)_{\infty }:=\lim _{n\rightarrow \infty }(x;q)_n\), which is well-defined both when q is a number such that \(|q|<1\) and when q is a formal variable. Note that we can express the q-binomial coefficients in the following ways
allowing us to extend the definition to the case when either n or m are negative, but not both simultaneously.
A partition \(\lambda \) is an infinite sequence \((\lambda _1,\lambda _2,\lambda _3,\dots )\) of nonnegative integers such that
and all but finitely many of \(\lambda _i\) are equal to 0. Sometimes we omit a tail consisting of zeroes, writing \((\lambda _1,\dots ,\lambda _n)\) instead of \((\lambda _1,\dots ,\lambda _n,0,0,\dots )\). The coordinates \(\lambda _i\) are called the parts of the partition \(\lambda \), the number of nonzero parts \(\lambda _i\) is the length \(l(\lambda )\) of \(\lambda \) and the weight is defined by \(|\lambda |:=\lambda _1+\lambda _2+\cdots \). We use \({\mathbb {Y}}^n\) to denote the set of partitions of length at most n. For a pair of partitions \(\lambda ,\mu \), we write \(\mu \subset \lambda \) if and only if for any \(i\ge 1\) we have \(\mu _i\le \lambda _i\). Furthermore, we say the partition \(\lambda \) interlaces the partition \(\mu \), and write \(\lambda \succ \mu \), if \(\lambda _i\ge \mu _i\ge \lambda _{i+1}\) for all \(i\ge 1\). Note that in this case we also have \(0\le l(\lambda )-l(\mu )\le 1\).
Whenever we claim that a statement holds for generic complex parameters \(a_1,\dots , a_n\), we mean that there exists a countable collection of non-constant polynomials \(F_i\in {\mathbb {C}}[x_1, \dots , x_n]\) such that the statement holds for all \((a_1, \dots , a_n)\in {\mathbb {C}}^n\) satisfying \(F_i(a_1, \dots , a_n)\ne 0\) for all i. In particular, the set of such \((a_1, \dots , a_n)\) is dense in \({\mathbb {C}}^n\).
2 Quantum affine algebra \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\) and representations \(V(s)_z\)
In this section we describe the necessary background on the quantum affine algebra \(U_q'({{\widehat{\mathfrak {sl}}}}_{n+1})\) and define the representations \(V(s)_z\), which play the main role in Sect. 3. Throughout this section \(q^{\frac{1}{2}}\) is assumed to be a transcendental complex number, though all results are still valid when \(q^{\frac{1}{2}}\) is a formal variable. References for the material in this section are [13, Chapter 12], [14, 45].
2.1 Quantum affine algebra \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\)
Let \({\mathfrak {h}}\) denote the Cartan subalgebra of \(\mathfrak {sl}_{n+1}\), which we identify with its dual \({\mathfrak {h}}^\vee \) via the Killing form \((\cdot , \cdot )\). Let \(\alpha _i\) denote the simple roots of \(\mathfrak {sl}_{n+1}\), and \(C_{i,j}\) denote the corresponding Cartan matrix. We enumerate the nodes \(I=\{1, \dots , n\}\) of the Dynkin diagram in a way such that \(C_{i,i}=2\) and \(C_{i,i-1}=C_{i-1,i}=-1\), while \(C_{i,j}=0\) for \(|i-j|>1\). We also assume that \((\alpha _i, \alpha _i)=2\), so the simple coroots \(\alpha _i^{\vee }\) coincide with the roots \(\alpha _i\). We use Q (\(Q^+\) and \(Q^-\)) to denote the \({\mathbb {Z}}\)-span (\({\mathbb {Z}}_{\ge 0}\)-span and \({\mathbb {Z}}_{\le 0}\)-span correspondingly) of the simple roots \(\alpha _i\), while \(P:=\{\mu \in {\mathfrak {h}} \mid (\mu , \alpha _i)\in {\mathbb {Z}}\}\) denotes the lattice of integral weights.
Let \(\theta =\alpha _1+\alpha _2+\dots +\alpha _n\) be the maximal positive root, \({\widehat{I}}:=\{0, 1, \dots , n\}\) and \({\widehat{C}}_{i,j}\) be the extended Cartan matrix
The quantum affine algebra \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\)Footnote 4 is the unital associative algebra over \({\mathbb {C}}\) with generators \(\{k_i^{\pm 1}\}_{i\in {{\widehat{I}}}}\), \(\{x_i^{\pm }\}_{i\in {{\widehat{I}}}}\) and relations
The last relation is called the (quantum) Serre relation. The algebra \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\) has a Hopf algebra structure which is given by
There is another presentation of \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\), introduced by Drinfeld [17]. In this presentation the algebra is generated by \(c^{\pm 1}\), \(\{k_i^{\pm 1}\}_{i\in I}\), \(\{h_{i,m}\}_{i\in I, m\in {\mathbb {Z}}\backslash \{0\}}\), \(\{x^{\pm }_{i,m}\}_{i\in I, m\in {\mathbb {Z}}}\), with the relations
Here \(\phi ^{\pm }_{i,m}\) are defined as coefficients in the formal power series
and we set \(\phi ^{\pm }_{i,\pm m}=0\) if \(m<0\). The last relation in (2.2) is taken for every pair of distinct integers \((i,j)\in I^2\) and for every integer sequence \(m_1, \dots , m_{1-C_{i,j}}\), while the first summation is over all permutations \(\pi \) of \(1-C_{i,j}\) elements.
The two presentations of \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\) above are related as follows: \(\{k_i^{\pm 1}\}_{i\in I}\) denote the same elements in both presentations, \(x^{\pm }_{i,0}=x^{\pm }_i\) for \(i\in I\) and \(c=k_0k_1k_2\dots k_n\). The generators \(x^{\pm }_0\) are given in terms of the generators from Drinfeld’s presentation using the following expressions, cf. [17, 21]:
where we use the notation \([A,B]_u=AB-uBA\).
The subalgebra of \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\) generated by \(\{k^{\pm 1}_i\}_{i\in I}\), \(\{x^{\pm }_i\}_{i\in I}\) is a Hopf subalgebra isomorphic to the quantized envelo** algebra \(U_q(\mathfrak {sl}_{n+1})\), so any \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\)-module has also a structure of a \(U_q(\mathfrak {sl}_{n+1})\)-module. We can define a Q-grading on \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\) and \(U_q({\mathfrak {sl}}_{n+1})\) by setting
Let \(U^{+}\) (resp. \(U^-\) and \(U^0\)) be the subalgebra of \(U_q(\mathfrak {sl}_{n+1})\) generated by \(\{x_i^{+}\}_{i\in I}\) (resp. \(\{x_i^{-}\}_{i\in I}\) for \(U^-\) and \(\{k^{\pm 1}_i\}_{i\in I}\) for \(U^0\)). Similarly, let \({\widehat{U}}^{+}\) (resp. \({\widehat{U}}^{-}\) and \({\widehat{U}}^0\)) be the subalgebra of \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\) generated by \(\{x_{i,m}^{+}\}_{i\in I, m\in {\mathbb {Z}}}\) (resp. \(\{x_{i,m}^{-}\}_{i\in I, m\in {\mathbb {Z}}}\) and \(\{k^{\pm 1}_i, h_{i,m}\}_{i\in I, m\in {\mathbb {Z}}\backslash \{0\}}\)). The following factorizations are known [13]:
where we use \(U_1.U_2:=\{u_1u_2\mid u_1\in U_1, u_2\in U_2\}\).
There exists an involution of \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\), which is denoted by \({\widehat{\omega }}\) and is defined by
One can check, using (2.3) and (2.4) that \({{\widehat{\omega }}}(x_0^\pm )=-q^{\mp \frac{n+1}{2}}x_0^{\mp }\). Moreover, \({\widehat{\omega }}\) is a coalgebra anti-automorphism:
For a \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\)-module V we use \(V^{\omega }\) to denote the pullback of V through \(\omega \).
Note that we have not listed the expressions for the coproduct of the Drinfeld generators. In fact, explicit formulae for \(\Delta x^\pm _{i,m}\) and \(\Delta h_{i,m}\) are not known, but partial expressions are available in the case of the quantum loop algebra \(U_q(L{\mathfrak {sl}}_{n+1})=U_q'({\widehat{\mathfrak {sl}}}_{n+1})/(c-1)\), see [12, Proposition 1.2] and references therein.
Remark 2.1
In the definitions of the quantum algebras we choose to use \(q^{\frac{1}{2}}\) instead of q. Our reasoning behind this choice will be clear in Sect. 5, where our q will match the parameter q of the q-Whittaker and Macdonald symmetric functions.
2.2 Highest l-weight modules
To study representations of \(U_q(\mathfrak {sl}_{n+1})\) and \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\) it is convenient to reintroduce the notions of weights, weight spaces and highest weight representations in the quantum affine setting. For a \(U_q(\mathfrak {sl}_{n+1})\)-module V we say that \(v\in V\) is a weight vector if
for an n-tuple \(\rho =(\rho _1, \rho _2, \dots , \rho _n)\in ({\mathbb {C}}^\times )^n\) called the weight of v. Treating \({\mathbb {C}}^\times ={\mathbb {C}}\backslash \{0\}\) as an abelian group, we get an abelian group structure on the set of such weights: \(\rho \tau =(\rho _1\tau _1, \dots , \rho _n\tau _n)\). We define an abelian group map \(q^{\frac{\bullet }{2}}:P\rightarrow ({\mathbb {C}}^\times )^n\) by setting \(q^{\frac{\mu }{2}}:=(q^{\frac{1}{2} (\mu ,\alpha _1)}, \dots , q^{\frac{1}{2} (\mu ,\alpha _n)})\). Since we have assumed that \(q^{\frac{1}{2}}\) is transcendental this map is injective.
We call a \(U_q(\mathfrak {sl}_{n+1})\)-module V a weight module if
The spaces \(V_\rho \) are called the weight spaces of V, and \(\rho \) is called a weight of V if \(V_\rho \ne 0\). We say that an \(U_q(\mathfrak {sl}_{n+1})\)-module V is in the category \({\mathcal {O}}\) if
-
V is a weight module and all its weight spaces are finite-dimensional;
-
All weights of V are contained in \(\bigcup _{i=1}^{m}\{\rho _i q^{-\frac{\mu }{2}}\mid \mu \in Q^+\}\) for some weights \(\rho _1, \dots , \rho _m\in ({\mathbb {C}}^{\times })^n\).
Similarly to the classical situation, for each weight \(\rho \) there exists a unique irreducible \(U_q(\mathfrak {sl}_{n+1})\)-module \(V(\rho )\in {\mathcal {O}}\) with the highest weight \(\rho \).
To extend the formalism above to \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\) we assume for convenience that c always acts by 1; one can check that this assumption is not restrictive as long as we work with simple \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\)-modules in \({\mathcal {O}}\), cf. [45, Proposition 3.2]. For a \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\)-module V we call \(v\in V\) an l-weight vector if
where \(\gamma =(\gamma _{i,m}^\pm )_{i\in I, m\in \pm {\mathbb {Z}}_{\ge 0}}\) is a collection of complex numbers satisfying \(\gamma _{i,0}^+\gamma _{i,0}^-=1\) for every \(i\in I\). Such collections of numbers \(\gamma \) are called l-weights.Footnote 5 We say that an l-weight \(\gamma \) is rational if for some rational functions \((f_1(u), \dots , f_n(u))\) the expansions of \(f_i(u)\) around 0 and \(\infty \) are
In this situation we denote this l-weight by \({\varvec{f}}=(f_1(u), \dots , f_n(u))\). Note that rational functions \(f_i(u)\) define a rational l-weight as long as \(f_i(u)\) are regular at \(0,\infty \) and \(f_i(0)f_i(\infty )=1\). For rational l-weights \(\varvec{f}, \varvec{g}\) set \(\varvec{fg}:=(f_1(u)g_1(u), \dots , f_n(u)g_n(u))\).
We call an l-weight vector v singular if \(x^+_{i,m}.v=0\) for any i, m. An \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\)-module V is called a highest l-weight module when \(V=U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v={\widehat{U}}^-.v\) for a singular l-weight vector v; the l-weight of v is called the highest l-weight of V. The following statement summarizes the information about irreducible highest l-weight modules with rational highest l-weights:
Proposition 2.2
([45]) Let \(\varvec{f}, \varvec{g}\) be rational l-weights.
-
1.
There exists a unique up to isomorphism irreducible representation \(L(\varvec{f})\) of \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})/(c-1)\) with the highest l-weight \({\varvec{f}}\).
-
2.
\(L(\varvec{f})\in {\mathcal {O}}\) as a \(U_q(\mathfrak {sl}_{n+1})\)-module.
-
3.
If \(L(\varvec{f})\otimes L(\varvec{g})\) is an irreducible \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\)-module then
$$\begin{aligned} L(\varvec{f})\otimes L(\varvec{g})\cong L(\varvec{fg})\cong L(\varvec{g})\otimes L(\varvec{f}). \end{aligned}$$
2.3 The representations \(V(s)_z\)
For a pair of complex numbers \(s,z\in {\mathbb {C}}^\times \) let \(V(s)_z\) denote the \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\)-module \(L(\varvec{f})\) with \(f_1(u)=s^{-1}\frac{1-szu}{1-s^{-1}zu}\) and \(f_r(u)=1\) for \(r>1\). For \(a,b\in {\mathbb {C}}^\times \) we also set \(V^a_b:=V(a/b)_{ab}\). These representations can be explicitly described, and to do it we use the following notation. By a composition \(\varvec{I}\) we mean an n-tuple of nonnegative integers \((I_1, \dots , I_n)\in {\mathbb {Z}}_{\ge 0}^n\) and for any composition \(\varvec{I}\) we set \(|\varvec{I}|=I_1+I_2+\dots +I_n\). Define \(\varvec{e}^i=(e_1^i, e_2^i,\dots , e_n^i)\) as the composition with \(e^i_j=\delta _{i,j}\), and set \(\varvec{0}:=(0, \dots , 0)\).
Proposition 2.3
The representation \(V(s)_z\) is infinite-dimensional and has basis \(\{v_{\varvec{I}}\}_{\varvec{I}\in {\mathbb {Z}}^n_{\ge 0}}\) if \(s\ne \pm q^{-\frac{m}{2}}\) for any \(m\in {\mathbb {Z}}_{\ge 0}\), and is finite-dimensional with basis \(\{v_{\varvec{I}}\}_{\varvec{I}\in {\mathbb {Z}}^n_{\ge 0}:|\varvec{I}|\le m}\) if \(s=\pm q^{-\frac{m}{2}}\) for \(m\in {\mathbb {Z}}_{\ge 0}\). The action of \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\) is given explicitly by
where \(r=2, \dots , n\). Moreover, viewed as a \(U_q(\mathfrak {sl}_{n+1})\)-module, \(V(s)_z\) is irreducible with the highest weight \((s^{-1}, 1, \dots , 1)\).
For the finite-dimensional case the claim can be deduced from [30, Proposition 5.1], and the infinite-dimensional situation can be reached using analytic continuation. Below we provide another way to verify Proposition 2.3.
Proof
We first check that (2.5) gives a well-defined \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\)-module, and then verify that the resulting module is indeed \(V(s)_z\).
Let \(V'\) be a vector space with a basis \(\{v_{\varvec{I}}\}_{\varvec{I}\in {\mathbb {Z}}^n_{\ge 0}}\), and consider \(V'\) as a \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\)-module with the action defined by (2.5). To check that this action is well-defined we need to verify the defining relations (2.1), which can be done by a direct computation. For instance,
Other relations (including the Serre relations for \(U'_q({\widehat{\mathfrak {sl}}}_2)\), when \({\widehat{C}}_{0,1}=-2\)) are either trivial or similar to the two above, so we omit their verification.
Note that when \(s=\pm q^{-\frac{m}{2}}\) for \(m\in {\mathbb {Z}}_{\ge 0}\) the module \(V'\) defined above has a submodule spanned by \(\{v_{\varvec{I}}\}_{\varvec{I}\in {\mathbb {Z}}^n_{\ge 0}:|\varvec{I}|\le m}\). Indeed, the only generators \(x_i^{\pm }\) which send \(v_{\varvec{I}}\) to \(v_{\varvec{J}}\) with \(|\varvec{J}|>|\varvec{I}|\) are \(x_0^+\) and \(x_1^-\), and we have \(x_0^+v_{\varvec{I}}=0, x_1^-v_{\varvec{I}}=0\) when \(s=\pm q^{-\frac{m}{2}}\) and \(|\varvec{I}|=m\). In this situation, let \(V''\subset V'\) denote this submodule spanned by \(\{v_{\varvec{I}}\}_{\varvec{I}\in {\mathbb {Z}}^n_{\ge 0}:|\varvec{I}|\le m}\), otherwise, when \(s\ne \pm q^{-\frac{m}{2}}\) for any \(m\in {\mathbb {Z}}_{\ge 0}\), set \(V''=V'\). Looking at the action of \(x_i^{\pm }\) for \(i=1, \dots , n\), one can see that \(V''=U_q(\mathfrak {sl}_{n+1}).v_{\varvec{0}}\) and \(v_{\varvec{0}}\) is the only \(U_q(\mathfrak {sl}_{n+1})\)-singular vector of \(V''\), hence \(V''\) is an irreducible \(U_q(\mathfrak {sl}_{n+1})\)-module with the highest weight \((s^{-1},1,\dots ,1)\).
To finish the proof we only need to show that \(V''\cong V(s)_z\). To do it we refer to [45, Proposition 5.5], which claims that a \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\)-module which is irreducible as a \(U_q(\mathfrak {sl}_{n+1})\)-module and has the highest \(U_q(\mathfrak {sl}_{n+1})\)-weight \((s^{-1}, 1, \dots , 1)\) is isomorphic to \(L(\varvec{g})\) with \(g_1(u)=s^{-1}\frac{1-sz'u}{1-s^{-1}z'u}\), \(g_2(u)=g_3(u)=\dots =1\) for some \(z'\in {\mathbb {C}}\backslash \{0\}\).Footnote 6 So \(V'' \cong L(\varvec{g})\), and we just need to show that \(z=z'\). To do it we can compute the action of \(x^-_{1,1}\) on \(v_{\varvec{0}}\): Note that for any \(A\in U_q'({\widehat{\mathfrak {sl}}}_{n+1})\) such that \([A,x_i^+]=0\), \(k_iAk_i^{-1}=q^{\frac{1}{2}}A\) we have \([x_i^+, [x_i^-, A]_{q^{-\frac{1}{2}}}k_i^{-1}k_{i-1}^{-1}]_{q^{\frac{1}{2}}}=Ak_{i-1}^{-1}\). Using this relation as an inductive step, we see from (2.3) that
In particular,
so \(\phi ^+_{1,1}.v_{\varvec{0}}=(q^{\frac{1}{2}}-q^{-\frac{1}{2}})[x_1^+, x_{1,1}^-].v_{\varvec{0}}=z(s^{-2}-1).v_{\varvec{0}}\). On the other hand, since \(g_1(u)=s^{-1}+z'(s^{-2}-1)u+{{\overline{o}}}(u)\), we have \(\phi ^+_{1,1}.v_{\varvec{0}}=z'(s^{-2}-1).v_{\varvec{0}}\). Hence \(z=z'\) and \(V''\cong V(s)_z\). \(\square \)
Remark 2.4
The representation \(V(s)_z\) can also be viewed as an evaluation module induced from an analytically-continued symmetric tensor power of the standard representation of \(U_q(\mathfrak {sl}_{n+1})\). More rigorously, to obtain \(V(s)_z\) one can consider the extension \(U_q(\mathfrak {sl}_{n+1})\subset U_q(\mathfrak {gl}_{n+1})\), define a \(U_q(\mathfrak {gl}_{n+1})\)-module V(s) and then pull it along the evaluation map \(\mathrm {ev_z}:U'_q({\widehat{\mathfrak {sl}}}_{n+1})\rightarrow U_q(\mathfrak {gl}_{n+1})\) introduced in [20].
The following fact will be crucial in the following section.
Proposition 2.5
The \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\)-module \(V(s_1)_{z_1}\otimes V(s_2)_{z_2}\otimes \dots \otimes V(s_L)_{z_L}\) is irreducible for a fixed \(L\in {\mathbb {Z}}_{\ge 1}\) and generic complex parameters \(z_1, \dots , z_L, s_1, \dots , s_L\).
Proof
We refer to a much stronger result [12, Theorem 4] for the case when all \(V(s_i)_{z_i}\) are finite-dimensional, that is, when \(s_i\in q^{-\frac{1}{2}{\mathbb {Z}}_{\ge 0}}\) for each i. \(\square \)
Theorem 2.6
([12, Theorem 4]) Let \(m_1, \dots , m_L\in {\mathbb {Z}}_{\ge 0}, z_1,\dots , z_L\in {\mathbb {C}}\backslash \{0\}\). If for any pair i, j such that \(i\ne j\) we have \(z_i/z_j\not \in q^{\frac{1}{2} {\mathbb {Z}}}\), then \(V(q^{-\frac{m_1}{2}})_{z_1}\otimes V(q^{-\frac{m_2}{2}})_{z_2}\otimes \dots \otimes V(q^{-\frac{m_L}{2}})_{z_L}\) is a highest l-weight module.
Below we explain how [12, Theorem 4] implies Proposition 2.5 by first explaining the notation from [12], then showing irreducibility of \(V(q^{-\frac{m_1}{2}})_{z_1}\otimes \dots \otimes V(q^{-\frac{m_L}{2}})_{z_L}\) for generic \(z_1, \dots , z_L\) and then lifting the restriction \(s_i\in q^{-\frac{1}{2}{{\mathbb {Z}}_{\ge 0}}}\).
We start with matching the notation used in [12] with our notation:
-
Our \(q^{\frac{1}{2}}\) corresponds to q in [12], to avoid confusion we keep using our \(q^{\frac{1}{2}}\) even when describing notation from [12].
-
In [12] the irreducible highest l-weight \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\)-modules of finite dimension are parametrized by n-tuples \(\varvec{\pi }=(\pi _1(u), \dots , \pi _n(u))\) of polynomials \(\pi _i(u)\), instead of rational l-weights \(\varvec{f}\) like in our work. More precisely, the functions \(\pi _i(u)\) corresponding to the representation \(V({\varvec{f}})\) are given by
$$\begin{aligned} \pi _i(u)={\left\{ \begin{array}{ll} {\prod }_{j\ge 1} \frac{f_i(q^{j}u)}{f_i(0)}, \quad |q|<1,\\ {\prod }_{j\ge 0} \frac{f_i(0)}{f_i(q^{-j}u)},\quad |q|>1, \end{array}\right. } \end{aligned}$$and it turns out that for the finite-dimensional representation \(V(q^{-\frac{m}{2}})_z\) these functions \(\pi _i\) are polynomials (regardless of conditions \(|q|>1\) or \(|q|<1\)):
$$\begin{aligned} \pi _1(u)=\prod _{i=1}^m(1-q^{\frac{m}{2}-i+1}zu), \qquad \pi _r(u)=1, \quad r\ge 2. \end{aligned}$$The n-tuple of polynomials \((\pi _1(u), \dots , \pi _n(u))\) above is denoted by \(\varvec{\pi }^1_{m,zq^{1/2}}\) in [12], and our representations \(V(q^{-\frac{m}{2}})_z\) correspond to \(V(\varvec{\pi }^1_{m,zq^{1/2}})\) in [12].
-
In [12] the results are proved over \({\mathbb {C}}(q^{\frac{1}{2}})\), while here we fix \(q^{\frac{1}{2}}\) as a transcendental complex number. Since we work generically and the action of \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\) can be expressed in terms of matrices over \({\mathbb {Q}}(q^{\frac{1}{2}}, z_1, \dots , z_m)\) this difference is irrelevant in view of Lemma 2.8 below, we will return to this later when discussing the transition from \(s_i=q^{-\frac{m_i}{2}}\) to arbitrary \(s_i\).
So, from [12, Theorem 4] we know that for any fixed \(m_1, \dots , m_L\in {\mathbb {Z}}_{\ge 1}\) and generic \(z_1, \dots , z_L\) the \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\)-module \(V:=V(q^{-\frac{m_1}{2}})_{z_1}\otimes \dots \otimes V(q^{-\frac{m_L}{2}})_{z_L}\) is highest l-weight. Since the vector \(v:=v_{\varvec{0}}\otimes \dots v_{\varvec{0}}\) is the unique up to a scalar vector with the maximal \(U_q(\mathfrak {sl}_{n+1})\)-weight \(\rho =(q^{\frac{1}{2}\sum _{i}m_i},1\dots , 1)\), we have \(x^+_{i,m}.v=0\) and v is the highest l-weight vector. In other words, [12, Theorem 4] can be rephrased as \(U'_q({\widehat{\mathfrak {sl}}}_{n+1}). v=V\).
To show irreducibility of V we consider \((V^*)^{{{\widehat{\omega }}}}\), that is, we consider the dual representation pulled back along the involution \({\widehat{\omega }}\). For convenience, we identify the underlying vector space of \((V^*)^{{{\widehat{\omega }}}}\) with \(V^*\). We use the following fact from [14].
Lemma 2.7
([14, Proposition 5.1]) There exists a fixed constant c such that for any m, z we have \(((V(q^{-\frac{m}{2}})_z)^*)^{{{\widehat{\omega }}}}\cong V(q^{-\frac{m}{2}})_{cz^{-1}}\).
Hence we have
and so \((V^*)^{{{\widehat{\omega }}}}\) is also a highest l-weight representation for generic \(z_1, \dots , z_L\). By comparing \(U_q(\mathfrak {sl}_{n+1})\)-weights note that the highest l-vector \(v^*\) of \((V^*)^{{{\widehat{\omega }}}}\) is dual to v in the sense that \(v^*\) vanishes on all weight spaces of V other than \({\mathbb {C}} v\). Now assume that \(W\subset V\) is a \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\)-submodule. Then W should also have a decomposition into \(U_q(\mathfrak {sl}_{n+1})\)-weight spaces, and hence either \(v\in W\) or \(v^*\in W^{\perp }\), where \(W^{\perp }\) is the annihilator of W in \(V^*\). In the first case we have \(U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v=V\subset W\), while in the second case we get
implying \(W=0\). So \(V(q^{-\frac{m_1}{2}})_{z_1}\otimes \dots \otimes V(q^{-\frac{m_L}{2}})_{z_L}\) is irreducible for generic \(z_1, \dots , z_L\).
To establish Proposition 2.5 we now need to go back from \(s_i=q^{-\frac{m_i}{2}}\) to generic \(s_i\), which can be done using the following elementary linear algebra fact:
Lemma 2.8
Fix \(d_1,d_2\in {\mathbb {Z}}_{\ge 1}\) and let \(X_{i,j}^{(r)}\) be formal variables enumerated by \(i,j,r\in {\mathbb {Z}}_{\ge 1}\) such that \(i\le d_1, j\le d_2\). There exist countable families of polynomials \(F_p\) and \(G_t\) over \({\mathbb {Z}}\) in variables \(X_{i,j}^{(r)}\) with the following property:
Assume we are given finite-dimensional vector spaces V, W over \({\mathcal {F}}\) with dimensions \(\dim (W)=d_1, \dim (V)=d_2\), and a countable family of linear operators \(A^{(r)}:W\rightarrow V\). Fix bases of V, W and let \(A^{(r)}_{i,j}\) be the matrix coefficients of \(A^{(r)}\) with respect to those bases. Then \(\bigcup _{r} {\textrm{Im}}\, A^{(r)}=V\) if and only if \(F_p(A^{(r)}_{i,j})\ne 0\) for some p, and \(\bigcap _{r} \textrm{Ker}\, A^{(r)}=0\) if and only if \(G_t(A^{(r)}_{i,j})\ne 0\) for some t. Here \(F_p(A^{(r)}_{i,j}), G_t(A^{(r)}_{i,j})\) denote the results of substitution \(X^{(r)}_{i,j}=A^{(r)}_{i,j}\) into \(F_p\) and \(G_t\). \(\square \)
We apply Lemma 2.8 in the following way. Let, as before, \(V=V(s_1)_{z_1}\otimes \dots \otimes V(s_L)_{z_L}\), \(v=v_{\varvec{0}}\otimes \dots \otimes v_{\varvec{0}}\) and \(\rho =(\prod _{i}s_i^{-1},1\dots , 1)\). Note that, when viewed as \(U_q(\mathfrak {sl}_{n+1})\)-module, all weights of V are of the form \(\rho q^{-\frac{1}{2}\mu }\) for \(\mu \in Q^+\) and the weight spaces can be explicitly described: if \(\mu =K_1\alpha _1+\dots +K_n\alpha _n\) for a composition \(\varvec{K}\in {\mathbb {Z}}_{\ge 0}^n\) then \(V_{\rho q^{-\frac{1}{2}\mu }}\) is spanned by \(v_{\varvec{I}_1}\otimes \dots \otimes v_{\varvec{I}_L}\) such that \(\varvec{I}_1+\dots +\varvec{I}_L=\varvec{K}\) and \(v_{\varvec{I}_i}\in V(s_i)_{z_i}\). The last condition is only relevant when \(s_i=\pm q^{-\frac{m}{2}},m\in {\mathbb {Z}}\), when we require \(|\varvec{I}_i|\le m\). In particular, as long as \(s_i\notin \{\pm 1, \pm q^{-\frac{1}{2}}, \dots , \pm q^{-\frac{|\varvec{K}|-1}{2}}\}\) for each i, we can identify the vector space \(V_{\rho q^{-\frac{1}{2}\mu }}\) for arbitrary \(s_1, \dots , s_L, z_1, \dots , z_L\) with the vector space over \({\mathbb {C}}\) generated by basis vectors \(\{|\varvec{I}_1, \dots , \varvec{I}_L\rangle \}_{\varvec{I}_1+\dots +\varvec{I}_L=\varvec{K}}\), we denote the latter by \(V^{gen}_{\rho q^{-\frac{1}{2}\mu }}\).
Note that V is irreducible if and only if for any \(\mu \in Q^+\) we have \(V_{\rho q^{-\frac{1}{2}\mu }}\subset U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v\) and \(v\in U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v'\) for any \(v'\in V_{\rho q^{-\frac{1}{2}\mu }}\backslash \{0\}\). It is enough to show that for a fixed \(\mu \in Q^+\) both these conditions hold for generic \(s_1,\dots , s_L, z_1, \dots , z_L\). We start with \(V_{\rho q^{-\frac{1}{2}\mu }}\subset U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v\). Recall that we have a Q-grading on \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\), let \({\mathcal {G}}_{\mu }\) be the set of words in \(k^{\pm 1}_i, x_i^{\pm }, i\in {\widehat{I}}\) whose total Q-degree is \(\mu \). Using \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\)-action from Proposition 2.3, consider each word \(w\in {\mathcal {G}}_{-\mu }\) as an operator \(A^{(w)}:{\mathbb {C}} v\rightarrow V_{\rho q^{-\frac{1}{2}\mu }}\). Recall that, for a fixed \(K\in {\mathbb {Z}}_{\ge 0}\) depending only on \(\mu \), the vector spaces \(V_{\rho q^{-\frac{1}{2}\mu }}\) for arbitrary \(s_1, \dots , s_L, z_1, \dots , z_L\) can be identified with \(V_{\rho q^{-\frac{1}{2}\mu }}^{gen}\), as long as \(s_i\notin \{\pm 1, \pm q^{-\frac{1}{2}}, \dots , \pm q^{-\frac{K-1}{2}}\}\). Moreover, all matrix coefficients of \(A^{(w)}\) with respect to \(v_{\varvec{I}_1}\otimes \dots \otimes v_{\varvec{I}_l}\) are polynomials in \(s_1, \dots , s_L, z_1,\dots , z_L\), with coefficients in \({\mathbb {Q}}(q^{\frac{1}{2}})\). Since \(V_{\rho q^{-\frac{1}{2}\mu }}\subset U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v\) is equivalent to \(\bigcup _{w}{\textrm{Im}}\, A^{(w)}=V_{\rho q^{-\frac{1}{2}\mu }}\), Lemma 2.8 gives a family of polynomials \(F_p(s_1, \dots , s_L, z_1, \dots , z_L)\) with coefficients from \({\mathbb {Q}}(q^{\frac{1}{2}})\), such that \(V_{\rho q^{-\frac{1}{2}\mu }}\subset U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v\) if and only if \(F_p(s_1, \dots , s_L, z_1, \dots , z_L)\ne 0\) for some p, still assuming \(s_i\notin \{\pm 1, \pm q^{-\frac{1}{2}}, \dots , \pm q^{-\frac{K-1}{2}}\}\). Since we work generically over \(s_i, z_i\), it is enough to show that \(F_p\ne 0\) as a polynomial in \(s_i, z_i\) for at least one p. But from the finite-dimensional case we know that \(V_{\rho q^{-\frac{1}{2}\mu }}\subset U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v\) for generic \(z_1, \dots , z_L\) and \(s_i\in q^{-\frac{1}{2}{\mathbb {Z}}_{\ge 0}}\). Hence, for any \(m_i\ge K\) and generic \(z_i\), we have \(F_p(q^{-\frac{m_1}{2}}, \dots , q^{-\frac{m_L}{2}}, z_1, \dots , z_L)\ne 0\) for some p, implying that \(F_p\ne 0\) as a polynomial in \(s_i, z_i\) for some p. Hence \(V_{\rho q^{-\frac{1}{2}\mu }}\subset U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v\) for generic \(s_i, z_i\). Note in particular, that we have used Lemma 2.8 to rewrite the condition \(V_{\rho q^{-\frac{1}{2}\mu }}\subset U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v\) in terms of vanishing of certain polynomials with coefficients being rational functions in \(q^{\frac{1}{2}}\), this readily implies that we can equivalently consider \(q^{\frac{1}{2}}\) as a transcendental complex number or as a formal variable, making this difference with [12] irrelevant.
To show that generically \(v\in U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v'\) for fixed \(\mu \) and any \(v'\in V_{\rho q^{-\frac{1}{2}\mu }}\backslash \{0\}\) we use the other half of Lemma 2.8. Namely, considering now the words \(w\in {\mathcal {G}}_{\mu }\) as operators \(A^{(w)}:V_{\rho q^{-\frac{1}{2}\mu }}\rightarrow {\mathbb {C}} v\), we have \(v\in U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v'\) for any \(v'\in V_{\rho q^{-\frac{1}{2}\mu }}\backslash \{0\}\) if and only if \(\bigcap _w \textrm{Ker} A^{(w)}=0\). By Lemma 2.8, there exist polynomials \(G_t(s_1, \dots , s_L, z_1, \dots , z_L)\) such that the last condition is equivalent to existence of t such that \(G_t(s_1, \dots , s_L, z_1, \dots , z_L)\ne 0\). Irreducibility of V in finite-dimensional situation implies that at least one polynomial \(G_t\) is nonzero, hence we have \(v\in U'_q({\widehat{\mathfrak {sl}}}_{n+1}).v'\) for any \(v'\in V_{\rho q^{-\frac{1}{2}\mu }}\backslash \{0\}\) generically. \(\square \)
3 Explicit expressions for isomorphisms between tensor products
In this section we present explicit expressions for isomorphisms between representations of the form \(V(s_1)_{z_1}\otimes V(s_2)_{z_2}\). The importance of these expressions is two-fold: on one hand we get explicit expressions for the R-matrix \(V(s_1)_{z_1}\otimes V(s_2)_{z_2}\rightarrow V(s_2)_{z_2}\otimes V(s_1)_{z_1}\), reproducing a result of [5]. On the other hand, using the generic irreducibility of arbitrary tensor products of the form \(V(s_1)_{z_1}\otimes \dots \otimes V(z_L)_{z_L}\), we can use our explicit expressions to explain and generalize the inhomogeneous Yang–Baxter equations found in [4] and used to study the q-Hahn vertex model.
To simplify expressions in this section we use the following notation. Recall that for a pair of parameters \(a,b\in {\mathbb {C}}\backslash \{0\}\) we set \(V^a_b:=V(a/b)_{ab}\). For compositions \(\varvec{X}, \varvec{Y}\) and complex parameters a, b define
Note that, while we usually assume that the compositions are positive, the expression \(\Phi (\varvec{X},\varvec{Y}; a,b)\) makes sense as long as for each i at least one of \(X_i,Y_i\) is positive, while the other might be negative, see Sect. 1.5. Alternative expressions there imply that, if \(X_i+Y_i\ge 0\) for every i, then \(\Phi (\varvec{X},\varvec{Y}; a,b)=0\) unless \(X_i\ge 0\) and \(Y_i\ge 0\) for every i. For parameters \(a_1, a_2, a_3, b_1,b_2, b_3\) and compositions \(\varvec{I},\varvec{J}\) we set
In view of Proposition 2.3 we have a basis of \(V(s_1)_{z_1}\otimes \dots \otimes V(s_L)_{z_L}\) given by \(|v_{\varvec{I}_1}\otimes v_{\varvec{I}_2}\otimes \dots \otimes v_{\varvec{I}_L}\rangle \), let \(\langle v_{\varvec{I}_1}\otimes v_{\varvec{I}_2}\otimes \dots \otimes v_{\varvec{I}_L}|\) denote the dual vectors in \((V(s_1)_{z_1}\otimes \dots \otimes V(s_L)_{z_L})^*\).
3.1 Explicit expressions
We start by describing isomorphisms between \(V^{a_1}_{b_1}\otimes V^{a_2}_{b_2}\), \(V^{a_2}_{b_1}\otimes V^{a_1}_{b_2}\), \(V^{a_1}_{b_2}\otimes V^{a_2}_{b_1}\) and \(V^{a_2}_{b_2}\otimes V^{a_1}_{b_1}\) for generic \(a_1,a_2,b_1,b_2\). Note that all four representations are irreducible by Propositions 2.5 and 2.2 all of them are isomorphic to \(L(\varvec{f})\) where \(f_1(u)=\frac{(a_1^{-1}-a_1u)(a_2^{-1}-a_2u)}{(b_1^{-1}-b_1u)(b_2^{-1}-b_2u)}\) and \(f_r(u)=1\) for \(r\ge 2\).
Theorem 3.1
For generic parameters \(a_1,b_1, a_2, b_2\) the representations \(V^{a_1}_{b_1}\otimes V^{a_2}_{b_2}\) and \(V^{a_2}_{b_1}\otimes V^{a_1}_{b_2}\) are irreducible and isomorphic, with an isomorphism \(W^a\) given explicitly by
In particular, \(\langle v_{\varvec{K}}\otimes v_{\varvec{L}}| W^a|v_{\varvec{I}}\otimes v_{\varvec{J}}\rangle =0\) unless \(\varvec{I}\ge \varvec{K}\) coordinate-wise.
Similarly, for generic parameters \(a_1,b_1, a_2, b_2\) the representations \(V^{a_1}_{b_1}\otimes V^{a_2}_{b_2}\) and \(V^{a_1}_{b_2}\otimes V^{a_2}_{b_1}\) are irreducible and isomorphic, with an isomorphism \(W^b\) given explicitly by
with the expression vanishing unless \(\varvec{J}\ge \varvec{L}\).
Proof
By the discussion above it is enough to check that the morphisms \(W^a\) and \(W^b\) presented above commute with the action of \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\). We do it by verifying the statement on the generators \(\{k_i^{\pm 1}\}\), \(\{x_i^{\pm }\}\) of \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\). For \(k_i\) the check is trivial since \(W^a\) and \(W^b\) clearly preserve the weight spaces. The checks for \(x_i^\pm \) are similar to each other and can be readily done by a straightforward but tedious computation. Here we only provide the verification that \(x_1^+\) commutes with the action of \(W^a\).
We need to check that \(\langle v_{\varvec{K}}\otimes v_{\varvec{L}}| W^ax_1^+|v_{\varvec{I}}\otimes v_{\varvec{J}}\rangle =\langle v_{\varvec{K}}\otimes v_{\varvec{L}}|x_1^+W^a|v_{\varvec{I}}\otimes v_{\varvec{J}}\rangle \) for any compositions \(\varvec{I},\varvec{J},\varvec{K},\varvec{L}\). Since \(W^a\) preserves the weight and \(x_1^+\) increases it by \(q^{\frac{1}{2}\alpha _1}\), both sides are zero unless \(\varvec{I}+\varvec{J}-\varvec{e}^1=\varvec{K}+\varvec{L}\). Assuming \(\varvec{I}+\varvec{J}-\varvec{e}^1=\varvec{K}+\varvec{L}\) from now on, note that
where the first equality describes the left action on \(V^{a_1}_{b_1}\otimes V^{a_2}_{b_2}\) with the assumption that \(|v_{\varvec{I}'}\otimes v_{\varvec{J}'}\rangle =0\) when the configurations \(\varvec{I}', \varvec{J}'\) are non-positive, while the second describes the right action on \((V^{a_2}_{b_1}\otimes V^{a_1}_{b_2})^*\). So we need to verify that
Plugging the expression for \(W^a\), cancelling terms, using that \(\varvec{I}+\varvec{J}-\varvec{e}^1=\varvec{K}+\varvec{L}\) and
we can rewrite the needed identity as
Using
we finally reduce the verification to
The latter can be verified by standard algebraic manipulations, using \(\varvec{I}+\varvec{J}-\varvec{e}^1=\varvec{K}+\varvec{L}\). Finally we note that the degenerate cases when \(I_1=0\) or \(I_1=K_1\) are automatically handled by our convention on q-Pochhammer symbols from Sect. 1.5. \(\square \)
Proposition 3.2
For generic parameters \(a_1,b_1, a_2, b_2\) there is an isomorphism
which is given in the following two equivalent ways:
where the sum is over configurations \(\varvec{P}\) such that \(P_i\le \min (J_i,L_i)\);
where the sum is over configurations \(\varvec{P}\) such that \(P_i\le \min (I_i, K_i)\).
Proof
For generic \(a_1,a_2,b_1,b_2\) the representations \(V^{a_1}_{b_1}\otimes V^{a_2}_{b_2}\) and \(V^{a_2}_{b_2}\otimes V^{a_1}_{b_1}\) are irreducible and have one-dimensional highest weight space, so there exists, up to a scalar, at most one isomorphism R between them.
The claim now follows from Proposition 3.1, which allows to construct isomorphism R as above in two ways corresponding to (3.1) and (3.2):
Note that both morphisms above send \(v_{\varvec{0}}\otimes v_{\varvec{0}}\) to \(v_{\varvec{0}}\otimes v_{\varvec{0}}\), so these two isomorphisms coincide. \(\square \)
Remark 3.3
Proposition 3.2 provides an expression for the action of the R-matrix of \(U'_q({\widehat{\mathfrak {sl}}}_{n+1})\) on \(V(s)_z\otimes V(s')_{z'}\). In the case of \(U'_q({\widehat{\mathfrak {sl}}}_2)\) this expression was obtained from various approaches in the works [25, 28], see also [2, 16] for analogous expressions in the more general elliptic case. For \(U_q({\widehat{\mathfrak {sl}}}_{n+1})\) the expression of Proposition 3.2 was first obtained in [5] using the methods of three-dimensional integrability.
3.2 Triple tensor products and inhomogeneous Yang–Baxter equations
By Propositions 2.2 and 2.5 generically we have \(V^{a_1}_{b_1}\otimes \dots \otimes V^{a_m}_{b_m}\cong L(\varvec{f})\) with \(f_1(u)=\prod _i\frac{a_i^{-1}-a_iu}{b_i^{-1}-b_iu}\) and \(f_r(u)=1\) for \(r>1\). Hence, for two collections of generic parameters \(a_1\dots a_m, b_1,\dots , b_m\) and \({{\tilde{a}}}_1\dots {{\tilde{a}}}_m, {{\tilde{b}}}_1,\dots , {{\tilde{b}}}_m\) the representations \(V^{a_1}_{b_1}\otimes \dots \otimes V^{a_m}_{b_m}\) and \(V^{{{\tilde{a}}}_1}_{{{\tilde{b}}}_1}\otimes \dots \otimes V^{{{\tilde{a}}}_m}_{{{\tilde{b}}}_m}\) are isomorphic if and only if, up to sign changes, \({{\tilde{a}}}\) is a permutation of a, \({{\tilde{b}}}\) is a permutation of b, and the total number of sign changes is even. Proposition 3.1 allows to explicitly construct all such isomorphisms, since \(W^a\) (\(W^b\)) is the isomorphism corresponding to a simple transposition of the parameters \(a_i\) (respectively \(b_i\)), while the sign changes are trivial since \(V^a_b=V(a/b)_{ab}=V^{-a}_{-b}\) and we have
where both isomorphisms above are constructed using \(W^a\).
More importantly, since the representations are irreducible, the isomorphism \(V^{a_1}_{b_1}\otimes \dots \otimes V^{a_m}_{b_m}\cong V^{{{\tilde{a}}}_1}_{{{\tilde{b}}}_1}\otimes \dots \otimes V^{{{\tilde{a}}}_m}_{{{\tilde{b}}}_m}\) is unique up to a scalar. Hence, if we have multiple ways of expressing the same isomorphism using \(W^a\), \(W^b\) and R, we obtain nontrivial equations, including the remarkable Yang–Baxter equations. Below we demonstrate this idea for triple tensor products, deriving two equalities which will be used in Sect. 4.
For later use we summarize the expressions form Propositions 3.1 to 3.2, setting:
We do not include \(C_{a_1,b_1;a_2,b_2}(\varvec{I},\varvec{J})\) from Propositions 3.1 to 3.2 in the expressions above, in this way right-hand sides of (3.3)-(3.5) are rational functions in q, not just in \(q^{\frac{1}{2}}\). Note also that we have replaced \(a_i^2, b_i^2\) from Propositions 3.1 to 3.2 by \(a_i, b_i\).
Remark 3.4
For the later use in Sects. 4 and 5 we note that \(\Phi (\varvec{X},\varvec{Y}, x/y, y/z)\) is a polynomial in y, and \(y^{-|\varvec{X}|-|\varvec{Y}|}\Phi (\varvec{X},\varvec{Y}, x/y, y/z)\) is a polynomial in \(y^{-1}\). Hence \(W^a_{a_1,a_2,b_1}(\varvec{I},\varvec{J},\varvec{K},\varvec{L})\) is a polynomial in \(a_2\), \(W^b_{a_2,b_1,b_2}(\varvec{I},\varvec{J},\varvec{K},\varvec{L})\) is a polynomial in \(b_1\) and \(a_2^{-|\varvec{I}|}W^a_{a_1,a_2,b_1}(\varvec{I},\varvec{J},\varvec{K},\varvec{L})\) is a polynomial in \(a_2^{-1}\). Moreover, \(a_2^{-|\varvec{I}|}R_{a_1,b_1,a_2,b_2}(\varvec{I},\varvec{J},\varvec{K},\varvec{L})\) is a polynomial in \(b_1\) and \(a_2^{-1}\).
Proposition 3.5
The following identity of rational functions in \(a_1\), \(a_2\), \(a_3\), \(b_1\), \(b_2\), \(b_3\) holds:
Proof
It is enough to prove that for generic \(a_1, a_2, a_3, b_1, b_2, b_3\) the expression
is equal to
Consider two isomorphisms \(V^{a_1}_{b_1}\otimes V^{a_2}_{b_2}\otimes V^{a_3}_{b_3}\cong V^{a_1}_{b_3}\otimes V^{a_2}_{b_2}\otimes V^{a_3}_{b_1}\):
Both isomorphisms send \(v_{\varvec{0}}\otimes v_{\varvec{0}}\otimes v_{\varvec{0}}\) to \(v_{\varvec{0}}\otimes v_{\varvec{0}}\otimes v_{\varvec{0}}\), so by irreducibility they are equal. The claim now follows by applying Proposition 3.1 to
Note that all coefficients \(C_{a,b;a',b'}(\varvec{I},\varvec{J},\varvec{K},\varvec{L})\) cancel out. \(\square \)
Proposition 3.6
The following identity of rational functions holds:
Proof
The claim follows from the same argument as in Proposition 3.5, applied to the morphisms
\(\square \)
Remark 3.7
The identity from Proposition 3.5 was first established in [4], where it was called an inhomogeneous Yang–Baxter equation and it was proved by algebraic manipulations starting with the Yang–Baxter equation
Back then it was not clear for us if there exists some quantum group reasoning behind the existence of such equations and if there is a systematic way to construct them. The discussion of this section answers both questions by considering instead of isomorphisms of the form \(V_1\otimes V_2\otimes V_3\cong V_3\otimes V_2\otimes V_1\) isomorphisms \(V_1\otimes V_2\otimes V_3\cong {{\tilde{V}}}_3\otimes {{\tilde{V}}}_2\otimes {{\tilde{V}}}_1\), where representations \({{\tilde{V}}}_i\) are different from \(V_i\). This new understanding allows us to obtain Proposition 3.6 here, which is new.
4 Vertex models and transfer matrices
From this point we move away from the quantum affine algebras and focus on applying the relations obtained in Sect. 3 to the algebraic-combinatorial objects of our interest. In this section we introduce row operators \({\mathbb {B}}(x \mid {\mathcal {A}}, {\mathcal {B}})\), \({\mathbb {B}}^*(y\mid {\mathcal {A}}, {\mathcal {B}})\) and prove exchange relations between them by iterating Propositions 3.5 and 3.6. To make our expressions and manipulations clearer we also explain the language of vertex models here.
We use the following notation. Since in what follows we only need Propositions 3.5 and 3.6 when \(n=1\), that is, when the quantum algebra in question is \(U_q'({\widehat{\mathfrak {sl}}}_2)\), we replace all length 1 compositions \(\varvec{I}=(I)\) by nonnegative integers I. From now on we treat q as a formal variable, and let \({\mathcal {A}}=(a_0, a_1, \dots )\), \({\mathcal {B}}=(b_0, b_1,\dots )\) denote two infinite sequences of parameters \(a_i\) and \(b_i\), which we treat as formal variables. Let \({\mathbb {k}}={\mathbb {Q}}(q, {\mathcal {A}}, {\mathcal {B}})\) denote the field of rational functions in \(q,a_0, a_1, \dots \) and \(b_0, b_1, \dots \). For a sequence \({\mathcal {X}}=(\chi _0, \chi _1, \dots )\) we set
4.1 Vertex models
By a vertex model we mean the following data:
-
A collection of oriented lines in the plane, whose intersections are called vertices, while the line segments between vertices are called edges. The edges are oriented in the same way as the underlying lines, and each vertex has exactly two incoming edges and two outgoing edges. An edge is internal if it connects a pair of distinct vertices, and is boundary if it is connected to only one vertex.
-
A collection of pairs of edge parameters (a, b), which are assigned to the edges and are constrained by the following rule: for each vertex if (a, b) and \((a',b')\) are the parameters of the incoming edges of the vertex, then its outgoing edges have either parameters (a, b) and \((a',b')\), or parameters \((a,b')\) and \((a',b)\). See Fig. 1 for the assignments satisfying this constrain.
A configuration of a vertex model is an assignment of non-negative integer labels to the edges. In this text we usually denote these labels by capital letters \(I,J,\dots \). Given a configuration around a vertex, that is, a collection of four edge labels I, J, K, L attached to the adjacent edges, we define the corresponding vertex weight by tracking the behavior of the edge parameters \(a,a',b,b'\) and correspondingly using expressions (3.3)–(3.5) in the way demonstrated in Fig. 1.
In other words, we define four types of vertices \(Id, W^a, W^b, R\), which are determined by the arrangement of the parameters \(a,b,a',b'\). For example, the vertex of type \(W^a\) preserves the parameters \(a,a'\) but swaps the parameters \(b,b'\) between the lines. Note that in each of these situations the vertex weight vanishes unless \(I+J=K+L\), we call this fact the conservation law.
The weight of a configuration of a vertex model is the product of the weights of all vertices. A boundary condition for a vertex model is an assignment of labels to the boundary edges. Given a vertex model and a boundary condition we define the corresponding partition function as the sum of the weights of all configurations satisfying the boundary condition, that is, configurations whose labels of the boundary edges coincide with the boundary condition. For example, both sides of the identities from Propositions 3.5 to 3.6 can be rewritten as partition functions in the following way:
Here to each boundary edge we assign a triple (I; a, b), where I is the label of the boundary condition, and (a, b) are the corresponding edge parameters. To lighten the notation, instead of specifying the edge parameters of the internal edges we specify the types of the vertices, \(W^a,W^b\) or R; the labels of the internal edges can be uniquely reconstructed from this information.
Remark 4.1
The vertex models described above are directly related to the representations from Sect. 3: we can think about the edge with parameters (a, b) as of the representation \(V^{\sqrt{a}}_{\sqrt{b}}\), and each vertex corresponds to one of the operators \(V^{\sqrt{a_1}}_{\sqrt{b_1}}\otimes V^{\sqrt{a_2}}_{\sqrt{b_2}}\rightarrow V^{\sqrt{{{\tilde{a}}}_1}}_{\sqrt{{{\tilde{b}}}_1}}\otimes V^{\sqrt{{{\tilde{a}}}_2}}_{\sqrt{{{\tilde{b}}}_2}}\). Note that the whole partition function corresponds to a matrix coefficient of the isomorphism between two tensor products of the form \(V^{\sqrt{a_1}}_{\sqrt{b_1}}\otimes \dots \otimes V^{\sqrt{a_L}}_{\sqrt{b_L}}\), with the parameters \(a_i\) and \(b_i\) determined by the parameters of the incoming and outgoing edges.
Remark 4.2
One can also write the usual Yang–Baxter equation \((1\otimes R)(R\otimes 1)(R\otimes 1)=(R\otimes 1)(1\otimes R)(R\otimes 1)\) using vertex models:
Note that when \(a_1=a_2=a_3\), the Eq. (4.1) becomes equivalent to the usual Yang–Baxter equation, while setting \(a_1=a_2, b_2=b_3\) makes (4.2) and (4.3) equivalent.
4.2 Row operators
For \(N\in {\mathbb {Z}}_{\ge 0}\) define \(V^{(N)}\) as the vector space over \({\mathbb {k}}\) with a basis consisting of the vectors \(|A_1, A_2, \dots , A_N\rangle \) enumerated by N-tuples \((A_1, \dots , A_N)\in {\mathbb {Z}}^N_{\ge 0}\). Equivalently, the same basis can be enumerated by partitions \(\mu \) of length at most N in the following way:
we use both notations interchangeably. For \(N<M\) we have an embedding \(V^{(N)}\subset V^{(M)}\) which is defined by \(|\mu \rangle \mapsto |\mu \rangle \) where \(l(\mu )\le N<M\), or, equivalently, \(|A_1, \dots , A_N\rangle \mapsto |A_1, \dots , A_N, 0, \dots , 0\rangle \). Let \(\langle \lambda |\), \(\langle B_1, \dots , B_N|\) denote the vectors dual to the basis \(\{|\mu \rangle \}_{l(\mu )\le N}\).
Recall that \({\mathcal {A}}=(a_0, a_1, a_2, \dots )\) and \({\mathcal {B}}=(b_0, b_1, b_2, \dots )\) denote two infinite sequences of parameters. For variables x, y we define row-to-row transfer matrices \(T^{b,N}_{I, L}(x\mid {\mathcal {A}},{\mathcal {B}}), T^{a,N}_{K, J}(y\mid {\mathcal {A}},{\mathcal {B}}):V^{(N)}\rightarrow V^{(N)}\) by setting
where for a sequence \((A_1, \dots , A_N)\) we set \(A_{[1,0]}=0\), \(A_{[1,r]}=A_1+\dots +A_r\) for \(r\ge 1\). The same definitions can be graphically represented using the following partition functions:
Here we follow the same conventions as in (4.1) and (4.2), and in the second partition function we use the vertices of type \(W^a\) rotated by \(90^\circ \). Note that for each partition function above there exists at most one configuration with nonzero weight, and \(L_r, J_r\) from (4.4) to (4.5) respectively are the labels of the internal edges in these configurations.
In the following statement we summarize simple facts about the operators \(T^{b,N}_{I,0}\) and \(T^{a,N}_{K,0}\):
Proposition 4.3
Let \(N\in {\mathbb {Z}}_{\ge 0}\) and \(\lambda ,\mu \) be partitions of length at most N.
(1) Duality: The operators \(T^{a,N}_{K,0}\) and \(T^{b,N}_{K,0}\) are dual in the following sense:
where \(\tau {\mathcal {B}}\) denotes the shifted sequence and \(\psi _\lambda ({\mathcal {A}},{\mathcal {B}})\) is defined by
(2) Polynomiality: \(\langle \lambda |\, T^{b,N}_{I,L}(x\mid {\mathcal {A}}, {\mathcal {B}})\, |\mu \rangle \) is a polynomial in x, and \(\langle \mu |\, T^{a,N}_{K,J}(y\mid {\mathcal {A}}, {\mathcal {B}})\, |\lambda \rangle \) is a polynomial in \(y^{-1}\). Moreover, both are rational functions in q regular at \(q=0\).
(3) Interlacing: \(\langle \lambda |\, T^{b,N}_{I,0}(x\mid {\mathcal {A}}, {\mathcal {B}})\, |\mu \rangle =\langle \mu |\, T^{a,N}_{I,0}(y\mid {\mathcal {A}}, {\mathcal {B}})\, |\lambda \rangle =0\) unless \(\lambda _1-\mu _1=I\) and \(\mu \prec \lambda \), that is,
In particular, for any \(M>N\) we have \(T^{b,M}_{I,0}(x\mid {\mathcal {A}}, {\mathcal {B}})\, V^{(N)}\subset V^{(N+1)}\) and \(T^{a,M}_{K,0}(y\mid {\mathcal {A}}, {\mathcal {B}})\, V^{(N)}\subset V^{(N)}\).
(4) Stability: For any \(M>N\) we have
Proof
Part (1) follows at once from (4.4) to (4.5) and the relation
which is verified using (3.3) and (3.4). Part (2) is also immediate from (3.3) to (3.4), see Remark 3.4.
For the interlacing statement from part (3), by part (1) it is enough to prove it for \(T^{b,N}_{I,0}(x\mid {\mathcal {A}}, {\mathcal {B}})\). Set \(J_r=\mu _{r}-\mu _{r+1}\), \(K_r=\lambda _r-\lambda _{r+1}\) and \(L_r=I+J_{[1,r]}-K_{[1,r]}\). Recall from (3.4) that the weights \(W^b_{a_{r+1}, x, b_r}(L_{r-1},J_r,K_r,L_r)\) vanish unless \(J_r\ge L_r\ge 0\). Hence, for \(\langle K_1, \dots , K_N|\, T^{b,N}_{I,0}(x\mid {\mathcal {A}}, {\mathcal {B}})\, |J_1, \dots , J_N\rangle \) to be nonzero we must have \(J_r\ge L_r\ge 0\) for every r, and moreover \(L_N=0\). Noting that \(L_r= I+\mu _1-\mu _{r+1}-\lambda _1+\lambda _{r+1}\) and, in particular, \(L_N=I+\mu _1-\lambda _1\), we get that \(\langle \lambda |\, T^{b,N}_{I,0}(x\mid {\mathcal {A}},{\mathcal {B}})\, |\mu \rangle =0\) unless \(I=\lambda _1-\mu _1\) and \(\mu _{r}-\mu _{r+1}\ge I+\mu _1-\mu _{r+1}-\lambda _1+\lambda _{r+1}\ge 0\) for every r. The last inequality is equivalent to \(\mu _r\ge \lambda _{r+1}\ge \mu _{r+1}\) for all \(r=1, \dots , N\), and since \(\lambda _1-\mu _1=I\ge 0\) the interlacing \(\mu \prec \lambda \) follows.
The second statement of part (3) follows immediately from the interlacing by noticing that \(\mu \prec \lambda \) implies \(l(\mu )\le l(\lambda )\le l(\mu )+1\). Finally, part (4) follows from the fact that \(W^b_{a,x,b}(0,0,0,0)=W^a_{a,y,b}(0,0,0,0)=1\) and the relation
which holds since \(l(\lambda ),l(\mu )\le N<M\). The claim for \(T^{a,M}_{K,0}\) is handled similarly. \(\square \)
Let \(V:=\bigcup _N V^{(N)}\) be a vector space over \({\mathbb {k}}\) with a basis \(\{|\mu \rangle \}_{\mu \in {\mathbb {Y}}}\) where we have no restrictions on the length of \(\mu \). The natural embeddings \(V^{(N)}\subset V\) are given by identifying the vectors \(|\mu \rangle \) in V and \(V^{(N)}\) when \(l(\mu )\le N\). By parts (3) and (4) of Proposition 4.3 we can define operators \({\mathbb {T}}^b_I(x\mid {\mathcal {A}},{\mathcal {B}}),{\mathbb {T}}^a_K(y\mid {\mathcal {A}},{\mathcal {B}}): V\rightarrow V\) by setting for \(v\in V^{(N)}\)
Using these operators, we can formally define
More precisely, \({\mathbb {B}}^*(y\mid {\mathcal {A}},{\mathcal {B}})\) is an operator \(V\rightarrow V\) such that
which is well-defined since \(\langle \mu |\, {\mathbb {T}}^a_{\lambda _1-\mu _1}(y\mid {\mathcal {A}},{\mathcal {B}})\, |\lambda \rangle \ne 0\) only if \(\mu \prec \lambda \), and there is a finite number of such \(\mu \) given a fixed \(\lambda \). On the other hand, \({\mathbb {B}}(x\mid {\mathcal {A}}, {\mathcal {B}})\) is not a well-defined operator \(V\rightarrow V\), since given a fixed \(\mu \) we have \({\mathbb {T}}^b_r(x\mid {\mathcal {A}}, {\mathcal {B}})\, |\mu \rangle \ne 0\) for infinitely many r in general. However, we can resolve this issue by considering q and x as formal variables: by Proposition 4.3 the matrix coefficients \(\langle \lambda |\, {\mathbb {T}}_{\lambda _1-\mu _1}^b(x\mid {\mathcal {A}}, {\mathcal {B}})\,|\mu \rangle \) can be viewed as elements of the algebra of formal power series \({\mathbb {k}}[[x,q]]\), while the combined degree with respect to x and q of \((x/b_0)^r\frac{(a_1 x^{-1};q)_r}{(q;q)_r}\) is at least r. Hence \({\mathbb {B}}(x\mid {\mathcal {A}}, {\mathcal {B}})\) is a well-defined operator \(V[[x,q]]\rightarrow V[[x,q]]\), where V[[x, q]] is the vector space of formal power series in x, q with coefficients in V.
The operators \({\mathbb {B}}(x\mid {\mathcal {A}},{\mathcal {B}})\) and \({\mathbb {B}}^*(y\mid {\mathcal {A}}, {\mathcal {B}})\) are dual to each other in the following sense:
Proposition 4.4
For any \(\lambda ,\mu \) we have
where \(\psi _\lambda ({\mathcal {A}},{\mathcal {B}})\) is defined by
Proof
Follows from Proposition 4.3, part (1). \(\square \)
4.3 Exchange relations
Now we consider commutation relations between the operators \({\mathbb {B}}(x\mid {\mathcal {A}}, {\mathcal {B}})\) and \({\mathbb {B}}^*(y\mid {\mathcal {A}},{\mathcal {B}})\), which follow from the Eqs. (4.1) and (4.2). In fact, our construction of the operators \({\mathbb {T}}^b_{r}(x\mid {\mathcal {A}}, {\mathcal {B}}),{\mathbb {T}}^a_r(y\mid {\mathcal {A}},{\mathcal {B}})\) is motivated by (4.1) and (4.2) and the desire for the arguments below to work, while the linear combinations \({\mathbb {B}}(x\mid {\mathcal {A}}, {\mathcal {B}})\), \({\mathbb {B}}^*(x\mid {\mathcal {A}}, {\mathcal {B}})\) are distinguished by especially nice commutation relations.
Proposition 4.5
We have
or, more precisely, the following relation holds:
where both sides are viewed as formal power series in \(x,y^{-1}\) and q.
Proof
We prove the latter identity, fixing \(\mu ,\nu \) from the statement and setting \(L_r=\nu _r-\nu _{r+1}\), \(J_r=\mu _r-\mu _{r+1}\).
Step 1: We start by consequently applying (4.2) to get a relation between the operators \(T^{b,N}_{I,0}\) and \(T^{a,N}_{K,0}\). Using (4.2) we have
where \(A_1,A_2,B_1,B_2\) are arbitrary nonnegative integers. Choosing \(N\in {\mathbb {Z}}_{\ge 0}\) such that \(N> l(\mu ),l(\nu )\) and applying these identities with \(r=N, N-1, \dots , 2, 1\) we get for arbitrary \(I,K\in {\mathbb {Z}}_{\ge 0}\)
Comparing with (4.4) and (4.5), the identity of partition functions above is equivalent to
where in the left-hand side the sum is over partitions \(\lambda \) such that \(l(\lambda )\le l(\mu )+1\le N\), with the restriction on length coming from Proposition 4.3, part (3).
From (4.4) we have
where
But \(N>l(\mu )\), so \(\mu _N=0, {{\widetilde{\mu }}}=\mu \) and hence
Recall that \(W^b_{a,x,b}(I,J,K,L)\) vanishes unless \(L\le J\), so the expression above vanishes unless \({{\tilde{I}}}=0\); hence in the left-hand side of (4.8) the nonzero terms have \({{\tilde{I}}}=0\). Since \(R_{a_{N+1},x,y,b_{N+1}}(0,0,{{\tilde{K}}}, 0)=\delta _{{{\tilde{K}}},0}\), the non-zero summands in the left-hand side of (4.8) must also have \({{\tilde{K}}}=0\) and we obtain
Step 2: Now we use (4.9) to obtain the claim. First, using the definitions of \({\mathbb {T}}^b_I\) and \({\mathbb {T}}^a_K\), we obtain
where in the left-hand side we have used that \(\lambda _1=\mu _1+I\) to simplify the term \((y/b_0)^{\lambda _1}\). Both sides are now elements of \({\mathbb {k}}[[q,x,y^{-1}]]\), cf. Remark 3.4 and part (2) of Proposition 4.3. So we can multiply both sides by \(x^I\frac{(a_1/x;q)_I}{(q;q)_I}\) and take the sum over I, K such that \(I+\mu _1=K+\nu _1\), obtaining an identity in \({\mathbb {k}}[[q,x,y^{-1}]]\):
To prove the claim it is now enough to show that for fixed \(J,L\ge 0\)
Indeed, if (4.11) holds we can apply it to the right-hand side of (4.10) to get
Step 3: To finish the proof we need to establish (4.11). Since both sides are power series in \(x,y^{-1},q\) with coefficients in \({\mathbb {C}}(a,b)\), it is enough to consider the case when \(x=q^{S}a\) for \(S\in {\mathbb {Z}}_{\ge 0}\): if for a formal series \(f(q,x)\in {\mathcal {F}}[[q,x]]\) we have \(f(q, q^S)=0\) for all \(S\in {\mathbb {Z}}_{\ge 0}\) then \(f(q,x)=0\), cf. [43, Lemma 3.2]. So, from now on we set \(x=q^{S}a\).
Let \(A\in {\mathbb {Z}}_{\ge 0}\) be a sufficiently large integer. From (4.2) we have
Consider the right-hand side of (4.13). Note that for \({\tilde{J}},{\tilde{L}}\ge 0\) we have
where for the first equality we have used the second line of (3.5), noting that since \(K=0\) the only nonzero summand has \(P=0\). Since \({\tilde{L}}={\tilde{J}}+S\ge S\) and \((q^{-{\tilde{L}}};q)_S\) vanishes unless \({\tilde{L}}\le S\), we get
Hence in the right-hand side (4.13) the configuration around the vertex of type R is fixed, and the partition function is equal to
For the left-hand side of (4.13) we can use (3.3) and (3.4) to write it as
Note that in the summation above we can assume \(I\le S\), since \((q^{-S};q)_I\) vanishes otherwise. In particular, the number of terms is bounded by S regardless of the value of A.
To finish the proof we need to consider the dependence on A. Namely, we rewrite (4.13) in a way that both sides are rational in \(q^A\): using the identity
and the expressions above for the both sides we readily obtain
Note that the both sides of (4.14) are rational functions in \(q^A\), which are equal when A is a sufficiently large integer. Hence (4.14) holds for any value of \(q^A\), and in particular we can set \(q^A=0\) getting
which is exactly (4.11) when \(x=q^Sa\). \(\square \)
Remark 4.6
One can check that (4.11) from the proof above is equivalent to the q-Gauss identity
Proposition 4.7
([4, Proposition 4.5]) The following relation holds
Proof
(Idea of the proof) This is exactly [4, Proposition 4.5], and the proof is similar to Proposition 4.5 above, so we provide only a brief sketch of it. First, similarly to steps 1,2 of the proof of Proposition 4.5, we use (4.1) to get a commutation relation
Then the claim follows from multiplying both sides of the commutation equation above by \((x_1/b_0)^I(x_2/b_0)^J\frac{(a_1/x_1;q)_I(a_2/x_2;q)_J}{(q;q)_I(q;q)_J}\), taking the sum over I, J and applying the relation
where K, L are fixed. The last relation can be proved by taking (4.1) and setting \(A_1=A_2=0, B_1=L, B_2=K\), while \(B_3\rightarrow \infty \) and \(A_3=B_1+B_2+B_3\). \(\square \)
5 Inhomogeneous spin q-Whittaker polynomials
In this section we describe inhomogeneous spin q-Whittaker polynomials, originally introduced in [4], and apply the results from previous sections to establish a new Cauchy-type identity and a new characterization theorem for these functions. We continue to use \({\mathcal {A}}=(a_0, a_1,\dots )\) and \({\mathcal {B}}=(b_0,b_1,\dots )\) to denote sequences of parameters and we continue to use the notation \(\tau {\mathcal {A}}, {{\overline{{\mathcal {A}}}}}\) from Sect. 4.
5.1 Basic properties
For a pair of partitions \(\lambda ,\mu \) and a collection of variables \(x_1, \dots , x_n\) the inhomogeneous spin q-Whittaker polynomial \({\mathbb {F}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) is defined in terms of the operators \({\mathbb {B}}(x\mid {\mathcal {A}},{\mathcal {B}})\) from Sect. 4 by
When \(\mu =\varnothing \) we write \({\mathbb {F}}_{\lambda }\) instead of \({\mathbb {F}}_{\lambda /\varnothing }\).
The following properties of \({\mathbb {F}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) were proved in [4]Footnote 7; for the sake of completeness we sketch their proofs:
Proposition 5.1
We have the following expression for the single-variable function \({\mathbb {F}}_{\lambda /\mu }(x\mid {\mathcal {A}},{\mathcal {B}})\):
Proof
The vanishing part follows from Proposition 4.3, part (3). For the explicit expression we use (3.4) and (4.4) and the relation
\(\square \)
Proposition 5.2
The following branching rule holds:
where the sum is over partitions \(\mu \) such that \(\lambda _{r}\ge \mu _{r}\ge \lambda _{r+1}\) for all r.
Proof
Follows from (5.1) and Proposition 4.3, part (3). \(\square \)
Corollary 5.3
\({\mathbb {F}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})=0\) unless \(\mu \subset \lambda \) and \(l(\lambda )\le l(\mu )+n\). \(\square \)
Proposition 5.4
The functions \({\mathbb {F}}_{\lambda /\mu }(x_1, x_2, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) are symmetric polynomials in \(x_1, \dots , x_n\).
Proof
\({\mathbb {F}}_{\lambda /\mu }(x_1, x_2, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) is a polynomial since \(\langle \lambda |\, {\mathbb {B}}(x\mid {\mathcal {A}},{\mathcal {B}})\,|\mu \rangle \) is a polynomial in x for any \(\lambda ,\mu \), and \({\mathbb {F}}_{\lambda /\mu }(x_1, x_2, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) is a finite sum of products of such matrix elements thanks to Proposition 5.2. The symmetry in \(x_1, \dots , x_n\) follows from Proposition 4.7. \(\square \)
Remark 5.5
Using (4.6), we can also obtain a graphical definition of the functions \({\mathbb {F}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) in terms of vertex models:
where \(Z^{r_1, \dots , r_n}_{\lambda /\mu }\) is the partition function from Fig. 2.
Remark 5.6
Spin q-Whittaker polynomials were originally introduced in [10], and later a different but related version was constructed in [29]. The inhomogeneous spin q-Whittaker polynomials \({\mathbb {F}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) generalize both these versions: when \(a_1=a_2=\dots =s\) and \(b_1=b_2=\dots =s^{-1}\) the function \({\mathbb {F}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) degenerates to the corresponding spin q-Whittaker polynomial from [10], while setting \(b_1=b_2=\dots =s^{-1}\), \(a_{n+l(\mu )}=0\) and \(a_i=s\) for all \(i\ne n+l(\mu )\) reduces \({\mathbb {F}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) to the version from [29].
5.2 Cauchy identity
To formulate the Cauchy identity we define dual functions \({\mathbb {F}}^*(y_1, \dots , y_m\mid {\mathcal {A}},{\mathcal {B}})\) by
where \(\psi _\lambda ({\mathcal {A}},{\mathcal {B}})\) is defined in Proposition 4.4:
Equivalently, the dual functions can be defined using the operators \({\mathbb {B}}^*(y\mid {\mathcal {A}},{\mathcal {B}})\) from Sect. 4:
Proposition 5.7
We have
Proof
Follows immediately from Proposition 4.4. Note that \(\psi _\lambda ({\mathcal {A}},{\mathcal {B}})=\psi _\lambda ({\overline{{\mathcal {B}}}},{\overline{{\mathcal {A}}}})\). \(\square \)
Theorem 5.8
For any partitions \(\mu ,\nu \) the following equality of formal power series in \(x_i,y_j,q\) holds:
In particular, setting \(\mu =\nu =\varnothing \) yields
Proof
This follows from Proposition 4.5 in the following way: set
Repeatedly using the exchange relation from Proposition 4.5, shifting \({\mathcal {A}}\) on each step, we have
Setting
and iterating (5.3) for m times, substituting \(y=y_j^{-1}\) for \(j=1, \dots , m\), we get
Evaluating both sides of the last identity at \(\langle \nu |\cdot |\mu \rangle \) yields the claim. \(\square \)
Remark 5.9
In the case \(b_0=b_1=b_2=\dots =1\) Theorem 5.8 was proved in [4] using the dual skew Cauchy identity between spin q-Whittaker functions and spin Hall–Littlewood functions.
5.3 Vanishing and characterization properties
It turns out that the functions \({\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) satisfy vanishing and characterization properties similar to interpolation symmetric functions. We start with the vanishing property. Recall that \({\mathbb {Y}}^n\) denotes the set of partitions of length at most n. For a partition \(\mu \in {\mathbb {Y}}^n\) set
Proposition 5.10
Let \(\lambda ,\mu \in {\mathbb {Y}}^n\) be partitions of length at most n. Then we have
Moreover, when \(\lambda =\mu \), we have
where
Proof
The proof is by induction on n, with the case \(n=0\) being trivial.
Assume that we have proved the claim for \(n-1\). Using the branching rule from Proposition 5.2 we get
where \({\widetilde{\mu }}=(\mu _2, \mu _3, \dots )\). Note that by induction hypothesis \({\mathbb {F}}_\nu ({\textbf{x}}_{\tau {\mathcal {A}}}^{n-1}({\widetilde{\mu }})\mid \tau {\mathcal {A}},{\mathcal {B}})=0\) unless \(\nu \subseteq {\widetilde{\mu }}\). But \(\nu \prec \lambda \) implies \({\widetilde{\lambda }}:=(\lambda _2,\lambda _3, \dots )\subseteq \nu \), so if in the sum above the summand corresponding to \(\nu \) does not vanish then \({\widetilde{\lambda }}\subseteq \nu \subseteq {\widetilde{\mu }}\).
At the same time, \({\mathbb {F}}_{\lambda /\nu }(a_1q^{\mu _1-\mu _2}\mid {\mathcal {A}},{\mathcal {B}})\) vanishes unless \(\lambda _1-\nu _1\le \mu _1-\mu _2\), because of the factor \((q^{\mu _{2}-\mu _1};q)_{\lambda _1-\nu _1}\) in
Hence, if \({\mathbb {F}}_\lambda ({\textbf{x}}_{{\mathcal {A}}}^n(\mu )\mid {\mathcal {A}},{\mathcal {B}})\ne 0\), then for some \(\nu \) we have \(\lambda _1-\nu _1\le \mu _1-\mu _2\) and \({\widetilde{\lambda }}\subseteq \nu \subseteq {\widetilde{\mu }}\). But then
so \(\lambda \subseteq \mu \) as desired.
To prove the second statement, note that if \(\lambda =\mu \) then the only possible \(\nu \) in the discussion above is \(\nu ={\widetilde{\lambda }}=(\lambda _2,\lambda _3,\dots )\). Hence
and the second statement readily follows from Proposition 5.1. \(\square \)
Remark 5.11
We can use the graphical definition of \({\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) from Remark 5.5 to sketch an alternative proof of Proposition 5.10. Namely, we have
where \(Z^{r_1, \dots , r_n}_\lambda \) is the partition function from Fig. 2 with \(x_i=a_iq^{\mu _i-\mu _{i+1}}\). Note that the factors \((q^{\mu _{i+1}-\mu _i};q)_{r_i}\) force \(r_i\le \mu _i-\mu _{i+1}\) in the sum above. On the other hand, since the weights \(W^b(I,J,K,L)\) vanish unless \(I+J=K+L\) and \(I\le K\), the partition function \(Z^{r_1, \dots , r_n}_\lambda \) vanishes unless \(r_1+\dots +r_n=\lambda _1\) and \(r_1+\dots +r_i\le \lambda _1-\lambda _{i+1}\) for any i. These restrictions on \(r_i\) imply that non-zero terms in the sum above must satisfy \(\lambda _{i}\le r_{i}+\dots +r_n\le \mu _{i}\), hence \({\mathbb {F}}_\lambda ({\textbf{x}}^n_{{\mathcal {A}}}(\mu )\mid {\mathcal {A}},{\mathcal {B}})\) vanishes unless \(\lambda _{i}\le \mu _i\).
Now we can consider a characterization for \({\mathbb {F}}_\lambda (x_1, \dots ,x_n\mid {\mathcal {A}},{\mathcal {B}})\) in terms of the vanishing property. To do so, we need the following notation. For an integer n and a partition \(\mu \in {\mathbb {Y}}^n\) define
Note that \(G_\mu ^{n}\) are symmetric polynomials in \(x_1, \dots , x_n\). Define a filtration \({\mathcal {G}}^n_0\subset {\mathcal {G}}^n_1\subset \dots \subset {\mathbb {k}}[x_1, \dots , x_n]^{S_n}\) of the algebra of symmetric polynomials in \(x_1, \dots ,x_n\) with coefficients in \({\mathbb {k}}={\mathbb {Q}}(q, {\mathcal {A}},{\mathcal {B}})\) by
Theorem 5.12
For each partition \(\lambda \in {\mathbb {Y}}^n\) the function \({\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) is uniquely characterized by the following properties:
-
1.
\({\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\in {\mathcal {G}}^{n}_{|\lambda |}\).
-
2.
For any partition \(\mu \) such that \(|\mu |\le |\lambda |\) and \(\mu \ne \lambda \) we have \({\mathbb {F}}_\lambda ({\textbf{x}}^n_{{\mathcal {A}}}(\mu )\mid {\mathcal {A}},{\mathcal {B}})=0\).
-
3.
\({\mathbb {F}}_\lambda ({\textbf{x}}^n_{{\mathcal {A}}}(\lambda )\mid {\mathcal {A}},{\mathcal {B}})=H_\lambda ({\mathcal {A}},{\mathcal {B}})\), where \(H_\lambda ({\mathcal {A}},{\mathcal {B}})\) is defined in Proposition 5.10 above.
Moreover, for each \(m\in {\mathbb {Z}}_{\ge 0}\) both \(\{{\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\}_{\begin{array}{c} \lambda \in {\mathbb {Y}}^n\\ |\lambda |\le m \end{array}}\) and \(\{G_\mu ^{n}(x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\}_{\begin{array}{c} \mu \in {\mathbb {Y}}^n\\ |\mu |\le m \end{array}}\) are bases of \({\mathcal {G}}^n_m\).
Proof
We start with the last statement. Let \(\mu \in {\mathbb {Y}}^n\) and consider the Cauchy identity (5.2) with \(m=n\) and \(y_i=q^{\mu _i-\mu _{i+1}}/b_i\). We get
Note that \((q^{\mu _1-\mu _2}/b_1, \dots , q^{\mu _n}/b_n)={\textbf{x}}^n_{{\overline{{\mathcal {B}}}}}(\mu )\), so we can apply the vanishing property from Proposition 5.10 to \({\mathbb {F}}^*_{\lambda }({\textbf{x}}^n_{{{\overline{{\mathcal {B}}}}}}(\mu )\mid {\overline{{\mathcal {B}}}}, {\overline{{\mathcal {A}}}})\), obtaining
where \(\psi _\lambda ({\mathcal {A}},{\mathcal {B}})\) is defined in Proposition 4.4. Hence we can rewrite (5.5) as
for some coefficients \(c_{\lambda \mu }\) such that \(c_{\mu \mu }\ne 0\). Thus, the transition matrix expressing \(G^n_\mu \) in terms of \({\mathbb {F}}_\lambda \) is upper-triangular with respect to the partial order of inclusion of Young diagrams. Moreover, this transition matrix has non-zero diagonal entries \(c_{\mu \mu }\), so it is invertible and
for some other coefficients \({\overline{c}}_{\mu \lambda }\).
Note that (5.6) and (5.7) imply that
On the other hand, the vanishing property implies that the functions \({\mathbb {F}}_\lambda \) are linearly independent: if we have
with \(\alpha _\lambda \) being nonzero for some \(\lambda \), then we get a contradiction by plugging \((x_1, \dots , x_n)={\textbf{x}}^n_{{\mathcal {A}}}(\mu )\) for a minimal \(\mu \) such that \(\alpha _\mu \ne 0\) and using the vanishing property from Proposition 5.10. Hence \(\{{\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\}_{\begin{array}{c} \lambda \in {\mathbb {Y}}^n\\ |\lambda |\le m \end{array}}\) is a basis of \({\mathcal {G}}^n_m\), and consequently \(\{G_\mu ^{n}(x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\}_{\begin{array}{c} \mu \in {\mathbb {Y}}^n\\ |\mu |\le m \end{array}}\) is also a basis of \({\mathcal {G}}_m^n\) in view of (5.6) and (5.7).
Now we can prove the characterization property. We have already proved that \({\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\in {\mathcal {G}}^n_m\), and the other properties follow from Proposition 5.10, so we only need to prove that \({\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\) is the unique function satisfying properties (1)–(3) above. Let \(f(x_1,\dots , x_n)\) be a function which also satisfies properties (1)–(3), and consider the difference \(g(x_1,\dots , x_n)=f(x_1, \dots , x_n)-{\mathbb {F}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\). We have \(g(x_1, \dots , x_n)\in {\mathcal {G}}_m^n\) and \(g({\textbf{x}}^n_{\mathcal {A}}(\mu ))=0\) for any \(\mu \) such that \(|\mu |\le |\lambda |\). Since \(\{{\mathbb {F}}_\lambda (x_1,\dots , x_n\mid {\mathcal {A}},{\mathcal {B}})\}_{\begin{array}{c} \lambda \in {\mathbb {Y}}^n\\ |\lambda |\le m \end{array}}\) is a basis of \({\mathcal {G}}_{|\lambda |}^n\), for some coefficients \(\alpha _\nu \in {\mathbb {k}}\) we have
Assume that \(g\ne 0\) and let \(\mu \) be a minimal partition such that \(\alpha _\mu \ne 0\). Then by Proposition 5.10
which leads to contradiction. Hence, \(g={\mathbb {F}}_\lambda -f=0\), and the uniqueness follows. \(\square \)
5.4 Degenerations of \({\mathbb {F}}_{\lambda }\)
In this section we introduce two degenerations of the functions \({\mathbb {F}}_{\lambda }\), the importance of which is explained in Sect. 6 where the interpolation symmetric functions are discussed.
We start by considering \(b_i\rightarrow \infty \) in the functions \({\mathbb {F}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}}, {\mathcal {B}})\). Recall that \({\mathbb {F}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}}, {\mathcal {B}})\) is a rational function in \(b_i\), but Proposition 5.1 implies that when we set \(b_i^{-1}=0\) for all \(i\ge 1\), the function just vanishes. So, to get a meaningful object we first need to renormalize the function: set
and let \({\widetilde{{\mathbb {F}}}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}},\infty )\) denote the result of the substitution \(b_0^{-1}=b_1^{-1}=\dots =0\).
Almost all the properties of \({\mathbb {F}}_{\lambda /\mu }(x_1,\dots x_n\mid {\mathcal {A}},{\mathcal {B}})\) described earlier in this section can be readily modified to obtain properties of \({\widetilde{{\mathbb {F}}}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}},\infty )\). For instance, we can compute \({\widetilde{{\mathbb {F}}}}_{\lambda /\mu }(x_1, \dots , x_n\mid {\mathcal {A}},\infty )\) using degenerations of the explicit expression from Proposition 5.1 and the branching rule from Proposition 5.2: we have
In the same manner we can degenerate the vanishing property from Proposition 5.10, since it does not depend on \({\mathcal {B}}\) in a significant way:
Proposition 5.13
Let \(\lambda ,\mu \) be partitions of length at most n. Then we have
Moreover, when \(\lambda =\mu \), we have
\(\square \)
The only property proved earlier which cannot be immediately degenerated to \({\mathcal {B}}=\infty \) is Theorem 5.12, since our definition of the filtration \({\mathcal {G}}^n_m\) makes little sense when \(b_i^{-1}\equiv 0\). To state the appropriate characterization, let \({\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\) denote the space of degree \(\le m\) symmetric polynomials with coefficients in \({\mathbb {k}}={\mathbb {C}}({\mathcal {A}},q)\), and let \(P_{\lambda /\mu }(x_1,\dots , x_n; q,0)\) denote the q-Whittaker function corresponding to \(\lambda /\mu \), that is, the \(t=0\) specialization of the Macdonald symmetric polynomial \(P_{\lambda /\mu }(x_1, \dots ,x_n;q,t)\), cf. [27, Chapter VI].
Proposition 5.14
The function \({{\widetilde{{\mathbb {F}}}}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}}, \infty )\) can be characterized in the following two equivalent ways: it is the unique function satisfying
-
(1)
\({{\widetilde{{\mathbb {F}}}}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},\infty )\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le |\lambda |}\),
-
(2)
\({{\widetilde{{\mathbb {F}}}}}_\lambda ({\textbf{x}}^n_{{\mathcal {A}}}(\mu )\mid {\mathcal {A}},\infty )=0\) for any partition \(\mu \) such that \(|\mu |\le |\lambda |\) and \(\mu \ne \lambda \),
-
(3)
\({{\widetilde{{\mathbb {F}}}}}_{\lambda }({\textbf{x}}_{{\mathcal {A}}}^{n}(\lambda )\mid {\mathcal {A}},\infty )=(-1)^{\lambda _1}q^{\frac{\lambda _1^2-\lambda _1}{2}}\displaystyle \prod _{i=1}^n a_i^{\lambda _{i}}\displaystyle \prod _{i,j= 1}^n\left( \frac{q^{\lambda _{i+1}-\lambda _i}a_{j+i}}{a_{i}};q\right) _{\lambda _{j+i}-\lambda _{j+i+1}}\);
and it is also the unique function satisfying
-
(1’)
\({{\widetilde{{\mathbb {F}}}}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},\infty )\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le |\lambda |}\), identically to (1),
-
(2’)
\({{\widetilde{{\mathbb {F}}}}}_\lambda ({\textbf{x}}^n_{{\mathcal {A}}}(\mu )\mid {\mathcal {A}},\infty )=0\) for any partition \(\mu \) such that \(|\mu |< |\lambda |\),
-
(3’)
The top degree homogeneous component of \({{\widetilde{{\mathbb {F}}}}}_{\lambda }(x_1, \dots , x_n\mid {\mathcal {A}},\infty )\) has degree \(|\lambda |\) and is equal to \(P_{\lambda }(x_1, \dots , x_n;q,0)\).
Proof
First we show that \({{\widetilde{{\mathbb {F}}}}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}}, \infty )\) indeed satisfy the properties from the statement. Properties (2), (2’) and (3) follow directly from Proposition 5.13. For the remaining properties (1), (1’) and (3’) note that by (5.9) the one-variable function \({{\widetilde{{\mathbb {F}}}}}_{\lambda /\mu }(x\mid {\mathcal {A}},\infty )\) is a polynomial of degree \(|\lambda |-|\mu |\), whose top term coincides with \(P_{\lambda /\mu }(x; q,0)\), cf. [27, Chapter VI, (7.13’)]. Then (5.8) implies that \({{\widetilde{{\mathbb {F}}}}}_{\lambda }(x_1, x_2, \dots , x_n\mid {\mathcal {A}},\infty )\) has degree at most \(|\lambda |\), and since the branching identical to (5.8) holds for the functions \(P_{\lambda /\mu }\), we deduce that the top-degree component of \({{\widetilde{{\mathbb {F}}}}}_{\lambda }(x_1, \dots , x_n\mid {\mathcal {A}},\infty )\) is indeed \(P_{\lambda }(x_1, \dots , x_n; q,0)\).
To show uniqueness for both characterizations it is enough to prove that if \(g\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\) and \(g({\textbf{x}}_{{\mathcal {A}}}^n(\mu ))=0\) for all \(|\mu |\le m\), then \(g=0\). Indeed, for the first chracterization fix \(\lambda \) and assume that \(f\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}\) satisfies the conditions (1)–(3). Then \(g=f-{{\widetilde{{\mathbb {F}}}}}_{\lambda }(x_1, \dots , x_n\mid {\mathcal {A}},\infty )\) is a polynomial in \({\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le |\lambda |}\) such that \(g({\textbf{x}}^n_{{\mathcal {A}}}(\mu ))=0\) for any \(\mu \in {\mathbb {Y}}^n, |\mu |\le |\lambda |\), since the conditions (2) and (3) fix the values of f and \({{\widetilde{{\mathbb {F}}}}}_{\lambda }(x_1, \dots , x_n\mid {\mathcal {A}},\infty )\) at \({\textbf{x}}^n_{{\mathcal {A}}}(\mu )\) for \(|\mu |\le |\lambda |\). Similarly, if \(f'\) is a function satisfying (1’)–(3’) set \(g'=f'-{{\widetilde{{\mathbb {F}}}}}_{\lambda }(x_1, \dots , x_n\mid {\mathcal {A}},\infty )\). By (2’) the functions \(f'\) and \({{\widetilde{{\mathbb {F}}}}}_{\lambda }(x_1, \dots , x_n\mid {\mathcal {A}},\infty )\) both vanish at \({\textbf{x}}^n_{{\mathcal {A}}}(\mu )\) when \(|\mu |\le |\lambda |-1\), and by (3’) the top degree homogeneous components of these functions coincide. Hence \(g'\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le |\lambda |-1}\) and \(g'({\textbf{x}}_{{\mathcal {A}}}^n(\mu ))=0\) for \(|\mu |\le |\lambda |-1\).
So, it is enough to prove that if \(g\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\) and \(g({\textbf{x}}_{{\mathcal {A}}}^n(\mu ))=0\) for all \(\mu \) such that \(|\mu |\le m\), then \(g=0\). This follows from Lemma 5.15 below, which for later convenience we state in a much greater generality. \(\square \)
Lemma 5.15
Assume that for a function \(\mho :{\mathbb {Y}}^n\rightarrow {\mathbb {k}}^n\) there exists a family of polynomials \(F_\lambda (x_1, \dots , x_n\mid \mho )\) such that
-
\(F_{\lambda }(x_1, \dots , x_n\mid \mho )\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le |\lambda |}\);
-
\(F_{\lambda }(\mho (\mu )\mid \mho )=0\) for any partitions \(\lambda ,\mu \in {\mathbb {Y}}^n\) such that \(|\mu |\le |\lambda |, \lambda \ne \mu \);
-
\(F_{\lambda }(\mho (\lambda )\mid \mho )\ne 0\).
Then the following holds
-
1.
The functions \(F_{\lambda }(x_1, \dots , x_n\mid \mho )\) are uniquely determined up to a scalar;
-
2.
Let \(m\in {\mathbb {Z}}_{\ge 0}\). If \(f\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\) and \(f(\mho (\mu ))=0\) for all \(\mu \in {\mathbb {Y}}^n\) such that \(|\mu |\le m\), then \(f=0\).
-
3.
For each \(m\in {\mathbb {Z}}_{\ge 0}\) the functions \(F_{\lambda }(x_1, \dots , x_n\mid \mho )\) with \(|\lambda |\le m\) form a basis of \({\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\);
-
4.
The degree of \(F_{\lambda }(x_1, \dots , x_n\mid \mho )\) is \(|\lambda |\).
Proof
Fix \(m\in {\mathbb {Z}}_{\ge 0}\) and fix a choice of functions \(F_{\lambda }(x_1, \dots , x_n\mid \mho )\) for \(|\lambda |\le m\). We claim that these functions are linearly independent: assume that
with \(\alpha _\lambda \ne 0\) for some \(\lambda \). Choose \(\mu \) such that \(\alpha _{\mu }\ne 0\) with minimal possible \(|\mu |\). Then for all \(\lambda \ne \mu \) either \(|\lambda |<|\mu |\) and \(\alpha _{\lambda }=0\), or \(|\mu |\le |\lambda |\) and \(F_{\lambda }(\mho (\mu )\mid \mho )=0\) by the definition of \(F_\lambda (x_1, \dots , x_n\mid \mho )\). Hence
leading to contradiction.
Note that the \({\mathbb {k}}\)-dimension of \({\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\) is equal to the number of partitions \(\lambda \in {\mathbb {Y}}^n\) such that \(|\lambda |\le m\), hence the functions \(F_{\lambda }(x_1, \dots , x_n\mid \mho )\) with \(|\lambda |\le m\) form a basis of \({\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\), proving (3). In particular, \(F_\lambda (x_1, \dots , x_n\mid \mho )\) with \(|\lambda |\le m-1\) form a basis of \({\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m-1}\), so if \(|\lambda |=m\) then the degree of \(F_\lambda (x_1, \dots , x_n\mid \mho )\) is m, proving (4). Finally, (2) implies (1) by considering the difference between two candidates for \(F_\lambda (x_1, \dots , x_n\mid \mho )\), so we only need to prove the former. Let \(f\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\) be such that \(f(\mho (\mu ))=0\) for all \(\mu \) satisfying \(|\mu |\le m\), and assume that \(f\ne 0\). Consider the expansion
and choose \(\mu \) such that \(\alpha _\mu \ne 0\) with minimal \(|\mu |\). Then, in the same way as in the first part of the proof,
leading to contradiction. \(\square \)
Another degeneration of \({\mathbb {F}}_\lambda \) is obtained by considering the regime \(q,x_i,a_i\rightarrow 1\) in \({\widetilde{{\mathbb {F}}}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}}, \infty )\). More precisely, let \(\varepsilon , d,r_1, \dots , r_n\) be parameters such that \(d\ne 0\), and let \({\mathcal {C}}=(c_0, c_1, \dots )\) be an infinite sequence of real parameters. Define \({\mathbb {F}}^{el}_{\lambda /\mu }(r_1, \dots , r_n\mid {\mathcal {C}}, \infty )\) as the limit
in the following regime:
From (5.9) we get the following expression for the one-variable function
where we have used the following relation
From (5.11) we see that \({\mathbb {F}}^{el}_{\lambda /\mu }(r\mid {\mathcal {C}},\infty )\) is a polynomial in r of degree \(|\lambda |-|\mu |\), with coefficients depending polynomially on d and \(c_i\). Moreover, the top homogeneous degree term is \(P_{\lambda /\mu }(r_1, \dots , r_n; 1,0)\), that is, it is a \(q=1, t=0\) specialization of the Macdonald polynomial. The branching rule (5.8) implies that \({{\mathbb {F}}}^{el}_{\lambda }(r_1, \dots , r_n\mid {\mathcal {C}}, \infty )\) is a symmetric polynomial in \(r_1, \dots , r_n\) of degree \(|\lambda |\) with coefficients in \({\mathbb {k}}^{el}={\mathbb {Q}}(d,{\mathcal {C}})\), and the top homogeneous degree of \({{\mathbb {F}}}^{el}_{\lambda }(r_1, \dots , r_n\mid {\mathcal {C}}, \infty )\) is \(P_\lambda (r_1, \dots , r_n; 1,0)\). From [27, VI.4] it is known that \(P_\lambda (r_1, \dots , r_n; 1,0)=e_{\lambda '}(r_1, \dots , r_n)\) is the elementary symmetric polynomial, where \(\lambda '\) is the partition conjugate to \(\lambda \), and the elementary symmetric polynomials are defined by
We have the following analogues of the vanishing and characterization properties, obtained as limits of Propositions 5.13 and 5.14. For a partition \(\mu \in {\mathbb {Y}}^n\) set
Proposition 5.16
Let \(\lambda ,\mu \) be partitions of length at most n. Then we have
Moreover, when \(\lambda =\mu \), we have
\(\square \)
Proof
Note that in the regime (5.10)
so we obtain the claim as the limit of Proposition 5.13. \(\square \)
Proposition 5.17
The function \({\mathbb {F}}^{el}_\lambda (r_1, \dots , r_n\mid {\mathcal {C}}, \infty )\) is the unique function satisfying
-
(1)
\({\mathbb {F}}^{el}_\lambda (r_1, \dots , r_n\mid {\mathcal {C}}, \infty )\in {\mathbb {k}}[r_1, \dots , r_n]^{S_n}_{\le |\lambda |}\),
-
(2)
\({\mathbb {F}}^{el}_\lambda ({\textbf{r}}^n_{{\mathcal {C}}}(\mu )\mid {\mathcal {C}},\infty )=0\) for any partition \(\mu \) such that \(|\mu |\le |\lambda |\) and \(\mu \ne \lambda \),
-
(3)
\({\mathbb {F}}^{el}_{\lambda }({\textbf{r}}_{{\mathcal {C}}}^{n}(\lambda )\mid {\mathcal {C}},\infty )=(-1)^{|\lambda |}(-d)^{\lambda _1}\displaystyle \prod _{i,j= 1}^n\prod _{k=1}^{\lambda _{i+j}-\lambda _{i+j+1}}(c_{j+i}-c_i+d(\lambda _{i+1}-\lambda _i+k-1))\);
and it is also the unique function satisfying
-
(1’)
\({\mathbb {F}}^{el}_\lambda (r_1, \dots , r_n\mid {\mathcal {C}},\infty )\in {\mathbb {k}}[r_1, \dots , r_n]^{S_n}_{\le |\lambda |}\), identically to (1),
-
(2’)
\({\mathbb {F}}^{el}_\lambda ({\textbf{r}}^n_{{\mathcal {C}}}(\mu )\mid {\mathcal {C}},\infty )=0\) for any partition \(\mu \) such that \(|\mu |< |\lambda |\),
-
(3’)
The top degree homogeneous component of \({\mathbb {F}}^{el}_{\lambda }(r_1, \dots , r_n\mid {\mathcal {C}},\infty )\) has degree \(|\lambda |\) and is equal to \(e_{\lambda '}(r_1, \dots , r_n)\).
Proof
By the discussion above and Proposition 5.16\({\mathbb {F}}^{el}_\lambda (r_1, \dots , r_n\mid {\mathcal {C}}, \infty )\) satisfies (1)–(3) and (1’)–(3’). The uniqueness follows in the same way as in Theorem 5.14, by applying Lemma 5.15 with \(\mho (\mu )={\textbf{r}}^n_{{\mathcal {C}}}(\mu )\). \(\square \)
In view of Proposition 5.17, we call \({\mathbb {F}}^{el}_\lambda (r_1, \dots , r_n\mid {\mathcal {C}}, \infty )\) interpolation elementary polynomials.
Remark 5.18
For the later use we note that instead of considering the fields \({\mathbb {k}}={\mathbb {Q}}(q,{\mathcal {A}}),{\mathbb {k}}^{cl}={\mathbb {Q}}(d,{\mathcal {C}})\) we can consider \(q, a_i, d, c_i\) above as elements of an arbitrary field \({\mathbb {k}}\). The induced functions \({{\widetilde{{\mathbb {F}}}}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}}, \infty )\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}\) are well-defined as long as q is not a root of unity, and our proofs of the vanishing property and the characterization for \({{\widetilde{{\mathbb {F}}}}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}}, \infty )\) hold as long as \({{\widetilde{{\mathbb {F}}}}}_\lambda ({\textbf{x}}^n_{{\mathcal {A}}}(\lambda )\mid {\mathcal {A}}, \infty )\ne 0\), which is equivalent to \(a_i/a_j\notin q^{{\mathbb {Z}}_{\ge 0}}\) for any i, j such that \(i\ne j\). Similarly, the functions \({\mathbb {F}}^{el}_\lambda (r_1, \dots , r_n\mid {\mathcal {C}}, \infty )\in {\mathbb {k}}[r_1, \dots , r_n]^{S_n}\) are well-defined as long as the characteristic of \({\mathbb {k}}\) is 0, while for the characterization property we also need \(c_i-c_j\notin d{\mathbb {Z}}\) for any \(i\ne j\).
6 Classification of interpolation symmetric polynomials
Our interest in the vanishing property from Proposition 5.13 and the characterization of Proposition 5.14 comes from the fact that the symmetric functions with similar properties were actively studied earlier. More precisely, there are roughly three classes of known symmetric polynomials with similar vanishing and characterization properties: factorial monomial polynomials, factorial Schur polynomials and interpolation Macdonald functions. In [34] these three classes and their variations were distinguished as the only solutions to a certain interpolation problem. In this section we describe how our new functions fit into this classification, expanding it.
6.1 Interpolation problem
Below we state the general interpolation problem from [34]. Let \({\mathbb {k}}\) be a field; for simplicity we assume that \({\textrm{char}}\, {\mathbb {k}}=0\). As before, we use \({\mathbb {Y}}^n\) to denote the set of partitions of length at most n and \({\mathbb {k}}[x_1, \cdots , x_n]^{S_n}_{\le m}\) to denote the space of symmetric polynomials in n variables, of degree \(\le m\) and with coefficients in \({\mathbb {k}}\).
An n-grid is a map \(\mho :{\mathbb {Y}}^n\rightarrow {\mathbb {k}}^{n}\). An n-grid \(\mho \) is non-degenerate if for every \(\lambda \in {\mathbb {Y}}^n\) there exists a symmetric polynomial \(F_\lambda (x_1, \dots , x_n\mid \mho )\) satisfying:
-
1.
\(F_{\lambda }(x_1, \dots , x_n\mid \mho )\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le |\lambda |}\);
-
2.
\(F_{\lambda }(\mho (\mu )\mid \mho )=0\) for every \(\mu \in {\mathbb {Y}}^n\) such that \(|\mu |\le |\lambda |, \lambda \ne \mu \);
-
3.
\(F_{\lambda }(\mho (\lambda )\mid \mho )\ne 0\).
An n-grid \(\mho \) is called perfect if it is non-degenerate and the polynomials \(F_\lambda \) additionally satisfy the following vanishing property:
The main result of [34] is the following classification of all perfect grids of a certain form.
Theorem 6.1
([34]) Assume that \(\mho \) is an n-grid of the form
Then \(\mho \) is perfect if and only if one of the following cases holds:
-
(\(\textbf{E}_{\textbf{1}}\)) \(f_i(j)=c_j\) where \(c_0,c_1, \dots \) are pairwise distinct elements of \({\mathbb {k}}\);
-
(\(\textbf{E}_\textbf{2}\)) \(f_i(j)=c_{j-i}\) where \(\dots , c_{-1},c_0,c_1,\dots \) are pairwise distinct elements of \({\mathbb {k}}\);
-
(\(\textbf{I}\)) \(f_i(j)=a+bq^jt^i+cq^{-j}t^i\) where a, b, c, q, t are elements of a field extension of \({\mathbb {k}}\);
-
(\(\textbf{II}\)) \(f_i(j)=\alpha +\beta j+\beta ' i+\gamma (\beta j+\beta ' i)^2\), where \(\alpha ,\beta ,\beta ',\gamma \in {\mathbb {k}}\);
-
(\(\textbf{III}\)) \(f_i(j)=\alpha +\epsilon ^j{\epsilon '}^i(\alpha '+\beta j+\beta 'i)\) where \(\epsilon ,\epsilon '\in \{\pm 1\}\) and \(\alpha ,\alpha ',\beta ,\beta '\in {\mathbb {k}}\);
-
(\(\textbf{IV}\)) This case only exists when \(n=2\), then \(f_1(j)=\alpha +\beta q^j, f_2(j)=\alpha +\beta ' q^{-j}\) where \(\alpha ,\beta ,\beta ',q\in {\mathbb {k}}\).
All cases above should additionally satisfy \(f_i(j)\ne f_{i'}(j')\) for all integers \(i\le i', j>j'\).
The polynomials \(F_\lambda (x_1, \dots , x_n\mid \mho )\) corresponding to the cases \(\textbf{E}_\textbf{1}, \textbf{E}_\textbf{2},\textbf{I}\) above are respectively factorial monomial polynomials, factorial Schur polynomials and interpolation Macdonald functions. The polynomial \(F_\lambda \) has degree \(|\lambda |\), and in these three cases the top-degree homogeneous component of \(F_\lambda \) is respectively a monomial, Schur or Macdonald symmetric polynomial. Moreover, the functions \(F_\lambda \) for case \(\textbf{II}\) above include interpolation functions related to Jack polynomials, see [26, 37].
Propositions 5.13–5.17 about vanishing and interpolation properties of the functions \({\widetilde{{\mathbb {F}}}}_\lambda (x_1, \dots , x_n\mid {\mathcal {A}},\infty )\) and \({\mathbb {F}}^{el}_{\lambda }(r_1, \dots , r_n\mid {\mathcal {C}},\infty )\) indicate that there is another class of solutions to this interpolation problem, where n-grids have an alternative form \(\mho (\mu )=(f_1(\mu _1-\mu _2),\dots , f_n(\mu _n))\), cf. the expressions for \({\textbf{x}}^n_{{\mathcal {A}}}(\mu )\) and \({\textbf{r}}^n_{{\mathcal {C}}}(\mu )\) from (5.4) to (5.12). In this section we show that the function \({\widetilde{{\mathbb {F}}}}, {{\mathbb {F}}}^{el}\) actually lead to all perfect grids of this alternative form, at least when \(n\ge 3\).
Theorem 6.2
Assume that \(n\ge 3\) and \(\mho \) is an n-grid of the form
Then \(\mho \) is perfect if and only if the functions \(f_i\) have one of the following two forms:
-
1.
\(f_i(k)=c+a_i q^k\) for constants \(c, q, a_1, \dots , a_n\in {\mathbb {k}}\) such that q is not a root of unity, \(q\ne 0\) and \(a_i/a_j\ne q^k\) for any \(i\ne j, k\in {\mathbb {Z}}\). In this case the functions \(F_\lambda (x_1, \dots , x_n\mid \mho )\) are proportional to \({{\widetilde{{\mathbb {F}}}}}_\lambda (x_1-c, x_2-c, \dots , x_n-c\mid {\mathcal {A}},\infty )\), where \(a_i\) are identified with elements of \({\mathcal {A}}\).
-
2.
\(f_i(k)=c_i+kd\) for constants \(d, c_1, c_2, \dots , c_n\in {\mathbb {k}}\) such that \(d\ne 0\) and \(c_i-c_j\ne kd\) for any \(i\ne j, k\in {\mathbb {Z}}\). In this case the functions \(F_\lambda (x_1, \dots , x_n\mid \mho )\) are proportional to \({\mathbb {F}}^{el}_\lambda (x_1,\dots , x_n\mid {\mathcal {C}},\infty )\), where \(c_i\) are identified with elements of \({\mathcal {C}}\).
The remainder of this section is devoted to the proof of Theorem 6.2.
Remark 6.3
Following the existing terminology, the characterization property from Proposition 5.14 allows us to call the functions \({{\widetilde{{\mathbb {F}}}}}\) interpolation q-Whittaker polynomials. Note that, while setting \(t=0\) reduces Macdonald polynomials to q-Whittaker polynomials, setting \(t=0\) in interpolation Macdonald polynomials does not result in any interpolation polynomials, because the characterization property for interpolation Macdonald polynomials does not survive in any form after setting \(t=0\).
Remark 6.4
Note that Theorem 6.2 does not cover perfect 2-grids of the form \(\mho (\mu )=(f_1(\mu _1-\mu _2), f_2(\mu _2))\). While both types of grids listed in Theorem 6.2 are well-defined and perfect when \(n=2\), numerical simulations suggest that there exist more general perfect grids when \(n=2\). However, our proof of Theorem 6.2 does not cover the \(n=2\) case.
6.2 General properties of n-grids
We start with adapting some arguments from [34] to our setting. From now on we always assume that n-grids \(\mho \) have the form
where \(\mho (1;\cdot ), \dots , \mho (n;\cdot )\) are functions \({\mathbb {Z}}_{\ge 0}\rightarrow {\mathbb {k}}\).
For later convenience, we below restate Lemma 5.15 in terms of non-degenerate grids:
Lemma 6.5
Let \(\mho \) be a non-degenerate n-grid.
-
1.
The functions \(F_{\lambda }(x_1, \dots , x_n\mid \mho )\) are uniquely determined up to a scalar;
-
2.
Let \(m\in {\mathbb {Z}}_{\ge 0}\). If \(f\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\) and \(f(\mho (\mu ))=0\) for all \(\mu \in {\mathbb {Y}}^n\) such that \(|\mu |\le m\), then \(f=0\).
-
3.
For each \(m\in {\mathbb {Z}}_{\ge 0}\) the functions \(F_{\lambda }(x_1, \dots , x_n\mid \mho )\) with \(|\lambda |\le m\) form a basis of \({\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\);
-
4.
The degree of \(F_{\lambda }(x_1, \dots , x_n\mid \mho )\) is \(|\lambda |\).
\(\square \)
In particular, given a non-degenerate n-grid \(\mho \) the functions \(F_\lambda (x_1, \dots , x_n\mid \mho )\) are well-defined up to scalars. We will fix a convenient normalization later.
The following operations on grids will be useful. Let \(\mho \) be an n-grid.
-
For \(m\in {\mathbb {Z}}_{\ge 1}\) such that \(m\le n\) define an m-grid \(\mho _m\) by
$$\begin{aligned} \mho _m(\lambda )=(\mho (1; \lambda _1-\lambda _2), \mho (2; \lambda _2-\lambda _3), \dots , \mho (m;\lambda _m)), \qquad \lambda \in {\mathbb {Y}}^m. \end{aligned}$$That is, \(\mho _m(i; j)=\mho (i;j)\) for \(i=1, \dots , m\).
-
For \(l\in {\mathbb {Z}}_{\ge 0}\) such that \(l<n\) define an \((n-l)\)-grid by
That is, for \(i=1, \dots , n-l\).
-
For \(k\in {\mathbb {Z}}\) define an n-grid \(\mho ^k\) by
$$\begin{aligned} \mho ^k(\lambda )=\mho (\lambda +k^n)=(\mho (1; \lambda _1-\lambda _2), \dots , \mho (n-1;\lambda _{n-1}-\lambda _n), \mho (n;\lambda _n+k)), \end{aligned}$$where \(\lambda +k^n\) denotes the partition with parts \(\lambda _i+k\). In other words, \(\mho ^k(i; j)=\mho (i;j)\) for \(i=1, \dots , n-1\) and \(\mho ^k(n,j)=\mho (n;j+k)\).
Our first goal is to show that the three operations above preserve perfect grids. We start with \(\mho _m\).
Proposition 6.6
Assume that \(\mho \) is a non-degenerate n-grid. Then for any \(m\le n\) the m-grid \(\mho _m\) is non-degenerate with
for each \(\lambda \in {\mathbb {Y}}^{m}\). Moreover, if \(\mho \) is perfect then \(\mho _m\) is perfect as well.
Proof
For the first statement it is enough to show that the functions
satisfy the defining properties of \(F_\lambda (x_1, \dots , x_m\mid \mho _m)\), which readily follows from noticing that for \(\mu \in {\mathbb {Y}}^m\)
The last statement follows immediately from the vanishing property for \(\mho \) and the identity above. \(\square \)
Proposition 6.7
For any non-degenerate n-grid \(\mho \) the following statements hold:
-
1.
\(\mho (i;j)\ne \mho (n;0)\) for any integer pair \((i,j)\ne (n,0)\);
-
2.
\(\mho ^1\) is non-degenerate and \(F_\lambda \) can be chosen so that
$$\begin{aligned} F_\lambda (x_1, \dots , x_n\mid \mho ^1)\prod _{i=1}^n(x_i-\mho (n;0))=F_{\lambda +1^n}(x_1, \dots , x_n\mid \mho ); \end{aligned}$$(6.3) -
3.
If \(\mho \) is perfect, then \(\mho ^1\) is perfect.
Proof
Let \(\lambda \in {\mathbb {Y}}^n\) and consider
Note that \(f_\lambda (x_1, \dots , x_{n-1})\) has degree at most \(|\lambda |+n\), and for any \(\mu \in {\mathbb {Y}}^{n-1}\) such that \(|\mu |\le |\lambda |+n\) we have
since \(\mu \ne \lambda +1^n\). Hence, by Lemma 6.5 we have \(f_\lambda =0\). In other words, , so \(x_n-\mho (n;0)\) divides \(F_{\lambda +1^n}(x_1, \dots , x_n\mid \mho )\). By symmetry, \(x_i-\mho (n;0)\) for \(i=1, \dots , n\) divide \(F_{\lambda +1^n}(x_1, \dots , x_n\mid \mho )\) and so
for some \(g_{\lambda }(x_1, \dots , x_n)\in {\mathbb {k}}[x_1, \dots ,x_n]^{S_n}_{\le |\lambda |}\).
To prove (1) take \(j\in {\mathbb {Z}}_{\ge 0}\) and set \(\lambda =(nj, (n-1)j, \dots , 2j, j)\). Recall that \(F_{\lambda +1^n}(\mho (\lambda +1^n)\mid \mho )\ne 0\), so from (6.4) we have
Hence \(\mho (i;j)\ne \mho (n;0)\) for any \(i<n\) and \(\mho (n;j+1)\ne \mho (n;0)\), which implies (1) since \(j\in {\mathbb {Z}}_{\ge 0}\) was arbitrary.
To prove (2) note that from (6.4) we have for any \(\mu \in {\mathbb {Y}}^n\)
By (1) we know that \(\mho ^1(i;j)-\mho (n;0)=\mho (i;j+\delta _{i,n})-\mho (n;0)\ne 0\) for any \(j\ge 0\), hence (6.5) implies that \(F_{\lambda +1^n}(\mho (\mu +1^n)\mid \mho )=0\) if and only if \(g_{\lambda }(\mho ^1(\mu ))=0\). In particular, for any \(\mu \in {\mathbb {Y}}^n\) such that \(|\mu |\le |\lambda |\) and \(\mu \ne \lambda \) we get \(g_{\lambda }(\mho ^1(\mu ))=0\), while \(g_{\lambda }(\mho ^1(\lambda ))\ne 0\). Since the degree of \(F_{\lambda +1^n}\) is \(|\lambda |+n\), the degree of \(g_{\lambda }\) is not greater than \(|\lambda |\). So, \(g_{\lambda }\) satisfies the defining properties of \(F_{\lambda }(x_1, \dots , x_n\mid \mho ^1)\), which proves (2).
To prove part (3) assume that \(\mho \) is perfect. Then \(F_{\lambda +1^n}(\mho (\mu +1^n)\mid \mho )=0\) unless \(\lambda \subseteq \mu \), and hence \(g_{\lambda }(\mho ^1(\mu ))=F_{\lambda }(\mho ^1(\mu )\mid \mho ^1)=0\) unless \(\lambda \subseteq \mu \). So the vanishing property for \(\mho \) implies the vanishing property for \(\mho ^1\). \(\square \)
Using Proposition 6.7 we can inductively get the following results.
Corollary 6.8
If \(\mho \) is non-degenerate then \(\mho ^k\) is non-degenerate for any \(k\in {\mathbb {Z}}_{\ge 0}\). If \(\mho \) is perfect then \(\mho ^k\) is perfect for any \(k\in {\mathbb {Z}}_{\ge 0}\). \(\square \)
Corollary 6.9
If \(\mho \) is non-degenerate then \( \mho (i;j)\ne \mho (i',j') \) for all pairs \((i,j)\ne (i',j')\).
Proof
Let \(\mho \) be a non-degenerate n-grid and fix \((i,j)\ne (i',j')\). Without loss of generality we can assume that \(i\le i'\). By Proposition 6.6\(\mho _{i'}\) is also non-degenerate, hence replacing \(\mho \) by \(\mho _{i'}\) we can assume that \(i'=n\). If \(i<n\), then \(\mho ^{j'}\) is non-degenerate, and by Proposition 6.7 applied to \(\mho ^{j'}\)
If \(i=i'=n\), then we can additionally assume that \(j>j'\) and use Proposition 6.7 to get
\(\square \)
It turns out that the converse to Corollary 6.9 is also true, so we can classify all non-degenerate grids:
Proposition 6.10
An n-grid \(\mho \) is non-degenerate if and only if \(\mho (i;j)\ne \mho (i',j')\) for all \((i,j)\ne (i',j')\).
Proof
By Corollary 6.9 we only need to prove that \(\mho \) is non-degenerate if \(\mho (i;j)\ne \mho (i',j')\) for all \((i,j)\ne (i',j')\). It is enough to prove the following statement: for any \(\mho \) as in the statement, any \(m\in {\mathbb {Z}}_{\ge 0}\) and any function
there exists a polynomial \(f\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\) such that \(f(\mho (\lambda ))=\phi (\lambda )\) for any \(\lambda \in {\mathbb {Y}}^n, |\lambda |\le m\).
We prove the latter claim by induction on n and m, reducing the claim for (n, m) to the claims for \((n',m')\) with either \(n'<n\) or \(m'<m\). Note that the cases when \(n=1\) or \(m=0\) are trivial.
For the inductive step recall that the monomial symmetric functions are defined by
where the sum is over all permutations \(\alpha \) of the n-tuple \((\lambda _1, \dots , \lambda _n)\). Let \(n\ge 2\) and fix n-grid \(\mho \) as in the claim. Define a degree-preserving map
by sending \(m_{\lambda }(x_1-\mho (n;0), \dots , x_{n-1}-\mho (n;0))\) to \(m_{\lambda }(x_1-\mho (n;0), \dots , x_{n}-\mho (n;0))\). Note that for any \(f\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}\) we have
Let \(\phi (\lambda )\) be an arbitrary function on the partitions \(\lambda \in {\mathbb {Y}}^n, |\lambda |\le m\). We will construct the required function f as
Consider the restriction of \(\phi \) to the partitions from \({\mathbb {Y}}^{n-1}\). By the induction hypothesis, there exists a function in \(n-1\) variables \(f_1\in {\mathbb {k}}[x_1, \dots , x_{n-1}]^{S_{n-1}}_{\le m}\) such that \(f_1(\mho _{n-1}(\lambda ))=\phi (\lambda )\) for all partitions \(\lambda \in {\mathbb {Y}}^{n-1}, |\lambda |\le m\). Since for all \(\lambda \in {\mathbb {Y}}^{n-1},|\lambda |\le m\), the substitution \((x_1,\dots , x_n)=\mho (\lambda )\) sets \(x_n=\mho (n;0)\), by (6.6) we have
If \(n<m\) then any partition \(\lambda \) such that \(|\lambda |\le m\) is in \({\mathbb {Y}}^{n-1}\) and we are done. If \(n\ge m\) consider the following function \(\phi _2\) on \(\{\lambda \in {\mathbb {Y}}^n: |\lambda |\le m-n\}\):
By induction in m and since \(\mho ^1\) also satisfies the assumptions on \(\mho \) from the statement, we can construct a function \(f_2\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m-n}\) such that \(f_2(\mho (\lambda +1^n))=f_2(\mho ^1(\lambda ))=\phi _2(\lambda )\). The resulting function
satisfies \(f\in {\mathbb {k}}[x_1, \dots , x_n]^{S_n}_{\le m}\) and \(f(\mho (\lambda ))=\phi (\lambda )\). \(\square \)
We can use (6.2) and (6.3) to introduce a natural normalization for the functions \(F_{\lambda }\). For an n-grid \(\mho \) let \(\mho ^k_m\) denote the m-grid \((\mho _m)^k\).
Proposition 6.11
For a non-degenerate n-grid \(\mho \) there exists a unique choice of functions \(F_{\lambda }(x_1,\dots , x_m\mid \mho _m^k)\) for all \(m=1,\dots , n\) and \(k\in {\mathbb {Z}}_{\ge 0}\) satisfying
-
1.
\(F_{\varnothing }(x_1, \dots , x_m\mid \mho _m^k)=1\) for all m, k;
-
2.
\(F_{\lambda }\) are consistent with (6.2), that is, for any \(\lambda \in {\mathbb {Y}}^{m-1}\)
$$\begin{aligned} F_{\lambda }(x_1, \dots , x_{m-1}\mid \mho _{m-1})=F_{\lambda }(x_1, \dots , x_{m-1}, \mho (m;k)\mid \mho _{m}^k). \end{aligned}$$Note that \((\mho _m^k)_{m-1}=\mho _{m-1}\).
-
3.
\(F_\lambda \) are consistent with (6.3), that is, for any \(\lambda \in {\mathbb {Y}}^{m}\)
$$\begin{aligned} F_{\lambda +1^m}(x_1, \dots , x_{m}\mid \mho _{m}^k)=F_{\lambda }(x_1, \dots , x_{m}\mid \mho _{m}^{k+1})\prod _{i=1}^m(x_i-\mho (m;k)). \end{aligned}$$
This unique choice of functions \(F_{\lambda }(x_1,\dots , x_m\mid \mho _m^k)\) is determined by setting
for any non-degenerate grid \(\mho \).
Proof
We use induction on n. When \(n=1\) all functions \(F_{(\lambda _1)}(x_1\mid \mho ^k)\) satisfying the conditions (1) and (3) from the statement must be of the form
and the claim follows.
Now assume that \(\mho \) is a non-degenerate n-grid and we have proved the claim for \(\mho _{n-1}\). Note that the functions \(F_\lambda (x_1, \dots , x_m\mid \mho ^k_m)\) are uniquely determined by the numbers \(H_\lambda (\mho ^k_m):=F_\lambda (\mho ^k_m(\lambda )\mid \mho ^k_m)\), defined for \(\lambda \in {\mathbb {Y}}^m\). Moreover, if the choice of \(F_\lambda (x_1, \dots , x_m\mid \mho ^k_m)\) satisfies conditions (1)-(3) from the statement, then by the inductive assumption applied to the \((n-1)\)-grid \(\mho _{n-1}\) we have
for any \(k\in {\mathbb {Z}}_{\ge 0}\), \(m<n\) and \(\lambda \in {\mathbb {Y}}^m\). So we only need to prove that \(H_\lambda (\mho ^k)\) are uniquely determined and are given by (6.7).
Let \(\lambda \in {\mathbb {Y}}^n\). If \(l(\lambda )<n\), then by the consistency with (6.2) we have \(H_{\lambda }(\mho ^k)=H_\lambda (\mho _{n-1})\), and we are done. If \(l(\lambda )=n\), then we can write \(\lambda ={\overline{\lambda }}+a^n\) for some \({\overline{\lambda }}\in {\mathbb {Y}}^{n-1}\) and \(a=\lambda _n\). Then, using consistency with (6.3), we have
Since \({\overline{\lambda }}\in {\mathbb {Y}}^{n-1}\), we already know that \(H_{{\overline{\lambda }}}(\mho ^{k+a})\) is givn by (6.7), and one can readily check that the resulting expression for \(H_{\lambda }(\mho ^k)=H_{{{\overline{\lambda }}}+a^n}(\mho ^k)\) is also consistent with (6.7). \(\square \)
From now on we will assume that \(\{F_\lambda \}_\lambda \) are normalized as in Proposition 6.11.
Proposition 6.12
Let \(\mho \) be a perfect n-grid with \(n\ge 2\). Then is perfect and we have
where \({\widetilde{\lambda }}=(\lambda _2, \lambda _3,\dots )\).
Proof
As before, let \(m_\lambda (x_1, \dots , x_n)\) denote monomial symmetric function corresponding to \(\lambda \), and let \(\mu \preceq \lambda \) denote the lexicographical order on the partitions (that is \(\mu \preceq \lambda \) if and only if either \(\mu =\lambda \) or for some r we have \(\lambda _i=\mu _i\) for \(i<r\) and \(\mu _r<\lambda _r\)).
First we want to prove that if \(\mho \) is a non-degenerate n-grid and \(\lambda \in {\mathbb {Y}}^n\) is a partition such that for \(\mu \in {\mathbb {Y}}^n\)
then we have
We prove the latter statement by induction on n, with \(n=1\) case being trivial. Fix \(\mho ,\lambda \) as above and let \(d=\deg _{x_1} F_{\lambda }(x_1, \dots , x_n\mid \mho )\). If \(d<\lambda _1\) then we are trivially done with \(\alpha _{\lambda \lambda }=0\), so consider the case when \(d\ge \lambda _1\). Define \(g(x_2, \dots , x_n)\) by
Clearly, \(g\ne 0\). Since the total degree of \(F_{\lambda }\) is \(\lambda \), the total degree of g satisfies \(\deg g= |\lambda |-d\le |\lambda |-\lambda _1=|{\widetilde{\lambda }}|\), where \({\widetilde{\lambda }}=(\lambda _2, \lambda _3,\dots )\). Let \(\mu \in {\mathbb {Y}}^{n-1}\) be a partition such that \({\widetilde{\lambda }}\not \subset \mu \). Then the partition \(\mu (k)=(\mu _1+k, \mu _1, \mu _2, \dots )\) satisfies \(\lambda \not \subset \mu (k)\) for any \(k\in {\mathbb {Z}}_{\ge 0}\), so by assumption (6.8)
for any k. In other words, the polynomial vanishes at \(x_1=\mho (1;k)\), and since all these points are distinct by Corollary 6.9, this implies
as a polynomial in \(x_1\). In particular, for \(\mu \in {\mathbb {Y}}^{n-1}\) we get
Recall that the degree of g is at most \(|{\widetilde{\lambda }}|\). By Proposition 6.10 the \((n-1)\)-grid is non-degenerate, so Lemma 6.5 and (6.10) imply that for some \(c\in {\mathbb {k}}\). Note that \(g\ne 0\), hence \(c\ne 0\) and the degree of g is exactly \(|{\widetilde{\lambda }}|\), forcing \(d=\lambda _1\). Moreover, (6.10) implies that the assumption of our statement holds for the pair (), hence by induction
This implies (6.9):
Thus, we have proved that if \(\mho \) is perfect, then the transition matrix expressing \(F_\lambda (x_1, \dots , x_n\mid \mho )\) in terms of \(m_{\lambda }(x_1, \dots , x_n)\) is upper-triangular with respect to the lexicographical order. But both families of functions form bases of the ring of symmetric polynomials, hence the transition matrix must be invertible, and \(\alpha _{\lambda \lambda }\ne 0\) in (6.9). In particular, in the argument above \(d=\lambda _1\) always holds and the case \(d<\lambda _1\) never happens, so the construction of \(g(x_2, \dots , x_n)\) can be performed for any \(\lambda \):
for \(c_{\lambda }\ne 0\). Moreover, (6.10) implies that is perfect.
It only remains to show that \(c_\lambda =1\). Note that it is enough to show that for perfect \(\mho \) we always have \(\alpha _{\lambda \lambda }=1\) in (6.9). Recall that if \(\mho \) is perfect then all grids of the form \(\mho _m^k\) are also perfect, so we can define a renormalization \({{\widetilde{F}}}_{\lambda }(x_1, \dots , x_m\mid \mho _m^k)\) such that
and \({{\widetilde{F}}}_\lambda \) is proportional to \(F_\lambda \). Then the functions \({{\widetilde{F}}}_{\lambda }(x_1, \dots , x_m\mid \mho _m^k)\) satisfy the three conditions of Proposition 6.11, hence \({{\widetilde{F}}}_{\lambda }=F_\lambda \), finishing the proof. \(\square \)
6.3 Pieri rule and explicit expressions for some \(F_\lambda \)
Our next goal is to prove Theorem 6.2 when \(n=3\). From here on our arguments are completely different from [34], and we will need to compute polynomials \(F_{\lambda }\) for perfect 2-grids. To do it, we use the following analogue of the Pieri rule:
Lemma 6.13
Let \(\mho \) be a perfect 2-grid. Then for any \(k\in {\mathbb {Z}}_{\ge 1}\) we have
where
Proof
Fix \(k\ge 1\) and set
Note that \(f\in {\mathbb {k}}[x_1,x_2]^{S_2}_{\le k+1}\), and moreover \(f(\mho (\mu ))=0\) for all \(\mu \) such that \(|\mu |\le k\): for \(\mu \ne \lambda \) we have \(F_{(k)}(\mho (\mu )\mid \mho )=0\), while for \(\mu =\lambda \) we set \(x_1=\mho (1;k), x_2=\mho (2;0)\) and \(x_1+x_2-\mho (1;k)-\mho (2;0)\) vanishes. Hence the expression
with
vanishes for any \((x_1, x_2)=\mho (\mu )\) with \(|\mu |\le k+1\), so by Lemma 6.5 we have
Note that by Lemma 6.12 the top \(x_1\)-degree term on the sides of (6.11) are \(x_1^{k+1}\) and \(\alpha _kx_1^{k+1}\), hence \(\alpha _k=1\). By (6.7), we have
hence \(\beta _k=\kappa _k(\mho )\) from the statement. Finally, by Proposition 6.7 we have
finishing the proof. \(\square \)
Using Lemma 6.13 we can get explicit expressions for \(F_{(k)}(x_1,x_2\mid \mho )\) when \(k=1,2,3\). Let \(\mho \) be a perfect 2-grid. To make expressions below manageable we use the notation \([i;j]:=\mho (i;j)\). By our choice in Proposition 6.11, we clearly have \(F_{\varnothing }(x_1,x_2\mid \mho )=1\). For \(F_{(1)}(x_1,x_2\mid \mho )\) recall that this is a degree 1 symmetric polynomial vanishing at \((x_1,x_2)=\mho (\varnothing )\) and with top term \(x_1+x_2\) by Proposition (6.12). Hence
From now we can use Lemma 6.13. Direct computations give
For Lemma 6.16 below we also need an explicit expression for \(F_{(2)}(x_1, x_2, x_3\mid \mho )\) when \(\mho \) is a perfect 3-grid such that \(\mho (3;0)=0\). To get \(F_{(2)}(x_1, x_2,x_3\mid \mho )\) we can use the expression for \(F_{(2)}(x_1, x_2\mid \mho _2)\) above and follow the construction from Proposition 6.10 to get
6.4 \(n=3\) case
Now we can prove Theorem 6.2 for perfect 3-grids. The main idea is to use vanishings \(F_{\lambda }(\mho (\mu )\mid \mho )=0\) for various \(\lambda \not \subset \mu \) to obtain enough constraints on \(\mho (i;j)\) for the desired classification.
Lemma 6.14
Let \(\mho \) be a perfect n-grid. Then for any \(i,j\in {\mathbb {Z}}_{\ge 1}\) such that \(i\le n-1\) we have
Proof
Replacing \(\mho \) by the 2-grid , which is perfect by Propositions 6.6 and 6.12, we can assume that \(i=1\) and \(\mho \) is a perfect 2-grid. Moreover, replacing \(\mho \) by \(\mho ^{j-1}\), which is perfect by Corollary 6.8, we can assume that \(j=1\).
So, it is enough to prove that for a perfect 2-grid \(\mho \) we have
This follows from the vanishing \(F_{(3)}(\mho ((2,2))\mid \mho )=0\): by explicit computation using (6.13) we have
where
Since \(\mho \) is non-degenerate, \(\mho (i;j)\ne \mho (i';j')\) as long as \((i,j)\ne (i',j')\), hence \(F_{(3)}(\mho ((2,2))\mid \mho )=0\) implies \(G=0\) and the claim follows. \(\square \)
From now on, let \(\mho \) be a perfect 3-grid.
Lemma 6.15
We have
Proof
We first note that if a grid \(\mho \) is perfect then for any constant \(c\in k\) the grid \(\mho +c\), defined by \((\mho +c)(i;j)=\mho (i;j)+c\), is also perfect with
On the other hand, the desired identity (6.16) is equivalent to
The left-hand side above clearly stays intact when we change \(\mho \) to \(\mho +c\), so it is enough to prove the claim for the perfect 3-grid \(\mho -[3;0]\). In other words, it is enough to consider the case \([3;0]=0\).
Since \(\mho \) is perfect, we must have \(F_{(2)}(\mho ((1,1,1))\mid \mho )=0\). Using (6.14), we obtain by explicit computation
Since \(\mho \) is non-degenerate and \([3;0]=0\), we have \([3;1]\ne 0\). Hence \(F_{(2)}(\mho ((1,1,1))\mid \mho )=0\) implies
which is equivalent to the claim when \([3;0]=0\). \(\square \)
Lemma 6.16
We have
Proof
Similarly to the proof of Lemma 6.15, the claim for the grid \(\mho \) is equivalent to the claim for a grid \(\mho +c\) for arbitrary c. So, to simplify computations, we can assume that \([3;0]=0\) throughout the proof without loss of generality.
To prove the claim we consider two different expressions for [3; 2] in terms of [1; 0], [1; 1], [1; 2], [2; 0], [2; 1], and show that equating them implies the claim. For the first expression we use Lemma 6.15 for the grids \(\mho \) and \(\mho ^1\), obtaining
The other expression comes from Lemma 6.14 applied to \((i,j)=(1,1)\) and \((i,j)=(2,1)\), leading to
Subtracting the two expressions for [3; 2] from each other we get
which, after algebraic manipulations, gives
Since \([3;1]\ne [2;0]\), we get
Plugging the values for [3; 1], [2; 2] above results in
and since \([2;0]\ne [1;1]\) and \([2;1]\ne [1;0]\), we finally get
\(\square \)
The constraints proved above are almost sufficient to prove the classification for \(n=3\). Namely, we can now prove the following:
Lemma 6.17
One of the following cases holds for the perfect 3-grid \(\mho \):
-
1.
For some constants \(c,q,a_1, a_2, a_3\in {\mathbb {k}}\) we have
$$\begin{aligned}{} & {} {[}1;0]=c+a_1, \qquad [1;1]= c+a_1q,\qquad [1;2]=c+a_1q^2,\\{} & {} {[}2;k]=c+a_2 q^k, \quad [3;k]= c+a_3q^k,\qquad k\in {\mathbb {Z}}_{\ge 0}. \end{aligned}$$ -
2.
For some constants \(d,c_1,c_2,c_3\in {\mathbb {k}}\) we have
$$\begin{aligned}{} & {} {[}1;0]=c_1, \qquad [1;1]= c_1+d,\qquad [1;2]=c_1+2d,\\{} & {} {[}2;k]=c_2+kd, \quad [3;k]= c_3+kd,\qquad k\in {\mathbb {Z}}_{\ge 0}. \end{aligned}$$
Note that in order to satisfy the non-degeneracy condition from Corollary 6.9, we have to assume that \(q\ne 0\) and q is not a root of unity in the first case.
Proof
Set
We have 2 cases.
Case 1: \(q\ne 1\). Set
so we have
Plugging these values into the constraint from Lemma 6.16 we get
Note that since \([1;0]\ne [2;0]\), we have \(a_1\ne a_2\), and hence the equation above implies \([1;2]=c+a_1q^2\).
To prove that \([2;k]=c+a_2q^k\) we use induction on k, with cases \(k=0,1\) already covered. For the inductive step, assume that \([2;k-1]=c+a_2q^{k-1}, [2;k]=c+a_2q^k\), and apply Lemma 6.14 with \((i,j)=(1,k)\). This gives
Since \([2;k-1]\ne [1;1]\) implies \(a_2q^{k-1}-a_1q\ne 0\), we get \([2;k+1]=c+a_2q^{k+1}\).
Similarly, we can use induction to prove that \([3;k]=c+a_3q^k\). We already know the case \(k=0\). Applying Lemma 6.15 to the perfect grid \(\mho ^k\), we get
which implies the inductive step and finishes the proof of this case.
Case 2: \(q=1\) . The argument is similar to the previous case. Set
where the last equality follows from \(q=1\). Then we have
Plugging these values into Lemma 6.16 we get
Since \([1;0]\ne [2;0]\), we have \(c_1\ne c_2\) and hence the equation above implies \([1;2]=c_1+2d\).
To prove that \([2;k]=c_2+kd\) and \([3;k]=c_3+kd\) we can use Lemmas 6.14 and 6.15 in exactly the same way as in Case 1. \(\square \)
At this point, in order to reach the classification from Theorem 6.2 for \(n=3\), we only need to determine the values of [1; k] for \(k\ge 3\). To do so we focus on the 2-grid \(\mho _2\) and, considering each case separately, compute the functions \(F_{(k)}(x_1,x_2\mid \mho _2)\). The result is summarized below.
Lemma 6.18
Let \({\widetilde{\mho }}\) be a perfect 2-grid.
-
1.
Assume that for some \(q,c,a_1,a_2\in {\mathbb {k}}\) we have
$$\begin{aligned} {\widetilde{\mho }}(1;0)=c+a_1,\qquad {\widetilde{\mho }}(1;1)=c+a_1q,\qquad {\widetilde{\mho }}(2;k)=c+a_2q^k,\quad k\in {\mathbb {Z}}_{\ge 0}. \end{aligned}$$Then we have \({\widetilde{\mho }}(1;k)=c+a_1q^k\) for all k and
$$\begin{aligned} F_{(k)}(x_1+c, x_2+c\mid {\widetilde{\mho }})=\sum _{i=0}^k x_1^ix_2^{k-i}\frac{(a_1/x_1;q)_i(a_2/x_2;q)_{k-i}(q;q)_k}{(q;q)_i(q;q)_{k-i}},\qquad \end{aligned}$$(6.17)where we have shifted the variables \(x_1,x_2\) by c.
-
2.
Assume that for some \(d,c_1,c_2\in {\mathbb {k}}\) we have
$$\begin{aligned} {\widetilde{\mho }}(1;0)=c_1,\qquad {\widetilde{\mho }}(1;1)=c_1+d,\qquad {\widetilde{\mho }}(2;k)=c_2+kd,\quad k\in {\mathbb {Z}}_{\ge 0}. \end{aligned}$$Then we have \({\widetilde{\mho }}(1;k)=c_1+kd\) for all k and
$$\begin{aligned} F_{(k)}(x_1, x_2\mid {\widetilde{\mho }})=\prod _{i=0}^{k-1}(x_1+x_2-c_1-c_2-i\, d). \end{aligned}$$(6.18)
Proof
For both parts we simultaneously prove the expressions for \({\widetilde{\mho }}(1;k)\) and \(F_{(k+1)}(x_1,x_2\mid {\widetilde{\mho }})\) by induction on k.
Part 1. Note that without loss of generality we can replace \({\widetilde{\mho }}\) by \({\widetilde{\mho }}-c\) and assume that \(c=0\). When \(k=0\) we have \({\widetilde{\mho }}(1;0)=a_1\) and
When \(k=1\) we have \({\widetilde{\mho }}(1;1)=a_1q\) and from (6.12) one can obtain
Now assume that \(k\ge 2\) and we have proved the claim for \({\widetilde{\mho }}(1;i)\) and \(F_{(i+1)}(x_1, x_2\mid {\widetilde{\mho }})\) with \(i<k\). Applying the Pieri rule from Lemma 6.13, we have
where
Note that when for any \(j=0, \dots , k\) we plug \(x_1=a_1q^{k-j}\) and \(x_2=a_2q^j\) in the right-hand side of (6.17), there is only one non-zero term in the sum and this term corresponds to \(i=j\). So, we have
Plugging this expression into (6.19), we get
and, consequently,
Since \({\widetilde{\mho }}\) is perfect, we must have
Hence (6.21) implies
Applying (6.20) to \(F_{(k)}\) and \(F_{(k-1)}\), noting for the latter that \(\mho ^1\) also satisfies the assumption of the first part with \(a_2\) multiplied by q, we get after elementary manipulations
This is a linear equation in \({\widetilde{\mho }}(1;k)\) of the form
for some \(\beta \) independent of \({\widetilde{\mho }}(1;k)\). Since \({\widetilde{\mho }}\) is non-degenerate, \(q\ne 1\) and \(a_1q^{k-2}={\widetilde{\mho }}(1;k-2)\ne {\widetilde{\mho }}(2;1)=a_2q\), hence the system is non-degenerate and it can be readily checked that \({\widetilde{\mho }}(1;k)=a_1q^k\) is its unique solution. So we have proved \({\widetilde{\mho }}(1;k)=a_1q^k\).
Returning to (6.21), plugging \({\widetilde{\mho }}(1;k)=a_1q^k\) we now have
Plugging expressions for \(F_{(k)}\) and \(F_{(k-1)}\) and doing elementary manipulations, we get
which can be rewritten as
finishing the proof of this part.
Part 2. We again proceed by induction on k. When \(k=0\) we have \({\widetilde{\mho }}(1;0)=a_1\) and
Similarly, when \(k=1\) we have \({\widetilde{\mho }}(1;0)=a_1+d\) and (6.12) implies
Now assume that \(k\ge 2\) and we have proved the claim for \({\widetilde{\mho }}(1;i)\) and \(F_{(i+1)}(x_1, x_2\mid {\widetilde{\mho }})\) with \(i< k\). Applying Lemma 6.13 again, we have
where
Plugging the expression for \(F_{(k)}(x_1,x_2\mid {\widetilde{\mho }})\), we get
Since \(F_{(k+1)}({\widetilde{\mho }}((k,2))\mid {\widetilde{\mho }})=F_{(k+1)}(c_1+(k-2)d,c_2+2d\mid {\widetilde{\mho }})=0\), we get
Using expressions for \(F_{(k)}\) and \(F_{(k-1)}\), we get after elementary manipulations
which is equivalent to
Note that \(c_1+(k-3)t-c_2={\widetilde{\mho }}(1;k-1)-{\widetilde{\mho }}(2;4)\ne 0\) since \({\widetilde{\mho }}\) is non-degenerate. Hence the equation above implies \({\widetilde{\mho }}(1;k)=c_1+kd\). Plugging this value for \({\widetilde{\mho }}(1;k)\) into (6.22) we get
which implies the claim. \(\square \)
Let us summarize Lemmas 6.17 and 6.18:
Corollary 6.19
Let \(\mho \) be a perfect 3-grid. Then one of the following holds:
-
1.
For some constants \(c,q,a_1, a_2, a_3\in {\mathbb {k}}\) we have
$$\begin{aligned} \mho (1;k)=c+a_1q^k, \quad \mho (2;k)=c+a_2 q^k, \quad \mho (3;k)= c+a_3q^k,\qquad k\in {\mathbb {Z}}_{\ge 0}. \end{aligned}$$ -
2.
For some constants \(d,c_1,c_2,c_3\in {\mathbb {k}}\) we have
$$\begin{aligned} \mho (1;k)=c_1+kd,\quad \mho (2;k)=c_2+kd, \quad \mho (3;k)= c_3+kd,\qquad k\in {\mathbb {Z}}_{\ge 0}. \end{aligned}$$
\(\square \)
6.5 Proof of Theorem 6.2
We first prove that two types of n-grids described in Theorem 6.2 are indeed perfect. Propositions 5.13 and 5.14 combined with Remark 5.18 imply that the following n-grid \(\mho \) is perfect:
for \(q,a_1, \dots , a_n\in {\mathbb {k}}\) such that \(a_i,q\ne 0\), q is not a root of unity, and \(a_i/a_j\ne q^k\) for any \(i\ne j\) and \(k\in {\mathbb {Z}}\). Hence the grid \(\mho +c\) is also perfect for any \(c\in {\mathbb {k}}\), covering the first type from Theorem 6.2. Similarly, Propositions 5.16, 5.17 and Remark 5.18 imply that the grids of the form \(\mho (i;j)=c_i+jd\) are perfect for \(c_1, \dots , c_n, d\in {\mathbb {k}}\) such \(d\ne 0\) and \(c_i-c_j\ne kd\) for any \(i\ne j\) and \(k\in {\mathbb {Z}}\).
Hence, it is enough to prove that all perfect n-grids for \(n\ge 3\) are described by Theorem 6.2. We do it by induction on n, where the base case \(n=3\) follows from Corollary 6.19. Note that the restrictions \(a_i,q\ne 0\), \(a_i/a_j\ne q^k\) and \(c_i-c_j\ne kd\) follow automatically from Corollary 6.9, so its enough to just prove that \(\mho \) has the form \(\mho (i;j)=c+a_iq^j\) or \(\mho (i;j)=c_i+jd\).
For the inductive step assume that we have proved Theorem 6.2 for all \((n-1)\)-grids \(\mho \). Let \(\mho \) be a perfect n-grid. Then the \((n-1)\)-grid \(\mho _{n-1}\) is also perfect, and by inductive assumption we either have \(\mho (i;j)=c+a_iq^j\) or \(\mho (i;j)=c_i+jd\) for \(i\le n-1\). At the same time, the 3-grid is also perfect, and so we either have
for some \({\widetilde{c}},{\widetilde{q}}, {\widetilde{a}}_{n-2}, {\widetilde{a}}_{n-1}, {\widetilde{a}}_{n}\in {\mathbb {k}}\), or
for some \({\widetilde{d}}, {\widetilde{c}}_{n-2}, {\widetilde{c}}_{n-1}, {\widetilde{c}}_{n}\in {\mathbb {k}}\). But note that for \(c,a, q,c',d\in {\mathbb {k}}\) we can simultaneously have \(\mho (n-1; k)=c+a q^k=c'+kd\) for all \(k\in {\mathbb {Z}}_{\ge 0}\) only if \(d=0\) and all numbers \(c+a q^k, c'+kd\) are equal. However, this is impossible since \(\mho \) is non-degenerate and the numbers \(\mho (n-1;k)\) are distinct. Hence, both grids \(\mho _{n-1}\) and are of the same type, and moreover \(q={\widetilde{q}}\) or \(d={\widetilde{d}}\). Thus, if \(\mho (i;j)=c+a_iq^j\) for \(i<n\) then \(\mho (n;j)=c+a_nq^j\) as well, and similarly for \(\mho (i;j)=c_i+jd\). This finishes the proof of Theorem 6.2. \(\square \)
Notes
It is worth noting that for the supersymmetric functions there is also the class of interpolation Q-Schur functions [19].
See [6] for a vertex model description of factorial Schur functions, leading to the vanishing property.
The prime in \(U_q'({{\widehat{\mathfrak {sl}}}}_{n+1})\) reflects that our definition does not include the additional generator d, which is sometimes used in the context of affine algebras.
As a warning, l-weights are not as nicely behaved as the usual weights. For example, if v is an l-weight vector \(x^{\pm }_i.v\) might fail to be an l-weight vector.
Such a \(U_q'({\widehat{\mathfrak {sl}}}_{n+1})\)-module is called a Kirillov–Reshetikhin module corresponding to the first fundamental weight.
References
Aggarwal, A., Borodin, A., Wheeler, M.: Colored Fermionic Vertex Models and Symmetric Functions (2021). ar**v:2101.01605
Aggarwal, A.: Dynamical stochastic higher spin vertex models. Sel. Math. 24, 2659–2735 (2018). ar**v:1704.02499
Brubaker, B., Bump, D., Friedberg, S.: Schur polynomials and the Yang–Baxter equation. Commun. Math. Phys. 308, 281–301 (2011). ar**v:0912.0911
Borodin, A., Korotkikh, S.: Inhomogeneous spin \(q\)-Whittaker polynomials (2021). ar**v:2104.01415
Bosnjak, S., Mangazeev, V.: Construction of \(R\)-matrices for symmetric tensor representations related to \(U_q(\widehat{sl_n})\). J. Phys. A Math. Theor. 49 (2016). ar**v:1607.07968
Bump, D., McNamara, P., Nakasuji, M.: Factorial Schur functions and the Yang–Baxter equation. Comment. Math. Univ. St. Pauli 63, 23–45 (2014). ar**v:1108.3087
Borodin, A.: On a family of symmetric rational functions. Adv. Math. 306, 973–1018 (2017). ar**v:1410.0976
Borodin, A., Petrov, L.: Higher spin six vertex model and symmetric rational functions. Sel. Math. New Ser. 24, 751–874 (2018). ar**v:1601.05770
Buciumas, V., Scrimshaw, T., Weber, K.: Colored five-vertex models and Lascoux polynomials and atoms. J. Lond. Math. Soc. 102(3), 1047–1066 (2020). ar**v:1908.07364
Borodin, A., Wheeler, M.: Spin \(q\)-Whittaker polynomials. Adv. Math. (2020). https://doi.org/10.1016/j.aim.2020.107449. ar**v:1701.06292
Borodin, A., Wheeler, M.: Coloured stochastic vertex models and their spectral theory. ar**v:1808.01866
Chari, V.: Braid group actions and tensor products. Int. Math. Res. Not. 7, 357–382 (2002). ar**v:math/0106241
Chari, V., Pressley, A.: A Guide to Quantum Groups. CUP, Cambridge (1994)
Chari, V., Pressley, A.: Minimal affinizations of representations of quantum groups: the simply-laced case. Lett. Math. Phys. 35, 99–114 (1995). ar**v:hep-th/9410036
Cuenca, C.: Interpolation macdonald operators at infinity. Adv. Appl. Math. 101, 15–59 (2018). ar**v:1712.08014
Date, E., Jimbo, M., Kuniba, A., Miwa, T., Okado, M.: Exactly solvable sos models II: proof of the star-triangle relation and combinatorial identities. Adv. Stud. Pure Math. 16, 17–122 (1988)
Drinfeld, V.G.: A New realization of Yangians and quantized affine algebras. Sov. Math. Dokl. 36, 212–216 (1988)
Howe, R., Umeda, T.: The Capelli identity, the double commutant theorem, and multiplicity-free actions. Math. Ann. 290, 569–619 (1991)
Ivanov, V.: Interpolation analogues of Schur Q-functions. J. Math. Sci. 131, 5495–5507 (2005). ar**v:math/0305419
Jimbo, M.: A \(q\)-analogue of \(U(gl_{N+1})\), Hecke algebra, and the Yang–Baxter equation. Lett. Math. Phys. 11(3), 247–252 (1986)
**g, N.: On Drinfeld realization of quantum affine algebras. Ohio State Univ. Math. Res. Inst. Publ. Gruyter Berlin 7, 195–206 (1998). ar**v:q-alg/9610035
Korff, C.: Cylindric versions of specialised Macdonald functions and a deformed Verlinde algebra. Commun. Math. Phys. 318(1), 173–246 (2013). ar**v:1110.6356
Koornwinder, T.: Okounkov’s \(BC\)-type interpolation Macdonald polynomials and their \(q=1\) limit. Sém. Lothar. Combin. B72a (2015). ar**v:1408.5993
Korotkikh, S.: Hidden diagonal integrability of \(q\)-Hahn vertex model and Beta polymer model (2021). ar**v:2105.05058
Kirillov, A.N., Reshetikhin, N.Y.: Representations of the algebra Uq(sl2), q-orthogonal polynomials, and invariants of links. In: Infinite Dimensional Lie Algebras and Groups, Advances in Mathematical Physics, vol. 7, pp. 285–339. World Scientific, Singapore (1988)
Knop, F., Sahi, S.: Difference equations and symmetric polynomials defined by their zeros. Int. Math. Res. Not. 10, 473–486 (1996). ar**v:q-alg/9610017
Macdonald, I.G.: Symmetric functions and Hall polynomials. In: Oxford Mathematical Monographs, 2nd edn. The Clarendon Press, New York (1995)
Mangazeev, V.: On the Yang–Baxter equation for the six-vertex model. Nucl. Phys. B 882, 70–96 (2014). ar**v:1401.6494
Mucciconi, M., Petrov, L.: Spin q-Whittaker polynomials and deformed quantum Toda. ar**v:2003.14260
Molev, A.I., Tolstoy, V.N., Zhang, R.B.: On irreducibility of tensor products of evaluation modules for the quantum affine algebra. J. Phys. A Math. Gen. 37, 2385–2399 (2004). ar**v:math/0309468
Okounkov, A.: (Shifted) Macdonald polynomials: q-integral representation and combinatorial formula. Comput. Math. 112, 147–182 (1998). ar**v:q-alg/9605013
Okounkov, A.: Binomial formula for Macdonald polynomials and applications. Math. Res. Lett. 4, 533–553 (1997). ar**v:q-alg/9608021
Okounkov, A.: BC-type interpolation Macdonald polynomials and binomial formula for Koornwinder polynomials. Transform. Groups 3(2), 181–207 (1998). ar**v:q-alg/9611011
Okounkov, A.: On Newton interpolation of symmetric functions: a characterization of interpolation Macdonald polynomials. Adv. Appl. Math. 20, 395–428 (1998). ar**v:q-alg/9712052
Olshanski, G.: Interpolation Macdonald polynomials and Cauchy-type identities. J. Comb. Theory Ser. A 162, 65–117 (2019). ar**v:1712.08018
Okounkov, A., Olshanski, G.: Shifted Schur functions, Algebra Analiz 9(2), 73–146 (1997); St. Petersb. Math. J. 9(2), 239–300 (1998). ar**v:q-alg/9605042
Okounkov, A., Olshanski, G.: Shifted Jack polynomials, binomial formula, and applications. Math. Res. Lett. 4, 69–78 (1997). ar**v:q-alg/9608020
Petrov, L.: Refined Cauchy identity for spin Hall–Littlewood symmetric rational functions. J. Comb. Theory Ser. A 184, 105519 (2021). ar**v:2007.10886
Rains, E.: \(BC_n\)-symmetric polynomials. Transform. Groups 10, 63–132 (2005). ar**v:math/0112035
Rains, E.: \(BC_n\)-symmetric abelian functions. Duke Math. J. 135(1), 99–180 (2006). ar**v:math/0402113
Sahi, S.: The spectrum of certain invariant differential operators associated to Hermitian symmetric spaces. Lie theory and geometry. In: Brylinski J.-L. et al. (eds.), Progress in Mathematics, vol. 123, pp. 569–576. Boston: Birkhäuser (1994)
Sahi, S.: Interpolation, integrality, and a generalization of Macdonald’s polynomials. Int. Math. Res. Not. 10, 457–471 (1996)
Stembridge, J.R.: A short proof of Macdonald’s conjecture for the root systems of type \(A\). Proc. Am. Math. Soc. 102, 777–785 (1988)
Tsilevich, N.: Quantum inverse scattering method for the q-boson model and symmetric functions. Funct. Anal. Appl. 40(3), 207–217 (2006). ar**v:math-ph/0510073
Young, C.A.S., Mukhin, E.: Affinization of category O for quantum groups. Trans. Am. Math. Soc. 366, 4815–4847 (2014). arxiv.org/1204.2769
Zinn-Justin, P.: Six-Vertex, Loop and Tiling Models: Integrability and Combinatorics. Habilitation Thesis. Lambert Academic Publishing (2009). https://www.lpthe.jussieu.fr/~pzinn/publi/hdr.pdf. ar**v:0901.0665
Acknowledgements
The author is grateful to Alexei Borodin for his constant support and fruitful suggestions during this project, and to Matteo Mucciconi for his significant help with initiating this project. The author also would like to thank Pavel Etingof and Evgeny Mukhin for their clarifications regarding affine quantum groups and Grigori Olshanski for his explanation of the context around interpolation symmetric functions.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Korotkikh, S. Representation theoretic interpretation and interpolation properties of inhomogeneous spin q-Whittaker polynomials. Sel. Math. New Ser. 30, 40 (2024). https://doi.org/10.1007/s00029-024-00930-w
Accepted:
Published:
DOI: https://doi.org/10.1007/s00029-024-00930-w