A Class of Polynomial Recurrences Resulting in $$({\text {n}}/{\text {log}}\,\,{\text {n}}, {\text {n}}/{\text {log}}^2{\text {n}})$$ –Asymptotic Normality

Hitczenko, Paweł

doi:10.1007/s44007-024-00126-w

A Class of Polynomial Recurrences Resulting in $({\text {n}}/{\text {log}}\,\,{\text {n}}, {\text {n}}/{\text {log}}^2{\text {n}})$–Asymptotic Normality

Original Research Article
Open access
Published: 15 July 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

La Matematica Aims and scope Submit manuscript

A Class of Polynomial Recurrences Resulting in $({\text {n}}/{\text {log}}\,\,{\text {n}}, {\text {n}}/{\text {log}}^2{\text {n}})$–Asymptotic Normality

Download PDF

Paweł Hitczenko ORCID: orcid.org/0000-0001-8863-9295¹

Abstract

We consider sequences of polynomials that satisfy differential–difference recurrences. Polynomials satisfying such recurrences frequently appear as generating polynomials of integer valued random variables that are of interest in discrete mathematics. It is, therefore, of interest to understand the properties of such polynomials and their probabilistic consequences. We identify a class of polynomial recurrences that lead to a normal law with the expected value and the variance proportional to $n/\log ~n$ and $n/\log ^2n$, respectively. Examples include Stirling numbers of the second kind and other polynomials concerning set partitions as well as polynomials related to Whitney numbers of Dowling lattices.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction and Motivation

Consider a sequence of polynomials

$$\begin{aligned} P_n(x)=\sum _{k=0}^q p_{n,k}x^k,\quad n\ge 0, \end{aligned}$$

where $q=q_n$, $p_{n,k}\ge 0$ and $\sum _{k=0}^q p_{n,k}>0$ for every n. We assume $p_{n,k}=0$ for $k>q$ and often use $k\ge 0$ as the range of the summation. Such polynomials are of interest in combinatorial probability since

$$\begin{aligned}\frac{P_n(x)}{P_n(1)}=\sum _{k\ge 0}\frac{p_{n,k}}{P_n(1)}x^k\end{aligned}$$

is the probability generating function of a non–negative, integer valued random variable $X_n$ whose distribution function is given by

$$\begin{aligned} \textbf{P}(X_n=k)=\frac{p_{n,k}}{P_n(1)},\quad k\ge 0.\end{aligned}$$

(1)

When the underlying combinatorial objects are defined recursively, their generating polynomials often follow recurrences of the form

$$\begin{aligned} P_n(x) =a_n(x)P_{n-1}(x)+b_n(x)P_{n-1}^{'}(x)+c_n(x)P_{n-2}(x) \end{aligned}$$

(2)

for specified sequences of functions $(a_n)$, $(b_n)$, and $(c_n)$. It is therefore of interest to analyze such recurrences and there is by now enormous literature on the subject going back to Euler, at the very least.

Many historical references and broad background are given in a recent paper [17] where the authors developed the limiting theory for solutions of the above recurrence when the polynomials are of the form:

$$\begin{aligned} a_n(x)=\alpha (x) n+\gamma (x),\quad b_n(x)=\beta (x)(1-x),\quad c_n(x)=0. \end{aligned}$$

(3)

The authors treat more than two hundred examples when $\beta (x)\ne 0$ and more than three hundred when $\beta (x)=0$ found in [24]. In addition, several examples with $c_n(x)\ne 0$ were discussed in Section 9.2 although in these cases the contribution of the term $c_n(x)P_{n-2}(x)$ was generally asymptotically negligible. Within this framework the authors derived many limiting laws with various limiting distributions, including a prominently featured normal law, but also several other discrete as well as continuous distributions. The main approach was through the method of moments, but an analytic approach based on partial differential equations (PDE) and singularity analysis of the generating function was also used. Another alternative approach is through real–rootedness of the generating polynomials (see, e.g., [16] for recent applications or [17] for a broader discussion and more detailed references).

As was stated in [17], the assumption that $b_n(1)=0$ is very important for the method of moments to work. Further, the assumptions made on the coefficient polynomials (particularly on $a_n(x)$) yielded a normal law with both the expected value and the variance linear in n, whenever the limit was Gaussian.

In the present note we concentrate on a situation that yields a normal law with the asymptotic mean and variance proportional to $n/\log n$ and $n/\log ^2n$, respectively, under different assumptions on $a_n(x)$ and $b_n(x)$. While not nearly as extensive as the case of the Eulerian recurrences treated in [17] it still covers a number of cases encountered in the literature. Examples include the classical case of Stirling numbers of the second kind as well as recurrences discussed e.g. in [1]– [9, 13, 19]– [23, 25] or [26].

Specifically, we consider a sequence of polynomials $(P_n(x))$ satisfying the recurrence

$$\begin{aligned} P_n(x)=\gamma (x)P_{n-1}(x)+mxP'_{n-1}(x)+(n-1)c(x)P_{n-2}(x),\quad P_0(x)=1, \end{aligned}$$

(4)

where m is a constant and

$$\begin{aligned} \gamma (x)=\sum _{j=0}^k\gamma _jx^j, \quad c(x)=\sum _{j=0}^\ell c_jx^j \end{aligned}$$

(5)

are polynomials. Let

$$\begin{aligned} d:=\text{ deg }(\gamma (x)+c(x))\quad \text{ and }\quad \alpha _d:=[x^d]\Big ( \sum _{j=1}^k\frac{\gamma _j}{mj}x^j+\sum _{j=1}^\ell \frac{c_j}{m^2j^2}x^j\Big ). \end{aligned}$$

(6)

Then the following holds.

Theorem 1

Let $n\ge 1$. Let $X_n$ be a random variable whose probability generating function is given by (1) where the sequence $(P_n(x))$ satisfies the recurrence (4) with $m>0$. If d and $\alpha _d$ defined in (6) satisfy $d\ge 1$ and $\alpha _d>0$, then as $n\rightarrow \infty $

$$\begin{aligned} \frac{X_n-dn/\log n}{d\sqrt{n}/\log n}{\mathop {\longrightarrow }\limits ^{{\mathcal {L}}}}N(0,1), \end{aligned}$$

where “${\mathop {\longrightarrow }\limits ^{{\mathcal {L}}}}$”denotes the convergence in distribution and N(0, 1) is the standard normal law.

Remark 1

(i) The conditions we imposed on the coefficient polynomials, namely,

$$\begin{aligned} a_n(x)=\gamma (x), \quad b_n(x)=mx,\quad c_n(x)=(n-1)c(x), \end{aligned}$$

seem quite restrictive. Nonetheless, in virtually all recurrences of this type $(b_n)$ do not depend on n. If one drops the requirement that $b_n(1)=0$, then $b_n(x)=mx$ is common (it is also responsible for the asymptotic values of the expectation and the variance). Also, if the polynomials $c_n(x)$ are assumed to be linear functions of n (with coefficients that are polynomials in x), assuming that they are of the form $(n-1)c(x)$ is not a serious restriction as the other term (a polynomial in x not depending on n) would not contribute significantly. The assumption on $(a_n(x))$ (which corresponds to setting $\alpha (x)=0$ in (3)) still covers a number of cases including Stirling numbers of the second kind (see Sect. 3 below for some specific examples). This assumption is complementary to [17] where it was assumed $\alpha (1)>0$ (which, as we mentioned above, led to the limiting normal law with the expected value and the variance linear in n).

(ii) Other approaches to establishing asymptotic normality have been used and some of them are discussed in [17]. One of them is based on showing that the polynomials $(P_n(x))$ have real roots only (in fact this is the case for many of the examples discussed in the references we mentioned earlier). The real–rootedness is of interest in itself and has been studied extensively. Examples of relatively recent work in some degree of generality in that direction include [10, 21]. However, as pointed out in [17], the real–rootedness property seems quite sensitive to variations in the coefficient polynomials $\gamma (x)$, b(x), and c(x). In addition, one still needs to show that the variance of the resulting random variables grows to infinity with n, which often in this context is not a substantially easier task. In some of the referenced papers the asymptotic normality was asserted; in others it was not. The approach requires working with specific recurrences to establish the real–rootedness. Our result provides some generality and uniformity for a class of frequently encountered polynomials. In virtually all of the examples referenced earlier, the degree of $\gamma (x)+c(x)$ is one resulting in the expected value and the variance asymptotic to $n/\log n$ and $n/\log ^2n$, respectively.

2 Proof of Theorem 1

Our proof proceeds along the typical lines. We first find the explicit form of the bivariate generating function F(z, x) that encodes the probability distribution function of the underlying random variables and then carry out the asymptotic analysis of its coefficients.

2.1 A PDE associated with (4)

To derive the bivariate generating function we consider a partial differential equation that F(z, x) satisfies. We note that in our situation the resulting PDF can be solved by the method of characteristics giving the explicit expression for F(z, x). While this approach is not new, it has not been used much in this context. For more on the method of characteristics we refer to [18] or almost any other textbook on PDEs. One of the advantages is that it provides a simple and transparent way of deriving the bivariate generating function in cases when the resulting PDF can be solved. Our situation is particularly simple. Let

$$\begin{aligned} F(z,x):=\sum _{n=0}^\infty P_n(x)\frac{z^n}{n!}. \end{aligned}$$

(7)

We differentiate (7) with respect to z. Using (4) (and a convention that $P_j(x)=0$ whenever $j<0$) gives

$$\begin{aligned}\frac{\partial }{\partial z}F(z,x)&=\sum _{n=1}^\infty \Big \{\gamma (x)P_{n-1}(x)+mxP'_{n-1}(x)+(n-1)c(x)P_{n-2}(x)\Big \}\frac{z^{n-1}}{(n-1)!}\\ {}&= \gamma (x)F(z,x)+mx\frac{\partial }{\partial x}F(z,x)+c(x)zF(z,x) \end{aligned}$$

or

$$\begin{aligned} \frac{\partial }{\partial z}F(z,x)-mx\frac{\partial }{\partial x}F(z,x)=(\gamma (x)+c(x)z)F(z,x). \end{aligned}$$

(8)

With $F(0,x)=P_0(x)$ (which usually is equal to 1) this is easily solved by the method of characteristics. Namely, by setting

$$\begin{aligned} \frac{dx}{dz}=-mx\end{aligned}$$

(9)

PDE (8) is reduced to the ordinary differential equation:

$$\begin{aligned} \frac{d}{dz}F(z,x(z))=(\gamma (x(z))+zc(x(z)))F(z,x(z)) \end{aligned}$$

whose solution is

$$\begin{aligned} F(z,x(z))=\exp \left\{ \int (\gamma (x(z))+zc(x(z)))dz\right\} , \end{aligned}$$

where by (9) $x(z)=\xi e^{- mz}, $ and the parameter $\xi $ is treated as a constant of integration. Eliminating the parameter $\xi =xe^{mz}$ and choosing the constant of integration so that $F(0,x)=P_0(x)$ gives the explicit expression for F(z, x).

In our case, using (5) we obtain

$$\begin{aligned}&\int \Big (\gamma (\xi e^{-mz})+zc(\xi e^{-mz})\Big )dz=\gamma _0z+c_0\frac{z^2}{2}+C_0\\ {}&\quad -\sum _{j=1}^k\frac{\gamma _j}{jm}\xi ^j(e^{-jmz}+C_j)-\sum _{j=1}^\ell \frac{c_j}{jm}\xi ^j\left( ze^{-jmz}+\frac{e^{-jmz}}{jm}+K_j\right) , \end{aligned}$$

where $C_j$ and $K_j$ are integration constants. Setting $F(0,x)=1$ means that $C_0=0$, $C_j=-1$, $1\le j\le k$ and $K_j=-1/(jm)$, $1\le j\le \ell $. Thus, using $\xi =xe^{mz}$ we obtain

$$\begin{aligned} F(z,x)= \exp \left\{ \gamma _0z+c_0\frac{z^2}{2}+\sum _{j=1}^k\frac{\gamma _j}{jm}x^j(e^{jmz}-1)+\sum _{j=1}^\ell \frac{c_j}{jm}x^j\left( \frac{e^{jmz}-1}{jm}-z\right) \right\} .\end{aligned}$$

(10)

2.2 Asymptotics of $[z^n]F(z,x)$

Functions given by (10) are amenable to the perturbation of the saddle point asymptotics. We refer to Section IX.8 of [12]) for a discussion of the perturbation aspects in the bivariate case and to Chapter VIII for a detailed presentation of the saddle point estimation. In our case, we can invoke the general principles developed in Hayman’s work [15] on admissibility. (Essentially, a function is called admissible if it is subject to the saddle point asymptotics; we refer the reader to [12, Section VIII.5] or the original work of Hayman [15] for more details.)

Let $\Omega $ be a small neighborhood of $x=1$ (in particular $Re(x)>0$ for $x\in \Omega $). We fix $x\in \Omega $ for a moment and write the exponent in (10) as

$$\begin{aligned} f(z,x)=Q_1(z,x)+Q_2(xe^{mz}) \end{aligned}$$

(11)

where

$$\begin{aligned}Q_1(z,x)= -\sum _{j=1}^k\frac{\gamma _jx^j}{jm}-\sum _{j=1}^\ell \frac{c_jx^j}{j^2m^2}+\Big (\gamma _0-\sum _{j=1}^\ell \frac{c_jx^j}{jm}\Big )z+c_0\frac{z^ 2}{2} \end{aligned}$$

and

$$\begin{aligned} Q_2(z)=\sum _{j=1}^k\frac{\gamma _j}{jm}z^{j}+\sum _{j=1}^\ell \frac{c_j}{j^2m^2}z^{j}. \end{aligned}$$

Clearly, the function $e^{mz}$ is admissible if $m>0$ (it also follows from [15, Theorem X]). Since the leading coefficient of $Q_2$ is positive and $x>0$, $Q_2(xe^{mz})$ is admissible and so is $Q_1(z,x)+Q_2(xe^{mz})$ by [15, Corollary to Theorem IX]. Thus, f(z, x) is admissible and so is F(z, x) by [15, Theorem VI].

We will apply Hayman’s result to $e^{f(z,x)}$. Choose r so that

$$\begin{aligned} (zf_z(z,x))_{z=r}=n. \end{aligned}$$

(12)

Then

$$\begin{aligned}F(z,x)= \frac{r^{-n}}{2\pi }\int _{-\pi }^\pi e^{f(re^{i\theta },x)-in\theta }d\theta \sim \frac{r^{-n}e^{f(r,x)}}{\sqrt{2\pi b(r,x)}}, \end{aligned}$$

where $b(r,x)=(zf_z(z,x)+z^2f_{zz}(z,x))_{z=r}$ and where $f_z(z,x)$, $f_{zz}(z,x)$ (and later $f_x(z,x)$, $f_{zx}(z,x)$, etc.) denote the derivative(s) of f with respect to the indicated variable(s) (not just the partial derivative(s) with respect to the first or second argument of f).

By examining the argument above and noting that the dependence on x is polynomial, it is clear that the above estimates are uniform for $x\in \Omega $. We thus infer that the probability generating functions of random variables encoded by F(z, x) are given asymptotically by

$$\begin{aligned} p_n(x)=\frac{F(\rho (x),x)}{F(\rho (1),1)}\left( \frac{\rho (1)}{\rho (x)}\right) ^n(1+o(1)), \end{aligned}$$

where the o(1) error is uniform over $\Omega $ and $\rho (x)=\rho _n(x)$ is a positive solution of the saddle point equation (12) for $x\in \Omega $. In particular, $\rho (x)$ satisfies

$$\begin{aligned} \rho (x)f_z(z,x)_{z=\rho (x)}=n. \end{aligned}$$

(13)

Taking the logarithms and recalling by (6) that the leading coefficient of $Q_2(z)$ is $\alpha _d$ we see that

$$\begin{aligned} \log \rho (x)+\log \left( md\alpha _dx^de^{md\rho (x)}\Big (1+O\Big (\frac{1}{xe^{m\rho (x)}}\Big )\Big )\right) =\log n. \end{aligned}$$

It follows that

$$\begin{aligned} md\rho (x)=\log n-\log \rho (x)-d\log x-\log md\alpha _d+O\Big (\frac{1}{xe^{m\rho (x)}}\Big ). \end{aligned}$$

Successive iterations starting with $x=1$ give

$$\begin{aligned} \rho (x)=\frac{\log n}{md}+O(\log \log n) \end{aligned}$$

(14)

with the uniform behavior in x over $\Omega $.

Let us denote

$$\begin{aligned}h_n(x):=f(\rho (x),x)-n\log \rho (x),\end{aligned}$$

so that

$$\begin{aligned}p_n(x)=\exp (h_n(x)-h_n(1))(1+o(1)).\end{aligned}$$

Then, by [12, Theorem IX.13] on generalized quasi–powers, the corresponding random variables are asymptotically normal provided

$$\begin{aligned} \frac{h'''(x)}{(h'_n(1)+h''_n(1))^{3/2}}\rightarrow 0, \end{aligned}$$

(15)

uniformly over $\Omega $. Moreover, the mean and the variance are asymptotic to $h'_n(1)$ and to $h_n'(1)+h_n''(1)$, respectively. Differentiating $h_n(x)$ we get

$$\begin{aligned}h_n'(x)=\rho '(x)f_z(z,x)_{z=\rho (x)}+f_x(z,x)_{z=\rho (x)}-n\frac{\rho '(x)}{\rho (x)}. \end{aligned}$$

In view of (13)

$$\begin{aligned} \rho '(x)f_z(z,x)_{z=\rho (x)}=\frac{\rho '(x)}{\rho (x)}\rho (x)f_z(z,x)_{z=\rho (x)}=n\frac{\rho '(x)}{\rho (x)}. \end{aligned}$$

Thus, $h'_n(x)$ simplifies to

$$\begin{aligned} h_n'(x)=f_x(z,x)_{z=\rho (x)}. \end{aligned}$$

Differentiating again yields

$$\begin{aligned} h''_n(x)=\rho '(x)f_{zx}(z,x)_{z=\rho (x)}+f_{xx}(z,x)_{z=\rho (x)}. \end{aligned}$$

Finally, implicit differentiation of (13) gives

$$\begin{aligned} \rho '(x)f_z(z,x)_{z=\rho (x)}+\rho (x)\Big (\rho '(x)f_{zz}(z,x)_{z=\rho (x)}+f_{zx}(z,x)_{z=\rho (x)}\Big )=0. \end{aligned}$$

After rearranging the terms and putting $z=\rho (x)$

$$\begin{aligned} \rho '(x)=\frac{-\rho (x)f_{zx}(\rho (x),x)}{f_z(\rho (x),x)+\rho (x)f_{zz}(\rho (x),x)}. \end{aligned}$$

Evaluating at $x=1$ (and writing $\rho =\rho (1)$, $h'_n=h'_n(1)$, etc.) we arrive at

$$\begin{aligned} \rho '=\frac{-\rho f_{zx}(\rho ,1)}{f_z(\rho ,1)+\rho f_{zz}(\rho ,1)}. \end{aligned}$$

(16)

Since $\rho \rightarrow \infty $ as $n\rightarrow \infty $ and f(z, x) is a polynomial in x, z and $e^z$, it is clear that the asymptotic behavior of the relevant expressions depends on the coefficients of highest power of $e^\rho $. Specifically,

$$\begin{aligned}f_x(\rho ,1)&\sim d\alpha _de^{\rho md}\\ f_z(\rho ,1)&\sim md\alpha _de^{\rho md}\\ f_{zz}(\rho ,1)&\sim m^2d^2\alpha _de^{\rho md}\\ f_{zx}(\rho ,1)&\sim md^2\alpha _de^{\rho md}\\ f_{xx}(\rho ,1)&\sim d(d-1)\alpha _de^{\rho md}. \end{aligned}$$

Further, (13) implies that

$$\begin{aligned} \rho md\alpha _de^{\rho md}\sim n. \end{aligned}$$

Thus, using (14),

$$\begin{aligned} h_n'= f_x(\rho ,1)\sim d\alpha _de^{\rho md}\sim \frac{n}{m\rho }\sim \frac{dn}{\log n}. \end{aligned}$$

(17)

Similarly,

$$\begin{aligned} h''_n=\rho 'f_{zx}(\rho ,1)+f_{xx}(\rho ,1)\sim \rho 'md^2\alpha _de^{\rho md}+d(d-1)\alpha _de^{\rho md} \end{aligned}$$

(18)

so that

$$\begin{aligned} h_n'+h_n''\sim (1+m\rho ')d^2\alpha _de^{\rho md}. \end{aligned}$$

By (16)

$$\begin{aligned} \rho '\sim \frac{-\rho md^2\alpha _de^{\rho md}}{md\alpha _de^{\rho md}+\rho m^2d^2\alpha _de^{\rho md}}\sim \frac{-\rho d}{1+\rho md}. \end{aligned}$$

Hence,

$$\begin{aligned}h_n'+h_n''&\sim \left( 1-\frac{\rho md}{1+m\rho d}\right) d^2\alpha _de^{\rho md}\sim \frac{1}{1+\log n}\frac{d}{m}md\alpha _de^{\rho md}\\ {}&\sim \frac{1}{1+\log n}\frac{d}{m}\frac{n}{\rho }\sim \frac{d^2n}{\log ^2 n} . \end{aligned}$$

Finally, it is clear from (17), (18) and the form of f(z, x) that (15) holds uniformly over a small neighborhood $\Omega $ of $x=1$. This completes the proof.

3 Examples and Further Comments

3.1 Set Partitions of Type $B_n$

Wang [26] established the normal limit law for the number of non–zero blocks in the colored set partitions of type $B_n$ (we refer to [26] for the definitions and background). This amounted to analyzing polynomial recurrences of the form

$$\begin{aligned} T_n(x)=(x+c)T_{n-1}(x)+mxT'_{n-1}(x),\quad n\ge 1, \quad T_0(x)=1, \end{aligned}$$

(19)

where c and m are positive integers. In order to do it, he showed that each $T_n(x)$ has real, negative roots. This implies that the resulting $X_n$ is a sum of independent indicators. He then used the formulas

$$\begin{aligned} \textbf{E} X_n =\frac{T_{n+1}(1)}{mT_n(1)}-\frac{1+c}{m},\quad \textbf{var}(X_n)=\frac{T_{n+2}(1)-T^2_{n+1}(1)}{m^2T_n(1)}-\frac{1}{m} \end{aligned}$$

to derive the asymptotics

$$\begin{aligned} \textbf{E} X_n \sim \frac{n}{\log n},\quad \textbf{var}(X_n)\sim \frac{n}{\log ^2n}. \end{aligned}$$

The last step relied on the asymptotic analysis of $T_n(1)$. While the calculations for the expected value were a straightforward application of the saddle point method, the variance was more delicate due to cancellations in $T_{n+2}(1)-T^2_{n+1}(1)$.

Alternatively, the asymptotic normality follows from Theorem 1 with $\gamma (x)=x+c$ and $c(x)=0$.

3.2 Whitney Numbers, Stirling Numbers and Their Generalizations

The coefficients $(T_{n,k})$ of the polynomials $(T_n(x))$ given by (19) satisfy the recurrence

$$\begin{aligned} T_{n,k}=T_{n-1,k-1}+(mk+c)T_{n-1,k},\quad 0\le k\le n.\end{aligned}$$

(20)

Versions of numbers satisfying (20) frequently appear in the literature under various names. In particular, when $c=1$ they are referred to as Whitney numbers of the second kind [3, 4]. The (ordinary) generating functions of Whitney numbers are called Dowling polynomials and the row sums of Whitney numbers are known as Dowling numbers. When $m=2$ Whitney numbers appear as A039755 (and A039756) in [24] under the name B–analogs of Stirling numbers of the second kind. Sequences A007405, A003575–A003582, A364069 and A364070 are Dowling numbers for $m=2,3,\dots , 10$, $m=64$ and $m=624$, respectively.

The case $c=0$ are translated Whitney numbers (see, e.g. [1]). Examples in [24] are sequences A075497 through A075505 which correspond to $m=2,\dots ,10$. The case $c=r$ are the r–Whitney numbers [7]. The latter are also referred to as the $(r,\beta )$–Stirling numbers [9] ($c=r$, $m=\beta $). This is because the numbers for the case $\beta =1$ are essentially the r–Stirling numbers of the second kind [6]. Specifically, the r–Stirling number $\left\{ \begin{array}{l} n \\ k \end{array}\right\} _r$ counts the number of partitions of the set $[n]:=\{1,2,\dots ,n\}$ into k blocks, such that the numbers 1 through r are in different blocks. If $T_{n,k}$ are (r, 1)–Stirling numbers as defined in [9] then the relation is

$$\begin{aligned} \left\{ \begin{array}{l}n\\ k \end{array}\right\} _r=T_{n-r,k-r} \quad \text{ for }\quad n\ge k\ge r. \end{aligned}$$

Since $T_{n,k}$ satisfy (20) with $m=1$, $c=r$, all r–Stirling numbers $\left\{ \begin{array}{l}n \\ k \end{array}\right\} _r$ satisfy (20) with $m=1$, $c=0$ but with different initial condition, namely, $\left\{ \begin{array}{l}n\\ k\end{array}\right\} _r=\delta _{n,r}$, $n\le r$. For $r=2, 3, 4$, r–Stirling numbers are sequences A143494–A143496 in [24]. Of course, classical Stirling numbers of the second kind correspond to $r=1$ and their $(n/\log n,n/\log ^2n)$–asymptotic normality is well–known and was established by Harper [14], see also a discussion in [12, Example III.11 and Proposition IX.20]. By Theorem 1 all the variants mentioned above follow the same distribution.

Whitney numbers were introduced in the context of geometric lattices associated with groups [11], see also [2]. Later, combinatorial interpretations (mostly related to restricted and colored set partitions) were found. A general nature of these restrictions is discussed in [13]. There seem to be overlaps in the literature between these various families. Partially for this reason, we limited references to the papers most relevant here. More details and history may be found therein.

3.3 Further Examples

Other examples of sequences in [24] satisfying (20) are A186695 ($m=2$, $c=-1$) or A111577 ($m=3$, $c=-2$). Both are referred to as Galton triangles and both have $c<0$.

Sequences satisfying (20) with $c=m-1$ are referred to as (scaled) Stirling–Frobenius subset numbers. For $m=1$ through $m=4$ they are A048993, A039755, A225468 and A225469 in [24], respectively.

Numbers (S2[d, a](n, k)) where a, d are non–negative integers with $\text{ gcd }(d,a)=1$ are called Sheffer triangles (see [20] for a general discussion and [19, Example 4] for an example relevant here). They satisfy the recurrence

$$\begin{aligned} S2[d,a](n,k)=dS2[d,a](n-1,k-1)+(a+dk)S2[d,a](n-1,k). \end{aligned}$$

Thus, their generating polynomials satisfy (4) with $\gamma (x)=dx+a$, $m=d$ and $c(x)=0$. Sequences A048993, A039755, A154537, A282629, A225466, A285061 and A225467–A225469 are the numbers S2[d, a] for various values of d and a. The scaled Stirling–Frobenius subset numbers mentioned earlier are special cases $S2[m,m-1]$. We note that the “unscaled”Stirling–Frobenius numbers are numbers satisfying (20) with $c=m-1$. This situation was discussed earlier.

While some of these families of numbers appeared in different contexts, from the point of view of the asymptotics, their behavior is the same. By Theorem 1 they are all asymptotically normal with the mean $n/\log n$ and the variance $n/\log ^2n$. For some of these numbers their asymptotic normality has been explicitly stated; for others it has not, it seems. However, it should be noted that the bivariate generating function is available in the explicit form (and has been derived in some cases). This could then be used to carry out the asymptotic analysis. The methods for deriving the bivariate generating function varied from case to case. The method of characteristics, outlined above, seems to provide some uniformity in this respect.

3.4 Set Partitions Without Small Blocks: s–Associated Stirling Numbers

In most of the cases in the literature c(x) is identically zero. One natural example where this is not the case is provided by the recurrence for $0\le k\le n$:

$$\begin{aligned} D_{n,k}=kD_{n-1,k}+(n-1)D_{n-2,k-1}, \quad D_{0,0}=1. \end{aligned}$$

The numbers $D_{n,k}$ count the number of set partitions of an n–set into k blocks, each of size at least 2 (see e.g. [5] for a relatively recent reference and discussion of some of its properties). It results in the following recurrence for the generating polynomials

$$\begin{aligned} D_n(x)=xD_{n-1}'(x)+(n-1)xD_{n-2}(x),\quad D_0(x)=1. \end{aligned}$$

Theorem 1 applies with $\gamma (x)=0$, $m=1$ and $c(x)=x$. Thus, for partitions of an n–set into blocks of size at least two, the number of blocks is asymptotically normal with the expected value asymptotic to $n/\log n$ and the variance asymptotic to $n/\log ^2n$. In [5], real–rootedness of the polynomials $D_n(x)$ was established and its various consequences have been discussed although the asymptotic normality was not one of them.

This example is actually a special case of the so–called s–associated Stirling numbers of the second kind (see [8, pp. 221–222]). They count the number of set partitions into blocks of sizes at least s. The analogous recurrence is

$$\begin{aligned}D_{n,k}=kD_{n-1,k}+\left( {\begin{array}{c}n-1\\ s-1\end{array}}\right) D_{n-s,k-1}.\end{aligned}$$

This gives the polynomial recurrence

$$\begin{aligned}D_n(x)=xD'_{n-1}(x)+\frac{x}{(s-1)!}(n-1)_{s-1}D_{n-s}(x),\end{aligned}$$

where $(x)_m$ is the falling factorial. When $s\ge 3$ this is technically outside of the scope of Theorem 1, but one can argue in exactly the same way: for

$$\begin{aligned}F(z,x)=\sum _n\frac{z^n}{n!}D_n(x)\end{aligned}$$

PDE (8) takes the form:

$$\begin{aligned}\frac{\partial }{\partial z} F(z,x)-x\frac{\partial }{\partial x}F(z,x)=\frac{x}{(s-1)!}z^{s-1}F(z,x), \end{aligned}$$

and hence, with $x(z)=\xi e^{-z}$,

$$\begin{aligned}F(z,x(z))=\exp \left\{ \frac{\xi }{(s-1)!}\int z^{s-1}e^{-z}dz\right\} =\exp \Big \{\xi \Big (-\sum _{j=1}^{s}\frac{z^{s-j}}{(s-j)!}e^{-z}+C\Big )\Big \}. \end{aligned}$$

As $1=F(0,x)=e^{\xi (-1+C)}$, $C=1$, and eliminating $\xi =xe^z$ gives

$$\begin{aligned} F(z,x)=\exp \Big \{x\Big (e^z-\sum _{j=0}^{s-1}\frac{z^j}{j!}\Big )\Big \}\end{aligned}$$

(21)

as given e.g. in [8]. The asymptotic analysis applies with

$$\begin{aligned}Q_1(z,x)=-x\sum _{j=0}^{s-1}\frac{z^j}{j!}\quad \text{ and }\quad Q_2(z)=z\end{aligned}$$

in (11). This yields, as for the cases $s=1$ and $s=2$, the asymptotically normal law with mean $n/\log n$ and the variance $n/\log ^2n$.

3.5 Associated r–Whitney Numbers

We close with one more example in the same spirit as the previous one. As we mentioned earlier, a recent paper [13] combined two different restrictions imposed on set partitions. One concerns the sizes of parts (association). The other insists that specific elements are in different blocks of a partition. As we will see, our results apply to the number of blocks in such partitions. As far as we know, the number of blocks in such partitions has not been considered before.

Following [7, 9] we say that a set partition is a Whitney colored r–partition with m colors if it is a partition of $[r+n]$ such that:

(i)
the numbers $1,\dots , r$ are in different blocks,
(ii)
the smallest elements of the blocks are not colored,
(iii)
elements in blocks containing $1,\dots , r$ are not colored,
(iv)
the remaining elements are colored with m colors.

Elements $1,\dots ,r$ are called distinguished and the blocks containing them are called distinguished blocks. All other blocks and elements are called non–distinguished. In [13], the s–associated r–Dowling numbers $(D_{n,m,r}^{\ge s})$ are defined and some of their properties are studied. Combinatorially, $D_{n,m,r}^{\ge s}$ is the number of Whitney colored r–partitions with m colors with the property that each non–distinguished block contains at least s elements. In analogy with Dowling numbers being the row sums of Whitney numbers, we let the s–associated r–Whitney number $W_{n,k,m,r}^{\ge s}$ be the number of such partitions with k non–distinguished blocks. We will show that these numbers are asymptotically normal with the mean $n/\log n$ and the variance $n/\log ^2n$. As m, r and s are fixed we will write $W_{n,k}=W_{n,k,m,r}^{\ge s}$ through the rest of this section. The numbers $W_{n,k}$ satisfy the recurrence (see [13, Proof of Theorem 3] for an argument for the Dowling counterparts)

$$\begin{aligned}W_{n,k}=(r+mk)W_{n-1,k}+\left( {\begin{array}{c}n-1\\ s-1\end{array}}\right) m^{s-1}W_{n-s,k-1}.\end{aligned}$$

Indeed, the first term counts the instances in which $n+r$ is in one of the distinguished blocks or in a non-distinguished block of size larger than s. The second counts instances in which it is in a non–distinguished block of size s. (In the latter case one needs to chose any $s-1$ elements from $\{r+1,\dots ,r+n-1\}$ for that block and color all but the smallest one in $m^{s-1}$ ways). Row generating polynomials

$$\begin{aligned} W_n(x):=\sum _{k=0}^nW_{n,k}x^k \end{aligned}$$

satisfy

$$\begin{aligned} W_n(x)=rW_{n-1}(x)+mxW_{n-1}'(x)+m^{s-1}\left( {\begin{array}{c}n-1\\ s-1\end{array}}\right) xW_{n-s}(x). \end{aligned}$$

The resulting PDE for the bivariate generating function F(z, x) takes a slightly different form than in the previous example. Namely,

$$\begin{aligned} \frac{\partial }{\partial z}F(z,x)-mx\frac{\partial }{\partial x}F(z,x)=\left( r+\frac{x}{(s-1)!}(mz)^{s-1}\right) F(z,x). \end{aligned}$$

Following the same steps as in that example gives

$$\begin{aligned} F(z,x)=\exp \Big \{rz+C_0+\frac{\xi }{m}\Big (-\sum _{j=0}^{s-1}\frac{(mz)^{j}}{j!}e^{-mz}+C_1\Big )\Big \}, \end{aligned}$$

where $\xi =xe^{mz}$ and $C_0$, $C_1$ are integration constants. The initial condition $F(0,x)=1$ leads to

$$\begin{aligned} F(z,x)=\exp \Big \{rz+\frac{x}{m}\Big (e^{mz}-\sum _{j=0}^{s-1}\frac{(mz)^{j}}{j!}\Big )\Big \}. \end{aligned}$$

The aforementioned asymptotic normality of $(W_{n,k})$ follows by the same asymptotic analysis as before with

$$\begin{aligned} Q_1(x,z)=rz-\frac{x}{m}\sum _{j=0}^{s-1}\frac{m^{j}}{j!}z^j\quad \text{ and } \quad Q_2(z)=\frac{z}{m} \end{aligned}$$

in (11).

We note that the univariate exponential generating function of the sequence $(D_{n,m,r}^{\ge s})$ is

$$\begin{aligned} F(z,1)=\exp \Big \{rz+\frac{1}{m}\Big (e^{mz}-\sum _{j=0}^{s-1}\frac{(mz)^{j}}{j!}\Big )\Big \}, \end{aligned}$$

as was derived in [13, Theorem 2] by a different method.

3.6 Final Comments

Recurrences of type (2) are very common in combinatorial probability. In vast majority of cases $c_n(x)=0$ and $b_n(x)=b(x)$ so the recurrence simplifies to

$$\begin{aligned} P_n(x)=a_n(x)P_{n-1}(x)+b(x)P'_{n-1}(x). \end{aligned}$$

Paper [17] comprehensively treated the case $a_n(x)=\alpha (x)n+\gamma (x)$ with $\alpha (1)>0$ and $b_n(x)=(1-x)\beta (x)$. The present work covers the situation $a_n(x)=\gamma (x)$ and $b(x)=mx$, $m>0$. However, there are examples of recurrences of the above type with the coefficients $a_n(x)$ and b(x) of different form that those just mentioned. Thus, it would seem worthwhile to study such recurrences for other sequences of interest. For some cases it should be rather straightforward, for other might be more challenging. In particular, for the method of moments to work well it is very important that the condition $b(1)=0$ holds. The approach based on the characteristics is less sensitive to that requirement. However, its drawback is that the resulting PDE might not have a closed form solution. In such cases one would have to work with implicitly defined function F(z, x) (see [17, Sections 3.1 and 5.1$-$5.3] for some discussion of that aspect) or develop other approaches to handle such cases.

Another possible direction for research is to consider more general forms for the terms $c_n(x)P_{n-2}(x)$. (In fact, the last two examples are of that type.) It is unclear, however, how common such recurrences are.

Data availability

Not applicable to this article as no datasets were generated or analyzed during the current study.

References

Belbachir, H., Bousbaa, I. E.: Translated Whitney and $r$–Whitney numbers: a combinatorial approach. J. Integer Seq. 16(8), 13.8.6–13.8.7 (2013)
Benoumhani, M.: On Whitney numbers of Dowling lattices. Discrete Math. 159, 3–33 (1996). https://doi.org/10.1016/0012-365X(95)00095-E
Article MathSciNet Google Scholar
Benoumhani, M.: On some numbers related to Whitney numbers of Dowling lattices. Adv. Appl. Math. 19(1), 106–116 (1997). https://doi.org/10.1006/aama.1997.0529
Article MathSciNet Google Scholar
Benoumhani, M.: Log-concavity of Whitney numbers of Dowling lattices. Adv. Appl. Math. 22(2), 186–189 (1999). https://doi.org/10.1006/aama.1998.0621
Article MathSciNet Google Scholar
Bóna, M., Mező, I.: Real zeros and partitions without singleton blocks. Eur. J. Combin. 51, 500–510 (2016). https://doi.org/10.1016/j.ejc.2015.07.021
Article MathSciNet Google Scholar
Broder, A.Z.: The $r$-Stirling numbers. Discrete Math. 49, 241–259 (1984). https://doi.org/10.1016/0012-365X(84)90161-4
Cheon, G.-S., Jung, J.-H.: $r$-Whitney numbers of Dowling lattices. Discrete Math. 312, 2337–2348 (2012). https://doi.org/10.1016/j.disc.2012.04.001
Comtet, L.: Advanced Combinatorics. Reidel, 1974. ISBN: 90-277-0441-4
Corcino, R.B., Corcino, C.B., Aldema, R.: Asymptotic normality of the $(r,\beta )$-Stirling numbers. Ars Combin. 81, 81–96 (2006)
MathSciNet Google Scholar
Dominici, D., Driver, K., Jordaan, K.: Polynomial solutions of differential-difference equations. J. Approx. Theory 163, 41–48 (2011). https://doi.org/10.1016/j.jat.2009.05.010
Article MathSciNet Google Scholar
Dowling, T. A.: A class of geometric lattices based on finite groups. J. Combin. Theory, Ser. B 14, 61-86, 1973. Erratum: J. Combin. Theory, Ser. B 15, 211, (1973)
Flajolet, P., Sedgewick, R.: Analytic Combinatorics. Cambridge University Press, (2009). ISBN: 978-0-521-89806-5. https://doi.org/10.1017/CBO9780511801655
Gyimesi, E., Nyul, G.: Associated $r$-Dowling numbers and some relatives. C. R. Math. Acad. Sci. Paris 359, 47–55 (2021). https://doi.org/10.5802/crmath.145
Article MathSciNet Google Scholar
Harper, L.H.: Stirling behaviour is asymptotically normal. Ann. Math. Statist. 38, 410–414 (1967). https://doi.org/10.1214/aoms/1177698956
Article MathSciNet Google Scholar
Hayman, W.K.: A generalisation of Stirling’s formula. J. Reine Angew. Math. 196, 67–95 (1956). https://doi.org/10.1515/crll.1956.196.67
Article MathSciNet Google Scholar
Hitczenko, P., Lohss, A.: Probabilistic consequences of some polynomial recurrences. Random Struct. Alg. 53, 652–666 (2018). https://doi.org/10.1002/rsa.20820
Article MathSciNet Google Scholar
Hwang, H.K., Chern, H.H., Duh, G.-H.: An asymptotic distribution theory for Eulerian recurrences with applications. Adv. Appl. Math. 112, 101960 (2020). https://doi.org/10.1016/j.aam.2019.101960
Article MathSciNet Google Scholar
John, F.: Partial Differential Equations. Springer, 4th edition, (1982). ISBN: 0-387-90609-6. https://doi.org/10.1007/978-1-4684-9333-7.
Lang, W.: On generating functions of diagonals sequences of Sheffer and Riordan number triangles. Preprint at ar**v:1708.01421, (2017)
Lang, W.: On sums of powers of arithmetic progressions, and generalized Stirling, Eulerian and Bernoulli numbers. Preprint at ar**v:1707.04451, (2017)
Liu, L.L., Wang, Y.: A unified approach to polynomial sequences with only real zeros. Adv. Appl. Math. 38, 542–560 (2007). https://doi.org/10.1016/j.aam.2006.02.003
Article MathSciNet Google Scholar
Mező, I.: The $r$-Bell numbers. J. Integer Seq. 14, 11.1.1-11.1.14 (2011)
MathSciNet Google Scholar
Neuwirth, E.: Recursively defined combinatorial functions: extending Galton’s board. Discrete Math. 239, 33–51 (2001). https://doi.org/10.1016/S0012-365X(00)00373-3
Article MathSciNet Google Scholar
OEIS Foundation Inc. (2024). The On-Line Encyclopedia of Integer Sequences. Published electronically at https://oeis.org
Suter, R.: Two analogues of a classical sequence. J. Integer Seq. 3, 00.1.8 (2000)
MathSciNet Google Scholar
Wang, D.G.L.: On colored set partitions of type $B_n$. Cent. Europ. J. Math. 12(9), 1372–1381 (2014). https://doi.org/10.2478/s11533-014-0419-9
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, Drexel University, 3141 Chestnut Street, Philadelphia, PA, 19104, USA
Paweł Hitczenko

Authors

Paweł Hitczenko
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Paweł Hitczenko.

Ethics declarations

Conflict of interest

The author has no relevant financial or non-financial interests to disclose.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hitczenko, P. A Class of Polynomial Recurrences Resulting in $({\text {n}}/{\text {log}}\,\,{\text {n}}, {\text {n}}/{\text {log}}^2{\text {n}})$–Asymptotic Normality. La Matematica (2024). https://doi.org/10.1007/s44007-024-00126-w

Download citation

Received: 27 December 2022
Revised: 26 February 2024
Accepted: 02 July 2024
Published: 15 July 2024
DOI: https://doi.org/10.1007/s44007-024-00126-w

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Class of Polynomial Recurrences Resulting in \(({\text {n}}/{\text {log}}\,\,{\text {n}}, {\text {n}}/{\text {log}}^2{\text {n}})\)–Asymptotic Normality

Abstract

1 Introduction and Motivation

Theorem 1

Remark 1

2 Proof of Theorem 1

2.1 A PDE associated with (4)

2.2 Asymptotics of \([z^n]F(z,x)\)

3 Examples and Further Comments

3.1 Set Partitions of Type \(B_n\)

3.2 Whitney Numbers, Stirling Numbers and Their Generalizations

3.3 Further Examples

3.4 Set Partitions Without Small Blocks: s–Associated Stirling Numbers

3.5 Associated r–Whitney Numbers

3.6 Final Comments

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

A Class of Polynomial Recurrences Resulting in \(({\text {n}}/{\text {log}}\,\,{\text {n}}, {\text {n}}/{\text {log}}^2{\text {n}})\)–Asymptotic Normality

Abstract

1 Introduction and Motivation

Theorem 1

Remark 1

2 Proof of Theorem 1

2.1 A PDE associated with (4)

2.2 Asymptotics of \([z^n]F(z,x)\)

3 Examples and Further Comments

3.1 Set Partitions of Type \(B_n\)

3.2 Whitney Numbers, Stirling Numbers and Their Generalizations

3.3 Further Examples

3.4 Set Partitions Without Small Blocks: s–Associated Stirling Numbers

3.5 Associated r–Whitney Numbers

3.6 Final Comments

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation