Celebratio Mathematica

by Thomas M. Liggett

Thomas Liggett’s last paper was unpublished at the time of his death in 2020. The text that follows is an edited draft of that paper, prepared by Wenpin Tang, who also provided the annotations throughout. Note: Strong Rayleigh random variables are also called Poisson binomial random variables. Some results in this paper are presented in Section 4 of the survey article of W. Tang and F. Tang (“The Poisson binomial distribution — old & new” (2019), arXiv:1908.10024), which provides further references.

1. Introduction

Suppose that \begin{equation}\label{polyf} f(u)=\sum_{i=0}^nc_iu^i \end{equation} is a polynomial with positive coefficients \( c_i > 0 \).1 Then:

\( f \) is said to be strong Rayleigh (SR) if all of its roots are real (and hence negative). This is equivalent to saying that \( f \) can be factored into polynomials of degree 1 with positive coefficients.
\( f \) is said to be Hurwitz (H) if all of its roots have negative real part. Since \[ (u-z)(u-\bar z)=u^2-2u\operatorname{Re}(z)+|z|^2, \] this is equivalent to saying that \( f \) can be factored into polynomials of degree at most 2 with positive coefficients.

In this paper we consider the following question: When can \( f \) be factored into polynomials of degree at most \( j \) with positive coefficients? We will call this property \( P_j \).

One motivation for this is the following: Except for the normalization \( f(1)=1 \), the \( f \) in \eqref{polyf} is the probability generating function (pgf) \( Eu^X \) of a random variable \( X \) with distribution \( c_i=P(X=i) \). If it2 has property \( P_j \), then there are independent random variables \( X_i \) with values in \( \{0,1,\dots, j\} \) so that \( X \) and \( \sum_iX_i \) have the same distribution. This property was exploited in [◊] in the case \( j=1 \) in order to obtain various probabilistic limit theorems.

Another motivation comes from [◊]. In our consideration of multivariate central limit theorems for SR vectors, we raised the following question: if \( X \) has an SR pgf and \( j,k\geq 1 \), how well can one approximate \( \frac jk X \) by an SR random variable? We were able to solve this problem for \( j=1 \), showing that \( \bigl\lfloor \frac 1k X\bigr\rfloor \) is SR. It turns out that \( \bigl\lfloor\frac 2k X\bigr\rfloor \) is very far from being SR, in the sense that the imaginary parts of some roots of the pgf are large. Nevertheless, we will see that a combination of the results in [◊] with the Hermite–Biehler theorem (see [◊] for example) implies that \( \bigl\lfloor\frac 2k X\bigr\rfloor \) is H. We then speculate about possible extensions of this to \( j\geq 3 \). For the applications we have in mind, this property is generally as useful as SR.

A sufficient condition for SR is that the coefficients of \( f \) satisfy \begin{equation}\label{SSR} c_i^2 > 4c_{i-1}c_{i+1},\quad 1\leq i\leq n-1; \end{equation} see [◊]. A sufficient condition for H if \( n > 5 \) is that the coefficients of \( f \) satisfy \begin{equation}\label{SHB} c_i c_{i+1}\geq (2.147\dots)c_{i-1}c_{i+2},\quad 1\leq i\leq n-2; \end{equation} see [◊].

These sufficient conditions are quite restrictive. For example, if \( f(u)=(u+1)^n \), so that \( c_i=\binom ni \), then \[ \frac {c_i^2}{c_{i-1}c_{i+1}}=\frac{i+1}i\frac{n-i+1}{n-i}. \] Therefore \[ \lim_{i\rightarrow\infty}\lim_{n\rightarrow\infty}\frac {c_i^2}{c_{i-1}c_{i+1}}=1, \] so \eqref{SSR} fails for large \( n \).

The Hermite–Biehler theorem gives an NASC3 for H:

Theorem 1: (Hermite–Biehler)4 Given a polynomial \( f \) as in \eqref{polyf} with positive coefficients, write \[ f(u)=\sum_{i=0}^{1}u^ih_i(u^2). \] Then \( f \) has property \( P_2 \) if and only if the roots of \( h_0,h_1 \) are negative and interlace, with the largest of these being a root of \( h_0 \).

By analogy with this result, we consider the following: Given a polynomial \( f \) as in \eqref{polyf} with positive coefficients, write \begin{equation}\label{HBext}f(u)=\sum_{i=0}^{j-1}u^ih_i(u^j).\end{equation} We will say that \( f \) has property \( Q_j \) if the roots of \( h_0,h_1,\dots, h_{j-1} \) are negative and interlace, with the largest being a root of \( h_0 \), the second largest being a root of \( h_1 \), etc.

Note that \( Q_1=P_1 \). The Hermite–Biehler theorem is the statement that \( Q_2=P_2 \). However, \( Q_3\neq P_3 \) as the following example shows. Let \[ f(u)=u^5+u^4+u^3+2u^2+\textstyle\frac32 u+\frac13. \] By definition \eqref{HBext}, we have \begin{equation*} h_0(u) = u + \textstyle\frac{1}{3}, \quad h_1(u) = u + \frac{3}{2}, \quad h_2(u) = u+2. \end{equation*} The roots of \( f \) are \( z_1 \), \( \bar z_1 \), \( z_2 \), \( \bar z_2 \), \( w \), where the (rounded) values are \[ z_1=-.725+.100i,z_2=.435+1.137i, w=-.420. \] Then \[ (u-w)(u-z_2)(u-\bar z_2)=.623+1.116u-.449u^2+u^3, \] so \( f \) is not \( P_3 \). However, the roots of \( h_0,h_1,h_2 \) are \( -\frac13 \), \( -\frac32 \), \( -2 \) respectively, which do interlace correctly. We are not aware of a useful criterion for \( P_j \) for \( j\geq 3 \).

In the following section, we prove that \( \bigl\lfloor\frac2kX\bigr\rfloor \) is \( P_2 \)5 if \( X \) is \( P_1 \), and speculate about the likely situation for \( \bigl\lfloor\frac3kX\bigr\rfloor \). In particular, we conjecture that \( \bigl\lfloor\frac34X\bigr\rfloor \) is \( P_3 \). \( \bigl\lfloor\frac35X\bigr\rfloor \) is not \( P_3 \), but we conjecture that it is nearly so, in that the pgf can be factored into polynomials of degree at most 3 with at most one of the factors having a negative coefficient. We will call this situation almost \( P_3 \). More generally, it may be the case that \( \bigl\lfloor\frac3kX\bigr\rfloor \) is \( P_3 \) if \( k\equiv1 \pmod{3} \) and almost \( P_3 \) if \( k\equiv2 \pmod{3} \).

2. Some examples

Suppose the roots of \( h_0,h_1,h_2 \) are negative and interlace. Let these roots be denoted by \[ t_m < t_{m-1} < \dots < t_0 < 0. \] Then \[ f(u)=h_0(u^3)+uh_1(u^3)+u^2h_2(u^3), \] where \( h_0 \) has roots \( t_0,t_3,\dots, \) and \( h_1 \) has roots \( t_1,t_4,\dots, \) and \( h_2 \) has roots \( t_2,t_5,\dots. \) The degree of \( f \) is \( m+3 \).

We begin with the case \( m=1 \), so \[ h_0(y)=a(y-t_0),\quad h_1(y)=b(y-t_1),\quad h_2(y)=c, \] with \( a,b,c > 0 \). Then \begin{equation}\label{fourth}f(u)=-at_0-bt_1u+cu^2+au^3+bu^4.\end{equation} If we write \[ f(u)=(u+d)(c_3u^3+c_2u^2+c_1u+c_0) \] and solve, we get \[ c_3=b,\quad c_2=a-bd,\quad c_1=c-ad+bd^2,\quad c_0=-bt_1-dc+ad^2-bd^3, \] where \( f(-d)=0 \). Now we seek the condition under which \( f \in P_3 \). For the coefficients to be nonnegative, we need \[ b \geq0,\quad a-bd\geq0,\quad c-ad+bd^2\geq0,\quad -bt_1-dc+ad^2-bd^3\geq 0. \] Writing \[ f(-d)=d(bd^3-ad^2+cd+bt_1)-at_0=d^2(bd^2-ad+c)-(-bdt_1+at_0), \] we see that these inequalities are equivalent to \[ a-bd\geq0,\quad -bt_1d+at_0\geq0. \]

A necessary and sufficient condition for the existence of a \( d > 0 \) so that6 \[ f(-d)=0,\quad\frac{at_0}{bt_1}\leq d\leq \frac ab,\] is that \( f \) take on opposite signs (including 0) at two points in the interval \[ \bigg(-\frac{at_0}{bt_1},-\frac ab\bigg). \] To find a useful sufficient condition, write \[ b^3f\bigg(-\gamma\frac ab\bigg)=a^2bc\gamma^2-ab^3(-\gamma t_1+t_0)-\gamma^3(1-\gamma)a^4. \] If \( \gamma=1 \), this is \[ a b[ac-b^2(-t_1+t_0)]. \] This is \( \geq0 \) if and only if \( -t_1+t_0\leq ac/b^2 \). It is \( \leq 0 \) if \( \gamma t_1\leq t_0 \) and \( bc\leq \gamma(1-\gamma)a^2 \). Therefore, a sufficient condition for the existence of the required \( d \) is that there exist a \( \gamma\in [0,1] \) so that \begin{equation}\label{suff4}t_0-t_1\leq\frac{ac}{b^2},\quad \gamma t_1\leq t_0,\quad\text{and}\quad bc\leq\gamma(1-\gamma)a^2.\end{equation}

When is \( Q_3 \) satisfied? If \( n=3 \), \( P_3 \) and \( Q_3 \) are both automatic. If \( n=4 \), \( Q_3 \) is equivalent to \( c_1c_3\geq c_0c_4 \).7 If \( f \) is \( P_3 \) but not \( P_2 \), then \[ \eqalign{ f(u)&=(a_0+a_1u)(b_0+b_1u+b_2u^2+b_3u^3), \cr c_1c_3-c_0c_4&=a_1^2b_0b_2+a_0a_1b_1b_2+a_0^2b_1b_3, } \] so \( f \) is \( Q_3 \). However, \( P_2 \) does not imply \( Q_3 \). For example, \( f(u)=(1+u^2)^2 \) is \( P_2 \) but not \( Q_3 \), since \( c_1c_3-c_0c_4=-1 \). More generally, if \[ f(u)=(a_0+a_1u+a_2u^2)(b_0+b_1u+b_2u^2), \] then \[ c_1c_3-c_0c_4=(a_0b_1+a_1b_0)(a_1b_2+a_2b_1)-a_0a_2b_0b_2. \] This is nonnegative if \( a_0a_2\leq 4a_1^2 \), \( b_0b_2\leq 4b_1^2 \).8

If \( n=5 \), \( Q_3 \) is equivalent to \( c_1c_3\geq c_0c_4 \) and \( c_2c_4\geq c_1c_5 \).9 If \[ f(u)=(a_0+a_1u+a_2u^2)\bigl(\textstyle\frac{25}2+\frac12 u^2+u^3\bigr), \] then \[ c_1c_3-c_0c_4=\textstyle\frac{25}2(a_1^2-a_0a_2). \] In this case, \( f \) is \( P_3 \) but not \( P_2 \), but it may not be \( Q_3 \).

3. The approximations

We begin by showing that in general, \( X \) being SR does not imply that \( \bigl\lfloor\frac{2}3X\bigr\rfloor \) is SR, and in fact that this is very far from being the case.

Theorem 2: Suppose \( X \) is \( B\bigl(3n,\frac12\bigr) \), and \( z_i \) are the roots of the pgf of \( \bigl\lfloor\frac{2}3X\bigr\rfloor \). Then \[ \max [\operatorname{Im}(z_i)]^2\geq\textstyle\frac12 (9n^2-9n-1). \]

Proof. Up to a constant multiple, the pgf of \( \bigl\lfloor\frac{2}3X\bigr\rfloor \) is \[ f(u)=\sum_{i= 0}^{3n}\binom {3n}iu^{\lfloor 2i/3\rfloor}=\sum_{i\geq 0}\binom{3n+1}{3i+1}u^{2i}+\sum_{i\geq 0}\binom{3n}{3i+2}u^{2i+1}, \] which has degree \( d=2n \). Let \( z_1,z_2,\dots \) be the complex roots with positive imaginary part and \( w_1,w_2,\dots \) be the real roots. Then \[ f(u)=\prod_j(u^2-2u\operatorname{Re}(z_j)+|z_j|^2)\prod_j(u-w_j). \] Both of these expressions have \( u^d \) coefficients equal to 1. Identifying the coefficients of \( u^{d-1} \) gives \begin{equation}\label{id1} -2\sum_j\operatorname{Re}(z_j)-\sum_jw_j=3n. \end{equation} Identifying the coefficients of \( u^{d-2} \) gives \[ 4\sum_{i < j}\operatorname{Re}(z_i)\operatorname{Re}(z_j)+\sum_j|z_j|^2 +2\sum_j\operatorname{Re}(z_j)\sum_iw_i+\sum_{i < j}w_iw_j=\binom{3n+1}{3n-2}. \] Combining this with \eqref{id1} leads to \[ \sum_i[\operatorname{Im}(z_i)]^2-\sum_i[\operatorname{Re}(z_i)]^2-{\textstyle\frac12}\sum_iw_i^2 =\textstyle\frac12 n(9n^2-9n-1). \] Therefore, since there are at most \( n \) pairs of complex conjugate roots, \[ n\max [\operatorname{Im}(z_i)]^2\geq\sum_i[\operatorname{Im}(z_i)]^2\geq \textstyle\frac12 n(9n^2-9n-1). \quad ◻ \]

Here is the result from [◊] that we used to show that \( \bigl\lfloor \frac 1k X\bigr\rfloor \) is SR if \( X \) is SR. For a polynomial \( f \) of degree \( n \), write \begin{equation}\label{poly} f(x)=\sum_{i=0}^{k-1}x^ig_i(x^k), \end{equation} where \( g_i \) is a polynomial of degree \( \bigl\lfloor\frac{n-i}k\bigr\rfloor \).

Theorem 3: ([◊],Theorem 4.3) If \( f \) is SR with degree \( n \), the corresponding polynomials \( g_i \) are SR as well. Furthermore, their roots are interlaced in the sense that if the collection of all \( n-k+1 \) roots \( s_j \) of the \( g_i \)’s are placed in increasing order, \[ s_{n-k} < \cdots < s_4 < s_3 < s_2 < s_1 < s_0 < 0, \] then the roots of \( g_i \) are \( s_i \), \( s_{i+k} \), \( s_{i+2k}, \dots. \)

This statement is illustrated in the following array, in the case \( k=3 \): \[ \begin{pmatrix} &\cdots&s_{6}&s_{5}&s_{4}&s_{3}&s_{2}&s_{1}&s_0&0\\\\ g_0&\cdots&0&+&+&0&-&-&0&+\\ g_1&\cdots&+&+&0&-&-&0&+&+\\ g_2&\cdots&+&0&-&-&0&+&+&+ \end{pmatrix}. \] Note that each row is periodic of period 6, and each row is obtained from the previous row via a shift. The proof in [◊] is by induction on the degree of \( f \). The pgf of \( \bigl\lfloor\frac 1k X\bigr\rfloor \) is \( \sum_ig_i \), which alternates signs at the points \( s_{mk} \). Therefore, it has a root in each of the intervals \( (s_{(m+1)k},s_{mk}) \), thus showing that \( \bigl\lfloor \frac 1k X\bigr\rfloor \) is SR \( (=P_1) \).

Theorem 4: If \( X \) is \( P_1 \), then \( \bigl\lfloor\frac 2k X\bigr\rfloor \) is \( P_2 \).

Proof. It suffices to prove the result for \( k \) odd. If \( k \) is odd, the pgf of \( \bigl\lfloor \frac 2k X\bigr\rfloor \) is \begin{equation}\label{2x/k} \sum_{i=0}^{(k-1)/2}g_i(u^2)+u\sum_{i=(k+1)/2}^{k-1}g_i(u^2). \end{equation} By the Hermite–Biehler theorem, all roots of \eqref{2x/k} have negative real parts if and only if all the roots of \[ \sum_{i=0}^{(k-1)/2}g_i(-u^2)\quad\text{and}\quad\sum_{i=(k+1)/2}^{k-1}g_i(-u^2) \] are negative and interlace, with the largest of these being a root of \[ \sum_{i=0}^{(k-1)/2}g_i(-u^2). \]

If \( k=3 \), one can see from the above array that \[ g_0(\,\cdot\,)+g_1(\,\cdot\,)\text{ has a root in each of the intervals }\cdots (s_7,s_6),(s_4,s_3),(s_1,s_0) \] and \[ g_2(\,\cdot\,)\text{ has a root in each of the intervals }\cdots (s_9,s_7),(s_6,s_4),(s_3,s_1), \] since the values of the function at the two endpoints of each interval have opposite signs. Therefore, the negative roots interlace. For larger odd \( k \), the argument is similar. Now \[ \sum_{i=0}^{(k-1)/2}g_i(\,\cdot\,)\text{ has a root in each of the intervals }\cdots (s_{(k-1)/2+2k},s_{2k}),(s_{(k-1)/2+k},s_k),(s_{(k-1)/2},s_0) \] and \[ \sum_{i=(k+1)/2}^{k-1}g_i(\,\cdot\,)\text{ has a root in each of the intervals }\cdots (s_{3k-1},s_{(k+1)/2+2k}),(s_{2k-1},s_{(k+1)/2+k}),(s_{k-1},s_{(k+1)/2}). \] It follows that \( \bigl\lfloor \frac 2k X\bigr\rfloor \) is H \( (=P_2) \). ◻

The situation for the case \( \bigl\lfloor \frac 3k X\bigr\rfloor \), where \( k \) is not a multiple of 3, is similar. Its pgf is \[ h_0(u)+uh_1(u)+u^2h_2(u), \] where \[ h_0(u)=\sum_{i=0}^{\lfloor k/3\rfloor}g_i(u),\quad h_1(u)=\sum_{\lceil k/3\rceil}^{\lfloor 2k/3\rfloor}g_i(u),\quad h_2(u)=\sum_{i=\lceil 2k/3\rceil}^{k-1}g_i(u). \] Then \[ \eqalign{ &h_0\text{ has a root in each of the intervals } (s_{\lfloor k/3\rfloor+ik},s_{ik})\text{ for }i\geq 0, \cr &h_1\text{ has a root in each of the intervals } (s_{\lfloor 2k/3\rfloor+ik},s_{\lceil k/3\rceil+ik})\text{ for }i\geq 0, \cr &h_2\text{ has a root in each of the intervals } (s_{k-1+ik},s_{\lceil 2k/3\rceil+ik})\text{ for }i\geq 0. } \] In a few cases, the interval is of the form \( (a,a) \), which is interpreted as the singleton \( \{a\} \). So, the roots of \( h_0,h_1,h_2 \) are negative and interlace. Unfortunately, as was pointed out earlier, this property does not imply \( P_3 \).

The case of \( \bigl\lfloor \frac 3k X\bigr\rfloor \) is more challenging, since for each root with positive real part, one must find a negative root that compensates for it. It is not simply a matter of proving some property for each root, as was the case for \( P_1 \) and \( P_2 \). All we can offer at this point is speculation based on computations of roots in special cases. If \( f \) is \( P_3 \) but not \( P_2 \), \( f \) will have complex roots of positive real part. For a negative real root \( w \) and a conjugate pair of roots \( z,\bar z \) with positive real part, let \[ \langle z,w\rangle=(u-w)(u-z)(u-\bar z). \] For this cubic polynomial to have positive coefficients, it is necessary and sufficient that \[ z+\bar z\leq -w\leq\frac{z \bar z}{z+\bar z}. \]

Here is what seems to be the case. Let \( w_1,w_2,\dots \) be the real roots of \( f \) ordered in increasing order of their absolute values, and \( z_1,z_2,\dots \) be the roots with positive real and imaginary parts, ordered in increasing order of the real parts. Assume that \( X \) is SR and takes values \( 0,1,\dots,n \).

Let \( f \) be the pgf of \( \bigl\lfloor\frac 34X\bigr\rfloor \). The degree of \( f \) is \( d=\bigl\lfloor\frac34 n\bigr\rfloor \). Then \( f \) appears to be \( P_3 \) in general. The number of \( z_i \)’s and the number of \( w_i \)’s are close to \( \frac n4 \). This has been checked in a number of cases, including \( X\sim B(n,p) \) for \( n=54 \) and 75, and \( p=\frac1{100} \), \( \frac12 \) and \( \frac{100}{101} \).10 If \( X\sim B\bigl(n,\frac12\bigr) \), the pgf of \( \bigl\lfloor\frac 34X\bigr\rfloor \) is \( P_2 \) for \( n\leq 9 \). If \( 10\leq n\leq 30 \), it is \( P_3 \) and can be factored into \( \bigl\lfloor\frac n4\bigr\rfloor \) irreducible cubics, with an extra linear factor if \( d\equiv 1\mod 3 \) and an extra irreducible quadratic factor if \( d\equiv 2\bmod 3 \). Here irreducible means that it cannot be further factored into polynomials with positive coefficients. The appropriate pairing of real and complex roots is \( w_i \) with \( z_i \). This picture continues to hold for many \( X \)’s that are sums of independent Bernoulli random variables with randomly chosen parameters.

Here is some more detail when \( X \) is \( B\bigl(n,\frac12\bigr) \) and \( f \) is the pgf of \( \bigl\lfloor\frac 34X\bigr\rfloor \), omitting multiplicative constants: \[ \eqalign{ &n=2:\quad f(u)=u+3 \text{ is } P_1, \cr &n=3:\quad f(u)=u^2+3u+4 \text{ is } P_2, \cr &n=4:\quad f(u)=u^3+4u^2+6u+5=(u+a)(u^2+bu+c) \text{ is } P_2, \cr &\phantom{n=4:}\quad f(-a)=0,\quad b=4-a,\quad c=\textstyle\frac 5a,\quad f(0)=5,\quad f(-4)=-19, } \] so \( f(-a)=0 \) for some \( 0 < a < 4 \). Therefore \( f \) is \( P_2 \). It is not \( P_1 \) since the discriminant of the quadratic factor is \( < 0 \) for \( 0 < a < 4 \). The actual values are \( a=2.35 \), \( b=1.65 \), \( c=2.12 \). \[ \eqalign{ &n=5:\quad \textstyle f(u)=u^3+\frac53 u^2+\frac 53 u+1=(u+a)(u^2+bu+c) \text{ is } P_2, \cr &\phantom{n=5:}\quad f(-a)=0,\quad c=\textstyle\frac 1a,\quad b=\frac53-a,\quad f(0)=1,\quad f\bigl(-\frac 53\bigr)=-\frac{16}9, } \] so \( f(-a)=0 \) for some \( 0 < a < \frac53 \). Again, \( f \) is \( P_2 \) but not \( P_1 \). The actual values are \( a=1 \), \( b=\frac23 \), \( c=1 \). \[ \eqalign{ &n=6:\quad f(u)=u^4+21u^3+20 u^2+15 u+7=(u+a)(u+b)(u^2+cu+d) \text{ is } P_2, \cr &\phantom{n=6:}\quad f(-a)=f(-b)=0,\quad d=\textstyle\frac 7{ab},\quad c=21-a-b. } \] Computing \( f(x+iy) \) with \( y\neq 0 \) and setting it \( =0 \) leads to \[ f(x)-\textstyle \frac12 f^{\prime\prime}(x)y^2+y^4=0,\quad f^{\prime}(x)-\textstyle\frac16 y^2f^{\prime\prime\prime}(x)=0. \] Solving the second equation for \( y^2 \) and using that in the first equation leads to \[ f(x) [f^{\prime\prime\prime}(x)]^2-3f^{\prime\prime}(x)f^{\prime}(x)f^{\prime\prime\prime}(x)+36[f^{\prime}(x)]^2=0. \] Expanding this yields a polynomial in \( x \), all of whose coefficients are negative, so \( x < 0 \). It follows that \( f \) is \( P_2 \). The actual values are \( a=20.04, b=.66,c=.30,d=.53 \).

The first case that is \( P_3 \) is \( n=10 \). Now \[ \eqalign{ f(u)&=11 + 45 u + 120 u^2 + 462 u^3 + 210 u^4 + 120 u^5 + 55 u^6 + u^7 \cr& =(u+53)(u^3+.24u^2+.10u+.03)(u^3+1.96u^2+3.23u+7.69). } \] Setting \( f(x+iy)=0 \) for \( y\neq 0 \) gives \[ \sum_{j=0}^3\frac{(-1)^jf^{(2j)}(x)y^{2j}}{(2j)!}=0,\quad \sum_{j=0}^3\frac{(-1)^jf^{(2j+1)}(x)y^{2j}}{(2j+1)!}=0. \]

Consider small \( n \) with \( X \) general. Use \( e_j \) for the elementary symmetric functions.

For \( n=3 \), \[ f(u)=u^2e_0+ue_1+(e_2+e_3)\] is \( P_2 \).

For \( n=4 \), \[ f(u)=u^3e_0+u^2e_1+ue_2+(e_3+e_4) \] is \( P_3 \), but may not be \( P_2 \).

For \( n=5 \), \[ f(u)=u^3(e_0+e_1)+u^2e_2+ue_3+(e_4+e_5) \] is \( P_3 \), but may not be \( P_2 \).

For \( n=6 \), \[ f(u)=u^4e_0+u^3(e_1+e_2)+u^2e_3+ue_4+(e_5+e_6). \] Setting this equal to \[ (u+w)(u^3+a u^2+b u+c) \] with \( w\geq 0 \) gives \( f(-w)=0 \) and \[ \eqalign{ w^3a&=w^2e_3-we_4+e_5+e_6=w^3(e_1+e_2-e_0w), \cr w^2b&=we_4-e_5-e_6, \cr wc&=e_5+e_6. } \] So that \( a,b,c\geq 0 \), we need \[ \frac{e_5+e_6}{e_4}\leq w\leq\frac{e_1+e_2}{e_0}. \] Compute \[ \displaylines{ e_0^2f\biggl(-\frac{e_1+e_2}{e_0}\biggr)=e_1(e_1e_3-e_0e_4)+e_2(2e_1e_3-e_0e_4)+e_2^2e_3+e_0^2(e_5+e_6)\geq 0, \cr 16e_0^3f\biggl(-\frac{e_1+e_2}{2e_0}\biggr)=-e_1^4-4e_1^2(e_1^2-e_0e_3)-8e_0^2(e_1e_4-2e_0e_5), \cr -2e_1e_2(3e_1e_2-4e_0e_3)-8e_0^2(e_2e_4-2e_0e_6)-4e_2^2(e_1e_2-4e_0e_3)-e_2^4\leq 0. } \] Since \[ \frac{e_5+e_6}{e_4}\leq\frac{e_1+e_2}{2e_0}, \] it follows that \( f \) has a root \( -w \) which makes \( a,b,c\geq0 \). The inequalities above follow from the Newton inequalities \[ \frac16\frac{e_1}{e_0}\geq\frac 25\frac{e_2}{e_1}\geq\frac34\frac{e_3}{e_2}\geq\frac43\frac{e_4}{e_3}\geq\frac52\frac{e_5}{e_4}\geq\frac61\frac{e_6}{e_5}. \] It follows that \( f \) is \( P_3 \).

For \( n=7 \), \[ f(u)=u^5e_0+u^4e_1+u^3(e_2+e_3)+u^2e_4+ue_5+(e_6+e_7). \] Set this equal to \[ (u^2+au+b)(u^3+cu^2+du+g). \] Solving for the coefficients gives \[ \eqalign{ c&=e_1-a, \cr d&=a^2-b-ae_1+e_2+e_3, \cr g&=-a^3+2ab+a^2e_1-be_1-a(e_2+e_3)+e_4, } \] with \[ b^2-b[3a^2+(e_2+e_3)-2ae_1)]+a^4-a^3e_1+a^2(e_2+e_3)-ae_4+e_5=0 \] and \[ (e_1-2a)b^2+b[a^3-a^2e_1+a(e_2+e_3]-e_4]+(e_6+e_7)=0. \] Combining these two equations gives (and nothing is lost if \( e_1\neq 2a \)) \[ b=\frac{2a^5-3e_1a^4+[e_1^2+2(e_2+e_3)]a^3-[e_1(e_2+e_3)+2e_4]a^2+(e_1e_4+2e_5)a-[(e_1e_5-(e_6+e_7)]}{5a^3-6e_1a^2+[2e_1^2+(e_2+e_3)]a-[e_1(e_2+e_3)-e_4]}. \] Using this value of \( b \) in either of those equations implies that \( a \) must be a root of a degree-10 polynomial \( H(a) \). Also, letting \[ D=5a^3-6e_1a^2+[2e_1^2+(e_2+e_3)]a-[e_1(e_2+e_3)-e_4], \] we have \[ \eqalign{ Dd&=3a^5-8e_1a^4+[7e_1^2+4(e_2+e_3)]a^3-[2e_1^3+7e_1(e_2+e_3)-3e_4]a^2 \cr&\quad +[(e_2+e_3)(e_2+e_3+3e_1^2)-2(e_1e_4+e_5)]a \cr&\qquad -[e_1(e_2+e_3)^2-e_4(e_2+e_3)-e_1e_5+(e_6+e_7)] } \] and \[ \eqalign{ Dg&=-a^6+3e_1a^5-[3e_1^2+2(e_2+e_3)]a^4+[e_1^3+4e_1(e_2+e_3)]a^3 \cr&\quad -[(e_2+e_3)(e_2+e_3+2e_1^2)+(e_1e_4-4e_5)]a^2+[e_1(e_2+e_3)^2+e_1^2e_4-4e_1e_5+2(e_6+e_7)]a \cr&\qquad -[e_1e_4(e_2+e_3)-e_4^2-e_1^2e_5+e_1(e_6+e_7)]. } \] Using PolynomialReduce, \[ H(a)=[a^4-a^3e_1+a^2(e_2+e_3)-ae_4+e_5]Dg-\textstyle\frac 13(e_6+e_7)[13Dd+K(a)], \] where \[ \eqalign{ K(a)&=29e_1a^4-2[20e_1^2+17(e_2+e_3)]a^3+[14e_1^3+61(e_2+e_3)-24e_4]a^2 \cr&\quad -[27e_1^2(e_2+e_3)+10(e_2+e_3)^2-17e_1e_4-20e_5]a \cr&\qquad +10[e_1(e_2+e_3)^2-e_4(e_2+e_3)-e_1e_5+e_6+e_7]. } \] Now the Newton inequalities are \[ \frac17\frac{e_1}{e_0}\geq\frac 26\frac{e_2}{e_1}\geq\frac35\frac{e_3}{e_2}\geq\frac44\frac{e_4}{e_3}\geq\frac53\frac{e_5}{e_4}\geq\frac62\frac{e_6}{e_5}\geq\frac71\frac{e_7}{e_6}. \]

Unfortunately, the pgf of \( f \) \( \bigl\lfloor\frac 35X\bigr\rfloor \) is not necessarily \( P_3 \). Take \( X \) to be \( B\bigl(n,\frac12\bigr) \). For \( n\leq 16 \), \( f \) is \( P_2 \), and for \( 17\leq n\leq 20 \) is \( P_3 \). For \( n=21 \), \( d=12 \), and there are 4 \( w_i \)’s and 4 \( z_j \)’s. For \( f \) to be \( P_3 \), there would have to be a pairing of the \( w_i \)’s and \( z_j \)’s so that for each such pair, \( \langle z_i,w_j\rangle \) has positive coefficients. But, no such pairing exists, since \( \langle w_1,z_i\rangle \) has a negative quadratic coefficient for \( i=1,2,3,4 \), so \( f \) is not \( P_3 \). However, \( \langle z_i,w_i\rangle \) and \( \langle z_{i-1},w_i\rangle \) have positive coefficients for \( i=2,3,4 \), so one can choose a factorization of \( f \) so that only one factor has a negative coefficient. Note that it is not \( P_4 \) either. It is \( P_6 \), since \[ (u-w_1)(u-w_2)(u-z_1)(u-\bar z_1)(u-z_2)(u-\bar z_2) \] has positive coefficients. This may or may not be useful.

However, all is not lost, due to the following result.

Proposition 5: Suppose \( U \) and \( V \) are nonnegative integer-valued random variables with \( P(U=m)=p_m \) and \( P(V=m)=q_m \) whose pgf’s satisfy \[ Eu^U=Eu^V(au^3+bu^2+cu+d). \] If \( b+c+d\geq 0 \), \( c+d\geq 0 \), \( d\geq 0 \), then \( U,V \) can be coupled so that \( U\leq V+3 \) and then \[ E(V+3-U)=b+2c+3d. \]

Proof. The distributions of \( U,V \) satisfy \[ p_m=aq_{m-3}+bq_{m-2}+cq_{m-1}+dq_m. \] Since \( a+b+c+d=1 \), summing gives \[ P(U\leq m)=P(V\leq m-3)+(b+c+d)q_{m-2}+(c+d)q_{m-1}+dq_m. \] Therefore, if \( b+c+d\geq 0 \), \( c+d\geq 0 \), \( d\geq 0 \), \( U \) and \( V \) can be coupled so that \( U\leq V+3 \) almost surely.11 Computing the first moment, we see that \[ \eqalign{ EU&=\sum_m m[aq_{m-3}+bq_{m-2}+cq_{m-1}+dq_m] \cr& =a(EV+3)+b(EV+2)+c(EV+1)+dEV=EV+(3a+2b+c), } \] so using again \( a+b+c+d=1 \), we conclude that \[ E(V+3-U)=b+2c+3d. \quad ◻ \]

For the above example, \( X\sim B\bigl(21,\frac12\bigr) \), we indicate the situation below, with the \( (i,j) \) entry corresponding to \( \langle z_i,w_j\rangle \). If the entry is \( + \), all coefficients of \( \langle z_i,w_j\rangle \) are positive. Otherwise, the entry indicates which coefficient is negative, and gives the value of \( b+2c+3d \): \[ \begin{pmatrix} b;.036&+&c;.48&c; .92\\ b;.61\phantom{0}&+&+&c;1.5\\ b;1.80&b;1.99&+&+\\ b;2.01&b;2.20&+&+ \end{pmatrix}. \] Writing \[ f=\prod_{i=1}^4\langle z_i,w_i\rangle, \] we see that there is a coupling of \( \bigl\lfloor\frac35X\bigr\rfloor \) with \( V \), a sum of three independent random values with values \( 0,1,2,3 \), so that \( \bigl\lfloor\frac35X\bigr\rfloor\leq V+3 \) and \[ \textstyle E\bigl(V+3-\bigl\lfloor\frac35X\bigr\rfloor\bigr)=.035\dots. \]

Let \( f \) be the pgf of \( \bigl\lfloor\frac 35X\bigr\rfloor \) where \( X \) is a general SR random variable taking values \( 0,1,\dots,n \). The degree of \( f \) is \( d=\bigl\lfloor\frac35 n\bigr\rfloor \). Then the following appears to be the case. There is a (not in general unique) index \( m \) so that \( \langle z_i,w_{i+1}\rangle \) has positive coefficients if \( i < m \) and \( \langle z_i,w_{i}\rangle \) has positive coefficients if \( i > m \). Then the proposition can be applied to the case \( \langle z_m,w_1\rangle \). In many cases, this partition of the \( z_i \)’s is not necessary, and \( f \) is actually \( P_3 \). This appears to be the case, for example, if \( X \) is \( B(21,p) \) for \( p \) close to 0 or 1. In one of these cases, \( \langle z_i,w_{i+1}\rangle \) has positive coefficients for all \( i \), and in the other \( \langle z_i,w_{i}\rangle \) has positive coefficients for all \( i \).

For \( 21\leq n\leq 26 \), one can choose all but one factor to have positive coefficients, and the (normalized) cubics corresponding to \( w_1 \) are \[ \eqalign{ n&=21:\quad .015+.984u-.002u^2+.003u^3, \cr n&=22:\quad .013+.990u-.011u^2+.007u^3, \cr n&=23:\quad .012+.997u-.024u^2+.015u^3, \cr n&=24:\quad .011+1.003u-.040u^2+.026u^3, \cr n&=25:\quad .011+1.007u-.059u^2+.042u^3, \cr n&=26:\quad .010+1.008u-.0819u^2+.063u^3. } \]

Here is a larger example, with \( X \) \( B\bigl(33,\frac12\bigr) \), \( d=19 \). Now there are 7 \( w_i \)’s and 6 \( z_i \)’s: \[ \begin{pmatrix} - & + & - & - & - & - & -\\ - & b & + & c & - & - & -\\ b & b & + & + & c & c & c\\ b & b & b & + & + & c & c\\ b & b & b & b & + & c & c\\ b & b & b & b & b & + & c \end{pmatrix}. \] Now, \( + \) means that all four coefficients are positive, \( b \) or \( c \) means that the coefficient is negative, but the hypotheses of the proposition are satisfied, and \( - \) means that the hypotheses of the proposition are not satisfied at all. In the context of the last paragraph, the possible values of \( m \) are 2, 3, 4, 5.

From the examples we have looked at, it appears that the pgf of \( \bigl\lfloor\frac 3kX\bigr\rfloor \) “usually” satisfies \( P_3 \), and when it does not, it can be factored into cubics that have positive coefficients except for one. That one can be treated using the proposition. Typically, the way this arises is that \( \langle z_i,w_i\rangle \) has positive coefficients for one range of \( i \)’s, and \( \langle z_i,w_{i+1}\rangle \) has positive coefficients for a complementary range of \( i \)’s. The exceptional factor appears in a transition from one range to the other.

Write \( z_i\sim w_j \) to mean that \( \langle z_i,w_j\rangle \) has positive coefficients. Take \( X \) to be \( B\bigl(n,\frac12\bigr) \).

If \( n=33 \), \( d=19 \), and \( \bigl\lfloor\frac 35X\bigr\rfloor \) satisfies \[ \eqalign{ z_i&\sim w_{i+1}&\quad&\text{for }1\leq i\leq 4, \cr z_i&\sim w_{i}&\quad&\text{for }3\leq i\leq 6. } \] If \( n=51 \), \( d=30 \), and \( \bigl\lfloor\frac 35X\bigr\rfloor \) satisfies \[ \eqalign{ z_i&\sim w_{i+1}&\quad&\text{for }1\leq i\leq 6, \cr z_i&\sim w_{i}&\quad&\text{for }4\leq i\leq 10. } \] In neither case is \( P_3 \) satisfied.

If \( n=105 \), \( d=45 \), and \( \bigl\lfloor\frac 37X\bigr\rfloor \) satisfies \[ \eqalign{ z_i&\sim w_{i+1}&\quad&\text{for }i=14, \cr z_i&\sim w_{i}&\quad&\text{for }1\leq i\leq 14, \cr z_i&\sim w_{i-1}&\quad&\text{for }i=14,15, } \] so \( P_3 \) is satisfied. We do not have a counterexample to \( P_3 \) for \( \bigl\lfloor\frac 37X\bigr\rfloor \).

If \( n=105 \), \( d=39 \), and \( \bigl\lfloor\frac 38X\bigr\rfloor \) satisfies \[ \eqalign{ z_i&\sim w_{i+1}&\quad&\text{for }1\leq i\leq 9, \cr z_i&\sim w_{i}&\quad&\text{for }5\leq i\leq 13, \cr z_i&\sim w_{i-1}&\quad&\text{for }i=13, } \] so \( P_3 \) is not satisfied.

4. The sufficient conditions

Motivated by the sufficient conditions \eqref{SSR} and \eqref{SHB}, we consider polynomials \[ f(u)=\sum_{i=0}^nc_iu^i \] satisfying12 \[ c_i^2=ac_{i-1}c_{i+1},\quad c_0=c_n. \] Then for \( n=2 \) \[ f\text{ is } \begin{cases} P_1&\text{if }a\geq 4,\\ P_2&\text{if }a < 4. \end{cases} \] If \( n=3 \), \[ f(z)=c_0(1+az+az^2+z^3)=c_0(z+1)(z^2+(a-1)z+1), \] so \[ f\text{ is } \begin{cases} P_1&\text{if }a\geq 3,\\ P_2&\text{if }1\leq a < 3,\\ P_3&\text{if }a < 1. \end{cases} \] At the transition cases, except for the factor of \( c_0 \), \[ f(z)= \begin{cases} (z+1)^3&\text{if }a=3,\\ (z+1)(z^2+1)&\text{if }a=1,\\ (z^3+1)&\text{if }a=0. \end{cases} \]

If \( n=4 \), \[ f(z)=c_0(1+b^3z+b^4z^2+b^3z^3+z^4), \] where \( a=b^2 \), so \[ f\text{ is } \begin{cases} P_1&\text{if }a > 3.236,\\ P_2&\text{if }\sqrt 2 < a < 3.236,\\ P_4&\text{if }a < \sqrt 2. \end{cases} \] If \( a=3.236 \), \( f(z) \) has double roots at \( -2.513 \) and \( -.398 \). If \( a=\sqrt 2 \), \[ f(z)=c_0(1+z^2)(1+2^{3/4}z+z^2). \]

If \( n=5 \), \[ \eqalign{ f(z)&=c_0(1+a^2z+a^3z^2+a^3z^3+a^2z^4+z^5) \cr& =c_0(z+1)[z^4+(a^2-1)z^3+(a^3-a^2+1)z^2+(a^2-1)z+1], } \] so \[ f\text{ is } \begin{cases} P_1&\text{if }a > 3.234,\\ P_2&\text{if }1.465 < a < 3.234,\\ P_3&\text{if }1\leq a < 1.465,\\ P_4&\text{if } a < 1. \end{cases} \]

If \( a=1 \), \[ (z)=c_0(1+z^3)(1+z+z^2). \]

If \( n=6 \), \[ f(z)=c_0(1+b^5z+b^8z^2+b^9z^3+b^8z^4+b^5z^5+z^6), \] where \( a=b^2 \), so for various values of \( a \) \[ 0 < P_6 < 1.019 < P_4 < 1.091 < P_2 < 1.341 < P_1. \] This suggests the following general conjecture.

Conjecture 6: If \( c_i^2\geq\sqrt 2 c_{i-1} c_{i+1} \) (or \( c_{i-1}c_{i+1}\geq ac_{i-2}c_{i+2} \) for some \( a \)), then \( f \) is \( P_3 \).

Given the polynomial \[ f(u)=\sum_{i=0}^nc_iu^i \] and a \( b > 0 \), define a polynomial \[ g(u)=\sum_{i=0}^nc_iu^{\lfloor bi\rfloor}=\sum_{j=0}^{\lfloor bn\rfloor}d_ju^j,\quad d_j=\sum_{i:j\leq bi < j+1}c_i. \]

If \( f \) is the pgf of \( X \), then \( g \) is the pgf of \( \lfloor bX\rfloor \). Then \[ d_j^2-ad_{j-1}d_{j+1}=\sum_m\biggl[\sum_{\substack{ j\leq bi < j+1\\bm-j-1 < bi\leq bm-j}}c_ic_{m-i}-a\sum_{\substack{ j-1\leq bi < j\\bm-j-2 < bi\leq bm-j-1}}c_ic_{m-i}\biggr]. \] We will say that the coefficients \( c_i \) satisfy the Newton inequalities if \begin{equation}\label{Newton} \frac{c_i^2}{\binom ni^2}\geq \frac{c_{i-1}}{\binom n{i-1}}\frac{c_{i+1}}{\binom n{i+1}}; \end{equation} see [◊], for example.

If \( b=\frac1k \), \[ d_j^2-ad_{j-1}d_{j+1}=\sum_{i=0}^{k-1}\sum_{l=0}^{k-1}[c_{kj+i}c_{kj+l}-ac_{kj+i-k}c_{kj+l+k}]. \] As pointed out earlier, even if \( k=1 \) and \( c_i=\binom ni \), \eqref{SSR} fails for large \( n \).

Acknowledgment

The editor thanks Subhro Ghosh for reviewing the paper, and for his various suggestions which improved the presentation of the paper.

Footnotes

1:It is also possible to assume that the coefficients satisfy \( c_i \ge 0 \). Then \( f \) is strong Rayleigh if all of its roots are negative or zero. As will be explained, an SR random variable with generating function \( f \) can be written as the sum of independent Bernoulli variables, with probabilities equal to the roots of \( f \). The zero roots correspond to Bernoulli variables always taking value 1. To exclude degeneracy, the author assumes that \( c_i > 0 \).

2:It refers to the pgf of the \( X \).

3:NASC refers to “necessary and sufficient condition”.

4:The polynomial \( f(u) \) is decomposed into the polynomial \( h_0(u^2) \) with even degrees and the polynomial \( uh_1(u^2) \) with odd degrees. Thus, either \( h_0 \) and \( h_1 \) have the same degree, or the degree of \( h_0 \) is larger than that of \( h_1 \) by 1. Given that \( h_0, h_1 \) have all real roots:

If \( h_0 \) and \( h_1 \) have the same degree, let \( v_1, \dots, v_k \) be the roots of \( h_0 \) and \( (w_1, \dots, w_k) \) be those of \( h_1 \). We say that the roots of \( h_0 \) and \( h_1 \) interlace if \begin{equation*} v_1 < w_1 < v_2 < \dots < v_k < w_k \quad \mbox{or} \quad w_1 < v_1 < w_2 < \cdots < w_k < v_k. \end{equation*}
If the degree of \( h_0 \) is larger than that of \( h_1 \) by 1, let \( v_1, \dots, v_k \) be the roots of \( h_0 \) and \( (w_1, \dots, w_{k-1}) \) be those of \( h_1 \). We say that the roots of \( h_0 \) and \( h_1 \) interlace if \begin{equation*} v_1 < w_1 < v_2 < \cdots < v_{k-1} < w_{k-1} < v_k. \end{equation*}

5:We say that a random variable \( X \) with range \( \{1, \dots, n\} \) is \( P_j \) if its pgf has property \( P_j \).

6:Recall that \( t_1 < t_0 < 0 \), so \( -bt_1d+at_0\geq0 \) implies that \( d \ge at_0/(bt_1) \).

7:For \( n = 4 \), we have \[ h_0(u) = c_3 u + c_0, \quad h_{1}(u) = c_u + c_4, \quad h_2(u) = c_2. \] By the definition of \( Q_3 \), the largest is a root of \( h_0 \), then second largest is a root of \( h_1 \), etc. Thus, \( -c_0/c_3 \ge -c_1/c_4 \) which implies \( c_1c_3 \ge c_0c_4 \).

8:Notice that \[ (a_0b_1+a_1b_0)(a_1b_2+a_2b_1)-a_0a_2b_0b_2 = a_0a_2b_0b_2 \biggl(\!\biggl(\frac{b_1}{b_0}+ \frac{a_1}{a_0}\biggr)\biggl(\frac{b_1}{b_2}+\frac{a_1}{a_2}\biggr) -1\biggr). \] By the Cauchy–Schwarz inequality, \[ \biggl(\frac{b_1}{b_0}+\frac{a_1}{a_0}\biggr)\biggl(\frac{b_1}{b_2}+\frac{a_1}{a_2}\biggr) \ge \biggl(\sqrt{\frac{b_1^2}{b_0b_2}} + \sqrt{\frac{a_1^2}{a_0a_2}}\biggr)^2. \] The latter is greater or equal to 1 if \( a_0a_2\leq 4a_1^2, b_0b_2\leq 4b_1^2 \).

9:The condition holds for the same reason as in the footnote 7.

10:These cases were checked by the author using Mathematica.

11:The coupling of \( U \) and \( V \) from stochastic domination follows from Strassen’s theorem.

12:The idea is to classify the polynomial \( f \) according to property \( P_j \) in terms of the value of \( a \).

Thomas Milton Liggett

Approximating multiples of strong Rayleigh random variables