1. Introduction
The Poisson–Dirichlet distribution $\mathrm{PD}(0,\theta)$ , $\theta>0$ , was proposed in [Reference Kingman9] in the simplex $\overline{\nabla}_{\infty} = \big\{x\in[0,1]^{\infty}\mid x_{1}\geq x_{2}\geq\cdots\geq0,\ \sum_{i=1}^{\infty}x_i\leq 1\big\}$ . The one-parameter Poisson–Dirichlet distribution $\mathrm{PD}(0,\theta)$ was generalized to the two-parameter Poisson–Dirichlet distribution $\mathrm{PD}(\alpha,\theta)$ , $\alpha\in(0,1)$ , $\theta+\alpha>0$ , in [Reference Pitman15]. They are also the representing measure of the Ewens–Pitman partition structure. The partition structure coined in [Reference Kingman10] is an exchangeable partition distribution M of the set $\mathbb{N}=\{1,2,\ldots,n,\ldots\}$ . Due to its exchangeability, the restriction of M to $\{1,2,\ldots,n\}$ is $M_n(\eta)=\dim\!(\eta)\varphi(\eta)$ , where $\eta=(\eta_1,\ldots,\eta_l)$ , $|\eta| \;=\!:\; \sum_{i=1}^l\eta_i=n$ , is an integer partition of n. Moreover, $\varphi(\eta)$ is the probability of a single partition with cluster sizes $\eta$ . Due to exchangeability, two partitions have the same probability as long as their cluster sizes are the same. So, $\dim\!(\eta)$ is the multiplicities of such partitions with cluster size $\eta$ . Then, the family of distributions $\{M_{n},n\geq1\}$ will satisfy a natural consistency condition and they are uniquely determined by a representing measure in the Kingman simplex $\overline{\nabla}_{\infty}$ due to its exchangeability and the de Finetti theorem.
Let $\Gamma_n$ be the totality of integer partitions of n; then $\Gamma=\cup_{n\geq0}^{\infty}\Gamma_n$ exhausts all integer partitions, and $\Gamma_{0}=\emptyset$ is treated as an empty partition. Usually $\Gamma$ can be the vertex set of a graded branching diagram $(\Gamma,\chi)$ , where $\chi(\eta,\omega)$ assigns positive weight to an edge joining $\eta$ and $\omega$ , and 0 otherwise (see Figure 2). Each integer partition is represented by a Young diagram (see Figure 1); we say $\eta\subset\zeta$ if the Young diagram of $\eta$ is contained in the Young diagram of $\zeta$ . There are various kinds of edge weights. In the algebra $\mathcal{A}$ of symmetric functions with variables $(x_1,\ldots,x_n,\ldots)$ , there are various kinds of linear bases $\{f_{\eta},\eta\in\Gamma\}$ (refer to [Reference MacDonald12] or the appendix), such as the monomial functions, the Shur functions, and the Jack functions. These functions usually satisfy the Pieri formula $\big(\sum_{i=1}^{\infty}x_i\big)f_{\eta} = \sum_{\eta\subset\zeta, |\eta|+1=|\zeta|}\chi(\eta,\zeta)f_{\zeta}$ , where $\eta$ can be obtained from $\zeta$ by removing a box. The edge weights are chosen to be the coefficients in the above Pieri formula. The weights from the monomial functions, the Shur functions, and the Jack functions define the Kingman graph, the Young graph, and the Jack graph respectively. The specialization of the symmetric functions can be regarded as an algebra homomorphism $\Phi\colon \mathcal{A}\to \mathbb{R}$ . In particular, a specialization mapping base functions to positive values will determine a positive harmonic function $\varphi(\eta)=\Phi(f_{\eta})$ on $\Gamma$ , satisfying $\varphi(\eta)=\sum_{\eta\subset\zeta, |\eta|+1=|\zeta|}\chi(\eta,\zeta)\varphi(\zeta)$ . Under pointwise convergence topology, the space $\mathcal{C}$ of all positive harmonic functions becomes compact and convex. Therefore, we can expect that $\varphi(\eta)=\int_{E} K(\eta,x)\mu({\mathrm{d}} x)$ , where E is the Martin boundary of $\mathcal{C}$ , $K(\eta,x)$ is the extremity, and $\mu({\mathrm{d}} x)$ is a probability measure. For the Kingman graph, this representation is the Kingman’s one-to-one correspondence of the partition structures [Reference Kingman10] and $K(\eta,x)$ is the continuous extension of the monomial functions. For the Jack graph, the representation is also established in [Reference Kerov, Okounkov and Olshanski8], and $K(\eta,x)$ is the extended Jack functions. The Z-partition structure is a partition structure defined on the Jack graph, and its representing measure is called the Z-measure, denoted as $\mathcal{Z}(z,z^{\prime},\vartheta)$ , $z,z^{\prime}\in\mathbb{C}$ , $\vartheta>0$ .
The two-parameter Poisson–Dirichlet diffusion with the stationary distribution $\mathcal{PD}(\alpha,\theta)$ was established through an up–down Markov chain on the Kingman graph in [Reference Petrov14].
Similarly, [Reference Olshanski13] established a reversible diffusion on the Thoma simplex,
Its stationary distribution is the measure $\mathcal{Z}(z,z^{\prime},\vartheta)$ . In this paper, we will use the Z-measure diffusion to refer to the diffusion obtained in [Reference Olshanski13].
Interestingly, both types of diffusion have similar explicit transition densities, and the spectrums of their generators are also the same, $\big\{0,\frac{1}{2}{n(n-1+\theta)}\mid n\geq2\big\}$ , where $\theta={zz^{\prime}}/{\vartheta}>0$ for the Z-measure diffusion.
Recently, spectral expansion was again applied to the Z-measure diffusion in [Reference Korotkikh11] to derive its explicit transition density. A similar method was previously used to establish the transition density of the two-parameter Poisson–Dirichlet diffusion in [Reference Feng, Sun, Wang and Xu6]. In fact, this method was first adopted in [Reference Ethier4]. Surprisingly, by rearranging the density in [Reference Feng, Sun, Wang and Xu6], [Reference Zhou17] obtained the expression
where $p_n(x,y)$ is a transition kernel and $\big\{\widetilde{d}_1^{\theta}(t),d_n^{\theta}(t),n\geq2\big\}$ , $\theta>-1$ , is the distribution of a pure death process related to the Kingman coalescent in [Reference Zhou17].
In this paper, we rearrange the transition density of the Z-measure diffusion as in [Reference Zhou17]. We obtain a similar expression for the Z-measure diffusion,
where $\mathcal{K}_{n}(\sigma,\omega)$ is also a transition kernel and $\big\{\widetilde{d}_1^{\theta}(t),d_n^{\theta}(t),n\geq2\big\}$ , $\theta>0$ , are the same as the coefficients in (1).
The next natural question would be why their transition densities have the same coefficients. This question has been resolved for the two-parameter Poisson–Dirichlet diffusion in [Reference Griffiths, Spanó, Ruggiero and Zhou7] by a dual process method. In order to understand why the Z-diffusion also has similar coefficients in its density expression (2), we also apply the dual process method to the Z-measure diffusion $Y_t$ . But we can only find the dual process of the Z-measure diffusion when $\vartheta=1$ because we rely on [Reference Borodin and Olshanski3, (5)]. When $\vartheta\neq 1$ , a similar equation is not known. For $\vartheta=1$ , the dual process $\mathcal{D}_{t}$ is also a partition-valued jump process characterized by the generator
where $p_1^{\downarrow}(\eta,\zeta)$ is the transition probability of the down Markov chain in the Young graph discussed in [Reference Borodin and Olshanski3], and f is the continuous function on the Thoma simplex. The dual relation is defined through a bivariate function $F(\eta, \omega) = {s_{\eta}^{o}(\omega)}/{{\mathbb{E}}_{z,z^{\prime},1}s_{\eta}^{o}}$ , which is the normalized kernel in the representation of the Young graph. The notation ${\mathbb{E}}_{z,z^{\prime},1}$ should always be interpreted as the expectation with respect to the Z-measure $\mathcal{Z}(z,z^{\prime},1)$ . We show that duality relation reads as ${\mathbb{E}}_{\omega}F(\eta, Y_t)={\mathbb{E}}_{\eta}F(\mathcal{D}_{t}, \omega)$ . Because the distribution of $\mathcal{D}_{t}$ is easier to calculate, then the expression in (2) can be obtained. Now the radial process $|\mathcal{D}_{t}|$ of the dual process $\mathcal{D}_{t}$ will be exactly the same as that in the two-parameter Poisson–Dirichlet diffusion. This will eventually determine the coefficients in the density expression (2). Although when $\vartheta\neq1$ we cannot find the dual process, the result when $\vartheta=1$ encourages us to conjecture that the dual process of the Z-measure diffusion should also be a partition-valued jump process whose jump rate is $\frac{1}{2}{n(n-1+\theta)}$ and its embedded chain is the down Markov chain in [Reference Olshanski13].
The plan of the paper is as follows. In Section 2 we introduce the branching diagrams and the up and down Markov chains. In Section 3, we talk about the Z-measure diffusion and its transition density. In Section 4 we use the dual process method to derive the transition density of the Z-measure diffusion when $\vartheta=1$ . In the last section, a few conjectures will be presented.
2. Branching diagram and up–down Markov chain
2.1. Branching diagram
For $n,l\in\mathbb{Z}_{+}$ , $\eta=(\eta_1,\ldots,\eta_l)\in\mathbb{N}^{l}$ is called an integer partition of n if $\eta_1\geq\eta_2\geq\cdots\geq\eta_l>0$ and $|\eta|\;:\!=\;\sum_{i=1}^{l}\eta_i=n$ . Define $\alpha_i(\eta)=\#\{1\leq j\leq l\mid \eta_j=i\}$ , $1\leq i\leq n$ ; then $(\alpha_1(\eta),\ldots,\alpha_n(\eta))$ is a different representation of the partition $\eta$ . Denote by $\Gamma_n$ the set of all integer partitions of n; then $\Gamma=\cup_{n\geq0}\Gamma_n$ is the set of all integer partitions, where $\Gamma_0=\{\emptyset\}$ and $\emptyset$ is an empty partition. An integer partition $\eta=(\eta_1,\ldots,\eta_l)$ may also be represented by a Young diagram (Figure 1) defined by attaching boxes at position (i, j), where $1\leq i\leq l$ , $1\leq j\leq \eta_i$ . Here the row number increases from top to bottom and the column number increases from left to right.
In this paper, we will interchangeably use $\eta$ to represent either an integer partition or its Young diagram. We define the diagonal line of the Young diagram $\eta$ as the set of boxes $\{(i,i)\mid i\leq \eta_{i},\eta^{\prime}_{i}\}$ , and r as the length of the diagonal line of $\eta$ . The transposition of $\eta$ with respect to its diagonal line will give us a new partition $\eta^{\prime}$ called the conjugate of $\eta$ . We say $\eta\subset \nu$ if the Young diagram of $\eta$ is contained in the Young diagram of $\nu$ . Then, $\subset$ is a partial ordering in $\Gamma$ . For each box $b=(i,j)$ in the partition $\eta$ , we define its arm length as $a(b)=\eta_i-j$ and its leg length as $l(b)=\eta^{\prime}_j-i$ . A partition $\eta$ may also be represented by its Frobenius coordinates $\widetilde{\eta}= \big(a(1,1)+\frac{1}{2},\ldots,a(r,r)+\frac{1}{2};\; l(1,1)+\frac{1}{2},\ldots,l(r,r)+\frac{1}{2}\big)$ .
A branching diagram is a graded graph $(\Gamma,\chi)$ (Figure 2), where $\Gamma=\cup_{n\geq0}\Gamma_n$ is the vertex set. There is an edge joining $\eta\in\Gamma_n$ and $\nu\in\Gamma_{n+1}$ if and only if $\eta\subset \nu$ , and their edge weight is $\chi(\eta,\nu)$ .
When the edge weights are determined by the coefficients in the Pieri formula of the monomial symmetric functions, we will have a Kingman graph. Its edge weight is defined as $\chi^\mathrm{K}(\eta,\zeta)=\alpha_{\zeta_{i}}(\zeta)$ if $\zeta-\eta=(i,j)$ , i.e. $\zeta$ can be obtained from $\eta$ by attaching a box at (i, j). If we choose the edge weight to be the coefficients in the Pieri formula of the Jack functions, then we end up with the Jack graph, whose edge weight, denoted as $\chi^\mathrm{J}_{\vartheta}(\eta,\zeta)$ , will be
where b runs over all boxes in the jth column of $\eta$ if $\zeta-\eta=(i,j)$ . When $\vartheta=1$ , then $\chi^\mathrm{J}_{1}=1$ and the Jack graph reduces to the Young graph.
We define the weight of a path $t\colon \eta=\eta(0)\subset\cdots\subset\eta(k)=\zeta$ as $\prod_{i=0}^{k-1}\chi(\eta(i),\eta(i+1))$ . Then we define the total weight between $\eta$ and $\zeta$ in the Kingman graph as $\dim^\mathrm{K}\!(\eta,\zeta)=\sum_{\eta=\eta(0)\subset\cdots\subset\eta(k)=\zeta}\prod_{i=0}^{k-1}\chi(\eta(i),\eta(i+1))$ . Similarly, the total weight between $\eta$ and $\zeta$ in the Jack graph will be $\dim^\mathrm{J}_{\vartheta}\!(\eta,\zeta)=\sum_{\eta=\eta(0)\subset\cdots\subset\eta(k)=\zeta}\prod_{i=0}^{k-1}\chi(\eta(i),\eta(i+1))$ . In particular, when $\eta=\emptyset$ , we regard $\dim^\mathrm{K}\!(\nu)=\dim^{\textrm{K}}\!(\emptyset,\nu)$ as the total weight of $\nu$ in the Kingman graph and $\dim^\mathrm{J}_{\vartheta}\!(\nu)=\dim^\mathrm{J}_{\vartheta}\!(\emptyset,\nu)$ as the total weight of $\nu$ in the Jack graph. Naturally, we have
In the Kingman graph, the total weight of $\eta$ is $\dim^\mathrm{K}\!(\eta)={n!}/({\eta_1!\cdots\eta_l!})$ if $\eta=(\eta_1,\ldots,\eta_l)\in\Gamma_n$ . In the Jack graph, the total of $\eta$ is $\dim^\mathrm{J}_{\vartheta}\!(\eta)={n!}/({H(\eta;\;\vartheta)H'(\eta;\;\vartheta)})$ , where $H(\eta;\;\vartheta)=\prod_{b\in\eta}\!(a(b)+1+l(b)\eta)$ , $H'(\eta;\;\vartheta)=\prod_{b\in\eta}\!(a(b)+(1+l(b))\eta)$ . When $\vartheta=1$ , the total weight of $\eta$ is $\dim_{1}^\mathrm{J}\!(\eta)={n!}/{H^2(\eta;\;1)}$ in the Young graph.
2.2. Down–up Markov chain
Due to (3), we can easily construct a down Markov chain $\{D_{n}^\mathrm{K},n\geq1\}$ in the Kingman graph with the transition probability
This down Markov chain is the embedded chain of the dual process in [Reference Griffiths, Spanó, Ruggiero and Zhou7]. Similarly, we can also construct a down Markov chain $\{D_{n}^{\mathrm{J},\vartheta},n\geq1\}$ in the Jack graph with the transition probability
When $\vartheta=1$ , this down Markov chain is the embedded chain of the dual process of the Z-measure diffusion that we are going to discuss in Section 4.
As you may see, the down Markov chain depends only on the edge weights of the graph. But we can also construct an up Markov chain if we have exchangeable partition structures in the branching diagram. The Ewens–Pitman partition structure can be used to construct an up Markov chain $\{U_n^{\mathrm{K},\theta,\alpha},n\geq1\}$ in the Kingman graph [Reference Petrov14]. Then we can define an up–down chain $\{UD^{\mathrm{K},\theta,\alpha,n}_{m},m\geq1\}$ on $\Gamma_n$ , updating itself as
Naturally, the partition distribution $M_n^{\mathrm{K},\theta,\alpha}$ will serve as the stationary distribution of this up–down chain. The usual space and time scaling yields the two-parameter Poisson–Dirichlet diffusion in [Reference Petrov14]. The Ewens–Pitman partition structure can be replicated by the Blackwell–MacQueen urn model, and it has found many applications in classification problems through Bayesian statistics [Reference Blei, Ng and Jordan1].
For the Jack graph there is also a special Z partition structure [Reference Borodin and Olshanski2]
where either (i) $z\in\mathbb{C}-(\mathbb{Z}_{\leq0}+\vartheta \mathbb{Z}_{\geq0})$ and $z^{\prime}=\overline{z}$ (principal case), or (ii) $\vartheta$ is rational number and $z,z^{\prime}$ are real numbers belonging to an interval between two consecutive lattice points in $\mathbb{Z}+\vartheta\mathbb{Z}$ (complementary case). Here, $(z)_{\eta;\;\vartheta}=\prod_{(i,j)\in\eta}(z+(j-1)-(i-1)\vartheta)$ . The representing measure $\mathcal{Z}(z,z^{\prime},\vartheta)$ of the partition structure (5) is the Z-measure. Similarly, we can construct an up Markov chain $\{U^{\mathrm{J},z,z^{\prime},\vartheta}_{n},n\geq1\}$ [Reference Olshanski13]. By mimicking the update in (4), we can also construct an up–down Markov chain $\{UD^{\mathrm{J},z,z^{\prime},\vartheta,n}_{m},m\geq1\}$ on $\Gamma_n$ . Then the usual space and time scaling in Section 3 yields the Z-diffusion in [Reference Olshanski13]. To the best of the author’s knowledge, no quick replication of the Z partition structure $M^{\mathrm{J},z,z^{\prime},\vartheta}_n(\eta)$ has been identified. So whether the Z partition structure can be applied to classification problems is still open.
3. The Z-measure diffusion and its transition density
3.1. The Z-measure diffusion
The diffusion approximation of the up–down Markov chain $\{UD^{\mathrm{J},z,z^{\prime},\vartheta,n}_{m},m\geq1\}$ on $\Gamma_n$ can be carried out by the space scaling
where $\widetilde{\eta}$ is the Frobenius coordinates of $\eta$ . As $n\to\infty$ , [Reference Olshanski13] showed that $\pi(UD^{\mathrm{J},z,z^{\prime},\vartheta,n}_{[n^2t]})$ converges to the Z-measure diffusion $Y_t$ on the Thoma simplex $\Omega$ with the pre-generator
The core of $ A_{z,z^{\prime},\vartheta}$ is spanned by $\{\varphi_j^{o},j\geq1\}$ , where $\varphi_j^{o}$ is the image of $\varphi_j(x)=\sum_{i=1}^{\infty}x_i^j$ under the special algebra homomorphisms $\Phi_{\omega}\colon \mathcal{A}\to\mathbb{R}$ , $\omega=(\alpha,\beta)\in\Omega$ , , $n\geq2$ , $\Phi_{\omega}(\varphi_{1})=1$ . Since the Jack functions $J_{\zeta}(x;\;\vartheta)$ can be written as linear combinations of $\varphi_{\eta_1}\cdots\varphi_{\eta_l}$ , where $\eta=(\eta_1,\ldots,\eta_l)\subset \zeta$ , then $J_{\zeta}^{o}(\omega;\;\vartheta)\;:\!=\;\Phi_{\omega}(J_{\zeta}(x;\;\vartheta))$ .
In particular, when $\vartheta=1$ we have the extended Shur function $s^{o}_{\zeta}(\omega)\;:\!=\;\Phi_{\omega}(J_{\zeta}(x;\;1))$ . The Z-measure diffusion reduces to the diffusion in [Reference Borodin and Olshanski3].
3.2. Transition density of the Z-measure diffusion $\boldsymbol{Y}_\boldsymbol{t}$
By spectral expansion of $A_{z,z^{\prime},\vartheta}$ in the Hilbert space $L^2(\Omega, \mathcal{Z}(z,z^{\prime},\vartheta))$ , the following explicit transition density of $Y_t$ was obtained in [Reference Korotkikh11].
Proposition 1. The transition density of $Y_t$ is
where $\theta = {zz^{\prime}}/{\vartheta}$ , $\lambda_m = \frac{1}{2}{m(m-1+\theta)}$ , and
In particular, when $n=0,1$ , $\mathcal{K}_n(\sigma,\omega)=1$ .
In [Reference Korotkikh11], it was shown that
By the following Proposition 2, [Reference Borodin and Olshanski2] showed that the representing measure of the Z partition structure is the Z-measure $\mathcal{Z}(z,z^{\prime},\vartheta)$ .
Proposition 2. For a partition structure $\{M_{n},n\geq1\}$ on the Jack graph, there is a unique probability measure $\mu$ on the Thoma simplex $\Omega$ such that $M_n(\eta)=\int_{\Omega}j_{\eta}(\omega;\;\vartheta)\mu({\mathrm{d}}\omega)$ . Moreover, $\mu_n({\mathrm{d}}\omega) = \sum_{\eta\in\Gamma_n}M_n(\eta)\delta_{{\widetilde{\eta}}/{n}}({\mathrm{d}}\omega)$ will converge weakly to $\mu$ . Here, $\widetilde{\eta} = \big(a(1,1)+\frac{1}{2},\ldots,a(r,r)+\frac{1}{2};\; l(1,1)+\frac{1}{2}, \ldots,l(r,r)+\frac{1}{2}\big)$ is the Frobenius coordinate of $\eta$ .
Therefore, similar to the Ewens sampling formula, we have
and $K_n^{o}(\sigma,\omega)={\mathcal{K}_n(\sigma,\omega)}/({n!(\theta)_{n}})$ . Then, we can easily see that (6) is equivalent to the density in [Reference Korotkikh11].
In this paper, we will rearrange the right-hand side of (6) to yield a new representation.
Theorem 1. The transition density of $Y_t$ is
where
Due to the estimation of $G_m$ in [Reference Korotkikh11], the proof of Theorem 1 is the same as the proof of [Reference Zhou17, Theorem 2.1].
Proof. By [Reference Korotkikh11, Proposition 16], we know there exist positive constants c, d such that $\sup_{\sigma,\omega\in\Omega}|G_m(\sigma,\omega)|\leq c m^{dm}$ . Then, we can show that the series in (6) is uniformly convergent (see [Reference Zhou17] or [Reference Korotkikh11]). We can then switch the order of summation in (6):
Since $\mathcal{K}_1(\sigma,\omega)=\mathcal{K}_0(\sigma,\omega)=1$ , we have
Note that $(\theta+1)_{(m-1)}m-(\theta)_{(m-1)}=0$ for $m=1$ . Therefore,
Corollary 1. The diffusion $Y_t$ satisfies the ergodic inequality
This inequality can be easily derived from an inequality of tail probabilities (see [Reference Tavaré16]): $\sum_{n=2}^{\infty}d_n^{\theta}(t) \leq \frac{1}{2}(\theta+1)(\theta+2){\mathrm{e}}^{-(\theta+1)t}$ .
4. Dual process of the Z-measure diffusion when $\vartheta=1$
The transition density of the Z-measure diffusion looks very similar to the transition density of the two-parameter Poisson–Dirichlet diffusion. Given their huge differences, it is surprising that their transition densities are both mixtures of distributions with exactly the same coefficients. To figure out why they have the same coefficients, we adopt the dual process method used in [Reference Griffiths, Spanó, Ruggiero and Zhou7]. The duality between $Y_t$ and its dual process $\mathcal{D}_{t}$ is defined through a bivariate function $F(\eta,\omega)$ , where $\eta\in\Gamma$ and $\omega\in\Omega$ . The bivariate function $F(\eta,\omega)$ is usually chosen to be the normalized kernel $F(\eta,\omega) = {j_{\eta}(\omega;\;\vartheta)}/{{\mathbb{E}}_{z,z^{\prime},\vartheta}j_{\eta}}$ in Proposition 2. Then, $Y_t$ and its dual $\mathcal{D}_t$ satisfy ${\mathbb{E}}_{\eta}^*F(\mathcal{D}_t,\omega)={\mathbb{E}}_{\omega}F(\eta, Y_t)$ , where ${\mathbb{E}}_{\eta}^*$ is the expectation with respect to the distribution of $\mathcal{D}_t$ . In this paper, we use this dual equation to derive the transition density (8) directly when $\vartheta=1$ . This derivation clearly explains why the coefficients $d_n^{\theta}(t)$ show up in the transition density (8). However, when $\vartheta\neq1$ , we fail to verify the dual process of $Y_t$ because we don’t know whether the Z-measure diffusion is similar to [Reference Olshanski13, (5)].
4.1. Dual process
In this section we consider test functions $g_{\eta}(\omega) = {j_{\eta}(\omega;\;1)}/{{\mathbb{E}}_{z,z^{\prime},\vartheta}j_{\eta}}$ , so $\hbox{${\mathbb{E}}_{z,z^{\prime},\vartheta}g(\eta)=1$}$ . When $\vartheta=1$ , $J_{\eta}(x;\;1)$ is just the Shur function $s_{\eta}(x)$ , so $j_{\eta}(\omega;\;1)$ is $\dim_{1}^\mathrm{J}\!(\eta)\Phi_{\omega}(s_{\eta}) = \dim_{1}^\mathrm{J}\!(\eta)s^{o}_{\eta}(\omega)$ . Moreover, by (7), we know that
We define $F(\eta,\omega) = g_{\eta}(\omega)$ , $\eta\in\Gamma$ , $\omega\in\Omega$ . So,
Now consider a jump process $\mathcal{D}_t$ defined by
where ${\dim_{1}^\mathrm{J}\!(\zeta)}/{\dim^\mathrm{J}_{1}\!(\eta)} = p^{\downarrow,J}_{1}(\eta, \zeta)$ is the transition probability of the down Markov chain $D^{\mathrm{J},1}_n$ in the Young graph. Moreover, $\mathcal{L}^\mathrm{J}_{1}1=0$ and. because $s_{(1)}^{o}=1$ , $\mathcal{D}_t$ will be absorbed at state $\eta=(1)$ . The radial process $|\mathcal{D}_t|$ of $\mathcal{D}_t$ is only determined by the jump rates $\frac{1}{2}n(n-1+\theta)$ , $n\geq1$ . Therefore, $|\mathcal{D}_t|$ is exactly the radial process in the Kingman coalescent by collapsing states 0 and 1 as a new state 1. The distribution of $|\mathcal{D}_t|$ is obtained in [Reference Tavaré16, Reference Zhou17]. So as long as the jump rates are the same for the dual process, the coefficients in their transition density, if it exists, will be the same.
Theorem 2. When $\vartheta=1$ , the Z-measure diffusion $Y_t$ and $\mathcal{D}_t$ satisfy the duality
Proof. By [Reference Borodin and Olshanski3, Lemma 5.4], we know that
From (9), we know that
Replacing $s^{o}_{\eta}$ and $s^{o}_{\zeta}$ in (11) yields
Because
we have
Therefore, $A_{z,z^{\prime},1}F(\eta,\omega)=\mathcal{L}^\mathrm{J}_{1}F(\eta,\omega)$ . By [Reference Ethier and Kurtz5, Theorem 4.4.1], we can show that the Z-measure diffusion $Y_t$ and $\mathcal{D}_t$ satisfy the dual equation (10).
Proposition 3. The dual process $\mathcal{D}_t$ has the transition probability ${\mathbb{P}}(\mathcal{D}_t = \eta \mid \mathcal{D}_0=\nu) = d_{mn}^{\theta}(t)\mathcal{H}(\eta,\nu)$ , $\eta\in\Gamma_{n}$ , $\nu\in \Gamma_m$ , $n\leq m$ . Here,
Proof. By the definition of $|\mathcal{D}_{t}|$ and [Reference Zhou17, Proposition 2.1], we know that ${\mathbb{P}}(|\mathcal{D}_{t}| = n \mid |\mathcal{D}_{0}| = m) = d_{mn}^{\theta}(t)$ . Then, $\mathbb{P}_{\nu}(\mathcal{D}_t = \eta) = {\mathbb{P}}(\mathcal{D}_{t} = \eta \mid \mathcal{D}_0 = \nu) = \mathbb{P}(\mathcal{D}_t=\eta \mid |\mathcal{D}_t| = n, \mathcal{D}_0=\nu) \mathbb{P}(|\mathcal{D}_{t}|=n \mid \mathcal{D}_0=\nu) = d_{mn}^{\theta}(t)\mathbb{P}(\mathcal{D}_t=\eta \mid |\mathcal{D}_t|=n,\mathcal{D}_0=\nu)$ . For $\nu\in\Gamma_{m}$ , $\eta\in\Gamma_{n}$ , there are paths of length $k=m-n$ joining $\nu$ and $\eta$ . These paths are the realizations of the embedded down Markov chain $D^{\mathrm{J},1}_n$ . Thus,
Proposition 4. For $j_{\eta}(\omega;\;1), \eta\in\Gamma_m$ , we have
Proof. By the duality equation (10), we know that
Then, we have (12) if we replace
4.2. Proof of Theorem 1 through the dual process method
Next we use Proposition 4 to deduce the transition density (8). By Proposition 2, we know that $\mu_{m}(d\omega) = \sum_{|\eta|=m}{\mathbb{E}}[j_{\eta}(Y_t;\;1) \mid Y_0 = \omega]\delta_{{\widetilde{\eta}}/{m}}(d\omega)$ will converge weakly to the distribution of $Y_t$ , i.e. the transition probability $q(t,\omega, \cdot) = {\mathbb{P}}_{\omega}(Y_t\in\cdot)$ . By (12), we know that
where
As $m\to\infty$ , we need to show that $\mu_{1,m}({\mathrm{d}}\omega)\to \mathcal{Z}(z,z^{\prime},\vartheta)({\mathrm{d}}\omega)$ and $\mu_{n,m}({\mathrm{d}}\omega)\to j_{\zeta}(\omega;\;1)\mathcal{Z}(z,z^{\prime},\vartheta)({\mathrm{d}}\omega)$ . The first claim can be directly obtained from Proposition 2; now we are going to show the second one. For any $f\in C(\Omega)$ , we have
where the second equality is due to [Reference Kerov, Okounkov and Olshanski8, Theorems 6.1 and 7.1]. Since $\lim_{m\to\infty}d_{mn}^{\theta}(t)=d_{n}^{\theta}(t)$ , we have derived the representation in Theorem 1.
5. Further discussion
5.1. Conjectures on diffusions with given stationary measures
The conclusions in this paper indicate that the dual process is only determined by the weights in the branching diagram, whether it be the Kingman graph or the Jack graph. The dual processes are determined by the generator
The diffusions, however, depend on the partition structures on the Kingman graph or the Jack graph. Due to Kingman’s representation theorem and the representation theorem in Proposition 2, the partition structures are uniquely determined by their representing measures, which will be the stationary distribution of the diffusions constructed through the up–down Markov chains.
More generally, for a given probability measure $\mu$ on the Kingman simplex $\overline{\nabla}_{\infty}$ , we can consider the generator $\mathcal{B}_{\mu}$ defined on an algebra $\mathcal{A}^{o}_\mathrm{K}$ spanned by $\{1,\varphi_k,k\geq2\}$ as follows:
where $\{g_{\eta}(x)={m_{\eta}(x)}/({{\mathbb{E}}_{\mu} m_{\eta}}) \mid \eta\in\Gamma\}$ is a linear base and ${\mathbb{E}}_{\mu}$ is the expectation with respect to the probability distribution $\mu$ . It will uniquely determine the operation of $\mathcal{B}_{\mu}^\mathrm{K}$ on $\mathcal{A}^{o}_\mathrm{K}$ .
Conjecture 1. For a given probability measure $\mu$ on the Kingman simplex $\overline{\nabla}_{\infty}$ , the generator defined in (13) will determine a reversible diffusion $W_t^\mathrm{K}$ with the stationary distribution $\mu$ . Its transition density is $p^\mathrm{K}(t,x,y) = d_0^{\theta}(t) + d_1^{\theta}(t) + \sum_{n=2}^{\infty}d_n^{\theta}(t)p_n(x,y)$ , where
Moreover, for a given probability measure $\mu$ in the Thoma simplex $\Omega$ , we can also consider the generator $\mathcal{B}_{\mu}^\mathrm{J}$ defined on an algebra $\mathcal{A}^{o}_\mathrm{J}$ spanned by $\{1,\varphi_k^{o},k\geq2\}$ as follows:
where $\{g_{\eta}(x)={j_{\eta}(x;\;\vartheta)}/({{\mathbb{E}}_{\mu} j_{\eta}}) \mid \eta\in\Gamma\}$ is a linear base. It will uniquely determine the operation of $\mathcal{B}_{\mu}^\mathrm{J}$ on $\mathcal{A}^{o}_\mathrm{J}$ .
Conjecture 2. For a given probability measure $\mu$ on the Thoma simplex $\Omega$ , the generator defined in (14) will determine a reversible diffusion $W_t^\mathrm{J}$ with the stationary distribution $\mu$ . Its transition density is $p^\mathrm{J}(t,x,y) = d_0^{\theta}(t) + d_1^{\theta}(t) + \sum_{n=2}^{\infty}d_n^{\theta}(t)\mathcal{K}_n(x,y)$ , where
Appendix A. Symmetric functions
In this appendix we discuss a few facts about the symmetric functions. Please refer to [Reference MacDonald12] for further details. Symmetric functions are defined as inverse limits of symmetric polynomials. For $n\in\mathbb{Z}_{+}$ , let $\Lambda_{n}$ be the ring of symmetric polynomials of variables $x_1,\ldots,x_n$ . Define $\rho_{n+1,n}\colon \Lambda_{n+1}\to\Lambda_{n}$ as follows:
Then $\Lambda$ is the inverse limit of $\Lambda_n$ , $n\geq1$ .
A.1. Monomial symmetric functions
Consider the symmetric polynomials $m_{\eta}^n(x_1,\ldots,x_n) = \sum_{\eta=(a_{(1)},\ldots,a_{(n)})}x_{1}^{a_1}\cdots x_{n}^{a_n} $ , where $(a_{(1)},\ldots,a_{(n)})$ is the descending arrangement of $(a_1,\ldots, a_n)$ . We can see that (15) is also true for $m_{\eta}^n(x_1,\ldots,x_n)$ . We can then define $m_{\eta}(x)$ as an inverse limit of $\{m_{\eta}^n\mid n\geq1\}$ ; this is called a monomial symmetric function. The evaluation of $m_{\eta}(x)$ in the Kingman simplex $\overline{\nabla}_{\infty}$ can be performed through continuous extension of $m_{\eta}|_{\nabla_{\infty}}$ . Moreover, $m_{\eta}(x)$ satisfies the Pieri formula $m_{\eta}(x)=\sum_{\eta\subset\nu,|\eta|+1=|\nu|} \chi^\mathrm{K}(\eta,\nu)m_{\nu}(x)$ , and $1=\sum_{|\eta|=n}\dim^\mathrm{K}\!(\eta)m_{\eta}(x)$ .
A.2. Shur functions
Denote a symmetric group as $S_n$ . Consider the symmetric polynomials
where $\eta=(\eta_1,\ldots,\eta_n)$ and $\delta=(n-1,n-2,\ldots,1,0)$ are integer partitions, and $a_{\eta}(x_1,\ldots,x_n) = \sum_{\sigma\in S_n}\mathrm{Sgn}(\sigma)x_{\sigma(1)}^{\eta_1}\cdots x_{\sigma(n)}^{\eta_n}$ . We can also show that $\{s^n_{\eta}\mid n\geq1\}$ satisfy (15). Then, the Shur function $s_{\eta}(x)$ is defined to be the inverse limit of $\{s^n_{\eta}\mid n\geq1\}$ . Moreover, the Shur functions also satisfy the Pieri formula, $s_{\eta}(x)=\sum_{\eta\subset\nu,|\eta|+1=|\nu|} \chi^\mathrm{J}_{1}(\eta,\nu)s_{\nu}(x)$ , and $1=\sum_{|\eta|=n}\dim^\mathrm{J}_{1}\!(\eta)s_{\eta}(x)$ , where $\dim^\mathrm{J}_{1} = {n!}/{H(\eta;\;1)}$ .
A.3. Jack functions
Jack polynomials $J_{\lambda}^n(x_1,\ldots,x_n;\;\vartheta)$ are defined to be the eigenfunctions of the Sekiguchi operators:
We can also show that $\{J^n_{\eta}(x;\;\vartheta)\mid n\geq1\}$ satisfy (15). Then the Jack symmetric function $J_{\lambda}(x_1,\ldots,x_n,\ldots;\;\vartheta)$ is defined to be the inverse limit of $\{J^n_{\eta}(x;\vartheta)\mid n\geq1\}$ . Moreover, the Jack functions satisfy the Pieri formula, $J_{\eta}(x;\vartheta)=\sum_{\eta\subset\nu,|\eta|+1=|\nu|} \chi^\mathrm{J}_{\vartheta}(\eta,\nu)J_{\nu}(x;\vartheta)$ , and $\big(\sum_{i=1}^{\infty}x_i\big)^{n}=\sum_{|\eta|=n}\dim_{\vartheta}^\mathrm{J}\!(\eta)J_{\eta}(x;\;\vartheta)$ . In particular, when $\vartheta=1$ , $J_{\eta}(x;1)=s_{\eta}(x)$ .
Acknowledgements
The author would like to thank Stewart N. Ethier for introducing him to [Reference Korotkikh11]. Moreover, gratitude should also go to Robert C. Griffiths for many stimulating discussions on dual processes. Last, but not least, the author would like to thank the two referees for their insightful comments.
Funding information
This research is supported by NSFC: 11701570.
Competing interests
There were no competing interests to declare which arose during the preparation or publication process of this article.