Chapter 8 Branching Process (Lecture on 01/28/2021)
Branching process is first introduced to understand the population size of the British royal family. It is assumed that each individual can give birth to k sons with probability pk, and define Xn as the number of individuals in the family at the nth generation. We are interested in the following things.
- E(Xn);
- Var(Xn);
- P(Xn=0) as n→∞.
Since Xn+1 denotes the number of individuals in the family at generation n+1, it can therefore be written as Xn+1=Xn∑r=1ξr where ξr is the number of sons born out of the rth individual in the nth generation. P(ξr=k)=pk for k=0,1,⋯.
We define a few generating functions: φ(s)=∞∑k=0pksk and φn(s)=∞∑k=0P(Xn=k)sk Obviously from the definition, we have φ(1)=∞∑k=0pk=1 and φn(1)=∞∑k=0P(Xn=k)=1 for n=0,1,⋯. We have also found from Lecture 7 that φn+1(s)=φ(φn(s)) Let m=E(X1)=∑∞k=1kpk and σ2=Var(X1), assume they both exist and are finite. Then by definition of φn(s), we have E(Xn)=∞∑k=1kP(Xn=k)=∞∑k=1sk−1kP(Xn=k)|s=1=φ′n(1) In addition, since φn+1(s)=φn(φ(s)), we have φ′n+1(s)=[φn(φ(s))]′=φ′n(φ(s))φ′(s) Put s=1, from (8.8) we have φ′n+1(1)=φ′n(1)φ′(1) (8.9) gives us a recursive relationship, from which we can get φ′n+1(1)=φ′n(1)φ′(1)=φ′n−1(1)[φ′(1)]2=⋯=[φ′(1)]n+1 If we assume that φ′(1)=m=E(X1), then φ′n+1(1)=mn+1. Now note that by (8.7), φ′n+1(1)=E(Xn+1), we finally obtain that E(Xn)=mn
Now we tend to compute Var(Xn). First note that φ′′n(s)=∞∑k=2k(k−1)P(Xn=k)sk−2 Thus, φ′′n(1)=E(Xn(Xn−1))=E(X2n)−E(Xn) Then use (8.6) we have φ′n+1(s)=φ′(φn(s))φ′n(s) and φ′′n+1(s)=φ′′(φn(s))[φ′n(s)]2+φ′(φn(s))φ′′n(s) Putting s=1, we then have φ′′n+1(1)=φ′′(φn(1))[φ′n(1)]2+φ′(φn(1))φ′′n(1)=φ′′(1)[φ′n(1)]2+φ′(1)φ′′n(1) The second equation uses the result of (8.5). Denote φ′′(1)=E(X21)−E(X1)=σ2+m2−m=M, then from (8.16) we get φ′′n+1(1)=M(mn)2+mφ′′n(1)=Mm2n+mφ′′n(1) which is also a recursive relationship. Extend it, we have φ′′n+1(1)=Mm2n+mφ′′n(1)=Mm2n+m[Mm2(n−1)+mφ′′n−1(1)]=⋯=M{m2n+m2n−1+⋯+mn} Now note that φ′′n+1(1)=E(X2n+1)−E(Xn+1)Var(Xn+1)=E(X2n+1)−E2(Xn+1)=E(X2n+1)−m2(n+1) Thus, we can get Var(Xn+1)=φ′′n+1(1)+mn+1−m2(n+1)=M{m2n+m2n−1+⋯+mn}+mn+1−m2(n+1)=(σ2+m2−m){m2n+m2n−1+⋯+mn}+mn+1−m2(n+1)=σ2{m2n+m2n−1+⋯+mn}+(m2−m){m2n+m2n−1+⋯+mn}+mn+1−m2(n+1)=σ2{m2n+m2n−1+⋯+mn} The last equation is becasue (m2−m){m2n+m2n−1+⋯+mn} is a telescoping sequence, and it cancels with the last two terms. Therefore, we finally have the formula for Var(Xn) as Var(Xn)={nσ2m=1σ2mn−1(mn−1m−1)m≠1
Now we tend to calculate the extinction probability, which is defined as the probability that the population of the royal families will eventually die out, denote at P(Xn=0.∃n). Note that Xn=0 implies that Xk=0,∀k≥n. Note also that extinction nevers occurs if p0=0. This is because p0 is the probability of having no son. Therefore, p0=0 means each individual have probability one to have at least one son and the extinction never happens. We assume p0>0 in the following calculation. We also assume p0+p1<1.
Note that φ(s)=∑∞k=0pksk, which is a power series in s with non-negative coefficients and p0>0 and p0<1. Hence, φ(s) is a strictly increasing function. Let qn=P(Xn=0)=φn(0) (by (8.3)), then qn+1=P(Xn+1=0)=φn+1(0)=φ(φn(0))=φ(qn) which is another recursive relationship. Now since q1=φ1(0)=P(X1=0)=P(ξ1=0)=p0>0 (we assume X0=1), and from (8.2), q2=φ(q1)>φ(0)=q1 since φ(s) is a strictly increasing function. In particular, if we assume qn>qn−1, we have φ(qn)>φ(qn−1), that is, qn+1>qn. Therefore, by induction, {qn}n≥1 is an increasing sequence of real numbers and also qn≤1. Thus, {qn}n≥1 is a monotonically increasing sequence and bounded above by 1, lim exists. Let \pi:=\lim_{n\to\infty}q_n, obviously 0<\pi\leq 1. Now, from (8.22), taking limit on both side, we have \begin{equation} \pi=\varphi(\pi) \tag{8.23} \end{equation} \pi is the probability of eventual extinction and \pi is the root of the equation \varphi(s)=s.
Claim: \pi is the smallest root of \varphi(s)=s.
Proof: Let s_0 be any other root of the equation, it implies q_1=\varphi(0)<\varphi(s_0)=s_0. Very similarly, we can show any q_n<s_0, which implies that \pi=\lim_{n\to\infty}q_n<s_0.
Notice also \varphi(1)=\sum_{k=0}^{\infty}p_k=1, so 1 is a root of the equation \varphi(s)=s. In addition, \begin{equation} \varphi^{\prime\prime}(s)=\sum_{k=2}^{\infty}k(k-1)s^{k-2}p_ks^{k-2}>0 \tag{8.24} \end{equation} since by our assumption, p_0+p_1<1 and there exist p_i,i\geq 2 such that p_i>0. Since any function with the second derivatives greater than 0 is a convex function, we have \varphi is a convex function. Therefore, there can be only two cases about \pi (Figure 8.1 illustrate this):
Case 1: If \varphi^{\prime}(1)>1, then f_1(s)=\varphi(s) and f_2(s)=s intersect twice in the region of s\in[0,1], we have \pi<1.
Case 2: If \varphi^{\prime}(1)\leq1, then f_1(s)=\varphi(s) and f_2(s)=s intersect only once in the region of s\in[0,1], which is just 1. Therefore, \pi=1.
FIGURE 8.1: Illustration of the cases for \varphi(s)=s.
Notice that \varphi^{\prime}(1)=E(X_1)=m, summing up the above result, we have \begin{equation} \boxed{\mathbf{\left\{\begin{aligned} & \boldsymbol{\pi}=1 & m=E(X_1)\leq 1\\ & 0<\boldsymbol{\pi}<1 & m=E(X_1)>1\end{aligned}\right. }} \tag{8.25} \end{equation}