14 Branching processes - Lecture Notes for Introductory Probability

In this chapter we will consider a random model for population growth in the absence of spatial or any other resource constraints. So, consider a population of individuals which evolves according to the following rule: in every generation n = 0,1,2, . . ., each individual produces a random number of offspring in the next generation, independently of other individuals. The probability mass function for offspring is often called the offspring distribution and is given by

p_i =P(number of offspring =i),

for i= 0,1,2, . . .. We will assume that p₀ < 1 andp₁ < 1 to eliminate the trivial cases. This model was introduced by F. Galton in the late 1800s to study the disappearance of family names;

in this case p_i is the probability that a man has isons.

We will start with a single individual in generation 0 and generate the resulting random family tree. This tree is either finite (when some generation produces no offspring at all) or infinite — in the former case, we say that the branching processdies outand, in the latter case, that it survives.

We can look at this process as a Markov chain where Xn is the number of individuals in generationn. Let us start with the following observations:

• IfX_n reaches 0, it stays there, so 0 is an absorbing state.

• Ifp0>0, P(Xn+1 = 0|Xn=k)>0, for all k.

• Therefore, by Proposition 13.5, all states other than 0 are transient if p₀ >0; the popu-lation must either die out or increase to infinity. If p₀ = 0, then the population cannot decrease and each generation increases with probability at least 1−p₁, therefore it must increase to infinity.

It is possible to write down the transition probabilities for this chain, but they have a rather complicated explicit form, as

P(Xn+1 =i|Xn=k) = P(W1+W2+. . .+W_k=i),

whereW₁, . . . , W_k are independent random variables, each with the offspring distribution. This suggests using moment generating functions, which we will indeed do. Recall that we are as-suming that X₀ = 1.

Let

δ_n=P(X_n= 0)

be the probability that the population is extinct by generation (which we also think of as time)n.

The probabilityπ0 that the branching process dies out is, then, the limit of these probabilities:

π0 =P(the process dies out) =P(Xn = 0 for somen) = lim

n→∞P(Xn= 0) = lim

n→∞δn.

Note thatπ₀ = 0 ifp₀ = 0. Our main task will be to computeπ₀ for general probabilitiesp_k. We start, however, with computing the expectation and variance of the population at generationn.

Let µand σ² be the expectation and the variance of the offspring distribution, that is, µ=EX1 =

X∞

k=0

kpk, and

σ² = Var(X₁).

Let m_n = E(X_n) and v_n = Var(X_n). Now, X_n+1 is the sum of a random number X_n of independent random variables, each with the offspring distribution. Thus, we have by Theorem 11.1,

m_n+1 =m_nµ, and

v_n+1=m_nσ²+v_nµ².

Together with initial conditions m₀ = 1,v₀ = 0, the two recursive equations determine m_n and vn. We can very quickly solve the first recursion to get mn=µⁿ and so

v_n+1=µⁿσ²+v_nµ².

When µ = 1, m_n = 1 and v_n = nσ². When µ 6= 1, the recursion has the general solution v_n=Aµⁿ+Bµ²ⁿ. The constantA must satisfy

Aµⁿ⁺¹ =σ²µⁿ+Aµⁿ⁺², so that,

A= σ² µ(1−µ).

Fromv₀ = 0 we getA+B = 0 and the solution is given in the next theorem.

Theorem 14.1. Expectation m_n and variance v_n of the nth generation count.

We have

m_n=µⁿ and

vn=

(_σ2µⁿ(1−µⁿ)

µ(1−µ) if µ6= 1, nσ² if µ= 1.

We can immediately conclude thatµ <1 impliesπ0= 1, as P(X_n6= 0) =P(X_n≥1)≤EX_n=µⁿ→0;

if the individuals have less than one offspring on average, the branching process dies out.

Now, let φ be the moment generating function of the offspring distribution. It is more convenient to replacee^t in our original definition withs, so that

φ(s) =φ_X₁(s) =E(s^X¹) =

∞

k=0

p_ks^k.

In combinatorics, this would be exactly the generating function of the sequencep_k. Then, the moment generating function of X_nis

φ_X_n(s) =E[s^Xⁿ] =

∞

k=0

P(X_n=k)s^k.

We will assume that 0≤s≤1 and observe that, for suchs, this power series converges. Let us get a recursive equation forφ_X_n by conditioning on the population count in generation n−1:

φ_X_n(s) = E[s^Xⁿ]

= X∞

k=0

E[s^Xⁿ|Xn−1=k]P(Xn−1 =k)

= X∞

k=0

E[s^W¹^+...+W^k]P(X_n₋₁ =k)

= X∞

k=0

(E(s^W¹)E(s^W²)· · ·E(s^W^k)P(X_n₋₁ =k)

= X∞

k=0

φ(s)^kP(X_n₋₁ =k)

= φ_X_n₋₁(φ(s)).

So,φ_X_n is the nth iterate ofφ,

φ_X₂(s) =φ(φ(s)), φ_X₃(s) =φ(φ(φ(s))), . . . and we can also write

φ_X_n(s) =φ(φ_X_n−1(s)).

Next, we take a closer look at the properties of φ. Clearly, φ(0) =p0 >0 and

φ(1) = X∞

k=0

p_k= 1.

Moreover, for s >0,

φ^′(s) = X∞

k=0

kp_ks^k⁻¹ >0,

soφis strictly increasing, with

φ^′(1) =µ.

Finally,

φ^′′(s) =

∞

k=1

k(k−1)p_ks^k⁻² ≥0, soφis also convex. The crucial observation is that

δn=φ_X_n(0),

and so δ_n is obtained by starting at 0 and computing the nth iteration of φ. It is also clear thatδ_nis a nondecreasing sequence (because X_n−1 = 0 implies thatX_n= 0). We now consider separately two cases:

• Assume that φ is always above the diagonal, that is, φ(s) ≥ s for all s ∈ [0,1]. This happens exactly whenµ=φ^′(1)≤1. In this case, δ_n converges to 1, and so π₀ = 1. This is shown in the right graph of the figure below.

• Now, assume that φ is not always above the diagonal, which happens when µ > 1. In this case, there exists exactly one s < 1 which solves s = φ(s). As δn converges to this solution, we conclude that π₀ < 1 is the smallest solution to s =φ(s). This is shown in the left graph of the figure below.

1 1

δ₂

φ φ

δ₁ δ₂ δ₁

The following theorem is a summary of our findings.

Theorem 14.2. Probability that the branching process dies out.

If µ≤1, π₀ = 1. If µ >1, thenπ₀ is the smallest solution on[0,1] tos=φ(s).

Example 14.1. Assume that a branching process is started with X₀ =k instead of X₀ = 1.

How does this change the survival probability? The kindividuals all evolve independent family trees, so that the probability of eventual death is π₀^k. It also follows that

P(the process ever dies out|Xn=k) =π₀^k

for everyn.

Ifµis barely larger than 1, the probabilityπ₀ of extinction is quite close to 1. In the context of family names, this means that the ones with already a large number of representatives in the population are at a distinct advantage, as the probability that they die out by chance is much lower than that of those with only a few representatives. Thus, common family names become ever more common, especially in societies that have used family names for a long time. The most famous example of this phenomenon is in Korea, where three family names (Kim, Lee, and Park in English transcriptions) account for about 45% of the population.

Example 14.2. Assume that

p_k=p^k(1−p), k= 0,1,2, . . . .

This means that the offspring distribution is Geometric(1−p) minus 1. Thus, µ= 1

1−p −1 = p 1−p

and, ifp≤ ¹₂,π0= 1. Now suppose that p > ¹₂. Then, we have to compute φ(s) =

X∞

k=0

s^kp^k(1−p)

= 1−p 1−ps.

The equationφ(s) =s has two solutions,s= 1 ands= ¹⁻_p^p. Thus, when p > ¹₂, π₀ = 1−p

p .

Example 14.3. Assume that the offspring distribution is Binomial(3,¹₂). Computeπ₀. As µ= ³₂ >1,π₀ is given by

φ(s) = 1 8+ 3

8s+3 8s²+ 1

8s³ =s, with solutions s= 1, −√

5−2, and√

5−2. The one that lies in (0,1), √

5−2≈0.2361, is the probability π₀.

Problems

1. For a branching process with offspring distribution given byp₀ = ¹₆, p₁ = ¹₂, p₃= ¹₃,determine (a) the expectation and variance ofX9, the population at generation 9, (b) the probability that the branching process dies by generation 3, but not by generation 2, and (c) the probability that

the process ever dies out. Then, assume that you start 5 independent copies of this branching process at the same time (equivalently, change X₀ to 5), and (d) compute the probability that the process ever dies out.

2. Assume that the offspring distribution of a branching process is Poisson with parameter λ.

(a) Determine the expected combined population through generation 10. (b) Determine, with the aid of computer if necessary, the probability that the process ever dies out forλ= ¹₂,λ= 1, and λ= 2.

3. Assume that the offspring distribution of a branching process is given by p₁ =p₂ =p₃ = ¹₃. Note that p0 = 0. Solve the following problem for a = 1,2,3. Let Yn be the proportion of individuals in generation n (out of the total number of X_n individuals) from families of size a. (A family consists of individuals that are offspring of the same parent from the previous generation.) Compute the limit of Yn asn→ ∞.

Solutions to problems

1. For (a), compute µ= ³₂,σ² = ⁷₂ −⁹₄ = ⁵₄, and plug into the formula. Then compute φ(s) = 1

6+ 1 2s+1

3s³. For (b),

P(X3 = 0)−P(X2= 0) =φ(φ(φ(0)))−φ(φ(0))≈0.0462.

For (c), we solveφ(s) =s, 0 = 2s³−3s+ 1 = (s−1)(2s²+ 2s−1), and soπ₀= ^√³₂⁻¹ ≈0.3660.

For (d), the answer isπ₀⁵. 2. For (a),µ=λand

E(X₀+X₁+. . .+X₁₀) =EX₀+EX₁+. . .+EX₁₀= 1 +λ+· · ·+λ¹⁰= λ¹¹−1 λ−1 , ifλ6= 1, and 11 ifλ= 1. For (b), ifλ≤1 then π₀ = 1, but ifλ >1, thenπ₀ is the solution for s∈(0,1) to

e^λ(s⁻¹⁾ =s.

This equation cannot be solved analytically, but we can numerically obtain the solution forλ= 2 to getπ₀≈0.2032.

3. Assuming Xn−1 = k, the number of families at time n is also k. Each of these has, in-dependently, a members with probability p_a. If k is large — which it will be for large n, as the branching process cannot die out — then, with overwhelming probability, the number of children in such families is aboutapak, whileXnis about µk. Then, the proportion Yn is about

apa

µ , which works out to be ¹₆, ¹₃, and ¹₂, fora= 1, 2, and 3.

In document Lecture Notes for Introductory Probability (Pldal 164-170)