• Nem Talált Eredményt

9 Convergence in probability

One of the goals of probability theory is to extricate a useful deterministic quantity out of a random situation. This is typically possible when a large number of random effects cancel each other out, so some limit is involved. In this chapter we consider the following setting: given a sequence of random variables, Y1, Y2, . . ., we want to show that, when nis large, Yn is approx-imately f(n), for some simple deterministic function f(n). The meaning of “approximately” is what we now make clear.

A sequenceY1, Y2, . . . of random variables converges to a number ain probability if, as n→ ∞, P(|Yn−a| ≤ǫ) converges to 1, for any fixedǫ >0. This is equivalent to P(|Yn−a|> ǫ)→0 as n→ ∞, for any fixed ǫ >0.

Example 9.1. Toss a fair coinntimes, independently. LetRn be the “longest run of Heads,”

i.e., the longest sequence of consecutive tosses of Heads. For example, ifn= 15 and the tosses come out

HHTTHHHTHTHTHHH, thenRn= 3. We will show that, asn→ ∞,

Rn

log2n →1,

in probability. This means that, to a first approximation, one should expect about 20 consecutive Heads somewhere in a million tosses.

To solve a problem such as this, we need to find upper bounds for probabilities that Rn is large and that it is small, i.e., forP(Rn≥k) andP(Rn≤k), for appropriately chosenk. Now, for arbitrary k,

P(Rn≥k) = P(kconsecutive Heads start at some i, 0≤i≤n−k+ 1)

= P(

nk+1

[

i=1

{iis the first Heads in a succession of at least kHeads})

≤ n· 1 2k.

For the lower bound, divide the string of size n into disjoint blocks of size k. There are

nk⌋ such blocks (ifn is not divisible by k, simply throw away the leftover smaller block at the end). Then, Rn ≥ k as soon as one of the blocks consists of Heads only; different blocks are independent. Therefore,

P(Rn< k)≤

1− 1 2k

nk

≤exp

− 1 2k

jn k

k ,

using the famous inequality 1−x≤ex, valid for allx.

Below, we will use the following trivial inequalities, valid for any real number x≥2: ⌊x⌋ ≥ x−1, ⌈x⌉ ≤x+ 1, x−1≥ x2, and x+ 1≤2x.

To demonstrate that logRn

2n →1,in probability, we need to show that, for any ǫ >0,

A little fussing in the proof comes from the fact that (1±ǫ) log2n are not integers. This is common in such problems. To prove (1), we plug k=⌊(1 +ǫ) log2n⌋ into the upper bound to

The most basic tool in proving convergence in probability is the Chebyshev inequality: ifX is a random variable with EX=µand Var(X) =σ2, then

P(|X−µ| ≥k)≤ σ2 k2,

for anyk >0. We proved this inequality in the previous chapter and we will use it to prove the next theorem.

Theorem 9.1. Connection between variance and convergence in probability.

Assume that Yn are random variables and thatais a constant such that

EYn→a, Var(Yn)→0, as n→ ∞. Then,

Yn→a, as n→ ∞, in probability.

Proof. Fix anǫ >0. Ifnis so large that

|EYn−a|< ǫ/2, then

P(|Yn−a|> ǫ) ≤ P(|Yn−EYn|> ǫ/2)

≤ 4Var(Yn) ǫ2

→ 0,

asn→ ∞. Note that the second inequality in the computation is the Chebyshev inequality.

This is most often applied to sums of random variables. Let Sn=X1+. . .+Xn,

where Xi are random variables with finite expectation and variance. Then, without any inde-pendence assumption,

ESn=EX1+. . .+EXn and

E(Sn2) =

n

X

i=1

EXi2+X

i6=j

E(XiXj),

Var(Sn) =

n

X

i=1

Var(Xi) +X

i6=j

Cov(Xi, Xj).

Recall that

Cov(X1, Xj) =E(XiXj)−EXiEXj and

Var(aX) =a2Var(X).

Moreover, ifXi are independent,

Var(X1+. . .+Xn) = Var(X1) +. . .+ Var(Xn).

Continuing with the review, let us reformulate and prove again the most famous convergence in probability theorem. We will use the common abbreviation i. i. d. for independent identically distributed random variables.

Theorem 9.2. Weak law of large numbers. LetX, X1, X2, . . . be i. i. d. random variables with with EX=µ and Var(X) =σ2<∞. Let Sn=X1+. . .+Xn. Then, as n→ ∞,

Sn

n →µ in probability.

Proof. Let Yn= Snn. We have EYn=µand Var(Yn) = 1

n2 Var(Sn) = 1

n2n σ2 = σ2 n. Thus, we can simply apply the previous theorem.

Example 9.2. We analyze a typical “investment” (the accepted euphemism for gambling on financial markets) problem. Assume that you have two investment choices at the beginning of each year:

• a risk-free “bond” which returns 6% per year; and

• a risky “stock” which increases your investment by 50% with probability 0.8 and wipes it out with probability 0.2.

Putting an amounts in the bond, then, gives you 1.06safter a year. The same amount in the stock gives you 1.5swith probability 0.8 and 0 with probability 0.2; note that the expected value is 0.8·1.5s= 1.2s >1.06s. We will assume year-to-year independence of the stock’s return.

We will try to maximize the return to our investment by “hedging.” That is, we invest, at the beginning of each year, a fixed proportion x of our current capital into the stock and the remaining proportion 1−x into the bond. We collect the resulting capital at the end of the year, which is simultaneously the beginning of next year, and reinvest with the same proportion x. Assume that our initial capital is x0.

It is important to note that the expected value of the capital at the end of the year is maximized when x= 1, but by using this strategy you will eventually lose everything. Let Xn be your capital at the end of yearn. Define the average growth rate of your investment as

λ= lim

We will expressλin terms ofx; in particular, we will show that it is a nonrandom quantity.

Let Ii =I{stock goes up in yeari}. These are independent indicators withEIi= 0.8.

Xn = Xn−1(1−x)·1.06 +Xn−1·x·1.5·In

= Xn−1(1.06(1−x) + 1.5x·In) and so we can unroll the recurrence to get

Xn=x0(1.06(1−x) + 1.5x)Sn((1−x)1.06)nSn,

in probability, as n→ ∞. The last expression defines λ as a function ofx. To maximize this, we set dx = 0 to get

0.8·0.44

1.06 + 0.44x = 0.2 1−x. The solution isx= 227, which gives λ≈8.1%.

Example 9.3. Distributenballs independently at random intonboxes. LetNnbe the number of empty boxes. Show that 1nNn converges in probability and identify the limit.

Note that

Moreover,

1. Assume that nmarried couples (amounting to 2n people) are seated at random on 2n seats around a round table. Let T be the number of couples that sit together. Determine ET and Var(T).

2. There are n birds that sit in a row on a wire. Each bird looks left or right with equal probability. Let N be the number of birds not seen by any neighboring bird. Determine, with proof, the constantc so that, asn→ ∞, 1nN →c in probability.

3. Recall the coupon collector problem: sample fromncards, with replacement, indefinitely, and let N be the number of cards you need to get so that each of ndifferent cards are represented.

Find a sequencean so that, asn→ ∞,N/an converges to 1 in probability.

4. Kings and Lakers are playing a “best of seven” playoff series, which means they play until one team wins four games. Assume Kings win every game independently with probability p.

(There is no difference between home and away games.) LetN be the number of games played.

Compute EN and Var(N).

5. An urn contains n red and m black balls. Select balls from the urn one by one without replacement. LetX be the number of red balls selected before any black ball, and let Y be the number of red balls between the first and the second black one. Compute EX and EY.

Solutions to problems

1. LetIi be the indicator of the event that theith couple sits together. Then,T =I1+· · ·+In. Moreover,

EIi = 2

2n−1, E(IiIj) = 22(2n−3)!

(2n−1)! = 4

(2n−1)(2n−2), for any iand j6=i. Thus,

ET = 2n 2n−1 and

E(T2) =ET+n(n−1) 4

(2n−1)(2n−2) = 4n 2n−1, so

Var(T) = 4n

2n−1− 4n2

(2n−1)2 = 4n(n−1) (2n−1)2.

2. LetIi indicate the event that birdiis not seen by any other bird. Then, EIi is 12 ifi= 1 or i=n and 14 otherwise. It follows that

EN = 1 +n−2

4 = n+ 2 4 .

Furthermore, Ii and Ij are independent if |i−j| ≥ 3 (two birds that have two or more birds between them are observed independently). Thus, Cov(Ii, Ij) = 0 if|i−j| ≥3. AsIi and Ij are indicators, Cov(Ii, Ij)≤1 for any iand j. For the same reason, Var(Ii)≤1. Therefore,

Var(N) =X

i

Var(Ii) +X

i6=j

Cov(Ii, Ij)≤n+ 4n= 5n.

Clearly, if M = 1nN, then EM = n1EN → 14 and Var(M) = n12Var(N) → 0. It follows that c= 14.

3. LetNi be the number of coupons needed to getidifferent coupons after havingi−1 different ones. Then N = N1+. . .+Nn, and Ni are independent Geometric with success probability

ni+1 and the final result can be obtained by plugging in EIi and by the standard formula

Var(N) =E(N2)−(EN)2.

5. Imagine the balls ordered in a row where the ordering specifies the sequence in which they are selected. LetIi be the indicator of the event that theith red ball is selected before any black ball. Then,EIi= m+11 , the probability that in a random ordering of theith red ball and allm black balls, the red comes first. As X=I1+. . .+In,EX = m+1n .

Now, letJi be the indicator of the event that theith red ball is selected between the first and the second black one. Then,EJi is the probability that the red ball is second in the ordering of the above m+ 1 balls, soEJi=EIi, and EY =EX.