Expected value of a function of a continu- continu-ous random variable

We leared in Chapter 17 of Part II that if we make N experiments for a discrete random variable X, and we substitute the experimental results X₁,X₂, . . . ,X_N into the function y= t(x), and we consider the values t(X₁),t(X₂), . . . ,t(X_N), then their average is is close to the expected value oft(X):

t(X₁) +t(X₂) +. . .+t(X_N)

N ≈E(t(X))

The same stabilization rule is true in the case of a continuous random variable. Let X be a continuous random variable, and t(x) a continuous function. The expected value of the random variablet(X)is calculated by the integral:

E(t(X)) = Z _∞

−∞t(x)f(x)dx

Motivation of the declared formula. We give here some motivation of the declared formula of the expected value oft(X). For this purpose, let us take a continuous random variable X, and a continuous functiont(x), and and let X₁,X₂, . . . ,X_N be the experimental results forX.

We will show that the average of the function values of the experimental results is close to the above integral:

t(X₁) +X₂+. . .+t(X_N)

N ≈

Z _∞

−∞

t(x)f(x)dx

In order to show this, we choose the fixed points . . . ,y_i,y_i+1, . . . on the real line so that all the differences ∆y_i=y_i+1−y_i are small. Then we introduce a discrete random variable, so that the value ofY is derived from the value of X by rounding down to the closesty_i value which is on the left side ofX, that is,

Y =y_i if and only if y_i≤X <y_i+1

Applying the rounding operation to each experimental result, we get the values Y₁,Y₂, . . . ,Y_N

161

162 PROBABILITY THEORY WITH SIMULATIONS

Since all the differences ∆y_i=y_i+1−y_i are small, and the function t(x) is continuous, we have that

t(X₁) +t(X₂) +. . .+t(X_N)

N ≈ t(Y₁) +t(Y₂) +. . .+t(Y_N)

Obviously, Y is a discrete random variable with the possible values . . . ,y_i. . ., so that the probability ofy_iis

p_i= Z _y_i+1

f(x)dx≈ f(y_i)∆y_i and thus, the expected value oft(Y)is

∑

We know that the average of the function values of the experimental results of a discrete random variable is close to its expected value, so

t(Y₁) +t(Y₂) +. . .+t(Y_N)

N ≈

∑

t(y_i)p_i From all these approximations we get that

t(X₁) +t(X₂) +. . .+t(X_N)

N ≈

Z _∞

−∞t(x)f(x)dx The expected value ofXⁿis called thenth momentofX:

E(Xⁿ) = Z _∞

−∞

xⁿf(x)dx specifically, thesecond momentofX is:

E X²

= Z _∞

−∞

x²f(x)dx

The expected value of(X−c)ⁿis called thenth momentabout a the pointc:

E((X−c)ⁿ) = Z _∞

−∞(x−c)ⁿf(x)dx specifically, thesecond momentabout a pointcis:

E (X−c)²

= Z _∞

−∞

(x−c)²f(x)dx Second moment of some continuous distributions:

1. Uniform distribution on an interval(A;B) E X²

= A²+AB+B² 3

tankonyvtar.ttk.bme.hu Vetier András, BME

Part III. Continous distributions in one-dimension 163

Here we recognize that the integral in the last line is the expected value of the λ -parametric exponential distribution, which is equal to ¹

λ, so we get as it was stated.

File to study the expected value of several functions of RND.

Demonstration file: E(t(RND)), expected value of functions of a random number 200-58-00

Vetier András, BME tankonyvtar.ttk.bme.hu

Section 49 ***Median

In this chapter, we learn about the notion of the median, which is a kind of a "center" of a data-set or of a distribution. In the next chapter, we will learn the notion of the expected value also for continuous random variables and distributions, which is a kind of "center", too, and then we will be able to compare them.

If a data-set consists ofnnumbers, then we may find the smallest of these numbers, let us denote it byz^∗₁, the second smallest, let us denote it byz^∗₂,

the third smallest, let us denote it byz^∗₃, and so on,

thekth smallest, let us denote it byz^∗_k, and so on,

thenth smallest, which is actually the largest, let us denote it byz^∗_n.

Using Excel.In Excel, for a data-set, the functionSMALL(in Hungarian:KICSI) can be used to find thekth smallest element in anarray:

z^∗_k =SMALL(array;k)

Now we may arrange the numbers z₁,z₂, . . . ,z_n in the increasing order: z^∗₁,z^∗₂, . . . ,z^∗_n. If the number n is odd, then there is a well defined center element in the list z^∗₁,z^∗₂, . . . ,z^∗_n. This center element is called the median of the data-set. If nis even, then there are two center elements. In this case, the average of these two center elements is themedian of the data-set.

Using Excel. In Excel, for a data-set, the functionMEDIAN(in Hungarian: MEDIÁN) is used to calculate the median of a data-set:

MEDIAN(array)

Themedianof a continuous random variable or distribution is the valuecfor which it is true that both the probability of being less than c and the probability of being greater than c is equal to ¹₂:

P((−∞,c)) = 1 2 164

Part III. Continous distributions in one-dimension 165

P((c,∞)) = 1 2 The median is the solution to the equation

F(x) = 1 2

For a continuous distribution, this equation has a solution. If the inverse ofF(x)exists, and it is denoted byF⁻¹(y), then

c=F⁻¹ 1

Using the density function, the median can be characterized obviously by the property Z c

The notion of the median can be defined for discrete distributions, too, but the definition is a little bit more complicated. The medianof a discrete random variable or distribution is the value c for which it is true that both the probability of being less thanc at least ¹₂ and the probability of being greater thancat least ¹₂:

P((−∞,c))≥ 1 2 P((−∞,c))≥ 1 2

In a long sequence of experiments, the median of the experimental results for a random vari-able stabilizes around the median of the distribution of the random varivari-able: ifX₁,X₂, . . . ,X_N are experimental results for a random variableX, andNis large, then the median of the

data-setX₁,X₂, . . . ,X_N, the so called experimental median is close to the median of the distribution

of the random variable.

Here is a file to study the notion of the median.

Demonstration file: Median of the exponential distribution 200-57-00

Minimal property of the median. If X is continuous random variable with the density function f(x), andcis a constant, then the expected value of the distance betweenX andcis

E(|X−c|) = Z∞

−∞

|x−c| f(x)dx

Vetier András, BME tankonyvtar.ttk.bme.hu

166 PROBABILITY THEORY WITH SIMULATIONS

This integral is minimal ifcis the median.

Proof.Let us denote the value of the integral, which depends onc, byh(c) h(c) = Let us take the derivative of each term with respect toc:



Now adding the 6 terms on the right sides, the termsc f(c) cancel each other, and what we get is

h⁰(c) =1−2F(c) Since the equation

1−2F(c) =0 is equivalent to the equation

F(c) =1/2 and the solution to this equation is the median, we get that

h⁰(c) =1−2F(c) =0 if c=median

tankonyvtar.ttk.bme.hu Vetier András, BME

Part III. Continous distributions in one-dimension 167

h⁰(c) =1−2F(c)<0 if c<median h⁰(c) =1−2F(c)>0 if c>median which means that the minimum ofh(c)occurs ifc=median.

Vetier András, BME tankonyvtar.ttk.bme.hu

Section 50

In document PROBABILITY THEORY WITH SIMULATIONS (Pldal 164-171)