Machine learning and portfolio selections. II.

(1)

Machine learning and portfolio selections. II.

László (Laci) Györfi¹

1Department of Computer Science and Information Theory Budapest University of Technology and Economics

Budapest, Hungary

September 22, 2007

e-mail: gyorfi@szit.bme.hu www.szit.bme.hu/˜gyorfi www.szit.bme.hu/˜oti/portfolio

(2)

Dynamic portfolio selection: general case

x_i = (x_i⁽¹⁾, . . .x_i^(d)) the return vector on day i b=b1 is the portfolio vector for the first day initial capitalS0

S1=S0· hb₁,x1i

for the second day,S₁ new initial capital, the portfolio vector b2=b(x1)

S₂ =S₀· hb₁,x₁i · hb(x₁),x₂i.

nth day a portfolio strategy b_n=b(x₁, . . . ,xn−1) =b(xⁿ⁻¹₁ ) Sn=S0

n

Y

i=1

D

b(xⁱ⁻¹₁ ),x_i E

=S0e^nWⁿ^(B) with the average growth rate

Wn(B) = 1 n

n

X

i=1

ln D

b(xⁱ⁻¹₁ ),xi

E .

(3)

Dynamic portfolio selection: general case

S1=S0· hb₁,x1i

S₂ =S₀· hb₁,x₁i · hb(x₁),x₂i.

n

Y

i=1

D

Wn(B) = 1 n

n

X

i=1

ln D

b(xⁱ⁻¹₁ ),xi

E .

(4)

Dynamic portfolio selection: general case

S1=S0· hb₁,x1i

S₂ =S₀· hb₁,x₁i · hb(x₁),x₂i.

n

Y

i=1

D

n

(5)

log-optimum portfolio

X1,X2, . . . drawn from the vector valued stationary and ergodic process

log-optimum portfolioB^∗ ={b^∗(·)}

E{ln

b^∗(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ }= max

b(·) E{ln

b(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ }

Xⁿ⁻¹₁ =X₁, . . . ,Xn−1

(6)

Optimality

Algoet and Cover (1988): IfS_n^∗ =S_n(B^∗) denotes the capital after dayn achieved by a log-optimum portfolio strategyB^∗, then for any portfolio strategyBwith capital S_n=S_n(B) and for any process{X_n}^∞_−∞,

lim sup

n→∞

1

n lnS_n−1 nlnS_n^∗

≤0 almost surely

for stationary ergodic process{X_n}^∞_−∞,

n→∞lim 1

n lnS_n^∗ =W^∗ almost surely, where

W^∗=E

maxb(·) E{ln

b(X⁻¹_−∞),X₀

|X⁻¹_−∞}

is the maximal growth rate of any portfolio.

(7)

Optimality

Algoet and Cover (1988): IfS_n^∗ =S_n(B^∗) denotes the capital after dayn achieved by a log-optimum portfolio strategyB^∗, then for any portfolio strategyBwith capital S_n=S_n(B) and for any process{X_n}^∞_−∞,

lim sup

n→∞

1

n lnS_n−1 nlnS_n^∗

≤0 almost surely for stationary ergodic process{X_n}^∞_−∞,

n→∞lim 1

n lnS_n^∗ =W^∗ almost surely, where

W^∗=E

maxb(·) E{ln

b(X⁻¹_−∞),X₀

|X⁻¹_−∞}

is the maximal growth rate of any portfolio.

(8)

Martingale difference sequences

for the proof of optimality we use the concept of martingale differences:

Definition

there are two sequences of random variables:

{Z_n} {X_n}

Zn is a function of X1, . . . ,Xn,

E{Z_n|X1, . . . ,Xn−1}= 0 almost surely.

Then{Z_n} is called martingale difference sequence with respect to {X_n}.

(9)

A strong law of large numbers

Chow Theorem: If{Z_n}is a martingale difference sequence with respect to{X_n}and

∞

X

n=1

E{Z_n²} n² <∞ then

n→∞lim 1 n

n

X

i=1

Z_i = 0 a.s.

(10)

A weak law of large numbers

Lemma: If{Z_n} is a martingale difference sequence with respect to{X_n}then {Z_n}are uncorrelated.

Proof. Put i <j.

E{Z_iZj} = E{E{Z_iZj |X1, . . . ,Xj−1}}

= E{Z_iE{Z_j |X1, . . . ,Xj−1}}

= E{Z_i ·0}= 0 Corollary

E





 1 n

n

X

i=1

Z_i

!2





= 1

n²

n

X

i=1 n

X

j=1

E{Z_iZ_j}

= 1

n²

n

X

i=1

E{Z_i²}

→ 0

if, for example,E{Z_i²} is a bounded sequence.

(11)

A weak law of large numbers

Proof. Put i <j. E{Z_iZj}

= E{E{Z_iZj |X1, . . . ,Xj−1}}

= E{Z_iE{Z_j |X1, . . . ,Xj−1}}

E





 1 n

n

X

i=1

Z_i

!2





= 1

n²

n

X

i=1 n

X

j=1

E{Z_iZ_j}

= 1

n²

n

X

i=1

E{Z_i²}

→ 0

(12)

A weak law of large numbers

Proof. Put i <j.

E{Z_iZj} = E{E{Z_iZj |X1, . . . ,Xj−1}}

= E{Z_iE{Z_j |X1, . . . ,Xj−1}}

E





 1 n

n

X

i=1

Z_i

!2





= 1

n²

n

X

i=1 n

X

j=1

E{Z_iZ_j}

= 1

n²

n

X

i=1

E{Z_i²}

→ 0

(13)

A weak law of large numbers

Proof. Put i <j.

E{Z_iZj} = E{E{Z_iZj |X1, . . . ,Xj−1}}

= E{Z_iE{Z_j |X1, . . . ,Xj−1}}

E





 1 n

n

X

i=1

Z_i

!2





= 1

n²

n

X

i=1 n

X

j=1

E{Z_iZ_j}

= 1

n²

n

X

i=1

E{Z_i²}

→ 0

(14)

A weak law of large numbers

Proof. Put i <j.

E{Z_iZj} = E{E{Z_iZj |X1, . . . ,Xj−1}}

= E{Z_iE{Z_j |X1, . . . ,Xj−1}}

= E{Z_i ·0}

= 0 Corollary

E





 1 n

n

X

i=1

Z_i

!2





= 1

n²

n

X

i=1 n

X

j=1

E{Z_iZ_j}

= 1

n²

n

X

i=1

E{Z_i²}

→ 0

(15)

A weak law of large numbers

Proof. Put i <j.

E{Z_iZj} = E{E{Z_iZj |X1, . . . ,Xj−1}}

= E{Z_iE{Z_j |X1, . . . ,Xj−1}}

= E{Z_i ·0}= 0

Corollary E





 1 n

n

X

i=1

Z_i

!2





= 1

n²

n

X

i=1 n

X

j=1

E{Z_iZ_j}

= 1

n²

n

X

i=1

E{Z_i²}

→ 0

(16)

A weak law of large numbers

Proof. Put i <j.

E{Z_iZj} = E{E{Z_iZj |X1, . . . ,Xj−1}}

= E{Z_iE{Z_j |X1, . . . ,Xj−1}}

E





 1 n

n

X

i=1

Z_i

!2





= 1

n²

n

X

i=1 n

X

j=1

E{Z_iZ_j}

= 1

n²

n

X

i=1

E{Z_i²}

→ 0

(17)

A weak law of large numbers

Proof. Put i <j.

E{Z_iZj} = E{E{Z_iZj |X1, . . . ,Xj−1}}

= E{Z_iE{Z_j |X1, . . . ,Xj−1}}

E





 1 n

n

X

i=1

Z_i

!2





= 1

n²

n

X

i=1 n

X

j=1

E{Z_iZ_j}

= 1

n²

n

X

i=1

E{Z_i²}

→ 0

(18)

A weak law of large numbers

Proof. Put i <j.

E{Z_iZj} = E{E{Z_iZj |X1, . . . ,Xj−1}}

= E{Z_iE{Z_j |X1, . . . ,Xj−1}}

E





 1 n

n

X

i=1

Z_i

!2





= 1

n²

n

X

i=1 n

X

j=1

E{Z_iZ_j}

= 1

n²

n

X

i=1

E{Z_i²}

→ 0

(19)

A weak law of large numbers

Proof. Put i <j.

E{Z_iZj} = E{E{Z_iZj |X1, . . . ,Xj−1}}

= E{Z_iE{Z_j |X1, . . . ,Xj−1}}

E





 1 n

n

X

i=1

Z_i

!2





= 1

n²

n

X

i=1 n

X

j=1

E{Z_iZ_j}

= 1

n²

n

X

i=1

E{Z_i²}

→ 0

(20)

Constructing martingale difference sequence

{Y_n} is an arbitrary sequence such that Yn is a function of X1, . . . ,Xn

Put

Zn=Yn−E{Y_n|X1, . . . ,Xn−1} Then{Z_n} is a martingale difference sequence:

Z_n is a function of X₁, . . . ,X_n,

E{Z_n|X1, . . . ,Xn−1}

= E{Y_n−E{Y_n|X1, . . . ,Xn−1} |X1, . . . ,Xn−1}

= 0

almost surely.

(21)

Constructing martingale difference sequence

Put

E{Z_n|X1, . . . ,Xn−1}

= E{Y_n−E{Y_n|X1, . . . ,Xn−1} |X1, . . . ,Xn−1}

= 0

almost surely.

(22)

Constructing martingale difference sequence

Put

E{Z_n|X1, . . . ,Xn−1}

= E{Y_n−E{Y_n|X1, . . . ,Xn−1} |X1, . . . ,Xn−1}

= 0

almost surely.

(23)

Constructing martingale difference sequence

Put

E{Z_n|X1, . . . ,Xn−1}

= E{Y_n−E{Y_n|X1, . . . ,Xn−1} |X1, . . . ,Xn−1}

= 0

almost surely.

(24)

Constructing martingale difference sequence

Put

E{Z_n|X1, . . . ,Xn−1}

= E{Y_n−E{Y_n|X1, . . . ,Xn−1} |X1, . . . ,Xn−1}

= 0

(25)

Optimality

E{ln

b^∗(Xⁿ⁻¹₁ ),Xn

|Xⁿ⁻¹₁ }= max

b(·) E{ln

b(Xⁿ⁻¹₁ ),Xn

|Xⁿ⁻¹₁ }

IfS_n^∗ =S_n(B^∗) denotes the capital after day n achieved by a log-optimum portfolio strategyB^∗, then for any portfolio strategy Bwith capital Sn=Sn(B) and for any process {X_n}^∞_−∞,

lim sup

n→∞

1

n lnSn−1 nlnS_n^∗

≤0 almost surely

(26)

Optimality

E{ln

b^∗(Xⁿ⁻¹₁ ),Xn

|Xⁿ⁻¹₁ }= max

b(·) E{ln

b(Xⁿ⁻¹₁ ),Xn

|Xⁿ⁻¹₁ }

IfS_n^∗ =S_n(B^∗) denotes the capital after day n achieved by a log-optimum portfolio strategyB^∗, then for any portfolio strategy Bwith capital Sn=Sn(B) and for any process {X_n}^∞_−∞,

lim sup

n→∞

1

n lnSn−1 nlnS_n^∗

≤0 almost surely

(27)

Proof of optimality

1

n lnSn = 1 n

n

X

i=1

ln D

b(Xⁱ⁻¹₁ ),Xi

E

= 1

n

X

i=1

E{lnD

b(Xⁱ⁻¹₁ ),Xi

E

|Xⁱ₁⁻¹}

+ 1

n

X

i=1

lnD

b(Xⁱ⁻¹₁ ),X_iE

−E{lnD

|Xⁱ⁻¹₁ }

and 1

n lnS_n^∗ = 1 n

n

X

i=1

E{lnD

b^∗(Xⁱ⁻¹₁ ),X_iE

|Xⁱ⁻¹₁ }

+ 1

n

X

i=1

lnD

b^∗(Xⁱ⁻¹₁ ),X_iE

−E{lnD

b^∗(Xⁱ⁻¹₁ ),X_iE

|Xⁱ₁⁻¹}

(28)

Proof of optimality

1

n lnSn = 1 n

n

X

i=1

ln D

b(Xⁱ⁻¹₁ ),Xi

E

= 1

n

X

i=1

E{lnD

b(Xⁱ⁻¹₁ ),Xi

E

|Xⁱ₁⁻¹}

+ 1

n

X

i=1

lnD

−E{lnD

|Xⁱ⁻¹₁ }

and 1

n lnS_n^∗ = 1 n

n

X

i=1

E{lnD

b^∗(Xⁱ⁻¹₁ ),X_iE

|Xⁱ⁻¹₁ }

+ 1

n

X

i=1

lnD

b^∗(Xⁱ⁻¹₁ ),X_iE

−E{lnD

b^∗(Xⁱ⁻¹₁ ),X_iE

|Xⁱ₁⁻¹}

(29)

Proof of optimality

1

n lnSn = 1 n

n

X

i=1

ln D

b(Xⁱ⁻¹₁ ),Xi

E

= 1

n

X

i=1

E{lnD

b(Xⁱ⁻¹₁ ),Xi

E

|Xⁱ₁⁻¹}

+ 1

n

X

i=1

lnD

−E{lnD

|Xⁱ⁻¹₁ }

and 1

n lnS_n^∗ = 1 n

n

X

i=1

E{lnD

b^∗(Xⁱ⁻¹₁ ),X_iE

|Xⁱ⁻¹₁ }

+ 1

n

X

i=1

lnD

b^∗(Xⁱ⁻¹₁ ),X_iE

−E{lnD

b^∗(Xⁱ⁻¹₁ ),X_iE

|Xⁱ₁⁻¹}

(30)

Universally consistent portfolio

These limit relations give rise to the following definition:

Definition

An empirical (data driven) portfolio strategyBis called

universally consistent with respect to a class C of stationary and ergodic processes{X_n}^∞_−∞,if for each process in the class,

n→∞lim 1

nlnSn(B) =W^∗ almost surely.

(31)

Empirical portfolio selection

E{ln

b^∗(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ }= max

b(·) E{ln

b(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ }

fixed integerk >0 E{ln

b(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ } ≈E{ln

b(Xⁿ⁻¹_n−k),X_n

|Xⁿ⁻¹_n−k} and

b^∗(Xⁿ⁻¹₁ )≈b_k(Xⁿ⁻¹_n−k) =arg max

b(·)

E{ln

|Xⁿ⁻¹_n−k} because of stationarity

b_k(x^k₁) = arg max

b(·)

E{lnD

b(x^k₁),X_k+1 E

|X^k₁ =x^k₁}

= arg max

b

E{lnhb,X_k+1i |X^k₁ =x^k₁}, which is the maximization of the regression function

m_b(x^k₁) =E{lnhb,X_k+1i |X^k₁ =x^k₁}

(32)

Empirical portfolio selection

E{ln

b^∗(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ }= max

b(·) E{ln

b(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ } fixed integerk >0

E{ln

b(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ } ≈E{ln

|Xⁿ⁻¹_n−k}

and

b(·)

E{ln

b(·)

E{lnD

b(x^k₁),X_k+1 E

|X^k₁ =x^k₁}

= arg max

b

(33)

Empirical portfolio selection

E{ln

b^∗(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ }= max

b(·) E{ln

b(Xⁿ⁻¹₁ ),X_n

E{ln

b(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ } ≈E{ln

b(·)

E{ln

|Xⁿ⁻¹_n−k}

because of stationarity b_k(x^k₁) = arg max

b(·)

E{lnD

b(x^k₁),X_k+1 E

|X^k₁ =x^k₁}

= arg max

b

(34)

Empirical portfolio selection

E{ln

b^∗(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ }= max

b(·) E{ln

b(Xⁿ⁻¹₁ ),X_n

E{ln

b(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ } ≈E{ln

b(·)

E{ln

b(·)

E{lnD

b(x^k₁),X_k+1 E

|X^k₁ =x^k₁}

= arg max

b

E{lnhb,X_k+1i |X^k₁ =x^k₁},

which is the maximization of the regression function m_b(x^k₁) =E{lnhb,X_k+1i |X^k₁ =x^k₁}

(35)

Empirical portfolio selection

E{ln

b^∗(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ }= max

b(·) E{ln

b(Xⁿ⁻¹₁ ),X_n

E{ln

b(Xⁿ⁻¹₁ ),X_n

|Xⁿ⁻¹₁ } ≈E{ln

b(·)

E{ln

b(·)

E{lnD

b(x^k₁),X_k+1 E

|X^k₁ =x^k₁}

= arg max

b

(36)

Regression function

Y real valued X observation vector

Regression function

m(x) =E{Y |X =x} i.i.d. data: Dn={(X₁,Y1), . . . ,(Xn,Yn)} Regression function estimate

m_n(x) =m_n(x,D_n) local averaging estimates

m_n(x) =

n

X

i=1

W_ni(x;X₁, . . . ,X_n)Y_i L. Gy¨orfi, M. Kohler, A. Krzyzak, H. Walk (2002) A Distribution-Free Theory of Nonparametric Regression, Springer-Verlag, New York.

(37)

Regression function

Y real valued X observation vector Regression function

m(x) =E{Y |X =x}

i.i.d. data: Dn={(X₁,Y1), . . . ,(Xn,Yn)}

Regression function estimate

m_n(x) =

n

X

i=1

(38)

Regression function

m(x) =E{Y |X =x}

m_n(x) =m_n(x,D_n)

local averaging estimates m_n(x) =

n

X

i=1

(39)

Regression function

m(x) =E{Y |X =x}

m_n(x) =

n

X

i=1

W_ni(x;X₁, . . . ,X_n)Y_i

L. Gy¨orfi, M. Kohler, A. Krzyzak, H. Walk (2002) A Distribution-Free Theory of Nonparametric Regression, Springer-Verlag, New York.

(40)

Regression function

m(x) =E{Y |X =x}

m_n(x) =

n

X

i=1

W_ni(x;X₁, . . . ,X_n)Y_i L. Gy¨orfi, M. Kohler, A. Krzyzak, H. Walk (2002) A

(41)

Correspondence

X ∼ X^k₁

Y ∼ lnhb,X_k+1i

m(x) =E{Y |X =x} ∼ m_b(x^k₁) =E{lnhb,X_k₊₁i |X^k₁ =x^k₁}

(42)

Correspondence

X ∼ X^k₁

Y ∼ lnhb,X_k+1i

(43)

Correspondence

X ∼ X^k₁

Y ∼ lnhb,X_k+1i

(44)

Partitioning regression estimate

PartitionP_n={A_n,1,An,2. . .}

A_n(x) is the cell of the partition P_n into which x falls mn(x) =

Pn

i=1Y_iI_[X_i_∈A_n_(x)] Pn

i=1I_[X_i_∈A_n_(x)]

LetGn be the quantizer corresponding to the partition P_n: G_n(x) =j ifx ∈A_n,j.

the set of matches

I_n(x) ={i ≤n: G_n(x) =G_n(X_i)} Then

mn(x) = P

i∈In(x)Yi

|I_n(x)| .

(45)

Partitioning regression estimate

A_n(x) is the cell of the partition P_n into which x falls

mn(x) = Pn

i=1Y_iI_[X_i_∈A_n_(x)] Pn

i=1I_[X_i_∈A_n_(x)]

the set of matches

mn(x) = P

i∈In(x)Yi

|I_n(x)| .

(46)

Partitioning regression estimate

Pn

i=1Y_iI_[X_i_∈A_n_(x)]

Pn

i=1I_[X_i_∈A_n_(x)]

the set of matches

mn(x) = P

i∈In(x)Yi

|I_n(x)| .

(47)

Partitioning regression estimate

Pn

i=1I_[X_i_∈A_n_(x)]

the set of matches

mn(x) = P

i∈In(x)Yi

|I_n(x)| .

(48)

Partitioning regression estimate

Pn

i=1I_[X_i_∈A_n_(x)]

the set of matches

I_n(x) ={i ≤n: G_n(x) =G_n(X_i)}

Then

mn(x) = P

i∈In(x)Yi

.

(49)

Partitioning-based portfolio selection

fixk, `= 1,2, . . .

P_`={A_`,j,j = 1,2, . . . ,m_`}finite partitions of R^d,

G_` be the corresponding quantizer: G_`(x) =j, if x∈A_`,j. G`(xⁿ₁) =G`(x1), . . . ,G`(xn),

the set of matches:

Jn={k <i <n:G_`(xⁱ⁻¹_i−k) =G_`(xⁿ⁻¹_n−k)}

b^(k,`)(xⁿ⁻¹₁ ) =arg max

b

X

i∈J_n

lnhb,x_ii

if the setIn is non-void, and b0 = (1/d, . . . ,1/d) otherwise.

(50)

Partitioning-based portfolio selection

fixk, `= 1,2, . . .

P_`={A_`,j,j = 1,2, . . . ,m_`}finite partitions of R^d, G_` be the corresponding quantizer: G_`(x) =j, if x∈A_`,j.

G`(xⁿ₁) =G`(x1), . . . ,G`(xn), the set of matches:

b

X

i∈J_n

lnhb,x_ii

(51)

Partitioning-based portfolio selection

fixk, `= 1,2, . . .

P_`={A_`,j,j = 1,2, . . . ,m_`}finite partitions of R^d, G_` be the corresponding quantizer: G_`(x) =j, if x∈A_`,j. G`(xⁿ₁) =G`(x1), . . . ,G`(xn),

the set of matches:

b

X

i∈J_n

lnhb,x_ii

(52)

Partitioning-based portfolio selection

fixk, `= 1,2, . . .

the set of matches:

b

X

i∈J_n

lnhb,x_ii

(53)

Partitioning-based portfolio selection

fixk, `= 1,2, . . .

the set of matches:

b

X

i∈J_n

lnhb,x_ii

(54)

Elementary portfolios

for fixedk, `= 1,2, . . .,

B^(k^,`) ={b^(k,`)(·)}, are called elementary portfolios

That is,b^(k,`)_n quantizes the sequencexⁿ⁻¹₁ according to the partitionP_`, and browses through all past appearances of the last seen quantized stringG_`(xⁿ⁻¹_n−k) of lengthk.

Then it designs a fixed portfolio vector according to the returns on the days following the occurence of the string.

(55)

Elementary portfolios

for fixedk, `= 1,2, . . .,

B^(k^,`) ={b^(k,`)(·)}, are called elementary portfolios

That is,b^(k,`)_n quantizes the sequencexⁿ⁻¹₁ according to the partitionP_`, and browses through all past appearances of the last seen quantized stringG_`(xⁿ⁻¹_n−k) of lengthk.

Then it designs a fixed portfolio vector according to the returns on the days following the occurence of the string.

(56)

Combining elementary portfolios

How to choosek, `

small k or small `: large bias

large k and large`: few matching, large variance

Machine learning: combination of experts

N. Cesa-Bianchi and G. Lugosi,Prediction, Learning, and Games. Cambridge University Press, 2006.

(57)

Combining elementary portfolios

How to choosek, `

small k or small `: large bias

large k and large`: few matching, large variance Machine learning: combination of experts

N. Cesa-Bianchi and G. Lugosi,Prediction, Learning, and Games.

Cambridge University Press, 2006.

(58)

Exponential weighing

combine the elementary portfolio strategiesB^(k^,`)={b^(k,`)_n }

let{q_k,`} be a probability distribution on the set of all pairs (k, `) such that for allk, `,q_k,`>0.

forη >0 put

wn,k,`=qk,`e^η^ln^Sⁿ⁻¹^(B^(k,`)⁾ forη = 1,

w_n,k,`=q_k,`e^ln^Sⁿ⁻¹^(B^(k,`)⁾=q_k,`Sn−1(B^(k,`)) and

v_n,k,`= wn,k,`

P

i,jw_n,i,j. the combined portfoliob:

b_n(xⁿ⁻¹₁ ) =X

k,`

v_n,k,`b^(k,`)_n (xⁿ⁻¹₁ ).

(59)

Exponential weighing

combine the elementary portfolio strategiesB^(k^,`)={b^(k,`)_n } let{q_k,`} be a probability distribution on the set of all pairs (k, `) such that for allk, `,q_k,`>0.

forη >0 put

v_n,k,`= wn,k,`

P

b_n(xⁿ⁻¹₁ ) =X

k,`

v_n,k,`b^(k,`)_n (xⁿ⁻¹₁ ).

(60)

Exponential weighing

forη >0 put

wn,k,`=qk,`e^η^ln^Sⁿ⁻¹^(B^(k,`)⁾

forη = 1,

v_n,k,`= wn,k,`

P

b_n(xⁿ⁻¹₁ ) =X

k,`

v_n,k,`b^(k,`)_n (xⁿ⁻¹₁ ).

(61)

Exponential weighing

forη >0 put

w_n,k,`=q_k,`e^ln^Sⁿ⁻¹^(B^(k,`)⁾=q_k,`Sn−1(B^(k,`))

and

v_n,k,`= wn,k,`

P

b_n(xⁿ⁻¹₁ ) =X

k,`

v_n,k,`b^(k,`)_n (xⁿ⁻¹₁ ).

(62)

Exponential weighing

forη >0 put

v_n,k,`= wn,k,`

P

i,jw_n,i,j.

the combined portfoliob: b_n(xⁿ⁻¹₁ ) =X

k,`

v_n,k,`b^(k,`)_n (xⁿ⁻¹₁ ).