• Nem Talált Eredményt

Discrete Applied Mathematics

N/A
N/A
Protected

Academic year: 2022

Ossza meg "Discrete Applied Mathematics"

Copied!
8
0
0

Teljes szövegt

(1)

Contents lists available atSciVerse ScienceDirect

Discrete Applied Mathematics

journal homepage:www.elsevier.com/locate/dam

Commute times of random walks on trees

Mokhtar Konsowa

a,Ď

, Fahimah Al-Awadhi

a

, András Telcs

b,

aDepartment of Statistics and Operations Research, Faculty of science, Kuwait University, Safat 13060, P.O.Box 5969, Kuwait

bDepartment of Quantitative Methods, Faculty of Economics, Veszprém, Egyetem utca 10, 8200 Veszprém, Hungary

a r t i c l e i n f o

Article history:

Received 12 November 2011

Received in revised form 4 October 2012 Accepted 7 October 2012

Available online 8 November 2012 Keywords:

Random walk Commute time

Spherically symmetric tree Resistance

a b s t r a c t

In this paper we provide exact formula for the commute times of random walks on spherically symmetric random trees. Using this formula we sharpen some of the results presented in Al-Awadhi et al. to the form of equalities rather than inequalities.

©2012 Elsevier B.V. All rights reserved.

1. Introduction

The commute time is a particular measure of random walks on weighted graphs. It has several nice properties which has been revealed independently partly or fully by many authors, see for example [5,2,3,14,21,1]. It is still in the focus of the research of computer scientists, probabilists and physicists as well. As examples, consider the tasks of graph embedding [9,16,18,22], graph sparsification [20], social network analysis, [13], proximity search [19], collaborative filtering [8], clustering [23], semisupervised learning [24], dimensionality reduction [10] image processing [17], graph labeling [11], and theoretical computer science [4,6]. For an extensive list of literature we refer the reader to [15]. Random walks on random graphs have been subject of permanent interest in the last three decades. Interestingly enough, very little is published on commute times of random walks on random graphs. The present paper studies commute times on very simple random objects, on spherically symmetric random trees,SSRT. Explicit results are presented in the annealed case, averaged commute times over the probability field of trees.

2. Commute times

Consider a random walk on a weighted graphG

= (

V

,

E

)

where a weight (conductivity)cxy

=

cyxis assigned to edge xy

E. The commute time between two verticesrandsis the mean number of steps it takes the random walk to go fromr tosand back torand will be denoted byE

(τ) =

E

r,s

)

. We know, see [5], for a finite connected graph,

E

(τ) =

2

ρ

rs

µ

rs

,

(2.1)

where

ρ = ρ

rsis the effective resistance betweenrandsand

µ = µ

rs

=

1

2

eEce. If the assigned weights are all equal 1, then

E

(τ) =

2

ρ

m

,

Corresponding author. Tel.: +36 303753896; fax: +36 14633157.

E-mail addresses:fahimah@kuc01.kuniv.edu.kw(F. Al-Awadhi),telcs.szit.bme@gmail.com(A. Telcs).

ĎOur co-author and friend Mokhtar Konsowa passed away with tragic suddenness while we were revising this submission.

0166-218X/$ – see front matter©2012 Elsevier B.V. All rights reserved.

doi:10.1016/j.dam.2012.10.006

(2)

wherem

= |

E

|

is the number of undirected edges ofG. We confine our study to investigating the commute time of random walk on spherically symmetric random treesSSRTin which the degree of a vertex depends only on its distance from the rootr.

The second probability space is given on the spherically symmetric trees of infinite heights and the corresponding probability and expectation will be denoted byPandE.

This type of trees is completely determined by its degree sequence

{

dn

;

n

0

}

wherednis the degree of every node at leveln. Let

n

= σ (

d1

,

d2

, . . . ,

dn

)

. Then, for each realizationT

(ω)

of a random treeT,

E

(τ |ℑ

n

) =

2

ρµ.

We are interested in the expected value with respect toP

(ω)

, probability distribution on the set of all possible treesT. In such a case,

E

(τ) =

2E

(ρµ).

It was shown in [1] thatE

(τ) ≤

2E

(ρ)

E

(µ)

. It can easily be seen that this inequality can not be strengthened to equality.

We first note that for positive nondegenerate random variableX, the functionf

(

X

) =

1

X is strictly convex and hence E

(

1

/

X

)

1

/

E

(

X

).

Consider now a treeT of height 1 rooted atrwhich has random degreed0. Thenm

=

d0and

ρ

rs

=

1

/

d0. Hence,E

(τ) =

2E

m

) =

2. On the other hand, 2E

(ρ)

E

(

m

)

2

.

Now we seek for asymptotic equality forE

(τ)

. LetSibe the sphere of radiusiand centered atr; that is the set of vertices at distanceifromr. Let

ρ

i

= ρ (

Si1

,

Si

)

,

µ

i

= µ (

Si1

,

Si

)

, andE

(τ)

is the commute time between the root and the sphere of radiusnshorted in one vertex. Then

E

(τ) =

2E

(ρµ) =

2E



n

i=1

µ

i

 

n

j=1

ρ

j



=

2E

n

i=1 n

j=1

µ

i

ρ

j

=

2E

n

i=1 n

j=1 j̸=i

µ

i

ρ

j

 +

2n

,

(2.2)

where the last step uses the fact that

ρ

i

=

1

i. Belowd+j will denote the outdegree of statej; that isd+j

=

dj

1. Now we concentrate on the double sum.

In

+

Jn

:=

E

n

i=2 i1

j=1

µ

i

ρ

j

 +

E

n1

i=1 n

j=i+1

µ

i

ρ

j

 .

Now,

In

=

E

n

i=2 i1

j=1

µ

i

ρ

j

=

E

n1

j=1

ρ

j

n

i=j+1

µ

i



=

E

n1

j=1

ρ

j

µ 

Sj

,

Sn

=

n1

j=1

E

 ρ

j

µ 

Sj

,

Sn



=

n1

j=1

k=1

E

 ρ

j

µ 

Sj

,

Sn

 | µ

j

=

k

P

µ

j

=

k

=

k=1 n1

j=1

E

1

k

j+1

+ µ

j+2

+ · · · + µ

n

) | µ

j

=

k

P

µ

j

=

k

=

k=1 n1

j=1

E

1

kk

(

d+j

+

d+j d+j+1

+ · · · +

d+jd+j+1

· · ·

d+n1

)

P

µ

j

=

k

=

n1

j=1 n1

x=j

E

Πix=jd+i

=

n1

x=1 x

j=1

E

Πix=jd+i

.

(3)

On the other hand, Jn

=

E

n1

i=1 n

j=i+1

µ

i

ρ

j

=

k=1

E

n1

i=1 n

j=i+1

µ

i

ρ

j

| µ

i

=

k

P

i

=

k

)

=

k=1

E

n1

i=1

k

i+1

+ ρ

i+2

+ · · · + ρ

n

) | µ

i

=

k

P

i

=

k

)

=

E

n1

i=1

k1 k

1 d+i

+

1

d+i d+i+1

+ · · · +

1 d+i d+i+1

· · ·

d+n1



=

n1

i=1

E

1 d+i

+

1

d+i d+i+1

+ · · · +

1 d+i d+i+1

· · ·

d+n1

=

n1

i=1 n1

x=i

E

Πjx=i

1 d+j

=

n1

x=1 x

i=1

E

Πjx=i

1 d+j

 ,

where in the third step the condition

µ

i

=

kis used to calculate the resistance ofkparallel branches starting in leveli. Finally, E

(τ) =

2n

+

2

n1

x=1 x

j=1

E

Πix=jd+i

 +

E

Πix=j

1 d+i



.

(2.3)

3. Spherically symmetric trees

In this section we use

τ

rsto denote the commute time between the rootrof aSSRTΓand the levelnshorted in one node s, while

τ

rxndenotes the commute time betweenrand a leafxnof leveln. We assign a unit resistance to every edge ofΓ. We also used+n to refer the outdegree of each node of levelnandZnto refer to the number of vertices in the leveln. Then Zn

=

Πkn=01dk. We will assume thatd+n’s are independent random variables. We need the following lemma from [12].

Lemma 1. Consider two nonnegative sequences anand bnsuch that

nbnis divergent. Iflimnabn

n

=

L, thenlimn

n k=1ak

n

k=1bk

=

L.

Notation 1. We use an

=

Θ

(

bn

)

forlimnan

bn

ϵ (

0

, ∞ )

.

The following theorem strengthen Theorem 1 of [1] that gives only an upper bound for the commute time.

Theorem 1. Consider a spherically symmetric random treeΓ such that d+n

=

1 with probab.1

qn 2 with probab. qn and

qn

< ∞ .

Then

τ

rs

=

Θ

n2

P-a.s.

and

τ

rxn

=

Θ

n2

P-a.s.

.

Proof. Applying the Borel Cantelli Lemma to the infinite outdegree sequence

{

d+n

}

shows thatp

(

dn

=

1 eventually

) =

1.

As such, there isNsuch thatp

(

dn

=

1

,

n

N

) =

1. Then

µ

rs

=

n

k=1

Zk

1

=

N

k=1

Zk

+

n

k=N+1

Zk

1

=

Θ

(

n

)

a.s.

(4)

whereZk

= |

Sk

|

. Similarly,

ρ

rs

=

Θ

(

n

)

a.s.

Therefore,

µ

rs

ρ

rs

=

Θ

n2

a.s.

and the result follows from Eq.(2.1).

The following lemma is presented in [7, p. 63].

Lemma 2. For any c

>

0, Πjn=1

1

+

c

j

nc

.

The following lemma is presented in [12, p. 66].

Lemma 3. For0

< α <

1,

n

k=1

1 kα

n

1α 1

− α .

Theorem 2.Consider a SSRTΓ such that d+n

=

1 with probab.1

qn 2 with probab. qn where qn

=

min

(

1

,

c

/

n

) ,

c

>

0. Then

E

rs

) =

Θ

n2logn

if c

=

1

,

E

rs

) =

Θ

n2

if c

<

1 E

rs

) =

Θ

nc+1

if c

>

1

.

Moreover, for any c

>

0

,

E

 τ

rxn

 =

Θ

nc+2

.

Proof. We first note that as long as theΘ-asymptotic behavior of the commute time is our concern and sinceInis greater thanJn, it is enough to calculateIn. It follows forc

=

1 that

E

Πix=jd+i

=

Πix=j

1

+

1

i

=

x

+

1 j

.

Then,

In

=

n1

x=1 x

j=1

x

+

1 j

=

n1

x=1

(

x

+

1

) α

xlogx

; α

j

−→

1

n1

x=1

(

x

+

1

)

logx

=

Θ

n2logn

,

where the last equality follows from the proof of Theorem 11 of [1]. Let us recall that inJnthe product of expected values is

x+12

j while inInx+j1. It follows that Jn

=

In

n1

x=1 x

j=1

1 2j

(5)

which means that In

Jn

n1

x=1

1 2logx

.

But

n1

x=1 1

2logx

=

o

n2logn

and since limnn2Ilogn n

∈ (

0

, ∞ )

we have that limnnI2n+logJnn

=

limn 2In

n2logn

∈ (

0

, ∞ ) .

We consider now the casec

<

1

.

It follows fromLemma 2that

Πix=j

1

+

c

i

=

x

c

α

x

jc

α

j

; α

x

−→ α ∈ (

0

, ∞ ) ,

we see

In

=

n1

x=1 x

j=1

xc

α

x

jc

α

j

=

n1

x=1

xc

α

x x

j=1

1

jc

α

j

.

(3.1)

It follows fromLemmas 1and3that In

n1

x=1

α

xxc

λ

x

xc+1

c

+

1

; λ

x

→ λ ∈ (

0

, ∞ )

=

Θ

n2

.

While forc

>

1

,

x

j=1

1 jc

α

j

=

Θ

(

1

)

and then, from Eq.(3.1)andLemma 1, In

=

n1

x=1

xc

α

x x

j=1

1 jc

α

j

n1

x=1

xc

α

x

n1

x=1

xc

.

Hence,

In

=

Θ

nc+1

.

The result forE

rxn

)

follows from the fact that

ρ

rxn

=

nand applyingLemma 2gives

n

k=1

E

(

Zk

) =

n

k=1

Πik=1

1

+

c

i

n

k=1

kc

=

Θ

nc+1

.

The following lemma is analogous to Theorem 3, p. 64 of [12].

Lemma 4. Consider a positive decreasing function f and define a sequence ak, k

=

1

,

2

, . . .

such that f

(

t

) =

at. Let In

=

n

1 f

(

t

)

dt andSn

= 

n

k=1ak. If limnIn

= ∞

then limn

Sn

In

=

1

.

Proof. Sincefis decreasing, then forj

=

2

,

3

, . . .

j+1 j

f

(

x

)

dx

aj

j j1

f

(

x

)

dx

.

(6)

By summing overj, we obtain

n+1 2

f

(

x

)

dx

Sn

a1

n 1

f

(

x

)

dx

.

That is,

In+1

I2

Sn

a1

In

.

(3.2)

It follows also that

n+1 n

f

(

x

)

dx

2 1

f

(

x

)

dx

C

.

As such,

lim

n

n+1 n f

(

x

)

dx

In

=

0

,

which implies that

limn

In+1 In

=

lim

n

1

+

n+1 n f

(

x

)

dx

In

=

1

,

and the result follows from Eq.(3.2).

Theorem 3.Consider a SSRTΓ such that for0

< α <

1 d+n

=

 

 

1 with probab.1

1 nα 2 with probab. 1

nα

.

Then for any

ϵ <

11α

,

there exists N such that for n

N, the following inequalities hold

(

i

)

Θ

nα+1exp



1 1

− α − ϵ

n1α



E

 τ

rxn

 ≤

Θ

nα+1exp



1 1

− α + ϵ

n1α



, (

ii

)

Θ

nαexp



1 1

− α − ϵ

n1α



E

rs

) ≤

Θ

nαexp



1 1

− α + ϵ

n1α



.

Remark 1. The case

α >

1 is covered byTheorem 1and the case

α =

1 is covered byTheorem 2.

Proof. LetSn

=

logE

(

Zn

) = 

n1 k=0log

1

+

1

kα

.

We first show that limn

Sn

n1α

=

1

1

− α .

(3.3)

Since In

=

n 1

log

1

+

1

xα

dx

=

nlog

1

+

1

nα

log 2

+ α 

n 1

1

1

+

xαdx

,

(3.4)

and lim

n

nlog

1

+

1

nα

n1α

=

1

,

and also

limn

n 1

1 1+xαdx n1α

=

lim

n 1 1+nα

(

1

− α)

nα

=

1 1

− α ,

then, from(3.4),

limn

In

n1α

=

1

+ α

1

− α =

1 1

− α .

(7)

It follows fromLemma 4that lim

n

Sn

n1α

=

lim

n

Sn

In

·

In n1α

=

lim

n

In

n1α

=

1 1

− α .

As such,

E

(

Zn

) =

exp

 γ

nn1α

; γ

n

1

1

− α .

(3.5)

That is, for arbitrary small

ϵ >

0, there is a sufficiently largeNsuch that forn

N

,

exp



1 1

− α − ϵ

n1α

E

(

Zn

) ≤

exp



1 1

− α + ϵ

n1α

,

(3.6)

and

n

k=N

exp



1 1

− α − ϵ

k1α

n

k=N

E

(

Zk

) ≤

n

k=N

exp



1 1

− α + ϵ

k1α

 .

Since,

lim

n

n

Nexp



1

1α

+ ϵ 

x1α

dx nαexp



1

1α

+ ϵ 

n1α

 =

1 1

+ ϵ (

1

− α) ,

and

n

Nexp



1

1α

− ϵ 

x1α

dx nαexp



1

1α

− ϵ 

n1α

 =

1 1

− ϵ (

1

− α) ,

then

Θ

nαexp



1 1

− α − ϵ

n1α



n

k=N

E

(

Zk

) ≤

Θ

nαexp



1 1

− α + ϵ

n1α



and the result forE

 τ

rxn

follows since

ρ

rxn

=

n

.

ForE

rs

)

, we follow the same argument of computingInas in the proof ofTheorem 2. From Eq.(3.5), we see that E

(

Πix=jd+i

) =

Πix=j

1

+

1

iα

=

exp

γ

xx1α

− γ

jj1α

; γ

x

1 1

− α .

As such,

x

j=1

E

(

Πix=jd+i

) =

x

j=1

exp

γ

x

(

x1α

) − γ

j

(

j1α

) 

= 

exp

γ

x

x1α



x

j=1

exp

− γ

j

(

j1α

)  .

Using the fact that the two series

anand

2νa2ν have the same convergence behavior, we can see that

x

j=1

exp

− γ

j

(

j1α

) 

=

Θ

(

1

)

and hence,

x

j=1

E

(

Πix=jd+i

) ∼

exp

x

(

x1α

)).

It follows then that for arbitrary small

ϵ >

0, and sufficiently largeN, exp



1 1

− α − ϵ

x1α

x

j=1

E

(

Πix=jd+i

) ≤

exp



1 1

− α + ϵ

x1α

;

x

N

n

x=N

exp



1 1

− α − ϵ

x1α

n1

x=N x

j=1

E

(

Πix=jd+i

) ≤

n

x=N

exp



1 1

− α + ϵ

x1α

.

(8)

The same argument of part (i) shows that Θ

nαexp



1 1

− α − ϵ

n1α



In

Θ

nαexp



1 1

− α + ϵ

n1α



,

and this proves part (ii).

References

[1] F. Al-Awadhi, M. Konsowa, Z. Najeh, Commute times and the effective resistances of random trees, Probability in the Engineering and Informational Sciences 23 (4) (2009) 649–660.

[2] D. Aldous, Random walk covering of some special trees, Journal of Mathematical Analysis and Applications 157 (1991) 271–283.

[3] D. Aldous, J. Fill, Reversible Markov chains and random walks on graphs, in: Monograph. Available at:

http://www.stat.berkeley.edu/~aldous/RWG/book.html(in preparation).

[4] R. ALeliunas, R. Karp, R. Lipton, L. Lovasz, C. Rackoff, Random walks, universal traversal sequences, and complexity of maze problem, in: 20th Annual Symposium on Foundations of Computer Science, 1979, pp. 218–223.

[5] A. Chandra, P. Raghavan, W. Ruzzo, R. Smolensky, P. Tiwari, The electrical resistance of a graph captures its commute and cover times, in: Proceedings of the 21st Annual Symposium on Theory of Computing, ACM Association for Computing Machinery, New York, 1989, pp. 574–586.

[6] C. Cooper, A. Frieze, The cover time of random geometric graphs, SODA (2009) 48–57.

[7] W. Feller, An Introduction to Probability Theory and Its Applications, Vol. I, second ed., John Wiley & Sons, New York, 1968.

[8] A. Fouss, J. Pirotte, M. Renders, M. Saerens, A novel way of computing dissimilarities between nodes of a graph with application to collaborative filtering and subspace projection of the graph nodes, Technical Report IAG WP 06/08, Universite cathlolique de Louvain, 2006.

[9] S. Guattery, Graph embeddings, symmetric real matrices, and generalized inverses, Technical Report, Institute for Computer Applications in Science and Engineering, NASA Langley Research Center, 1998.

[10] J. Ham, D.D. Lee, S. Mika, B. Scholkopf, A kernel view of the dimensionality reduction of manifolds, in: International Conference on Machine Learning, ACM, 2004, pp. 369–376.

[11] M. Herbster, M. Pontil, Prediction on a graph with a perceptron, in: Bernhard Scholkopf, John Platt, Thomas Hoffman (Eds.), New Information Processing Systems, 2006, pp. 577–584.

[12] K. Knopp, Infinite Sequences and Series, Dover Publications, INC., New York, 1956.

[13] Liben-Nowell, J. Kleinberg, The link prediction problem for social networks, in: Proceedings of the 2003 ACM CIKM International Conference of Information and Knowledge Management, CIKM-03, 2003, pp. 556–559.

[14] L. Lovasz, Random walks on graphs: a survey, combinatorics, Bolyai Society Mathematical Studies (1993) 353–397.

[15] U. Luxburg, A. Radl, M. Hein, Hitting and commute times in large graphs are often misleading,arXiv:1003.1266v2[CS.ds], 2011.

[16] H. Qiu, E. Hancock, Graph embedding using commute time, Structural, Syntactic, and Statistical Pattern Recognition (2006) 441–449.

[17] H. Qiu, E.R. Hancock, Image segmentation using commute times, in: Proceedings of the 16th British Machine Vision Conference, 2005, pp. 929–938.

[18] M. Saerens, F. Fouss, L. Yen, P. Dupont, The principal components analysis of a graph and its relationship to spectral clustering, in: Proceedings of the 15th European Conference on Machine Learning, Springer, Berlin, 2004, pp. 371–383.

[19] P. Sarkar, A. Moore, A. Prakash, Fast incremental proximity search in large graphs, in: Proceedings of the 25th International Conference of Machine Learning, 2008, pp. 896–903.

[20] D. Spielman, N. Srivastava, Graph sparsification by effective resistances, in: Proceedings of the 40th Annual Symposium on Theory of Computing, 2008, pp. 563–568.

[21] P. Tetali, Random walks and effective resistances of networks, Journal of Theoretical Probability 4 (1) (1991) 101–109.

[22] D.M. Wittmann, D. Schmidl, F. Blochl, F.J. Theis, Reconstruction of graphs based on random walks, Theoretical Computer Science 410 (38–40) (2009) 3826–3838.

[23] L. Yen, D. Vanvyve, F. Wouters, F. Fouss, M. Verleysen, M. Saerens, Proceedings of the 13th Annual Symposium on Artificial Neural Networks, 2005, pp. 317–324.

[24] D. Zhou, B. Scholkopf, Learning from labeled and unlabeled data using random walks, Pattern Recognition, in: Proceedings of the 26th DAGM Symposium, Berlin, Germany, 2004, pp. 237–244.

Hivatkozások

KAPCSOLÓDÓ DOKUMENTUMOK

For more than four tree degree sequences on a small number of vertices, it is hard to prove the existence of a rainbow matching of size k − 1 within an arbitrary k − 1 of

A polynomial counterpart of the Seiberg-Witten invariant associated with a negative definite plumbed 3-manifold has been proposed by earlier work of the authors. It is provided by

In complex analysis, the behaviour of random power series near the radius of convergence has been thorougly examined, partly due to the following classical problem: if we consider

We will gather theoretical and computational evidence that the proposed edge coloring provides better estimates for the clique size than the node coloring and can be used to divide

P APP , Optimal pebbling and rubbling of graphs with given diameter Discrete Applied Mathematics, 266 (2019) pp.. P APP , Optimal Pebbling Number of the Square Grid Graphs

There is coverage on bits and pieces on certain aspects of WAN optimization such as data compression, which has been widely studied and reported in several books or survey papers,

If on-line picking procedure is applied in the warehouse, it can be unambiguously identified if a certain pallet has been transferred to the picking area and when, where and how

In gossip learning, models perform random walks on the network and are trained on the local data using stochastic gradient descent.. Besides, several models can perform random walks