Contents lists available atSciVerse ScienceDirect
Discrete Applied Mathematics
journal homepage:www.elsevier.com/locate/dam
Commute times of random walks on trees
Mokhtar Konsowa
a,Ď, Fahimah Al-Awadhi
a, András Telcs
b,∗aDepartment of Statistics and Operations Research, Faculty of science, Kuwait University, Safat 13060, P.O.Box 5969, Kuwait
bDepartment of Quantitative Methods, Faculty of Economics, Veszprém, Egyetem utca 10, 8200 Veszprém, Hungary
a r t i c l e i n f o
Article history:
Received 12 November 2011
Received in revised form 4 October 2012 Accepted 7 October 2012
Available online 8 November 2012 Keywords:
Random walk Commute time
Spherically symmetric tree Resistance
a b s t r a c t
In this paper we provide exact formula for the commute times of random walks on spherically symmetric random trees. Using this formula we sharpen some of the results presented in Al-Awadhi et al. to the form of equalities rather than inequalities.
©2012 Elsevier B.V. All rights reserved.
1. Introduction
The commute time is a particular measure of random walks on weighted graphs. It has several nice properties which has been revealed independently partly or fully by many authors, see for example [5,2,3,14,21,1]. It is still in the focus of the research of computer scientists, probabilists and physicists as well. As examples, consider the tasks of graph embedding [9,16,18,22], graph sparsification [20], social network analysis, [13], proximity search [19], collaborative filtering [8], clustering [23], semisupervised learning [24], dimensionality reduction [10] image processing [17], graph labeling [11], and theoretical computer science [4,6]. For an extensive list of literature we refer the reader to [15]. Random walks on random graphs have been subject of permanent interest in the last three decades. Interestingly enough, very little is published on commute times of random walks on random graphs. The present paper studies commute times on very simple random objects, on spherically symmetric random trees,SSRT. Explicit results are presented in the annealed case, averaged commute times over the probability field of trees.
2. Commute times
Consider a random walk on a weighted graphG
= (
V,
E)
where a weight (conductivity)cxy=
cyxis assigned to edge xy∈
E. The commute time between two verticesrandsis the mean number of steps it takes the random walk to go fromr tosand back torand will be denoted byE(τ) =
E(τ
r,s)
. We know, see [5], for a finite connected graph,E
(τ) =
2ρ
rsµ
rs,
(2.1)where
ρ = ρ
rsis the effective resistance betweenrandsandµ = µ
rs=
12
e∈Ece. If the assigned weights are all equal 1, then
E
(τ) =
2ρ
m,
∗Corresponding author. Tel.: +36 303753896; fax: +36 14633157.
E-mail addresses:fahimah@kuc01.kuniv.edu.kw(F. Al-Awadhi),telcs.szit.bme@gmail.com(A. Telcs).
ĎOur co-author and friend Mokhtar Konsowa passed away with tragic suddenness while we were revising this submission.
0166-218X/$ – see front matter©2012 Elsevier B.V. All rights reserved.
doi:10.1016/j.dam.2012.10.006
wherem
= |
E|
is the number of undirected edges ofG. We confine our study to investigating the commute time of random walk on spherically symmetric random treesSSRTin which the degree of a vertex depends only on its distance from the rootr.The second probability space is given on the spherically symmetric trees of infinite heights and the corresponding probability and expectation will be denoted byPandE.
This type of trees is completely determined by its degree sequence
{
dn;
n≥
0}
wherednis the degree of every node at leveln. Letℑ
n= σ (
d1,
d2, . . . ,
dn)
. Then, for each realizationT(ω)
of a random treeT,E
(τ |ℑ
n) =
2ρµ.
We are interested in the expected value with respect toP
(ω)
, probability distribution on the set of all possible treesT. In such a case,E
(τ) =
2E(ρµ).
It was shown in [1] thatE
(τ) ≤
2E(ρ)
E(µ)
. It can easily be seen that this inequality can not be strengthened to equality.We first note that for positive nondegenerate random variableX, the functionf
(
X) =
1X is strictly convex and hence E
(
1/
X)
1/
E(
X).
Consider now a treeT of height 1 rooted atrwhich has random degreed0. Thenm=
d0andρ
rs=
1/
d0. Hence,E(τ) =
2E(ρ
m) =
2. On the other hand, 2E(ρ)
E(
m)
2.
Now we seek for asymptotic equality forE
(τ)
. LetSibe the sphere of radiusiand centered atr; that is the set of vertices at distanceifromr. Letρ
i= ρ (
Si−1,
Si)
,µ
i= µ (
Si−1,
Si)
, andE(τ)
is the commute time between the root and the sphere of radiusnshorted in one vertex. ThenE
(τ) =
2E(ρµ) =
2E
n
i=1
µ
i
n
j=1
ρ
j
=
2E
n
i=1 n
j=1
µ
iρ
j
=
2E
n
i=1 n
j=1 j̸=i
µ
iρ
j
+
2n,
(2.2)where the last step uses the fact that
ρ
i=
1/µ
i. Belowd+j will denote the outdegree of statej; that isd+j=
dj−
1. Now we concentrate on the double sum.In
+
Jn:=
E
n
i=2 i−1
j=1
µ
iρ
j +
E
n−1
i=1 n
j=i+1
µ
iρ
j .
Now,In
=
E
n
i=2 i−1
j=1
µ
iρ
j
=
E
n−1
j=1
ρ
j
n
i=j+1
µ
i
=
E
n−1
j=1
ρ
jµ
Sj,
Sn
=
n−1
j=1
E
ρ
jµ
Sj,
Sn
=
n−1
j=1
∞
k=1
E
ρ
jµ
Sj,
Sn | µ
j=
k
P
µ
j=
k
=
∞
k=1 n−1
j=1
E
1k
(µ
j+1+ µ
j+2+ · · · + µ
n) | µ
j=
k
P
µ
j=
k
=
∞
k=1 n−1
j=1
E
1kk
(
d+j+
d+j d+j+1+ · · · +
d+jd+j+1· · ·
d+n−1)
P
µ
j=
k
=
n−1
j=1 n−1
x=j
E
Πix=jd+i
=
n−1
x=1 x
j=1
E
Πix=jd+i
.
On the other hand, Jn
=
E
n−1
i=1 n
j=i+1
µ
iρ
j
=
∞
k=1
E
n−1
i=1 n
j=i+1
µ
iρ
j| µ
i=
k
P
(µ
i=
k)
=
∞
k=1
E
n−1
i=1
k
(ρ
i+1+ ρ
i+2+ · · · + ρ
n) | µ
i=
k
P
(µ
i=
k)
=
E
n−1
i=1
k1 k
1 d+i+
1d+i d+i+1
+ · · · +
1 d+i d+i+1· · ·
d+n−1
=
n−1
i=1
E
1 d+i+
1d+i d+i+1
+ · · · +
1 d+i d+i+1· · ·
d+n−1
=
n−1
i=1 n−1
x=i
E
Πjx=i1 d+j
=
n−1
x=1 x
i=1
E
Πjx=i1 d+j
,
where in the third step the condition
µ
i=
kis used to calculate the resistance ofkparallel branches starting in leveli. Finally, E(τ) =
2n+
2n−1
x=1 x
j=1
E
Πix=jd+i
+
E
Πix=j1 d+i
.
(2.3)3. Spherically symmetric trees
In this section we use
τ
rsto denote the commute time between the rootrof aSSRTΓand the levelnshorted in one node s, whileτ
rxndenotes the commute time betweenrand a leafxnof leveln. We assign a unit resistance to every edge ofΓ. We also used+n to refer the outdegree of each node of levelnandZnto refer to the number of vertices in the leveln. Then Zn=
Πkn=−01dk. We will assume thatd+n’s are independent random variables. We need the following lemma from [12].Lemma 1. Consider two nonnegative sequences anand bnsuch that
nbnis divergent. Iflimnabn
n
=
L, thenlimnn k=1ak
n
k=1bk
=
L.Notation 1. We use an
=
Θ(
bn)
forlimnanbn
ϵ (
0, ∞ )
.The following theorem strengthen Theorem 1 of [1] that gives only an upper bound for the commute time.
Theorem 1. Consider a spherically symmetric random treeΓ such that d+n
=
1 with probab.1−
qn 2 with probab. qn and
qn
< ∞ .
Thenτ
rs=
Θ
n2
P-a.s.and
τ
rxn=
Θ
n2
P-a.s.
.
Proof. Applying the Borel Cantelli Lemma to the infinite outdegree sequence
{
d+n}
shows thatp(
dn=
1 eventually) =
1.As such, there isNsuch thatp
(
dn=
1,
n≥
N) =
1. Thenµ
rs=
n
k=1
Zk
−
1=
N
k=1
Zk
+
n
k=N+1
Zk
−
1=
Θ(
n)
a.s.whereZk
= |
Sk|
. Similarly,ρ
rs=
Θ(
n)
a.s.Therefore,
µ
rsρ
rs=
Θ
n2
a.s.and the result follows from Eq.(2.1).
The following lemma is presented in [7, p. 63].
Lemma 2. For any c
>
0, Πjn=1
1+
cj
∼
nc.
The following lemma is presented in [12, p. 66].
Lemma 3. For0
< α <
1,n
k=1
1 kα
∼
n1−α 1
− α .
Theorem 2.Consider a SSRTΓ such that d+n
=
1 with probab.1−
qn 2 with probab. qn where qn=
min(
1,
c/
n) ,
c>
0. ThenE
(τ
rs) =
Θ
n2logn
if c
=
1,
E(τ
rs) =
Θ
n2
if c
<
1 E(τ
rs) =
Θ
nc+1
if c
>
1.
Moreover, for any c>
0,
E
τ
rxn =
Θ
nc+2
.
Proof. We first note that as long as theΘ-asymptotic behavior of the commute time is our concern and sinceInis greater thanJn, it is enough to calculateIn. It follows forc
=
1 thatE
Πix=jd+i
=
Πix=j
1+
1i
=
x+
1 j.
Then,In
=
n−1
x=1 x
j=1
x
+
1 j=
n−1
x=1
(
x+
1) α
xlogx; α
j−→
1∼
n−1
x=1
(
x+
1)
logx=
Θ
n2logn
,
where the last equality follows from the proof of Theorem 11 of [1]. Let us recall that inJnthe product of expected values is
x+12
j while inInx+j1. It follows that Jn
=
In−
n−1
x=1 x
j=1
1 2j
which means that In
−
Jn∼
n−1
x=1
1 2logx
.
But
n−1x=1 1
2logx
=
o
n2logn
and since limnn2Ilogn n
∈ (
0, ∞ )
we have that limnnI2n+logJnn=
limn 2Inn2logn
∈ (
0, ∞ ) .
We consider now the casec<
1.
It follows fromLemma 2thatΠix=j
1+
ci
=
xc
α
xjc
α
j; α
x−→ α ∈ (
0, ∞ ) ,
we seeIn
=
n−1
x=1 x
j=1
xc
α
xjc
α
j=
n−1
x=1
xc
α
x x
j=1
1
jc
α
j.
(3.1)It follows fromLemmas 1and3that In
∼
n−1
x=1
α
xxcλ
xx−c+1
−
c+
1; λ
x→ λ ∈ (
0, ∞ )
=
Θ
n2
.
While forc>
1,
x
j=1
1 jc
α
j=
Θ(
1)
and then, from Eq.(3.1)andLemma 1, In
=
n−1
x=1
xc
α
x x
j=1
1 jc
α
j∼
n−1
x=1
xc
α
x∼
n−1
x=1
xc
.
Hence,In
=
Θ
nc+1
.
The result forE
(τ
rxn)
follows from the fact thatρ
rxn=
nand applyingLemma 2givesn
k=1
E
(
Zk) =
n
k=1
Πik=1
1+
ci
∼
n
k=1
kc
=
Θ
nc+1
.
The following lemma is analogous to Theorem 3, p. 64 of [12].
Lemma 4. Consider a positive decreasing function f and define a sequence ak, k
=
1,
2, . . .
such that f(
t) =
at. Let In=
n1 f
(
t)
dt andSn=
nk=1ak. If limnIn
= ∞
then limnSn
In
=
1.
Proof. Sincefis decreasing, then forj
=
2,
3, . . .
j+1 jf
(
x)
dx≤
aj≤
j j−1f
(
x)
dx.
By summing overj, we obtain
n+1 2f
(
x)
dx≤
Sn−
a1≤
n 1f
(
x)
dx.
That is,In+1
−
I2≤
Sn−
a1≤
In.
(3.2)It follows also that
n+1 nf
(
x)
dx≤
2 1f
(
x)
dx≤
C.
As such,lim
n
n+1 n f(
x)
dxIn
=
0,
which implies thatlimn
In+1 In
=
limn
1+
n+1 n f(
x)
dxIn
=
1,
and the result follows from Eq.(3.2).Theorem 3.Consider a SSRTΓ such that for0
< α <
1 d+n=
1 with probab.1
−
1 nα 2 with probab. 1nα
.
Then for any
ϵ <
1−1α,
there exists N such that for n≥
N, the following inequalities hold(
i)
Θ
nα+1exp
1 1− α − ϵ
n1−α
≤
E τ
rxn ≤
Θ
nα+1exp
1 1− α + ϵ
n1−α
, (
ii)
Θ
nαexp
1 1− α − ϵ
n1−α
≤
E(τ
rs) ≤
Θ
nαexp
1 1− α + ϵ
n1−α
.
Remark 1. The case
α >
1 is covered byTheorem 1and the caseα =
1 is covered byTheorem 2.Proof. LetSn
=
logE(
Zn) =
n−1 k=0log
1
+
1kα
.
We first show that limnSn
n1−α
=
11
− α .
(3.3)Since In
=
n 1log
1+
1xα
dx
=
nlog
1+
1nα
−
log 2+ α
n 11
1
+
xαdx,
(3.4)and lim
n
nlog
1+
1nα
n1−α=
1,
and alsolimn
n 11 1+xαdx n1−α
=
limn 1 1+nα
(
1− α)
n−α=
1 1− α ,
then, from(3.4),limn
In
n1−α
=
1+ α
1
− α =
1 1− α .
It follows fromLemma 4that lim
n
Sn
n1−α
=
limn
Sn
In
·
In n1−α=
limn
In
n1−α
=
1 1− α .
As such,E
(
Zn) =
exp γ
nn1−α
; γ
n→
11
− α .
(3.5)That is, for arbitrary small
ϵ >
0, there is a sufficiently largeNsuch that forn≥
N,
exp
1 1− α − ϵ
n1−α
≤
E(
Zn) ≤
exp
1 1− α + ϵ
n1−α
,
(3.6)and
n
k=N
exp
1 1− α − ϵ
k1−α
≤
n
k=N
E
(
Zk) ≤
n
k=N
exp
1 1− α + ϵ
k1−α .
Since,lim
n
nNexp
11−α
+ ϵ
x1−α
dx nαexp
11−α
+ ϵ
n1−α
=
1 1+ ϵ (
1− α) ,
and
nNexp
11−α
− ϵ
x1−α
dx nαexp
11−α
− ϵ
n1−α
=
1 1− ϵ (
1− α) ,
thenΘ
nαexp
1 1− α − ϵ
n1−α
≤
n
k=N
E
(
Zk) ≤
Θ
nαexp
1 1− α + ϵ
n1−α
and the result forE
τ
rxn
follows sinceρ
rxn=
n.
ForE
(τ
rs)
, we follow the same argument of computingInas in the proof ofTheorem 2. From Eq.(3.5), we see that E(
Πix=jd+i) =
Πix=j
1+
1iα
=
exp
γ
xx1−α− γ
jj1−α
; γ
x→
1 1− α .
As such,x
j=1
E
(
Πix=jd+i) =
x
j=1
exp
γ
x(
x1−α) − γ
j(
j1−α)
=
expγ
x
x1−α
x
j=1
exp
− γ
j(
j1−α) .
Using the fact that the two series
anand
2νa2ν have the same convergence behavior, we can see that
x
j=1
exp
− γ
j(
j1−α)
=
Θ(
1)
and hence,x
j=1
E
(
Πix=jd+i) ∼
exp(γ
x(
x1−α)).
It follows then that for arbitrary small
ϵ >
0, and sufficiently largeN, exp
1 1− α − ϵ
x1−α
≤
x
j=1
E
(
Πix=jd+i) ≤
exp
1 1− α + ϵ
x1−α
;
x≥
Nn
x=N
exp
1 1− α − ϵ
x1−α
≤
n−1
x=N x
j=1
E
(
Πix=jd+i) ≤
n
x=N
exp
1 1− α + ϵ
x1−α
.
The same argument of part (i) shows that Θ
nαexp
1 1− α − ϵ
n1−α
≤
In≤
Θ
nαexp
1 1− α + ϵ
n1−α
,
and this proves part (ii).References
[1] F. Al-Awadhi, M. Konsowa, Z. Najeh, Commute times and the effective resistances of random trees, Probability in the Engineering and Informational Sciences 23 (4) (2009) 649–660.
[2] D. Aldous, Random walk covering of some special trees, Journal of Mathematical Analysis and Applications 157 (1991) 271–283.
[3] D. Aldous, J. Fill, Reversible Markov chains and random walks on graphs, in: Monograph. Available at:
http://www.stat.berkeley.edu/~aldous/RWG/book.html(in preparation).
[4] R. ALeliunas, R. Karp, R. Lipton, L. Lovasz, C. Rackoff, Random walks, universal traversal sequences, and complexity of maze problem, in: 20th Annual Symposium on Foundations of Computer Science, 1979, pp. 218–223.
[5] A. Chandra, P. Raghavan, W. Ruzzo, R. Smolensky, P. Tiwari, The electrical resistance of a graph captures its commute and cover times, in: Proceedings of the 21st Annual Symposium on Theory of Computing, ACM Association for Computing Machinery, New York, 1989, pp. 574–586.
[6] C. Cooper, A. Frieze, The cover time of random geometric graphs, SODA (2009) 48–57.
[7] W. Feller, An Introduction to Probability Theory and Its Applications, Vol. I, second ed., John Wiley & Sons, New York, 1968.
[8] A. Fouss, J. Pirotte, M. Renders, M. Saerens, A novel way of computing dissimilarities between nodes of a graph with application to collaborative filtering and subspace projection of the graph nodes, Technical Report IAG WP 06/08, Universite cathlolique de Louvain, 2006.
[9] S. Guattery, Graph embeddings, symmetric real matrices, and generalized inverses, Technical Report, Institute for Computer Applications in Science and Engineering, NASA Langley Research Center, 1998.
[10] J. Ham, D.D. Lee, S. Mika, B. Scholkopf, A kernel view of the dimensionality reduction of manifolds, in: International Conference on Machine Learning, ACM, 2004, pp. 369–376.
[11] M. Herbster, M. Pontil, Prediction on a graph with a perceptron, in: Bernhard Scholkopf, John Platt, Thomas Hoffman (Eds.), New Information Processing Systems, 2006, pp. 577–584.
[12] K. Knopp, Infinite Sequences and Series, Dover Publications, INC., New York, 1956.
[13] Liben-Nowell, J. Kleinberg, The link prediction problem for social networks, in: Proceedings of the 2003 ACM CIKM International Conference of Information and Knowledge Management, CIKM-03, 2003, pp. 556–559.
[14] L. Lovasz, Random walks on graphs: a survey, combinatorics, Bolyai Society Mathematical Studies (1993) 353–397.
[15] U. Luxburg, A. Radl, M. Hein, Hitting and commute times in large graphs are often misleading,arXiv:1003.1266v2[CS.ds], 2011.
[16] H. Qiu, E. Hancock, Graph embedding using commute time, Structural, Syntactic, and Statistical Pattern Recognition (2006) 441–449.
[17] H. Qiu, E.R. Hancock, Image segmentation using commute times, in: Proceedings of the 16th British Machine Vision Conference, 2005, pp. 929–938.
[18] M. Saerens, F. Fouss, L. Yen, P. Dupont, The principal components analysis of a graph and its relationship to spectral clustering, in: Proceedings of the 15th European Conference on Machine Learning, Springer, Berlin, 2004, pp. 371–383.
[19] P. Sarkar, A. Moore, A. Prakash, Fast incremental proximity search in large graphs, in: Proceedings of the 25th International Conference of Machine Learning, 2008, pp. 896–903.
[20] D. Spielman, N. Srivastava, Graph sparsification by effective resistances, in: Proceedings of the 40th Annual Symposium on Theory of Computing, 2008, pp. 563–568.
[21] P. Tetali, Random walks and effective resistances of networks, Journal of Theoretical Probability 4 (1) (1991) 101–109.
[22] D.M. Wittmann, D. Schmidl, F. Blochl, F.J. Theis, Reconstruction of graphs based on random walks, Theoretical Computer Science 410 (38–40) (2009) 3826–3838.
[23] L. Yen, D. Vanvyve, F. Wouters, F. Fouss, M. Verleysen, M. Saerens, Proceedings of the 13th Annual Symposium on Artificial Neural Networks, 2005, pp. 317–324.
[24] D. Zhou, B. Scholkopf, Learning from labeled and unlabeled data using random walks, Pattern Recognition, in: Proceedings of the 26th DAGM Symposium, Berlin, Germany, 2004, pp. 237–244.