topologies On the combinatorial design of data centre network Journal of Computer and System Sciences

(1)

Contents lists available atScienceDirect

Journal of Computer and System Sciences

www.elsevier.com/locate/jcss

On the combinatorial design of data centre network topologies ^✩ ^, ^✩✩

Iain A. Stewart

SchoolofEngineeringandComputingSciences,DurhamUniversity,ScienceLabs,SouthRoad,DurhamDH13LE,UK

a rt i c l e i n f o a b s t ra c t

Articlehistory:

Received28April2016

Receivedinrevisedform 26January2017 Accepted29May2017

Availableonline13June2017

Keywords:

Datacentrenetworks

Switch-centricdatacentrenetworks Fat-Trees

Combinatorialdesigns Bipartitegraphs Pathdiversity

Thetheoryofcombinatorialdesignshasrecentlybeenusedinordertobuildswitch-centric data centre networks incorporating alarge number ofservers, in comparison with the popularFat-Treedatacentrenetwork.Weclarifyandextendtheseresultsandprovethat inthesedatacentrenetworks:therearepairwiselink-disjointpathsjoiningalltheservers adjacenttosomeswitchwithalltheserversadjacenttoanyotherswitch;and thereare pairwiselink-disjointpathsfromalltheserversadjacenttosomeswitchtoanyidentically- sizedcollectionoftargetservers wherethesetargetserversneednot beadjacenttothe sameswitch. In bothcases,wealways control the pathlengths.Our constructionsand analysisareundertakenonbipartitegraphswiththeapplicationstodatacentrenetworks being easily derived. Our results show the potential of the application of results and methodologiesfromcombinatoricstodatacentrenetworkdesign.

©²⁰¹⁷^TheÂuthor(s).^Published^byÊlsevierÎnc.^Thisîsânôpenâccessârticleûnder^the CCBYlicense(http://creativecommons.org/licenses/by/4.0/).

1. Introduction

1.1. Thedatacentrenetworkcontext

Datacentresareexpandingbothintermsoftheirsizeandtheirimportanceascomputationalplatformsforcloudcom- puting, websearch, social networking,andso on.There isan increasing demandthat datacentres incorporatemoreand moreserversbutsothatoverallcomputationaleﬃciencyisnotcompromisedthroughexcessivetraﬃc.A keyfactorastothe eventualperformanceofadatacentreisthedatacentrenetwork(DCN);thatis,theinterconnectionfabricoftheserversand switcheswithinthedatacentre.Aswestrivetoincorporatemoreandmoreservers,newtopologiesarebeingdevelopedso astocopewiththeincreaseinscaleandbestutilizetheadditionalcomputationalpower.Itiswithtopologicalaspects of DCNsthatweareconcernedinthispaper.

ThetraditionaldesignofaDCNisswitch-centricsothattheroutingintelligenceresidesamongsttheswitches,withthe serversbehavingonlyascomputationalnodes.Inswitch-centricDCNs,therearenodirectserver-to-serverlinks;onlyserver- to-switch andswitch-to-switchlinks.Switch-centric DCNsaretraditionallytree-likewithserverslocatedatthe‘leaves’ of the tree-likestructure.Examples includeElasticTree [1],VL2 [2], HyperX[3], Portland[4],andFlattenedButterﬂy[5],al- thoughthedominatingswitch-centricDCNisFat-Tree[6].Whilst itisgenerallyacknowledgedthattree-like,switch-centric

✩ ThisworkwassupportedbytheUKEngineeringandPhysicalSciencesResearchCouncil(EPSRC)grant‘InterconnectionNetworks:Practiceuniteswith Theory(INPUT)’[grantnumberEP/K015680/1].

✩✩ ApreliminaryversionofthispaperappearedasanextendedabstractintheProceedingsof20thInternationalSymposiumonFundamentalsofComputation Theory(A.Kosowski,I.Walukiewicz,eds.),Gdansk,Poland,August17–192015,LectureNotesinComputerScience,Volume9210,Springer,2015,283–295.

E-mailaddress:i.a.stewart@durham.ac.uk.

http://dx.doi.org/10.1016/j.jcss.2017.05.015

0022-0000/©²⁰¹⁷^TheÂuthor(s).^Published^byÊlsevierÎnc.^Thisîsânôpenâccessârticleûnder^the^CC^BY^license (http://creativecommons.org/licenses/by/4.0/).

(2)

concernedhere.

Itisextremelydifficultto designcomputationallyefficient(switch-centric)DCNssoastoincorporatelargenumbersof serversastherearemanyadditionalconsiderationstotake intoaccount.Forexample,switchesand(especially)servers in data centreshave a limitednumber ofportswith a consequencebeingthat the more servers there are,the greater the averageorworst-caselink-countbetweentwodistinctservers;hence,thereisapacketlatencyoverheadtobeborne.Also, so astobetter support routing,fault-tolerance, andload-balancing, we wouldprefer that thereare numerousalternative pathswithintheDCNjoininganytwodistinctservers;thatis,thatthereispathdiversity.IrrespectiveoftheDCNparadigm within whichone works,there aremanyother designparameters tobear inmindrelatingto,forexample,(incremental) scalability, throughput, cost, oversubscription, energy consumption, latency, and security (see, for example, [8,9] for an overview).TheupshotisthattheDCNdesignerhastosimultaneouslysecureanumberofperformancecharacteristics,some ofwhicharecompetingagainsteachother;thismakestheDCNdesignspacecomplexanddifficulttoworkin.

1.2. UsingcombinatorialdesignstobuildDCNs

Arecentproposalin[10]advocatedtheuseofcombinatorialdesigntheoryinordertodesignswitch-centricDCNs;these DCNshavebeneﬁcialpropertiesasregardsincorporatingmoreserversandpossessingpathdiversityyetitispossibletolimit theworst-caselink-lengthofserver-to-servershortestpaths(andso,ultimately,achieve bettercontroloverpacketlatency inaDCN).Theuseofcombinatorialdesignswithinthestudyofgeneralinterconnectionnetworksisnotnewandoriginated in[11]wherethetargetednetworksinvolvedprocessorscommunicatingviabuses(thereaderisreferredto[12]forarange ofapplications ofcombinatorialdesigntheory within computer science).A hypergraph frameworkwas developed in[11]

where the hypergraph nodes representthe processors andthe hyperedges the buses. Likewise, an analogousframework was developed in [10] butwhere the hypergraph nodes andedges both represent switches so that the pendantservers

‘hangoff’someoftheswitches(wepresentadetaileddescriptionofthisframeworkinSection3.3).In[10],theubiquitous switch-centricFat-TreeDCNfrom[6]wasusedasayardstickagainstwhichtocomparethenewDCNdesignsdevelopedin [10]underthenormalizationthatallDCNsaretohavethesameworst-caselink-lengthofserver-to-servershortestpaths, namely 6,asthis equals theworst-case link-lengthofserver-to-server shortestpaths in the Fat-Tree DCN. Itwas shown thatmoreserverscanbeincorporatedwithinthenewDCNsyet,crucially,theresultingDCNshavegoodpathdiversity.Itis thealgebraicproperties(relatingtosymmetryandbalance)possessedbytransversaldesignsthatenable theconstructions andanalysisasdescribed in[10].One slightdiﬃculty withtheoriginalandnovelapproach takenin[10] isthat some of thepathdiversityresultsderivedthereareincorrect(asweexplainlaterinSection4.1).Notonlyhascombinatorialdesign theoryfeaturedasregardsthedesignofinterconnectionnetworksbutotheraspects ofalgebrahavetoo;indeed,therehas beenrecentworkontherelevanceofCayleygraphs,Hamminggraphs,andhyperbolicitytoDCNdesign(see,e.g.,[13–15]).

1.3. Ourcontribution

Inthispaperwe returntotheframework of[10]andformulateandprove pathdiversityresultsfortheswitch-centric DCNsconstructedusingthemethodsofthat paper.AsourconcernisentirelywithtopologicalpropertiesofDCNs,henceforth we abstract our DCNsas undirectedgraphs where thenodes are to representservers and switches andthe edges point-to-pointlinks.Thecruxoftheconstructionin[10]is(essentially)tobuildabipartitegraphusingasystematicmethod, calledthe3-stepmethod,involvinga different‘base’bipartite graphandatransversaldesign,andtoconverttheresulting bipartite graphintoswitch-centricDCNs(in avariety ofways).Afterexplaininghow hypergraphsandtransversaldesigns canall beconsideredasbipartitegraphs inSection2,inSection3we provideadetaileddescriptionofthe3-stepframe- workfrom[10]andexplainhowthebipartitegraphsconstructedareconvertedintoswitch-centricDCNs.Next,werevisit theresultsfrom[10].Inparticular,inSection4wecorrectandextendtheanalysisin[10]andaﬃrmthatusingthe3-step methodfrom[10],we canbuildswitch-centricDCNs:withmanymoreservers thantheFat-TreeDCNyetsothat,likethe Fat-Tree,everyserver-to-servershortestpathhaslengthatmost6;andsothat(assumingsomenumericconditionsonthe basebipartitegraphandthetransversaldesign)wecanﬁndpairwiselink-disjointpathsfromalloftheserversadjacentto aparticularswitchtoalloftheserversadjacenttoanyotherswitch.Moreover,weprovideanupperboundonthelengths ofthe pathsconstructed intermsofthe diameterofthebasebipartite graph (seeTheorem 4).We alsodeal withasce- nariomissingfrom[10](seepart(b)ofTheorem 4).Asweexplain,thegeneralsituationismoresubtlethanwas assumed in[10].

The DCNpath diversity,aswe havedescribed itabove,comes aboutfrombuildingbipartite graphs(which aresubse- quentlyconvertedtoDCNs)sothatgivenanytwodistinctnodes,therearenumerousnode-disjointpathsjoiningthesetwo

(3)

nodes;that is,thesebipartite graphshaveone-to-one pathdiversity.InSection 5,wegoontoshowthat wecanactually build numerousedge-disjointpathsfroma sourcenodeto differentdestinationnodesinourbipartitegraphs; thatis,we haveone-to-manypathdiversity(one-to-oneandone-to-manypathdiversityaredeﬁnedinSection2.1).TheDCNsobtained fromthesebipartitegraphsaresuchthat(assumingsomenumericconditionsonthebasebipartitegraphandthetransver- saldesign)wecanﬁndpairwise link-disjointpathsfromalloftheserversadjacenttosomeswitchtoanyidentically-sized collection of servers (irrespective of which switch they are adjacent to). Consequently, we show that our DCNs provide supportforadditionalcommunicationpatternsthatareprevalentwithindatacentrenetworks.Itshouldbenotedthatone- to-manyandmany-to-manycommunicationpatternsarecommonplaceindatacentres;forexample,in‘bigdata’processing applicationssuchasMapReduce,Hadoop,Spark,andStorm(see,e.g.,thesurvey[16]).

Thispaperisunashamedlytheoretical.However,wedemonstratethatnotonlyisthereinterestingcombinatoricswithin thepractical worldofDCNdesignbutthatcombinatorialmathematicscanpotentiallycontributeto theDCNdesignspace on apractical level.Wefeel thatthemathematical aspectsofDCNshaveso farremainedalmostcompletely unexamined andweadvocateaclosertheoreticalscrutinyofDCNsbothasamodelofcomputationandinrelationtothevastswathesof researchongeneralinterconnectionnetworks.Wementionsomepracticalconsiderationsanddirectionsforfurtherresearch intheConclusion.

2. Basicconcepts

We begin by brieﬂy reviewing some architectural aspects of switch-centric DCNs that are pertinent to our subsequent research. We then move on to the discrete structures featuring in [10,11], namely hypergraphs, bipartite graphs, andtransversaldesigns.So thatwe mightfullydescribe andunderstandthe constructionsin[10,11],aswellasourown upcominganalysisofswitch-centricDCNs,weeventuallyamalgamatehypergraphs,bipartitegraphs,andtransversaldesigns so that bythe endofthissection,we willhave developedan encompassingbipartite graphframework forthedesign of switch-centricDCNs.ThereadershouldbeawarethatitwillnotbeuntilSection 3.3thatwetransformthebipartitegraphs that we have beenconstructing up untilthen intoswitch-centric DCNs. Asa hintasto thistransformation (andso that thereaderdoesnotlosesightofoureventualgoal),roughlyspeakingweshallregard thenodesofoneofourconstructed bipartite graphs as switch-nodes and attach to some of these switch-nodes additional server-nodes in order to get our switch-centricDCN.Generalgraph-theoreticconceptscanbeobtainedin[17].

2.1. Switch-centricDCNs

Aswitch-centricDCNisabstractedasagraph(whichwealsorefertoasaDCN)wherethenodesarepartitionedintotwo sets: thereareserver-nodes;andthereareswitch-nodes.Ofcourse,theserver-nodescorrespond toserversintheDCNand theswitch-nodestoswitches;notethatimmediatelytherearepracticaldesignlimitationsimposedbythenumberofports inarealswitchandthenumberofNICportsinarealserver(wesometimesrefertothenumberofportsofaswitch-node ratherthanitsdegree). Furthermore,inswitch-centricDCNstherearenolinksjoiningoneserver-nodedirectlytoanother server-node(becauseallroutingwithinaswitch-centricDCNfallswithinthepurviewoftheswitches).Ofconcerntousin thispaperwillbe incorporatingacomparativelylargenumberofserver-nodeswithinourDCNsbutsothatthemaximum lengthofashortestpathjoininganytwoserver-nodes,thatis,thediameteroftheDCN,iskeptwithinagivenbound,where the lengthofsuch a pathis thenumberofdistinct linkson thepath.Essentially, wewill becomparing DCNsasto how manyserver-nodestheyincorporatebutwhentheirdiametersarenormalized.

However,DCNsmustalsopossessotherpropertiestomakethemusablewithinadatacentrecontext.Forexample,they alsoneed to:bescalableandincrementallyscalable(thatis,havethecapacityto copewithincreasesincomponentsand data); havelowmessagelatency;provideforhighoverallthroughput(underarangeoftrafficpatterns);beabletotolerate (alimitednumberof)faults;beenergyefficient;bebotheconomicallyandphysicallyviable;andsupportvirtualization(that is, thepartitioningof theDCNinto virtual networksona dynamic basis),amongst manyother things. Supportforsome ofthesepropertiescanbe measuredusinggraphtheory;forexample,thediameteroftheDCNgivesguidanceasregards theexpectedmessagelatency.Ofparticularinteresttouswillbepathdiversitywhichwe define(somewhatinformally)as thecapacitytosenddatawithoutinducingadditionalcongestionorsoastocopewithexistingcongestionorfaults.There aretwocontexts ofinteresttous:theone-to-one(orunicast)context,whenasourceserver-nodewishestosenddatatoa destination server-nodeby theutilizationofindependentpaths (wewillreturntowhat wemeanby‘independent’soon);

and the one-to-many (or multicast) context, when a source server-node wishes to send datato a number of destination server-nodessothat thedifferenttransmissionsdonotinducecongestion.Path diversityishighly relevanttoanumberof the above propertiessuch aslatency andscalability,wheredifferent paths areused to splitandbalance loads,andfault tolerance, where differentpaths provide alternative means of transitin the caseof faults. Path diversity isimportant in boththeone-to-one andone-to-manycontexts,withthisimportanceaccentuatedinthelattercontextwhenadatacentre needs tosupportdatareplicationandapplicationslikeMapReduce[18].Anadditionaldimensionisaddedwithrespectto virtualization when we havevirtual machinesembedded within adata centrethat sharethe sameresources butrequire traﬃc to be routed via different routes. As we shall soon see, just as with latency, the independence of paths can be consideredgraph-theoretically.

(4)

numberofhyperedgescontainingitandtherankofahyperedgeisitssizeasasubsetofV.A hypergraphisregular(resp.

uniform)ifeverynodehasthesamedegree(resp.everyhyperedgehasthesamerank)withthisdegree(resp.rank)being thedegree(resp.rank)ofthehypergraph.EverygraphG=

(

V

,

E

)

hasa naturalrepresentationasahypergraph:thenodes ofthehypergraphare V;andthehyperedgesareE,wherethehyperedgeeconsistsofthepairofnodesincidentwiththe edgeeofG.

2.3. Hypergraphsandbipartitegraphs

WecanrepresentanyhypergraphH=

(

V

,

E

)

asabipartitegraph:thenodesetofthebipartitegraphisV∪^E;^and^there isanedge

(

v

,

e

)

,forv∈^V ândê∈Ê,ⁱⁿ^the^bipartite^graphîf,ândônlyîf,^v∈êⁱⁿ^thehypergraph.Itisclearthatthisyields aone-to-onecorrespondencebetweenhypergraphsandbipartitegraphs(withoutisolatednodes)thatcomecompletewith apartitionoftheelementsintoa‘left-handside’,whichwillcorrespondtothenodesofthehypergraph,anda‘right-hand side’,whichwillcorrespondtothehyperedgesofthehypergraph(rememberthatinahypergraph,everynodeisinatleast onehyperedgeandeveryhyperedgecontains atleastone node,sowecannot haveisolatednodesinourbipartitegraphs).

Weassume (henceforth)that everybipartitegraphcomesequippedwithsuch apartitionandforclarityfromnowonwe refertothenodesontheleft-handsideasnodesandthenodesontheright-handsideasblocks(thisisinkeepingwithour upcomingrealisationoftransversaldesignsasbipartitegraphs).Likewise,werefertothedegreeofanodeasitsdegreeand thedegree ofa blockasits rank.A bipartitegraphcorresponding to aregular, uniformhypergraph ofdegreed andrank

is calleda

(

d

, )

-bipartitegraph.Every bipartite graph(andso every hypergraph)alsodescribesits dualbipartitegraph (oralternativelyitsdualhypergraph)wheretherolesofthenodesontheleft-handsideandtheblocksontheright-hand side of the partition are reversed in thedeﬁnition of thebipartite graph;so, for example,the dual bipartite graph ofa

(

d

, )

-bipartitegraphisregularofdegree

anduniformofrankd.

Notethatif G isabipartite graphthenitcorresponds toahypergraph viaour representationabove andit alsocorre- spondstoahypergraphviathenaturalrepresentationhighlightedinSection2.2.Thetwohypergraphscorrespondingtothe samebipartite grapharedifferentandweare neverinterested intherepresentationofabipartite graphasahypergraph viathenaturalrepresentationofSection2.2.

2.4. Pathsinhypergraphs

Apathinsome hypergraph H=

(

V

,

E

)

(orthecorresponding bipartite graph)isan alternating sequenceofnodesand hyperedges sothat all nodesare distinct,all hyperedges are distinct,anda node v∈^V ^followsôr^{precedes a} ^hyperedge e∈Ê ⁱⁿ^the^sequenceônlyîf ^v∈ê ⁱⁿ^the^hypergraph ^(or

(

v

,

e

)

isan edgeinthecorrespondingbipartite graph).Thefirst elementofsomepathisthesourceandthefinalelementthedestination.Thelengthofanypathisitslengthinthebipartite graphcorrespondingtothehypergraph,andthedistancebetweentwodistinctelementsofV∪Ê îs^the^lengthôfâ^shortest pathjoiningthesetwoelementsinthecorresponding bipartitegraph.Thediameter ofH isthemaximumofthedistances betweeneverypairofdistinctnodesofV,andtheline-diameterofH isthemaximumofthedistancesbetweeneverypair ofdistincthyperedgesofE.

Wehavetworemarks.First,wehavetraditionalnotionsofdiameterandline-diameterinanybipartitegraph.Notethat ournotionofdiameterinabipartitegraph,whichisthelongestshortestnode-to-nodepath(andsoignoresnode-to-block and block-to-block paths),is different fromthe usual graph-theoretic notionof diameter in a bipartite graph (the same commentcanbe madeasregardsline-diameter). Whenwetalkofthediameterorline-diameter ofa bipartitegraph,we meanwithrespecttoournotionofdiameterorline-diameter,respectively;ifweneedto talkofthetraditionalnotionof graphdiameterthenwe willmake thisclear.Second, ournotionofpathlength inahypergraph differsfromthat in[10]

wherethelengthisthenumberofnodes(resp.hyperedges) inahyperedge-to-hyperedge(resp.node-to-node)path.There isno realconsequenceto thisdifference; essentially, ournotionofpath lengthis doublethat in[10]. However, weshall soonmovetoanexclusivelybipartitegraph-theoreticformulationinwhichournotionoflengthisthenaturalonetoadopt.

Weshallbeinterested inbuildingsets ofpaths insomehypergraph H sothatthepathsmighthavethesamesources ordestinations;moreover,weshallrequirethatthesepathsdonot‘interfere’withoneanother(orare‘independent’aswe mentionedearlier).Wesaythataset P ofpathsinH is:

• ^pairwiseinternally-disjointifanysourceordestinationofsomepathofP onlyappearsasasourceordestinationonany pathof P,andanynodeorhyperedgethatisnotasourceordestinationappearsonatmostonepathof P

(5)

•^pairwiseedge-disjointifeverypair

(

v

,

e

)

∈^V×Ê îs^such^that ^v^followsôr^precedesêôn^some^pathât^mostônceâcross allpathsfromthesetP.

2.5. Hypergraphsasswitch-centricDCNs

Given some hypergraph H=

(

V

,

E

)

, our intention is to ultimately transform this hypergraph into a DCN by consid- ering both the nodes andthe hyperedges as switch-nodes so that the switch-nodes corresponding to thenodes (which we shalllater call the level-1 switch-nodes, withthe switch-nodes corresponding to the hyperedgesthe level 2-switch- nodes) alsohave adjacentserver-nodes,which wehave yetto deﬁne(this intentionis bestappreciatedby workingwith the corresponding bipartite graph ratherthan the hypergraph; the upcoming Fig. 5provides a visualization ofwhat we mean). Consequently,we canregard ahypergraph H asmodellinga switch-centricDCN N wherethereare twolevels of switch-nodes.

Suppose that we haveaset P ofpairwise internally-disjointpathsfroma node u of H toanothernode v of H.This translatesto aset P ofpairwise internally-disjointpathsin N fromthecorresponding level-1 switch-node u tothecor- responding level-1 switch-node v. We can usethe paths of P forthe simultaneous transfer of data from server-nodes adjacent to thelevel-1 switch-node u to server-nodes adjacentto thelevel-1 switch-node v (see Fig. 5). In orderto fa- cilitatethisdatatransfer werequirethat level-1 switch-nodesarenon-blockingwhereas thelevel-2 switch-nodescanbe blocking; recallthat aswitch-nodeis non-blocking whennocontentionariseswhensimultaneously sendingdata through the switch-nodeontwodistinct inputlinksandout ontwodistinct outputlinks, andblocking otherwise.Thisisbecause we need to beable tosimultaneously move datafromall servers adjacentto the level-1 switch-nodeu in N across the switch-node andout alongdifferentlinks(the samecanbe saidfor v). Ifour pathsin H areonlypairwise edge-disjoint thenwerequirethatlevel-1 andlevel-2 switch-nodesofN arenon-blocking(aswemighthaveswitch-nodesappearingon morethanonepathofP,eventhoughnolinkdoes).

2.6. Transversaldesigns

Thenotionofatransversaldesigniscrucialtowhatfollows.

Deﬁnition1.Letk

,

≥^2. ^A[,^k]-transversaldesign T isa triple

(X ,

D,U)^where: |X|=

k; D=

(

D₁

,

D₂

, . . . ,

D

)

is a partitionofX ^into

equal-sizedgroups(eachofsizek);andU= {^Uj:^j=¹

,

2

, . . . ,

k²}îsâ^familyôf^k²^subsetsôfX^,êach ofsize

andcalledablock,sothat

• |^Di∩^Uj|=^1,^forⁱ=¹

,

2

, . . . ,

, j=¹

,

2

, . . . ,

k²

•êach^pairôfêlements{^xi

,

xj}^,^where^xi∈^Di,xj∈^Dj andi=^j,^is^containedⁱⁿ^exactly^{1 block.}

We adoptagraph-theoreticperspective ontransversaldesigns asdefinedinDefinition 1:wethinkofthe[,^k]^-trans- versaldesignT asabipartitegraphwheretheelements ofX ^(resp.U⁾ ^lieôn^the^left-hand^side ^(resp.^right-hand^side)ôf thepartition,andsoarecallednodes(resp.blocks)withinthebipartitegraph,andsothatinthisbipartitegraphthereisan edge

(

p

,

Q

)

,for p∈X ând^Q ∈U^,îf,ândônlyîf,ⁱⁿ^thetransversaldesigntheelement pisintheblock Q.Notethatthe bipartite graphcorrespondingtothetransversaldesignfromDefinition 1isa

(

k

, )

-bipartitegraph.Henceforth,weadopt our bipartite graph frameworkand regard both hypergraphsandtransversaldesigns asbipartite graphs (unless we state otherwise).

There is an intimate relationship involving transversaldesigns, orthogonalarrays andmutuallyorthogonallatinsquares, although thereisnoneedtogive deﬁnitionshere. However,itiswell known:that thereare

mutuallyorthogonal latin squaresoforderkif,andonlyif,thereisa[+2

,

k]-orthogonalarrayif,andonlyif,thereisa[+2

,

k]-transversaldesign;

andthat thereare atmostk−^{1 mutually} ôrthogonal^latin ^squaresôfôrder^k ^(see,^forêxample,^[19]).^Hence, îf^we ^have a [,^k]-transversal designthen

≤^k+^1. Âlso, îf^k îs â ^prime ^power^then â [,^k]-transversal design exists whenever 2≤

≤^k+^{1 (again,}^see^[19]).^We ^shallûse^these^facts ^laterôn.^The^studyôf^theêxistenceôf[,^k]-transversaldesigns, forvarious

andk,isalong-standingareaofresearch.

Werequireoneﬁnalbitofnotation.IfT issometransversaldesign,asinDeﬁnition 1,andxandy arenodesindistinct groupsthenwerefertotheuniqueblockadjacenttobothxandy astheblockgeneratedbyxand y.

3. The3-stepconstructionanditsextensions

We now describe the 3-stepconstructionfor buildingbipartite graphs (or, equivalently,hypergraphs) by usinga ‘base’

bipartite graphandatransversaldesign(whichwe thinkofasabipartite graph).Thisconstructionoriginatedin[11]and was used in [10]. We then explain how this construction was subsequently extended in [10] both by iteration and by compositionsoastoyieldswitch-centricDCNs.

(6)

Fig. 1.A(d, )-bipartite graphH0.

Fig. 2.A[,k]-transversal designT.

3.1. The3-stepconstruction

The3-stepconstructionproceedsasfollows.

Step 1:LetH0beaconnected

(

d

, )

-bipartitegraphsothattherearennodes(ontheleft-handsideofthepartition,each ofdegreed)andeblocks(ontheright-handside,eachofrank

).SuchanH₀canbevisualizedasinFig. 1(ordinarily,we representnodesascirclesandblocksassquares).

Step 2:LetT bea[

,

k]-transversaldesign.Inparticular,thereare

groupsofknodes(ontheleft-handside)aswell as k² blocks(ontheright-handside).Sucha T canbevisualizedasinFig. 2.BuildthebipartitegraphH asfollows.Forevery node pofH0,introduceagroup Gp ofknodesofH;wesaythatthegroupofnodesGp ofHisassociatedwiththenode p ofH0.Foreveryblock Q ofH0,adjacenttothenodesp1

,

p2

, . . . ,

pin H0,introduceacopyofT,denotedTQ,rootedon the

groupsofnodesG_p₁

,

G_p₂

, . . . ,

G_p; so,associatedwiththeblock Q ofH₀,wehaveaset B_Q ofk² blocksin H.We refertothe

groupsofnodes Gp₁

,

Gp₂

, . . . ,

Gp astherootsofthecopyTQ ofT in H.Suchabipartitegraph H canbe visualizedasinFig. 3wheretwoofthecopiesofT arepartiallyshown(notethattheymighthavesomerootsincommon buttheirrespectivesetsofblocksarealwaysdisjointasaretheirsetsofedges).ThebipartitegraphH0providesatemplate astohowweintroducecopiesofT toformH.

Notethat:

• êach^nodeôf^H ^can^beîndexedâsâp,j,wherep∈ {^pi:ⁱ=¹

,

2

, . . . ,

n}^and ^j∈ {¹

,

2

, . . . ,

k}^,^so^that ^p^is^the^node^of^H0 towhichthegroup Gp inwhichap,j sitsisassociatedand jistheindexofthenodeap,jinthisgroup

• êach^blockôf^H^can^beîndexedâs^BQ,U,whereQ ∈ {^Qi:ⁱ=¹

,

2

, . . . ,

e}^and^U∈ {¹

,

2

, . . . ,

k²}^,^so^that ^Q ^is^the^block^of H0 towhichthesetofblocksBQ inwhichBQ,U sitsisassociatedandU istheblockofT towhichBQ,U corresponds.

Inaddition,eachnodeofT canbeindexedu_i_,_j,wherei∈ {¹

,

2

, . . . , }

^and ^j∈ {¹

,

2

, . . . ,

k}^,^so^that^Diisthegroupofnodes inwhichui,j sitsand j istheindexofui,jinthatgroup.

(7)

Fig. 3.AmalgamatingH0andT to getH.

Step 3: LetH^∗ be thebipartitegraphobtainedfromthebipartite graphH by reversingtherolesofnodesandblocks(so, H^∗ isthedualbipartitegraphofH).Notethatthebipartitegraph H^∗ isregularofdegree

anduniformofrankdk.

Werefertothe

(

dk

, )

-bipartitegraphH (resp.the

(,

dk

)

-bipartitegraph H^∗) constructedaboveashavingbeencon- structed bythe2-step(resp.3-step)methodusingthe

(

d

, )

-bipartitegraph H0 andthe[

,

k]-transversal designT.Note that H (resp.H^∗)hasnknodes(resp.ek² nodes)andek²blocks(resp.nkblocks).

Ourintentionwithourconstructionsistoultimatelydesignswitch-centricDCNswithbeneﬁcial properties(asweout- lined in Section 2). Whilst there are many properties we would like our DCNs to have, it is important that DCNs can integrate alarge number ofserver-nodes sothat the server-node-to-server-node distances are shortandso that there is redundancy astowhich(short)server-node-to-server-noderouteswechoosetouse.Inourframeworkofbipartitegraphs, thistranslatesasbuildingbipartite graphswithalargenumberofnodesandwithredundant(short)node-to-nodepaths.

As a ﬁrst step,thefollowing resultwas proven in[11] (it isactually derivablefromthe proofsofour upcoming results) andallowsuscontroloverthelengthofshortestblock-to-blockpathsin2-stepconstructions(andsoshortestnode-to-node pathsin3-stepconstructions).

Theorem2([11]).Supposethatthe

(

dk

, )

-bipartitegraphH hasbeenconstructedusingthe2-stepmethodusingthe

(

d

, )

-bipartite graphH0andthe[

,

k]-transversaldesignT .IfH0hasline-diameter

λ

≥⁴^then^{H has}line-diameter

λ

.

Ofcourse,if H^∗ isthedualbipartitegraphof H inTheorem 2thenit hasdiameter

λ

.Wereiteratethatournotionof diameterandline-diameterdiffersfromthatin[11,10](wherethelengthofablock-to-blockpathisthenumberofnodes onthatpath;so,in[11,10]thebound

λ

≥^{4 in}ôur^{Theorem 2}âppearsâs

λ

≥^2).

3.2. Iteration

Wecaniteratethe3-stepconstruction(aswasdonein[10]).NotethatifH0 isa

(

d

, )

-bipartitegraphofline-diameter

λ

≥^4,^withⁿ^nodes^and^e^blocks,^then^the^bipartite^graph ^H1 resultingfromthe2-stepconstruction(using H0 andsome [,^k]-transversal design T)isa

(

dk

, )

-bipartite graphofline-diameter

λ

.So,repeating the2-stepconstruction butwith H1replacingH0(wekeepthesameT,althoughwedonothaveto)yieldsa

(

dk²

, )

-bipartitegraphH2ofline-diameter

λ

. Byiteratingthisconstruction,wecanclearlyobtaina

(

dkⁱ

, )

-bipartitegraphHiofline-diameter

λ

.Converting Hi intoH^∗_i resultsinabipartitegraphwithek²ⁱnodes,withnkⁱblocks,withdiameter

λ

,andthatisregularofdegree

anduniform ofrankdkⁱ.

3.3. Composition

Weare nowinapositiontotransformourbipartite graphsintoswitch-centric DCNs.Aswell astheconstructions,and their associatedproofs, that were presentedin [10], newmethods of composing bipartite graphs (builtaccording to the

(8)

Fig. 4.Building a switch-centric DCN via MethodAwhenc>1.

Fig. 5.Building a switch-centric DCN via MethodAwhenc=^1.

Fig. 6.Building a switch-centric DCN via MethodB.

3-stepconstruction)soastoobtainswitch-centricDCNswere alsoderived.In[10], 4 suchmethods weregiven:Methods M₁,M₂ andM₃ aredifferentcasesofMethod A,below;andMethodM₄isMethodB.

Inwhatfollows,let Hbea

(, δ)

-bipartitegraphwhere

< δ

andwheretherearennodesandeblocks.

MethodA:Wetakec copiesofH where

δ

−^c

>

0 andc≥^1.^Forêach ^nodeûôf ^H:^we^remove^thecorrespondingnode ineachofthec copiesofH andintroduceanewswitch-node(commontoallcopiesofH);wemakeallofthec

edges incident withthec original nodesincident withthisnewswitch-node; andweattach

ρ

=

δ

−^c

pendantserver-nodes to thenew switch-node.Allblocks of H are considered asswitch-nodes.We follow[10] andcallthe newswitch-nodes level-1switch-nodes,and the original switch-nodeslevel-2switch-nodes.The construction ofthe switch-centric DCN N

(

H

)

fromH viathismethodcanbevisualisedasinFig. 4,whereweonlyshowtheconstructionforthec nodescorresponding toonenodeofH.Notethatevery switch-nodeofN

(

H

)

has

δ

ports.Also,thereissomechoiceasregardstheparameterc (sothat choosingdifferentvaluesforc yields differentvaluesfor

ρ

). Weillustrate thespecialcasewhen c=^{1 in}^{Fig. 5,} whereH isa

(

3

,

5

)

-bipartitegraph.Thegeneralcasewhenc≥1 correspondstoMethodM₂ of[10];thespecialcasewhen c=1 correspondstoMethod M₁;andthespecialcasewhenc= ^δ²corresponds toMethodM₃.Inthislattercase,the aimis toensure that every level-1 switch-node isadjacent toroughly the samenumberoflevel-2 switch-nodes asitis server-nodes.Notethat: thenumberofserver-nodesin N

(

H

)

isn

(δ

−^c

)

;the numberoflevel-1 switch-nodes isn; and thenumberoflevel-2 switch-nodesisce.

MethodB:Wenowworkwithaswitch-centricDCNasconstructedbyMethod A.Leteverylevel-1 switch-nodehave

ρ

adjacentserver-nodes.Supposethatthereisanevennumberoflevel-1 switch-nodes.Partitionthesetoflevel-1 switch-nodes intopairs.Foreachpairofswitch-nodes

(

S

,

S

)

:remove^ρ₂server-nodesthatareadjacenttoSandremove^ρ₂^server- nodesthat areadjacentto S; andmakeevery server-nodethat isadjacenttothe switch-node S ortheswitch-node S alsoadjacenttotheother switch-node.Notethatthenumberofportsofanyswitch-nodehasnotchangedbutthatevery server-nodeisnowadjacentto2 switch-nodes.Thephilosophybehindthisconstructionistobettertoleratethefailureofa level-1switch-node.TheconstructioncanbevisualizedasinFig. 6wherepairedlevel-1 switch-nodeshavethesameshade ofgreyandwhere

ρ

=^3.

(9)

Table 1

Comparingswitch-centricDCNsbuiltwithswitch-nodeswith64 ports.

Network # switch ports Diameter # server-nodes # switch-nodes

Fat-Tree 64 6 65,536 5,120

H^∗ 64 4 54,720 6,840

N¹_A(H^∗) 64 6 3,064,320 61,560

N²_A(H^∗) 64 6 437,760 102,600

N³_A(H^∗) 64 6 1,751,040 82,080

NB(H^∗) 64 6 1,532,160 61,560

H¯^∗ 64 4 20,480 1,280

N¹_A(H¯^∗) 64 6 1,228,800 21,760

3.4. SomeillustrationsofDCNs

In [10],switch-centricDCNs constructedusingthe3-stepmethodallied withMethods A and B werefavourablycom- paredwiththe3-levelFat-TreeDCNfrom[6]withregardtothenumberofserver-nodesthereinwhenthediameterandthe numberofportsofaswitch-nodeareheldconstant.Thereaderisreferredto[6,10]forfulldetailsasregardsthetopology ofFat-TreeandtoTables2–4in[10]forthecompletecomparison;however,weincludeareplicatedtableherepurelyfor illustrative purposes.InTable 1(whichis Table 2from[10]):thenumberofportsofanyswitch-nodeisforcedtobe 64;

thediametersoftheDCNsresultingfromusingthe3-stepmethod,iterationandcompositionareforcedtobe(atmost)6 (likethatofFat-Tree);andthenumbersofserver-nodesandswitch-nodesintheresultingDCNsareasgiven(notethatthe lengthofaserver-node-to-server-nodepathasdeﬁnedin[10]isthenumberofswitch-nodesonit,whichisonelessthan ournotionoflengthwhichisthenumberoflinksonthepath).

•^The ^bipartite ^graph ^H^∗ îs ôbtainedûsing ^the ^3-step ^method ^starting ^withâ

(

8

,

8

)

-bipartite graph H₀, that has 855 nodes,855 blocks,anddiameterandline-diameter4 (suchabipartitegraphH₀ exists;see[20]),anda[⁸

,

8]-transversal design T.The DCNH^∗ inTable 1istheDCNobtainedbysimplyregardingevery nodeofthebipartite graphH^∗ asa server-node(notethatinthisDCNwerequirethateveryserver-nodehas8 NICports);theDCNN¹_A

(

H^∗

)

(resp.N²_A

(

H^∗

)

, N³_A

(

H^∗

)

) is obtainedby employing Method A withc=1 (resp. c=7, c=4); and the DCN NB

(

H^∗

)

is obtained by employingMethod BwithN¹_A

(

H^∗

)

(notethatthenumberofswitch-nodesentryinTable 2in[10]isincorrect).

•^The ^bipartite ^graph H¯ isobtained usingthe 3-step methoditerated twice, starting witha

(

4

,

4

)

-bipartite graph H¯0, that has80 nodes, 80 blocks,and diameterand line-diameter 4 (such a bipartite graph H¯₀ exists; see [20]), and a [⁴

,

4]-transversal designT¯ (actually,in[10]thistransversaldesignisnotmentioned;itdoes,however,exist).TheDCN H¯^∗ in Table 1is theDCNobtainedby simplyregarding everynode ofthe bipartite graph H¯^∗ asa server-node(note thatthenumberofserver-nodesentryinTable 2in[10]isincorrect,thoughthecorrectnumberisstatedinthetext);

andtheDCN N¹_A

(

H^∗

)

isobtainedbyemploying Method A withc=^{1 (note} ^that^the ^numbers^ofserver-nodesandof switch-nodesentriesinTable 2in[10]areincorrect).

ItisclearfromTable 1(andfrom[10])thatwecanbuildmuchbiggerserver-centricDCNsusingthe3-stepmethodandthe subsequent iterationsandcompositionsthan Fat-Treebutwithoutincreasing thediameter(which isa proxyforlatency);

ofcourse,wewouldwishthenewDCNstohaveother propertiesthat makethemattractivewithinadatacentrecontext.

Establishing such propertieswas essentially the whole point of[10] andwe continue withthisline ofresearch in what follows. Note that we provide additional illustrations of our constructions of switch-centric DCNs, in tandem with our upcomingresults,inSection4.3.

Beforewemovetoourmainresults,letuscommentonusingthe2-stepmethodasopposedtothe3-stepmethodwhen building ourswitch-centric DCNs(the same commentwas madein [10]). Notethat when one usesthe (iterated)2-step method,whilsttherankoftheresultingbipartitegraphstaysthesame,thedegreegrows.Werewetoattachserver-nodes totheswitch-nodesthat replacethenodesofthe2-stepbipartite graphH,ratherthanthe3-stepbipartite graphH^∗,the number ofports ofthe level-2 switch-nodes(which would be

) wouldbe much lessthan thenumber ofportsof the level-1 switch-nodes.Hence,itmakesmoresensetoproceedaswehavedoneabove.

4. One-to-onepathdiversity

So far, we haveset the scenefrom[10] anddescribed a methodby which we can build bipartite graphs(the 3-step method)whichcanthenbetransformedintoswitch-centricDCNswithmanymoreserversthanFat-Treewhilstmaintaining thediameterofFat-Tree,i.e.,6.However,aswementionedearlier,therearemanymoreaspectstothedesignofDCNswith an important onebeing pathdiversity. Inwhat follows,we highlightsome problemswiththe proofsofone-to-one path diversity in [10] for bipartite graphs builtusing the 3-stepmethod. We then provide not only correctproofs asregards one-to-one pathdiversity butwe alsoextendandimprove theanalysisin [10] withnewresults.We endthe section by applyingourconstructionssoastobuildDCNswithgoodone-to-onepathdiversityproperties.

(10)

are claimedin thesituationwhenthetwo blocks B_Q_,_U and B_Q,U aresuch that Q =^Q ^(recallôur^methodôfîndexing inSection3.1whichweadopthere).However, thereareseriousflawsintheproofofTheorem 3of[10],somuchsothat thetheoremisuntrue.Inshort,Theorem 3of[10]claims thatifthereare

ω

pairwiseinternally-disjointpathsinH0 from Q to Q then there are min{

ω ,

k

ω

_} pairwise internally-disjoint paths in H from BQ,U to BQ,U. This doesnot make sense:themaximumnumberofpairwise internally-disjointpaths in H from B_Q_,_U to B_Q,U is

(as thebipartite graph H has rank

) andso we must havethat min{

ω ,

k

ω

}≤

. Forinstance,in Example1 of[10], thebipartite graph H0 isthe cycleoflength 10 (H0 is derived fromthecycleoflength 5 using its naturalrepresentationas ahypergraph;see Section2.2),sothatd=

=^2,ⁿ=ê=^5,ând^thereâre2 internally-disjointpathsfromanyblockofH0 toanyotherblock ofH₀.A[²

,

3]-transversaldesignT isusedandthebipartitegraphH^∗ builtbythe3-stepmethodhasrank6 anddegree2.

However,ifTheorem 3of[10]were truethen therewouldbe4 pairwise disjointpaths fromBQ,U to B_Q,U in H^∗ which clearlycannotbethecase.

4.2. Theone-to-onescenario

Wenowresurrect(someof)theproofsofthemainresultsfrom[10] andextendtheresultsclaimedinthatpaper.The followinglemmaprovesmostuseful.

Lemma3.LetT besome[

,

k]-transversaldesignwithgroupsofnodes{^D1

,

D2

, . . . ,

D}^.^Let^{U be}^some^block^of^{T .}^For^each i∈ {¹

,

2

, . . . ,

}^,^let^ri∈^DibetheuniquenodeofDithatisadjacenttoU .SetR= {^ri:ⁱ=¹

,

2

, . . . ,

}^.^Let^{P be}^a^set^of^distinct^pairs ofnodessothat:exactlyonenodeofanypairinP isinR andnonodeofR isinmorethanonepairofP ;andnopairinP issuchthat bothnodeslieinthesamegroup.TheblocksgeneratedbythepairsinP arealldistinctanddifferentfromU .

Proof. Supposethat {^ri

,

x}∈^P^,^where^x∈^Dl\^R ^with^l=ⁱ ^and^whereⁱ∈ {¹

,

2

, . . . ,

}^.^LetÛr_i,x betheblockgeneratedby riandx.IfUri,x=Û ^thenÛ îsâdjacent^to^the^distinct^nodes^rl andxinDl whichyieldsacontradiction.

Supposethat {^rj

,

y}∈^P\ {{^ri

,

x}}^,^where ^j∈ {¹

,

2

, . . . , }

^.^LetÛrj,y be theblock generatedby r_j and y. Supposethat Ur_i,x=Ûr_j,y; hence, Ur_i,x is adjacent to both r_i andr_j with i= ^j. Âs âny ^two ^nodes ^lying ⁱⁿ ^distinct ^groups ⁱⁿ ^T âre adjacenttoauniqueblockof T,wemusthavethatUr_i,x=Ûr_j,y=Û^;^but^this^yieldsâcontradictionasabove.Hence,the blocksgeneratedbythepairsinP arealldistinctandalldifferentfromU. 2

Weusethislemmathroughout,bothexplicitlyandimplicitly.

Ourmainresultintheone-to-onecontext isconcerned withbuildingasmanypairwiseinternally-disjointpathsaswe canfromanyblocktoanyotherblockinthebipartitegraphbuiltusingthe2-stepmethod(or,equivalently,fromanynode toanyother node inthebipartitegraphbuiltusingthe 3-stepmethod).We explainthe impactoftheexistence ofthese paths onthepath diversityofsubsequentlybuiltDCNs presently.Oneaddedandsigniﬁcant complicationintheproof of thefollowingresultcomesaboutwhenthetransversaldesignT isa[^k+¹

,

k]-transversaldesign(so,thereisthepotential for

=^k+¹

>

kpaths).

Theorem4.Letk

, ,

d≥^2.^Let^{H be}^built^by^the^2-step^method^from^the

(

d

, )

-bipartitegraph H₀usingthe[,^k]-transversal designT .

(a) LetQ andQbedistinctblocksofH0sothatthereare

λ

≥¹^pairwiseinternally-disjointpathsinH0fromQ toQ,eachoflength atmost

μ

.Therearemin{,^k}^pairwiseinternally-disjointpathsfromanyblockB_Q_,_V of H toanyotherblockB_Q,V of H . Furthermore,if

λ

≥²^then^there^are

pairwiseinternally-disjointpathsfromanyblockBQ,V ofH toanyotherblockB_Q,V of H .Allpathshavelengthatmost

μ

+^4.

(b) IfBQ,VandBQ,VaredistinctblocksofH thenthereare

pairwiseinternally-disjointpathsfromBQ,VtoBQ,V,eachoflength atmost6andlyingentirelywithinT_Q.

Proof. RecallthatwementionedinSection2.6thatnecessarily

≤^k+^1.

Case (a)(i):Supposethat:

=^k+^1;

λ

≥^2;^and^the^distinct^nodes ^p1andp2arecommonneighboursinH0 ofQ andQ. We‘batch’thegroupsofnodesof T_Q andT_Q togethersothatineachofT_Q andT_Q,thek+^{1 groups}^of^nodes^form 1 batchofkgroupsand1 batchof1 groupasfollows:

(11)

Fig. 7.The basic set-up in Case (a)(i).

•^forⁱ∈ {¹

,

2}^,^deﬁne^Gⁱ₀=^Gp_i=^H₀ⁱ

•^the ^remaining ^k−^{1 groups} ^within ^TQ are G¹₁

,

G¹₂

, . . . ,

G¹_k₋₁ and the remaining k−^{1 groups} ^within ^TQ are H¹₁

,

H¹₂

, . . . ,

H¹_k₋₁sothat:

– any groupoftheformG¹_j,where j

>

0,isassociatedwithsomenode p∈ {

/

^p1

,

p₂}^of ^H0 thatisadjacenttoboth Q and Qif,andonlyif,thegroup H¹_j isassociatedwiththesamenode pof H0 (so,ifG¹_j andH¹_j areassociatedwith thesamenode p∈ {

/

^p1

,

p₂}^of^H0thentheyarethesamegroupinH).

Foreach j∈ {⁰

,

1

, . . . ,

k−¹}^,^let^r¹_j∈^G¹_j ^(resp.^s¹_j∈^H¹_j⁾^be ^theûnique^node ôf^G¹_j ^(resp. ^H¹_j⁾ ^that îsâdjacent^to ^BQ,V (resp. BQ,V) in H.Notethat thepairr¹_j ands¹_j lieinthesamegroup ofnodesin Hif, andonlyif,both G¹_j and H¹_j are associatedwiththesamenode pofH0 andthisnodepisadjacenttobothQ andQinH0.Thesituationcanbevisualized asinFig. 7(whereinthiscase Q and Q havea+²≥^{2 common}^neighboursⁱⁿ ^H0 andwhere,forexample,r¹₁=^s¹₁ ^but r_a¹=^s¹a).

LetG¹₀= {^r₀¹

,

t₁

, . . . ,

t_k₋₁}^and^H₀¹= {^s¹₀

,

w₁

, . . . ,

w_k₋₁}^so^that:

•^if^r₀¹=^s¹₀ ^then^tj=^wj,for j=¹

,

2

, . . . ,

k−¹

•^if^r₀¹=^s¹₀ ^then^r₀¹=^w1,s¹₀=^t1 andtj=^wj,for j=²

,

3

, . . . ,

k−^1.

WearenowreadytogeneratesomeblockswithinTQ andTQ inH.Foreach j∈ {¹

,

2

, . . . ,

k−¹}^:

•^let ^B_r¹

j,tj betheuniqueblockofT_Q inHgeneratedbythenodesr¹_j∈^G¹_j ^and^tj∈^G¹₀

•^let ^B_s1

j,wj betheuniqueblockofT_Q inH generatedbythenodess¹_j∈^H¹_j ^and^wj∈^H¹₀^.

So,we havegeneratedk−^{1 blocks}ⁱⁿ^TQ andk−^{1 blocks}ⁱⁿ ^TQ.Notethatanyblockof T_Q isnecessarilydistinctfrom anyblockofT_Q.ByLemma 3appliedtwicetoboth T_Q andT_Q,allblocksof{^B_r¹

j,tj:^j=¹

,

2

, . . . ,

k−¹}^are^distinct^and differentfromBQ,V,andallblocksof{^B_s1

j,wj:^j=¹

,

2

, . . . ,

k−¹}^are^distinct^and^different^from^BQ,V.Callthesetwosets ofblocksourworkingsetsofblocks.

WearenowinapositiontobuildsomepathsfromB_Q_,_V to B_Q,V in H.Ifr¹₀=^s¹₀ ^then^deﬁne^the^paths:

•

π

₀¹asBQ,V

,

r₀¹

,

BQ,V

•

π

₁¹asB_Q_,_V

,

r₁¹

,

B_Q,V,ifr¹₁=^s¹₁^,^and^as^BQ,V

,

r¹₁

,

B_r1 1,t1

,

t₁

,

B

s¹₁,w₁

,

s¹₁

,

B_Q,V,ifr¹₁=^s¹₁ ^(note^that^t1=^w1).

Ifr¹₀=^s¹0 thendeﬁnethepaths:

•

π

₀¹asBQ,V

,

r₀¹

,

B

s¹₁,w1

,

s¹₁

,

B_Q,V (notethat w1=^r₀¹⁾

•

π

₁¹asB_Q_,_V

,

r₁¹

,

B_r1

1,t1

,

s¹₀

,

B_Q,V (notethatt₁=^s¹₀^).

We’ll nowbuild pathsfrom BQ,V to B_Q,V usingnodesfromthegroups{^G¹₀}∪ {^G¹_j

,

H¹_j:^j=²

,

3

, . . . ,

k−¹}^.^For^each j∈ {²

,

3

, . . . ,

k−¹}^:

•^if^r¹_j=^s¹_j ^then^deﬁne^the^path:

–

π

¹_j asBQ,V

,

r¹_j

,

B_r1 j,t_j

,

tj

,

B

s¹_j,wj

,

s¹_j

,

BQ,V (notethattj=^wj)

•^if^r¹_j=^s¹_j ^then^deﬁne^the^path:

–

π

¹_j asBQ,V

,

r¹_j

,

BQ,V.

topologies On the combinatorial design of data centre network Journal of Computer and System Sciences

Journal of Computer and System Sciences

On the combinatorial design of data centre network topologies ✩ , ✩✩

Iain A. Stewart

(

,

)

(

,

)

(

,

)

(

, )

(

, )

(

,

)

(

,

)

(

,

)

(

,

)

,

(X ,

(

,

, . . . ,

)

,

, . . . ,

,

, . . . ,

,

, . . . ,

,

(

,

)

(

, )

,

,

(

, )

,

,

, . . . ,

,

, . . . ,

,

, . . . ,

,

, . . . ,

,

, . . . ,

,

, . . . ,

,

, . . . ,

,

, . . . , }

,

, . . . ,

(

, )

(,

)

(

, )

,

(

, )

(

On the combinatorial design of data centre network topologies ^✩ ^, ^✩✩