Growing a mapping - Parameterized Complexity of Graph Modiﬁcation and Stable Matching Problems

3.2 Trees

3.2.2 Growing a mapping

From now on, we assume that Reductions A and B cannot be applied. At this point, the algorithm checks whether the conditions of Lemma 3.2.1 are fulfilled, and correctly outputs

’No’ if the conditions do not hold. Letφdenote the isomorphism fromT to TS that we are looking for. As in Section 3.1, we try to grow a partial mapping fromT toTS, which is always a restriction ofφ. To begin, the algorithm chooses an arbitrary starting vertexr0 inT, and branches on the choice ofφ(r0) inG, which means|V(G)|possibilities.

Throughout its running, the algorithm may modifyG by deleting vertices of S from it.

We denote byGⁱ the graph obtained fromGafter thei-th step, withG=G⁰. Assume that in thei-th step of the algorithm there is a subtreeDⁱofT on whichφis already known. The algorithm proceeds step by step, choosing a leafrⁱ ofDⁱ in the i-th step that has not been examined yet. For the chosen vertex rⁱ, it determines φ on NT(rⁱ) by applying a method described below. This means also that it addsNT(rⁱ) toDⁱto getDⁱ⁺¹, deletesNG(φ(rⁱ))∩S from Gⁱ to get Gⁱ⁺¹, and checks whether φ is still an isomorphism. When determining φ onNT(rⁱ), the algorithm may branch into a bounded number of branches, or may proceed

3.2. Trees 43 with a single branch. Accordingly, we distinguish betweenbranching andsimple cases.

Let us describe the details of a single step executed by the algorithm. First, it checks whether|V(Gⁱ)| ≥ |V(T)|holds, outputting ’No’ if the condition fails. Next, it verifies some simple conditions considering the neighbors ofrⁱ and φ(rⁱ) =r⁰ⁱ. To do this, it determines the minimal connected subgraphKⁱ ofGⁱ containing every cycle ofGⁱ. Note thatKⁱcan be constructed fromGⁱ easily in linear time, as the 2-connected components of a graph can be determined in linear time, e.g. by applying depth first search [37].

To proceed, let us introduce some new notation. We divide the vertices of NT(rⁱ) into two groups as follows:

• those neighbors ofrⁱ that are inDⁱ,

• those neighbors ofrⁱ that are not inDⁱ. Lettⁱ₁, . . . , tⁱ_αi denote these vertices, and letT_jⁱ be the tree component ofT−rⁱ containingtⁱ_j.

Similarly, we classify the vertices ofNG(r⁰ⁱ) into three groups:

• those neighbors ofr⁰ⁱ that are inφ(Dⁱ),

• those neighbors of r⁰ⁱ outside φ(Dⁱ) that are connected to r⁰ⁱ by edges not in Kⁱ. Let t⁰₁ⁱ, . . . , t⁰_βⁱi denote these vertices, and T_j⁰ⁱ denote the component of Gⁱ−r⁰ⁱ that includest⁰_jⁱ. Observe that either T_j⁰ⁱ is a tree, orr⁰ⁱ∈/ V(Kⁱ) andT_j⁰ⁱ containsKⁱ.

• those neighbors ofr⁰ⁱoutsideφ(Dⁱ) that are connected tor⁰ⁱ by edges inKⁱ. Letγⁱbe the number of such vertices.

Clearly, αⁱ ≤βⁱ+γⁱ, and the equality holds if and only if NGⁱ(r⁰ⁱ)∩S =∅. Thus, if the algorithm finds thatαⁱ> βⁱ+γⁱ, then it outputs ’No’.

First, let us observe that if the treeT_hⁱ is isomorphic toT_j⁰ⁱ for somehandj, then w.l.o.g.

we can assume thatφ(T_hⁱ) =T_j⁰ⁱ. As the trees of a forest can be classified into equivalence classes with respect to isomorphism in time linear in the size of the forest [6, 78], this case can be noticed easily. Given two isomorphic trees, an isomorphism between them can also be found in linear time, so the algorithm can extendφonT_hⁱ, adding alsoT_hⁱ to the subgraphDⁱ. Hence, we only have to deal with the following case: no treeT_hⁱ (h∈[αⁱ]) is isomorphic to one of the graphs T_j⁰ⁱ (j ∈ [βⁱ]). This argument makes our situation significantly easier, since every graph T_j⁰ⁱ must contain some vertex from S. Therefore βⁱ ≤ |S| = k. Clearly, ifr⁰ⁱ ∈/ V(Kⁱ) thenγⁱ= 0. Ifr⁰ⁱ∈V(Kⁱ) thenr⁰ⁱ can have degree at mostk²+kinK⁰, and thus inKⁱ, by Lemma 3.2.1. Thus, we getγⁱ≤k²+k, implying alsoαⁱ≤βⁱ+γⁱ≤k²+ 2k.

The algorithm determinesαⁱ, βⁱandγⁱin each step, and outputs ’No’ if these bounds do not hold for them.

The algorithm faces one of the following two cases at each step.

Simple case: βⁱ +γⁱ ≤ 1. In this case, αⁱ ≤ 1. If βⁱ+γⁱ = 0 then αⁱ = 0, hence the algorithm proceeds with the next step by choosing another leaf of Dⁱ not yet visited.

Otherwise, letv be the unique vertex inNGⁱ(r⁰ⁱ)\V(φ(Dⁱ)). Ifαⁱ= 0 thenv must be inS, otherwiseφ(tⁱ₁) =v. According to this, the algorithm deletes v or extendsφ ontⁱ₁, adding alsotⁱ₁ toDⁱ.

Branching case: βⁱ+γⁱ ≥ 2. In this case, the algorithm branches on every possible choice of determiningφ onNT(rⁱ). Guessingφ(v) for a vertexv∈NV(T−Dⁱ)(rⁱ) can result in at mostβⁱ+γⁱ possibilities, so the number of possible branches in a branching step is at most (βⁱ+γⁱ)^αⁱ ≤(k²+ 2k)^k²^+2k. After guessingφ(v) for each vertexv∈N_V_(T₋_Dⁱ₎(rⁱ), the algorithm puts the remaining verticesNG(r⁰ⁱ)\ {φ(v)| v ∈ NT(rⁱ)} into S, deleting them fromGⁱ.

Lemma 3.2.2. In a single branch of a run of the algorithm described above on a solvable input for the Cleaning(Tree,−)problem with parameterk, there can be at most g(k) + 2k−2 = 2k³(k+ 1) + 5k−2 =O(k⁴)branching steps.

Proof. We use the notation applied in the description of the algorithm. The i-th step can only be a branching case if eitherγⁱ ≥2,βⁱ ≥2, or βⁱ =γⁱ = 1 holds. For each of these cases, we give an upper bound on the number of steps in a single branch of a run of the algorithm where these cases can happen.

To determine a bound for the case γⁱ ≥2, let r^∗ be the first vertex in T examined in a step such that φ(r^∗) is inK⁰. Recall thatK⁰−S is a tree, so supposing φ(rⁱ)∈V(K⁰) we get that if rⁱ 6= r^∗ then for the edge e incident to rⁱ in Dⁱ it must hold that φ(e) is in K⁰. Now, observe that if γⁱ≥2 holds, then this implies that eitherrⁱ =r^∗ or φ(rⁱ) has at least three edges incident to it inK⁰. The latter means thatrⁱ ∈K3, where K3 denotes the vertices ofK⁰ having degree at least three in K⁰. Thus, the condition γⁱ≥2 can hold in at most|K3|+ 1≤g(k) steps, by Lemma 3.2.1.

Now, if the algorithm finds βⁱ≥2, then recall that bothT₁⁰ⁱ andT₂⁰ⁱinclude at least one vertex fromS, and thusGⁱ−φ(Dⁱ) has more connected components containing vertices ofS thanGⁱ−φ(Dⁱ−rⁱ) has. It is easy to see that this can be true for only at most|S| −1 such vertices rⁱ, so this case can happen at mostk−1 times in a single branch of a run of the algorithm.

Finally, letS^∗denote those vertices ofS that are not contained inK⁰. Clearly, ifs∈S^∗, then s is not contained in any cycle of G, so |NG(s)∩V(TS)| ≤ 1. Now, if βⁱ = γⁱ = 1, then r⁰ⁱ ∈ V(K) and the edge r⁰ⁱt⁰₁ⁱ must be one of the edges that connect to K⁰ a tree in G−K⁰ containing a vertex in S^∗. Observe that there can be at most |S^∗| ≤k−1 such edges. Therefore, the claim follows.

As Lemma 3.2.2 only bounds the number of branching steps for solvable inputs, the algorithm ensures the same bound on every input by maintaining a counter for these steps.

Thus, it outputs ’No’ if it encounters a branching case for the (g(k) + 2k−1)-th time.

As the number of branches in a branching case is k^O(k²⁾, and the number of branching cases in a single branch of a run of the algorithm isO(k⁴), the number of leaves in the search tree explored by the algorithm (i.e. the number of steps where the algorithm stops, regarding all the branches in total) isk^O(k⁶⁾. At each vertex, the algorithm uses time at most linear in|V(G)|. The number of steps performed in a single branch of a run of the algorithm is at most|V(T)|, hence the algorithm needs quadratic time after choosingφ(r0) for the starting vertexr0. Trying all possibilities forφ(r0) increases this to cubic time. ReductionsAandB can also be executed in cubic time, as argued before, so we can conclude:

Theorem 3.2.3. The Cleaning(Tree,−)problem on input(T, G)can be solved in k^O(k⁶⁾n³ time, wheren=|V(T)| and|V(G)|=n+k.

CHAPTER 4 Induced Subgraph Isomorphism on interval graphs

In this chapter, we discuss the parameterized complexity of the following problem: given two interval graphs G and H, decide whether we can delete some vertices of G to obtain a graph isomorphic to H. On the one hand, we prove that this special case of Induced Subgraph Isomorphismis NP-hard, and we show that it is W[1]-hard when parameterized by the|V(H)|, denoting the number of vertices in the smaller graph.

On the other hand, we present a newly developed FPT algorithm for this problem, when parameterized by the number |V(G)| − |V(H)|, denoting the number of vertices which we have to delete fromGto obtain a graph isomorphic toH. Using the notation of the previous chapter, we will denote the resulting parameterized problem byCleaning(Interval, Interval), withInterval standing for the class of interval graphs.

Interval graphs form an important and widely studied class of graphs. Thanks to their strict structure, many NP-hard problems become polynomial-time solvable when restricted to interval graphs [64, 51]. They have numerous applications in scheduling problems but also in various areas of computational biology. Apart from the theoretical interest, the investigation of the Cleaning problem for interval graphs is also motivated by its similarity with an important problem in biology, namely theArc-Preserving Subsequenceproblem [109].

In Section 4.1 we give a brief introduction to a data structure called labeled PQ-trees which yield a canonical form for interval graphs. Section 4.2 contains the obtained hardness results, and Sections 4.3 and 4.4 cover our FPT algorithm forCleaning(Interval, Interval).

The results of this chapter appear in [108].

4.1 Interval graphs and labeled PQ-trees

LetGbe an interval graph, meaning thatGcan be regarded as the intersection graph of a set of intervals. Formally, an interval representation ofG is a set{Ii |i∈[n]} of intervals, where Ii andIj intersect each other if and only ifvi andvj are adjacent. We say that two intervalsproperly intersect, if they intersect, but none of them contains the other.

LetC(G) be the set of all maximal cliques inG, and letC(v) ={C|v∈C, C∈ C(G)}for somev∈V(G). It is known that a graph is an interval graph if and only if its maximal cliques can be ordered consecutively, i.e. there is an ordering ofC(G) such that the cliques inC(v)

form a consecutive subsequence [65]. Note that any interval representation gives rise to a natural ordering ofC(G), which is always a consecutive ordering. The set of all consecutive orderings ofC(G) are usually represented by PQ-trees, a data structure introduced by Booth and Lueker [22].

APQ-treeofGis a rooted treeT with ordered edges with the following properties: every non-leaf node is either a Q-node or a P-node, each P-node has at least 2 children, each Q-node has at least 3 children, and the leaves ofTare bijectively associated with the elements ofC(G).

For an illustration, see Figure 4.1. Thefrontier F(T) of the PQ-treeT is the permutation ofC(G) that is obtained by ordering the cliques associated with the leaves ofT simply from left to right. Two PQ-treesT1 and T2 are equivalent, if one can be obtained from the other by applying a sequence of the following transformations: permuting the children of a P-node arbitrarily, or reversing the children of a Q-node. The consecutive orderings of the maximal cliques of a graph can be represented by a PQ-tree in the following sense: for each interval graphGthere exists a PQ-treeT, such that{F(T⁰)|T⁰is a PQ-tree equivalent toT}yields the set of all consecutive orderings ofC(G). Such a PQ-treerepresents G. For any interval graphGa PQ-tree representing it can be constructed in linear time [22].

This property of PQ-trees can be used in the recognition of interval graphs. However, to examine isomorphism of interval graphs, the information stored in a PQ-tree is not sufficient.

For this purpose, a new data structure, the labeled PQ-tree has been defined [96, 35]. For a PQ-treeT and some nodes ∈ V(T), let Ts denote the subtree of T rooted at s. For each vertex v in G, let the characteristic node R(v) of v in a PQ-treeT representing Gbe the deepest nodesinT such that the frontier ofTscontainsC(v). For a nodes∈V(T), we will also writeR⁻¹(s) ={x∈V(G)|R(x) =s}, and ifT⁰ is a subtree ofT, thenR⁻¹(T⁰) ={x∈ V(G) | R(x) ∈ V(T⁰)}. Observe that if R(v) is a P-node, then every clique in the frontier ofTR(v)containsv. It is also true that ifR(v) is a Q-node with childrenx1, x2, . . . , xm, then those children ofR(v) whose frontier contains v form a consecutive subseries ofx1, . . . xm. Formally, there must exist two indicesi < j such thatC(v) ={C|C∈F(Txh) for somei≤ h≤j}.

A labeled PQ-tree of G is a labeled version of a PQ-treeT of G where the labels store the following information. Ifxis a P-node or a leaf, then its label is simply |R⁻¹(x)|. If q is a Q-node with childrenx1, x2, . . . , xm (from left to right), then for eachv ∈R⁻¹(Tq) we defineQq(v) to be the pair [a, b] such thatxa andxbare the leftmost and rightmost children of q whose frontier in T contains C(v). (See again Figure 4.1.) Also, if Qq(v) = [a, b] for some vertex v, then we let Q^left_q (v) =a and Q^right_q (v) = b. For some 1 ≤ a ≤b ≤m, the pair [a, b] is a block of q. Considering blocks of a Q-node, we will use a terminology that treats them like intervals, so two blocks can be disjoint, intersecting, they contain indices, etc. The labelL(q) of qencodes the values|Lq(a, b)| for eacha < bin [m], whereLq(a, b) is the set{v∈R⁻¹(q)|Qq(v) = [a, b]}.

Note that a PQ-tree can be labeled in linear time. Two labeled PQ-trees are identical, if they are isomorphic as rooted trees and the corresponding vertices have the same labels.

Two labeled PQ-trees areequivalent, if they can be made identical by applying a sequence of transformations as above, with the modification that when reversing the children of a Q-node, its label must also be adjusted correctly. The key theorem that yields a way to handle isomorphism questions on interval graphs is the following:

Theorem 4.1.1([96]). LetG1andG2be two interval graphs, and letT^L(G1)andT^L(G2)be the labeled version of a PQ-tree representingG1 andG2, respectively. ThenG1 is isomorphic toG2 if and only if T^L(G1)is equivalent to T^L(G2).

Given a Q-nodeqin a PQ-treeT, letx1, . . . , xmdenote its children from left to right. For a given childxi of q, we defineMq(i) to be the set of verticesv ∈ R⁻¹(q) for which Qq(v)

In document Parameterized Complexity of Graph Modiﬁcation and Stable Matching Problems (Pldal 48-53)