Single-source shortest path methods - Selected chapters from algorithms

Graphs have a plenty of applications, of which we will investigate only one here:

the problem of finding the shortest path from a vertex to another (typically used in route planners, among others). Since there is no difference in worst case time complexity whether we look for the shortest path from a given vertex to a single other one or to all others (and there is no separate algorithm either), we investigate the problem of finding the shortest paths from a single vertex (also called the source).

Breadth-first search

We have seen in subsection “Binary search trees” how binary trees can be walked.

The same problem, i.e. walking the vertices arises on graphs in general. Two basic methods are the depth-first search and the breadth-first search. The depth-first search is a backtracking algorithm (see page 8). It starts from the source and goes along a path as far as it can without revisiting vertices. If it gets stuck, it tries the remaining edges running out from the actual vertex, and when there are no more, it steps back one vertex on its original path coming from the source and keeps on trying there.

The breadth-first search is not only simpler to implement than the depth-first search but it is also the basis for several important graph algorithms, therefore we are going to investigate this in details in the following. Breadth-first search can be imagined as an explosion in a mine where the edges of the graph represent the galleries of the mine which are assumed to have equal lengths. After the explosion

a shockwave starts from the source reaching the vertices adjacent to it first, then the vertices adjacent to these, etc. The breadth-first search algorithm usually cannot process all neighbors of a vertex at the same time on a computer, as in the mine example (except for the case of parallel computing), hence they are processed in a given order.

The following pseudocode executes a breadth-first search from the source vertex s in a graph given with its adjacency matrix A.

BreadthFirstSearch(A,s,D,P)

Parameters D and P are references to two one-dimensional arrays where the procedure stores the vertices’ distances from the source during the search and their predecessors on the paths where they were reached. Initially it stores 0 in

Figure 24. Breadth-first search from the source 1 in a graph.

The labels written in the vertices denote the shortest distance from the source and the predecessor on a shortest path from the source in parentheses, respectively

all elements of P indicating that no predecessors have been assigned to the vertices yet, and ∞ in the elements of D (in reality any value greater than any distance that can be found during the search, e.g. the number A.CountRows of vertices of the graph) to indicate that the vertices have not been searched yet.

The algorithm supplies a FIFO queue Q, where it puts the vertices that have already been reached but whose neighbors have not been processed yet. In the for-loop it checks all the neighbors j of vertex v (in line 9 it verifies whether vertex j is adjacent to v and if it has not been visited yet) and sets the values for the distance of the source and the predecessor on the path coming from the source.

Knowing the predecessors stored in array P of the vertices, the path leading to a vertex from the source can be restored recursively any time.

Dijkstra’s algorithm

The breadth-first search immediately delivers shortest paths from a given source to any of the vertices, however, only if the length of all edges is considered as 1.

In reality the edges of a network where shortest paths are to be determined have different weights. A very similar algorithm to the breadth-first search is Dijkstra’s algorithm which can provide the shortest paths provided that every weight is positive which is mostly the case in real-life applications.

The difference between the breadth-first search and Dijkstra’s algorithm is that whilst the former never modifies a (shortest) distance value once given to a vertex, Dijkstra’s algorithm visits all vertices adjacent to the vertex just being checked, independently from whether any of them have already been labeled or not.

Figure 25. The result of Dijkstra’s shortest path method started from source vertex 5.

The bold values over the edges are the weights, the rest of the notation is similar to that of Figure 24.

However, to achieve an optimal solution it always chooses the one with the least distance value among the unprocessed vertices (those whose neighbors have not been checked yet). It realizes this idea using a minimum priority queue (that can be implemented using minimum heaps, see Page 37) denoted by M in the following pseudocode. The priority queue’s order is defined by the values of array D of distances.

Dijkstra(A,s,D,P)

1 for i  1 to A.CountRows 2 do P[i]  0

3 D[i]  ∞

4 D[s]  0

5 for i  1 to A.CountRows 6 do M.Enqueue(i) 7 repeat

8 v  M.ExtractMinimum 9 for j  1 to A.CountColumns 10 do if A[v,j] > 0

11 then if D[j] > D[v] + A[v,j]

12 then D[j]  D[v] + A[v,j]

13 P[j]  v

14 until M.IsEmpty

The idea mentioned above is simply an extension of the breadth-first search. We can even visualize this if the weights are natural numbers as in the example of Figure 25. In this case, if we insert virtual vertices in the graph the problem is reduced to a breadth-first search in an unweighted graph (see Figure 26, the virtual vertices are shaded). The choice formulated by taking the vertex with the minimal distance value in line 8 of the pseudocode translates in the converted problem to the order of vertices in the FIFO queue Q of the breadth-first search.

The time complexity of Dikstra’s algorithm adds up from initializing the data structures (setting initial values in lines 1-4 and enqueuing all vertices into M in lines 5-6) and executing the search itself (the loop construct in lines 7-14).

Initialization of arrays D and P takes 𝑂(𝑛) time (if denoting the number of vertices in the graph by n). Building a heap (using this optimal implementation of priority queue M) in lines 5-6 is linear, so this is 𝑂(𝑛) again (see page 39). The repeat loop is executed n times, for each iteration cycle consuming at most 𝑂(log 𝑛 + 𝑛) time,

resulting in 𝑂(𝑛(log 𝑛 + 𝑛)) = 𝑂(𝑛²) in worst case. Since the initialization part does not worsen this, the final result is 𝑇(𝑛) = 𝑂(𝑛²).

Exercises

69 Demonstrate how the breadth-first search works on the example graph of Figure 24.

70 Demonstrate how Dijkstra’s algorithm works on the example graph of Figure 25.

71 Write the pseudocode of a procedure that lists a shortest path from the source to a given vertex calling Dijkstra’s algorithm.

2 (4)

3 (1)

1 (5)

0 (0)

2 (3)

3 (e)

1 (5)

2 (f)

2 (c)

3 (b)

1 (5)

Figure 26. Visualizing the basic idea of Dijkstra’s algorithm by converting the example of Figure 25 to a breadth-first search problem in an unweighted graph using virtual vertices.

In document Selected chapters from algorithms (Pldal 94-99)