Single Source Shortest Path

One type of the shortest path problem.

Dijkstra’s and Bellman-Ford algorithms are two of the most well-known algorithms.

Table of contents

Problem Definition
General Algorithm Structure
1. Initialization
2. Relaxation
Path-Relaxation Property
Shortest Path is Always Well-Defined in DAG
1. How to Find the Topological Order?
Dijkstra’s Algorithm
Bellman-Ford Algorithm
1. Why $|V| - 1$ iterations?
2. DP Solution for Bellman-Ford

Problem Definition

We are given a source vertex $s \in V$.

The single-source shortest-paths problem (SSSP):

Find the shortest path from $s$ to every other vertex $v \in V$.

The result is a shortest path tree rooted at $s$.

The following variants can be reduced to the single-source problem:

Single-Destination: Find the shortest path to a given vertex.
- Reverse the direction of all edges
Single-Pair: Find the shortest path between two given vertices.
- Trivially solved

However, all-pairs is (sort of) a different setting. It is solvable by running the single-source algorithm once for each vertex, but the time complexity is improved by a different algorithm. We will discuss it later.

General Algorithm Structure

Each vertex $v$ stores two attributes:
- Distance $v.d$: the tentative shortest path weight estimate from $s$ to $v$
- Predecessor $v.\pi$: the vertex preceding $v$ in the tentative shortest path
We want to be able to reconstruct the path, that’s why we store the predecessor.

SSSP algorithms initialize distance estimate and tentative predecessor for each vertex, then iteratively relax edges to improve the estimates.

Initialization

def init_single_source(G, s):
  for v in G.V:
    v.d = float('inf')
    v.p = None
  s.d = 0

$O(|V|)$ time complexity.

Relaxation

Relaxing an edge $(u,v)$ tests whether our shortest path estimate $v.d$ can be improved by $u.d + w(u,v)$ (going through $u$).

If going through $u$ is better, update $v$:

def relax(u, v):
  if v.d > u.d + w(u, v):
    v.d = u.d + w(u, v)
    v.p = u

On the term "Relaxation"

I found the term “relaxation” a bit confusing, so I decided to think of it as a bezier curve.

You had a pretty rigid bezier curve to $v$. But you add control point $u$ which makes the curve smoother and relaxes the curve to $v$.

Path-Relaxation Property

Let path $p = (v_0, v_1, \ldots, v_k)$ where $w(p) = \delta(v_0, v_k)$

In other words, $p$ is the shortest path from $v_0$ to $v_k$.

If we relax the edges of $p$ in order $(v_0, v_1), (v_1, v_2), \ldots, (v_{k-1}, v_k)$, then $v_k.d = \delta(v_0, v_k)$.

The important thing is that these relaxations do not need to be done in a strictly sequential order. Relaxations on any other edges can be interleaved.

Proof by Induction

Base case: $k = 0$: Trivially, $v_0.d = 0 = \delta(v_0, v_0)$.
Inductive hypothesis: $v_{k-1}.d = \delta(v_0, v_{k-1})$.
Inductive step:

We now relax $(v_{k-1}, v_k)$.

This means that now $v_k.d = v_{k-1}.d + w(v_{k-1}, v_k)$.

By the inductive hypothesis,

\[v_k.d = \delta(v_0, v_{k-1}) + w(v_{k-1}, v_k) = \delta(v_0, v_k)\]

This property is crucial for the correctness of the algorithms below.

The algorithms somehow manage to relax the edges in the topological order defined by the shortest path, which is why they work.

Shortest Path is Always Well-Defined in DAG

Directed Acyclic Graphs (DAGs) is just a tree with directed edges. So we can always find a topological order of the vertices.

By logic, the shortest path has to be in this topological order, otherwise, it would not be the shortest path.

If we do a topological sort of the vertices and edges, and relax the edges in this order, by the path-relaxation property, we can always find the shortest path in a DAG.

How to Find the Topological Order?

The question now is how do we find the topological order?

One way to do this is to use depth-first traversal (DFT).

def topo_sort(G):
  order = []
  for v in G.V:
    v.visited = False
  for v in G.V:
    if v.visited == False:
      df_visit(G, v, order)
  
  return reversed(order)

def df_visit(G, u, order):
  u.visited = True
  for v in G.adj[u]:
    if v.visited == False:
      df_visit(G, v)
  order.append(u)

Dijkstra’s Algorithm

Dijkstra’s algorithm is a greedy algorithm.
Dijkstra’s algorithm works for non-negative edge weights.
Dijkstra’s algorithm is optimal.
- Dijkstra’s algorithm is not optimal for negative edge weights.
Dijkstra’s algorithm relaxes each edge once.
Dijkstra’s algorithm uses a minimum priority queue.
- Generalization of BFS to weighted graphs.

Pseudocode

def dijkstra(G, s):
  init_single_source(G, s)  # O(V)

  S = set()
  Q = PriorityQueue()  
  for v in G.V:  # O(V), since no heapify
    Q.put(v)
  
  while not Q.empty():
    u = Q.get()
    for v in G.adj[u]:
      # Relaxation with PQ update
      if v.d > u.d + w(u, v):
        v.d = u.d + w(u, v)
        v.p = u
        Q.update(v)  # Decrease-key operation
    S.add(u) 

This algorithm has a time complexity of $O((V+E) \log V)$:

Q.get() called $O(V)$ times: $O(V \log V)$
Q.update() called $O(E)$ times: $O(E \log V)$

If given graph is dense, $O(E \log V)$.

See C++ implementation here.

Bellman-Ford Algorithm

While Dijkstra’s algorithm relaxes one edge with the smallest shortest path estimate at a time, Bellman-Ford relaxes all edges in each iteration.

Bellman-Ford is slower than Dijkstra’s algorithm.
Bellman-Ford is optimal even for negative edge weights.
You can mandate a certain edge to be selected by giving it a large negative weight $-\infty$.
Bellman-Ford can be parallelized.
- Per each vertex
Bellman-Ford can detect negative-weight cycles.

Psuedocode

def bellman_ford(G, s):
  init_single_source(G, s)  # O(V)

  for _ in range(G.V - 1):  # We will reason why this is enough
    for u, v in G.E:
      relax(u, v)

  # The above steps should have saturated the shortest paths
  # If not, there is a negative-weight cycle
  for u, v in G.E:
    if v.d > u.d + w(u, v):  # Further relaxation possible
      return False  # Negative-weight cycle

  return True

We relax $|E|$ edges $|V| - 1$ times, $O(|V| \cdot |E|)$.

Why $|V| - 1$ iterations?

How do we know that after $|V| - 1$ iterations, Bellman-Ford algorithm has found the shortest path?

Exploiting the Path-Relaxation Property

Bellman-Ford exploits the path-relaxation property above by relaxing all edges in each iteration.

Shortest path is acyclic and thus a tree of all vertices in the graph, which means it will contain $|V| - 1$ edges.

Every edge is relaxed at each iteration, and there are $|V| - 1$ iterations.

So Bellman-Ford covers every permutation of the relaxation orders. So it is guaranteed to find the shortest path.

See C++ implementation here.

DP Solution for Bellman-Ford

See here