Bully Algorithm — Leader Election

Overview

What this concept solves

The Bully algorithm is the textbook starting point for leader election. Published by Hector Garcia-Molina in 1982, it makes one ruthlessly simple promise: the process with the highest ID is always the leader. When the leader fails, any process that notices can start an election — and the algorithm guarantees that the election always converges on the highest-ID survivor, however many crashes have happened along the way.

It works on a fully connected network where every process knows every other process's ID. The dynamic is exactly what the name suggests: a low-ID process that starts an election gets bullied out by every higher-ID process that's still alive — they all reply OK and take over the election themselves. The highest-ID alive process eventually wins by default: nobody bullies it back. It then broadcasts COORDINATOR to everyone and the cluster goes back to work.

Bully is mostly of historical and pedagogical importance now. Real systems use Raft, ZAB, or lease-based election because Bully's all-to-all messaging is O(n²) at worst, it assumes a fully connected network, and the textbook version offers no guarantee against split-brain during a partition. But it captures the essence of leader election in two pages — the right place to start before the modern machinery makes sense.

Mechanics

How it works

The protocol

Every process has a known unique ID. The process with the highest ID is by definition the leader.
Each process periodically expects to hear from the current leader (a heartbeat or successful RPC). If the leader goes silent past a timeout, the process suspects failure and starts an election.
Election: the suspecting process P sends an ELECTION message to every process with an ID higher than its own.
If no higher-ID process answers within a timeout, P wins. It broadcasts COORDINATOR(P) to every process — done.
If any higher-ID process Q answers OK, P drops out of contention (it has been bullied). Each Q now starts its own election, repeating step 3.
Eventually the highest-alive ID has no higher-ID responder, declares itself leader, and broadcasts COORDINATOR.

Message complexity

Best case — the highest-alive ID notices first. One round of ELECTIONs (none answered) + one COORDINATOR broadcast = O(n) messages.
Worst case — the lowest-alive ID notices first. Each higher ID then runs its own election, cascading up to the top. The textbook worst case is O(n²) messages.
Stable case — once a leader is elected, the only traffic is the periodic heartbeat / RPC checks, plus occasional election noise. O(1) per heartbeat interval.

Two leaders during a partition

If the network partitions so that, say, P5 (the leader) is on one side and {P1..P4} on the other, the lower side will run an election and crown P4 — because they cannot hear P5. Both sides now believe they have a leader. The textbook Bully algorithm has no quorum and no fence token to prevent this. Production usage either layers a quorum on top, or — much more commonly — picks a different algorithm. Bully on its own is not safe against partitions.

Why "highest ID"?

Any total order over processes works — Bully would still be correct if the rule were "lowest ID wins" or "longest hostname wins." Highest ID is just the convention. What matters is that the order is total and known: every process agrees on who is bigger than whom, so the algorithm converges on a single winner.

Interactive prototype

Run it. Break it. Tune it.

Sandboxed simulation embedded right in the page. No setup, no install.

simulation › Bully Algorithm

About this simulation

Five processes with IDs P1..P5. Pick a scenario — Leader dies (P3 notices, runs an election, gets bullied by a higher ID), Cascade failure (two leaders die in a row, the next-highest survives), or Old leader returns (the failed top-ID rejoins and immediately re-bullies). Free play lets you crash any node and trigger your own elections; the log card below holds only the last two messages.

Hands-on

Try these on your own

Open the prototype above, run each experiment, predict the answer, then verify.

try 01

Walk Leader dies

Open Leader dies and step through. P5 (the leader, ID 5) crashes. P3 — a mid-ID process — notices the leader's silence and starts an election. It sends ELECTION to P4 and P5, only P4 answers OK, P3 backs off, P4 then runs its own election, sees no higher alive ID, and broadcasts COORDINATOR. Notice the cascade — that's the textbook O(n²) shape.

try 02

Walk Cascade failure

Run Cascade failure. P5 and P4 both crash; only P1, P2, P3 are alive. P1 starts an election, gets bullied by P3 (P4 and P5 don't answer), and P3 wins by default. Watch how the algorithm always converges on the highest-alive ID, no matter how many nodes died.

try 03

Walk Old leader returns

Run Old leader returns. P5 dies and P4 becomes leader; then P5 restarts. As soon as P5 is back, it immediately starts an election and re-takes the crown — because the rule is highest ID always wins. This is the famous "bully" dynamic: a higher-ID rejoiner can disrupt a working cluster just by waking up.

try 04

Free play — break it yourself

Open Free play. Kill any node by clicking it. Trigger an election from any process you like. Try killing the leader and a higher ID simultaneously. Notice that the algorithm always converges on the same winner — the highest alive ID — regardless of who started the election or in what order failures occurred. That deterministic outcome is the appeal of Bully.

In practice

When to use it — and what you give up

When it's the right tool

Teaching leader election — the algorithm is two pages, the dynamic is intuitive, and it's the cleanest introduction to the problem.
Very small static clusters (3–10 nodes) on a reliable fully-connected network, where simplicity beats safety guarantees you don't need.
Embedded systems / routing — IS-IS DIS election and OSPF DR election use Bully-style "highest priority wins" inside a single broadcast segment.
Heuristics inside a larger protocol — Bully sometimes appears as a tie-break inside a richer election (e.g. "if tied on log length, highest server ID wins").

When to reach for something else

Replicated state machine over a real network — use Raft. You need quorum-based safety, monotonic terms, and crash-recovery semantics Bully cannot give you.
Network can partition — Bully has no split-brain defence. Use any quorum-based algorithm (Raft, Paxos, ZAB) or a lease against an external store.
Large clusters (hundreds of nodes) — the O(n²) worst case bites hard. Lease-based or ZooKeeper's ephemeral-znode scheme scale far better.
You need a fence token for safe writes after election — Bully's COORDINATOR carries no monotonic epoch. Use Raft's term, ZooKeeper's zxid, or a lease revision.

Pros

Two-page algorithm — easy to teach, easy to implement, easy to debug.
Deterministic outcome — given the same set of alive processes, the same winner is always picked (the highest ID).
No external dependency — needs no shared store, no coordinator, no consensus library. Just point-to-point messaging.
Fast in the common case — when the highest-alive ID notices first, the election finishes in one round.

Cons

O(n²) worst-case message complexity — every node may run its own election in cascade.
Assumes a fully connected network — every process must be able to address every other. Doesn't fit ring or partial-mesh topologies.
No partition safety — separated subnetworks happily elect their own leaders. Split-brain by construction.
No fence token — a paused-then-resumed old leader can write to backing stores after the new one is elected, corrupting state.
Liveness during turbulence is poor — frequent failures cause repeated elections, each O(n²); the cluster can spend more time electing than working.

Reference

Code & further reading

A minimal reference implementation and pointers worth bookmarking.

bully_algorithm.go

// Bully algorithm. Each process can be a candidate. Highest ID always wins.
package election

import (
	"sync"
	"time"
)

const electionTimeout = 500 * time.Millisecond

type ProcID = int

type Msg struct {
	Type string // "election" | "ok" | "coord"
	From ProcID
	To   ProcID
}

type BullyProcess struct {
	ID    ProcID
	Peers []ProcID
	Send  func(m Msg)
	Alive func(id ProcID) bool

	mu     sync.Mutex
	leader ProcID // -1 means none
	busy   bool
}

// StartElection is called when this process suspects the leader is dead.
func (p *BullyProcess) StartElection() {
	p.mu.Lock()
	if p.busy {
		p.mu.Unlock()
		return
	}
	p.busy = true
	p.mu.Unlock()
	defer func() {
		p.mu.Lock()
		p.busy = false
		p.mu.Unlock()
	}()

	var higher []ProcID
	for _, peer := range p.Peers {
		if peer > p.ID {
			higher = append(higher, peer)
		}
	}
	if len(higher) == 0 {
		p.declareSelfLeader()
		return
	}

	// Race the asks against the election timeout.
	acks := make(chan []ProcID, 1)
	go func() { acks <- p.askHigher(higher) }()

	select {
	case answers := <-acks:
		if len(answers) == 0 {
			// No higher-ID process answered: I am the new leader.
			p.declareSelfLeader()
		}
		// Otherwise: some higher-ID process took over. Wait for its COORDINATOR.
	case <-time.After(electionTimeout):
		// Timed out without an OK: I am the new leader.
		p.declareSelfLeader()
	}
}

func (p *BullyProcess) askHigher(higher []ProcID) []ProcID {
	var (
		mu      sync.Mutex
		wg      sync.WaitGroup
		answers []ProcID
	)
	for _, to := range higher {
		wg.Add(1)
		go func(to ProcID) {
			defer wg.Done()
			p.Send(Msg{Type: "election", From: p.ID, To: to})
			if p.Alive(to) { // they will reply OK + start their own
				mu.Lock()
				answers = append(answers, to)
				mu.Unlock()
			}
		}(to)
	}
	wg.Wait()
	return answers
}

func (p *BullyProcess) OnMessage(m Msg) {
	if m.Type == "election" && m.From < p.ID {
		p.Send(Msg{Type: "ok", From: p.ID, To: m.From})
		go p.StartElection() // I'm higher -> take over the election
	}
	if m.Type == "coord" {
		p.mu.Lock()
		p.leader = m.From
		p.mu.Unlock()
	}
}

func (p *BullyProcess) declareSelfLeader() {
	p.mu.Lock()
	p.leader = p.ID
	p.mu.Unlock()
	for _, peer := range p.Peers {
		if peer != p.ID {
			p.Send(Msg{Type: "coord", From: p.ID, To: peer})
		}
	}
}

References & further reading

6 sources

Knowledge check

Did the prototype land?

Quick questions, answers revealed on submit. Sign in to save your best score.

question 01 / 03

In the Bully algorithm, which process becomes leader after an election?

question 02 / 03

What is the worst-case message complexity of the Bully algorithm?

question 03 / 03

Why is Bully alone usually inadequate for a production replicated database?

0/3 answered