Raft-Style Election — Leader Election

Overview

What this concept solves

Raft-style election is the leader-election half of the Raft consensus algorithm — extracted here on its own because every modern replicated system uses some variant of it. The idea: each server runs a randomized election timer (commonly 150–300 ms). Whoever's timer fires first becomes a Candidate, advances to the next term, and asks every other server for a vote. Whoever wins a majority of votes is the new Leader for that term. Heartbeats from the Leader reset everyone else's timers, so as long as the Leader is alive, no election happens.

Three properties give Raft election its production-grade safety. Terms are a monotonically increasing integer; every message carries the term, and any server seeing a higher term immediately steps down. At most one leader per term falls out of "each server votes at most once per term and only with a quorum." Election restriction: a server only grants its vote to a candidate whose log is at least as up-to-date as its own — which is what guarantees committed entries survive across leader changes. The ID does not pick the winner; timing, term, and log freshness do.

Random timeouts are the trick that defeats split votes. If every server picked the same timeout, two would routinely fire together and split the vote in half forever. Randomising across a window means the first timer to expire usually wins outright; even when two do fire close together, the next round randomises them apart. This is how Raft elections converge in O(1) rounds in practice. The whole election protocol fits in two RPCs: RequestVote and the heartbeat-style AppendEntries.

Mechanics

How it works

Roles and the election timer

Follower — passive; expects heartbeats from the Leader. Maintains a randomized election timer (typical: 150–300 ms).
Candidate — a follower whose timer expired. Has just incremented its term and is asking for votes.
Leader — won a majority of votes; sends periodic heartbeats to reset all followers' timers and stay in power.
Term — a monotonically increasing integer attached to every message. Higher term always wins; servers seeing a stale term reject the message.

The protocol

Leader L sends heartbeats every ~50 ms. Each heartbeat resets every follower's election timer.
If a follower's timer fires (≈ Leader is gone), it transitions to Candidate, increments the term, votes for itself, and sends RequestVote(term, lastLogIndex, lastLogTerm) to every other server.
Each server votes at most once per term. It grants the vote only if (a) it hasn't voted this term, and (b) the candidate's log is at least as up-to-date as its own.
If the Candidate collects votes from a majority (e.g. 3 of 5), it becomes Leader and starts sending heartbeats — the election is over.
If two Candidates split the vote and neither wins, the term ends without a Leader. Followers' timers fire again with new random values; whoever fires first next round is much more likely to win cleanly.
If a Candidate receives a message with a higher term, it immediately steps down to Follower for that term — the future has already happened without it.

Majority is what kills split-brain

Even if the network partitions and both sides try to elect a Leader, only the side with a strict majority can succeed. The minority side runs Candidates that never collect enough votes; they remain stuck in the election loop until they can reach the majority again. That's why Raft is safe across partitions and Bully isn't — the quorum requirement makes "at most one leader per term" mechanical, not optimistic.

The up-to-date log rule

When choosing between two candidates, a follower picks the one whose log is at least as up-to-date as its own. The rule: "the candidate with the higher last-log-term wins; if tied, the longer log wins." This Election Restriction is what guarantees previously committed entries are never lost — only candidates whose log already contains every committed entry can win a majority. It's the safety lemma that makes the whole Raft state machine correct.

Interactive prototype

Run it. Break it. Tune it.

Sandboxed simulation embedded right in the page. No setup, no install.

simulation › Raft-Style Election

About this simulation

Five servers running Raft elections. Pick a scenario — Election timeout (the leader crashes, a follower's randomized timer fires, it wins a majority), Split vote (two candidates start in the same term, neither gets quorum, a new term begins with randomized backoff), or Stale leader returns (a partitioned old leader rejoins with an outdated term and steps down). Free play lets you crash any server and trigger elections; the log card below holds only the last two messages.

Hands-on

Try these on your own

Open the prototype above, run each experiment, predict the answer, then verify.

try 01

Walk Election timeout

Open Election timeout and step through. The Leader (S1) crashes; heartbeats stop. S2's randomized timer fires first, S2 becomes a Candidate, advances to term 2, votes for itself, and sends RequestVote to the rest. S3/S4/S5 grant their votes (no one else has voted this term, and the logs match). S2 reaches 3 of 5, becomes Leader, and starts sending heartbeats. Notice that the ID didn't pick the winner — the timer did.

try 02

Walk Split vote

Run Split vote. The Leader crashes and S2 and S4 happen to time out at almost the same instant. Both become Candidates for term 2. S3 votes for S2; S5 votes for S4. Each Candidate has 2 votes — less than the majority of 3. Term 2 ends without a Leader. New random timers fire in term 3; this time only one Candidate appears and wins cleanly. The random timeout window is the entire mechanism that breaks the deadlock.

try 03

Walk Stale leader returns

Run Stale leader returns. The Leader S1 is partitioned away in term 1. The majority side elects S2 in term 2. The partition heals. S1 tries to send heartbeats with term 1; every server replies "I have seen term 2" and S1 immediately steps down to Follower for term 2. The monotonic term is the fence token: an old leader cannot quietly resume control. This is the safety property Bully and Ring lack.

try 04

Free play — break it yourself

Open Free play. Trigger elections manually, crash any server, and try to engineer a partition where you'd expect two leaders to appear. They won't — because the minority side cannot reach a quorum, the Candidate on the minority side just keeps re-trying without success. Try splitting 2 vs 3 nodes: the 3-node side elects, the 2-node side spins. That mechanical safety is the whole reason Raft displaced Bully and Ring for serious replicated systems.

In practice

When to use it — and what you give up

When it's the right tool

Replicated state machine over an unreliable network — Raft election + log replication is the default in 2026 (etcd, Consul, CockroachDB, TiKV, Kafka KRaft).
You need at-most-one leader per term, even across partitions — the quorum rule makes split-brain impossible by construction.
You want a fence token for safe writes — every Raft write carries the term; followers reject writes from stale terms automatically.
Team needs to onboard quickly — Raft's three-state diagram and two RPCs are explicitly engineered for understandability.

When to reach for something else

Singleton job on top of an existing strongly-consistent store — use lease-based election against etcd/Kubernetes instead of running your own Raft cluster.
Very small static cluster on a trusted LAN — Bully or Ring may be enough, with much less code.
Byzantine fault model — Raft assumes crash failures only. Use PBFT or HotStuff if nodes might actively lie.
You need leaderless writes — Raft funnels everything through the leader. EPaxos, Mencius, or Generalized Paxos remove that bottleneck.

Pros

Provably at-most-one leader per term — the quorum rule plus monotonic terms eliminate split-brain.
Fast failover — bounded by the election timeout (typically 100–500 ms).
Built-in fence token — the term is the epoch; old leaders' writes are automatically rejected.
Random timeouts solve split votes — converges in O(1) rounds in practice.
Mature library ecosystem — etcd/raft, hashicorp/raft, Apache Ratis, TiKV, OpenRaft, and dozens more.

Cons

Brief unavailability on leader failure — the cluster cannot accept writes during the election window.
Leader is a write bottleneck — every write funnels through it; throughput caps at one node's resources.
Higher message complexity than lease-based on a strongly-consistent store — needs O(n) RequestVotes per election even when an external lock would do.
Quorum requirement — a cluster of 5 tolerates 2 failures; 3 tolerates 1. You pay even-number nodes for nothing.
Hard to get right from scratch — Raft is the most understandable consensus algorithm, but "understandable" is not the same as "easy to implement." Use a library.

Reference

Code & further reading

A minimal reference implementation and pointers worth bookmarking.

raft_election.go

// Raft-style leader election. Two RPCs: RequestVote (election) and AppendEntries (heartbeat).
package election

import (
	"math/rand"
	"time"
)

type Role int

const (
	Follower Role = iota
	Candidate
	Leader
)

type ServerID = int

type LogEntry struct {
	Term int
	Cmd  string
}

type VoteReply struct {
	Term    int
	Granted bool
}

type RaftElectionServer struct {
	ID          ServerID
	Peers       []ServerID
	CurrentTerm int
	VotedFor    *ServerID
	role        Role
	Log         []LogEntry

	rpc func(to ServerID, kind string, args any) VoteReply
	now func() int64 // unix millis

	electionDeadline int64
}

// ResetElectionTimer sets a randomized 150-300ms timer; reset on heartbeat.
func (s *RaftElectionServer) ResetElectionTimer() {
	s.electionDeadline = s.now() + 150 + rand.Int63n(150)
}

// OnAppendEntries is called by AppendEntries (heartbeat) from current leader.
func (s *RaftElectionServer) OnAppendEntries(term int, leaderID ServerID) (int, bool) {
	if term < s.CurrentTerm {
		return s.CurrentTerm, false
	}
	if term > s.CurrentTerm {
		s.CurrentTerm = term
		s.VotedFor = nil
	}
	s.role = Follower
	s.ResetElectionTimer()
	return s.CurrentTerm, true
}

// StartElection is called when our election timer fires.
func (s *RaftElectionServer) StartElection() {
	s.role = Candidate
	s.CurrentTerm++
	s.VotedFor = &s.ID
	votes := 1 // myself
	myLastIdx := len(s.Log) - 1
	myLastTerm := 0
	if len(s.Log) > 0 {
		myLastTerm = s.Log[len(s.Log)-1].Term
	}

	for _, p := range s.Peers {
		r := s.rpc(p, "RequestVote", map[string]int{
			"term": s.CurrentTerm, "candidateId": s.ID,
			"lastLogIndex": myLastIdx, "lastLogTerm": myLastTerm,
		})
		if r.Term > s.CurrentTerm {
			s.CurrentTerm = r.Term
			s.role = Follower
			return
		}
		if r.Granted {
			votes++
		}
	}

	if votes >= len(s.Peers)/2+2 { // majority of (peers + self)
		s.role = Leader
		// start sending heartbeats
	} else {
		// split vote or lost - wait for next randomized timeout
		s.role = Follower
		s.ResetElectionTimer()
	}
}

// OnRequestVote is called when a peer sends us RequestVote.
func (s *RaftElectionServer) OnRequestVote(
	term, candidateID, lastLogIndex, lastLogTerm int,
) VoteReply {
	if term > s.CurrentTerm {
		s.CurrentTerm = term
		s.VotedFor = nil
		s.role = Follower
	}
	myLastIdx := len(s.Log) - 1
	myLastTerm := 0
	if len(s.Log) > 0 {
		myLastTerm = s.Log[len(s.Log)-1].Term
	}
	upToDate := lastLogTerm > myLastTerm ||
		(lastLogTerm == myLastTerm && lastLogIndex >= myLastIdx)
	free := s.VotedFor == nil || *s.VotedFor == candidateID
	grant := term == s.CurrentTerm && free && upToDate
	if grant {
		s.VotedFor = &candidateID
		s.ResetElectionTimer()
	}
	return VoteReply{Term: s.CurrentTerm, Granted: grant}
}

var _ = time.Now

References & further reading

7 sources

Knowledge check

Did the prototype land?

Quick questions, answers revealed on submit. Sign in to save your best score.

question 01 / 03

What triggers a follower to become a candidate?

question 02 / 03

Two candidates start an election in the same term, and neither collects a majority. What happens next?

question 03 / 03

Why does Raft election survive network partitions safely?

0/3 answered