Intermediate

Cache Eviction

When the cache fills up, something has to go — and which one you pick decides your hit rate. Ten classic policies, side-by-side.

performancememorypatterns

Start with LFU — Least Frequently Used

What is Cache Eviction?

The 60-second primer

Cache eviction is the policy a cache uses to decide what to throw out when it runs out of room. A cache is a small, fast store sitting in front of something larger and slower. Once it's full, every new entry has to push an old one out — and which one you push out is what makes or breaks the hit rate.

The goal is simple to state: keep the items most likely to be requested next, evict the rest. The hard part is predicting the future from the access pattern you've seen so far. Each algorithm is a different bet about what the past tells you about the future.

There are ten classic policies worth knowing. Some lean on recency (LRU, MRU, FIFO, Clock). Some lean on frequency (LFU). Some try to balance both (ARC, 2Q, LIRS). And a few intentionally ignore the access pattern (Random, TTL). Pick the wrong one and you spend most of your time fetching from the slow store — you have a cache in name only.

Why the policy choice matters

Hit rate is everything — a cache that hits 95% of the time is twenty times better than one that hits 50%. The eviction policy is the lever that moves that number.
Caches are finite by design — RAM costs money, CPU caches are tiny, CDN edge nodes have hard disk caps. You will run out of room; the question is what you keep.
Wrong policy = thrashing — if your policy evicts the hot keys, every request becomes a miss and the cache turns into pure overhead.
Workload-dependent — there's no universal winner. A policy that crushes web traffic can lose to FIFO on a database scan workload.

Where eviction lives

Operating systems use it for page caches (Linux uses a Clock-variant). Databases use it for buffer pools (Postgres uses Clock-Sweep, MySQL InnoDB uses a 2Q-like LRU). CDNs use it at every edge. Your CPU's L1/L2 caches use pseudo-LRU in hardware. Redis lets you pick the policy at runtime.

Side-by-side

How they compare

The same concepts, on the same axes. Use this as a map; the individual pages are the territory.

Algorithm	Signal used	Hit rate	Overhead	Best for
01LFU	Frequency	Great for stable hotspots	`O(1) — freq buckets`	Long-tailed, skewed traffic
02LRU	Recency	Good on most workloads	`O(1) — HashMap + DLL`	General-purpose default
03MRU	Anti-recency	Crushes LRU on scans/loops	`O(1) — HashMap + DLL`	Sequential scans, looping workloads
04FIFO	Insertion order	Mediocre — ignores re-use	`O(1) — single queue`	Streaming, one-shot reads
05Random	None	Surprisingly close to LRU	`O(1)`	Embedded, hot loops, sampling
06TTL	Age	Depends on TTL choice	`O(1) — expiry per key`	Time-bounded data (DNS, sessions)
07Clock	Recency (approx)	≈ LRU, cheaper	`O(N) — 1 bit/entry`	OS page caches, hot paths
082Q	Recency, two-tier	Better than LRU on scans	`O(N) — two queues`	Buffer pools that face scans
09ARC	Recency + frequency	Excellent, self-tuning	`O(N) — 4 lists`	Mixed workloads (ZFS, Postgres-adj)
10LIRS	Inter-reference recency	Beats LRU on weak-locality	`O(N) — stack + queue`	DB buffer pools, MySQL-like

01LFU

Signal used: Frequency
Hit rate: Great for stable hotspots
Overhead: O(1) — freq buckets
Best for: Long-tailed, skewed traffic

02LRU

Signal used: Recency
Hit rate: Good on most workloads
Overhead: O(1) — HashMap + DLL
Best for: General-purpose default

03MRU

Signal used: Anti-recency
Hit rate: Crushes LRU on scans/loops
Overhead: O(1) — HashMap + DLL
Best for: Sequential scans, looping workloads

04FIFO

Signal used: Insertion order
Hit rate: Mediocre — ignores re-use
Overhead: O(1) — single queue
Best for: Streaming, one-shot reads

05Random

Signal used: None
Hit rate: Surprisingly close to LRU
Overhead: O(1)
Best for: Embedded, hot loops, sampling

06TTL

Signal used: Age
Hit rate: Depends on TTL choice
Overhead: O(1) — expiry per key
Best for: Time-bounded data (DNS, sessions)

07Clock

Signal used: Recency (approx)
Hit rate: ≈ LRU, cheaper
Overhead: O(N) — 1 bit/entry
Best for: OS page caches, hot paths

082Q

Signal used: Recency, two-tier
Hit rate: Better than LRU on scans
Overhead: O(N) — two queues
Best for: Buffer pools that face scans

09ARC

Signal used: Recency + frequency
Hit rate: Excellent, self-tuning
Overhead: O(N) — 4 lists
Best for: Mixed workloads (ZFS, Postgres-adj)

10LIRS

Signal used: Inter-reference recency
Hit rate: Beats LRU on weak-locality
Overhead: O(N) — stack + queue
Best for: DB buffer pools, MySQL-like

Decision guide

Which one should you use?

A practical tour of when each algorithm wins.

Decision guide

You don't know your workload yet → LRU. Boring, well-understood, hard to beat without measurement.
A small set of keys is asked for much more often than the rest → LFU. Frequency dominates recency for skewed traffic.
Sequential scans or loops larger than the cache → MRU. The counter-intuitive opposite of LRU — drop what you just touched, because you won't come back to it for a while.
One-shot reads, streams, append-only logs → FIFO. Re-use doesn't happen, so don't pay to track it.
You want LRU but in the OS kernel or a hot lock-free path → Clock. One reference bit per page, no list shuffling on hits.
Big scans are wrecking your buffer pool's hit rate → 2Q or LIRS. Both protect frequently-used pages from being flushed by one-shot scans.
Mixed workload, you'd rather not tune → ARC. Adapts the recency/frequency balance on the fly.
You need rock-bottom overhead and can tolerate noisy evictions → Random. Try it before assuming it's bad.
Data has a natural lifetime (DNS records, session cookies, OTP codes) → TTL. Eviction by clock, not by capacity.

When in doubt, start with LRU and measure

LRU is the default in nearly every cache library (Caffeine, Guava, Redis with allkeys-lru) for good reason. Get a baseline hit rate, then only swap in something fancier if the data shows you should.

Concepts in this track

10 concepts, in order

Each links to a concept page with its own explanation, prototype, and quiz.

LFU — Least Frequently Used

Evict the entry with the fewest accesses. Wins on skewed, long-tailed traffic.

Intermediate9 mintry it

LRU — Least Recently Used

Evict the entry that was accessed the longest time ago. The default in nearly every cache.

Beginner8 mintry it

MRU — Most Recently Used

The opposite of LRU. Throws away what you just touched — perfect for loops bigger than the cache.

Intermediate7 mintry it

FIFO — First In, First Out

Evict by insertion age. Simple, predictable, and surprisingly useful for streams.

Beginner6 mintry it

Random

Throw out a random entry. Embarrassingly close to LRU on real workloads.

Beginner6 mintry it

TTL — Time To Live

Eviction by clock, not by capacity. The right tool for data with an intrinsic lifetime.

Beginner7 mintry it

Clock — Second Chance

An approximate LRU with one bit per entry. Cheap enough to run in a kernel.

Intermediate9 mintry it

2Q — Two Queue

A small FIFO probation queue protects an LRU main queue from being flushed by scans.

Intermediate10 mintry it

ARC — Adaptive Replacement Cache

Self-tunes between recency and frequency using 'ghost' lists of recently-evicted keys.

Advanced12 mintry it

LIRS — Low Inter-reference Recency Set

Classify keys by the gap between accesses. Top-tier hit rate; tricky to implement.

Advanced12 mintry it

Related tracks

If this one clicks, try these next.

Cache Write Policies

Three ways to handle a write when you have a cache in front of the store. Each policy is a different bet about durability, throughput, and how stale your data is allowed to get.

Intermediate3 concepts · 25 min

performancememorypatterns

Memory Allocation

Before garbage collection ever runs, something has to hand out the memory. Six allocators — four ways to pick a hole, plus the two structured schemes real kernels actually ship.

Intermediate6 concepts · 70 min

memoryruntimeperformance

Garbage Collection

How a runtime reclaims memory you stopped using — without you ever calling free(). Eight algorithms, from the counter on every object to the collectors that run alongside your program.

Advanced8 concepts · 90 min

memoryruntimeperformance