Insertion Sort — Sorting

Overview

What this concept solves

Insertion sort is how you sort a hand of playing cards. You pick up the next card, slide it left past every bigger card you've already arranged, and drop it into its place. The sorted region grows one card at a time, always from the left. You never touch the unsorted side until you commit to lifting its next card.

That single mental model gives you four guarantees that bubble and selection sort don't have together. It's stable (equal cards keep their order — you only shift left past strictly bigger ones). It's in-place (no extra memory). It's adaptive (already-sorted runs cost almost nothing). And it's online — you can feed it one element at a time and the sorted region is always available.

On random data, insertion sort is still O(n²) — every new card has to slide past about n/2 existing ones on average. But on nearly-sorted data, where each card only shifts a few slots, it's effectively O(n + d) where d is the number of out-of-place pairs. That's exactly why it's the inner loop of Timsort and the small-array base case of std::sort and qsort. For n under ~16, it beats every theoretically faster algorithm.

Mechanics

How it works

One insertion

Position 0 is trivially sorted (a single element).
Lift key = a[i] — the next unsorted value (purple in the prototype).
Look left at a[i-1]. If it's bigger than key, shift it right by one (a[j+1] = a[j]). Step j left.
Repeat until you find a smaller-or-equal element, or you fall off the left edge.
Drop key into the gap at j+1. Sorted region now extends to index i.

Across the list

Iteration i = 1 inserts a[1] into the size-1 sorted prefix.
Iteration i = 2 inserts a[2] into the size-2 sorted prefix.
After iteration n-1, the entire list is sorted.
Total comparisons & shifts: equal to the number of inversions in the input. Best case (already sorted): n-1 comparisons, zero shifts.

Insertion sort = the size of the inversion count

An inversion is a pair (i, j) with i < j but a[i] > a[j]. Insertion sort's work is exactly proportional to the number of inversions — each shift fixes exactly one. A list with 5 inversions takes 5 shifts. A reverse-sorted list of n has n(n-1)/2 inversions. A sorted list has zero. This is why insertion sort dominates whenever data is nearly sorted: shifts ≈ inversions, and nearly-sorted means few inversions.

Why every fast sort uses insertion as a base case

For n < ~16, the constant-factor wins of insertion sort (no recursion overhead, perfect cache locality, branch-predictor friendly) beat the asymptotic wins of merge sort and quicksort. That's why Timsort, std::sort, and Java's Arrays.sort all switch to insertion sort when their recursion reaches small chunks.

Interactive prototype

Run it. Break it. Tune it.

Sandboxed simulation embedded right in the page. No setup, no install.

simulation › Insertion Sort

About this simulation

Each value is lifted out (purple bar) and slid into its place inside the sorted left side, shifting larger values one slot to the right. Press Play for one full sort, Step / Back to follow each shift, Try sorted to see the best-case O(n), Shuffle to reroll.

Hands-on

Try these on your own

Open the prototype above, run each experiment, predict the answer, then verify.

try 01

Play one full sort

Press Play. Watch the purple bar — that's the key being lifted out. Watch the red bars — those are the values shifting one slot right to make room. Notice how the green sorted region grows from the left, one element per outer iteration.

try 02

Step into a single insertion

Use Step until you see a purple key. Then step one at a time and count the shifts. Each red flash is one shift right. The key drops back into the gap once a smaller-or-equal element blocks it (or the left edge does).

try 03

Try sorted — see the O(n) best case

Press Try sorted. Step through. Each iteration lifts a key, compares once against its left neighbour, finds it's already in order, and drops the key right back. Zero shifts, n-1 comparisons total. That's the best case the trade-offs table promises.

try 04

Shuffle and watch shifts vs comparisons

Press Shuffle. Step through a random list. The Comparisons counter goes up on every step inside the while loop; the Shifts counter only goes up when an element actually moves. On a random list the two are nearly equal — every comparison usually shifts. On a nearly-sorted list, comparisons climb while shifts stay near zero.

In practice

When to use it — and what you give up

When it's the right tool

Small lists (n < ~16) — fastest comparison sort on tiny inputs. Every serious sort() includes it for this case.
Nearly-sorted data — streaming data with occasional out-of-order elements, files that are mostly already sorted, log compactions.
Online / streaming inputs — you can feed one element at a time and always have a sorted view of what arrived so far.
As a sub-routine — Timsort uses it to extend short runs; introsort uses it for tiny recursion leaves.

When to reach for something else

Large random data — O(n²) is unacceptable past a few thousand elements; reach for merge or quick.
You need worst-case O(n log n) — reverse-sorted input is insertion's worst case (full n²/2 shifts).
You can use a non-comparison sort — counting/radix beat insertion on integer keys at scale.

Pros

Adaptive — O(n) on already-sorted data — best case is one pass with n-1 comparisons and zero shifts.
Stable — only shifts past strictly-larger elements, so equals retain their order.
In-place — O(1) extra memory.
Online — sorted region is always available; you can feed elements one at a time.
Tight inner loop — beats theoretically-faster sorts on tiny inputs because of cache behaviour and branch prediction.

Cons

O(n²) on random and reverse-sorted data — unusable for n above a few thousand.
Lots of writes — every shift is an array assignment; selection sort makes far fewer.
Recursive sorts win on large data — merge and quick both dominate from n ≈ 50 upward.

Reference

Code & further reading

A minimal reference implementation and pointers worth bookmarking.

insertion-sort.ts

// Insertion sort — adaptive, stable, in-place.
// Best case O(n) on sorted input; worst case O(n²) on reverse-sorted.
function insertionSort<T>(a: T[]): T[] {
  for (let i = 1; i < a.length; i++) {
    const key = a[i];
    let j = i - 1;
    // Shift larger elements one slot right.
    while (j >= 0 && a[j] > key) {
      a[j + 1] = a[j];
      j--;
    }
    // Drop the key into the gap.
    a[j + 1] = key;
  }
  return a;
}

// Example
insertionSort([5, 2, 4, 6, 1, 3]);
// → [1, 2, 3, 4, 5, 6]
// On nearly-sorted input, almost every iteration takes O(1).

References & further reading

6 sources

Knowledge check

Did the prototype land?

Quick questions, answers revealed on submit. Sign in to save your best score.

question 01 / 03

Insertion sort's running time is proportional to which of the following?

question 02 / 03

Why does Timsort use insertion sort to extend short runs instead of just merging tiny runs directly?

question 03 / 03

What's the relationship between insertion sort and selection sort?

0/3 answered