Peak EWMA load balancer simulator with a live chart of each server's exponentially weighted moving average of response time. The load balancer picks the server with the lowest score, computed as the EWMA times active-connections-plus-one. An alpha slider controls how quickly the average reacts to new measurements.
EWMA per server over time — what the algorithm sees when deciding
Server 1Server 2Server 3Server 4
Pick score = EWMA × (active + 1) — lowest wins
S11.50s*×(0+1)=1.50
S21.50s*×(0+1)=1.50
S31.50s*×(0+1)=1.50
S41.50s*×(0+1)=1.50
* default EWMA used until the server's first request completes
S1 latency
×1
S2 latency
×1
S3 latency
×3
S4 latency
×1
α (EWMA smoothing factor)0.30
low α → smooth EWMA, slow to react · high α → reactive but jumpy