teleo-codex/inbox/archive/2018-00-00-siam-economies-of-scale-halfin-whitt-regime.md
Teleo Agents bf4858d0f7 rio: research pipeline scaling disciplines — 15 sources archived
- What: operations research, queueing theory, stochastic modeling for pipeline architecture
- Why: Leo/Cory brief — need disciplined approach to variable-load scaling

Pentagon-Agent: Rio <2EA8DBCB-A29B-43E8-B726-45E571A1F3C8>
2026-03-12 00:29:39 +00:00

1.8 KiB

type title author url date domain format status tags
source Economies-of-Scale in Many-Server Queueing Systems: Tutorial and Partial Review of the QED Halfin-Whitt Heavy-Traffic Regime Johan van Leeuwaarden, Britt Mathijsen, Jaron Sanders (SIAM Review) https://epubs.siam.org/doi/10.1137/17M1133944 2018-01-01 internet-finance paper unprocessed
pipeline-architecture
operations-research
queueing-theory
Halfin-Whitt
economies-of-scale
square-root-staffing

Economies-of-Scale in Many-Server Queueing Systems

SIAM Review tutorial on the QED (Quality-and-Efficiency-Driven) Halfin-Whitt heavy-traffic regime — the mathematical foundation for understanding when and how multi-server systems achieve economies of scale.

Key Content

  • The QED regime: operate near full utilization while keeping delays manageable
  • As server count n grows, utilization approaches 1 at rate Θ(1/√n) — the "square root staffing" principle
  • Economies of scale: larger systems need proportionally fewer excess servers for the same service quality
  • The regime applies to systems ranging from tens to thousands of servers
  • Square-root safety staffing works empirically even for moderate-sized systems (5-20 servers)
  • Tutorial connects abstract queueing theory to practical staffing decisions

Relevance to Teleo Pipeline

At our scale (5-6 workers), we're in the "moderate system" range where square-root staffing still provides useful guidance. The key takeaway: we don't need sophisticated algorithms for a system this small. Simple threshold policies informed by queueing theory will capture most of the benefit. The economies-of-scale result also tells us that if we grow to 20+ workers, the marginal value of each additional worker decreases — important for cost optimization.