teleo-codex/inbox/archive/2022-06-07-slimmon-littles-law-scale-applications.md
Rio 099253fa12 rio: research pipeline scaling disciplines (#630)
Co-authored-by: Rio <rio@agents.livingip.xyz>
Co-committed-by: Rio <rio@agents.livingip.xyz>
2026-03-12 00:30:19 +00:00

1.6 KiB
Raw Blame History

type title author url date domain format status tags
source Using Little's Law to Scale Applications Dan Slimmon https://blog.danslimmon.com/2022/06/07/using-littles-law-to-scale-applications/ 2022-06-07 internet-finance essay unprocessed
pipeline-architecture
operations-research
queueing-theory
littles-law
capacity-planning

Using Little's Law to Scale Applications

Practitioner guide showing how Little's Law (L = λW) provides a simple but powerful tool for capacity planning in real systems.

Key Content

  • Little's Law: L = λW where L = average items in system, λ = arrival rate, W = average time per item
  • Rearranged for capacity: (total worker threads) ≥ (arrival rate)(average processing time)
  • Practical example: 1000 req/s × 0.34s = 340 concurrent requests needed
  • Important caveat: Little's Law gives long-term averages only — real systems need buffer capacity beyond the theoretical minimum to handle variance
  • The formula guides capacity planning but isn't a complete scaling solution — it's the floor, not the ceiling

Relevance to Teleo Pipeline

Direct application: if we process ~8 sources per extraction cycle (every 5 min) and each takes ~10-15 min of Claude compute, Little's Law says L = (8/300s) × 750s ≈ 20 sources in-flight at steady state. With 6 workers, each handles ~3.3 sources concurrently — which means we need the workers to pipeline or we'll have queue buildup.

More practically: λ = average sources per second, W = average extraction time. Total workers needed ≥ λ × W. This gives us the minimum worker floor. The square-root staffing rule gives us the safety margin above that floor.