Library · paper

Towards a Science of Scaling Agent Systems

Yubin Kim, Ken Gu, Chanwoo Park, Chunjong Park, Samuel Schmidgall, A. Ali Heydari, Yao Yan, Zhihan Zhang, Yuchen Zhuang, Yun Liu, Mark Malhotra, Paul Pu Liang, Hae Won Park, Yuzhe Yang, Xuhai Xu, Yilun Du, Shwetak Patel, Tim Althoff, Daniel McDuff & Xin Liu
2025·arXiv preprint (2512.08296)

Source: https://arxiv.org/abs/2512.08296

Full text: arXiv preprint

The first quantitative scaling principles for multi-agent AI systems, derived from 260 configurations across six benchmarks.

Three findings matter: independent agent swarms can amplify baseline errors up to 17 times; tool-heavy tasks suffer disproportionately from multi-agent overhead; and a capability saturation effect means adding agents only helps when single-agent accuracy is below roughly 45%.

The paper also distinguishes centralized from decentralized topologies — centralized orchestration contains errors better but at higher coordination cost.

For anyone designing agentic architectures, this is the paper that replaces intuition with measurement, turning the single-vs-multi question into an engineering tradeoff with quantifiable thresholds.