Question 1

When should I use multiple agents instead of a single agent?

Accepted Answer

Kim et al. identify a capability saturation threshold: multi-agent systems only outperform single agents when baseline accuracy is below ~45%. Above that threshold, adding agents introduces coordination overhead without meaningful improvement. This gives you a measurable decision point: run a single-agent baseline first, and only invest in multi-agent orchestration if performance falls below this threshold.

Question 2

Why do error rates sometimes explode in multi-agent systems?

Accepted Answer

Independent agent swarms can amplify baseline errors up to 17 times due to compounding failures across agents. The paper distinguishes centralized vs. decentralized topologies: centralized orchestration mitigates this error cascade but requires higher coordination cost, while decentralized systems are cheaper to run but more vulnerable to error propagation.

Question 3

How do tool integrations affect multi-agent scaling?

Accepted Answer

Tool-heavy tasks suffer disproportionately from multi-agent overhead, meaning each additional agent adds more friction relative to the performance gain. If your product requires agents to coordinate around external tools (APIs, databases, etc.), the scaling calculus shifts unfavorably—you'll need stronger justification for multi-agent architectures than you would for purely reasoning-based tasks.

Towards a Science of Scaling Agent Systems

Central argument

Critique

Why it matters for product