A Google and MIT study identifies problems with multi-agent systems

By EngineAI Team | Published on December 19, 2025 | Updated on December 19, 2025
A Google and MIT study identifies problems with multi-agent systems
In a recent study, Google and MIT researchers examined if adding more AI agents to challenges would enhance outcomes. They discovered that task structure had a significant impact on performance.

The specifics:

Using the identical prompts and token budgets, the team conducted 180 experiments on models from OpenAI, Google, and Anthropic.

While Minecraft jobs requiring step-by-step effort deteriorated by up to 70%, financial analysis tasks divided across agents witnessed an 81% improvement.

Adding more agents usually resulted in inferior performance when a single agent had already achieved 45% accuracy on a task, with numerous agents rapidly depleting tokens.

Companies and customers are being pushed toward sophisticated multi-agent workflows by the agentic hype, but this research may indicate that more isn't always better. A well-designed single agent may perform better than a complex system at a fraction of the cost for many enterprise tasks that call for step-by-step reasoning.

🔗 External Resource:
Visit Link →