Multi-Agent Team

About References Harness GitHub 6 Open the app

Built on the Vercel AI SDK

Nine ways to make AI agents work as a team

The same request, solved by nine different multi-agent architectures — from a single coordinator delegating to specialists, to peers negotiating on a bus, to a market where agents bid for work. Watch them think, stream live, and see what each pattern costs.

Try it now See the patterns

This is a public demo — bring your own OpenAI or Anthropic API key in Settings to run it.

How a team handles one request

The same job, coordinated four different ways. Pick a pattern — or watch them cycle.

“Research the state of multi-agent AI and write a brief.”

Coordinator

plans & delegates

Researcher

searches the web

Writer

drafts the piece

Editor

polishes & checks

Synthesized brief

A coordinator plans, delegates to specialists, and synthesizes the result.

The nine architectures

Open any architecture for how it works, the agents, notes, and references.

Orchestrated

coordinator + research / write / edit

Best for: Linear content pipelines where the steps and their order are clear up front.

Trade-off: A single coordinator is a bottleneck and a single point of failure; no parallelism.

How it works

Choreographed

backend / frontend / design peers

Best for: Cross-functional design tasks where peers must negotiate a shared artifact.

Trade-off: Peer negotiation can loop or stall; harder to guarantee convergence than a coordinator.

How it works

Hierarchical

a lead spawns sub-agents on the fly

Best for: Open-ended tasks that naturally break into nested, independent subtasks.

Trade-off: Emergent tree shape is less predictable; recursive spawning + synthesis costs more tokens.

How it works

Evaluator–Optimizer

generate → critique → revise, until it passes

Best for: A single artifact you want iteratively improved to a quality bar — a draft, spec, or snippet.

Trade-off: Cost grows with each round; a never-satisfied critic can burn the full round budget.

How it works

Debate

opposing sides argue, a judge decides

Best for: Decisions and trade-offs where the strongest case for each side should be heard first.

Trade-off: Adds rounds of argument before any answer; the verdict quality depends on the judge.

How it works

Blackboard

agents share one workspace; a controller picks who acts

Best for: Problems whose answer assembles from many partial contributions converging on a shared artifact.

Trade-off: Controller selection can loop; no direct peer messaging means coordination is slower.

How it works

Market

agents bid on tasks; best bid wins

Best for: Heterogeneous work where the best agent for each task is not obvious up front.

Trade-off: The bid round is extra LLM calls; only worth it for larger, varied agent pools.

How it works

Self-Consistency

sample in parallel, judge the best

Best for: Questions where one attempt is noisy but agreement across attempts signals quality.

Trade-off: N parallel samples cost N× the tokens of a single attempt for the sampling step.

How it works

Swarm

identical agents build on a shared scratchpad

Best for: Open-ended ideation and refinement that benefits from many cheap passes converging.

Trade-off: No structure means redundancy and drift; convergence isn’t guaranteed, so it’s round-capped.

How it works

Pick a pattern and watch it run

Switch architectures from a dropdown, stream the agents’ reasoning live, and see the cost of every step.

Launch the app