Skip to main content

6 posts tagged with "Multi-Agent"

Coordinating multiple AI agents — delegation, hierarchical teams, inter-agent messaging, deadlock and loop prevention.

View All Tags

Lead-to-Proposal Multi-Agent Pipeline in LangGraph

· 25 min read
Vadim Nicolai
Senior Software Engineer

From Lead to Proposal: Building a Multi-Agent Pipeline with LangGraph

A lead-to-proposal pipeline in LangGraph runs an autonomous lead→proposal loop: a raw lead enters, and three specialized agents qualify it, research it from grounded facts, and draft a tailored proposal — every intermediate node executing unattended, with no sales rep between them. That is the whole point of decomposing the work into a multi-agent graph rather than one prompt. The loop earns its autonomy by stopping at exactly one place: a human gate on the send, the single action that carries legal and reputational weight.

That gate is what most implementations get wrong. They either automate everything and lose human oversight at the consequential step, or keep a human in every node and forfeit the throughput the automation was supposed to buy. The pipeline below takes neither path. It automates the expensive cognitive labour — qualify, research, draft — and holds the final verify for an operator, who approves a grounded draft rather than composing one from scratch. The bottleneck was never the proposal itself; it is everything upstream of it, and that is precisely what the loop absorbs.

Hierarchical Coach→Worker Delegation for Agent Teams

· 26 min read
Vadim Nicolai
Senior Software Engineer

A flat agent swarm caps its own autonomy. Let every worker talk to every peer with no leader tracking progress, and the system can run for hours without anyone — human or machine — able to say whether the work was actually done. That is the ceiling this article is about. Hierarchical coach→worker delegation raises it: a single coach plans once, delegates to specialized workers, and those workers act unattended against that one plan instead of re-improvising every step. The autonomy gain is not that more agents run; it is that one durable plan governs many executions over time, so the plan→act→verify loop stops being per-run and becomes a property of the whole campaign.

On the fleet's autonomy ladder this capability sits high. The coach automates the plan step across an entire multi-touch campaign — a sequence that unfolds over weeks, not a single run — and worker subgraphs act against that plan unattended, with the human verify preserved only at each draft's approval. This article grounds that argument in two flag-gated graphs from one production agentic-sales fleet: a campaign-level coach (AA02) and a single-email organized team (AA06). It connects both to the organized-teams paper by Guo et al. (2024) and to decades of organizational evidence. The constants, enums, and feature flags below are read from the code, not from a benchmark. The claim is contrarian because the zeitgeist says "swarm good, hierarchy bad." The evidence says the opposite.

Design-Thinking Multi-Agent Panels for Campaign Strategy

· 25 min read
Vadim Nicolai
Senior Software Engineer

Design-thinking multi-agent campaign strategy is what you get when you let an agent fleet own the plan step that a human normally improvises in their head. Instead of a hard-coded six-touch weekly drip, one LangGraph graph simulates a room of human experts — a strategist, a skeptic, a brand-voice lens — arguing over how a multi-touch outreach sequence should be shaped before the first email is ever drafted. On the fleet's autonomy ladder this capability sits medium: it automates the deliberation over what a campaign's touch sequence should be, then hands the resulting plan to the durable engine, which still holds every individual email for human approval before it acts. Autonomy is earned, not asserted — the panel's output is only a seed (cadence and per-touch angles), never a send.

Deadlock & Infinite-Loop Prevention in Multi-Agent Sales

· 22 min read
Vadim Nicolai
Senior Software Engineer

How to Prevent Deadlocks and Infinite Loops in Multi-Agent Sales Workflows

Deadlock and infinite-loop prevention in multi-agent sales workflows starts with one ugly trace: a sales agent sits idle while a competitor closes the deal. Two nodes trade the same lead back and forth — rechecking CRM fields, re-requesting approval, re-updating scores — until the opportunity ages out. No cancellation, no escalation, no crash. Just an infinite loop that burns credits, writes no value, and slips past every per-message quality gate, because each individual draft looks fine.

This is article #8 of The Autonomous Sales Fleet — one production LangGraph + DeepSeek + Cloudflare-D1 + LangSmith system where each article realizes one 2026 reliability paper as one real graph node. The constraints stay constant across the series. A three-plane architecture splits the work: a LangGraph control plane, a Cloudflare data plane, and a LangSmith observability plane. DeepSeek-only egress runs through a single AI Gateway. A 0.80 eval gate sits on every prompt path. Grounding-First provenance tags every persisted decision, and every send waits on draft-first human approval. This piece adds the liveness layer: structural deadlock and infinite-loop prevention that runs before any model judges anything.

This is a guardrail, not a rung on the autonomy ladder. It is one of the constraints that earns the autonomy the higher rungs exercise — the CRM orchestrator, the coach→worker teams, the lead-to-proposal pipeline. Every plan→act→verify loop that runs unattended needs a deterministic floor under it. That floor proves the loop will actually terminate; without it, the act step has no safe upper bound. This guard is the thing that lets the fleet trust a self-directed loop at all.

Red Teaming LLM Applications with DeepTeam: A Production Implementation Guide

· 21 min read
Vadim Nicolai
Senior Software Engineer

Your LLM application passed all its unit tests. It's still dangerously vulnerable. This isn't just about a bug; it's about a fundamental misunderstanding of risk in autonomous systems. Consider this: an AI agent with a seemingly robust 85% accuracy per individual step has only a ~20% chance of successfully completing a 10-step task. That's the brutal math of compound probability in agentic workflows. The gap between functional correctness and adversarial safety is where silent, catastrophic failures live -- failures that manifest as cost-burning "Tool Storms" or logic-degrading "Context Bloat".

The stakes are not hypothetical. Stanford researchers found that GPT-4 hallucinated legal facts 58% of the time on verifiable questions about federal court cases. In Mata v. Avianca (2023), a lawyer was sanctioned $5,000 for filing a ChatGPT-generated brief with six fabricated cases. Since then, over $31K in combined sanctions have been levied across courts, and 300+ judges now require AI citation verification in their standing orders. The compound failure isn't a rare edge case -- it's the baseline behavior of unsupervised LLM applications in high-stakes domains.

Red teaming is the disciplined, automated process of finding these systemic flaws before they reach production. In this guide, I'll walk through a production implementation using DeepTeam, an open-source adversarial testing framework. We'll move beyond theory into the mechanics of architecting your judge model, enforcing safety thresholds in CI, and grounding everything in two real case studies: a high-stakes therapeutic audio agent for children, and a 6-agent adversarial pipeline that stress-tests legal briefs using the same adversarial structure that has powered legal systems for centuries.

CrewAI's Genuinely Unique Features: An Honest Technical Deep-Dive

· 14 min read
Vadim Nicolai
Senior Software Engineer

TL;DR — CrewAI's real uniqueness is that it models problems as "build a team of people" rather than "build a graph of nodes" (LangGraph) or "build a conversation" (AutoGen). The Crews + Flows dual-layer architecture is the core differentiator. The role-playing persona system and autonomous delegation are ergonomic wins, not technical breakthroughs. The hierarchical manager is conceptually appealing but broken in practice. This post separates what's genuinely novel from what's marketing.