agents / mcp / cli

Anthropic’s MCP (Model Context Protocol), launched in Nov 2024 and now widely adopted, promised a unified open standard for models to call external tools, but production use in late 2025–2026 revealed a big drawback: MCP’s tool metadata consumes huge amounts of model context tokens, driving cost and latency. The context bloat—examples show dozens of tool schemas eating tens of thousands of tokens—prompted Anthropic engineers to acknowledge the issue. The debate intensified when Peter Steinberger

0.2

Steady

News Items

Articles

Sources

First Seen

2026-05-28 03:51:38

30-Day Trend

05-28

05-29

05-30

Source Breakdown

Zeli (1)sopilot (1)HN (1)Dev.to (1)

Key Entities

MCPQuandri EngineeringGPT-4o(OpenAI)Claude Code(Anthropic)OpenClawAnthropic

Why It Matters

Tech professionals building agent systems must balance interoperability with runtime costs; MCP's metadata-driven approach affects latency, reliability, and inference costs. Decisions about agent tooling standards will influence architecture, deployment, and vendor choices.

Latest Changes

Independent tests show MCP tool schemas can use tens of thousands of context tokens for real stacks
Engineers report MCP-driven context bloat causes higher latency and cost in production agents
Debate intensifies over MCP practicality versus alternative CLI or lightweight integration approaches

Timeline

2024-11-01 — Anthropic launched the Model Context Protocol as an open standard for model-tool integration
2025-11-01 — MCP saw wide adoption across agents and tooling ecosystems
2026-05-28 — Reports and guides note production issues with MCP context bloat and cost
2026-05-29 — Quandri Engineering published tests showing examples where 77 tools consumed about 21k tokens
2026-05-29 — Public discussion escalated highlighting reliability, latency, and duplication problems with MCP

What to Watch

Responses or mitigations from Anthropic or standards groups to reduce MCP metadata in model context
Adoption of alternative patterns like CLI-style connectors or external tool registries to avoid context bloat

Dossier last updated: 2026-05-29 23:34:00

Recent News (4)

MCP Is Dead

Quandri Engineering tested Model Context Protocol (MCP) on a real stack and finds it costly: MCP tool schemas consume substantial LLM context windows (e.g., 77 tools ≈ 21k tokens, using 10.5% of a 200k-token Claude window and 16.5% of GPT-4o’s 128k), reducing space for actual prompts. They measured large per-tool sizes (Linear definitions alone ~12.8k tokens) and report reliability and performance problems—init failures, crashes, slower responses due to extra process round-trips, and opaque permissions. MCP also duplicates functionality available via CLIs/APIs, losing composability, debuggability, and human parity. An update notes Claude Code’s Deferred Loading mitigates context bloat, but Quandri argues performance, debugging, and architectural drawbacks remain relevant.

26pts

Zelinadis20h ago

MCP Is Dead

Quandri Engineering analyzed the Model Context Protocol (MCP) and found it impractical for production use: MCP consumes substantial LLM context, introduces reliability and latency problems, and duplicates existing CLI/API capabilities. Measurements on a Quandri stack show 77 tools from four MCP servers consuming ~21k tokens (10.5% of a 200k-token Claude window, 16.5% of a 128k GPT-4o window), with individual tool schemas like linear/save_issue using ~619 tokens. Operational issues include init failures, process maintenance, slower round-trips (benchmarks show multix slower than direct REST), mid-session crashes, and opaque permissions. Quandri argues CLI/API paths remain more composable, debuggable, and efficient; note: Claude Code’s deferred loading later reduces context bloat but other concerns persist.

372pts

HNnadis21h ago