Semble boosts agent code search; MCP rules matter

A new CPU-only code search engine, Semble, aims to speed and shrink agent-driven code retrieval by returning concise snippets and using roughly 98% fewer tokens than a grep-and-read workflow. Built with static Model2Vec embeddings, BM25, RRF fusion and code-aware reranking, Semble indexes repos in ~250 ms and answers queries in ~1.5 ms on CPU while matching ~99% of a 137M code transformer's retrieval quality. It can run locally as an MCP server for agents like Claude Code or Codex. Separately, AWS’s Agent Toolkit shows that MCP servers and curated skills only work reliably when agents are given a short rules file directing them to prefer MCP tools and verify APIs—omitting that file causes agents to ignore available skills and fall back to training data.

Latest Changes

Semble open-sourced as a CPU-only code search optimized for agents

Semble claims ~98% token reduction vs grep+read with ~99% retrieval parity to a 137M code transformer

Semble indexes repos in ~250 ms and answers queries in ~1.5 ms on CPU

Semble uses Model2Vec embeddings, BM25, RRF fusion and code-aware reranking

AWS Agent Toolkit requires a short rules file to make agents prefer MCP tools and verify APIs

Timeline

2026-05-06 — AWS released the Agent Toolkit with MCP server, sandboxed execution, and 20+ curated skills

2026-05-14 — Reporting highlighted that the AWS Toolkit needs a short rules file for agents to load MCP skills reliably

2026-05-17 — Semble was posted on Show HN as a new agent-focused code-search library using 98% fewer tokens

2026-05-18 — MinishLab announced Semble as a fast, accurate CPU-only code search engine matching ~99% retrieval quality

What to Watch

Adoption of Semble as a local MCP server for agent toolchains like Claude Code or Codex

Whether agents using the AWS Toolkit include the rules file that enforces MCP/tool preference and API verification

Independent evaluations comparing Semble's retrieval quality and token savings across diverse codebases

Recent News (5)

Semble：为 AI Agent 打造的代码搜索，比 grep 节省 98% token

Semble is a lightweight code-search library designed for AI agents that returns precise snippets with about 98% fewer tokens than a grep-and-read workflow. It indexes a typical repo in ~250 ms and answers queries in ~1.5 ms on CPU, claiming ~200x faster indexing and ~10x faster queries than a code-specialized transformer while maintaining 99% of retrieval quality (NDCG@10 ≈ 0.854). Semble runs locally (no GPU or API keys), can operate as an MCP server compatible with Claude Code, Codex, Cursor, OpenCode and others, and supports searching local paths or git URLs via a CLI (semble search / find-related). The tool reduces token usage for agents, speeds agent access to code, and enables on-device, privacy-friendly code retrieval for developer workflows and LLM agent integrations.

src_agent-collectGitHub / Hacker News1d ago

MinishLab/semble: Fast and Accurate Code Search for Agents. Uses ~98% fewer tokens than grep+read

123pts

GitHubMinishLab2d ago

Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

Semble is a new code-search library optimized for AI agents that claims to deliver fast, accurate, token-efficient retrieval, returning only relevant snippets and using ~98% fewer tokens than a grep-and-read workflow. It indexes repositories in ~250 ms and answers queries in ~1.5 ms on CPU, while matching 99% of retrieval quality of code-specialized transformer models (NDCG@10 ≈ 0.854) and offering ~200x faster indexing and ~10x faster queries in their benchmarks. Semble runs locally with no API keys or GPUs, can act as an MCP server for agents like Claude Code, Codex, Cursor, and OpenCode, or be invoked via a CLI/Bash integration, and supports local paths or git URLs with automatic re-indexing. This matters for agent-driven development workflows and cost/latency-sensitive code-assistance tools.

Semble boosts agent code search; MCP rules matter

Why It Matters

Latest Changes

Timeline

What to Watch

Articles

AI as Infrastructure, Agent Tools, and Cheap Edge AI Hacks

Recent News (5)