Get ChunkHound running

Install the CLI, pick your stack, index your code. Three commands, two minutes.

Install

# Skip if you already have uv
curl -LsSf https://astral.sh/uv/install.sh | sh
uv tool install chunkhound

Verify the install resolved on your shell:

chunkhound --version

Choose your editor, embedding provider, and LLM. Copy the three commands below and run them from your project root.

For local or proxied OpenAI-compatible LLM backends like Ollama or vLLM, keep the generated llm.model value in place. ChunkHound requires an explicit model name for custom base_url endpoints, including per-role overrides that resolve to OpenAI-compatible providers.

Pick your stack

Works with your agent
+ any MCP-compatible
Embedding provider
+ any OpenAI-compatible
LLM provider
echo .chunkhound.json >> .gitignore
cat > .chunkhound.json <<'CHUNKHOUND_EOF'
{
  "embedding": {
    "provider": "voyageai",                // embedding service identifier
    "model": "voyage-3.5",                 // model name
    "api_key": "<YOUR_VOYAGE_API_KEY>"     // replace with your API key
  },
  "llm": {
    "provider": "anthropic",               // which provider runs `chunkhound research`
    "api_key": "<YOUR_ANTHROPIC_API_KEY>"  // replace with your API key
  }
}
CHUNKHOUND_EOF

mkdir -p .cursor
cat > .cursor/mcp.json <<'CHUNKHOUND_EOF'
{
  "mcpServers": {
    "ChunkHound": {
      "command": "chunkhound",
      "args": [
        "mcp"
      ]
    }
  }
}
CHUNKHOUND_EOF

.chunkhound.json holds your API keys

The first command adds it to .gitignore so you don't commit secrets. Replace the <YOUR_*_API_KEY> placeholders with real keys before running. Local OpenAI-compatible backends still need an explicit model. Need Azure OpenAI, a self-hosted endpoint, or a proxy?

Index and verify

Run these four commands in order. Each one verifies a different layer of the stack: database, regex search, embeddings, then LLM.

1. Index the project

chunkhound index .

2. Confirm regex search

chunkhound search --regex "import"

3. Confirm semantic search

chunkhound search "authentication flow"

4. Confirm research

chunkhound research "How does authentication work?"

Use it from your agent

Your agent already has ChunkHound — the editor command from Pick your stack registered it as an MCP tool. The biggest unlock is calling code_research before writing code, not after.

code_research synthesises a cited markdown report covering architecture, key locations, and cross-file flows. One call usually replaces 5–10 manual searches.

Example prompts

The pattern is one line. Paste it into your editor chat with a topic of your own:

Use chunkhound research to ...

A few directions to start with:

  • explain how authentication works end to end, with file:line citations
  • map every caller of the email subsystem and the helpers they share
  • trace the login flow from form submit to Set-Cookie
  • find every file-upload handler and compare their size limits, MIME validation, and storage targets
  • diagnose duplicate webhook deliveries by tracing the handler outward through retries, queues, and idempotency keys

Need just a file or symbol? Ask the agent to use chunkhound search instead — it's faster and skips the LLM.

MCP integration

ChunkHound exposes code_research, search, and a handful of other tools as an MCP server; your editor connected to it via the command in Pick your stack.

Where to next

Stuck? Ask in Discord or open an issue.