Local memory for coding agents. Your agent remembers decisions, bugs, and context across sessions — no cloud, no API keys, no cost.
Install · Features · How it works · Commands · Blog post
EchoVault gives your agent persistent memory. Every decision, bug fix, and lesson learned is saved locally and automatically surfaced in future sessions. Your agent gets better the more you use it.
Coding agents forget everything between sessions. They re-discover the same patterns, repeat the same mistakes, and forget the decisions you made yesterday. I tried other tools like Supermemory and Claude Mem — both are great, but they didn't fit my use case.
Supermemory saves everything in the cloud, which was a deal breaker since I work with multiple companies as a consultant and don't want codebase decisions stored remotely. Claude Mem caused my sessions to consume too much memory, making it hard to run multiple sessions at the same time.
I built EchoVault to solve this: local memory persistence for coding agents that's simple, fast, and private.
Works with 4 agents — Claude Code, Cursor, Codex, OpenCode. One command sets up MCP config for your agent.
MCP native — Runs as an MCP server exposing memory_save, memory_search, and memory_context as tools. Agents call them directly — no shell hooks needed.
Local-first — Everything stays on your machine. Memories are stored as Markdown in ~/.memory/vault/, readable in Obsidian or any editor. No data leaves your machine unless you opt into cloud embeddings.
Zero idle cost — No background processes, no daemon, no RAM overhead. The MCP server only runs when the agent starts it.
Hybrid search — FTS5 keyword search works out of the box. Add Ollama, OpenAI, or OpenRouter for semantic vector search.
Secret redaction — 3-layer redaction strips API keys, passwords, and credentials before anything hits disk. Supports explicit <redacted> tags, pattern detection, and custom .memoryignore rules.
Cross-agent — Memories saved by Claude Code are searchable in Cursor, Codex, and OpenCode. One vault, many agents.
Obsidian-compatible — Session files are valid Markdown with YAML frontmatter. Point Obsidian at ~/.memory/vault/ and browse your agent's memory visually.
pip install git+https://github.com/mraza007/echovault.git
memory init
memory setup claude-code # or: cursor, codex, opencodeThat's it. memory setup installs MCP server config automatically.
By default config is installed globally. To install for a specific project:
cd ~/my-project
memory setup claude-code --project # writes .mcp.json in project root
memory setup opencode --project # writes opencode.json in project root
memory setup codex --project # writes .codex/config.toml + AGENTS.mdEmbeddings enable semantic search. Without them, you still get fast keyword search via FTS5.
Generate a starter config:
memory config initThis creates ~/.memory/config.yaml with sensible defaults:
embedding:
provider: ollama # ollama | openai | openrouter
model: nomic-embed-text
enrichment:
provider: none # none | ollama | openai | openrouter
context:
semantic: auto # auto | always | never
topup_recent: trueWhat each section does:
embedding— How memories get turned into vectors for semantic search.ollamaruns locally,openaiandopenroutercall cloud APIs.nomic-embed-textis a good local model for Ollama.enrichment— Optional LLM step that enhances memories before storing (better summaries, auto-tags). Set tononeto skip.context— Controls how memories are retrieved at session start.autouses vector search when embeddings are available, falls back to keywords.topup_recentalso includes recent memories so the agent has fresh context.
For cloud providers, add api_key under the provider section. API keys are redacted in memory config output.
By default, EchoVault stores data in ~/.memory.
You can change that in two ways:
MEMORY_HOME=/path/to/memory(highest priority, per-shell/per-process)memory config set-home /path/to/memory(persistent default)
Useful commands:
memory config set-home /path/to/memory
memory config clear-home
memory configmemory config now shows both memory_home and memory_home_source (env, config, or default).
Once set up, your agent uses memory via MCP tools:
- Session start — agent calls
memory_contextto load prior decisions and context - During work — agent calls
memory_searchto find relevant memories - Session end — agent calls
memory_saveto persist decisions, bugs, and learnings
The MCP tool descriptions instruct agents to save and retrieve automatically. No manual prompting needed in most cases.
Auto-save hooks (Claude Code) — Optional hooks that ensure Claude always saves learnings before ending a session. See docs/auto-save-hooks.md for setup.
You can also use the CLI directly:
memory save --title "Switched to JWT auth" \
--what "Replaced session cookies with JWT" \
--why "Needed stateless auth for API" \
--impact "All endpoints now require Bearer token" \
--tags "auth,jwt" --category "decision" \
--details "Context:
Options considered:
- Keep session cookies
- Move to JWT
Decision:
Tradeoffs:
Follow-up:"
memory search "authentication"
memory details <id>
memory context --projectFor long details, use --details-file notes.md. To scaffold structured details automatically, use --details-template.
~/.memory/
├── vault/ # Obsidian-compatible Markdown
│ └── my-project/
│ └── 2026-02-01-session.md
├── index.db # SQLite: FTS5 + sqlite-vec
└── config.yaml # Embedding provider config
- Markdown vault — one file per session per project, with YAML frontmatter
- SQLite index — FTS5 for keywords, sqlite-vec for semantic vectors
- Compact pointers — search returns ~50-token summaries; full details fetched on demand
- 3-layer redaction — explicit tags, pattern matching, and
.memoryignorerules
| Agent | Setup command | What gets installed |
|---|---|---|
| Claude Code | memory setup claude-code |
MCP server in .mcp.json (project) or ~/.claude.json (global) |
| Cursor | memory setup cursor |
MCP server in .cursor/mcp.json |
| Codex | memory setup codex |
MCP server in .codex/config.toml + AGENTS.md fallback |
| OpenCode | memory setup opencode |
MCP server in opencode.json (project) or ~/.config/opencode/opencode.json (global) |
All agents share the same memory vault at your effective memory_home path (default ~/.memory/). A memory saved by Claude Code is searchable from Cursor, Codex, or OpenCode.
| Command | Description |
|---|---|
memory init |
Create vault at effective memory home |
memory setup <agent> |
Install MCP server config for an agent |
memory uninstall <agent> |
Remove MCP server config for an agent |
memory save ... |
Save a memory (--details-file and --details-template supported) |
memory search "query" |
Hybrid FTS + semantic search |
memory details <id> |
Full details for a memory |
memory delete <id> |
Delete a memory by ID or prefix |
memory context --project |
List memories for current project |
memory sessions |
List session files |
memory config |
Show effective config |
memory config init |
Generate a starter config.yaml |
memory config set-home <path> |
Persist default memory location |
memory config clear-home |
Remove persisted memory location |
memory reindex |
Rebuild vectors after changing provider |
memory mcp |
Start the MCP server (stdio transport) |
memory uninstall claude-code # or: cursor, codex, opencode
pip uninstall echovaultTo also remove all stored memories: rm -rf ~/.memory/
I Built Local Memory for Coding Agents Because They Keep Forgetting Everything
Everything stays local by default. If you configure OpenAI or OpenRouter for embeddings, those API calls go to their servers. Use Ollama for fully local operation.
MIT — see LICENSE.