Subcog

A persistent memory system for AI coding assistants. Subcog captures decisions, learnings, and context from coding sessions and surfaces them when relevant.

Overview

Subcog delivers:

Single-binary distribution (<100MB, <10ms cold start)
Three-layer storage architecture: SQLite persistence, FTS5 indexing, usearch HNSW vectors
MCP server integration for AI agent interoperability
Claude Code plugin with hooks for seamless IDE integration
Semantic search with hybrid vector + BM25 ranking (RRF fusion)
Knowledge graph with entity extraction and relationship inference
Faceted storage with project, branch, and file path filtering

ADR Compliance

ADR compliance is tracked in docs/adrs/README.md. Current compliance is 95% (55/58 active ADRs) with documented partials in ADR-0003 and ADR-0039.

Benchmark Results

Subcog achieves 97% accuracy on factual recall (LongMemEval) and 57% on personal context (LoCoMo), compared to 0% baseline without memory. See full benchmark results.

Benchmark	With Subcog	Baseline	Improvement
LongMemEval	97%	0%	+97%
LoCoMo	57%	0%	+57%
ContextBench	24%	0%	+24%
MemoryAgentBench	28%	21%	+7%

Features

Core (Always Available)

Memory capture with automatic embedding generation (384-dimensional vectors)
Real semantic search using all-MiniLM-L6-v2 via fastembed-rs
Hybrid search combining BM25 text search + vector similarity (RRF fusion)
Normalized scores (0.0-1.0 range) for intuitive relevance understanding
SQLite persistence as single source of truth (ACID-compliant)
Faceted storage with project_id, branch, and file_path fields
Multi-domain memories (project, user, organization)
10 memory namespaces (decisions, learnings, patterns, blockers, etc.)
Branch garbage collection for tombstoning stale branch memories
Migration tools for upgrading existing memories to use embeddings

Enhanced (Opt-in)

Entity and temporal extraction
Secrets filtering (API keys, PII detection)
OpenTelemetry observability
Full Claude Code hook integration

LLM-Powered (Requires Provider)

Implicit capture from conversations
Memory consolidation and summarization
Supersession detection
Temporal reasoning queries

Installation

Multiple installation methods are available. See INSTALLATION.md for detailed instructions.

# Cargo (recommended - Rust developers)
cargo install subcog

# Homebrew (macOS/Linux)
brew install zircote/tap/subcog

# Docker
docker run --rm ghcr.io/zircote/subcog --help

# Binary download
curl -LO https://github.com/zircote/subcog/releases/latest/download/subcog-VERSION-TARGET.tar.gz

# npm/npx (fallback if binary install unavailable)
npx @zircote/subcog --help

Method	Platforms	Auto-update
Cargo	All	`cargo install`
Homebrew	macOS, Linux	`brew upgrade`
Docker	linux/amd64, linux/arm64	Pull latest tag
Binary	All	Manual
npm/npx	macOS, Linux, Windows	Via npm

Quick Start

# Capture a memory
subcog capture --namespace decisions "Use PostgreSQL for primary storage due to ACID requirements"

# Search memories (semantic search with normalized scores 0.0-1.0)
subcog recall "database storage decision"

# Search with raw RRF scores (for debugging)
subcog recall "database storage decision" --raw

# Check status
subcog status

# Migrate existing memories to use real embeddings
subcog migrate embeddings

Score Normalization

Search results return normalized scores in the 0.0-1.0 range:

1.0: Best match in the result set
>=0.7: Strong semantic match
>=0.5: Moderate relevance
<0.5: Weak match

Use --raw flag to see the underlying RRF (Reciprocal Rank Fusion) scores.

MCP Server

Run as an MCP server for AI agent integration:

subcog serve

Configure in Claude Desktop's claude_desktop_config.json:

{
  "mcpServers": {
    "subcog": {
      "command": "subcog",
      "args": ["serve"]
    }
  }
}

Note: This configuration requires the subcog binary to be installed and in your PATH. Install via cargo install subcog, Homebrew, or download from GitHub Releases. If you cannot install the binary, use npx as a fallback:
{ "command": "npx", "args": ["-y", "@zircote/subcog", "serve"] }

Available MCP Tools

Subcog exposes ~22 consolidated MCP tools (see ADR-0061 for the consolidation design):

Category	Tools	Description
Core	`subcog_capture`, `subcog_recall`, `subcog_status`	Memory CRUD and search
Lifecycle	`subcog_consolidate`, `subcog_reindex`	Maintenance operations
CRUD	`subcog_get`, `subcog_update`, `subcog_delete`, `subcog_list`	Individual memory operations
Bulk	`subcog_delete_all`, `subcog_restore`, `subcog_history`	Bulk and recovery operations
Graph	`subcog_graph`, `subcog_entities`, `subcog_relationships`	Knowledge graph queries
Prompts	`subcog_prompts`, `prompt_understanding`	Prompt template management
Templates	`subcog_templates`	Context template management
Session	`subcog_init`, `subcog_get_summary`, `subcog_namespaces`	Session and namespace info
Compliance	`subcog_gdpr_export`	Data export for compliance

When working with an agent, treat inputs that match MCP tool names as tool invocations (not shell commands) unless you explicitly say "shell" or "run in terminal".

Transport Security

Subcog supports two transport modes with different security models:

Stdio Transport (Default)

The stdio transport is the default and recommended mode for local development:

Property	Description
Trust Model	Process isolation via OS - parent spawns subcog as child process
Network Exposure	None - communication only via stdin/stdout pipes
Authentication	Implicit - same-user execution, no credentials required
Confidentiality	Data stays on local machine (SQLite is authoritative)
Integrity	OS guarantees pipe integrity, no MITM attacks possible

When to use: Local development with Claude Desktop or other MCP clients that spawn subcog directly.

HTTP Transport (Optional)

Enable the HTTP transport for network-accessible deployments:

# Enable HTTP transport
subcog serve --http --port 8080

# With JWT authentication (recommended for production)
subcog serve --http --port 8080 --jwt-secret "your-secret-key"

Property	Description
Trust Model	JWT token authentication required
Network Exposure	Binds to specified port (localhost by default)
Authentication	JWT tokens with configurable expiry
CORS	Configurable allowed origins
TLS	Use a reverse proxy (nginx, Caddy) for HTTPS

When to use: Team environments, remote access, or integration with web-based clients.

Data Protection

Both transports include built-in security features:

Secrets Detection: API keys, tokens, and passwords are detected and optionally redacted
PII Filtering: Personal information can be masked before storage
Encryption at Rest: Enable with encryption_enabled = true (default: true)
Audit Logging: All operations are logged for compliance (SOC2, GDPR)

See environment-variables.md for security configuration options.

Claude Code Hooks

Subcog integrates with all 5 Claude Code hooks:

Hook	Purpose
`SessionStart`	Inject relevant context at session start
`UserPromptSubmit`	Detect capture signals in prompts
`PostToolUse`	Surface related memories after file operations
`PreCompact`	Analyze conversation for auto-capture
`Stop`	Finalize session, capture pending memories

Configure in ~/.claude/settings.json:

{
  "hooks": {
    "SessionStart": [{ "command": "subcog hook session-start" }],
    "UserPromptSubmit": [{ "command": "subcog hook user-prompt-submit" }],
    "PostToolUse": [{ "command": "subcog hook post-tool-use" }],
    "PreCompact": [{ "command": "subcog hook pre-compact" }],
    "Stop": [{ "command": "subcog hook stop" }]
  }
}

Migration

Upgrade existing memories to use real embeddings:

# Dry-run (see what would be migrated)
subcog migrate embeddings --dry-run

# Migrate all memories without embeddings
subcog migrate embeddings

# Force re-generation of all embeddings
subcog migrate embeddings --force

# Migrate from a specific repository
subcog migrate embeddings --repo /path/to/repo

The migration command:

Scans all memories in the index
Generates embeddings using fastembed (all-MiniLM-L6-v2)
Stores embeddings in the vector backend (usearch HNSW)
Skips memories that already have embeddings (unless --force)
Shows progress with migrated/skipped/error counts

Architecture

Subcog uses a three-layer storage architecture to separate concerns:

System Architecture Diagram

flowchart TB
    subgraph Access["Access Layer"]
        CLI["CLI<br/>subcog capture/recall/status"]
        MCP["MCP Server<br/>JSON-RPC over stdio"]
        Hooks["Claude Code Hooks<br/>SessionStart, UserPrompt, Stop"]
    end

    subgraph Services["Service Layer"]
        Capture["CaptureService<br/>Memory ingestion"]
        Recall["RecallService<br/>Hybrid search"]
        GC["GCService<br/>Branch cleanup"]
        Dedup["DeduplicationService<br/>3-tier duplicate detection"]
        Context["ContextBuilder<br/>Adaptive injection"]
    end

    subgraph Storage["Three-Layer Storage"]
        subgraph Persistence["Persistence Layer<br/>(Authoritative)"]
            SQLiteP["SQLite<br/>(default)"]
            PostgresP["PostgreSQL"]
            FS["Filesystem"]
        end

        subgraph Index["Index Layer<br/>(Searchable)"]
            SQLiteI["SQLite + FTS5<br/>(default)"]
            PostgresI["PostgreSQL FTS"]
            Redis["RediSearch"]
        end

        subgraph Vector["Vector Layer<br/>(Embeddings)"]
            usearch["usearch HNSW<br/>(default)"]
            pgvector["pgvector"]
            RedisV["Redis Vector"]
        end
    end

    subgraph External["External Systems"]
        FastEmbed["FastEmbed<br/>all-MiniLM-L6-v2"]
        LLM["LLM Provider<br/>Anthropic/OpenAI/Ollama"]
    end

    CLI --> Capture
    CLI --> Recall
    MCP --> Capture
    MCP --> Recall
    Hooks --> Context
    Hooks --> Capture

    Capture --> Persistence
    Capture --> Index
    Capture --> Vector
    Capture --> FastEmbed
    Capture --> Dedup

    Recall --> Index
    Recall --> Vector
    Recall --> FastEmbed

    Context --> Recall
    Context --> LLM

    Dedup --> Recall
    Dedup --> FastEmbed

    GC --> Persistence
    GC --> Index

    style Access fill:#e1f5fe
    style Services fill:#fff3e0
    style Storage fill:#e8f5e9
    style External fill:#fce4ec

Data Flow Diagram

sequenceDiagram
    participant User
    participant CLI/MCP
    participant CaptureService
    participant Dedup
    participant FastEmbed
    participant Persistence
    participant Index
    participant Vector

    User->>CLI/MCP: subcog capture "decision..."
    CLI/MCP->>CaptureService: CaptureRequest
    CaptureService->>Dedup: Check duplicate
    Dedup->>Index: Hash tag lookup (exact)
    Dedup->>FastEmbed: Generate embedding
    Dedup->>Vector: Similarity search (semantic)
    Dedup-->>CaptureService: Not duplicate

    CaptureService->>FastEmbed: Generate embedding
    FastEmbed-->>CaptureService: [384-dim vector]

    par Store in all layers
        CaptureService->>Persistence: Store memory
        CaptureService->>Index: Index for FTS
        CaptureService->>Vector: Store embedding
    end

    CaptureService-->>CLI/MCP: CaptureResult{id, urn}
    CLI/MCP-->>User: Memory captured

Hybrid Search Flow

flowchart LR
    Query["Query: 'database storage decision'"]

    subgraph Search["Parallel Search"]
        BM25["BM25 Search<br/>(Index Layer)"]
        VectorSearch["Vector Search<br/>(Vector Layer)"]
    end

    subgraph Results["Raw Results"]
        BM25Results["id1: 2.3<br/>id2: 1.8<br/>id3: 1.2"]
        VectorResults["id2: 0.92<br/>id1: 0.85<br/>id4: 0.78"]
    end

    RRF["RRF Fusion<br/>score = sum(1/(k+rank))"]

    Final["Final Results<br/>(normalized 0.0-1.0)<br/>id2: 1.00<br/>id1: 0.87<br/>id3: 0.45<br/>id4: 0.38"]

    Query --> BM25
    Query --> VectorSearch
    BM25 --> BM25Results
    VectorSearch --> VectorResults
    BM25Results --> RRF
    VectorResults --> RRF
    RRF --> Final

ASCII Architecture Reference

                              +-----------------+
                              |   Access Layer  |
                              +-----------------+
                              |  CLI | MCP | Hooks
                              +--------+--------+
                                       |
                              +--------v--------+
                              |  Service Layer  |
                              +-----------------+
                              | Capture | Recall | GC
                              +--------+--------+
                                       |
        +------------------------------+------------------------------+
        |                              |                              |
+-------v-------+             +--------v-------+             +--------v-------+
|  Persistence  |             |     Index      |             |     Vector     |
|    Layer      |             |     Layer      |             |     Layer      |
+---------------+             +----------------+             +----------------+
|               |             |                |             |                |
| - Authoritative             | - Full-text    |             | - Embeddings   |
|   source of truth           |   search (BM25)|             |   (384-dim)    |
| - ACID storage              | - Faceted      |             | - Similarity   |
| - Durable                   |   filtering    |             |   search (ANN) |
|               |             |                |             |                |
+-------+-------+             +--------+-------+             +--------+-------+
        |                              |                              |
+-------v-------+             +--------v-------+             +--------v-------+
|    SQLite     |             | SQLite + FTS5  |             |    usearch     |
|  (default)    |             |   (default)    |             |   (HNSW)       |
+---------------+             +----------------+             +----------------+
|  PostgreSQL   |             |  PostgreSQL    |             |   pgvector     |
+---------------+             +----------------+             +----------------+
|  Filesystem   |             |  RediSearch    |             | Redis Vector   |
+---------------+             +----------------+             +----------------+

Layer Responsibilities

Layer	Purpose	Default Backend	Alternatives
Persistence	Authoritative storage, ACID guarantees	SQLite	PostgreSQL, Filesystem
Index	Full-text search, BM25 ranking	SQLite + FTS5	PostgreSQL, RediSearch
Vector	Embedding storage, ANN search	usearch (HNSW)	pgvector, Redis Vector

For detailed architecture documentation, see src/storage/traits/mod.rs.

Development

Prerequisites

Rust 1.88+ (Edition 2024)
Git 2.30+
cargo-deny for supply chain security

Setup

git clone https://github.com/zircote/subcog.git
cd subcog

# Build
cargo build

# Run tests
cargo test

# Run all checks
cargo fmt -- --check && \
cargo clippy --all-targets --all-features -- -D warnings && \
cargo test && \
cargo doc --no-deps && \
cargo deny check

Project Structure

src/
├── lib.rs              # Library entry point
├── main.rs             # CLI entry point
├── models/             # Data structures (Memory, Domain, Namespace)
├── storage/            # Storage backends (SQLite, PostgreSQL, usearch)
│   └── traits/         # Backend trait definitions (see mod.rs for docs)
├── services/           # Business logic (Capture, Recall, GC)
├── mcp/                # MCP server implementation
├── hooks/              # Claude Code hook handlers
├── embedding/          # Vector embedding generation
└── observability/      # Tracing, metrics, logging

docs/
├── QUICKSTART.md       # Getting started guide
├── TROUBLESHOOTING.md  # Common issues and solutions
├── PERFORMANCE.md      # Performance tuning guide
├── research/           # Research documents
└── spec/               # Specification documents

Configuration

Configuration file at ~/.config/subcog/config.toml:

[storage]
backend = "sqlite"  # "sqlite", "postgres", "filesystem"
data_dir = "~/.local/share/subcog"

[embedding]
model = "all-MiniLM-L6-v2"
dimensions = 384

[hooks]
enabled = true
session_start_timeout_ms = 2000
user_prompt_timeout_ms = 50

[llm]
provider = "anthropic"  # Optional: for Tier 3 features

Faceted Capture

Memories can be tagged with project, branch, and file path:

# Capture with facets (auto-detected from git context)
subcog capture --namespace decisions "Use PostgreSQL"

# Capture with explicit facets
subcog capture --namespace decisions --project my-project --branch feature/auth "Added JWT support"

# Search within a project
subcog recall "authentication" --project my-project

# Search within a branch
subcog recall "bug fix" --branch feature/auth

# Include tombstoned memories
subcog recall "old decision" --include-tombstoned

Branch Garbage Collection

Clean up memories from deleted branches:

# GC current project (dry-run)
subcog gc --dry-run

# GC specific branch
subcog gc --branch feature/old-branch

# Purge tombstoned memories older than 30 days
subcog gc --purge --older-than 30d

Performance Targets

Metric	Target	Actual
Cold start	<10ms	~5ms
Capture latency	<30ms	~25ms
Search latency (100 memories)	<20ms	~82us
Search latency (1,000 memories)	<50ms	~413us
Search latency (10,000 memories)	<100ms	~3.7ms
Binary size	<100MB	~50MB
Memory (idle)	<50MB	~30MB

All performance targets are exceeded by 10-100x. Benchmarks run via cargo bench.

For performance tuning, see docs/PERFORMANCE.md.

Documentation

Document	Description
INSTALLATION.md	Complete installation guide (npm, Docker, Homebrew, Cargo)
QUICKSTART.md	Getting started guide
TROUBLESHOOTING.md	Common issues and solutions
PERFORMANCE.md	Performance tuning guide
environment-variables.md	Environment variable reference
URN-GUIDE.md	Memory URN scheme documentation

Specification

Active Work

See docs/spec/active/ for current work in progress.

Completed Specifications

Full specification documents for the storage architecture are in docs/spec/completed/2026-01-03-storage-simplification/:

REQUIREMENTS.md - Product requirements
ARCHITECTURE.md - Technical architecture
IMPLEMENTATION_PLAN.md - Phased implementation
DECISIONS.md - Architecture decision records

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

fastembed - Embedding generation
usearch - Vector similarity search
rmcp - MCP protocol implementation

Name		Name	Last commit message	Last commit date
Latest commit History 548 Commits
.claude-plugin		.claude-plugin
.claude		.claude
.github		.github
.vscode		.vscode
benches		benches
commands		commands
docker		docker
docs		docs
hooks		hooks
npm		npm
scripts		scripts
skills		skills
src		src
tests		tests
.claude_settings.json		.claude_settings.json
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.markdownlint.json		.markdownlint.json
.trivy.yaml		.trivy.yaml
.trivyignore		.trivyignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
clippy.toml		clippy.toml
deny.toml		deny.toml
example.config.toml		example.config.toml
rustfmt.toml		rustfmt.toml
subcog.mcp.json		subcog.mcp.json

License

zircote/subcog

Folders and files

Latest commit

History

Repository files navigation

Subcog

Overview

ADR Compliance

Benchmark Results

Features

Core (Always Available)

Enhanced (Opt-in)

LLM-Powered (Requires Provider)

Installation

Quick Start

Score Normalization

MCP Server

Available MCP Tools

Transport Security

Stdio Transport (Default)

HTTP Transport (Optional)

Data Protection

Claude Code Hooks

Migration

Architecture

System Architecture Diagram

Data Flow Diagram

Hybrid Search Flow

ASCII Architecture Reference

Layer Responsibilities

Development

Prerequisites

Setup

Project Structure

Configuration

Faceted Capture

Branch Garbage Collection

Performance Targets

Documentation

Specification

Active Work

Completed Specifications

License

Acknowledgments

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 25

Packages 0

Uh oh!

Uh oh!

Contributors 5

Uh oh!

Languages

Packages