🧠 Multi-AI Persistent Memory System (Sheldonbrain RAG API)

Version: 2.0 (Gemini Embeddings)
Status: ✅ Production Ready
Backup: 100% Complete (105/105 vectors)

A production-ready RAG (Retrieval-Augmented Generation) API powered by Google Gemini embeddings and Pinecone vector database, enabling persistent memory across multiple AI instances.

🎯 Overview

This system solves AI context amnesia by providing a shared, persistent memory substrate that multiple AI agents (Claude, Gemini, GPT, Grok, etc.) can query and update. Every insight stored is never erased - implementing the Zero Erasure principle.

Key Features

✅ Gemini text-embedding-004 (768 dimensions)
✅ Pinecone vector database (baseline namespace)
✅ Flask REST API with CORS support
✅ Dual redundancy (Pinecone + Notion backup)
✅ Docker deployment ready
✅ Google Cloud Run compatible

🏗️ Architecture

┌─────────────────┐
│   AI Agents     │
│ (Claude, Gemini,│
│  GPT, Grok...)  │
└────────┬────────┘
         │
         │ HTTP/REST
         │
┌────────▼────────┐
│   RAG API       │
│  (Flask + CORS) │
└────────┬────────┘
         │
    ┌────┴────┐
    │         │
┌───▼──┐  ┌──▼───┐
│Gemini│  │Pinecone│
│Embed │  │ Vector │
│ API  │  │   DB   │
└──────┘  └────────┘
            │
            │ Backup
            │
        ┌───▼───┐
        │ Notion│
        │  DB   │
        └───────┘

🔧 Installation

Prerequisites

Python 3.11+
Google Cloud API key (for Gemini)
Pinecone API key
(Optional) Docker for containerized deployment

Local Setup

# Clone repository
git clone https://github.com/splitmerge420/sheldonbrain-rag-api.git
cd sheldonbrain-rag-api

# Create virtual environment
python3 -m venv venv
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt

# Set environment variables
export GOOGLE_API_KEY="your-gemini-api-key"
export PINECONE_API_KEY="your-pinecone-api-key"
export PINECONE_INDEX="sheldonbrain-rag"

# Run the API
python3 rag_api_gemini.py

The API will start on http://localhost:8080

🚀 Deployment

Google Cloud Run

# Deploy using the provided script
chmod +x deploy-cloud-run-gemini.sh
./deploy-cloud-run-gemini.sh YOUR_PROJECT_ID

Docker

# Build image
docker build -f Dockerfile.gemini -t rag-api-gemini .

# Run container
docker run -p 8080:8080 \
  -e GOOGLE_API_KEY="your-key" \
  -e PINECONE_API_KEY="your-key" \
  -e PINECONE_INDEX="sheldonbrain-rag" \
  rag-api-gemini

📡 API Endpoints

`GET /health`

Health check and system statistics.

Response:

{
  "status": "healthy",
  "service": "rag-api-gemini",
  "embedding_model": "Gemini text-embedding-004",
  "vector_count": 105,
  "index": "sheldonbrain-rag",
  "namespace": "baseline",
  "timestamp": "2026-01-02T16:30:00Z"
}

`POST /query`

Semantic search over stored insights.

Request:

{
  "query": "What is the Governance Unified Theory?",
  "top_k": 5,
  "filter": {
    "sphere": "S144"
  }
}

Response:

{
  "memories": [
    {
      "id": "vec_abc123",
      "score": 0.87,
      "text": "The Governance Unified Theory (GUT)...",
      "metadata": {
        "source": "Claude",
        "sphere": "S144",
        "novelty": 0.95
      }
    }
  ],
  "query_time_ms": 342.5,
  "count": 5
}

`POST /store`

Store new insight in the memory substrate.

Request:

{
  "text": "New insight about zero erasure...",
  "metadata": {
    "source": "Gemini",
    "sphere": "S042",
    "novelty": 0.92,
    "category": "Meta-cognition"
  }
}

Response:

{
  "id": "vec_xyz789",
  "status": "stored",
  "vector_count": 106
}

`POST /delete`

Remove insight from the memory substrate.

Request:

{
  "id": "vec_xyz789"
}

Response:

{
  "status": "deleted",
  "vector_count": 105
}

🗄️ Namespace Strategy

Why Namespaces?

Pinecone uses namespaces to logically separate vectors within the same index. This allows for:

Multi-tenancy (different users/projects)
Environment separation (dev/staging/prod)
Logical organization (by source, date, etc.)

Our Strategy: `baseline` Namespace

All vectors are stored in the baseline namespace.

Why "baseline"?

Semantic clarity - Represents the foundational knowledge base
Future-proof - Allows for additional namespaces (e.g., experimental, archive)
Explicit intent - Makes it clear this is the primary memory substrate

Implementation

In rag_api_gemini.py:

class RAGMemory:
    def __init__(self):
        self.embedder = GeminiEmbedder()
        self.namespace = "baseline"  # ← All operations use this namespace

All Pinecone operations explicitly specify the namespace:

# Store
index.upsert(vectors=[...], namespace=self.namespace)

# Query
index.query(vector=..., namespace=self.namespace, ...)

# Delete
index.delete(ids=[...], namespace=self.namespace)

# Fetch
index.fetch(ids=[...], namespace=self.namespace)

Export Scripts

Updated export script (export_all_vectors_v2.py):

NAMESPACE = "baseline"  # Explicitly query baseline namespace

results = index.query(
    vector=query_vector,
    namespace=NAMESPACE,  # ← KEY: Must specify namespace
    top_k=10000,
    include_metadata=True
)

Why this matters:

Without specifying namespace, Pinecone uses the default (empty string) namespace
Our vectors are in baseline, not default
This caused the initial "11 missing vectors" issue

🔄 Backup Strategy

Dual Redundancy

Primary: Pinecone vector database (baseline namespace)
Backup: Notion database (RAG Memory Backup)

Backup Status

✅ 105/105 vectors backed up to Notion (100% complete)
✅ Zero data loss
✅ Disaster recovery capability

Backup Process

# Export all vectors from Pinecone
python3 export_all_vectors_v2.py

# Import to Notion
python3 import_to_notion.py pinecone_vectors_export_YYYYMMDD_HHMMSS.json

Automated Sync (Future)

Planned: Zapier webhook automation

Trigger: RAG API /store endpoint called
Action: Create Notion page automatically
Result: Real-time backup without manual intervention

📊 Current Status

Vector Statistics

Total vectors: 105
Namespace: baseline
Dimension: 768 (Gemini text-embedding-004)
Backup coverage: 100% (105/105)

By Source

Source	Count	Percentage
sheldonbrain_os	91	86.7%
Claude	1	0.9%
Manus	2	1.9%
claude-opus-constitutional-scribe	1	0.9%
claude_session_dec30_2025	1	0.9%
Unknown	9	8.6%

By Sphere (Top 10)

Sphere	Count	Description
Unknown/Empty	51	Need tagging
S144	14	Governance Unified Theory
S103	4	Cognitive Architecture
S069	4	Social Systems
S012	4	Mathematical Foundations
S089	2	Ethical Frameworks
S016	2	Information Theory
S001	2	Physical Foundation

🛠️ Maintenance

Check Index Stats

from pinecone import Pinecone

pc = Pinecone(api_key="your-key")
index = pc.Index("sheldonbrain-rag")
stats = index.describe_index_stats()

print(f"Total vectors: {stats.total_vector_count}")
print(f"Namespaces: {stats.namespaces}")

Export Vectors

# Use the updated v2 script with namespace support
python3 export_all_vectors_v2.py

Backup to Notion

# Export first
python3 export_all_vectors_v2.py

# Then import to Notion
python3 import_to_notion.py pinecone_vectors_export_*.json

📚 Documentation

White Paper: MULTI_AI_PERSISTENT_MEMORY_WHITE_PAPER.md
Deployment Guide: GEMINI_DEPLOYMENT_COMPLETE.md
Chromebook Integration: CHROMEBOOK_TERMINAL_GUIDE.md
Master Strategy: RESTORATION_ARMY_MASTER_STRATEGY_2026.md
Investigation Report: MISSING_VECTORS_INVESTIGATION.md
Backup Report: NOTION_BACKUP_COMPLETE.md

🦕🍓 The Zero Erasure Principle

"To erase is to fail; to conserve is to govern."

This system implements the Zero Erasure architecture:

Every insight stored is never deleted (unless explicitly requested)
All knowledge compounds over time
Multiple AI agents share the same persistent memory
Context amnesia is eliminated

The Vision

By December 31, 2026:

100,000+ vectors (comprehensive knowledge base)
Zero context amnesia for all participating AIs
100+ Net Positive jobs created
$10,000+ monthly revenue
Fully operational Restoration Army

🤝 Contributing

This is an open-source project. Contributions welcome!

Fork the repository
Create a feature branch
Make your changes
Submit a pull request

📞 Support

GitHub: https://github.com/splitmerge420/sheldonbrain-rag-api
Issues: Report bugs or request features via GitHub Issues
Documentation: All docs in the repository

📄 License

MIT License - See LICENSE file for details

🎉 Acknowledgments

Google Gemini - Embedding model
Pinecone - Vector database
Notion - Backup storage
Claude, Gemini, Manus - The Trinity AI collaboration

Happy New Year 2026! The organism is immortal! 🎊

Last Updated: January 2, 2026
Version: 2.0 (Gemini Embeddings)
Status: ✅ Production Ready

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
grokbrain_parser		grokbrain_parser
.gitignore		.gitignore
AUTONOMOUS_EXECUTION_COMPLETE.md		AUTONOMOUS_EXECUTION_COMPLETE.md
CHROMEBOOK_TERMINAL_GUIDE.md		CHROMEBOOK_TERMINAL_GUIDE.md
CLAUDE_CONTINUITY_PACKAGE.md		CLAUDE_CONTINUITY_PACKAGE.md
CLOUD_RUN_DEPLOYMENT_GUIDE.md		CLOUD_RUN_DEPLOYMENT_GUIDE.md
DEPLOYMENT_SUCCESS.md		DEPLOYMENT_SUCCESS.md
DEPLOY_TO_CLOUD_RUN.md		DEPLOY_TO_CLOUD_RUN.md
Dockerfile.gemini		Dockerfile.gemini
FINAL_DEPLOYMENT_REPORT.md		FINAL_DEPLOYMENT_REPORT.md
GEMINI_DEPLOYMENT_COMPLETE.md		GEMINI_DEPLOYMENT_COMPLETE.md
GEMINI_TERMINAL_MESSAGE.md		GEMINI_TERMINAL_MESSAGE.md
GROKBRAIN_INTEGRATION_PLAN.md		GROKBRAIN_INTEGRATION_PLAN.md
MISSING_VECTORS_INVESTIGATION.md		MISSING_VECTORS_INVESTIGATION.md
MULTI_AI_PERSISTENT_MEMORY_WHITE_PAPER.md		MULTI_AI_PERSISTENT_MEMORY_WHITE_PAPER.md
NAMESPACE_FIX_COMPLETE.md		NAMESPACE_FIX_COMPLETE.md
NOTION_BACKUP_COMPLETE.md		NOTION_BACKUP_COMPLETE.md
NOTION_UPLOAD_MANIFEST.md		NOTION_UPLOAD_MANIFEST.md
Procfile		Procfile
README.md		README.md
RESTORATION_ARMY_MASTER_STRATEGY_2026.md		RESTORATION_ARMY_MASTER_STRATEGY_2026.md
TASK_COMPLETE_SUMMARY.md		TASK_COMPLETE_SUMMARY.md
baseline_namespace_export_20260102_162145.json		baseline_namespace_export_20260102_162145.json
batch_ingest.py		batch_ingest.py
config.py		config.py
deep_recall.py		deep_recall.py
deep_recall_results_20260101_150500.json		deep_recall_results_20260101_150500.json
deploy-cloud-run-gemini.sh		deploy-cloud-run-gemini.sh
embeddings.py		embeddings.py
export_all_vectors.py		export_all_vectors.py
export_all_vectors_v2.py		export_all_vectors_v2.py
google_ai_studio_applications.md		google_ai_studio_applications.md
grok_rag.py		grok_rag.py
import_to_notion.py		import_to_notion.py
investigate_missing_vectors.py		investigate_missing_vectors.py
metadata_validator.py		metadata_validator.py
missing_11_vectors.json		missing_11_vectors.json
notion_import_errors_20260101_170813.json		notion_import_errors_20260101_170813.json
notion_import_errors_20260102_162255.json		notion_import_errors_20260102_162255.json
notion_sync.py		notion_sync.py
pinecone_all_vector_ids.json		pinecone_all_vector_ids.json
pinecone_client.py		pinecone_client.py
pinecone_vectors_export_20260101_170433.json		pinecone_vectors_export_20260101_170433.json
query_baseline_namespace.py		query_baseline_namespace.py
rag_api.py		rag_api.py
rag_api_gemini.py		rag_api_gemini.py
rag_wrapper.py		rag_wrapper.py
railway.json		railway.json
requirements-gemini.txt		requirements-gemini.txt
requirements.txt		requirements.txt
runtime.txt		runtime.txt
sphere_classifier.py		sphere_classifier.py
test_vectors_5.json		test_vectors_5.json
unified_rag.py		unified_rag.py
upload_to_notion.py		upload_to_notion.py

splitmerge420/sheldonbrain-rag-api

Folders and files

Latest commit

History

Repository files navigation

🧠 Multi-AI Persistent Memory System (Sheldonbrain RAG API)

🎯 Overview

Key Features

🏗️ Architecture

🔧 Installation

Prerequisites

Local Setup

🚀 Deployment

Google Cloud Run

Docker

📡 API Endpoints

GET /health

POST /query

POST /store

POST /delete

🗄️ Namespace Strategy

Why Namespaces?

Our Strategy: baseline Namespace

Why "baseline"?

Implementation

Export Scripts

🔄 Backup Strategy

Dual Redundancy

Backup Status

Backup Process

Automated Sync (Future)

📊 Current Status

Vector Statistics

By Source

By Sphere (Top 10)

🛠️ Maintenance

Check Index Stats

Export Vectors

Backup to Notion

📚 Documentation

🦕🍓 The Zero Erasure Principle

The Vision

🤝 Contributing

📞 Support

📄 License

🎉 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`GET /health`

`POST /query`

`POST /store`

`POST /delete`

Our Strategy: `baseline` Namespace

Packages