Handles ingestion, enrichment, storage, and retrieval of research knowledge.
- Connect to external data sources / papers / datasets
- Chunking, embedding generation
- Vector + metadata store operations
- Retrieval + ranking interfaces
- Vector DB: (e.g. PostgreSQL + pgvector / Milvus / Pinecone)
- Embeddings: (e.g. OpenAI / local models)
knowledge/
ingestion/
processing/
storage/
retrieval/
tests/
- Define ingestion pipeline contract
- Implement first retriever
- Caching & freshness policy
See CODEOWNERS.