Architecture Overview¶
Thoth is a semantic search system for handbook documentation, designed to provide AI assistants with fast, relevant answers from your organization’s knowledge base.
System Components¶
flowchart TB
subgraph External["External"]
GL[GitLab]
AI[AI Assistants]
end
subgraph GCP["Google Cloud Platform"]
subgraph Services["Cloud Run Services"]
MCP[MCP Server]
ING[Ingestion Worker]
end
subgraph Storage["Storage"]
GCS[(Cloud Storage)]
SM[Secret Manager]
end
CT[Cloud Tasks]
end
GL -->|webhooks/polling| ING
ING -->|batch tasks| CT
CT -->|process batch| ING
ING -->|store vectors| GCS
GCS -->|load vectors| MCP
SM -->|credentials| MCP & ING
AI <-->|MCP protocol| MCP
Component Responsibilities¶
Component |
Responsibility |
|---|---|
MCP Server |
Handles semantic search queries from AI assistants via MCP protocol |
Ingestion Worker |
Processes markdown files into vector embeddings |
Cloud Tasks |
Orchestrates parallel batch processing |
Cloud Storage |
Stores raw files and vector database backups |
Secret Manager |
Manages API tokens and credentials |
Data Flow Summary¶
Ingestion: GitLab → Chunker → Embedder → LanceDB → GCS (native when using cloud)
Query: AI Assistant → MCP Server → LanceDB search → Results
Architecture Documents¶
Ingestion Pipeline - Document processing and embedding pipeline
MCP Server - Query handling and MCP protocol implementation
Deployment - CI/CD and infrastructure deployment
Technology Stack¶
Layer |
Technology |
|---|---|
Vector Database |
LanceDB |
Embeddings |
sentence-transformers (all-MiniLM-L6-v2) |
API Protocol |
Model Context Protocol (MCP) |
Transport |
HTTP/SSE |
Compute |
Google Cloud Run |
Storage |
Google Cloud Storage |
IaC |
Terraform Cloud |
CI/CD |
GitHub Actions |