thoth.ingestion.flows.batch¶
Batch processing workflow.
Handles the /ingest-batch endpoint which processes a specific batch of files in parallel. Each batch writes to its own isolated LanceDB table.
Functions
|
Extract trace ID from X-Cloud-Trace-Context header. |
|
Create a job-scoped logger adapter. |
|
Return the global JobManager singleton (creates on first call). |
|
Return the global SourceRegistry singleton (creates on first call). |
|
Process a specific batch of files (called by Cloud Tasks). |
|
Set the trace context for the current request/task. |
|
Create and configure a logger with structured JSON output. |
Classes
|
Special type indicating an unconstrained type. |
|
Orchestrates the complete ingestion pipeline. |
|
|
|
Statistics for a job. |
|
|
|
Vector store for document embeddings using LanceDB. |