New workspace member crate `kb-store-sqlite` (allowed deps only:
kb-core, kb-config, rusqlite[bundled], refinery, serde, serde_json,
time, blake3, tracing, anyhow, thiserror; dev-deps add kb-parse-md /
kb-normalize / kb-chunk for the contract round-trip test).
Migration V001 replaces the P0-1 stub with the full §5 DDL (assets,
documents, document_tags, blocks, chunks with policy_hash,
embedding_records, jobs, ingest_runs, answers, eval_runs,
eval_query_results) plus the §5 indexes. FTS5 virtual table + triggers
remain deferred to V002 (P2-1).
Public surface per task spec:
SqliteStore::open / run_migrations / put_asset_with_bytes
impl DocumentStore for SqliteStore (7 trait methods)
impl JobRepo for SqliteStore (4 trait methods)
StoreError { Sqlx, Migration, Conflict }
Behavior:
- Pragmas at open: foreign_keys=ON, journal_mode=WAL,
synchronous=NORMAL, temp_store=MEMORY.
- Asset writer: byte_len ≤ copy_threshold_mb * 1MiB → copy to
data_dir/assets/<aa>/<asset_id> (mode 0o644 on Unix), else
reference. blake3(bytes) verified against asset.checksum; mismatch →
Conflict.
- Idempotency: put_document UPSERTs and bumps doc_version + 1 on
conflict; put_blocks / put_chunks DELETE-then-INSERT; document_tags
re-derived per put_document.
- get_document rehydrates blocks via payload_json ordered by stream
ordinal.
- list_documents builds dynamic WHERE from DocFilter (lang / trust_min
/ path_glob via GLOB / tags_any via document_tags subquery).
- JobRepo: jobs.kind/status are stored as lowercase enum tags; create
mints a 32-hex JobId via blake3(kind || payload || nanos).
Tests follow in subsequent commits.
24 lines
931 B
Rust
24 lines
931 B
Rust
//! `kb-store-sqlite` — SQLite-backed implementations of
|
|
//! [`kb_core::DocumentStore`] and [`kb_core::JobRepo`] (§7.2), plus the
|
|
//! asset writer that copies (or references) raw bytes per design §5.2.
|
|
//!
|
|
//! Schema is owned by `migrations/V001__init.sql` (workspace root), which
|
|
//! ships the full §5 DDL minus the FTS5 virtual table + triggers (those
|
|
//! land in P2-1's `V002`).
|
|
//!
|
|
//! Allowed deps per task spec: `kb-core`, `kb-config`, `rusqlite`,
|
|
//! `refinery`, `serde_json`, `time`, `blake3`, `tracing`, `anyhow`,
|
|
//! `thiserror`. NOT allowed: `kb-parse-*`, `kb-normalize`, `kb-chunk`,
|
|
//! `kb-store-vector`, `kb-source-fs`, etc. (`kb-parse-md`, `kb-normalize`,
|
|
//! `kb-chunk` may appear as **dev-deps** — see `Cargo.toml` — to drive
|
|
//! the contract round-trip test off a real Markdown fixture.)
|
|
|
|
mod error;
|
|
mod schema;
|
|
mod store;
|
|
mod documents;
|
|
mod jobs;
|
|
|
|
pub use error::StoreError;
|
|
pub use store::SqliteStore;
|