altair823
8dcedc4b11
feat(p10-r2): V007 trigram migration + design §5.5 + fts diff-check
Task A2 + A3 한 묶음.
migrations/V007__fts_trigram.sql 신규:
- chunks_fts shadow 를 DROP + 재생성 (tokenize = trigram).
- chunks_ai/ad/au trigger 재생성 (V002 와 동일).
- chunks 에서 backfill INSERT — 사용자 re-ingest 불필요, V007 자동.
- V002 는 historical cold-upgrade replay 위해 그대로 유지.
design §5.5 갱신:
- verbatim block 의 tokenize 만 trigram 으로 교체.
- §5.5 본문 상단에 한국어 채택 사유 + trade-off (영어 lexical 변경,
BM25 분포, 디스크 ~2-10x, contentless 아님) prose 한 단락 추가.
crates/kebab-store-sqlite/tests/fts.rs:
- fts_v002_matches_design_section_5_5_verbatim →
fts_v007_matches_design_section_5_5_verbatim 으로 rename.
- extract_migration_5_5_verbatim_block() 의 include_str! path 를
V007__fts_trigram.sql 로 변경. 주석/assertion msg V007 로.
- V002 cold-upgrade test 들 (fts_v002_backfill_*) 은 그대로 유지.
검증: cargo test -p kebab-store-sqlite --test fts → 10/10 PASS
(`fts_v007_matches_design_section_5_5_verbatim` 포함).
Codex round 1/2 의 design §5.5 contentless 정정·trigram tokenizer
채택 사유 명시 발견 반영.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-23 00:52:40 +00:00
..
2026-05-23 00:52:40 +00:00
2026-05-07 19:11:47 +09:00
2026-05-07 10:58:24 +09:00
2026-05-07 14:44:09 +09:00
2026-05-07 17:29:30 +09:00
2026-05-08 18:00:10 +09:00
2026-05-09 14:10:08 +09:00
2026-05-09 17:36:51 +09:00
2026-05-10 00:45:29 +09:00
2026-05-10 03:26:40 +09:00
2026-05-10 12:05:31 +09:00
2026-05-10 17:40:47 +09:00
2026-05-10 22:05:09 +09:00
2026-05-10 22:59:03 +09:00
2026-05-10 18:55:05 +09:00
2026-05-10 20:05:27 +09:00
2026-05-20 13:24:16 +00:00
2026-05-23 00:43:31 +00:00