fix(rag): fb-41 PR-8 — multi-hop synthesize safety in depth (pool 15 + self-check rule)

v0.18 cut 전 fb-41 multi-hop RAG **layered defense** — PR-7 의 pre-decompose probe gate 위에 추가 safety. PR-7 의 fix 만으로는 hybrid mode 의 RRF top_score 가 gate 통과 시 (도그푸딩 S7 의 caffeine query) hallucination 여전히 발생 — synthesize 단계 자체의 safety 보강 필요. **중요**: 본 PR 만으로는 S7 hallucination 완전 차단 안 됨 (gemma3:4b 의 prompt-following 한계 — 추가 dogfood S7 retest 에서 확인). 진짜 fix 는 PR-9 (NLI-based post-synthesis verification). PR-8 은 그 사이의 *partial mitigation + safety in depth* — latency 4× 개선 (614s → 158s) + future larger LLM 용 prompt rule. 설계: docs/superpowers/specs/2026-05-25-p9-fb-41-multi-hop-rag-design.md 계획: /build/cache/dogfood-v018/results/PR-9-DESIGN.md (사용자 결정 후 spec/plan 으로 promotion) ## 변경 - `crates/kebab-config/src/lib.rs`: - `RagCfg::multi_hop_max_pool_chunks` default **30 → 15**. - rationale doc — gemma3:4b 가 30-chunk large prompt 에서 citation rule 잃는 측정 결과. - 2 unit test (`default_*` rename + `legacy_*` assert) 갱신. - `crates/kebab-rag/src/pipeline.rs`: - `MULTI_HOP_SYNTHESIZE_SYSTEM_PROMPT` 에 **답하기 전 self-check** rule 추가 — "[원본 질문] 의 핵심 entity (고유명사, 화학식, 수치 단위, 코드명, 약자) 가 [근거] 본문에 literal 으로 등장하지 않으면 다른 entity 의 정보로 답을 합성하지 말고 '근거가 부족하다' 답한다". example (caffeine + Adam optimizer chunk) 도 명시. ## 도그푸딩 결과 (retest with PR-7 + PR-8) | query | path | grounded | latency | answer | |---|---|---|---|---| | caffeine formula | single-pass | false (LlmSelfJudge) | 30s | "근거가 부족하다" ✓ | | caffeine formula | multi-hop pre-fix | true ✗ | 141s | hallucination | | caffeine formula | multi-hop PR-7 | true ✗ | 143s | hallucination (probe gate top_score 0.5 > 0.30) | | caffeine formula | multi-hop PR-8 | true ✗ | **158s** | hallucination (LLM 가 새 rule 무시) — **latency 4× 개선** | PR-8 의 부분 성과: - pool 30→15 로 synthesize prompt size ↓ → latency 614s → 158s. - prompt rule 은 future larger LLM (gemma2:9b, qwen2.5:7b 등) 에서 가치 ↑. PR-8 의 한계: - gemma3:4b 의 prompt-following 한계 — strong rule 도 무시하고 다른 entity chunk (Adam optimizer formula) 의 본문을 caffeine 화학식 출처로 인용. - LLM-self-judge 기반 safety 의 ceiling. ## 진짜 fix → PR-9 (별 PR) 학계 / industry 표준 검색 결과 (Self-RAG, CRAG, Auto-GDA, MedTrust-RAG): deterministic post-synthesis verification 이 정답 path. **NLI-based groundedness check** — mDeBERTa-v3-base-xnli (280 MB multilingual) ONNX model 이 (premise=packed_chunks, hypothesis=answer) entailment 검사. score < 0.5 면 refuse. PR-8 위에 layered defense. ## 검증 - `cargo test -p kebab-config -p kebab-rag -j 1` — 모든 test 통과 (config default test 2개 갱신, rag tests 영향 없음). - `cargo clippy -p kebab-config -p kebab-rag --all-targets -j 1 -- -D warnings` clean. - 단일 crate 직렬 build (16 GB RAM 제약). - S7 dogfood retest — hallucination 여전 (PR 본문에 정직 명시). ## 변경 없음 - Wire schema — additive (config knob default 만 변경). - PR-7 의 probe gate — 그대로 작동 (gate 통과 시 PR-8 의 추가 safety layer). - 다른 도그푸딩 P1 항목 (citation 일관성, binary path) — 별 PR. ## 다음 - **PR-9a/b/c**: NLI-based post-synthesis verification — 진짜 fix. - PR-9 머지 후 dogfood S7 재검증 (예상: refuse + nli_score < 0.5). - v0.18.0 cut. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-25 12:44:31 +00:00
parent 71fb2cbcb3
commit 52a97303dc
2 changed files with 17 additions and 7 deletions
--- a/crates/kebab-config/src/lib.rs
+++ b/crates/kebab-config/src/lib.rs
@@ -202,8 +202,14 @@ pub struct RagCfg {
    /// p9-fb-41: hard ceiling on the deduped chunk pool. When the
    /// accumulated pool would exceed this many chunks the pipeline
    /// stops accepting new retrieval results and forces synthesize
-    /// with `forced_stop = true`. Default `30` — generous for
-    /// 5-hop / 10-hits multi-hop runs while still bounded.
+    /// with `forced_stop = true`.
+    ///
+    /// Default `15` — tuned down from the original 30 in the v0.18
+    /// pre-cut dogfood (`tasks/HOTFIXES.md` 2026-05-25 fb-41 post-PR-7
+    /// entry). With 30 chunks the synthesize prompt was large enough
+    /// for gemma3:4b to lose the citation rule + drift into unrelated
+    /// chunks; 15 keeps the prompt tight while still allowing 3-iter
+    /// cross-doc reasoning over ~5 chunks per iter.
    #[serde(default = "default_multi_hop_max_pool_chunks")]
    pub multi_hop_max_pool_chunks: u32,
 }
@@ -217,7 +223,7 @@ fn default_multi_hop_max_sub_queries_per_iter() -> u32 {
 }

 fn default_multi_hop_max_pool_chunks() -> u32 {
-    30
+    15
 }

 /// Settings for the image ingest pipeline (P6). `ocr` controls OCR
@@ -1164,8 +1170,11 @@ theme = "dark"
    }

    #[test]
-    fn default_multi_hop_max_pool_chunks_is_30() {
-        assert_eq!(Config::defaults().rag.multi_hop_max_pool_chunks, 30);
+    fn default_multi_hop_max_pool_chunks_is_15() {
+        // v0.18 dogfood (HOTFIXES 2026-05-25 fb-41 post-PR-7) tuned
+        // this down from 30 → 15 to keep the synthesize prompt tight
+        // enough for gemma3:4b to follow the citation rule.
+        assert_eq!(Config::defaults().rag.multi_hop_max_pool_chunks, 15);
    }

    #[test]
@@ -1200,7 +1209,8 @@ theme = "dark"
            .expect("parse legacy config");
        assert_eq!(c.rag.multi_hop_max_depth, 3);
        assert_eq!(c.rag.multi_hop_max_sub_queries_per_iter, 5);
-        assert_eq!(c.rag.multi_hop_max_pool_chunks, 30);
+        // v0.18 dogfood (post-PR-7): pool default 30 → 15.
+        assert_eq!(c.rag.multi_hop_max_pool_chunks, 15);
    }

    #[test]