kebab

Author	SHA1	Message	Date
altair823	9c644245fb	review(p6-3): 회차 1 지적 반영 - 새 모듈 `crates/kebab-parse-image/src/image_prep.rs` — OCR + caption + 향후 PDF/video 가 공유할 단일 다운스케일 헬퍼 (`downscale_to_png`) 추출. 기존 ocr.rs / caption.rs 의 거의 동일 알고리즘 두 벌을 한 곳으로 통합. 1px 후행 클램프 / PNG passthrough hot path / 에러 메시지 패턴이 한 곳에서 관리됨. - src/ocr.rs: `downscale_to_long_edge` 제거 → `image_prep::downscale_to_png` 호출. `image::ImageReader / ImageFormat / Cursor` import 도 정리. - src/caption.rs: • `caption_image` / `apply_caption` 의 disabled 처리 비대칭 해소. `caption_image` 는 raw 연산 (gate 없음), `apply_caption` 만 `cfg.image.caption.enabled` 게이트 검사. 호출자가 같은 함수에서 같은 의미를 얻음. • `apply_caption` 의 caption.model / model_version `String::clone` 2회 → 0회. caption move 전에 ProvenanceEvent.note 를 먼저 빌드. • 다운스케일 로직 통째로 image_prep 위임. • `MIN_CAPTION_LONG_EDGE` / `MAX_CAPTION_LONG_EDGE` 를 `pub const` 로 노출 (P6-2 의 `MAX_DECODE_DIM` 가시성 컨벤션과 일관). - tests/caption.rs: • `caption_image_errors_when_feature_disabled` 를 `caption_image_runs_regardless_of_enabled_flag` 로 교체 — 새 책임 분리 의미 검증. • `caption_image_clamps_oversized_max_pixels` 가 literal 1536 대신 `kebab_parse_image::caption::MAX_CAPTION_LONG_EDGE` 상수 참조. - tasks/HOTFIXES.md: `model_version` 형태 deviation 한 단락 추가 (spec literal `provider` → `<provider>/<prompt_template_version>` 확장 + 사유). cargo test -p kebab-parse-image — 42 pass + 2 ignored (13 unit + 12 P6-1 + 8 P6-2 + 9 P6-3). cargo clippy --workspace --all-targets -- -D warnings — pass.	2026-05-02 06:11:56 +00:00
altair823	cd2213e48d	feat(kebab-parse-image): P6-3 caption adapter — vision LM via trait - 신규 모듈 `crates/kebab-parse-image/src/caption.rs` 추가: • `caption_image(llm, bytes, lang_hint, cfg)` — `&dyn LanguageModel` 위에서 동작. 비전 LM (예: gemma4:e4b) 이 한 문장 객관 설명 출력. temperature=0 / seed=0 결정성. • `apply_caption(llm, bytes, block, lang_hint, cfg, events)` — `block.caption = Some(...)` 으로 채우고 ProvenanceKind::CaptionApplied 이벤트 1건 추가. `image.caption.enabled = false` 면 클린 no-op (Ok(())). LM 실패 시 block.caption None 그대로 + events 미기록. • 다운스케일 long-edge `[128, 1536]` 클램프. PNG passthrough hot path 보존, 그 외는 단일 디코드 + PNG 재인코딩. • 한국어 / 영어 프롬프트 분기 (lang_hint=\"ko\"/\"kor\" → 한국어). • `ModelCaption.model_version = \"<provider>/<prompt_template_version>\"` (예: \"ollama/caption-v1\") — prompt 또는 모델 회귀 감사 가능. ## kebab-core / kebab-llm-local 변경 - `kebab_core::GenerateRequest` 에 `images: Vec<String>` 필드 추가. `#[serde(default)]` 으로 기존 wire 페이로드 / snapshot 호환. - `kebab-llm-local::OllamaLanguageModel` 가 req.images 를 Ollama `images: [base64, ...]` 와이어 필드로 라우팅. `#[serde(skip_serializing_if = is_empty)]` 로 비어 있을 때 wire shape 가 pre-P6-3 와 byte-identical. ## kebab-config - 신규 `ImageCfg.caption: CaptionCfg`: - `enabled: bool` (default false) - `max_pixels: u32` (default 768, 클램프 [128, 1536]) - `prompt_template_version: String` (default \"caption-v1\") - `KEBAB_IMAGE_CAPTION_{ENABLED,MAX_PIXELS,PROMPT_TEMPLATE_VERSION}` 3종 환경변수 추가. ## Spec deviations `tasks/HOTFIXES.md` 2026-05-02 항목 추가: - Symptom 1: spec p6-3 시그니처가 `&dyn LanguageModel` 인데 frozen trait + GenerateRequest 가 vision 미지원. → trait 확장. - Symptom 2: spec 의 cargo feature `caption` (default OFF at compile time) → runtime gate 1개로 통합. base64/image/kebab-llm 외 추가 deps 없어 cargo feature 의 binary 절감 가치 미미. p4-1 / p4-2 / p6-3 spec 의 amends 명시. ## 테스트 `cargo test -p kebab-parse-image --test caption` — 9건 + 1 ignored: - feature gate (disabled → no-op / Err on direct call) - happy path (block.caption Some + Provenance CaptionApplied) - 빈 토큰 stream → empty text + caption.is_some() - CapturingMock 으로 req.images 라우팅 검증 (base64 1개, decode 가능) - 한국어 / 영어 프롬프트 분기 (CapturingMock 의 system 캡처) - LM Err → block.caption None 유지 + events 미기록 - 결정성 (동일 mock 입력 → 동일 caption) - max_pixels 클램프 (99999 → 1536, 4000×3000 PNG 다운스케일 검증) - opt-in 통합 (실 192.168.0.47 Ollama / gemma4:e4b → \"The image is a solid red color.\" 검증 완료, 4.3초) `cargo test --workspace --no-fail-fast -j 1` 전체 pass. `cargo clippy --workspace --all-targets -- -D warnings` pass. ## 의존성 경계 - 추가 deps: `kebab-llm` (trait 만), `base64` (이미 P6-2 에서 추가). - dev-deps: `kebab-llm/mock` 으로 `MockLanguageModel`, `kebab-llm-local` (통합 테스트 전용 — 런타임 deps 에는 없음). - forbidden 침범 없음: `kebab-source-fs / parse-md / normalize / chunk / store-* / embed* / search / rag / UI` 미참조. contract: docs/superpowers/specs/2026-04-27-kebab-final-form-design.md sections: §3.4 ImageRefBlock.caption, §3.7a ModelCaption, §9.1 caption (model-generated, low trust).	2026-05-02 06:05:39 +00:00
altair823	4ed5536c92	feat(kebab-parse-image): P6-2 OCR adapter — Ollama-vision default - 새 모듈 `crates/kebab-parse-image/src/ocr.rs` 추가. spec 의 `OcrEngine` trait 그대로 + `OllamaVisionOcr` default 구현 + `apply_ocr` 헬퍼. - `OllamaVisionOcr`: `<endpoint>/api/generate` 비스트리밍 호출, `images: [base64]` 필드로 이미지 전달, 프롬프트는 언어 힌트 + 화이트리스트 언어 목록 포함. 응답 prose 를 `OcrText.joined` 로, prepared image 전체 영역 단일 region (confidence 1.0) 으로 wrap. 기본 모델 `gemma4:e4b`. endpoint 비어 있으면 `models.llm.endpoint` 로 fallback. - 이미지 전처리: long-edge `config.image.ocr.max_pixels` (기본 1600, 256~4096 클램프) 초과 시 PNG 로 재인코딩 (image::imageops::resize, Triangle filter). PNG 입력이 max 이내면 zero-copy passthrough. - `apply_ocr` 는 OCR 성공 시 block.ocr 를 Some 으로 채우고 ProvenanceKind::OcrApplied 이벤트 추가. 실패 시 block.ocr 는 None 그대로 + provenance 미기록 (부분 상태 누출 금지). - `kebab-config`: 새 `ImageCfg.ocr: OcrCfg` 블록 (enabled/engine/model /endpoint/languages/max_pixels). `#[serde(default)]` 로 pre-P6 TOML 호환. `KEBAB_IMAGE_OCR_*` 환경변수 5종 추가. ## Spec deviation 원래 P6-2 spec 은 Tesseract 를 default OCR 엔진으로 지정했으나, dev / CI 호스트에서 `libtesseract-dev` 시스템 패키지 설치를 피하려고 Ollama-vision 으로 default 를 교체. `OcrEngine` trait 추상화는 spec 그대로 보존 — Tesseract / Apple Vision / PaddleOCR 어댑터는 같은 trait 으로 추후 feature-gate 추가 가능. 자세한 내역은 `tasks/HOTFIXES.md` 2026-05-02 항목 참조. Trust 측면: vision LM 은 hallucinate 가능. `OcrText.engine = "ollama-vision"` 필드로 consumer 가 엔진 별 신뢰 분기 가능. ## 테스트 - 신규 (`tests/ocr.rs`, 8 + 1 ignored): - 200 happy → OcrText 디코딩 (joined / engine / engine_version / region count / bbox / confidence) - 빈 응답 → 빈 regions - 5xx → Err with status + body 포함 - 200 error envelope → Err - apply_ocr → block.ocr Some + Provenance OcrApplied 1건 - apply_ocr error → block.ocr None 유지 + events 미기록 - 4000×3000 PNG → max_pixels=1024 까지 다운스케일, aspect ratio 보존 - from_parts max_pixels 클램프 - opt-in `KEBAB_OCR_INTEGRATION=1` 통합 (실제 192.168.0.47 Ollama `gemma4:e4b` 로 \"Hello World 2026\" 전사 검증 완료) - 신규 (`src/ocr.rs` unit): truncate, build_prompt 언어/힌트 처리 - `kebab-config` 테스트 +3: defaults, env override, pre-P6 TOML 호환 전체: `cargo test -p kebab-parse-image` 28 pass + 1 ignored, `cargo test -p kebab-config` 20 pass, `cargo clippy --workspace --all-targets -- -D warnings` pass. contract: docs/superpowers/specs/2026-04-27-kebab-final-form-design.md sections: §3.4 ImageRefBlock.ocr, §3.7a OcrText / OcrRegion, §9.1 OCR vs caption provenance.	2026-05-02 05:38:24 +00:00
altair823	f9714aa5cb	docs(rename): kb → kebab — README, tasks/, docs/, design doc, report 마지막 commit. 모든 .md 안의 `kb` 단어 일괄 갱신. - 19 개 crate 이름 (`kb-core`, `kb-app`, …) → `kebab-` (Rust 모듈 path 표기 `kb_` → `kebab_` 포함). - 미래 component (`kb-tui`, `kb-desktop`, `kb-asr-whisper`, `kb-ocr`, `kb-mcp`, `kb-vlm`, `kb-rerank`, `kb-vision-ocr`, `kb-index`, `kb-smoke`, `kb-architecture`) → `kebab-` (P6+ 가 시작될 때 같은 prefix 사용). - CLI 명령 예제: `kb ingest` / `kb search` / `kb ask` / `kb init` / `kb doctor` / `kb inspect` / `kb list` / `kb eval` → `kebab <verb>`. fenced code block + 인라인 backtick 모두. - XDG paths + env vars + binary 경로 (`target/release/kb` → `target/release/kebab`) 동기화. - design doc / 최초 보고서 / SMOKE / HOTFIXES / phase epic / task spec 모든 reference 통일. - task-decomposition.md 의 `git -c user.name=kb` 는 과거 git history 기록용 author 정보라 그대로 유지 (실제 git history 의 author 는 변경 불가). - `tasks/phase-5-evaluation.md` 의 `status: planned` → `completed` 도 같이 (P5-1 + P5-2 PR 머지 후 미반영분). ## 검증 - `grep -rEn "\bkb-[a-z]\|\bkb_[a-z]\|\.config/kb\b\|kb\.sqlite\|\bKB_[A-Z]" --include="*.md"` 0 hits (task-decomposition.md 의 git author 제외). - 모든 file path reference 살아있음 (renamed file 들 모두 새 path 로 update). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 04:01:55 +00:00
altair823	9dde01eb9f	fix(rag): normalize RRF fusion_score to [0,1] + log post-merge hotfixes ## Bug config.rag.score_gate default 0.05 was incompatible with hybrid RRF fusion_score: raw RRF tops out at num_retrievers / (k_rrf + 1) ≈ 0.0328 at the default k_rrf=60, so every hybrid `kb ask` tripped ScoreGate refusal even when the top hit was perfectly aligned across both retrievers. Symptomatic on the post-P4-3 manual smoke at /tmp/kb-smoke/ pointed at 192.168.0.47 Ollama: $ kb ask "Rust ownership 모델의 핵심 규칙은 뭐야?" --mode hybrid 근거 부족. KB에 해당 내용 없음. # top fusion_score = 0.0164 Per-mode score_gate (lexical_score_gate / vector_score_gate / hybrid_score_gate) was rejected because it forces every consumer (CLI, eval, TUI) to know which mode picks which threshold. Score normalization solves it at the source. ## Fix crates/kb-search/src/hybrid.rs divides every fused score by 2 / (k_rrf + 1), the theoretical RRF maximum with two retrievers each contributing rank 1. After normalization: - both retrievers agree on rank 1 → fusion_score = 1.0 - only one retriever finds the chunk → caps near 0.5 - typical mixed ranks → falls between 0 and 0.5 RRF's rank-ordering invariants are preserved (every score divides by the same positive constant), so sort + tiebreak behaviour is unchanged. Wire schema label `fusion_score` keeps its slot in RetrievalDetail; only the magnitude shifts, and only for hybrid mode (lexical / vector were already in [0, 1]). Verification: re-ran the four-scenario smoke at /tmp/kb-smoke/ with default score_gate = 0.05 — all four (Korean→Korean, English→ English, cross-language Korean↔English, out-of-corpus) succeed with the expected grounded / refusal classification, top fusion_score now ≈ 0.5. ## Tests One unit test (rrf_formula_matches_known_value) updated to expect the normalized value `(1/61 + 1/62) / (2/61) ≈ 0.9919` instead of the raw `1/61 + 1/62 ≈ 0.0325`. The integration snapshot fixture crates/kb-search/tests/fixtures/search/hybrid/run-1.json already used presence checks (fusion_score_positive: true) rather than absolute values, so it doesn't need regeneration. Workspace 319 tests pass; clippy clean across both feature configs. ## Docs This commit also adds tasks/HOTFIXES.md as a dated post-merge log covering this fix and the two earlier --config-flag regressions (P3-5 hotfix #20 across ingest/search/list/inspect/doctor; P4-3 follow-up #24 for kb ask). Original task specs in tasks/p<N>/ *.md stay frozen as the historical contract; HOTFIXES.md is the live source of truth for post-merge deltas. Each affected task spec gets a "Risks/notes" addendum pointing back to HOTFIXES.md so a reader landing on the spec sees the active behaviour: - tasks/INDEX.md gains a "Post-merge 핫픽스" section. - tasks/phase-3-vector-hybrid.md updates the RRF formula text to show the normalized form. - tasks/p3/p3-4-hybrid-fusion.md "Behavior contract" RRF bullet notes the normalization and reason. - tasks/p3/p3-5-app-wiring.md "Risks/notes" notes the --config fix. - tasks/p4/p4-3-rag-pipeline.md "Risks/notes" notes the kb-ask --config fix and the score_gate-RRF incompatibility (closed by the normalization in p3-4). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 16:16:01 +00:00

5 Commits