kebab

Author	SHA1	Message	Date
altair823	3d45994693	refactor(config): signature paddle 경로 미디어화 + 바이트 불변 골든 ocr_engine_version_for_sig 가 det/rec/dict 를 호출자(미디어별)로부터 받도록 인자화 — image 는 [ingest.image.ocr], pdf 는 [ingest.pdf.ocr]. v2 의 pdf↔image paddle 비대칭 제거. engine_version_for_paths 신설(kebab-parse-image). 출력 문자열은 값 기반이라 v2 와 바이트 동일(불변식 #1). test seam + 골든 추가. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 12:44:27 +00:00
altair823	d5c69f6715	refactor(config): v3 경로 call-site sweep (kebab-app/kebab-eval/kebab-parse-image) 부모 경로에 .ingest 삽입(leaf 구조체 불변). src + 테스트 call-site 전부. kebab-cli 테스트의 v2 TOML fixture 는 from_file 자동변환(T6) 경로 검증용으로 유지. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 12:40:06 +00:00
altair823	f3a7222ec5	fix(ocr): PR #206 round-1 리뷰 반영 — 골든 CI 테스트 + PDF 튜닝 문서 + threshold const + mutex 복구 - [MEDIUM] 골든 CI 단위테스트 2건 추가: ctc_greedy_decode_golden (argmax_idx one-hot → decoded 문자열 검증), det_box_score_golden (box_score/unclip_rect golden corner 검증). 모델/ONNX 불요, CI 상주. ctc_greedy_decode를 자유 함수(ctc_greedy_decode_with_dict)로 추출하여 테스트 가능하게 함. - [MEDIUM] PDF paddle 튜닝 비대칭 문서화: build_pdf_ocr_engine에 paddle-onnx가 image.ocr.* 사용(pdf.ocr.* 아님) 이유 명시 + PdfOcrCfg.engine 필드 doc 갱신. - [MEDIUM] DBNet 이진화 매직넘버 0.3 → DET_BIN_THRESH const 추출 + score_thresh 기본값 느슨한 이유 1줄 주석. - [LOW] Mutex poison 복구: det/rec .expect("poisoned") → .unwrap_or_else(PoisonError::into_inner). 자산 panic이 ingest abort 안 되도록. - [LOW] DetBox.score dead field 제거 (box_score 결과는 필터에만 사용, 저장 불요). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-04 09:13:27 +00:00
altair823	375a0693e4	chore(ocr): T11/T12 — clippy clean + docs + v0.27.0 bump T11: fix 12 clippy lints in paddle_onnx.rs/paddle_e2e.rs (doc overindent, finish_non_exhaustive, map_or_else, RangeInclusive::contains, cast_lossless, is_some_and, usize::from). Full-workspace clippy -D warnings = 0. Smoke (paddle-onnx, real binary): clean_paragraph OCR verbatim-correct, real per-region confidence (0.99/0.96/0.95), FTS5 lexical hit on Korean(검색)+ English(embedding), parser_version folds \|ocr:1:paddle-onnx:<ver>. Big page <4s inference (5.6s ingest incl. one-time session load). T12: README [image.ocr].engine + ARCHITECTURE OCR row + SMOKE paddle-onnx config + HANDOFF + HOTFIXES dated entry. Workspace version 0.26.2 → 0.27.0 (minor: new engine value + config keys). .gitattributes: onnx as plain blobs (no git-lfs). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 08:36:19 +00:00
altair823	8cc4e6d563	fix(ocr): T10/T11 — unclip edge-offset (CER 0.26→0.005) + e2e gate + error tests Root cause found at T11 e2e: unclip_rect pushed corners radially from the centroid. For a wide/short text box the diagonal is near-horizontal, so the box barely grew in height and clipped character tops (ㄷ→ㄴ, 다→나). Rewrote unclip as a proper per-edge polygon offset along the rect's own (u,v) axes — height and width each grow by 2*distance, matching PaddleOCR pyclipper. Result (synthetic-ocr-bench, real inference): mean gate CER 0.2585 → 0.0049 (clean_paragraph/korean_heavy/numbers_table/tech_terms = 0.0), beating the 0.976 PoC baseline. Big page 3.9s < 5s. T10: dict-length-mismatch construction error + undecodable-bytes recognize error. T11 e2e: tests/paddle_e2e.rs CER<=0.05 gate (skips cleanly when assets absent). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 08:22:47 +00:00
altair823	901416d8e9	feat(ocr): T7-T9 — config overrides + engine factory + signature cascade T7: OcrCfg gains det_model/rec_model/dict overrides + score_thresh/ unclip_ratio/max_boxes (serde default, KEBAB_IMAGE_OCR_* env). OnnxPaddleOcr::new threads them via ModelPaths::from_config. T8: build_image_ocr_engine / build_pdf_ocr_engine factories return Box<dyn OcrEngine>; match on engine string (ollama-vision\|paddle-onnx\|err). ImagePipeline.ocr_engine + pdf_ocr_engine signatures switched to &dyn OcrEngine. OcrEngine gains model() for the progress label. T9: ingest_config_signature image/pdf branches emit \|ocr:1:{engine}:{engine_version} (memoized blake3 per asset-triple, m3-safe). Unit tests (a)(b)(c) added. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 08:15:30 +00:00
altair823	b706e3e88c	feat(ocr): T2-T6 OnnxPaddleOcr core engine — det/rec ONNX + DBNet postproc + CTC PP-OCRv5 ONNX OCR engine on the pinned ort rc.9 (no Python, no oar-ocr dep). Implements the recognize() pipeline end-to-end (compiles + unit-tested): - T2: OnnxPaddleOcr skeleton, OcrEngine impl, det/rec Session loaded once (Mutex-wrapped → Send+Sync), engine_version = blake3(det+rec+dict) cached once at construction, dict bounds-check (11945 lines vs 11947 rec classes). - T2 preproc: det ImageNet mean/std NCHW + limit_side_len 960 → ×32 round (golden 192x900→896x192 pinned); rec height-48 keep-aspect, (x-0.5)/0.5. - T3 det postproc: threshold 0.3 → imageproc contours → min-area rect via pure-Rust rotating calipers + convex hull → mean-prob box-score filter → pure-Rust unclip(ratio 1.5). No clipper2/OpenCV. - T4 crop+rectify: corner ordering + bilinear perspective warp to horizontal. - T5 rec+CTC: greedy decode with the T0a-confirmed mapping (idx0=blank, 1..=11945=dict[idx-1], 11946=space), rec-class bounds-check. - T6 assembly: reading-order OcrText with per-region bbox + real confidence. Unit tests (4 pass): det_target_dims golden, convex hull, min-area rect, unclip expansion. Large *.onnx assets stay untracked pending T12 LFS decision. Remaining: T7 config overrides, T8 factory (4 sites), T9 signature cascade, T10 error matrix, T11 gates (clippy/e2e CER), T12 docs+bump+PR. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 07:52:39 +00:00
altair823	aeaa18a564	feat(ingest): 진행 로그 개선 — 파일명/phase/heartbeat/slowest 요약 OCR/caption 켜진 볼트 ingest 가 중간부터 느릴 때 TTY 진행바가 파일명·phase· 모델·경과시간을 안 보여 "멈춤"처럼 보이던 문제 해결. - 신규 wire AssetPhase{idx,total,phase,model} + AssetTimings.ocr_ms/caption_ms (additive, ingest_progress.v1 유지) - app: apply_ocr/apply_caption/embed 진입 시 AssetPhase emit + ocr/caption 시간 측정 - cli: TTY 진행바에 현재 파일명 + phase(model) + asset 경과초(heartbeat), 종료 시 최장 소요 파일 top-5 요약(quiet 여도 출력, --json 미출력) - wire schema / README / HANDOFF / HOTFIXES 동기화, version 0.26.0 → 0.27.0 검증(리더): clippy 0, kebab-app/cli 61그룹·parse-image/tui 14그룹 0실패(-j8). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 10:52:26 +00:00
altair823	685007789a	style: cargo fmt --all (round 4 ingest log feature follow-up) Phase C4 executor 의 마지막 `fix(test): clippy + fmt fixes` commit 이 test file 부분만 fmt 적용. workspace 전체 fmt 누락 발견 → cargo fmt --all 적용. 모든 import alphabetical reorder + line wrapping 정합. 추가 untracked artifact 동시 commit: - docs/superpowers/specs/2026-05-28-v0.20-ingest-log-spec.md (491 line, ACCEPT) - docs/superpowers/plans/2026-05-28-v0.20-ingest-log-plan.md (616 line, ACCEPT) workspace test: 1370 passed / 0 failed / 50 ignored, ingest_log_smoke green. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-28 04:18:40 +00:00
altair823	7c85de065a	chore: workspace-wide cleanup — clippy::pedantic baseline + auto-fix cut PR v0.18.0 전 마지막 정리. 사용자 요청: "전체 코드베이스를 깔끔하고 알아보기 쉽게". ## Workspace lints - `Cargo.toml` 의 `[workspace.lints.clippy]` 에 `pedantic = "warn"` (priority -1) + 의도적 allow-list 추가: - cast_possible_truncation / cast_possible_wrap / cast_sign_loss / cast_precision_loss — ONNX i64 / hash modular reduction 등 의도적 truncation. - doc_markdown / missing_errors_doc / missing_panics_doc — cosmetic doc style. - too_many_lines / module_name_repetitions / must_use_candidate / needless_pass_by_value / manual_let_else / items_after_statements / similar_names — informational only. - format_collect / match_wildcard_for_single_variants / trivially_copy_pass_by_ref / unnecessary_wraps — intentional patterns (exhaustive match, future Result variants 등). - default_trait_access — `Foo::default()` 가 idiomatic. - float_cmp — NLI / RRF score 의 explicit threshold 비교 의도. - struct_excessive_bools / case_sensitive_file_extension_comparisons / naive_bytecount / ignore_without_reason — domain-specific 의도. - format_push_string / return_self_not_must_use / match_same_arms — builder / wire-label / hot-path 패턴 보존. - needless_continue / used_underscore_binding / nonminimal_bool / unreadable_literal / many_single_char_names / doc_link_with_quotes / assigning_clones / collapsible_str_replace / trivial_regex / elidable_lifetime_names / range_plus_one / explicit_iter_loop / implicit_hasher / ref_option — remaining low-value style. - 각 24 crate `Cargo.toml` 에 `[lints] workspace = true` 추가. ## Auto-fix `cargo clippy --workspace --all-targets --fix` 적용 — 128 files changed, 552 insertions / 472 deletions. 주로: - uninlined_format_args (~18): `format!("{}", x)` → `format!("{x}")`. - redundant_closure_for_method_calls (~33): `.map(\|x\| x.foo())` → `.map(T::foo)`. - 그 외 mechanical refactor. ## 검증 - `cargo clippy --workspace --all-targets -j 1 -- -D warnings` clean (pedantic + 모든 lint group). - `cargo test --workspace --no-fail-fast -j 1` — 1293 tests pass + 1 pre-existing flaky fail (`kebab-mcp::tools_call_ask_multi_hop::ask_tool_routes_multi_hop_true_to_decompose_first`, HOTFIX candidate, cleanup 무관). 회귀 0. Wire 영향: 없음. Behavior 영향: 없음 (mechanical refactor only). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-26 03:01:58 +00:00
altair823	41c5edc517	feat(image.ocr): request_timeout_secs config knob + closure of v0.17.1 미진행 v0.17.1 (PR #162) 가 LLM 쪽 hard-coded 300s 를 [models.llm] request_timeout_secs 로 풀어준 것과 같은 패턴을 OCR 어댑터에 적용. 사용자 결정으로 별 노브 분리 ([image.ocr] request_timeout_secs) — OCR 는 LLM 대비 cold start 패턴이 달라 독립 조절이 편함. - OcrCfg.request_timeout_secs: u64 (serde default 300) - KEBAB_IMAGE_OCR_REQUEST_TIMEOUT_SECS env override - OllamaVisionOcr::build / from_parts 시그니처에 timeout 인자 추가 - REQUEST_TIMEOUT 상수 제거 - 3 신규 unit test (default / env / legacy parse) — LlmCfg 패턴 그대로 - HOTFIXES 2026-05-25 v0.17.1 entry 의 두 미진행 항목 모두 closure (OCR timeout = 본 PR, --stream docs = PR #163 에서 이미 완료) 기존 config / 옛 KB 영향 없음 — 새 필드는 default 로 채워지고 동작도 동일 (300s). vision 모델 cold start 가 길면 env 또는 config 로 늘릴 수 있음. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 05:06:53 +00:00
th-kim0823	bf4ebf8d2a	feat(p10-1a-1): add Metadata.repo / git_branch / git_commit / code_lang Four optional, serde-skipped-when-None fields added to `Metadata` for code ingest context. All 11 downstream construction sites patched with `repo: None, git_branch: None, git_commit: None, code_lang: None`. Full workspace check (`--tests`) and per-crate test suite pass clean. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 15:44:18 +09:00
altair823	f867b36afb	feat(kebab-core): p9-fb-23 task 2 — CanonicalDocument gains last_chunker_version + last_embedding_version Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-04 17:50:25 +00:00
altair823	b40b0b3992	review(p6-3): 회차 2 — image_prep 회귀 테스트 + doc 일반화 - src/image_prep.rs: • 신규 unit 테스트 6건 — PNG passthrough (zero-decode + 바이트 동일성), JPEG → PNG 재인코딩, 1px 후행 클램프 (max=1601 / long=4001 irrational scale), aspect ratio (4:3 보존, 2% 이내), 손상 PNG Err, 인식 불가 바이트 Err. • 모듈 doc-comment 의 \"send to vision models\" 표현을 \"image-to-LM pipeline / channel\" 으로 일반화. 미래 PDF / video keyframe 등 호출자가 doc 만 보고 호출 의도 파악 가능. cargo test -p kebab-parse-image — 48 pass + 2 ignored (19 unit (+6 image_prep) + 12 P6-1 + 8 P6-2 + 9 P6-3). cargo clippy -p kebab-parse-image --all-targets -- -D warnings — pass.	2026-05-02 06:14:27 +00:00
altair823	9c644245fb	review(p6-3): 회차 1 지적 반영 - 새 모듈 `crates/kebab-parse-image/src/image_prep.rs` — OCR + caption + 향후 PDF/video 가 공유할 단일 다운스케일 헬퍼 (`downscale_to_png`) 추출. 기존 ocr.rs / caption.rs 의 거의 동일 알고리즘 두 벌을 한 곳으로 통합. 1px 후행 클램프 / PNG passthrough hot path / 에러 메시지 패턴이 한 곳에서 관리됨. - src/ocr.rs: `downscale_to_long_edge` 제거 → `image_prep::downscale_to_png` 호출. `image::ImageReader / ImageFormat / Cursor` import 도 정리. - src/caption.rs: • `caption_image` / `apply_caption` 의 disabled 처리 비대칭 해소. `caption_image` 는 raw 연산 (gate 없음), `apply_caption` 만 `cfg.image.caption.enabled` 게이트 검사. 호출자가 같은 함수에서 같은 의미를 얻음. • `apply_caption` 의 caption.model / model_version `String::clone` 2회 → 0회. caption move 전에 ProvenanceEvent.note 를 먼저 빌드. • 다운스케일 로직 통째로 image_prep 위임. • `MIN_CAPTION_LONG_EDGE` / `MAX_CAPTION_LONG_EDGE` 를 `pub const` 로 노출 (P6-2 의 `MAX_DECODE_DIM` 가시성 컨벤션과 일관). - tests/caption.rs: • `caption_image_errors_when_feature_disabled` 를 `caption_image_runs_regardless_of_enabled_flag` 로 교체 — 새 책임 분리 의미 검증. • `caption_image_clamps_oversized_max_pixels` 가 literal 1536 대신 `kebab_parse_image::caption::MAX_CAPTION_LONG_EDGE` 상수 참조. - tasks/HOTFIXES.md: `model_version` 형태 deviation 한 단락 추가 (spec literal `provider` → `<provider>/<prompt_template_version>` 확장 + 사유). cargo test -p kebab-parse-image — 42 pass + 2 ignored (13 unit + 12 P6-1 + 8 P6-2 + 9 P6-3). cargo clippy --workspace --all-targets -- -D warnings — pass.	2026-05-02 06:11:56 +00:00
altair823	cd2213e48d	feat(kebab-parse-image): P6-3 caption adapter — vision LM via trait - 신규 모듈 `crates/kebab-parse-image/src/caption.rs` 추가: • `caption_image(llm, bytes, lang_hint, cfg)` — `&dyn LanguageModel` 위에서 동작. 비전 LM (예: gemma4:e4b) 이 한 문장 객관 설명 출력. temperature=0 / seed=0 결정성. • `apply_caption(llm, bytes, block, lang_hint, cfg, events)` — `block.caption = Some(...)` 으로 채우고 ProvenanceKind::CaptionApplied 이벤트 1건 추가. `image.caption.enabled = false` 면 클린 no-op (Ok(())). LM 실패 시 block.caption None 그대로 + events 미기록. • 다운스케일 long-edge `[128, 1536]` 클램프. PNG passthrough hot path 보존, 그 외는 단일 디코드 + PNG 재인코딩. • 한국어 / 영어 프롬프트 분기 (lang_hint=\"ko\"/\"kor\" → 한국어). • `ModelCaption.model_version = \"<provider>/<prompt_template_version>\"` (예: \"ollama/caption-v1\") — prompt 또는 모델 회귀 감사 가능. ## kebab-core / kebab-llm-local 변경 - `kebab_core::GenerateRequest` 에 `images: Vec<String>` 필드 추가. `#[serde(default)]` 으로 기존 wire 페이로드 / snapshot 호환. - `kebab-llm-local::OllamaLanguageModel` 가 req.images 를 Ollama `images: [base64, ...]` 와이어 필드로 라우팅. `#[serde(skip_serializing_if = is_empty)]` 로 비어 있을 때 wire shape 가 pre-P6-3 와 byte-identical. ## kebab-config - 신규 `ImageCfg.caption: CaptionCfg`: - `enabled: bool` (default false) - `max_pixels: u32` (default 768, 클램프 [128, 1536]) - `prompt_template_version: String` (default \"caption-v1\") - `KEBAB_IMAGE_CAPTION_{ENABLED,MAX_PIXELS,PROMPT_TEMPLATE_VERSION}` 3종 환경변수 추가. ## Spec deviations `tasks/HOTFIXES.md` 2026-05-02 항목 추가: - Symptom 1: spec p6-3 시그니처가 `&dyn LanguageModel` 인데 frozen trait + GenerateRequest 가 vision 미지원. → trait 확장. - Symptom 2: spec 의 cargo feature `caption` (default OFF at compile time) → runtime gate 1개로 통합. base64/image/kebab-llm 외 추가 deps 없어 cargo feature 의 binary 절감 가치 미미. p4-1 / p4-2 / p6-3 spec 의 amends 명시. ## 테스트 `cargo test -p kebab-parse-image --test caption` — 9건 + 1 ignored: - feature gate (disabled → no-op / Err on direct call) - happy path (block.caption Some + Provenance CaptionApplied) - 빈 토큰 stream → empty text + caption.is_some() - CapturingMock 으로 req.images 라우팅 검증 (base64 1개, decode 가능) - 한국어 / 영어 프롬프트 분기 (CapturingMock 의 system 캡처) - LM Err → block.caption None 유지 + events 미기록 - 결정성 (동일 mock 입력 → 동일 caption) - max_pixels 클램프 (99999 → 1536, 4000×3000 PNG 다운스케일 검증) - opt-in 통합 (실 192.168.0.47 Ollama / gemma4:e4b → \"The image is a solid red color.\" 검증 완료, 4.3초) `cargo test --workspace --no-fail-fast -j 1` 전체 pass. `cargo clippy --workspace --all-targets -- -D warnings` pass. ## 의존성 경계 - 추가 deps: `kebab-llm` (trait 만), `base64` (이미 P6-2 에서 추가). - dev-deps: `kebab-llm/mock` 으로 `MockLanguageModel`, `kebab-llm-local` (통합 테스트 전용 — 런타임 deps 에는 없음). - forbidden 침범 없음: `kebab-source-fs / parse-md / normalize / chunk / store-* / embed* / search / rag / UI` 미참조. contract: docs/superpowers/specs/2026-04-27-kebab-final-form-design.md sections: §3.4 ImageRefBlock.caption, §3.7a ModelCaption, §9.1 caption (model-generated, low trust).	2026-05-02 06:05:39 +00:00
altair823	1539367692	review(p6-2): 회차 3 cosmetic — build() 회귀 테스트 + lib doc trust note - src/ocr.rs: • `OllamaVisionOcr` 에 `#[derive(Debug)]` 추가 (test 의 expect_err 바운드 충족용; reqwest::blocking::Client 도 Debug 구현). • 신규 unit 테스트 3건 (`build_rejects_empty_endpoint`, `build_rejects_empty_model_after_trim`, `build_clamps_max_pixels_outside_legal_range`) — 회차 2 에서 추가된 `fn build` 가드의 회귀 신호. - src/lib.rs: • 모듈-레벨 doc-comment 에 OCR 트러스트 정책 한 줄 추가 (\"LLM-driven default can hallucinate; OcrText.engine carries source identity\"). lib 사용자가 ocr 모듈 doc 까지 안 들어가도 의도 캐치 가능. cargo test -p kebab-parse-image — 31 pass + 1 ignored (11 unit + 12 P6-1 integration + 8 P6-2 integration). cargo clippy -p kebab-parse-image --all-targets -- -D warnings — pass.	2026-05-02 05:51:00 +00:00
altair823	2bede0030f	review(p6-2): 회차 2 지적 반영 - src/ocr.rs: • `OllamaVisionOcr::new` 와 `from_parts` 의 입력 검증을 공통 `fn build` 으로 통합. 두 생성자가 빈 endpoint / 빈 model / `max_pixels` 클램프 동일 invariant 를 공유 — \"테스트는 통과하지만 프로덕션은 panic\" 분기 차단. • `max_pixels` clamp 가 실제로 발동 시 `tracing::warn!` 로 사유 기록 (사용자가 \"왜 항상 4096?\" 디버깅 가능). • `downscale_to_long_edge` 의 long-axis 가 `f32` 라운딩으로 1px 초과하는 코너 케이스 (예: max=1601, long=4001) 후행 클램프로 엄격히 묶음. doc-comment 의 \"long edge is at most max_long_edge\" 가 실제 동작과 정확히 일치. - tests/ocr.rs: • 통합 테스트의 이중 게이트 (`#[ignore]` + `KEBAB_OCR_INTEGRATION=1`) 제거. `--ignored` 만으로 실행 의도 단일 신호화 — `kebab-llm-local` 의 통합 테스트 컨벤션과 일관됨. endpoint / model 의 env 오버라이드는 유지. cargo test -p kebab-parse-image — 28 pass + 1 ignored. cargo test -p kebab-config — 21 pass. cargo clippy --workspace --all-targets -- -D warnings — pass.	2026-05-02 05:48:23 +00:00
altair823	e869710d82	review(p6-2): 회차 1 지적 반영 - crates/kebab-config/src/lib.rs: • `OcrCfg.endpoint: String` (\"\" sentinel) → `Option<String>` 으로 교체. `#[serde(default)]` 적용. `KEBAB_IMAGE_OCR_ENDPOINT=\"\"` (빈 값) 도 None 으로 매핑하는 분기 추가. • 신규 회귀 테스트 `image_ocr_endpoint_empty_env_value_is_none`. - crates/kebab-parse-image/src/ocr.rs: • `OllamaVisionOcr::new` 의 endpoint fallback 로직을 새 `Option<String>` 스키마에 맞춰 정리 (`as_deref` + match). • `OllamaGenerateResponse` 의 dead `_other: HashMap<String, Value>` 필드 제거. `serde_json::Value` import 도 같이 정리. • `OllamaGenerateRequest.images: Vec<&'a str>` → `[&'a str; 1]` (호출당 vec! 알로케이션 제거, multi-image 는 OcrEngine trait 가 단일 이미지를 받으므로 OOS). • `downscale_to_long_edge` 단일-디코드로 리팩터. PNG passthrough hot path 보존 (header sniff 만으로 분기), 그 외 모든 경로는 decode 1회 + (필요 시) resize + PNG re-encode 1회로 통일. • `pub fn max_pixels(&self) -> u32` accessor 추가 — clamp 결과 검증 용 (단순 inspector). - crates/kebab-parse-image/tests/ocr.rs: • `cfg_for_endpoint` / 통합 테스트가 `Some(endpoint)` 형태로 갱신. • `from_parts_clamps_max_pixels_into_legal_range` 가 새 accessor 로 실제 클램프 결과 (256 / 4096 / 1024) 를 검증하도록 강화. • 통합 테스트가 폰트 부재 시 panic 대신 skip 하도록 분기. - crates/kebab-parse-image/tests/common/mod.rs: • `hello_world_png` 가 `anyhow::Result<Vec<u8>>` 반환하도록 변경. expect(\"DejaVu Sans Bold required\") 메시지를 \"only the opt-in OCR integration fixture needs this font\" 로 의도 명확화. cargo test -p kebab-parse-image — 28 pass + 1 ignored. cargo test -p kebab-config — 21 pass (+1 회귀). cargo clippy --workspace --all-targets -- -D warnings — pass. Reviewer-suggested workspace.dependencies 통합 (reqwest / base64) 은 P6-3 와 함께 처리할 수 있도록 follow-up 으로 두고 본 PR scope 에서 제외 (회차 1 본문에서 명시).	2026-05-02 05:45:25 +00:00
altair823	4ed5536c92	feat(kebab-parse-image): P6-2 OCR adapter — Ollama-vision default - 새 모듈 `crates/kebab-parse-image/src/ocr.rs` 추가. spec 의 `OcrEngine` trait 그대로 + `OllamaVisionOcr` default 구현 + `apply_ocr` 헬퍼. - `OllamaVisionOcr`: `<endpoint>/api/generate` 비스트리밍 호출, `images: [base64]` 필드로 이미지 전달, 프롬프트는 언어 힌트 + 화이트리스트 언어 목록 포함. 응답 prose 를 `OcrText.joined` 로, prepared image 전체 영역 단일 region (confidence 1.0) 으로 wrap. 기본 모델 `gemma4:e4b`. endpoint 비어 있으면 `models.llm.endpoint` 로 fallback. - 이미지 전처리: long-edge `config.image.ocr.max_pixels` (기본 1600, 256~4096 클램프) 초과 시 PNG 로 재인코딩 (image::imageops::resize, Triangle filter). PNG 입력이 max 이내면 zero-copy passthrough. - `apply_ocr` 는 OCR 성공 시 block.ocr 를 Some 으로 채우고 ProvenanceKind::OcrApplied 이벤트 추가. 실패 시 block.ocr 는 None 그대로 + provenance 미기록 (부분 상태 누출 금지). - `kebab-config`: 새 `ImageCfg.ocr: OcrCfg` 블록 (enabled/engine/model /endpoint/languages/max_pixels). `#[serde(default)]` 로 pre-P6 TOML 호환. `KEBAB_IMAGE_OCR_*` 환경변수 5종 추가. ## Spec deviation 원래 P6-2 spec 은 Tesseract 를 default OCR 엔진으로 지정했으나, dev / CI 호스트에서 `libtesseract-dev` 시스템 패키지 설치를 피하려고 Ollama-vision 으로 default 를 교체. `OcrEngine` trait 추상화는 spec 그대로 보존 — Tesseract / Apple Vision / PaddleOCR 어댑터는 같은 trait 으로 추후 feature-gate 추가 가능. 자세한 내역은 `tasks/HOTFIXES.md` 2026-05-02 항목 참조. Trust 측면: vision LM 은 hallucinate 가능. `OcrText.engine = "ollama-vision"` 필드로 consumer 가 엔진 별 신뢰 분기 가능. ## 테스트 - 신규 (`tests/ocr.rs`, 8 + 1 ignored): - 200 happy → OcrText 디코딩 (joined / engine / engine_version / region count / bbox / confidence) - 빈 응답 → 빈 regions - 5xx → Err with status + body 포함 - 200 error envelope → Err - apply_ocr → block.ocr Some + Provenance OcrApplied 1건 - apply_ocr error → block.ocr None 유지 + events 미기록 - 4000×3000 PNG → max_pixels=1024 까지 다운스케일, aspect ratio 보존 - from_parts max_pixels 클램프 - opt-in `KEBAB_OCR_INTEGRATION=1` 통합 (실제 192.168.0.47 Ollama `gemma4:e4b` 로 \"Hello World 2026\" 전사 검증 완료) - 신규 (`src/ocr.rs` unit): truncate, build_prompt 언어/힌트 처리 - `kebab-config` 테스트 +3: defaults, env override, pre-P6 TOML 호환 전체: `cargo test -p kebab-parse-image` 28 pass + 1 ignored, `cargo test -p kebab-config` 20 pass, `cargo clippy --workspace --all-targets -- -D warnings` pass. contract: docs/superpowers/specs/2026-04-27-kebab-final-form-design.md sections: §3.4 ImageRefBlock.ocr, §3.7a OcrText / OcrRegion, §9.1 OCR vs caption provenance.	2026-05-02 05:38:24 +00:00
altair823	a4f895e8cc	review(p6-1): 회차 3 cosmetic 정리 - src/dims.rs: `with_guessed_format()` 의 `map_err(...)` 를 `.context()?` 로 정리. 회차 2 의 `match Some/None` → `.context()?` 정리와 호출 스타일 통일. - src/lib.rs: `(*format).to_string()` → `format.to_string()`. `format` 이 `&&'static str` 이라 명시 deref 없이 자동 호출 가능. - tests/common/mod.rs: `ImageFixture::workspace_root` / `config` 가시성을 `pub` → 모듈-비공개로 축소. 외부 호출자가 두 필드를 직접 읽지 않고 `ctx()` 만 사용함. cargo test -p kebab-parse-image — 16건 pass. cargo clippy -p kebab-parse-image --all-targets -- -D warnings — pass.	2026-05-02 05:19:13 +00:00
altair823	58d56467e5	review(p6-1): 회차 2 지적 반영 — GPS 안전성 + 디버깅 - src/exif_extract.rs: • `gps_decimal` 에 ±90 / ±180 범위 검증 추가. 비정상 EXIF (예: 위도 300°) 가 들어와도 wire 에 흘러나가지 않고 silent drop. • GPSLatitudeRef / GPSLongitudeRef 가 빠진 좌표는 양수 가정으로 내보내지 않고 None 반환 — 모호한 부호를 그대로 두는 대신 손상된 메타데이터로 처리. • `read_from_container` 실패 시 `tracing::debug!` 한 줄로 사유 기록 (운영시 \"EXIF 없음\" vs \"EXIF 손상\" 구분 단서). - src/dims.rs: `match Some/None` 을 `anyhow::Context::context()?` 로 압축. import 한 줄 추가. - src/lib.rs: `Vec::with_capacity` 를 dim_warning 분기에 따라 `2` / `3` 으로 정확히 맞추고 의미 주석 한 줄 추가. - tests/common/mod.rs: `build_exif_blob_gps` 를 `GpsFlavor` 파라미터로 일반화 (`Valid` / `NoRef` / `OutOfRange`). JPEG 스플라이스 로직은 `splice_exif_into_jpeg` 헬퍼로 추출. - tests/extractor.rs: 회귀 테스트 2건 추가 — `*Ref` 누락 좌표 드롭, out-of-range 위도 드롭 (경도는 정상 통과 검증). cargo test -p kebab-parse-image — 16건 (4 unit + 12 integration) pass. cargo clippy -p kebab-parse-image --all-targets -- -D warnings — pass.	2026-05-02 05:16:37 +00:00
altair823	194dd34668	review(p6-1): 회차 1 지적 반영 - Cargo.toml: 미사용 deps 제거 (`serde`, `thiserror`) + dev-deps 의 `serde_json` 중복 선언 제거. - src/lib.rs: 변수명 `decode_warning` → `dim_warning` (16k cap 초과 분기까지 포괄하므로 더 정확). - src/exif_extract.rs: `ascii_field` / `u32_field` 의 dead-flexibility `In` 인자 제거 (모든 호출이 `In::PRIMARY` 였음). 두 단 `if let` 을 Rust 2024 let-chain 으로 정리. EXIF 화이트리스트 출력 키를 workspace wire-schema 컨벤션에 맞춰 snake_case 로 통일 (`Make` → `make`, `DateTimeOriginal` → `date_time_original` 등). - tests/common/mod.rs: 호출되지 않는 `fake_path` 헬퍼 + `Path` import 제거. - tests/extractor.rs: snake_case 키로 assertion 갱신. cargo test -p kebab-parse-image — 14건 모두 pass. cargo clippy -p kebab-parse-image --all-targets -- -D warnings — pass.	2026-05-02 05:11:40 +00:00
altair823	d11a810119	feat(kebab-parse-image): P6-1 image extractor + EXIF whitelist - 새 crate kebab-parse-image 추가 (workspace 19개째). MediaType::Image(_) 자산을 단일-블록 CanonicalDocument 로 변환하는 ImageExtractor 구현. - parser_version "image-meta-v1" (§9 versioning). - 본문은 Block::ImageRef 1건만 포함 — OCR / caption 필드는 None 으로 남겨 두고 P6-2 / P6-3 에서 채운다. - EXIF 화이트리스트 (§9.1, PII 표면 최소화): Make / Model / Software / DateTimeOriginal / Orientation / GPSLatitude(+Ref) / GPSLongitude(+Ref). MakerNote / Thumbnail / 기타 태그는 폐기. DateTime 은 EXIF "YYYY:MM:DD HH:MM:SS" → ISO-8601 변환. GPS DMS triple + N/S/E/W ref → signed decimal degree. - 차원: image::ImageReader 헤더만 읽어 (w, h, format) 획득. 16k×16k cap 초과 또는 디코드 실패 → metadata.user.dimensions = null + Provenance Warning 이벤트 (Err 아님). 포맷 자체 인식 실패 → anyhow::Error (caller skip). - SourceSpan::Region { 0, 0, w, h } 으로 전체 이미지 영역 표기. 결정성: 동일 bytes + 동일 parser_version → 동일 doc_id + block_id (§4.2 ID recipe 그대로 사용). - metadata.source_type = Reference, trust_level = Primary, lang = "und". title = 확장자 제외 파일명, alt = 파일명. - 의존성 경계 (§8): kebab-core 만 + image 0.25 (default features off, png/jpeg/webp/gif/tiff 만), kamadak-exif 0.6, anyhow / serde / serde_json / time / tracing / thiserror. kebab-source-fs · parse-md · store-* · embed* · llm* · rag · UI crate 미참조. - 테스트 14개 (4 unit + 10 integration): • PNG 차원 추출, JPEG EXIF GPS 추출 (DMS → decimal 변환 정확도 1e-6), EXIF 없는 PNG → 빈 map, 손상 PNG → warning + null dims (panic 없음), 인식 불가 bytes → Err, 결정성, 스냅샷, supports() 매칭, media_type 불일치 거부. • 픽스처는 in-memory 생성 (PNG 는 image crate, EXIF JPEG 는 kamadak Writer 로 EXIF blob 만든 뒤 SOI 직후 APP1 splice) — 바이너리 fixture 커밋 없음. - HEIC / RAW 는 spec 상 v1 out of scope (image crate 미지원, Apple Vision sidecar 가 추후 P+ 에서 채움). - tasks/p6/p6-1-image-extractor-exif.md status: planned → completed. contract: docs/superpowers/specs/2026-04-27-kebab-final-form-design.md sections: §3.4 Block::ImageRef + ImageRefBlock, §3.7a OcrText / ModelCaption stubs, §9.1 image extraction policy, §9 versioning.	2026-05-02 05:05:47 +00:00

24 Commits