altair823
901416d8e9
feat(ocr): T7-T9 — config overrides + engine factory + signature cascade
T7: OcrCfg gains det_model/rec_model/dict overrides + score_thresh/
unclip_ratio/max_boxes (serde default, KEBAB_IMAGE_OCR_* env). OnnxPaddleOcr::new
threads them via ModelPaths::from_config.
T8: build_image_ocr_engine / build_pdf_ocr_engine factories return
Box<dyn OcrEngine>; match on engine string (ollama-vision|paddle-onnx|err).
ImagePipeline.ocr_engine + pdf_ocr_engine signatures switched to &dyn OcrEngine.
OcrEngine gains model() for the progress label.
T9: ingest_config_signature image/pdf branches emit |ocr:1:{engine}:{engine_version}
(memoized blake3 per asset-triple, m3-safe). Unit tests (a)(b)(c) added.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-04 08:15:30 +00:00
..
2026-06-04 08:15:30 +00:00
2026-05-02 03:28:08 +00:00
2026-05-25 06:45:32 +00:00
2026-06-03 14:14:15 +00:00
2026-06-03 14:14:15 +00:00
2026-06-02 21:37:58 +00:00
2026-05-09 17:49:23 +09:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 06:17:47 +00:00
2026-05-28 04:18:40 +00:00
2026-06-02 13:58:27 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-27 13:45:38 +00:00
2026-05-28 11:01:00 +00:00
2026-05-26 04:42:37 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 06:34:01 +00:00
2026-06-03 14:14:15 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-27 23:58:10 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 12:06:01 +00:00
2026-06-02 21:37:58 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00