altair823
901416d8e9
feat(ocr): T7-T9 — config overrides + engine factory + signature cascade
T7: OcrCfg gains det_model/rec_model/dict overrides + score_thresh/
unclip_ratio/max_boxes (serde default, KEBAB_IMAGE_OCR_* env). OnnxPaddleOcr::new
threads them via ModelPaths::from_config.
T8: build_image_ocr_engine / build_pdf_ocr_engine factories return
Box<dyn OcrEngine>; match on engine string (ollama-vision|paddle-onnx|err).
ImagePipeline.ocr_engine + pdf_ocr_engine signatures switched to &dyn OcrEngine.
OcrEngine gains model() for the progress label.
T9: ingest_config_signature image/pdf branches emit |ocr:1:{engine}:{engine_version}
(memoized blake3 per asset-triple, m3-safe). Unit tests (a)(b)(c) added.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-04 08:15:30 +00:00
..
2026-06-04 08:15:30 +00:00
2026-06-02 21:37:58 +00:00
2026-06-03 11:02:16 +00:00
2026-06-04 08:15:30 +00:00
2026-06-02 21:36:44 +00:00
2026-05-28 04:18:40 +00:00
2026-06-03 04:59:11 +00:00
2026-05-28 04:18:40 +00:00
2026-06-03 04:59:11 +00:00
2026-05-30 00:53:24 +00:00
2026-05-28 04:18:40 +00:00
2026-05-30 03:03:09 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-06-04 08:15:30 +00:00
2026-05-28 04:18:40 +00:00
2026-05-28 04:18:40 +00:00
2026-05-29 04:46:27 +00:00
2026-06-02 21:37:58 +00:00
2026-05-28 04:18:40 +00:00
2026-06-02 21:37:58 +00:00
2026-05-28 04:18:40 +00:00
2026-06-03 11:02:16 +00:00