kebab

Files

altair823 d93b757cf1 fix(cli): thread --config through kebab eval run/aggregate/compare (facade-rule)

Cmd::Eval now loads Config via cli.config (same pattern as all other
subcommands) before dispatching to the inner match.  Each arm now calls
the *_with_config variant:

  run_eval(&opts)             → run_eval_with_config(&cfg, &opts)
  compute_aggregate(run_id)   → compute_aggregate_with_config(&cfg, run_id)
  store_aggregate(run_id, ..) → store_aggregate_with_config(&cfg, run_id, ..)
  Compare already called compare_runs_with_config but sourced cfg from
  Config::load(None) — that redundant load is removed; cfg comes from
  the shared binding above.

Fixes the same facade-rule regression pattern as P3-5 / P4-3: previously
`kebab --config /build/dogfood/config.toml eval run` silently evaluated
the XDG-default (empty) KB instead of the dogfood KB.

Also fixes runner.rs test that hardcoded rag-v2 after commit 5719969
bumped the default prompt_template_version to rag-v3.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-29 03:42:40 +00:00

fixtures/eval

test(eval): regenerate runner_per_query_snapshot for V009 baseline

2026-05-28 11:51:49 +00:00

loader.rs

style: cargo fmt --all (round 4 ingest log feature follow-up)

2026-05-28 04:18:40 +00:00

metrics_and_compare.rs

style: cargo fmt --all (round 4 ingest log feature follow-up)

2026-05-28 04:18:40 +00:00

runner.rs

fix(cli): thread --config through kebab eval run/aggregate/compare (facade-rule)

2026-05-29 03:42:40 +00:00