altair823-org/kebab

feat(fb-34): output budget controls #125

Merged

altair823 merged 14 commits from feat/fb-34-output-budget-controls into main

2026-05-09 12:52:40 +00:00

Author	SHA1	Message	Date
th-kim0823	e084b306e5	fix(fb-34): align next_cursor semantics with docs (PR #125 round 2) Previous round-1 fix dropped the speculative cursor branch on the truncated path, leaving a contradiction with the docs: - snippet-only shrunk → cursor emitted (returned == k_effective) - k-popped → cursor null (returned < k_effective) But docs promised the opposite. R2 resolution: emit cursor whenever more hits may be reachable (either retriever filled the page OR budget popped hits — the popped ones remain fetchable from offset+returned). Drop the artificial "widen vs paginate" copy; truncated and next_cursor are now independent signals — caller may do either or both. Updates: app.rs::search_with_opts logic + SearchResponse doc + schema description + SKILL.md two bullets + max_tokens=0 test asserts cursor IS emitted on k-pop case. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 21:07:04 +09:00
th-kim0823	f485608108	fix(fb-34): address PR #125 round 1 review - error_wire: StructuredError wrapper preserves ErrorV1 through anyhow → classify pipeline. Adds downcast short-circuit so cursor::decode's typed code = "stale_cursor" reaches the wire instead of being string-formatted to code = "generic". - app: search_with_opts now wraps cursor::decode error in StructuredError instead of anyhow! string format. - test: error_wire pins both negative (bare anyhow → not stale_cursor) AND positive (StructuredError → stale_cursor) invariants. CLI integration test runs end-to-end and asserts error.v1.code on stderr. - app: next_cursor only emitted on full-page (k-pop) path; drop speculative emit on snippet-only truncation that would point at a different page than the agent expected. - cursor: differentiate malformed-base64 / malformed-payload / revision-mismatch error messages; all keep code = stale_cursor. - test: cursor_rejected fixture uses .expect() to fail loud on cursor non-emission instead of silent skip. - test: max_tokens=0 → 1-hit floor + truncated=true. - docs: SKILL.md + schema description distinguish snippet-shrink (widen) vs k-pop (paginate) truncated cases. HOTFIXES notes --no-cache semantic shift (cached path + clear vs uncached path). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 20:49:27 +09:00
th-kim0823	9f076003e2	docs(fb-34): README + SMOKE + INDEX + HOTFIXES + skill notes Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 20:20:58 +09:00
th-kim0823	e1fcea6313	chore: clippy fix for fb-34 — allow result_large_err on cursor::decode ErrorV1 is the workspace wire error struct; boxing here would force every call site to deref through a Box for no win — the err-path is rare. Single allow at the function level. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 20:20:36 +09:00
th-kim0823	5e0cff1b92	feat(mcp): search tool emits search_response.v1 + budget inputs (fb-34) SearchInput gains max_tokens / snippet_chars / cursor (all optional). Output wrapped in search_response.v1 to match CLI; existing tools_call_search test updated to read v["hits"] instead of the bare array. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 20:12:05 +09:00
th-kim0823	603061fb86	test(cli): wire_search_response + budget integration (fb-34) 4 lexical-only tests covering search_response.v1 wrapper shape, --max-tokens truncation, --cursor pagination, plain stderr hint. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 20:09:01 +09:00
th-kim0823	21220f6d39	feat(cli): kebab search --max-tokens / --snippet-chars / --cursor (fb-34) JSON output wrapped in search_response.v1 (breaking — agent must adapt). Plain output unchanged + [truncated; use --cursor X] stderr hint when budget tripped. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 20:02:50 +09:00
th-kim0823	f25ad31741	feat(wire): search_response.v1 schema (fb-34) Wrapper around search_hit.v1[] with next_cursor + truncated. Wire breaking — agent that parses bare array must adapt. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 18:00:58 +09:00
th-kim0823	af80cedd81	feat(app): App::search_with_opts + SearchResponse (fb-34) Budget loop: snippet shorten → k pop → ≥1 hit floor. Cursor encode/decode threads corpus_revision; mismatch surfaces as stale_cursor anyhow error. App::search retained as thin wrapper. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 17:59:48 +09:00
th-kim0823	aabe66f5e2	docs(error_wire): note stale_cursor convention (fb-34) stale_cursor is built by cursor::decode, not classify. Test locks the invariant. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 17:50:39 +09:00
th-kim0823	ebbc3a46ae	feat(app): cursor encode/decode for paginated search (fb-34) Opaque base64(JSON{offset, corpus_revision}). Mismatch or malformed input returns ErrorV1 with code = stale_cursor. base64 promoted to workspace dep. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 17:49:23 +09:00
th-kim0823	e00418537f	feat(core): SearchOpts domain type for budget controls (fb-34) 3 optional knobs (max_tokens, snippet_chars, cursor); Default = all None = no enforcement (backwards-compat existing search behavior). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 17:46:40 +09:00
th-kim0823	dbb7b54d5d	plan(fb-34): output budget controls implementation plan 11 tasks: SearchOpts (kebab-core), cursor module + base64 dep (kebab-app), error_wire stale_cursor convention, App::search_with_opts + SearchResponse + budget loop, wire schema search_response.v1, CLI flags + plain truncated hint, CLI integration tests, MCP wrapper + inputs, workspace+clippy gate, docs (README/SMOKE/INDEX/HOTFIXES/ skill), smoke+PR. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 17:43:26 +09:00
th-kim0823	a80f65c6f2	spec(fb-34): output budget controls — design `kebab search` 에 --max-tokens / --snippet-chars / --cursor 신규. chars/4 token approximation. truncate priority: snippet → k → 멈춤 (최소 1 hit 보장). cursor = opaque base64(offset + corpus_revision) — mismatch 시 error.v1.code = stale_cursor. wire breaking: stdout array → search_response.v1 wrapper. agent 갱신 필요. App::search 시그니처는 thin wrapper 로 보존 (TUI 무영향). ask path 는 scope out (rag.max_context_tokens 가 이미 budget 담당). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 17:36:51 +09:00