kebab

Author	SHA1	Message	Date
altair823	a8fd76499c	feat(expansion): doc-side expansion 별칭 개별 dense 벡터 + 파생물 캐시(V012) 별칭을 줄별 개별 dense 벡터(sentinel `{chunk}#alias#N`)로 색인하고 boilerplate 청크는 별칭 생성을 skip. 묶음 1벡터 방식은 평균화로 특정 표현이 희석돼 오히려 회귀(13/18)했던 것을 폐기. 변형 일관성 14/18 → 16/18, mean_spread@10 0.222 → 0.111 (나무위키 ~1000 문서 CS corpus). `kebab-core::strip_alias_suffix` 가 suffix 형과 per-alias 형 둘 다 처리. 파생물 캐시(V012): embedding 벡터 + 별칭 LLM 결과를 청크 내용 해시 키로 캐싱해 재색인 시 내용 불변 청크의 재계산을 skip. cache_key = blake3(kind ‖ text_blake3 ‖ version_key)[:32], version_key 에 model/prompt/dimensions 포함 → §9 cascade 와 정합(버전 bump 시 자동 miss). 측정: 정답 3개 cold 1879s → warm 13s ≈ 145배. 순수 가산이라 corpus_revision bump 없음. search/ask 는 kebab.sqlite+lancedb 만으로 동작 → 외부 서버 색인 후 DB 만 복사하는 이식 워크플로 가능. V012 schema migration + 신규 surface 로 workspace version 0.20.2 → 0.21.0 (minor) bump. README/HANDOFF/ARCHITECTURE/HOTFIXES sync. known limitation: stack·svm 설명형 2개 잔존 + grounded 판정이 부분 인용을 grounded 로 오분류(후속 후보). 측정 상세: docs/superpowers/handoffs/2026-05-31-namu-wiki-alias-cache-study.md Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-31 08:24:04 +00:00
altair823	0282a81c67	fix(store): CASCADE 대체 4번째 경로 + V011 CHECK 복원 (Task 4.5 리뷰) 리뷰 MAJOR: purge_document_at_workspace_path_except_doc_id(parser-bump 경로)에 원본+sentinel embedding_records 명시 DELETE 누락 → tombstone 누적. 추가 + 회귀 테스트. MINOR: V011 status CHECK(pending/committed/tombstone) 복원. NIT: foreign_keys PRAGMA no-op 주석. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 14:02:46 +00:00
altair823	f3587b7143	feat(store): filter_chunks sentinel 별칭 candidate strip (committed 통과) LanceDB 후보의 sentinel chunk_id({orig}#alias)는 chunks JOIN 에서 탈락해 VectorRetriever strip 이전에 사라진다. candidate 를 kebab_core::strip_alias_suffix 로 원본 chunk_id 로 strip 해 IN-list/JOIN 에 넣어(committed 판정은 원본 body chunk 기준) 통과시키되, 반환은 입력 candidate 형태(sentinel 유지) — VectorRetriever 가 그 sentinel 을 받아 strip+dedup 한다. SQL replace 대신 (b) Rust strip 채택(명확). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 13:41:28 +00:00
altair823	483b1ec06b	feat(store): V011 embedding_records FK 제거 + CASCADE 대체 명시 DELETE (sentinel 별칭 벡터) 별칭 dense 벡터를 sentinel chunk_id({orig}#alias)로 색인하려면 chunks 에 없는 chunk_id 가 embedding_records 에 들어가야 한다. V001 의 chunk_id REFERENCES chunks ON DELETE CASCADE FK 가 이를 SQLite 787 로 막으므로 테이블을 FK 없이 재생성한다. status/vector_committed(V003) + 3개 인덱스 보존, chunks_bd_tombstone_embeddings trigger 무수정. DROP→RENAME 시 dangling trigger 재파싱을 피하려 legacy_alter_table=ON. 사라진 CASCADE 는 put_chunks + purge 두 경로(purge_orphan_at_workspace_path, purge_deleted_workspace_path)의 명시 DELETE 로 대체 — chunks 삭제 직전 원본 + {id}#alias sentinel embedding_records 를 함께 정리. corpus_revision baseline 2→3. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 13:41:20 +00:00
altair823	d279f343e7	docs(spec,plan): 별도 벡터 인프라 — FK 제거(V011) + CASCADE 대체 + filter_chunks PoC: 별칭 순수 벡터가 영어 설명형 rank 7~30 (concat 본문 희석으로 미회복) → 별도 벡터 명분. 차단요인 3건: embedding_records FK(787, V011 재생성), CASCADE 대체(명시 DELETE), filter_chunks sentinel strip. plan Task 4.5/4.6. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 13:25:45 +00:00
altair823	b56469f010	fix(core): clippy uninlined_format_args — strip_alias 테스트 (리뷰 MAJOR-1) workspace clippy --all-targets -D warnings 게이트 통과. format! 인자 인라인. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 11:24:04 +00:00
altair823	6ba8cb2c88	feat(search): VectorRetriever sentinel 별칭 strip + dedup 별칭 dense 벡터({orig}#alias) hit 을 원본 chunk_id 로 strip 해 hydrate, body+alias 중복은 첫(높은 score) 하나만 유지. overfetch 2→3 (dedup 후 k 확보). wire/RetrievalDetail 무변경. vector/hybrid 회귀 0, clippy green. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 11:09:32 +00:00
altair823	afa8af0f88	feat(app): 별칭 dense 별도 벡터 색인 + purge (sentinel)	2026-05-30 10:48:58 +00:00
altair823	b9d20d23d1	feat(config): ingest.expansion.embed_aliases flag (default off)	2026-05-30 10:31:07 +00:00
altair823	86b4e1ebd0	feat(core): ALIAS_SUFFIX + strip_alias_suffix (dense alias vectors)	2026-05-30 10:31:03 +00:00
altair823	825543549d	docs(plan): 별칭 dense 별도 벡터 구현 plan ALIAS_SUFFIX(core) → embed_aliases flag → ingest sentinel 벡터+purge → VectorRetriever strip+dedup → 측정. TDD, 완성 코드. doc-side expansion PR. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 10:28:43 +00:00
altair823	bcb8b93751	docs(spec): 별칭 dense 별도 벡터 설계 spec PoC(concat) 측정: dense 별칭이 6/0/2/0.25 (설명형은 dense 본령 실증), 단 영어 설명형 2개는 concat 본문 희석으로 미회복. 처방: 별칭을 sentinel chunk_id 별도 벡터로 색인(본문 벡터 불변=회귀 안전, 별칭 순수 신호). flag ingest.expansion.embed_aliases default off. lexical 완화는 폐기. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 10:26:24 +00:00
altair823	116b3e6377	fix(app): clippy unused_self — build_request 를 associated fn 으로 CI 게이트(clippy --workspace --all-targets -D warnings) 통과. 동작 동일. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 03:47:06 +00:00
altair823	69b53d1c97	docs(spec): doc-side expansion 검색 메커니즘을 shipped 구현에 맞춰 정정 Task 6 리뷰 MINOR-1: spec 본문이 단일 UNION ALL+GROUP BY 로 기술됐으나 shipped = 2-query(run_query+run_alias_query) + Rust merge_body_alias(body 우선). 서로 다른 FTS 테이블 bm25 절대값 비교가 무의미해 body-우선 merge 가 더 깨끗. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 03:20:13 +00:00
altair823	a271352e33	feat(search): lexical body+alias 병합 검색 (pool-rescue) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-30 03:12:14 +00:00
altair823	cde4d75f6b	feat(app): ingest 별칭 생성 hook (flag off 기본, fail-soft)	2026-05-30 03:03:09 +00:00
altair823	bddcd53688	fix(app): parse_aliases 접두 제거가 숫자/하이픈 선두 별칭 손상 (Task 4 리뷰 MAJOR-1) 탐욕적 trim_start_matches → 명시적 strip_list_marker(마커+공백 패턴만 1회). "3D 렌더링"/"2단계"/"-fast" 보존, "- "/"1. " 마커만 제거. 회귀 테스트 2개. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 02:49:25 +00:00
altair823	2a207f9868	feat(app): ExpansionGenerator — 청크당 별칭 생성 (fail-soft) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-30 02:36:20 +00:00
altair823	cc31868d24	feat(config): [ingest.expansion] flag (default off)	2026-05-30 02:26:41 +00:00
altair823	0df47febf0	test(store): doc-side expansion Task 2 리뷰 보강 (M1/M2/N1) - M1: chunk_aliases trigger 가드에 AND aliases <> '' (빈 문자열 미색인) - M2: 재색인 멱등 테스트 (재-put 후 별칭 행 1개) - N1: 본문 격리 음성 단언 (별칭 term 이 chunks_fts 로 누출 안 됨) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 02:24:24 +00:00
altair823	b12a616ab2	feat(store): V010 chunk_aliases_fts + put_chunks 별칭 영속화 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-30 02:15:27 +00:00
altair823	848b75c069	feat(core): Chunk.aliases 필드 (doc-side expansion) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-30 02:09:39 +00:00
altair823	467a974901	docs(plan): doc-side expansion 구현 plan + spec 정제 (별도 FTS 테이블) spec: chunks_fts §5.5 verbatim 충돌 회피 → 별도 chunk_aliases_fts 테이블 + lexical 내부 body+alias 병합(RetrievalDetail/wire schema 무변경)으로 정제. plan: 7 task TDD (Chunk 필드 → V010 → config → ExpansionGenerator → ingest hook → lexical 병합 → 측정/문서). 완성 코드 + 빌드 규약. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 02:04:58 +00:00
altair823	098413922b	docs(spec): 색인시 doc-side expansion 설계 spec (Phase 2) brainstorm 확정: 청크당 별칭 생성(같은언어+한↔영 번역), additive+수동 재색인, 1차 단순 품질제어. 별도 FTS5 aliases 채널 → RRF 3채널 융합. flag off 기본, kebab eval variants 로 on/off 측정. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 01:54:46 +00:00
altair823	695010ea7a	Merge pull request 'docs: Phase 2 doc-side expansion 킥오프 + 구현 방법론 핸드오프' (#194 ) from docs/phase2-doc-expansion-kickoff into main Reviewed-on: #194	2026-05-30 01:23:48 +00:00
altair823	8bb7c276d0	docs: Phase 2 doc-side expansion 킥오프 + 구현 방법론 핸드오프 새 세션이 Phase 2(색인시 doc-side expansion)를 자립적으로 이어받을 컨텍스트 문서. 배경(rerank 반증→재정의→Phase1 진단 B우세→딥리서치→PoC), 설계 방향(KO↔EN 번역 별칭 + 별도 FTS5 필드 + RRF, flag off), 이미 만든 측정 도구(kebab eval variants + dogfood golden), 그리고 지금까지와 동일한 구현 방법론(brainstorm→spec→plan→OMC teammate sequential 구현+리뷰 +독립검증, 모델 라우팅, 빌드 redirect+exit, 측정=variant eval 프록시금지, gitea-pr 리뷰루프)을 담음. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 01:19:14 +00:00
altair823	01a03463a6	Merge pull request 'feat(eval): 변형 일관성(query-paraphrase robustness) 평가 프레임워크' (#193 ) from feat/paraphrase-robustness-eval into main Reviewed-on: #193	2026-05-30 01:12:26 +00:00
altair823	b6ad947378	docs: README 명령 표 슬림 + ARCHITECTURE 상세 이전·동기화 README 의 괴물 셀(ingest 2891→544, search 2952→687, ask 1244→415, tui 2300→453자)을 "무엇 + 핵심 flag + 포인터"로 축소. 빠진 구조 detail 은 ARCHITECTURE 로 이전: - symbol path 형식에 Go/Java/Kotlin/C/C++ 추가 + code chunk provenance(citation.kind/code_lang/repo) - Markdown title 자동 채움 순서(md-frontmatter-v2) - RAG groundedness 검증(mDeBERTa-v3 XNLI, nli_threshold gate) 결정 행 신설 - TUI 행을 P9-1~4 완료 + F1 cheatsheet 로 최신화 (stale "진행 예정" 제거) flag 망라는 --help, TUI 키는 in-app F1 cheatsheet(권위 런타임 소스)로 위임 — stale 방지. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 01:10:39 +00:00
altair823	1529e6d991	docs(readme): PR #193 회차 1 리뷰 반영 — eval 명령 표에 aggregate/variants 추가 Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 01:00:32 +00:00
altair823	5ad1f98227	docs(handoff): doc-side expansion 딥리서치 + PoC 결과 (Phase 2 방향 확정) 딥리서치(104 agent): 어휘격차 pool-miss 최선책 = 색인시 doc-side expansion. PoC(dogfood KB): recall@50=0 이던 3쿼리가 별칭 추가로 rank1~2 부활(hybrid+vector, 골든 verbatim 아님=일반화). 핵심 미검증 고리 실 corpus 정량 확인. Phase 2 = 색인시 doc-side expansion(KO↔EN 번역 별칭) → 별도 FTS5 필드 → RRF, flag off. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 00:53:24 +00:00
altair823	a58cae2ff3	docs(research): 어휘격차 pool-miss 해결 딥리서치 레퍼런스 deep-research 워크플로(104 agent, 5각도, 22소스, 25 claim 3-vote 검증, 22 confirmed/3 killed). 결론: 색인시 doc-side expansion(doc2query)이 pool-miss 최선책 — pool 자체를 키우고 per-query 지연 ~0(색인시 1회), 정확매칭 보존(별도 필드 append). 단 vanilla mt5는 같은언어라 한/영 갭은 색인시 KO↔EN 대체 query 생성 필요. query-side(HyDE=거부된 per-query LLM, Vector-PRF=recall 주장 기각)는 부적합. 검증은 기존 variant eval 로 가능. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 00:53:24 +00:00
altair823	7a1dff1684	docs(handoff): query-paraphrase robustness Phase 1 완료 + (A)/(B) 진단 8그룹×4변형(dogfood) 측정: groups=8 A_dominant=2 B_dominant=4 spread@10=0.750. 진단 — 문제는 한/영이 아니라 어휘 거리(영어 풀어쓴 문장도 miss, 일부 한국어는 OK). B(어휘격차, recall@50=0, rerank 불가) 우세 → 쿼리 확장/번역 처방 신호. A(순위출렁)는 cap_theorem/vector_database 2그룹뿐. "측정 먼저" 논제 정량 검증(rerank 단독은 부분해법). Phase 2 처방 결정 대기. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 00:53:24 +00:00
altair823	0988f66331	feat(cli): kebab eval variants <run_id> — 변형 일관성 진단 리포트 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-30 00:53:24 +00:00
altair823	82e02aa4fe	fix(eval): 변형 일관성 리뷰 H1/M1 — pool truncation 방어 + answer 판정 정렬 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-30 00:53:24 +00:00
altair823	db4af0cc72	feat(eval): 변형 일관성 메트릭 + A/B(순위출렁/어휘격차) 분류 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-30 00:53:24 +00:00
altair823	ab20202241	test(eval): Task1 리뷰 nit — 3+멤버 그룹/group=None 테스트 + 에러 메시지에 divergent query id Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 00:53:24 +00:00
altair823	a51e6395c0	feat(eval): GoldenQuery.group + 그룹 정합성 검증 (변형 일관성 기반)	2026-05-30 00:53:24 +00:00
altair823	fe4c854673	docs(plan): query-paraphrase robustness Phase 1 구현 계획 5개 task: (1) GoldenQuery.group + 그룹 정합성 검증, (2) 변형 일관성 메트릭 모듈 + A/B(순위출렁/어휘격차) 분류, (3) kebab eval variants CLI, (4) dogfood golden 변형 그룹 큐레이션, (5) 측정 + 진단 리포트. TDD bite-sized, 완성 코드. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 00:53:24 +00:00
altair823	1de3f4ffca	docs(spec): query-paraphrase robustness 평가 프레임워크 설계 (측정 먼저) 목표 재정의: 한/영 overlap → 같은 의미의 다양한 표현(동의어·다른 어휘·풀어쓴 문장·한영)에서 일관된 답변 품질. 지난 reranker 실험이 overlap 프록시 최적화로 헛돈 교훈 반영 — 처방 전 진짜 지표(변형 일관성)를 직접 재는 평가부터. Phase 1(본 spec 구현): kebab-eval golden suite에 변형 그룹(intent group) + 변형 일관성 메트릭(recall_spread, answer_consistency) + recall@pool vs recall@k로 (A)순위출렁/(B)어휘격차 자동 판별. Phase 2(처방)는 측정 결과 게이트 뒤 조건부. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-30 00:53:24 +00:00
altair823	7fbfec647d	Merge pull request 'feat: v0.20.2 — Ask 응답언어 rag-v3 + 8 dogfood findings + 검색 품질 eval baseline' (#192 ) from fix/v0-20-2-dogfood-findings into main Reviewed-on: #192 v0.20.2	2026-05-29 05:27:52 +00:00
altair823	ca8c83b1ba	chore(hotfixes): PR #192 회차 1 리뷰 반영 — refusal marker 표기 정정 `<REFUSE>` marker → citation marker(`[#번호]`) 유무 기반 (pipeline.rs:463-486). release-notes 정정과 일관. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-29 05:22:57 +00:00
altair823	6c611990d8	chore: bump version 0.20.1 -> 0.20.2 v0.20.2 — Ask 응답언어 자동매칭(rag-v3) + 8 dogfood findings + eval --config facade 패치 + 검색 품질 golden baseline (hybrid hit@3=1.0 / MRR=0.833, lexical hit@3=1.0). evidence: tasks/HOTFIXES.md 2026-05-29, docs/release-notes/v0.20.2-draft.md. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-29 05:10:42 +00:00
altair823	166b1404e4	docs(release-notes): correct refusal判정 mechanism + O-2 phrasing leader review of writer draft: refusal 판정은 citation marker(`[#번호]`) 유무 기반이며 `<REFUSE>` 특수 마커가 아님. O-2 문구 예시도 실제 rag-v3 규칙으로 정정. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-29 04:58:08 +00:00
altair823	2d0168b7ab	docs(sync): README + HANDOFF v0.20.2 surface 반영 CLAUDE.md docs-split 규칙에 따라 사용자 visible surface 변경 동기화. README: - [rag] prompt_template_version default rag-v2 → rag-v3 (v0.20.2) - v3 규칙 설명 (답변 언어 = 질문 언어) - O-2 known limitation (소형 모델 refusal 언어 불일치) HANDOFF: - 머지 후 발견된 버그/결정 에 v0.20.2 1줄 요약 추가 - 검색 품질 baseline (hybrid MRR=0.833) + O-2 known limitation 언급 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-29 04:54:26 +00:00
altair823	4afcaf96d2	docs(release-notes): add v0.20.2-draft (rag-v3 응답언어 + 검색 품질 eval 인프라) v0.20.2 릴리즈 노트 초안 작성. 사용자 영향 4단락 구조로 각 finding 기술. - Finding #1/O-2: rag-v3 응답언어 자동 매칭 + refusal 언어중립화 - Finding #2: bulk search input schema 확정 (15필드) - Finding #3: list docs human-readable path 보강 - Finding #7: index_version 두 곳 구분 (vector vs FTS5) - eval --config facade + 검색 품질 baseline (hybrid hit@3=1.0 / MRR=0.833) - Finding #4/#5/#6/#8: docs/schema 정비 - version cascade 주의 (rag-v3 → eval compare) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-29 04:54:19 +00:00
altair823	16c4579399	docs(hotfixes): add 2026-05-29 v0.20.2 dogfood findings + 검색 품질 baseline 8-finding 도그푸딩 라운드 및 검색 품질 baseline 결과를 HOTFIXES 에 기록. - 8 findings 요약 표 (rag-v3, bulk schema, list docs, index_version 등) - Finding O-2 known limitation (소형 모델 refusal 언어 불일치) - 검색 품질 baseline 표 (hybrid MRR=0.833, lexical MRR=0.7) - golden 큐레이션 교훈 (dispatch.py 정답 정정 → hit@3 0.9→1.0) - eval logs cross-link Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-29 04:54:11 +00:00
altair823	40d7faee71	docs(dogfood): add §10.2 search quality baseline scenario (v0.20.2 golden suite) eval --config facade 패치로 dogfood KB 직접 평가 가능해짐에 따라 §10 Eval 에 §10.2 검색 품질 baseline 섹션 추가. - golden suite 실행 명령 (hybrid + lexical eval run → aggregate) - v0.20.2 metric baseline 표 (hybrid hit@3=1.0 / MRR=0.833) - 정성 체크리스트 (한국어 2자 hit@3, empty=0, MRR 임계치) - golden 큐레이션 절차 + dispatch.py 오류 교훈 - §10.1 로 기존 basic eval run 재구성 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-29 04:54:03 +00:00
altair823	a3bb2580bf	test(rag): add rag-v3 dispatch integration test + refresh stale rag-v2 docs (code-review) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-29 04:46:27 +00:00
altair823	2429189447	docs(spec): reflect search-quality critic round-1 (eval --config, lang-filter non-goal, curation) Incorporates all critic (opus) round-1 findings into the dogfood search-quality eval design spec: BLOCKER-1: §4.4 execution commands now use --config /build/dogfood/config.toml (Task A facade-rule patch makes this the canonical path). §5.1 re-titled from "(후속 패치)" to "Task A로 적용됨 — 권장 운영 경로"; XDG workarounds demoted to "패치 전 fallback". Intro paragraph updated accordingly. MAJOR-1: §3 Non-Goals gains an explicit bullet: lang/media/code_lang SearchFilters validation is out of scope for this harness (runner uses SearchFilters::default(), runner.rs:151). §4.1 "code 검색" row no longer claims code_lang filter coverage. MINOR-1: §4.3 step 3 now names kebab inspect doc <id> as the primary chunk-selection path (breaks chunk-level curation loop); search hits demoted to "보조 확인용". MINOR-2: §4.1 golden category table gains two new rows — 한국어 N-gram fallback query (복합어/신조어 coverage) and 영어 whole-token exact query (separates substring artefacts). MINOR-3: §4.1 YAML header note added: record corpus_revision in golden file so stale-bail root cause is immediately traceable. NIT: §9 References line numbers corrected (runner.rs:31, metrics.rs:116/144); runner.rs:151 SearchFilters::default() reference added. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-29 03:43:00 +00:00
altair823	d93b757cf1	fix(cli): thread --config through kebab eval run/aggregate/compare (facade-rule) Cmd::Eval now loads Config via cli.config (same pattern as all other subcommands) before dispatching to the inner match. Each arm now calls the *_with_config variant: run_eval(&opts) → run_eval_with_config(&cfg, &opts) compute_aggregate(run_id) → compute_aggregate_with_config(&cfg, run_id) store_aggregate(run_id, ..) → store_aggregate_with_config(&cfg, run_id, ..) Compare already called compare_runs_with_config but sourced cfg from Config::load(None) — that redundant load is removed; cfg comes from the shared binding above. Fixes the same facade-rule regression pattern as P3-5 / P4-3: previously `kebab --config /build/dogfood/config.toml eval run` silently evaluated the XDG-default (empty) KB instead of the dogfood KB. Also fixes runner.rs test that hardcoded rag-v2 after commit `5719969` bumped the default prompt_template_version to rag-v3. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-29 03:42:40 +00:00

1 2 3 4 5 ...

1037 Commits