feat: add BM25 pre-search to QA workflow and remove keyword-search (#382) by kiyotis · Pull Request #385 · nablarch/nabledge-dev

kiyotis · 2026-06-25T01:48:24Z

See steering.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…n, add incremental test steps Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…tional Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Add scripts/bm25-search.sh that wraps bm25s Python library. Builds index from knowledge/*.json on first run, saves to scripts/.bm25-index/, and reloads on subsequent calls. Detects staleness by comparing index mtime to newest JSON mtime. Returns top-20 sections as JSON array with file, section_id, section_title, and score fields. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Insert Step 3 (BM25 pre-search) between Step 2 (ask user) and the former Step 3 (semantic search). Renumber former Steps 3-8 to Steps 4-9 and update all internal cross-references (Step 5→6 in answer generation pointer, Step 6→7 in verify pointer, Step 8→9 in output pointer). BM25 path: extract narrow terms → bm25-search.sh → read-sections.sh → generate answer → verify → if PASS skip Steps 4-8 to Step 9. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…qa.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…h.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…er/verify-answer) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ntent check Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…reed Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…w recorded Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Aligns parameter name with verify-answer.md's output field name. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Adds check-answerable.md workflow that judges whether BM25 hits are sufficient to answer the question. Updates qa.md to the agreed flow: Step 1 hearing → Step 2 full-text-search → Step 3 check-answerable (OK→Step5 / NG→Step4) → Step 4 semantic-search → Step 5 generate-answer → Step 6 verify-answer (FAIL→regenerate with {findings}) → Step 7 output. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…rding Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…8 issue noted Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ge title lookup Pass the full component page title list (libraries/handlers/adapters, 130 files) to LLM as a translation table. LLM maps question concepts to Nablarch-specific terms by finding related page titles and extracting their hyphenated keywords. Example: "バリデーション" → sees libraries-bean-validation in titles → uses "bean-validation" as BM25 search term. This eliminates the need for LLM to know Nablarch-specific terminology a priori. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…riment results recorded Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…-search.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…p procedure clarified Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…W-TO-RUN.md reference Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…full benchmark steps Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

… pending Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

kiyotis and others added 30 commits June 25, 2026 10:48

chore: start session — issue-382

21ee3c5

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs: revise steering — drop baseline re-run, add BM25 engine decisio…

5891f46

…n, add incremental test steps Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs: revise steering #2 — BM25 library selection is required, not op…

75df1c7

…tional Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs: record benchmark baseline for issue-382

f58b6a4

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs: complete task #1 — confirm benchmark baseline

170d5b1

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs: add bm25-step-draft.md — library selection and qa.md step design

45a12be

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs: complete task #2 — BM25 library selection and step design

ee90311

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs: fix step-number refs in bm25-step-draft.md (Step 5→6, Step 6→7)

86a2b1f

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

chore: remove bm25-step-draft.md — decisions in steering.md, impl in …

a736f8c

…qa.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

refactor: extract BM25 pre-search to workflows/full-text-search.md

a3ab32a

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

refactor: qa.md Step 3 calls full-text-search.md workflow

60bd529

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

wip: task-3 in progress — Phase A done, refactored to full-text-searc…

9644567

…h.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

refactor: extract qa.md into focused workflows (hearing/generate-answ…

92f8f1f

…er/verify-answer) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

fix: verify-answer reads full pages instead of section pointers

cc1fd89

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

fix: add {question} input to verify-answer.md — needed for missing-co…

2f58446

…ntent check Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

wip: task-3 design redesign in progress — check-answerable pattern ag…

1e84ac7

…reed Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs: pause task-3 — check-answerable pattern agreed, qa.md final flo…

64d46f1

…w recorded Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

refactor: rename {excluded_claims} to {findings} in generate-answer.md

ecaa615

Aligns parameter name with verify-answer.md's output field name. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

feat: update benchmark infra for new qa.md BM25+check-answerable flow

fcbcc7c

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

fix: add cascade-fallback/regeneration tests and qa.md NG-fallback wo…

5423ab9

…rding Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs: update steering.md — pre-benchmark stabilization loop, review-0…

082dee1

…8 issue noted Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

wip: save task-3 check file and path-coverage notes

a0ee91c

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

wip: update steering.md — fts-hints.md design agreed, 4-scenario expe…

a0d6fae

…riment results recorded Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs: reconcile steering.md — clear State section for resume

a0c1ac7

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

feat: generate fts-hints.md in RBKC and load dynamically in full-text…

722345a

…-search.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

fix: fts-hints.md uses file stems not JSON titles for BM25 term lookup

394983c

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

kiyotis and others added 7 commits June 26, 2026 10:15

wip: update steering.md — fts-hints.md implemented, stabilization loo…

874dfe6

…p procedure clarified Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

wip: update steering.md — benchmark must follow HOW-TO-RUN.md strictly

957e09b

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

wip: update steering.md — replace transcribed benchmark steps with HO…

168b3b8

…W-TO-RUN.md reference Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

wip: update steering.md — split benchmark into stabilization run and …

e4d45cc

…full benchmark steps Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

wip: update steering.md — State updated for session suspend

d562f33

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

chore: save 20260626-1108-stabilization-bm25 run-1 (benchmark + report)

14eda86

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

wip: update steering.md — stabilization run-1 complete, user approval…

2f5d428

… pending Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add BM25 pre-search to QA workflow and remove keyword-search (#382)#385

feat: add BM25 pre-search to QA workflow and remove keyword-search (#382)#385
kiyotis wants to merge 37 commits into
mainfrom
worktree-text-search

kiyotis commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

kiyotis commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant