Files
SubMiner/backlog/tasks/task-107 - Fix-Yomitan-scan-token-fallback-fragmentation.md

1.6 KiB

id, title, status, assignee, created_date, updated_date, labels, dependencies, priority, ordinal
id title status assignee created_date updated_date labels dependencies priority ordinal
TASK-107 Fix Yomitan scan-token fallback fragmentation on exact-source misses Done
2026-03-07 01:10 2026-03-07 01:12
high 9007

Description

Left-to-right Yomitan scanning can emit bogus fallback tokens when termsFind returns entries but none of their headwords carries an exact primary source for the consumed substring. Repro: だが それでも届かぬ高みがあった currently yields trailing fragments like があ / , which blocks the real あった token from receiving frequency highlighting.

Acceptance Criteria

  • #1 Scanner skips termsFind fallback entries that are not backed by an exact primary source for the consumed substring.
  • #2 Repro line no longer yields bogus trailing fragments such as があ.
  • #3 Regression coverage added for the scan-token path.

Final Summary

Removed the scan-token helper fallback that previously emitted a token from the first returned headword even when Yomitan did not report an exact primary source for the consumed substring. Added a focused regression test covering だが それでも届かぬ高みがあった, ensuring bogus があ fragmentation is skipped so the later あった exact match can still be tokenized and highlighted.

Verification:

  • bun test src/core/services/tokenizer/yomitan-parser-runtime.test.ts src/core/services/tokenizer.test.ts --timeout 20000