Files
SubMiner/backlog/tasks/task-23.1 - Implement-JLPT-token-lookup-service-for-subtitle-words.md
2026-02-13 18:29:17 -08:00

1.2 KiB

id, title, status, assignee, created_date, labels, dependencies, parent_task_id, priority
id title status assignee created_date labels dependencies parent_task_id priority
TASK-23.1 Implement JLPT token lookup service for subtitle words To Do
2026-02-13 16:42
TASK-23 high

Description

Create a lookup layer that parses/queries the bundled JLPT dictionary file and returns JLPT level for a given token/word. Integrate with subtitle tokenization path with minimal performance overhead.

Acceptance Criteria

  • #1 Service accepts a token/normalized token and returns JLPT level or no-match deterministically.
  • #2 Lookup handles expected dictionary format edge cases and unknown tokens without throwing.
  • #3 Lookup path is efficient enough for frame-by-frame subtitle updates.
  • #4 Tokenizer interaction preserves existing token ordering and positions needed for rendering spans/underlines.
  • #5 Behavior on malformed/unsupported dictionary format is documented with fallback semantics.

Definition of Done

  • #1 Lookup service returns JLPT level with deterministic output for test fixtures.