Files
SubMiner/docs-site/mining-workflow.md
T

12 KiB
Raw Blame History

Mining Workflow

This guide walks through the sentence mining loop — from watching a video to creating Anki cards with audio, screenshots, and context.

Overview

Sentence mining means turning real sentences you encounter while watching native video into Anki flashcards, so you learn vocabulary in the context where you actually met it. SubMiner automates the tedious parts of that loop.

SubMiner runs as a transparent overlay on top of mpv (the video player). As subtitles play, the overlay displays them as interactive text. You hover a word, trigger a Yomitan dictionary lookup with your configured lookup key/modifier, then create an Anki card with a single action. SubMiner automatically attaches the sentence, an audio clip, and a screenshot to that card — no manual copy-pasting or screen capturing.

Yomitan is the popup dictionary that shows definitions when you hover or scan a word. AnkiConnect is the add-on that lets SubMiner talk to Anki. Both are set up during installation — see Anki Integration if you have not configured them yet.

Creating Anki Cards

There are four ways to create or enrich cards, depending on your workflow.

1. Auto-Update from Yomitan

This is the most common flow. Yomitan creates a card in Anki, and SubMiner enriches it automatically.

  1. Hover a word, then trigger Yomitan lookup → Yomitan popup appears.
  2. Click the Anki icon in Yomitan to add the word.
  3. SubMiner receives or detects the new card:
    • Proxy mode (default, ankiConnect.proxy.enabled: true): immediate enrich after a successful addNote / addNotes is pushed through the local proxy.
    • Polling mode (fallback, when the proxy is disabled): detects new cards via AnkiConnect polling (ankiConnect.pollingRate, default 3 seconds).
  4. SubMiner updates the card with:
    • Sentence: The current subtitle line.
    • Audio: Extracted from the video using the subtitle's start/end timing (plus optional configured padding).
    • Image: A screenshot or animated clip from the current playback position.
    • Translation: From the secondary subtitle track, or generated via AI if configured.
    • MiscInfo: Metadata like filename and timestamp.

Configure which fields to fill in ankiConnect.fields. See Anki Integration for details.

2. Manual Update from Clipboard

If you prefer a hands-on approach (animecards-style), you can copy the current subtitle to the clipboard and then paste it onto the last-added Anki card:

  1. Add a word via Yomitan as usual.
  2. Press Ctrl/Cmd+C to copy the current subtitle line to the clipboard.
    • For multiple lines: press Ctrl/Cmd+Shift+C, then a digit 19 to select how many recent subtitle lines to combine. The combined text is copied to the clipboard.
  3. Press Ctrl/Cmd+V to update the last-added card with the clipboard contents plus audio, image, and translation — the same fields auto-update would fill.

Manual clipboard updates always replace generated sentence audio, even when ankiConnect.behavior.overwriteAudio is disabled. The word audio field is left unchanged because the word itself does not change in this flow.

This is useful when auto-update is disabled or when you want explicit control over which subtitle line gets attached to the card.

Shortcut Action Config key
Ctrl/Cmd+C Copy current subtitle shortcuts.copySubtitle
Ctrl/Cmd+Shift+C + digit Copy multiple recent lines shortcuts.copySubtitleMultiple
Ctrl/Cmd+V Update last card from clipboard shortcuts.updateLastCardFromClipboard

3. Mine Sentence (Hotkey)

Create a standalone sentence card without going through Yomitan:

  • Mine current sentence: Ctrl/Cmd+S (configurable via shortcuts.mineSentence)
  • Mine multiple lines: Ctrl/Cmd+Shift+S followed by a digit 19 to select how many recent subtitle lines to combine.

The sentence card uses the note type configured in isLapis.sentenceCardModel and always maps sentence/audio to Sentence and SentenceAudio.

::: warning Requires Lapis/Kiku note type Sentence card creation requires a Lapis or Kiku compatible note type and ankiConnect.isLapis.enabled: true in your config. See Anki Integration — Sentence Cards for setup. :::

4. Mark as Audio Card

After adding a word via Yomitan, press the audio card shortcut to overwrite the audio with a longer clip spanning the full subtitle timing.

::: warning Requires Lapis/Kiku note type Audio card marking requires a Lapis or Kiku compatible note type and ankiConnect.isLapis.enabled: true in your config. See Anki Integration — Sentence Cards for setup. :::

Field Grouping (Kiku)

If you mine the same word from different sentences, SubMiner can merge the cards instead of creating duplicates. This feature is designed for use with Kiku and similar note types that support grouped fields.

  1. You add a word via Yomitan.
  2. SubMiner detects the new card and checks if a card with the same expression already exists.
  3. If a duplicate is found (this requires ankiConnect.isKiku.fieldGrouping to be set to "auto" or "manual"; it defaults to "disabled"):
    • Auto mode (ankiConnect.isKiku.fieldGrouping: "auto"): Merges automatically. Both sentences, audio clips, and images are combined into the existing card. The duplicate is optionally deleted.
    • Manual mode (ankiConnect.isKiku.fieldGrouping: "manual"): A modal appears showing both cards side by side. You choose which card to keep and preview the merged result before confirming.

See Anki Integration — Field Grouping for configuration options, merge behavior, and modal keyboard shortcuts.

Overlay Model

SubMiner uses one overlay window with modal surfaces. It carries two subtitle bars — a primary reading bar and a secondary translation/context bar — plus modal dialogs that open on top.

Toggle the entire overlay window with Alt+Shift+O (global) or y-t (mpv plugin).

Primary Subtitle Layer

The primary bar renders subtitles as tokenized hoverable word spans. Each word is a separate element with reading and headword data attached. This plane is styled independently from mpv subtitles and supports:

  • Word-level hover targets for Yomitan lookup
  • Auto pause/resume on subtitle hover (enabled by default via subtitleStyle.autoPauseVideoOnHover)
  • Auto pause/resume while the Yomitan popup is open (enabled by default via subtitleStyle.autoPauseVideoOnYomitanPopup)
  • Right-click to pause/resume
  • Right-click + drag to reposition subtitles
  • Reading annotations — known words, N+1 targets, character-name matches, JLPT levels, and frequency hits can all be visually highlighted

Secondary Subtitle Bar

The secondary bar is a compact top-strip region in the same overlay window. It shows a secondary subtitle track (typically English) for translation/context while keeping the primary reading flow below. It is useful for:

  • Quick comprehension checks without leaving the mining flow.
  • Auto-populating the translation field on mined cards — when a card is created, SubMiner uses the secondary subtitle text as the translation field value (unless AI translation is configured to override it).

It is controlled by secondarySub configuration and shares its lifecycle with the main overlay window. Cycle which track feeds it with Shift+J.

Display Modes

Both the primary and secondary subtitle bars share the same three visibility modes, and each can be changed independently at runtime:

  • Hidden — the bar is not shown.
  • Visible — the bar is always shown.
  • Hover — the bar is revealed only while you hover over the overlay.

By default the primary bar is visible (subtitleStyle.primaryDefaultMode) and the secondary bar is hover (secondarySub.defaultMode).

Cycle each bar's mode at runtime with its own shortcut:

Shortcut Action Config key
V Cycle primary subtitle mode (hidden → visible → hover) overlay-local
Ctrl/Cmd+Shift+V Cycle secondary subtitle mode (hidden → visible → hover) shortcuts.toggleSecondarySub

Modal Surfaces

Jimaku search, field-grouping, runtime options, and manual subsync open as modal surfaces on top of the same overlay window.

Looking Up Words

  1. Hover over the subtitle area — the overlay activates pointer events.
  2. Hover the word you want. SubMiner keeps per-token boundaries so Yomitan can target that token cleanly.
  3. Trigger Yomitan lookup with your configured lookup key/modifier (for example Shift if that is how your Yomitan profile is set up).
  4. Yomitan opens its lookup popup for the hovered token.
  5. From the popup, add the word to Anki.

Controller Workflow

With a gamepad connected and keyboard-only mode enabled, the full mining loop works without a mouse or keyboard:

  1. Navigate — push the left stick left/right to move the token highlight across subtitle words.
  2. Look up — press A to trigger Yomitan lookup on the highlighted word.
  3. Browse the popup — push the left stick up/down to smooth-scroll through the Yomitan popup, or use the right stick for larger jumps.
  4. Cycle audio — press R1 to move to the next dictionary audio entry, L1 to play the current one.
  5. Mine — press X to create an Anki card for the current sentence (same as Ctrl+S).
  6. Close — press B to dismiss the Yomitan popup and return to subtitle navigation.
  7. Pause/resume — press L3 (left stick click) to toggle mpv pause at any time.

After controller support is enabled, the controller and keyboard can be used interchangeably — switching mid-session is seamless. Toggle keyboard-only mode on or off with Y on the controller.

See Usage — Controller Support for setup details and Configuration — Controller Support for the full mapping and tuning options.

Subtitle Sync (Subsync)

If your subtitle file is out of sync with the audio, SubMiner can resynchronize it using alass or ffsubsync.

  1. Open the subsync modal from the overlay.
  2. Select the sync engine (alass or ffsubsync).
  3. For alass, select a reference subtitle track from the video.
  4. SubMiner runs the sync and reloads the corrected subtitle.

Install the sync tools separately — see Troubleshooting if the tools are not found.

Texthooker

SubMiner runs a local HTTP server at http://127.0.0.1:5174 (configurable port) that serves a texthooker UI. This allows external tools — such as a browser-based Yomitan instance — to receive subtitle text in real time.

The texthooker page displays the current subtitle and updates as new lines arrive. This is useful if you prefer to do lookups in a browser rather than through the overlay's built-in Yomitan.

If you want to build your own browser client, websocket consumer, or automation relay, see WebSocket / Texthooker API & Integration.

These features support the mining loop but have their own dedicated pages:

  • Jimaku subtitle search — search and download anime subtitle files directly from the overlay (Ctrl+Shift+J by default), then load them into mpv.
  • N+1 word highlighting — cross-reference your Anki decks to highlight known words, making true N+1 sentences (exactly one unknown word) easy to spot during immersion.
  • Immersion tracking — log watching and mining activity to a local database and view session times, words seen, and cards mined in the built-in stats dashboard.

Next: Anki Integration — field mapping, media generation, and card enrichment configuration.