When using Input Folder Path above, whether to include all files in the subdirectory or not.
When using Input Folder Path above, whether to save output in the same directory as inputs or not, in addition to the original output directory.
Computation type for transcription
Use previous output as prompt for next window
Suppress blank outputs at start of sampling
Extract word-level timestamps
Enabling this will remove background music
Enable this to transcribe only detected voice
Computation type for transcription
Use previous output as prompt for next window
Suppress blank outputs at start of sampling
Extract word-level timestamps
Enabling this will remove background music
Enable this to transcribe only detected voice
Computation type for transcription
Use previous output as prompt for next window
Suppress blank outputs at start of sampling
Extract word-level timestamps
Enabling this will remove background music
Enable this to transcribe only detected voice
VRAM usage for each model
| Model name | Required VRAM |
|---|---|
| nllb-200-3.3B | ~16GB |
| nllb-200-1.3B | ~8GB |
| nllb-200-distilled-600M | ~4GB |
Note: Be mindful of your VRAM! The table above provides an approximate VRAM usage for each model.