Whisper Large V3
🤫
813
Transcribe or translate audio from mic, files, or YouTube
Transcribe or translate audio from mic, files, or YouTube
mcp_server & FLUX 4-bit Quantization + Enhanced
SText to Audio(Sound SFX) Generator
Easily remove your videos background!
Generate a talking face video from an image and audio
FLUX 4-bit Quantization(just 8GB VRAM)
Flux Animations(GIF) Generaion
Try on clothes on a person image
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Spanish finetune for the original F5 model.