moka-ai/m3e-base
0.1B
β’
Updated
β’
87.2k
β’
980
Transcribe speech from audio or YouTube videos into text
Generate speech from text using a reference voice
Audio-based Lip Sync for Talking Head Video Editing