UniVerse-1: Unified Audio-Video Generation via Stitching of Experts Paper • 2509.06155 • Published Sep 7 • 13
Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model Paper • 2502.16779 • Published Feb 24 • 4
SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation Paper • 2507.09862 • Published Jul 14 • 49