The FID Lottery: Quantifying Hidden Randomness in Generative-Model Evaluation Paper • 2606.20536 • Published 12 days ago • 10
MERIT: Learning Disentangled Music Representations for Audio Similarity Paper • 2605.27346 • Published May 26 • 8
Configuration error Agents 22 SongFormer 🎵 22 State-of-the-art music analysis with multi-scale datasets
Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models Paper • 2603.25750 • Published Mar 20 • 36
SLA2: Sparse-Linear Attention with Learnable Routing and QAT Paper • 2602.12675 • Published Feb 13 • 59
BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published Mar 13 • 55