LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper • 2307.02486 • Published Jul 5, 2023 • 82 • 17
Rethinking Cross-Layer Information Routing in Diffusion Transformers Paper • 2605.20708 • Published May 20 • 111 • 6
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published Jan 9, 2025 • 98 • 5