view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 2 days ago ⢠467
Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting Paper ⢠2603.25745 ⢠Published 9 days ago ⢠13
Group3D: MLLM-Driven Semantic Grouping for Open-Vocabulary 3D Object Detection Paper ⢠2603.21944 ⢠Published 12 days ago ⢠26
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper ⢠2603.21986 ⢠Published 12 days ago ⢠120
Hidden Dynamics of Massive Activations in Transformer Training Paper ⢠2508.03616 ⢠Published Aug 5, 2025 ⢠19
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation Paper ⢠2410.17799 ⢠Published Oct 23, 2024 ⢠12
Grounding World Simulation Models in a Real-World Metropolis Paper ⢠2603.15583 ⢠Published 19 days ago ⢠152
FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning Paper ⢠2401.08553 ⢠Published Jan 16, 2024 ⢠2
FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization Paper ⢠2303.14189 ⢠Published Mar 24, 2023 ⢠5
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models ⢠15 items ⢠Updated 4 days ago ⢠259
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 Jun 3, 2025 ⢠342
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation Paper ⢠2601.22153 ⢠Published Jan 29 ⢠74
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications Paper ⢠2508.16279 ⢠Published Aug 22, 2025 ⢠61
Helios: Real Real-Time Long Video Generation Model Paper ⢠2603.04379 ⢠Published about 1 month ago ⢠178