ViMoGen Collection The Quest for Generalizable Motion Generation: Data, Model, and Evaluation • 3 items • Updated 19 days ago
SenseNova-SI Collection Scaling Spatial Intelligence with Multimodal Foundation Models • 10 items • Updated 13 days ago • 15
sensenova/SenseNova-SI-1.1-InternVL3-8B-800K Image-Text-to-Text • 8B • Updated Dec 23, 2025 • 18 • 2
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published Dec 22, 2025 • 64
ViMoGen Collection The Quest for Generalizable Motion Generation: Data, Model, and Evaluation • 3 items • Updated 19 days ago
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published Dec 15, 2025 • 74
sensenova/SenseNova-SI-1.2-InternVL3-8B Image-Text-to-Text • 8B • Updated Dec 10, 2025 • 3.94k • 10
sensenova/SenseNova-SI-1.2-InternVL3-8B Image-Text-to-Text • 8B • Updated Dec 10, 2025 • 3.94k • 10
SenseNova-SI Collection Scaling Spatial Intelligence with Multimodal Foundation Models • 10 items • Updated 13 days ago • 15
Scaling Spatial Intelligence with Multimodal Foundation Models Paper • 2511.13719 • Published Nov 17, 2025 • 47