BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
Paper • 2308.09936 • Published • 1
None defined yet.
SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds
SynthVerse: A Large-Scale Diverse Synthetic Dataset for Point Tracking