vgbench

community

https://vgbench.github.io/

AI & ML interests

Vector Graphics, LLM

Recent Activity

BochengZou authored a paper 14 days ago

Agent Skills Should Go Beyond Text: The Case for Visual Skills

HanSolo9682 authored a paper 24 days ago

Your Embedding Model is SMARTer Than You Think

HanSolo9682 submitted a paper 24 days ago

Your Embedding Model is SMARTer Than You Think

View all activity

authored a paper 14 days ago

Agent Skills Should Go Beyond Text: The Case for Visual Skills

Paper • 2606.01414 • Published 19 days ago • 10

authored a paper 24 days ago

Your Embedding Model is SMARTer Than You Think

Paper • 2605.24938 • Published 26 days ago • 25

submitted a paper to Daily Papers 24 days ago

Your Embedding Model is SMARTer Than You Think

Paper • 2605.24938 • Published 26 days ago • 25

authored a paper 30 days ago

MementoGUI: Learning Agentic Multimodal Memory Control for Long-Horizon GUI Agents

Paper • 2605.18652 • Published May 18 • 8

authored a paper 3 months ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

Paper • 2603.25744 • Published Mar 26 • 13

submitted a paper to Daily Papers 3 months ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

Paper • 2603.25744 • Published Mar 26 • 13

authored a paper 3 months ago

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

Paper • 2603.18004 • Published Mar 18 • 14

authored 2 papers 4 months ago

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published Jan 15 • 35

Reasoning-Augmented Representations for Multimodal Retrieval

Paper • 2602.07125 • Published Feb 6

submitted a paper to Daily Papers 4 months ago

Reasoning-Augmented Representations for Multimodal Retrieval

Paper • 2602.07125 • Published Feb 6

updated a dataset over 1 year ago

vgbench/VGen

Viewer • Updated Dec 31, 2024 • 5.84k • 24 • 4

authored a paper over 1 year ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 16

authored a paper over 1 year ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 16

authored a paper over 1 year ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 16

authored 3 papers over 1 year ago

CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples

Paper • 2402.13254 • Published Feb 20, 2024

VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation

Paper • 2407.10972 • Published Jul 15, 2024 • 1

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3, 2024 • 7

authored a paper over 1 year ago

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3, 2024 • 7

authored a paper almost 2 years ago

VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation

Paper • 2407.10972 • Published Jul 15, 2024 • 1

updated a dataset almost 2 years ago

vgbench/VGQA

Viewer • Updated Jul 16, 2024 • 4.28k • 132 • 6