Mage's picture

8

Mage

arctanx

AI & ML interests

None yet

Recent Activity

updated a dataset about 7 hours ago

arctanx/PKU_Tianwang_CWT200g

published a dataset about 22 hours ago

arctanx/PKU_Tianwang_CWT200g

upvoted a paper about 1 month ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

View all activity

Organizations

None yet

upvoted 4 papers about 1 month ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

Paper • 2510.26802 • Published Oct 30 • 33

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published Oct 27 • 84

Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks

Paper • 2510.19195 • Published Oct 22 • 10

Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs

Paper • 2510.24514 • Published Oct 28 • 21

upvoted a paper about 2 months ago

UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning

Paper • 2510.13515 • Published Oct 15 • 11

upvoted a paper 6 months ago

IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection

Paper • 2506.00979 • Published Jun 1 • 13

upvoted a paper 8 months ago

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

Paper • 2504.09925 • Published Apr 14 • 38

upvoted a paper about 1 year ago

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 118