Qianjia Cheng's picture

12 3

Qianjia Cheng

CajZella

·

CajZella

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

upvoted a paper 26 days ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

upvoted a paper 26 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper 24 days ago

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Paper • 2511.15065 • Published 25 days ago • 74

upvoted 2 papers 26 days ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published 27 days ago • 42

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published 27 days ago • 132

upvoted 2 papers about 1 month ago

VideoSSR: Video Self-Supervised Reinforcement Learning

Paper • 2511.06281 • Published Nov 9 • 24

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6 • 208

upvoted 6 papers 3 months ago

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18 • 111

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Paper • 2509.14760 • Published Sep 18 • 53

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4 • 80

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 265

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9 • 31

upvoted a paper 4 months ago

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

Paper • 2508.18124 • Published Aug 25 • 48