Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Huanyu_Zhang's picture
2 5 2

Huanyu_Zhang

huanyu112

AI & ML interests

None yet

Recent Activity

new activity about 1 month ago
huanyu112/Latent-Sketchpad.Sketch_Decoder:Add pipeline tag and sample usage for Sketch Decoder
liked a model about 1 month ago
huanyu112/Latent-Sketchpad.Sketch_Decoder
upvoted a paper about 1 month ago
Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs
View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs

Paper • 2510.24514 • Published Oct 28 • 21
upvoted a paper 3 months ago

BaseReward: A Strong Baseline for Multimodal Reward Model

Paper • 2509.16127 • Published Sep 19 • 21
upvoted a paper 7 months ago

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16 • 57
upvoted a collection 10 months ago

SYNTHETIC-1

Collection
A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7 • 66
upvoted a paper over 1 year ago

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23, 2024 • 26
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs