Building on HF

25 77 84

Yuhang Zang PRO

yuhangzang

https://yuhangzang.github.io/

AI & ML interests

🤗 HuggingFace is all you need

Recent Activity

updated a dataset 20 minutes ago

internlm/WildClawBench

published a dataset about 2 hours ago

internlm/WildClawBench

updated a model about 15 hours ago

internlm/Visual-ERM

View all activity

Organizations

upvoted 2 papers 9 days ago

From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space

Paper • 2603.12648 • Published 11 days ago • 14

Visual-ERM: Reward Modeling for Visual Equivalence

Paper • 2603.13224 • Published 11 days ago • 21

upvoted a paper 12 days ago

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models

Paper • 2603.12252 • Published 12 days ago • 10

upvoted 2 papers about 1 month ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 156

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Paper • 2602.08439 • Published Feb 9 • 28

upvoted 2 papers about 2 months ago

Unified Personalized Reward Model for Vision Generation

Paper • 2602.02380 • Published Feb 2 • 20

FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation

Paper • 2601.23182 • Published Jan 30 • 21

upvoted a collection about 2 months ago

UnifiedReward Flex

Collection

13 items • Updated 2 days ago • 5

upvoted a collection 2 months ago

LightOnOCR-2 🦉

Collection

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 21 days ago • 23

upvoted a collection 3 months ago

RoPE++

Collection

19 items • Updated Dec 9, 2025 • 2

upvoted 4 papers 4 months ago

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published Dec 8, 2025 • 60

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 50

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

Paper • 2512.03036 • Published Dec 2, 2025 • 22

Think Visually, Reason Textually: Vision-Language Synergy in ARC

Paper • 2511.15703 • Published Nov 19, 2025 • 9

upvoted 4 papers 5 months ago

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31, 2025 • 31

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 19

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21, 2025 • 68

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published Oct 9, 2025 • 128

upvoted 2 papers 6 months ago

G^2RPO: Granular GRPO for Precise Reward in Flow Models

Paper • 2510.01982 • Published Oct 2, 2025 • 7

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Paper • 2509.22624 • Published Sep 26, 2025 • 19

Yuhang Zang PRO

AI & ML interests

Recent Activity

Organizations

yuhangzang's activity