Fengzhuo Zhang's picture

3 14

Fengzhuo Zhang

Fengzhuo

·

AI & ML interests

None yet

Recent Activity

updated a dataset 12 days ago

Fengzhuo/optimizer_results

upvoted a paper 13 days ago

Demystifying the Slash Pattern in Attention: The Role of RoPE

submitted a paper 13 days ago

Demystifying the Slash Pattern in Attention: The Role of RoPE

View all activity

Organizations

upvoted a paper 13 days ago

Demystifying the Slash Pattern in Attention: The Role of RoPE

Paper • 2601.08297 • Published 16 days ago • 3

upvoted a paper 2 months ago

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Paper • 2511.11434 • Published Nov 14, 2025 • 45

upvoted 3 papers 3 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 129

Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning

Paper • 2510.14095 • Published Oct 15, 2025 • 6

Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation

Paper • 2510.15624 • Published Oct 17, 2025 • 15

upvoted a paper 4 months ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1, 2025 • 90

upvoted a paper 5 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 84

upvoted a paper 6 months ago

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Paper • 2507.22607 • Published Jul 30, 2025 • 47

upvoted 4 papers 8 months ago

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Paper • 2506.14429 • Published Jun 17, 2025 • 44

Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders

Paper • 2506.14002 • Published Jun 16, 2025 • 5

Reinforcing General Reasoning without Verifiers

Paper • 2505.21493 • Published May 27, 2025 • 26

Lifelong Safety Alignment for Language Models

Paper • 2505.20259 • Published May 26, 2025 • 23

upvoted a paper 10 months ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26, 2025 • 59

upvoted a paper 11 months ago

Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework

Paper • 2503.10704 • Published Mar 12, 2025 • 5