Zhiwei He's picture

Zhiwei He

zwhe99

·

https://zwhe99.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 27 days ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

liked a model about 1 month ago

MiniMaxAI/MiniMax-M2

updated a dataset about 1 month ago

zwhe99/lcbv5

View all activity

Organizations

None yet

upvoted a paper 27 days ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published about 1 month ago • 52

upvoted a collection 2 months ago

DeepSeek-V3.2

4 items • Updated 6 days ago • 501

upvoted a paper 3 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8 • 78

upvoted a paper 5 months ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8 • 75

upvoted a paper 6 months ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 104

upvoted a paper 7 months ago

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Paper • 2505.02847 • Published May 1 • 28

upvoted 2 collections 7 months ago

Qwen3

84 items • Updated Aug 6 • 1.47k

DeepMath

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning • 5 items • Updated May 22 • 4

upvoted 2 papers 8 months ago

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Paper • 2504.15521 • Published Apr 22 • 64

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published Apr 15 • 12

upvoted a paper 11 months ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 40

upvoted a collection over 1 year ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 248