Evolution Fine-Tuning
Collection
Internalizing Discovery Capability into LLM • 5 items • Updated
NLP group at University of Minnesota
Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL
The Amazing Agent Race: Strong Tool Users, Weak Navigators