ClawBench: Can AI Agents Complete Everyday Online Tasks? Paper • 2604.08523 • Published 2 days ago • 91
ClawBench: Can AI Agents Complete Everyday Online Tasks? Paper • 2604.08523 • Published 2 days ago • 91
Watch Before You Answer: Learning from Visually Grounded Post-Training Paper • 2604.05117 • Published 5 days ago • 30
Watch Before You Answer: Learning from Visually Grounded Post-Training Paper • 2604.05117 • Published 5 days ago • 30
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 223