sophia peng
sophiapeng
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 7 hours ago
Harnessing Negative Signals: Reinforcement Distillation from Teacher
Data for LLM Reasoning liked a model 7 months ago
lixiaoxi45/WebThinker-QwQ-32B upvoted a paper 7 months ago
WebSailor: Navigating Super-human Reasoning for Web Agent