Wei Wu's picture

Wei Wu

Wei-Wu

·

AI & ML interests

None yet

Recent Activity

reacted to Shrijanagain's post with 👍 about 21 hours ago

Surya-1.1T: Scaling Beyond Human-Level Reasoning via 146 Trillion Token Pre-training Author: Shrijan Kumar Tiwari Affiliation: SKT AI Labs / Project Surya Model Architecture: Optimized Dense Transformer Parameters: 1.1 Trillion Training Tokens: 146 Trillion Wanna collaborate us Friends let's Start Journey we have Collected 146 trillon tokens and done pre training but we need to made more powerfull Whitepaper - https://github.com/SHRIJANAGAIN/PROFF

liked a dataset 8 months ago

dgslibisey/MuSiQue

liked a model 8 months ago

jiulaikankan/Qwen2.5-14B-ReasonGenRM

View all activity

Organizations

models 0

None public yet

datasets 0

None public yet