jasonjiang
mikinyaa
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 hours ago
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling upvoted an article 3 days ago
Using OCR models with llama.cpp upvoted a paper 3 days ago
RAGEN-2: Reasoning Collapse in Agentic RLOrganizations
None yet