Charles Cai

charlescai2016

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

HeartMuLa: A Family of Open Sourced Music Foundation Models

upvoted a paper 4 days ago

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

liked a model 7 days ago

cerebras/GLM-4.7-REAP-218B-A32B-FP8

View all activity

Organizations

upvoted a paper about 15 hours ago

HeartMuLa: A Family of Open Sourced Music Foundation Models

Paper • 2601.10547 • Published 5 days ago • 23

upvoted a paper 4 days ago

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Paper • 2601.07372 • Published 8 days ago • 28

liked 2 models 7 days ago

cerebras/GLM-4.7-REAP-218B-A32B-FP8

Text Generation • Updated 10 days ago • 645 • 13

cerebras/GLM-4.7-REAP-268B-A32B

Updated 10 days ago • 16

liked a model 13 days ago

Lightricks/LTX-2

Image-to-Video • Updated about 23 hours ago • 1.74M • • 1.2k

liked a model 15 days ago

tencent/HY-Motion-1.0

Text-to-3D • Updated 20 days ago • 898 • 341

upvoted a paper 19 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 20 days ago • 264

updated a collection about 1 month ago

Papers

Collection

4 items • Updated Dec 16, 2025

liked a dataset about 2 months ago

iteratehack/code19-dataset

Viewer • Updated Nov 30, 2025 • 3.06k • 8 • 1

liked a model about 2 months ago

PrimeIntellect/INTELLECT-3

Text Generation • 107B • Updated Nov 27, 2025 • 2.83k • 203

liked a model 2 months ago

ByteDance/BindWeave

Image-to-Video • Updated Nov 28, 2025 • 713 • 88

upvoted an article 3 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

upvoted a paper 3 months ago

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Paper • 2510.04290 • Published Oct 5, 2025 • 18

upvoted an article 3 months ago

Article

Train your ControlNet with diffusers

Mar 24, 2023

•

upvoted a paper 3 months ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30, 2025 • 116

liked a Space 3 months ago

The Smol Training Playbook

📚

2.89k

The secrets to building world-class LLMs

upvoted a paper 3 months ago

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published Oct 27, 2025 • 84

liked a model 3 months ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 3.02M • 3.09k

liked a dataset 3 months ago

criteo/CriteoClickLogs

Updated 5 days ago • 757 • 8

upvoted a paper 3 months ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published Oct 14, 2025 • 27

Charles Cai

AI & ML interests

Recent Activity

Organizations

charlescai2016's activity

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Train your ControlNet with diffusers

The Smol Training Playbook