Charles Cai

charlescai2016

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

netop/TeleLogs

liked a model 5 days ago

HeartMuLa/HeartMuLa-oss-3B

liked a model 5 days ago

HeartMuLa/HeartMuLaGen

View all activity

Organizations

liked a dataset 3 days ago

netop/TeleLogs

Viewer • Updated Aug 5, 2025 • 3.26k • 1.33k • 28

liked 2 models 5 days ago

HeartMuLa/HeartMuLa-oss-3B

Text-to-Audio • 4B • Updated 6 days ago • 7.55k • 205

HeartMuLa/HeartMuLaGen

Text-to-Audio • Updated 6 days ago • 21

upvoted a paper 5 days ago

HeartMuLa: A Family of Open Sourced Music Foundation Models

Paper • 2601.10547 • Published 10 days ago • 37

upvoted a paper 10 days ago

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Paper • 2601.07372 • Published 14 days ago • 36

liked 2 models 13 days ago

cerebras/GLM-4.7-REAP-218B-A32B-FP8

Text Generation • Updated 16 days ago • 1.23k • 16

cerebras/GLM-4.7-REAP-268B-A32B

Text Generation • 269B • Updated 3 days ago • 31 • 18

liked a model 18 days ago

Lightricks/LTX-2

Image-to-Video • Updated 6 days ago • 2.23M • • 1.31k

liked a model 20 days ago

tencent/HY-Motion-1.0

Text-to-3D • Updated 26 days ago • 986 • 347

upvoted a paper 24 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 25 days ago • 280

updated a collection about 1 month ago

Papers

Collection

4 items • Updated Dec 16, 2025

liked a dataset about 2 months ago

iteratehack/code19-dataset

Viewer • Updated Nov 30, 2025 • 3.06k • 7 • 1

liked a model about 2 months ago

PrimeIntellect/INTELLECT-3

Text Generation • 107B • Updated Nov 27, 2025 • 1.78k • 203

liked a model 2 months ago

ByteDance/BindWeave

Image-to-Video • Updated Nov 28, 2025 • 624 • 88

upvoted an article 3 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

upvoted a paper 3 months ago

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Paper • 2510.04290 • Published Oct 5, 2025 • 19

upvoted an article 3 months ago

Article

Train your ControlNet with diffusers

Mar 24, 2023

•

upvoted a paper 3 months ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30, 2025 • 117

liked a Space 3 months ago

The Smol Training Playbook

📚

2.92k

The secrets to building world-class LLMs

upvoted a paper 3 months ago

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published Oct 27, 2025 • 85

Charles Cai

AI & ML interests

Recent Activity

Organizations

charlescai2016's activity

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Train your ControlNet with diffusers

The Smol Training Playbook