1 8 4

Mao Song

MaoSong2022

https://maosong2022.github.io/

MaoSong2022

AI & ML interests

None yet

Recent Activity

liked a Space 21 days ago

OpenEvals/evaluation-guidebook

upvoted a collection about 1 month ago

Olmo 3

upvoted an article 6 months ago

SmolLM3: smol, multilingual, long-context reasoner

View all activity

Organizations

liked a Space 21 days ago

Evaluation Guidebook

📝

218

Display benchmark evaluation data for LLMs

upvoted a collection about 1 month ago

Olmo 3

Collection

Artifacts for the Olmo 3 release. • 9 items • Updated 6 days ago • 157

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

740

liked a Space 7 months ago

The Ultra-Scale Playbook

🌌

3.6k

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 articles 7 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

426

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

•

245

updated a dataset 8 months ago

MaoSong2022/CharXiv

Updated Apr 30 • 25

published a dataset 9 months ago

MaoSong2022/CharXiv

Updated Apr 30 • 25

updated a dataset 9 months ago

MaoSong2022/CV-Bench

Preview • Updated Apr 14 • 9

published a dataset 9 months ago

MaoSong2022/CV-Bench

Preview • Updated Apr 14 • 9

updated a dataset 9 months ago

MaoSong2022/MMVP

Viewer • Updated Apr 9 • 300 • 15

published a dataset 9 months ago

MaoSong2022/MMVP

Viewer • Updated Apr 9 • 300 • 15

liked a dataset 9 months ago

Anthropic/EconomicIndex

Preview • Updated Nov 17 • 3.4k • 380

updated a model 10 months ago

MaoSong2022/llava-sequence-append

Updated Mar 11 • 7

published a model 10 months ago

MaoSong2022/llava-sequence-append

Updated Mar 11 • 7

upvoted a paper 10 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

liked a model 11 months ago

ValueFX9507/Tifa-Deepsex-14b-CoT

Reinforcement Learning • 15B • Updated Feb 13 • 656 • 219

upvoted 3 papers about 1 year ago

Chimera: Improving Generalist Model with Domain-Specific Experts

Paper • 2412.05983 • Published Dec 8, 2024 • 9

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 159

TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation

Paper • 2412.03069 • Published Dec 4, 2024 • 34

Mao Song

AI & ML interests

Recent Activity

Organizations

MaoSong2022's activity

Evaluation Guidebook

SmolLM3: smol, multilingual, long-context reasoner

The Ultra-Scale Playbook

You could have designed state of the art positional encoding

nanoVLM: The simplest repository to train your VLM in pure PyTorch