8 13 46

Shuhuai Ren

ShuhuaiRen

https://renshuhuai-andy.github.io/

AI & ML interests

NLP, Multi-modal

Recent Activity

upvoted a paper about 2 months ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

liked a model 6 months ago

XiaomiMiMo/MiMo-Audio-Tokenizer

upvoted a collection 6 months ago

MiMo-Audio

View all activity

Organizations

upvoted a paper about 2 months ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Paper • 2602.03560 • Published Feb 3 • 47

liked a model 6 months ago

XiaomiMiMo/MiMo-Audio-Tokenizer

Updated Sep 19, 2025 • 220 • 32

upvoted a collection 6 months ago

MiMo-Audio

Collection

4 items • Updated 20 days ago • 25

upvoted a paper 7 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 85

liked a dataset 7 months ago

apf1/datafilteringnetworks_2b

Updated Feb 28, 2025 • 150 • 20

New activity in XiaomiMiMo/MiMo-VL-7B-RL-2508 7 months ago

add hints for placing visual input and thinking control

#2 opened 7 months ago by

ShuhuaiRen

New activity in XiaomiMiMo/MiMo-VL-7B-SFT-2508 7 months ago

add hints for placing visual input and thinking control

#2 opened 7 months ago by

ShuhuaiRen

liked 2 models 8 months ago

XiaomiMiMo/MiMo-VL-7B-SFT-2508

Image-Text-to-Text • 8B • Updated Aug 21, 2025 • 4.36k • 36

XiaomiMiMo/MiMo-VL-7B-RL-2508

Image-Text-to-Text • 8B • Updated Aug 21, 2025 • 150k • 90

upvoted a collection 8 months ago

MiMo-VL

Collection

6 items • Updated Dec 17, 2025 • 39

liked a model 8 months ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 784k • • 12.5k

liked 3 Spaces 9 months ago

RISEBench Gallery

👀

A Gallery of Generation Results on RISEBench

FineWeb: decanting the web for the finest text data at scale

🍷

1.31k

Read a detailed overview of the FineWeb web‑scale text dataset

The Ultra-Scale Playbook

🌌

3.75k

The ultimate guide to training LLM on large GPU Clusters

authored a paper 10 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 80

upvoted a paper 10 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 80

liked 2 models 10 months ago

XiaomiMiMo/MiMo-VL-7B-SFT

Image-Text-to-Text • 8B • Updated Jun 7, 2025 • 467 • 55

XiaomiMiMo/MiMo-VL-7B-RL

Image-Text-to-Text • 8B • Updated Jun 7, 2025 • 1.61k • 169

authored a paper 10 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 82

liked a dataset 10 months ago

BGLab/BioTrove

Viewer • Updated Dec 13, 2024 • 163M • 956 • 18

Shuhuai Ren

AI & ML interests

Recent Activity

Organizations

ShuhuaiRen's activity

add hints for placing visual input and thinking control

add hints for placing visual input and thinking control

RISEBench Gallery

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook