Mingzhe Li's picture

4 2

Mingzhe Li

Mubuky

·

https://www.mubuky.com

Mubuky

AI & ML interests

RL & Agent

Recent Activity

upvoted a paper 29 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

upvoted a paper 29 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 30 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

upvoted 2 papers 29 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 84

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published about 1 month ago • 242

upvoted a paper 30 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 93

liked a dataset about 2 months ago

OpenMOSS-Team/VideoThinkBench

Viewer • Updated 13 days ago • 4.9k • 896 • 12

authored a paper about 2 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 210

upvoted a paper about 2 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 210

updated a dataset 2 months ago

OpenMOSS-Team/VideoThinkBench

Viewer • Updated 13 days ago • 4.9k • 896 • 12

liked a model 3 months ago

Qwen/WorldPM-72B

Text Classification • 73B • Updated May 17, 2025 • 99 • 80