3 25 33

Cao Yue

yuecao0119

https://yuecao0119.github.io/

yuecao0119

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Qwen3-VL Technical Report

upvoted a paper 19 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

liked a model 24 days ago

miromind-ai/MiroThinker-v1.0-72B

View all activity

Organizations

upvoted a paper 3 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 11 days ago • 106

upvoted a paper 19 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published 22 days ago • 158

upvoted a paper 2 months ago

Sequential Diffusion Language Models

Paper • 2509.24007 • Published Sep 28 • 45

upvoted 4 papers 3 months ago

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18 • 111

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 84

MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding

Paper • 2410.11829 • Published Oct 15, 2024 • 2

MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity

Paper • 2407.15838 • Published Jul 22, 2024 • 3

upvoted 3 collections 3 months ago

upvoted 2 papers 3 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 208

Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation

Paper • 2508.18032 • Published Aug 25 • 42

upvoted a collection 4 months ago

MiroThinker-v0.1

Collection

High performance in deep research and tool use. • 7 items • Updated Sep 8 • 35

upvoted a paper 4 months ago

DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning

Paper • 2508.05405 • Published Aug 7 • 64

upvoted 3 papers 5 months ago

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16 • 57

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published May 29 • 45

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13 • 72

upvoted 2 papers 8 months ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21 • 67

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 303

upvoted a paper 9 months ago

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 36

Cao Yue

AI & ML interests

Recent Activity

Organizations

yuecao0119's activity