Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Uasonchen 's Collections
Agent
Image Generation
Image Editing
Vision Foundation Model
Video Generation
MLLM
Open Math Data for LLM
Math Data Synthesis

MLLM

updated 12 days ago
Upvote
-

  • Apriel-1.5-15b-Thinker

    Paper • 2510.01141 • Published 21 days ago • 110

  • MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

    Paper • 2509.21268 • Published 27 days ago • 100

  • LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

    Paper • 2509.00676 • Published Aug 31 • 83

  • Visual Representation Alignment for Multimodal Large Language Models

    Paper • 2509.07979 • Published Sep 9 • 82

  • InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

    Paper • 2508.18265 • Published Aug 25 • 201
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs