Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
YitongChen (SII)'s picture
8 12 3

YitongChen (SII)

Row11n
langDU's profile picture
·
  • Row11n

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago
CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization
upvoted a paper 3 days ago
CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization
submitted a paper 3 days ago
CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization
View all activity

Organizations

Slim Multimodal Models-X's profile picture SII Text 2 Image's profile picture ShareLab-SII's profile picture

authored a paper 3 days ago

CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization

Paper • 2603.06449 • Published 6 days ago • 6
submitted a paper to Daily Papers 3 days ago

CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization

Paper • 2603.06449 • Published 6 days ago • 6
authored 3 papers 12 months ago

Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning

Paper • 2412.03565 • Published Dec 4, 2024 • 10

Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection

Paper • 2412.17800 • Published Dec 23, 2024

CoMP: Continual Multimodal Pre-training for Vision Foundation Models

Paper • 2503.18931 • Published Mar 24, 2025 • 30
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs