10 39 59

Jiaming Han

csuhan

https://csuhan.com

csuhan

AI & ML interests

Computer Vision

Recent Activity

updated a Space about 2 hours ago

csuhan/scholar_api

published a Space about 2 hours ago

csuhan/scholar_api

upvoted a paper 4 days ago

Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models

View all activity

Organizations

None yet

updated a Space about 2 hours ago

Google Scholar Citation API

📚

Fetch Google Scholar citations for an author

published a Space about 2 hours ago

Google Scholar Citation API

📚

Fetch Google Scholar citations for an author

upvoted a paper 4 days ago

Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models

Paper • 2604.08545 • Published 5 days ago • 39

upvoted a paper 7 days ago

Vero: An Open RL Recipe for General Visual Reasoning

Paper • 2604.04917 • Published 8 days ago • 30

upvoted a paper 14 days ago

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Paper • 2603.28767 • Published 15 days ago • 57

upvoted a paper 25 days ago

Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens

Paper • 2603.19232 • Published 26 days ago • 33

upvoted a collection about 1 month ago

BitDance

Collection

BitDance: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model. • 10 items • Updated Mar 2 • 11

liked a Space about 2 months ago

BitDance-14B-64x

🚀

Open-source autoregressive model with binary visual tokens.

authored 2 papers about 2 months ago

UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model

Paper • 2602.14178 • Published Feb 15 • 14

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published Feb 15 • 53

upvoted 2 papers about 2 months ago

UniWeTok: An Unified Binary Tokenizer with Codebook Size 2^{128} for Unified Multimodal Large Language Model

Paper • 2602.14178 • Published Feb 15 • 14

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published Feb 15 • 53

updated a dataset about 2 months ago

csuhan/bitdance_demo

Viewer • Updated Feb 15 • 141 • 189

published a dataset about 2 months ago

csuhan/bitdance_demo

Viewer • Updated Feb 15 • 141 • 189

liked 2 datasets 2 months ago

#2 opened 3 months ago by

Mejistus

upvoted 3 papers 4 months ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Paper • 2512.17909 • Published Dec 19, 2025 • 37

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 65

OneThinker: All-in-one Reasoning Model for Image and Video

Paper • 2512.03043 • Published Dec 2, 2025 • 34

Jiaming Han

AI & ML interests

Recent Activity

Organizations

csuhan's activity

Google Scholar Citation API

Google Scholar Citation API

BitDance-14B-64x

Can some details about the image generation process be added?